Lecture Notes in Control and Information Sciences Editor: M. Thoma
244
Springer London Berlin Heidelberg New York Barcelona I-Iong Kong Milan Paris Santa Clara Singapore Tokyo
H. Nijmeijer and T.I. Fossen (Eds)
New Directions in Nonlinear Observer Design
~ Springer
Series Advisory Board A. B e n s o u s s a n • M.]. G r i m b l e J.L. M a s s e y • Y.Z. T s y p k i n
• P. K o k o t o v i c
• H. K w a k e r n a a k
Editors H. Nijmeijer F a c u l t y o f M a t h e m a t i c a l S c i e n c e s , U n i v e r s i t y o f T w e n t e , P O B o x 217, 7500 A E E n s c h e d e , T h e N e t h e r l a n d s T.I. F o s s e n Department of Engineering Cybernetics, Norwegian University of Science and Technology, N-7034 Trondheim, Norway
ISBN 1-85233-134-8 Springer-Verlag L o n d o n Berlin Heidelberg British Library Cataloguing in Publication Data New directions in nonlinear observer design. - (Lecture notes in control and information sciences ; 224) 1.Observers (Control theory) 2.Nonlinear control theory 3.Feedback control systems l.Nijmeijer, Henk, 1955- II.Fossen, Thor I. 629.8'36 ISBN 1852331348 Library of Congress Cataloging-in-Publication Data New directions in nonlinear observer design / H. Nijmeijer and T.I. Fossen (eds.). p. cm. - (Lecture notes in control and information sciences ; 244) Includes bibliographical references and index. ISBN 1-85233-134-8 (alk. Paper) 1.Observers (Control theory)--Congresses. 2. Nonlinear control Theory--Congresses. I. Nijmeijer, H. (Henk), 1955- . II. Fossen, Thor I. III. Series. QA402.3.N487 1999 99-12174 629.8'312--dc21 CIP Apart from any fair dealing for the purposes of research or private study, or criticism or review, as permitted under the Copyright, Designs and Patents Act 1988, this publication may only be reproduced, stored or transmitted, in any form or by any means, with the prior permission in writing of the publishers, or in the case of reprographic reproduction in accordance with the terms of licences issued by the Copyright Licensing Agency. Enquiries concerning reproduction outside those terms should be sent to the publishers. © Springer-Verlag London Limited 1999 Printed in Great Britain The use of registered names, trademarks, etc. in this publication does not imply, even in the absence of a specific statement, that such names are exempt from the relevant laws and regulations and therefore free for general use. The publisher makes no representation, express or implied, with regard to the accuracy of the information contained in this book and cannot accept any legal responsibility or liability for any errors or omissions that may be made. Typesetting: Camera ready by contributors Printed and bound at the Atheneum Press Ltd., Gateshead, Tyne & Wear 69/3830-543210 Printed on acid-free paper
vi
Acknowledgments The editors are grateful to: S t r a t e g i c U n i v e r s i t y P r o g r a m (SUP) in M a r i n e C y b e r n e t i c s at the Norwegian University of Science and Technology (NTNU), Departments of Engineering Cybernetics, Marine Hydrodynamics and Marine Strutures (Professor Dr.-Ing. Olav Egeland, Program Man-
ager). • A B B (Professor Dr.-Ing. Asgeir J. SCrensen, Technology Manager -
Business Area Marine and Turbochargers) for their financial support. The authors want to thank all the workshop contributors for contributing to this book project. Finally, Mrs. Alison Jackson at Springer-Verlag London should be thanked for editorial suggestions and for helping us with general publishing questions.
Trondheim, February 1999 Enschede, February 1999
Thor I. Fossen Henk Nijmeijer
Contributors Alcorta Garcia, E., Department of Measurement and Control, University of Duisburg, Duisburg, Germany. Ashton, S. A., School of MIS, Coventry University, U.K. Astolfi, A., Centre for Process Systems Engineering, Imperial College of Science, London, U.K. Bastin, G., Centre for Systems Engineering and Applied Mechanics, Universite Catholique de Louvain, Louvain-La Neuve, Belgium. Battilotti, S., Dipartimento di Informatica e Sistemistica, Universit~ di Roma "La Sapienza", Italy. Besan~on, G., Laboratoire d'Automatique de Grenoble, ENSIEG, SaintMartin d'H~res, France. Blanke, M., Department of Automatic Control, Aalborg University, Denmark. Canudas de Wit, C. Laboratoire d'Automatique INPG, ST. Martin d'H~res, France.
de Grenoble, ENSIEG-
Cruz, C., Department of Electronics ~ Telecom., Scientific Research and Advanced Studies Center of Ensenada (CICESE), M~xico. Deng, H., Department of Applied Mechanics and Engineering Sciences University of California at San Diego, La Jolla, USA. Egeland, O., Department of Engineering Cybernetics, Norwegian University of Science and Technology, Trondheim, Norway. El Bahir, L., Department of Control Engineering, Universit~ Libre de Bruxelles, Brussels, Belgium. El Yaagoubi, E. H., LCPI ENSEM
Cassablanca, Morocco.
Fossen, T. I., IDepartment of Engineering Cybernetics, Norwegian University of Science and Technology, Trondheim, Norway and 2ABB Industri AS, Marine Division, Oslo, Norway.
Vlll
Frank, P. M., Department of Measurement Duisburg, Duisburg, Germany.
and Control, University of
Glumineau, A. Institut de Recherche en Cybern~tique de Nantes, France. Hammouri, H., LAGEPT
University of Lyon, France.
Huijberts, H. J. C., Department of Mathematics and Computing Science, Eindhoven University of Technology, The Netherlands. Horowitz, R. Department of Mechanical Engineering, University of California, Berkeley, CA, U.S.A. Isidori, A., IDepartment of Systems Science and Mathematics, Washington University, St. Louis, USA and 2Dipartimento di Informatica e Sistemistica, Universit~t di Roma "La Sapienza", Italy. Izadi-Zamanabadi, R., Department of Automatic Control, Aalborg University, Denmark. Jiang, Z.-P., Department of Electrical Engineering, Polytechnic University, Brooklyn, U.S.A. Junge, L., Drittes Physikalisches Institut, Universitttt G6ttingen, Germany. Khalil, H. K., Department of Electrical and Computer Engineering, Michigan State University, USA. Kinnaert, M., Department of Control Engineering, Universit~ Libre de Bruxelles, Brussels, Belgium. Kocarev, L., Department of Electrical Engineering, St Cyril and Methodius University, Skopje, Macedonia. Kristiansen, D., Department of Engineering Cybernetics, Norwegian University of Science and Technology, Trondheim, Norway. Krstid, M., Department of Applied Mechanics and Engineering Sciences University of California at San Diego, La Jolla, USA. Lilge, T., Institut flit Regelungstechnik, University of Hannover, Hannover, Germany. Lohmiller, W., Nonlinear Systems Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA. Ldpez-Morales, V., Institut de Recherche en Cybern~tique de Nantes, France.
ix Loria, A., Laboratoire d'Automatique de Grenoble, ENSIEG, St. Martin d'H~res, France. Nijmeijer, H., 1Faculty of Mathematical Sciences, Dept. of Systems, Signals and Control, University of Twente and 2Faculty of Mechanical Engineering, Eindhoven University of Technology, The Netherlands. Ortega, R., Laboratoire des Signaux et Syst~mes, Ecole Sup6rieure d'Electricit6, Paris, Prance. Panteley, E., I.N.R.I.A., Rh6ne Alpes, St. Martin d'Hfires, France. Parlitz, U., Drittes Physikalisches Institut, Universitat GSttingen, Germany. Pettersen, K. Y., Department of Engineering Cybernetics, Norwegian University of Science and Technology, Trondheim, Norway. Praly, L., Centre Automatique et Syst~mes, t~cole des Mines de Paris, Fontainebleau, prance. Rodrigues-Cortes, H., Laboratoire des Signaux et Syst~mes, Ecole Sup~rieure d'Electricit~, Paris, prance. Schaffner, J., Institute for Systems, Informatics and Safety, European Commission Joint Research Centre, Ispra, Italy Schreier, G., Department of Measurement and Control, University of Duisburg, Duisburg, Germany. Shields, D. N., School of MIS, Coventry University, U.K. Shiriaev, A., Department of Engineering Cybernetics, Norwegian University of Science and Technology, Trondheim, Norway. Slotine, J. J. E., Nonlinear Systems Laboratory, Massachusetts Institute of Technology, Cambridge, Massachusetts, USA. Strand, J. P., ABB Industri AS, Marine Division, Oslo, Norway. Teel, A., Department of Electrical and Computer Engineering, University of California, Santa Barbara, USA. Tsiotras, P., Georgia Institute of Techology, School of Aerospace Eng., Atlanta, Georgia, USA. Vik, B. Department of Engineering Cybernetics, Norwegian University of Science and Technology, Trondheim, Norway. Zeitz, M., Institut ftir Systemdynamik of Stuttgart, Germany.
und Regelungstechnik, University
Contents Nonlinear Observer Design A Viewpoint on Observability Nonlinear Systems
and
Observer
Design
for 3
G. Besanf~on 1 2 3
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . Basic Definitions a n d P r o p o s e d "Classification" . . . . . . . E x a m p l e s of N o n U n i f o r m a n d U n i f o r m O b s e r v a t i o n . . . . 3.1 N o n U n i f o r m O b s e r v a t i o n : the Case of State-Affine Systems . . . . . . . . . . . . . . . . . . . . . . . . . 3.2 U n i f o r m O b s e r v a t i o n : the Case of U n i f o r m l y O b s e r v able Systems . . . . . . . . . . . . . . . . . . . . . . 3.3 A n E x a m p l e of U n i f o r m O b s e r v a t i o n of N o n - u n i f o r m l y observable S y s t e m s . . . . . . . . . . . . . . . . . . . Observer I n t e r c o n n e c t i o n . . . . . . . . . . . . . . . . . . . S t a t e T r a n s f o r m a t i o n s a n d Observer Design . . . . . . . . . Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Model-Based
Observers
for
Tire/Road
Contact
1
3 4 5
7 8 9 11 15 20 20
Friction
Prediction C. Canudas de Wit, R. Horowitz and P. Tsiotras
2
3 4 7
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . T i r e - r o a d Friction Models . . . . . . . . . . . . . . . . . . . 2.1 P s e u d o - S t e a d y S t a t e Models . . . . . . . . . . . . . 2.2 L u m p e d D y n a m i c Models . . . . . . . . . . . . . . . 2.3 D i s t r i b u t e d D y n a m i c Models . . . . . . . . . . . . . 2.4 Relation Between Distributed Dynamical Model and the Magic F o r m u l a . . . . . . . . . . . . . . . . . . . Problem Formulation . . . . . . . . . . . . . . . . . . . . . . G e n e r a l Observer Design . . . . . . . . . . . . . . . . . . . . A p p l i c a t i o n to the O n e - W h e e l Model . . . . . . . . . . . . . 5.1 Simulation Results . . . . . . . . . . . . . . . . . . . Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . . . . . .
23 23 25 26 28 29 30 32 33 36 39 40 41
xii
Observer D e s i g n for Nonlinear D. Kristiansen and O. Egeland Introduction . . . . . . . . . . 1 Contraction Theory . . . . . 2 System Equations . . . . . . . 3 4 5
6 7
Oscillatory S y s t e m s
. . . . . . . . . . . . . . . . . . 3.1 Analysis . . . . . . . . . . . . . . O b s e r v e r Design . . . . . . . . . . . . . Simulations . . . . . . . . . . . . . . . . 5.1 E x a m p l e 1: 2 - D O F O s c i l l a t o r y Nonlinearities . . . . . . . . . . . 5.2 E x a m p l e 2: C y l i n d e r G y r o s c o p e Conclusions . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . System with . . . . . . . . ............ . . . . . . . . . . . . . . . .
Transformation to State Affine S y s t e m Design A. Glumineau and V. Ldpez-M. Introduction . . . . . . . . . . . . . . . . . . . 1 Defi n i t i o n s a n d N o t a t i o n . . . . . . . . . . . . 2 Problem Statement . . . . . . . . . . . . . . . 3
4
5 6
and
Observers
1 2
. . . . . .
43 44 46 47 49 51 51 52 56 57
Observer
. . . . . . . . . . . . . . . . . . . . . for S t a t e A t t i n e . . . . . . . ........ ........ . . . . . . . . . . . . . . . . . . . . . . . . . . . .
59 60 61 61 62 65 66 67 68 69
f o r N o n l i n e a r Discrete73
Introduction . . . . . . . . . . . . . . . . . . . . . . Differential F o r m s . . . . . . . . . . . . . . . . . . O b s e r v e r D e s i g n using O b s e r v e r F o r m s . . . . . . . . . . . . O b s e r v e r Design using E x t e n d e d O b s e r v e r F o r m s Conclusions . . . . . . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . .
Stability Analysis and Observer Diffusion P r o c e s s e s W. Lohmiller and J.-J. E. Slotine
. . . . . . . . . . . . . . . . . . Cubic . . .
59
. . . 3.1 T h e I n p u t - O u t p u t Differential E q u a t i o n Systems ~a - . . . . . . . . . . . . . . . 3.2 S t a t e Affine T r a n s f o r m a t i o n A l g o r i t h m S y n t h es i s O b s e r v e r for S t a t e Affine S y s t e m s 4.1 Physical Example . . . . . . . . . . . . 4.2 Simulation Results . . . . . . . . . . . . Conclusions . . . . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . .
On E x i s t e n c e o f E x t e n d e d Time Systems H. J. C. Huijberts
43
Design
. . . . . . . . . . ...... . . . . . . . . . .
73 75 79 84 90 91
for N o n l i n e a r
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . Contraction Analysis . . . . . . . . . . . . . . . . . . . . . . 2.1 Basic Tools . . . . . . . . . . . . . . . . . . . . . . .
93 93 94 94
xiii
4 5 6 A
2.2 N o n l i n e a r O b s e r v e r Design using C o n t r a c t i o n T h e o r y 2.3 Weakly Contracting Systems . . . . . . . . . . . . . N o n l i n e a r Diffusion E q u a t i o n s . . . . . . . . . . . . . . . . . 3.1 C o n t r a c t i o n P r o p e r t i e s of R e a c t i o n - D i f f u s i o n P r o cesses . . . . . . . . . . . . . . . . . . . . . . . . . . 3.2 Observer Design for N o n l i n e a r Diffusion Processes Spatial Discretization and Numerical Implementation . . . . Further Extensions . . . . . . . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . . . . . . C o m p u t a t i o n of C o n t r a c t i o n R a t e s . . . . . . . . . . . . . .
96 97 99 100 103 104 105 109 109
N o n l i n e a r P a s s i v e O b s e r v e r D e s i g n for S h i p s w i t h A d a p t i v e
Wave Filtering J. P. Strand and T. L Fossen 1 2
3
4
5 6 7
113
Introduction . . . . . . . . . . . . . . . . . . . . . . . Modeling . . . . . . . . . . . . . . . . . . . . . . . . 2.1 Kinematics . . . . . . . . . . . . . . . . . . . 2.2 Vessel D y n a m i c s . . . . . . . . . . . . . . . . . . . . 2.3 T o t a l Ship Model . . . . . . . . . . . . . . . . . . . . N o n - A d a p t i v e Observers . . . . . . . . . . . . . . . . . . . . 3.1 Observer i n the E F frame . . . . . . . . . . . . . . . 3.2 Augmented Observer . . . . . . . . . . . . . . . . . . A d a p t i v e Observer . . . . . . . . . . . . . . . . . . . 4.1 A d a p t i v e Observer E q u a t i o n s . . . . . . . . . . . . . 4.2 A d a p t i v e Observer Error D y n a m i c s . . . . . . . . . . 4.3 Stability a n d P a s s i v i t y . . . . . . . . . . . . . . . . . Experimental Results . . . . . . . . . . . . . . . . . . . . . . Conclusions . . . . . . . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . .
Nonlinear INS
Observer
Design
for I n t e g r a t i o n
. . . . . . . . . . . .
113 115 115 115 118 118 119 123 125 126 126 126 128 130 133
. . . .
. . . . . . . .
of DGPS
and
135
B. Vik, A. Shiriaev and T. L Fossen 1
2 3
4
5
Introduction . . . . . . . . . . . . . . . . . . . . . . 1.1 Nomenclature . . . . . . . . . . . . . . . . . 1.2 Motivation . . . . . . . . . . . . . . . . . . Review of G P S F u n d a m e n t a l s . . . . . . . . . . . . . . . . . Strapdown Equations ..................... 3.1 Local F r a m e R e p r e s e n t a t i o n . . . . . . . . . . . . . . 3.2 Earth Frame Representation ............. 3.3 A n g u l a r Velocity E q u a t i o n s . . . . . . . . . . . . . . N o n l i n e a r Observer Design . . . . . . . . . . . . . . . . . . . 4.1 A n g u l a r Velocity Observer . . . . . . . . . . . . . . . 4.2 Velocity a n d P o s i t i o n Observers . . . . . . . . . . . . Case S t u d y . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . .
. . . . .
135 135 136 138 140 140 141 142 142 143 145 157
xiv
6
Conclusions and Future Work
7
References . . . . . . . . . . . . . . . . . . . . . . . . . . . .
Variants
of Nonlinear
Normal
. . . . . . . . . . . . . . . . .
Form
Observer
158 158
161
Design
J. Schaffner and M. Zeitz Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 Normal Form Observer . . . . . . . . . . . . . . . . . . . . . 2 Continuous Observer . . . . . . . . . . . . . . . . . . . . . . 3 Extended Luenberger Observer . . . . . . . . . . . . . . . . 4 Block-Triangular Observer . . . . . . . . . . . . . . . . . . . 5 Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . 6 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . 7
1
167 171 179 179
181
Output Feedback Control Design
II
161 162 163
Separation Results for Semiglobal Stabilization Nonlinear Systems via Measurement Feedback
of 183
S. Battilotti 1
2 3
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . Notations . . . . . . . . . . . . . . . . . . . . . . . . . . . . Regional Stabilization via Measurement Feedback ......
183 185 186
3.1 3.2 3.3
186 191
Tools . . . . . . . . . . . . . . . . . . . . . . . . . . Applications . . . . . . . . . . . . . . . . . . . . . . S e m i g l o b a l S t a b i l i z a t i o n of U n c e r t a i n N o n l i n e a r S y s tems . . . . . . . . . . . . . . . . . . . . . . . . . . .
References . . . . . . . . . . . . . . . . . . . . . . . . . . . . Observer-Controller
Design
for
Global
Tracking
of
N o n h o l o n o m i c Systems Z.-P. Jiang and H. Nijmeijer Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 Problem Statement . . . . . . . . . . . . . . . . . . . . . . . 2 Reduced-Order Observer . . . . . . . . . . . . . . . . . . . . 3 Output-Feedback Design . . . . . . . . . . . . . . . . . . . . 4 4.1 Backstepping-Based Trackers . . . . . . . . . . . . . 4.2 A Modification . . . . . . . . . . . . . . . . . . . . . Example: A Knife-Edge . . . . . . . . . . . . . . . . . . . . Conclusions and Future Work . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . . . . . .
3
196 205
Separation Principle f o r a C l a s s o f Euler-Lagrange Systems
207 207 208 210 212 213 217 219 225 226
A
229
A. Loria and E. Panteley 1
Introduction
. . . . . . . . . . . . . . . . . . . . . . . . . . .
229
xix
IV
Synchronization
467
and Observers
1 Synchronization Through E x t e n d e d Kalman Filtering
469
C. Cruz and H. Nijmeijer Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 A n E x t e n d e d K a l m a n F i l t e r as Receiver . . . . . . . . . . . 2 Examples . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3 3.1 Synchronization .................... 3.2 Secure C o m m u n i c a t i o n . . . . . . . . . . . . . . . . . Concluding Remarks . . . . . . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . . . . . .
2 Nonlinear Discrete-Time
469 472 479 479 483 487 488
Observers for Synchronization
Problems
491
T. Lilge 1 2 3 4
7 8
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . S t a t e E q u i v a l e n c e to a S y s t e m in E x t e n d e d O b s e r v e r F o r m Observer Design via E x t e n d e d Observer F o r m . . . . . . . . A l t e r n a t i v e Observer S t r u c t u r e s via E O F . . . . . . . . . . 4.1 Observer E q u a t i o n s . . . . . . . . . . . . . . . . . . 4.2 M a i n C h a r a c t e r i s t i c s of t h e Observers . . . . . . . . A n E x a m p l e in the F i e l d of C o m m u n i c a t i o n . . . . . . . . . Observer Design for the R6ssler S y s t e m . . . . . . . . . . . 6.1 Observer Design in C o n t i n u o u s - T i m e . . . . . . . . . 6.2 Observer Design i n D i s c r e t e - T i m e . . . . . . . . . . 6.3 Observer Errors for Slow E r r o r D y n a m i c s . . . . . . . 6.4 Observer Errors for Fast E r r o r D y n a m i c s . . . . . . 6.5 Concluding Remarks .................. Discussion a n d C o n c l u s i o n s . . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . . . . . .
511
Chaos Synchronization U. Parlitz, L. Junge and L. Kocarev Introduction . . . . . . . . . . . . . . . . . . . . . 1 S y n c h r o n i z a t i o n of S p a t i a l l y E x t e n d e d S y s t e m s 2 Generalized Synchronization . . . . . . . . . . . . . . . . . . 3 Phase Synchronization ..................... 4 Conclusions . . . . . . . . . . . . . . . . . . . . . 5 References . . . . . . . . . . . . . . . . . . . . . . 6
491 494 497 499 499 500 501 503 505 506 506 507 508 509 509
. . . . . . .......
. . . . . . . . . . . .
511 512 515 518 522 522
XV
4 5 6 A 4
Model a n d P r o b l e m F o r m u l a t i o n . . . . . . . . . . . . . . . A Cascades A p p r o a c h to a S e p a r a t i o n P r i n c i p l e . . . . . . . 3.1 Observer Design . . . . . . . . . . . . . . . . . . . . 3.2 S t a t e Feedback C o n t r o l l e r . . . . . . . . . . . . . . . 3.3 A Separation Principle . . . . . . . . . . . . . . . . . A p p l i c a t i o n to R o b o t M a n i p u l a t o r s . . . . . . . . . . . . . . Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . . . . . . A T h e o r e m o n U G A S for Strictly Passive S y s t e m s . . . . .
High-Gain
Observers
in Nonlinear
Feedback
232 234 235 236 237 240 243 244 246 249
Control
H. K. Khalil 1 2 3 4 5 6 7 8 9 10 11 5
Introduction . . . . . . . . . . . . . . . . . . . . . . . Motivating Example . . . . . . . . . . . . . . . . . . Separation Principle . . . . . . . . . . . . . . . . . . S t a b i l i z a t i o n a n d Semiglobal S t a b i l i z a t i o n . . . . . . . . . . Nonlinear Servomechanisms . . . . . . . . . . . . . . . . . . Adaptive Control . . . . . . . . . . . . . . . . . . . . Sliding Mode C o n t r o l . . . . . . . . . . . . . . . . . . U n m o d e l e d Fast D y n a m i c s . . . . . . . . . . . . . . . . . . Discrete-Time Implementation ................ A p p l i c a t i o n to I n d u c t i o n Motors . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . .
Output-Feedback tems
Control
of
Stochastic
. . . . . . . . . . . .
. . . . . . . .
. . . .
Nonlinear
Feedback
Sys269
M. Krstid and H. Deng Introduction . . . . . . . . . . . . . . . . . . . . . . . . 1 P r e l i m i n a r i e s on Stochastic S t a b i l i t y . . . . . . . . . . . . . 2 O u t p u t - F e e d b a c k S t a b i l i z a t i o n in P r o b a b i l i t y . . . . . . . . 3 Output-Feedback Noise-to-State Stabilization . . . . . . . . 4 Output-Feedback Adaptive Stabilization ........... 5 References . . . . . . . . . . . . . . . . . . . . . . . . . 6 P r o o f . . . . . . . . . . . . . . . . . . . . . . . . . . . . A P r o o f . . . . . . . . . . . . . . . . . . . . . . . . . . . . B P r o o f . . . . . . . . . . . . . . . . . . . . . . . . . . . . C Output
249 250 255 258 259 261 262 262 263 263 264
Control of Food-Chain
. . .
. . . .
. . . .
. . . .
Systems
R. Ortega, A. Astolfi, G. Bastin and H. Rodrigues Cortes Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 C o n t r o l l e r Design P r o c e d u r e . . . . . . . . . . . . . . . . . . 2 S t a t e - F e e d b a c k C o n t r o l of a Simple P r e y - P r e d a t o r S y s t e m 3 Output-Feedback Stabilization ................ 4 M ain Result . . . . . . . . . . . . . . . . . . . . . . . . . . . 5 S i mulations . . . . . . . . . . . . . . . . . . . . . . . . . . . 6
269 270 271 276 280 284 287 288 289 291 291 292 295 299 301 305
xvi 7 8 A 7
8
Concluding Remarks . . . . . . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . . . . . . Maple Code . . . . . . . . . . . . . . . . . . . . . . . . . . .
306 307 308
O u t p u t Feedback Tracking Control for Ships
311
K. Y. Pettersen and H. Nijmeijer Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 T h e Ship M o d e l . . . . . . . . . . . . . . . . . . . . . . . . . 2 D e s i g n of a n O u t p u t F e e d b a c k T r a c k i n g C o n t r o l L a w . . . . 3 Simulations . . . . . . . . . . . . . . . . . . . . . . . . . . . 4 Bias E s t i m a t i o n . . . . . . . . . . . . . . . . . . . . . . . . . 5 S i m u l a t i o n s w i t h an E n v i r o n m e n t a l D i s t u r b a n c e . . . . . . 6 Conclusions and ~ t u r e Work . . . . . . . . . . . . . . . . . 7 References . . . . . . . . . . . . . . . . . . . . . . . . . . . . 8
311 313 314 319 324 327 329 331
D y n a m i c U C O Controllers and Semiglobal Stabilization o f U n c e r t a i n N o n m i n i m u m P h a s e S y s t e m s by O u t p u t Feedback 335 A. Isidori, A. R. Teel and L. Praly 1 2 3
4
5 6
III
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . Preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . S t a b i l i z a t i o n of N o n m i n i m u m P h a s e S y s t e m s by O u t p u t F eed back . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 3.1 T h e R e l a t i v e D e g r e e O n e Case . . . . . . . . . . . . 3.2 T h e R e l a t i v e D e g r e e G r e a t e r t h a n O n e Case . . . . On Dynamic U C O Feedback . . . . . . . . . . . . . . . . . . 4.1 General Results . . . . . . . . . . . . . . . . . . . . . 4.2 A p p l i c a t i o n to N o n m i n i m u m P h a s e S y s t e m s . . . . . Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . . . . . .
335 336 338 338 341 344 344 347 349 349
351
Fault Detection and Isolation
Fault D e t e c t i o n O b s e r v e r f o r a C l a s s o f N o n l i n e a r S y s t e m s 353 S. A. Ashton and D. N. Shields 1 2 3 4 5
6 7
Introduction . . . . . . . . . . . . . . . . . . . . . . . System Description . . . . . . . . . . . . . . . . . . . Observer Design . . . . . . . . . . . . . . . . . . . . General Detectability Conditions ............... Testable Detectability Conditions ............... 5.1 A S p e c ia l Class ( S t e p - F a u l t s ) . . . . . . . . . . . . . 5.2 Numerical Calculation Procedure ........... Concluding Remarks . . . . . . . . . . . . . . . . . . References . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . .
. . . . . . . .
353 354 354 363 365 368 371 372 373
xvii
N o n l i n e a r Observer for Signal and P a r a m e t e r Fault D e t e c t i o n in Ship P r o p u l s i o n Control 375 M. Blanke and R. Izadi-Zamanabadi 1 2
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . Ship P r o p u l s i o n S y s t e m . . . . . . . . . . . . . . . . . . . . 2.1 Propeller Thrust and Torque ............. 2.2 Diesel E n g i n e P r i m e M o v e r . . . . . . . . . . . . . .
375 376 377 377
2.3 2.4
Hull R e s i s t a n c e . . . . . . . . . . . . . . . . . . . . . A c t u a t o r s for Fuel I n j e c t i o n a n d P r o p e l l e r P i t c h . .
378 378
2.5
Sensors
. . . . . . . . . . . . . . . . . . . . . . . . .
378
3 4
Control Hierarchy . . . . . . . . . . . . . . . . . . . . . . . Structural Analysis . . . . . . . . . . . . . . . . . . . . . . . 4.1 D e s c r i p t i o n of t h e M o d e l . . . . . . . . . . . . . . . 4.2 Formal Representation . . . . . . . . . . . . . . . . . 4.3 Sensor Fusion for R e - c o n f i g u r a t i o n . . . . . . . . . .
379 380 380 380 381
5
I s o l a t i o n of Shaft S p e e d a n d E n g i n e F a u l t s ......... 5.1 Adaptive Observer . . . . . . . . . . . . . . . . . . . 5.2 I d e n t i f i c a t i o n of P r o p e l l e r P a r a m e t e r s . . . . . . . .
384 384 386
5.3
....
388
6
Fault Isolation . . . . . . . . . . . . . . . . . . . . . . . . . 6.1 Re-configuration . . . . . . . . . . . . . . . . . . . .
389 392
7
Simulation Results
. . . . . . . . . . . . . . . . . . . . . . .
394
8
Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . .
395
9
References . . . . . . . . . . . . . . . . . . . . . . . . . . . .
395
Nonlinear Observers for Fault D e t e c t i o n and Isolation P. M. Frank, G. Schreier and E. Alcorta Garcia Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . 1 Preliminaries . . . . . . . . . . . . . . . . . . . . . . . . . . 2 Observer-Based Residual Generation ............. 3
399
3.1 3.2 3.3 3.4 3.5 3.6 4
Identifiability from Usual Maneuvering D a t a
399 400 401 Nonlinear Identity Observer Approach ........ 401 N o n l i n e a r U n k n o w n I n p u t O b s e r v e r A p p r o a c h . . . 403 The Disturbance Decoupling Nonlinear Observer Approach . . . . . . . . . . . . . . . . . . . . . . . . . . 404 Adaptive Nonlinear Observer Approach ....... 406 T h e N o n l i n e a r F a u l t D e t e c t i o n F i l t e r A p p r o a c h . . . 408 O b s e r v e r for F a u l t D i a g n o s i s in B i l i n e a r S y s t e m s . . 410
Nonlinear Observer Design via Lipschitz Condition . . . . .
412
4.1 4.2 4.3
Observer Presentation . . . . . . . . . . . . . . . . . C o n t r i b u t i o n of this O b s e r v e r . . . . . . . . . . . . . Residual Generation . . . . . . . . . . . . . . . . . .
412 415 417
Conclusions . . . . . . . . . . . . . . . ............ References . . . . . . . . . . . . . . . . . . . . . . . . . . . .
418 418
XVlll
4
A p p l i c a t i o n o f N o n l i n e a r O b s e r v e r s t o Fault D e t e c t i o n a n d Isolation 423 H. Hammouri, M. Kinnaert and E.H. El Yaagoubi 1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . .
423
2
R e s i d u a l G e n e r a t i o n for L i n e a r S y s t e m s
424
2.1
3
...........
Problem Statement . . . . . . . . . . . . . . . . . . .
424
2.2
Second Problem Formulation
425
2.3
P r i n c i p l e of t h e S o l u t i o n . . . . . . . . . . . . . . . .
.............
R e s i d u a l G e n e r a t i o n for N o n l i n e a r S y s t e m s
426 .........
428
3.1
Introduction
. . . . . . . . . . . . . . . . . . . . . .
428
3.2
Basic N o t i o n s . . . . . . . . . . . . . . . . . . . . . .
428
3.3
H i g h G a i n O b s e r v e r s for U n i f o r m l y O b s e r v a b l e Systems . . . . . . . . . . . . . . . . . . . . . . . . . .
429
3.4
T h e F h n d a m e n t a l P r o b l e m of R e s i d u a l G e n e r a t i o n for N o n l i n e a r S y s t e m s . . . . . . . . . . . . . . . . .
431
3.5
A p p l i c a t i o n of N o n l i n e a r O b s e r v e r s t o t h e F P R G
. .
Hydraulic System . . . . . . . . . . . . . . . . . . . . . . . .
434 437
4.1
M o d e l l i n g of t h e S y s t e m . . . . . . . . . . . . . . . .
437
4.2
Design of a R e s i d u a l G e n e r a t o r . . . . . . . . . . . .
438
5
Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . .
441
6
References . . . . . . . . . . . . . . . . . . . . . . . . . . . .
441
A
N u m e r i c a l Values used for t h e S i m u l a t i o n of t h e H y d r a u l i c System . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
443
I n n o v a t i o n G e n e r a t i o n for Bilinear S y s t e m s w i t h U n k n o w n 445
Inputs
M. Kinnaert and L. El Bahir 1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . .
445
2
Problem Statement . . . . . . . . . . . . . . . . . . . . . . .
447
3
Design Procedure . . . . . . . . . . . . . . . . . . . . . . . .
448
4
Innovation Monitoring . . . . . . . . . . . . . . . . . . . . .
456
4.1
Introductory Remark . . . . . . . . . . . . . . . . . .
456
4.2
I n n o v a t i o n in t h e P r e s e n c e of A d d i t i v e F a u l t s . . . .
456
4.3
Generalized Likelihood Ratio Test
457
5
..........
D e s i g n a n d V a l i d a t i o n of a F D I S y s t e m for a t h r e e T a n k Process . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
459
5.1
Process Description
459
5.2
Design a n d V a l i d a t i o n of t h e I n n o v a t i o n G e n e r a t o r .
5.3
E v a l u a t i o n of t h e I n n o v a t i o n S e q u e n c e . . . . . . . .
. . . . . . . . . . . . . . . . . .
460 462
6
Conclusions . . . . . . . . . . . . . . . . . . . . . . . . . . .
463
7
References . . . . . . . . . . . . . . . . . . . . . . . . . . . .
463
A V i e w p o i n t on Observability and Observer Design for Nonlinear S y s t e m s Gildas Besan~on
ENSIEG
1
Laboratoire d ' A u t o m a t i q u e de Grenoble BP46 - 38402 Saint-Martin d'H~res, France
Introduction
Given a dynamical system, the observer aims at obtaining an estimate of the current state by only using available measurements. For linear systems, the property of observability, characterized by the K a l m a n rank condition, guarantees the possibility to indeed design an observer. In the case of nonlinear systems, observability is not enough, basically because this p r o p e r t y in general depends on the input of the system. In other words, observability of a nonlinear system does not exclude the existence of inputs for which two distinct initial states cannot be distinguished by using the knowledge of the measured output. This results in the fact t h a t in general, observer gains can be expected to depend on the applied input. Moreover, the existing observers generally tightly depend on some specific structure of the considered system. This chapter discusses such characteristics of observer design for nonlinear systems, basically following the recent work of [1]: on the basis of background definitions, the main observability requirements for observer design are first recalled in Section 2, so as to put into relief the various contexts which can be found. In particular, designs which are non uni.form w.r.t, the input are distinguished from those which are uniform, with a special attention paid to the case of uniform design for non uniformly observable systems in Section 3. Two directions of extension of available designs are then highlighted and illustrated, namely the possibility of interconnecting sub-observers to still obtain an observer in Section 4, and the issue of state transformation to make some observer design possible in Section 5. Some conclusions are finally given in Section 6.
4
2
1. A Viewpoint on Observability and Observer Design for Nonlinear Systems
Basic Definitions and Proposed "Classification"
Let us consider a nonlinear system defined by the following representation:
(~n~)
{~ = f(x,u) y
=
h(x)
x c ~ , u
9
y9
m
,
(1.1)
and let X~,(t, xo) denote its solution at time t, with initial condition x0 at time t = 0 and control u(t). Admissible inputs u(.) axe assumed to be taken in some set L / o f measurable and bounded functions. Observability can then be defined by the notion of indistinguishability [22] (see [11] for a synthesis). D e f i n i t i o n 1.1 Indistinguishability.
A pair (xo, ~o) will be said to be indistiguishable by u i.fVt > O, h(x~(t, xo)) =h(Xu( t, ~o) ). The pair is just said to be indistinguishable, i.f it is so .for any U.
From this definition, observability of (1.1) can be defined as follows: Definition 1.20bservability.
A nonlinear system (1.1) is observable if it does not have any indistinguishable pair o.f states. At this point, one can notice that from this definition, observability does not exclude the possible existence of inputs for which some states are indistinguishable. As an example, the following system: :~1
~
UX2
J:2
=
--x2
y
~
X1
(1.2)
is clearly observable, and yet u _= 0 makes every pair
indistinguishable. This means t h a t in general observability is not enough to be able to design an observer and t h a t the problem of inputs must be taken into account. A particular case of interest, is the case of inputs for which no indistinguishable pair can be found: D e f i n i t i o n 1.3 Universal inputs. An input u is universal on [0, t] i.f.for every pair of distinct states xo ~ 20, there exists T 9 [O,t] such that h(x~(7, xo)) ~ h(x~(7, 20)).
If u is universal on Kt +, it is just said to be universal.
1. A Viewpoint on Observability and Observer Design for Nonlinear Systems
5
From this definition, the notion of singular inputs can be derived: D e f i n i t i o n 1.4 Singular inputs. A non universal input is called singular. As one can guess from example (1.2) above, a typical class of systems which have singular inputs is the class of so-called state-affine systems of the following form:
y
= =
A(u)x Cx.
(1.3)
For such systems, one can define - as for linear time-varying systems - the following quantities: 9 the transition matrix @~(T, t) by: d(I)~('r, t) dT @~(t,t)
-
A(U(T))O~(T,t)
=
Id,
(1.4)
9 the observability Grammian F(t, T, u) by: F(t, T, u) :---
f
t+T
@T(T,t)cTV~2u(T,t)dT),
(1.5)
Jt
9 and some universality index "y(t, T, u), defined as the smallest elgenvalue of F(t, T, u). On this basis one can characterize inputs which are "universal enough" so that an observer design will be possible: D e f i n i t i o n 1.5 [10] Regularly persistent inputs. A n admissible input u is said to be regularly persistent .for system (1.3) if 3 T > O, a > O, and to > 0 such that 7 ( t , T , u ) >_ ~ for t >_ to. These remarks show that in general, the observer gain, as well as its stability property, depend on the input. In view of the above definitions, a particular case of special interest is the one of systems without singular inputs: D e f i n i t i o n 1.6 Uniformly observable systems. A system whose all inputs are universal is called uniformly observable. If, .for every t > O, all inputs are universal on [0, t], the system is locally uniformly observable. A class of "sufficiently regular" locally uniformly observable control affine systems have been characterized in [14]:
6
1. A Viewpoint on Observability and Observer Design for Nonlinear Systems
Theorem
1.1 [14] A n observable nonlinear system in the form:
= y
f(~)+ug(~)
=
(1.6)
with u = 0 as a universal input, and nonsingular Jacobian of ~(~) = (h(~),Lf(h(~)), ... L ~ - l ( h ( ~ ) ) ) at ~0, is locally uniformly observable at ~o i.f and only i.f the change of coordinates x = ~(~) turns it into the .following .form: Xl X2
= =
X2 + ~ I ( X l ) u X3 + ~ 2 ( X l , X 2 ) u
Xn--1
=
Xn+~n--I(Xl,...Xn--1)U
y
=
(1.7) =
+
xI []
Extensions to non-control-affine case mono and multi o u t p u t can be found in [15] and [12] respectively. For such systems, so-called high gain observers m a y exist [16], which are observers with gain and stability independent of the input. However, one can find several other cases of observers which have been proposed irrespective of the input, although the considered systems are not uniformly observable [21, 29, 13, 2]. The only possible explanation for such a phenomenon is t h a t in these cases, the difference between trajectories resulting from two distinct indistinguishable states naturally tends to zero. This is for instance what happens in system (1.2) : for this system u = 0 is singular insofar as it cannot distinguish
However, for such initial conditions and input, the error e2 on trajectories of x2 satisfies 42 = - e 2 , while t h a t of xl is identically zero, and thus, it is clear that the difference between the two trajectories asymptotically goes to zero. In this case, an observer irrespective of the input can be designed, simply as ~:2 = - x 2 Following the terminology of linear systems, this suggests to define some detectability property as follows: D e f i n i t i o n 1.7 Detectability. A nonlinear system (1.1) will be called detectable if for every couple ((2o, ~o), u(.) ) in (j~n • H:tn) • 5t such that there exists to .for which
1. A Viewpoint on Observability and Observer Design for Nonlinear Systems Vt _> to; h(x~(t, xo)) = h(x~(t, 5:0)) then IIx,,(t, xo) -
x=(t, ~0)ll
,0. t ---~oo
In the case of uncontrolled system, we only consider pairs of initial conditions. As a summary, in view of the above observability properties w.r.t, the inputs, one can distinguish the following cases in observer designs: 9 Either the system does not have singular inputs, and in t h a t case, one can hope to be able to design an observer irrespective of the input (uniform observation of uniformly observable systems), but m a y also only find an observer depending on the input (non uni.forTn observa-
tion of uniformly observable systems); 9 Or the system may have singular inputs, and in t h a t case possible observer designs will generally depend on the inputs (non uniform observation of non uniformly observable systems), except in special cases of systems which are detectable in the sense of definition 1.7
(uniform observation of non uniformly observable systems).
3 3.1
Examples of Non Uniform and Uniform Observation Non Uniform Observation: the Case of State-Affine Systems
Let us consider here a system described by the following equations:
y
= =
A(u)x + B(u), Cx
x C ~n, u C ~m y C P:tp.
(1.8)
For such a system, the observability generally depends on the input, and under appropriate excitation, an observer has been proposed with a gain indeed depending on u, as recalled below: 1.2 [10, 17] I.f u is regularly persistent for (1.8), and A(u), B(u) are uniformly bounded on the set of admissible inputs, then there exists 0o s.t. .for any 0 >_ 0o, the following system is an observer for (1.8):
Theorem
= So
= >
A(u)2 S - 1 c T ( c : ~ y) -~- B(~t) - 0 S - d ( u ) V S -- SA(u) + C T C O,
(1.9)
8
1. A Viewpoint on Observability and Observer Design for Nonlinear Systems
and V~ > 0, 30 > 0 : II~(t) - x(t)l I < A e x p ( - ~ t ) , .for some A > O. [] T h i s result c a n c l e a r l y b e e x t e n d e d t o t h e case of s y s t e m s in t h e f o r m ic = A ( s ) x + B(s) for a n y m e a s u r e d signal s w h i c h is r e g u l a r l y p e r s i s t e n t for k = A(s)x, for i n s t a n c e s = (u, y) [18]. W e p r e s e n t in S e c t i o n 4 a n o t h e r t y p e of e x t e n s i o n , b a s e d on i n t e r c o n n e c t i o n s of o b s e r v e r s in t h e form (1.9).
3.2
U n i f o r m O b s e r v a t i o n : the C a s e o f U n i f o r m l y O b s e r v a b l e Systems
W e consider h e r e a s y s t e m of t h e following form:
ic y
= =
A x + ~(x, u), C x e Kt
x C ~:tn, u E1R m (1.10)
0 1 ... 0 / withA=
". ". 0 andC= (1 0... 0 ). 0 ... 0 1 0 ... 0 U n d e r s t r u c t u r e c o n d i t i o n as in (1.7) - e n s u r i n g u n i f o r m o b s e r v a b i l i t y - a n d s o m e L i p s c h i t z c o n d i t i o n on ~, one c a n here d e s i g n a n o b s e r v e r w i t h a g a i n which is u n i f o r m w.r.t, u as recalled h e r e a f t e r ( w h e r e xi - resp. ~ i - d e n o t e s each c o m p o n e n t of x - resp. ~):
Theorem 1.3 [16] If: ,, ~ is globally Lipschitz w.r.t, x, uni.forTnly w.r.t, u; 9 ~xj = - - O , . f o r i = l , . . . n - l , j = i + l , . . . n . then there exists 00 such that for all 0 > 00, the .following system is an asymptotic observer .for (1.10): x 0 andV
> 0,30
= =
A 2 - S - 1 C T ( C 2 - y) + V ( } , u ) -OS- ATs - SA + cTc,
> 0 : Ilk(t) - x(t)ll _<
for some
(1.11)
> O. []
W e will use this r e s u l t in S e c t i o n 4 t o p r o p o s e a u n i f o r m o b s e r v e r for s o m e n o n u n i f o r m l y o b s e r v a b l e s y s t e m . B u t let us first i l l u s t r a t e t h i s p h e n o m e n o n in n e x t s u b s e c t i o n .
1. A Viewpoint on Observability and Observer Design for Nonlinear Systems
3.3
9
A n E x a m p l e o f U n i f o r m Observation of N o n - u n i f o r m l y observable S y s t e m s
To be able to find an observer which is non uniform w.r.t, the input for a system which is not uniformly observable, as well as a uniform observer for a system which is uniformly observable, is to some extent quite consistent. More impressive are cases of uni.form observers for non uniformly observable systems. An illustrative example of the phenomenon is given by the case of systems of the following form:
y
= =
Ax + f ( x , u ) Cx
(1.12)
for which one can find matrices K, D such t h a t
T=(
C
is invertible, ( K C + D)f(x, u) = cp(Cx, u), and A22 + NA12 is stable, with
( ~-T-~T-IA21 All A22 A12) . Such systems indeed, generally admit singular input (like system (1.2) for instance, for which D = (0 1) and K = 0 satisfy the above conditions, and u = 0 is a singular input), and yet: =
w
(A22 + KA12)z + ( K A u + A21 - (A22 + KA12)K)y + ~(y, u)
=
KC+D
z (1.13)
is an observer for (1.12) irrespective of the input [2]. Such systems in fact enjoy the following structure:
x= / All A21
A22
A
r
u)
' y = Cx = (Ip, O)x (1.14)
and as design Such in the
soon as A makes A22 - AA12 asymptotically stable, then one can an observer in the form (1.13) for (1.14). a property can in particular be used to design robust observers sense that the state estimation does not require the knowledge of
F(x,u). For instance, various manufacturing s y s t e m s admit a representation in the form (1.14), and an example can be found in [28] where the t h e r m a l
10
1. A Viewpoint on Observability and Observer Design for Nonlinear Systems
behaviour of some machine-tool spindle-bearing system is considered. Considering temperatures of the various elements involved as state variables, a numerical realization of the systems reads as follows:
---
+ y
=
(1
-8.000.10 5
2.982.10 -6
0.0421 0
-0.0325 2.483.10 -6
1.407.10 - s 0.008O 2.495.10 .5
) 0(u,x)
0)
0.0104 -1.724.10 -4
(1.15) (1.16)
00)x
where Q(u, x) is some inaccurate model of the friction heat flow. This system is thus in the form (1.14), with the nonlinear part in the form B1
B2 ) F(x,u) and one can check that here A = B_z B1 leaves A 2 2 - AA12 stable. Hence an observer can here be designed with complete decoupling of the uncertain part Q. Some more general conditions for such a design to be possible are given in [2], but a general formulation of the idea can be expressed as follows: P r o p o s i t i o n 1.1 If a nonlinear system:
y
=
f(x,u)
=
h(x)
(1.17)
can be transformed into: Zl
z
f l ( Z 1 , Z2, U)
~2 =
f2(zl,z2,~)
y
Zl
z
(1.18)
by change of coordinates z = O(x), such that .for any couple (u, Zl) of admissible functions and any pair of initial conditions z ~ ~ ~o we have ][X~u,zl)(t, z o) _ X~u,*i)(t, ~0)]] __~ 0 when t ---* ~ , then:
z2 =
f2(y, ~2,~) (1.19)
is an observer for (1.17).
1. A Viewpoint on Observability and Observer Design for Nonlinear Systems
11
This illustrates how an observer can be designed on the basis of some state transformation, here in some particular conditions ensuring detectability. Other results on state transformation for observer design are given in section 5.
4
Observer Interconnection
One way to extend the class of systems for which an observer can be designed is to interconnect observers in order to design an observer for some interconnected system, when possible. If indeed a system is not under a form for which an observer is already available, but can be seen as an interconnection between several subsystems each of which would a d m i t an observer if the states of the other subsystems were known, then a candidate observer for the interconnection of these subsystems is given by interconnecting available sub-observers. Notice t h a t in general, the stability of the interconnected observer is not guaranteed by that of each sub-observer, in the same way as separate designs of observer and controller do not in general result in some stable observer-based controller for nonlinear systems (no separation principle). However, Lyapunov-based sufficient conditions can be given so t h a t the existence of sub-observers results in t h a t of an interconnected observer [7]. Consider for instance the case of systems made of two subsystems of the following form:
it1 Y
:
= =
f l ( X l , X 2 , U ) , u C U C 1Rm; fi C ~ function, i = 1,2; f 2 ( x 2 , x l , u ) , x~ E X~ C ~ , i = 1,2; ( h i ( x 1 ) , h2(x2)) T = (Yl, Y2) T, Y~ E H~n~, i = 1,2. (1.20)
Assume also t h a t u(.) E U c / : o o ( ~ + , U), and set Xi := A C ( ~ + , ~ '~') the space of absolutely continuous function from ~7~+ into ~ n , . Finally, when i E {1, 2}, let ~ denote its complementary index in {1, 2}. The system (1.20) can be seen as the interconnection of two subsystems (Ei) for i = 1, 2 given by: (E~)
2~ = f~(xi,v~,u),
y~ = h~(x~),
(v~,u) e X~ x l~.
(1.21)
Assume t h a t for each system (Ei), one can design an observer (Oi) of the following form:
(Oi)
zi = fi(zi, v~, u) + ki(gi, z i ) ( h i ( z i ) - Yi),
gi = Gi(zi, v~, u, gi), (1.22)
for smooth ki, G~ and (zi, g~) E (~:~n~ x(~i), dT~ positively invariant by (1.22). T h e idea is to look for an observer for (1.20) under the form of the following
1. A Viewpoint on Observability and Observer Design for Nonlinear Systems
12
interconnection:
CO)
~ gi
f~(~, ~ , u) + ki(~, ~ci)(hi(~i) - yi); i = 1, 2; G~(2i, ~ , u, ~i); i -- 1, 2
=
(1.23)
Set ei := z~ - xi, and for any u C/4, v~ E A'i consider the following s y s t e m (where k~"(t) denotes gain ki(g~, zi) defined in (1.22)) :
c(~.... )
~
=
f~(z~,v~,u)+k~
(t)(h~(z~)
h~(z~-e~))
T h e n sufficient conditions for (1.23) to be an observer for (1.20) have b e e n expressed in [7] as follows: 1.4 [7] If.for i = 1,2, any signal u C Lt, v~ E A C ( ~ + , ~ n ~ ) , and any initial value rz0 [ i ,gi0~) E ~n~ x dg~, 3Vi(t, el), Wi(ei) positive de.finite .functions such that: Theorem
(i) Vx~ E X~;Ve~ E z~'~;Vt _> 0, ~
(t, ei) + ~
(t, ei)[fi(xi + ei, v~(t), u(t) ) - fi(xi, v~(t), u(t) ) +k~(t)(h~(xi + e~) - hi(xi))] ~ - W i ( e i )
(ii) 3a~ > 0;Yx~ C X~;Vx~ C ~ ; Y e ~
E ~';Ye~
C ~';Vt
>_ 0,
O~(t, e~)[f~(x~, x~ + e~, u(t)) - f~(x~, x~, u(t))] < ~ i ~ X / / - ~ e ~ ) , (iii) a~ + a2 < 2, then (1.23) is an asymptotic observer for (1.20). [] In the weaker case of cascade interconnection, n a m e l y when f l ( X l , x2, u) ---fl (x~, u) in (1.20), a s s u m p t i o n s can be weakened in the following way: Theorem
1.5 [7] Assume that:
L System :~1 fl(Xl,U); Yl = hi(x~) admits an observer (0~) as in (1.22) (without v2), s.t. Vu ~ Lt and Vxl(t) admissible trajectory of the system associated to u: ~-
lim el(t) = 0 a n d
Ile~(t)lldt < +o~
(with e~ :=
Z 1 -- Xl) ;
t ----~0 ( 3
(1.24)
IX. ~c > 0; Vu e U; Vx~ e X2, Ilf2(x2, xl,u)-f~(x2,x~,u){I
~ cllxl-X~lII,
1. A Viewpoint on Observability and Observer Design for Nonlinear Systems
13
III. Vu ELt, Vvl E A C ( f t +, ITtTM ), Vz2, o g2, o 3v(t, e2), w(e2) positive de.finite .functions s.t for every trajectory of ~(u'vl 2 with z2(O) = z ~ g2(O) = gO: (i) Vx2 E X2, e2 E j~n,2, t ~ O, ov
--~(t, e2) +
(t, e2)[f2(x2 + ee, vl(t),u(t)) - f~(x2,vl(t),u(t)) +k~ 1(t)(h2(x2 + e~) - h2(x2))] < -w(e~)
(ii) re2 E / R n2, t >_ O; v(t, e2) >_ z~(e2) (iii) Ve2 E Kgn2\B(O,r),t >_ O;
~--~eo(t, --o~
<_ A(l+v(t, e2(t))).for
.some constants ~,r > 0 and B(O,r) := {e2:11~211 _< ~}. Then: X1
fl (:~1, lt) Jr- k 1 (gl, :~l)(hl (Xl) - hi (Xl))
X2
f2( 1,
gl
- h (xl))
u) + k (02,
(1.25)
----
is an observer" .for (1.20) where fl (xl, x2, u) = f l (xl, u). [] In view of these conditions, and using available observers for systems in some particular forms, one might be able to design observers for further nonlinear systems. As an example, one can obtain in this way, and on the basis of observer (1.9) for system (1.8), a non uniform observer for a class of cascade block state affine systems of the following form:
~c
:bl x2
= =
Al(u,y)xl + Bl(u,y) A2(u,y, xl)x2 + B 2 ( u , y , xl)
2Cq Yl
~~-
Aq(u,y, xl,...Xq_l)Xq-]-Bq(U,y, CIZ1
yq
-~-
Cqxq
Xl,...Xq_l)
(1.26)
where xi E ~ n , : yi C zT~V~,u E ff~'~,y = ( y T ...yT)T = Cx and Ai,~i are continuous functions 9 Here the stability of the interconnected observer can only be guaranteed provided the inputs are "rich enough". Denoting by x~(t, xo) the projection of the solution onto llr~TM + ....... which takes components from 1 to nl + . . . ni of x, a~i(t, u, x0) the extended input
cx
(t, t ,x~
,
14
1. A Viewpoint on Observability and Observer Design for Nonlinear Systems
and B(u) the set {x c s II~(t,u,x)ll < c~, t e [ 0 , + ~ [ } of initial conditions generating bounded trajectories with u, we define this "richness" as: D e f i n i t i o n 1.8 Given E C 1l~"~, an input u will be said to be E - r e g u l a r l y persistent .for E~ if.for any compact K of E such that K A 13(u) ~ ~ there exist to > O, (~ > O, T > 0 such that: Vt>_to;VxEKAB(u), Theorem
Vi=ltoq-1,
~/(t,T,w~(t,u,x))>_a.
1.6 [~] Given E C 1Ft'~, assume that:
9 For i = 2 , . . . q, Ai, Wi are globally Lipsehitz w.r.t. ( X l , . . . x i - 1 ) uni.formly w.r.t. (u, y). 9 Input u is E-regularly persistent .for (1.26). 9 x(O) e E n u(u). then .for any ~ > O, there exist 01 > 0 , . . . ,0q > 0 and A > 0 such that the .following system: A1 (u, y)21 + ~ (u, y) - g~-IcT(c~&~ - yl) A2(u, y, ~,)~2 + ~2(u, y, &~) - S ~ c T ( c 2 2 2 - y2)
~E1
A q ( u , y, X l , . . .
Oc
~1
=
- - 0 ~ 1 -- A T ( ~ , y ) g l -02S2
~q
=
, Xq-1)Xq ~- (~:~q(U, y , : E l , . 9 . , : ~ q - 1 )
-
- glAI(~,y)
m~(u,y,2~)S2
-OqSq - AT (u,y,2l,
+ CyC~,
-
SqlCT(Cqxq
-- Yq)
g~(O) > 0
- S2A2(u,y, xl) + cTc2,
. . . ,~q-1)Sq - SqAq(u,y,~l,
g2(0)
>
0
. . . , ~ q - 1 ) + c T Cq,
gq(O) > 0 (1.27)
is an observer.for (1.26), with: ][2(t) - x(t)[] _< Ae -r [] The above design illustrates the case of cascade interconnection. As an example of "full interconnection" of observers, let us consider system of the following form: d:l X2 Yl Y2
= = = =
Alxl-b fl(Xl,U)+gl(xl,x2,u); A2(u)x2 ~ - f 2 ( x l , u) : : r ClXl C C2x2 C1R p
Xl C ]~ TM x2 E ~ n 2
(1.28)
with: (C1) A, C as in (1.10) and f ( x l , u ) Lipschitz assumptions;
satisfying uniform observability and
1. A Viewpoint on Observability and Observer Design for Nonlinear Systems
15
( 6 2 ) g(Xl,X2,U ) : ( 0 , . . . O, g n ( Z l , X 2 , U ) ) T and gn is globally Lipschitz w.r.t, xl (resp. x2), uniformly w.r.t. ( x 2 , u ) (resp. (Xl,U));
(C3) f2 is globally Lipschitz w.r.t, xl, uniformly w . r . t . u . For such a system, one can easily identify two subsystems for which (1.11) and (1.9) are candidate sub-observers, and on the basis of the associated Lyapunov functions [16, 17], one can check t h a t as soon as u is regularly persistent for ~ = A2(u)z, conditions of theorem 1.4 can be satisfied for 01 large enough. This gives an observer of the following form: X1 s x2 0 602
=
= -=
A13Cl q- fl(a?l, u) q- gl(~l, a:2, u) - - S l l C I T ( C l g C l Yl); A2(u):?2 + f2(:rl,U) - $21c2T(c2:c2 - y2); 01S1 -- A1Ts1 -- SIA1 + cT1c1 - 0 2 S 2 - d 2 ( u ) T & - S e A s ( u ) + C~C2; $2(0) > O.
Notice t h a t here the observer gain is non uniform due to the state affine p a r t of the system. But one could imagine a similar case where some uniform gain can be used, provided that detectability is guaranteed. As an example, consider system (1.28) again, now with 6'2 = 0 and p being some function now enjoying the following property: (C3') ~ is globally Lipschitz w.r.t. Xl, uniformly w.r.t. (u, x2) and there exists V positive definite s.t. V(~,e) 9 ~n2 , IIoy ll _< -9211ell and OV e ) + -~-(t, OV e)[~(~ + e , x(t), u ( t ) ) - ~(~, x(t), u(t))] _< -~111ell 2, for -57-(t, every admissible input function u and absolutely continuous function X. T h e n one can again check t h a t under conditions (C1), (C2), (C3'), an observer can be obtained as follows [7]: :~1 x2 0
5
= = :
A l X l q- f l ( X l , U ) q- gl(Xl,3:2, u) - S l l C 1 T ( C I : ~ I - y l ) qp(x2, :~1, u) -01S1 - ATs1 - SIA1 -/- cT1c1 .
(1.29)
S t a t e T r a n s f o r m a t i o n s and O b s e r v e r D e s i g n
One can notice t h a t observer designs presented till now are all based on a particular structure of the system. The subsequent idea is t h a t these designs also give state observers for systems which can be turned into one of these forms by change of state coordinates. We will call equivalent, two systems related by such a relationship: D e f i n i t i o n 1.9 Given xo E 1R n, a s y s t e m described by: 5c = y =
f(x,u)=fu(x)xe~n,u 9 h(x) 9 1Rp
"~ (1.30)
16
1. A Viewpoint on Observability and Observer Design for Nonlinear Systems
will be said to be e q u i v a l e n t at xo to the system: { ~
=
F(z,u)=F~(z)
y
=
H(z)
~f there exists a diffeomorphism z = q~(x) defined on some neighbourhood of xo such that: Vu
e/R m,
0q5 -~xf~(x) I~=o-l(z)= F~(z)
et
h o q5-1 --- H.
The interest of such a relationship for observer design is then motivated by the following proposition: P r o p o s i t i o n 1.2 Given two systems (El) and (E2) respectively defined
by: X(x,u) { ~ = Z(z,u) h(x) and(r2) y = H(z) and equivalent by z = C~(x), If: Z(~,u) + k ( w , g ( # ) (21)
{ ~ y
= =
(o~)
w
=
~-
=
x(:~,~)+
(v
=
F(w,u,y)
is an observer for (E2), Then: (o~)
{
y))
r(w,u,y)
(0o), ~
k, k(~,, h(:~) - y)
is an observer .for (El). This kind of remark has motivated various works on characterizing (rank observable) systems which can be turned into some "canonical form" for observer design, from the linear one up to output injection [23, 8, 24] to several forms of cascade block state afiine systems up to nonlinear injections from block to block [3, 26, 4, 5]. Using the formalism of differential forms [9] e.g. used in [19], we can indeed characterize systems equivalent to "special forms" of (1.26) (a general characterization would further require the use of explicit PDE's in its formulation). With the following notations:
9 d, L z , iz, A to respectively denote usual differentiation, Lie derivative along a vector field Z, inner product with Z and exterior product of differential forms; 9 d~v := dVl A . . . A d v , , and i x v := ( i x v l , . . . i x v ~ )
ifv = (vl,...v~);
1. A Viewpoint on Observability and Observer Design for Nonlinear Systems
17
9 i x ~ : = { i x w , w 9 f~}, Adf~ : = {wlA...Wd,Wi 9 Q}, df~ : = {dw,w 9 f~} if f~ is a set of differential forms; 9 f~ | O to denote the set of finite linear c o m b i n a t i o n s of elements of ft with coefficients in O; 9 i x w : = i x .... i x , . w if w is an (r + 1)-differential form, a n d X ( X 1 , . . . X ~) an r - t u p l e of vector fields;
=
9 (9(y) to d e n o t e the observability subspace of the considered s y s t e m with o u t p u t y, n a m e l y the smallest vectorial subspace of ~ which contains all o u t p u t functions, and is invariant u n d e r Lie derivation along the vector fields of the system, o b t a i n e d w h e n u describes ~ m ; let us privilege systems (E~,...~') of the following form:
A l ( u , y l ) z l + ~ l ( U , y 1) A 2 ( u ' y2, Z l ) Z 2 ~_ qp2 ( u , y 2 , ~.q
=
y
=
~'~1. . .7~q
Aq(u,
Zl )
yq, Z l , . . . Zq--1)Z q ~- r
y q , . . . Zq_l)
(1.31)
Cqzq ;q . U 9 j ~ m ~Zi 9 ~2~n.i y~ 9 H:C~, a s s u m e d to satisfy the following cascade rank observability condition at x0: for any x in some n e i g h b o u r h o o d of x0,
dimdO(yl)(x) difrtdO(y 2) A dnl O(yl )(x) d i m d O ( y q) A d TM O ( y l ) . . .
/~
dnq-lO(yq-1)(X)
~-
n 1
~.
n2
~-
nq
(1.32)
where ni E Pc'* such t h a t ~ i =q 1 ni = n. We will call those integers cascade observability indices. This c o n s t r u c t i o n means t h a t in (1.31), variables zl of each block are exclusively "observed" by o u t p u t hi, as soon as Z l , . . . zi-1 are k n o w n . T h e characterization of such systems will use the following tools: Given n .... Vm,q 9 fV*, V m - t u p l e s of functions y m = (y~n,...y~,~,,) a n d v , ~ - t u p l e s of vector fields X "~, for 1 < m < q, we define: 9 ~ ( y m ) the space such that dTl(y m) A d'""y "~ -= O, (f~x''(y~,.) = Span~{dLi.,,(y jfn, ) A d vl~~y rn, ,u E ~:~nq, ,1 <_ j -< /]m} | .~k+liY - . x ' ' . ,~,,) = Span~{diI..ix,,,~(~) A d'""ym,u E f~ m, [a~",.(ym)
~.~
..~',,.,
Lk=z~k
m,
[Y )
1. A Viewpoint on Observability and Observer Design for Nonlinear Systems
18
9 Oy~...y,,+l the space of functions such that: I . 9 rL n
m
(d"r
A
A
ix~x"
1
T~),+ 1
(YJ)) Ad~'""+lYm+I AdOY,::~,,,
=0,
j=l
9
f~'"'"(X;)
the space A ~ ~ ( d ' J y
j A A "j-~j
ix.~t~x"(yJ)) A f U ' " ( y TM) |
,.,y~r~ ( ~ " 1 ......... l for rn > 1 and a x ' ( y l ) @ ~_((yl) for m = 1. ?4t
These definitions associate to each block (m) - c o r r e s p o n d i n g to a l] m tuple of o u t p u t s ym _ the set of functions of these o u t p u t s T/(y'~), a space of differential (urn + 1 ) - f o r m s x ' " , a set of functions 6) _Yn ~1 "Y "+~ g a t h e r i n g , ,nTT~, all functions of zl to zm-1 a n d a m o d u l e to characterize the state affine s t r u c t u r e of each block. We can t h e n state: T h e o r e m 1.7 A nonlinear s y s t e m (1.1) cascade observable w.r.t, outputs h = ( y l , . . . y q ) in the sense of (1.32) is equivalent at xo to a s y s t e m I11 . . . t J q (E . . . . ... . ,) descmbed by (1.31) If " a n d o n l y If 9 y i E J~ , ", ( n l , . . . n q ) are cascade observability indices of (1.1) and there exist q Um-tuples of vector fields X m = ( X ] n, X m ] 1 .< r e < q ,. such. that, . .for l < r e < q :
9
9
.
.
l]m.
/
~
1. Lxj,,(y'~') ----0 if j • k and 1 otherwise, .for 1 < j , k < p yl ..,ym, 2. d i m ( f P " ~ ' " (E)) = n,,, - u m on On, ......... I
[ dix,,,~x'"(y ') A A~'-I (A "~-'~ ~x..,ex'(y j) A d~Jyj) = 0 3.
m-1 J dix, " ~x'" ( ym ) A A~:I(A v.,
" ~ - ~ i x. ~ X~ ( yJ ) A d ~ 'y j ) rn--1
nj--uj
.
~'"a~'"(y~) Adh,A A ( A i~,a~"(YJ) Ad~JY~)| k
1=1 q
j=l n , ) --~,:j
~ A(e~"y ~A A "
x~
j=l
This s t a t e m e n t follows previous results of [20], [4] or [6], a n d can be checked by the same kind of arguments: necessity is o b t a i n e d by verifying t h a t conditions I to 4 are indeed satisfied for a s y s t e m of the f o r m (1.31) w i t h X~ ~ = a a n d sufficiency is established by inductively defining new c o o r d i n a t e s OgF" under the form dzj A dzy-1 A ... A dzl = M j ( y j, z j - 1 , . . , z l ) i x J f t Xj (yY) A dZj_l A ... A d z 1 where M can be found on the basis of condition 3 along the same lines as in [6]. T h e p r o b l e m in such a characterization is t o find a p p r o p r i a t e vector fields X i. Let us sketch a constructive procedure giving such vector fields in the case of systems equivalent to (1.31) where each block takes the following
1. A Viewpoint on Observability and Observer Design for Nonlinear Systems
19
form: Zil
~-- A~I (u, z i _ l , Yi)+A~2 (u, _zi-1)zi2
Zi2 Zi3
~-~--
A ~ (u, z_~_~,yi) + A~2(u, z~_~, y~)z~2 + A~a(U, Z~_l)Z~3 Ai31(u,z__i_l , Yi) +A32(u, i i (u, _zi_ 1 , Y i ) Z i 3 Zi-- 1 , Y i ) Z i 2 ~-A33 +A~4(u, z _ i _ l ) Z i 4
Zini_ 1
Ai~-I,I(u, zi-1, Y i ) + A ni~ - l , 2 ( u , z i - 1 , y i ) z i 2 + i Ai.,:-1,3( u, _zi-1, Y i ) Z i 3 Jr- An.~_l,4(u, Zi-1)Zi4
i (~, z~_l)z~, +... + An.,:_l,n.~
z~n~
i Z = An,l(u,_~_l,yi) + A~2(u,Zi_l,yi)zi2+
yi
= Ciz~ = z~l E ~:t
9
A , , , 3 ( u , - i -Z1 ,
Y i ) zi3 -~ A ni ~ 4 ( u , z i - 1 ) z i 4
~- "" ~-
A~,~,
(u Zi_l)Zin,,
(1.33) where ( Z i l , . . . , z~,~,.)T = zi and z~_ 1 = (z~, z T , . . , zT_I) T for i > 2 a n d is e m p t y otherwise 9 Notice t h a t any s y s t e m equivalent to a form (1.31) where A~ (u, z i _ l , yi) = Ai(u, zi_l) - as characterized in [4] - is equivalent to a form where each block has this triangular s t r u c t u r e (1.33). Notice also that, as in the case of state-affine equivalent s y s t e m s where A(u, y) does not d e p e n d on y, e.g. considered in [20], one can c o m p u t e sets of c o n s t a n t control sequences I~ in the form { (uiH, " " " ~tilk)' " " " (~tivil' " . U ..,k } such that: { d L f ~ ( h l ) , k = 0 . . . r l } spans dO(hi) a n d inductively,
{dLf,~(hi) , k = O. ". r~}AAi~_~dO(hl ) spansdO(hi) A Al=l i-1 dO(hi) (where hi denotes the o u t p u t function for Yi and Lfq: (hi) is the vector of c o m p o nents LI, h ... Lf,,t (hi))9 On this basis, one can inductively c o m p u t e candidates for X 1 to X q on the s a m e p a t t e r n , given hereafter for X I : 9 C o m p u t e Y (uniquely) defined by:
gy(hl) L r L L } (hi)
=
0,
=
0 if j % T1, 1 otherwise;
LyLL~(hz)
=
0
and forj=l
forj=l,..rt,
top
(1.34) (1.35)
l=2,...q.
9 B y successive Lie Brackets, c o m p u t e Y,~....... 3 := [fv~,-,,... [f-~, Y ] . . . ] a n d Y~...... , := [f,,,, Y,~,_~ .... ~] for some c o n s t a n t vi's, a n d set: Lf,,, (hi) Z := Yv,,...Vl + - 9 Check: d L z ( h l ) =- 0; L z ( h l ) ~ 0 and finally set: X 1 .--
1
Lz(hl)
Z.
20
1. A Viewpoint on Observability and Observer Design for Nonlinear Systems
One can check by inspection that such a construction necessarily gives a candidate for X 1. The same construction can be used to find X 2 to X n and finally, verification of conditions of Theorem 1.7 reduces to differentiations and tests of linear dependencies. In this way, the particular structure (1.33) can be fully intrinsically characterized, as this is done for several cases in [6].
6
Conclusions
The purpose of this chapter was to draw some lines of recent advances in the problem of observer design for nonlinear systems and highlight several further directions of research. In particular, the problem of the input has been underlined for the observability properties of the systems, and several aspects of observer designs based on interconnection of sub-observers as well as state transformations have been discussed. In terms of the technique used for the design, obviously further methods can be thought of, for instance including optimization [25], sliding modes [27] etc.
Acknowledgement The author would like to thank Professors Hassan Hammouri and Guy Bornard for having awoken and fed his interest in nonlinear observers.
7
REFERENCES [1] G. Besan~on. Contributions d l'Etude et i~ l'Observation des Syst~mes Non Lin~aires avec Recours au Calcul Formel. PhD thesis, Institut National Polytechnique de Grenoble, 1996. Laboratoire d'Automatique de Grenoble. [2] G. Besan~on and H. Hammouri, "On uniform observation of nonuniformly observable systems," Systems ~4 Control Letters, vol. 33, no. 1, pp. 1-11, 1996.
[3]
G. Besanqon and G. Bornard. "A condition for cascade time-varying linearization," in IFA C Proc., Nonlinear Control Systems Design Symposium, Tahoe City, CA, USA, pp. 684-689, 1995.
[4] G. Besan~on, G. Bornard, and H. Hammouri. "Observer synthesis for a class of nonlinear control systems," Europ. Journal of Control, vol. 3, no. 1, pp. 176-193, 1996.
1. A Viewpoint on Observability and Observer Design for Nonlinear Systems
21
[5] G. Besanqon and G. Bornard. "State equivalence based observer design for nonlinear control systems," in Proc. IFAC World Congress, San Francisco, CA, USA, pp. 287-292, 1996. [6] G. Besan~on and G. Bornard. "On characterizing classes of observer forms for nonlinear systems," in Proc. ~th European Control Conf., Br~tssels, Belgium, 1997. [7] G. Besan~on and H. Hammouri. "On observer design for interconnected systems," Journal o.f Mathematical Systems, Estimation, Control, vol. 8, no. 3, 1998. [8] D. Bestle and M. Zeitz. "Canonical form observer design for nonlinear time-variable systems," Int. Journal of Control, vol. 38, no. 2, pp. 419431, 1983. [9] W. M. Boothby. An Introduction to Differentiable Manifolds and Riemannian Geometry. Academic Press, New York, 1975. [10] G. Bornard, N. Couenne and F. Celle. "Regularly persistent observer for bilinear systems," in Proc. of the Colloque International en Automatique Non Lin~aire, Nantes, June 1988. [11] G. Bornard, F. Celle-Couenne and G. Gilles. "Observability and observers," in Nonlinear Systems - T.1, Modeling and Estimation, pp. 173 216, Chapman & Hall, London, 1995. [12] K. Busawon. Sur les Observateurs pour des Syst~mes Non Lin~aires et le Principe de S~paration. PhD thesis, Universit~ Claude Bernard, Lyon I, 1996. [13] D. Dawson, Z. Qu and J. Carroll. "On the state observation and output feedback problems for nonlinear uncertain systems," Systems Control Letters, vol. 18, pp. 217-222, 1992. [14] J.P. Gauthier and G. Bornard. "Observability for any u(t) of a class of nonlinear systems," IEEE Trans. on Automatic Control, vol. 26, no. 4, pp. 922-926, 1981. [15] J.P. Gauthier and A. Kupka. "Observability and observers for nonlinear systems," Siam Journal on Control and Optimization, vol. 32, no. 4, pp. 975-994, 1994. [16] J.P. Gauthier, H. Hammouri and S. Othman. "A simple observer for nonlinear systems - applications to bioreactors," IEEE Trans. on Automatic Control, vol. 37, no. 6, pp. 875-880, 1992. [17] H. Hammouri and J. D. L. Morales. "Observer synthesis for stateaffine systems," in Proc. 29th IEEE Conf. on Decision and Control, Honolulu, Hawaii, pp. 784-785, 1990.
22
1. A Viewpoint on Observability and Observer Design for Nonlinear Systems
[18] H. Hammouri and F. Celle. "Some results about nonlinear systems equivalence for the observer synthesis," in New Trends in Systems Theory, pp. 332-339, Birkh~user, 1991. [19] H. Hammouri and J.P. Gauthier. "Bilinearization up to output injection," Systems ~ Control Letters, vol. 11, pp. 139-149, 1988. [20] H. Hammouri and M. Kinnaert. "A new formulation for time-varying linearization up to output injection," Systems ~ Control Letters, vol. 28, pp. 151-157, 1996. [21] S. Hara and K. Furuta. "Minimal order state observers for bilinear systems," Int. Journal of Control, vol. 24, no. 5, pp. 705-718, 1976. [22] R. Hermann and A. Krener. "Nonlinear controllability and observability," IEEE Trans. on Automatic Control, vol. 22, no. 5, pp. 728-740, 1977. [23] A. J. Krener and A. Isidori. "Linearization by output injection and nonlinear observers," Systems ~ Control Letters, vol. 3, pp. 47-52, 1983. [24] A. J. Krener and W. Respondek. "Nonlinear observers with linearizable error dynamics," Siam Journal on Control and Optimization, vol. 23, no. 2, pp. 197 216, 1985. [25] H. Michalska and D. Mayne. "Moving horizon observers," in IFAC Proc., Nonlinear Control Systems Design Symposium, Bordeaux, France, pp. 576 581, June 1992. [26] J. Rudolph and M. Zeitz. "A block triangular nonlinear observer normal form," Systems ~ Control Letters, vol. 23, pp. 1-8, 1994. [27] J. J. E. Slotine, J. Hedrick and E. Misawa. "On sliding observers for nonlinear systems," Journal of Dynamic Systems, Measurements, and Control, vol. 109, pp. 245 252, 1987. [28] J. Tu and J. Stein. "Model error compensation and robust observer design - part 2: Bearing temperature and preload estimation," in Proc. American Control Conference, Baltimore, Mawland, USA, pp. 33083312, June 1994. [29] B. Walcott and S. Zak. "State observation of nonlinear uncertain dynainical systems," IEEE Trans. on Automatic Control, vol. 32, no. 2, pp. 166-170, 1987.
M o d e l - B a s e d Observers for T i r e / R o a d C o n t a c t Friction Prediction Carlos Canudas de Wit 1, Roberto Horowitz 2 and P. Tsiotras 3 1Laboratoire d ' A u t o m a t i q u e de Grenoble, U M R CNRS 5528 E N S I E G - I N P G , ST. Martin d'H~res, France. 2Department of Mechanical Engineering, University of California Berkeley, CA 94720-1740, U.S.A. 3Georgia Institute of Techology, School of Aerospace Eng. Atlanta, Georgia 30332-0150, U.S.A.
1
Introduction
This contribution is devoted to the problem of tire-road friction estimation. The need for such type of studies, steers from the difficulty of direct sensing of tire forces, slip, slip angles and other external factors. Observer algorithms are, in this context, a low cost alternative for sensors. Tire forces information is relevant to problems like: optimization of Anti-look brake systems (ABS), traction system, diagnostic of the road friction conditions, etc. Literature for tire/road friction estimation is numerous. Bakker et al [1] and Burckhardt [4] describe two analytical models for t i r e / r o a d behavior t h a t are intensively used by researchers in the field. In these two models the coefficient of friction, #, or more precisely, the normalized friction force, i.e. F Friction force # - F~ Normal force is mainly determined based on the wheel slip s and some other p a r a m e t e r s like speed and normal load. Fig. 1 shows two curves, obtained from H a r n e d et al [9], t h a t represent typical # versus s behavior. It is current practice to n a m e the ratio between the friction and the normal forces, #, as being the "coefficient" of friction. Under constant normal force conditions, #, is a constant if and only if the Coulomb model is used to describe friction. Nevertheless, the Coulomb model is too simplistic to suitable represent forces between the rubber tire and the road, which are dominated by the elesto-plastic force/displacement characteristics. Therefore, to consiser it as a constant is a pure idealistic view. p should thus
24
2. Model-Based Observers for Tire/Road Contact Friction Prediction Relali~ship ol ~
andi
Relallonship of ~ and i !
a~pha,'t
,
~
.
.
.
o~
.........
~
o~
. . . .
.
~M~
.......
:4o MP~
:
L ~ e g~avel .......
i
......
;
.
.
.
.
.
~o
_I o~1 i
0.1
02
03
04
05
06
LonOIttdnal sl~
07
08
09
1
-- ol
ol2
0'3
o'4
o'.5 or6 Long#u~naL slW
ol7
018
ol9
FIGURE 1. a) Variations between coefficient of road adhesion # and longitudinal slip s for different road surface conditions (left). b) Variations between coefficient of road adhesion # and longitudinal slip s for different vehicle velocities (right). be viewed more as the ratio between friction and normal forces (i.e. the normalized force), which is indeed a (static or dynamic) function of the system state variables. The expression given by Bakker et al [1], and Paceijka and Sharp [14], also known as "magic formula" is derived heuristically from experimental d a t a to produce a good fit. It provides the tire/road coefficient of friction # as a function of the slip s. The expression in Burckhardt [4] is derived with a similar methodology. The final m a p expresses # as a function of s, the vehicle velocity, v and the normal load on the tire F,~. Kiencke [10] presents a procedure for real-time estimation of #. A simplification to the analytical model by Burckhardt [4] is introduced in such a way that the relation between # and s is linear in the parameters. Kiencke [10] uses a two stages identification algorithm. In the first stage, the value of # is estimated. This estimate of # is used in the second stage to obtain the parameters for the simplified # versus s curve. The paper by Gustafsson [8] derives an scheme to identify different classes of roads. He assumes t h a t by combining the slip and the initial slope of the # versus s curve it is possible to distinguish between different road surfaces. The author tests for asphalt, wet asphalt, snow and ice and identifies the actual value of the slope with a K a h n a n filter and a least square algorithm. Ray [16] estimates # based on a different approach. Instead of using the slip information to derive a characteristic curve, Ray [16] estimates the forces on the tires with an extended K a l m a n filter. Using a tire model introduced by Szostak et al [17], that expresses the tire forces as a function of #, the author tries this model for different values of #. A Bayesian approach is used to determine the value of # that is most likely to produce the forces estimated with the extended K a l m a n filter. The works of Kiencke [10], Gustafsson [8], and Ray [16] do not consider any velocity dependence in the derivation of #, as suggested by B u r c k h a r d t
2. Model-Based Observers for Tire/Road Contact Friction Prediction
25
[4] and Harned et al [9]. An a t t e m p t to consider the velocity dependence for ABS control is presented in Liu and Sun [13]. The authors assume the tire/road characteristics to be known. Due to the limitations in the available data, the authors are not able to compare their algorithm with other methods. There are other works related to the on line identification of the t i r e / r o a d friction, as for example Lee and Tomizuka [12], and Yi and Jeong [18]. However, in these papers only the instantaneous coefficient of friction is identified. The coefficient of t i r e / r o a d friction, or coefficient of road adhesion, # is mainly a function of the longitudinal slip, the velocity of the vehicle and the normal load. The estimators proposed in the literature depends very much on the type of used models, and verification of the hypothesis used for the model derivation. As shown by the figures above, the relation of the curves # - s , depends very much on system operating conditions, such as the vehicle velocity. It is clear t h a t p a r a m e t e r s describing a curve like the one in Fig .1-(a), will not be invariant, as shown in Fig .l-(b). It is thus interesting to introduce models described by parameters t h a t are more likely to be invariant and have physical significance. Theory never exactly matches reality, but some times closely resembles it. To achieve this goal, we propose in this paper to use a dynamical t i r e / r o a d friction model, together with a nonlinear observer specifically designed for this application. This paper is organized as follows: The next section reviews some of the existing tire/road friction models, and also introduces lumped and distributed dynamic representations. In Section 3 we set-up the observation problem, using the particular case of a one-wheel system with lumped contact friction. Inspired from previous works by Canudas-deWit and Lischinsky [6] on adaptive friction estimation and compensation, Section 4 presents a general framework for the design of nonlinear observers for the on-line estimation of the road conditions. In Section 5 we apply this design to the case study case set in Section 3. Finally, Section 6 presents simulation results.
2
Tire-road Friction Models
This section reviews some friction models t h a t can be used for the study of the on-line identification of the friction force (or coefficient, if we consider normalized force). We first present the some of the pseudo steady-state models proposed in the literature, then we discuss some alternative dynamic (lumped and a distributed) models. T h e sep up for this study is the simple case of an one-wheel model with tire-road contact friction, shown schematically in Fig. 2. In this s t u d y we
26
2. Model-Based Observers for Tire/Road Contact Friction Prediction Wheel with
Wheel with
lumped friction F
distributed friction F
or
p
F I G U R E 2. One-wheel system with: lumped friction (left), distributed friction (right) will thus consider a s y s t e m of the form m~
=
F
J&
=
-rF
(2.1) + u,- - cr~w ,
(2.2)
where: m - wheel mass, J
wheel inertia,
r - wheel radius, v
linear velocity,
w - angular velocity, u~ - b r a k i n g / d r i v i n g torque, F
t i r e / r o a d friction force.
Therefore, only longitudinal m o t i o n (longitudinal slip) will be considered. 2.1
Pseudo-Steady
State
Models
This t y p e of models are currently used in the literature. T h e y are defined as one-to-one ( m e m o r y less) maps between the friction F , a n d the longitudinal slip r a t e s, defined as: s=
.... r ~r e -d v
if if
v > rw, v # 0 v
breaking driving
(2.3)
2. Model-Based Observers for Tire/Road Contact Friction Prediction
27
The slip rate results from the reduction of the effective circumference of the tire (consequence of the tread deformation due to the elasticity of the tire rubber), which implies t h a t the ground velocity will not be equal to v = r0a. T h e slip rate is defined in the interval [0, 1]. W h e n s = 0 there is no sliding (pure rolling), whereas s = 1 indicates full sliding. In opposition to steady-state friction models, the pseudo s t e a d y - s t a t e (PSS) models aim at describing the shapes shown in Fig.1 via static m a p s F ( s ) , m a p p i n g s to F, and not as a steady-state relation between contact relative velocity and friction forces. T h e y are n a m e d pseudo steady-state because some of these models depend on the vehicle velocity v, i.e. F ( s , v). T h e models also depends on the operating conditions, although they are only valid under steady-state conditions. One of the most well known models of this type is Pacejka's model (see, Pacejka and Sharp [14] ), also known by the n a m e of "magic formula". This model has been shown to suitably m a t c h experimental data, obtained under particular conditions of constant linear and angular velocity. T h e Pacejka model has the form F ( s ) = e~ sin(e2 arctan(~3~ - ~4(~a8 - a r c t a n ( ~ a ~ ) ) ) ) , where the c~s are the p a r a m e t e r s characterizing this model. The p a r a m e t e r s can be identified by matching experimental data, as shown in Bakker et az [1]. The model proposed by Burckhardt [4] for the t i r e / r o a d friction characteristics is of the form F ( s , v) : (c1(1 - e - c ' ) - eas) e - c ~ v
(2.4)
where cl, 999 , c4 are constants. T h e normal load at the tire is kept constant in this model. Note also the velocity dependency of this model, seeking to m a t c h variations like the one shown in Fig. l-(b). Kiencke and Daiss [11] neglect the velocity dependent t e r m in Eq. (2.4) and a p p r o x i m a t e the curve by F(s) = Ks
8
cls 2 + c2s + 1 '
(2.5)
where Ks is the slope of the F ( s ) versus s curve when s = 0 and Cl and c2 are properly chosen parameters. Notice t h a t Eq. (2.5) is only dependent on the slip s. T h e value of Ks is assumed to be known. Kiencke and Daiss [11] choose a fixed value of a b o u t 30 ~ for it. Alternative, Burckhardt [3] proposes a simpler three p a r a m e t e r s model, F(~)
= ~1(1 - e - c . )
_ ~s
Since these models are highly nonlinear in the unknown parameters, they are not well adapted to be used for on-line identification. For this reason, simplified models like F(~)
= clVq
- ~2~
28
2. Model-Based Observers for Tire/Road Contact Friction Prediction
are used in connection with a linear recursive identification algorithms, has been proposed in the literature. A part from the nonlinearity in the unknown parameters, the m a j o r limitation of this models seems to steer from the fact t h a t the unknown p a r a m e t e r s are not really invariant, they may strongly depend on the tire characteristics (such as compound, tread type, tread depth, inflation pressure, temperature), on the road conditions (such as type of surface, texture, drainage, capacity, temperature, lubricant, i.e. water or snow), and on the vehicle operational conditions (velocity, load), see P a s t e r k a m p and Paceijka [15]. As an alternative to the pseudo static models t h a t depend instantaneously only on s (memory-less models), dynamic models based on the preliminary studies on dynamic friction models of Dahl [7], can be a d a p t e d to suitably describe the road-tire contact friction. The Dahl's models leads to a friction displacement relation t h a t bears much resemblance with stressstrain relations proposed in classical solid mechanics. A potential advantage of such models is their ability to describe some of the physical phenomena found in r o a d / t i r e friction (such as: hysteresis loops, pre-sliding displacement, etc), as well as their dependance on p a r a m eters t h a t m a y have physical meaning. Although the p a r a m e t e r s of such models, m a y also depend on some of the factors mentioned above, other p a r a m e t e r s m a y be more like to be invariant, or be more directly related with the phenomena to be observed, like for instance the change on the road characteristics (i.e. dry wet, etc.). Dynamic models can be formulated as a lumped or distributed models, as shown in Fig. 2. This distinction will be discussed next.
2.2
Lumped Dynamic
Models
A lumped friction model assumes punctual tire-road friction contact. An example of such a model can be derived from the LuGre model (see Canudas et al, [5]). This model differs from the one in [5] in the way t h a t the function g(v) is defined. Here we propose to use the t e r m e -Iv'/v.'lm/2 instead the t e r m e -(v'/~.')2 as in the LuGre model in order to better m a t c h the pseudostationary characteristic of this model (map s H F(s) ) with the shape of the Paceijka's model, as it will be shown later. T h e model of Canudas et al, [5] is written:
: F
=
o-o1 ,-I (o-oz +
(2.6) + o-yv,.) Fn
with, = f f c + (ffs - ffc)
(2.7)
2. Model-Based Observers for Tire/Road Contact Friction Prediction
29
where, a0 - rubber longitudinal lumped stiffness, a l - rubber longitudinal lumped damping, (r2 - viscous relative damping, # c - normalized Coulomb friction, # s - normalized Static friction,
# c _< # s , C [0, 1],
v s - Stribeck relative velocity, Fn - normal force, vr - relative velocity --- (rw - v), z - internal friction state. R e m a r k : This model, has the following i m p o r t a n t properties: (i) if Iz(0)l < # s / a o , thus iz(t)l < # s / n o ,
Vt >_ 0,
(ii) oc > # s >_ g(v~) >_ # c > 0, Vv~ (iii) the right hand side of (2.6) is Lipschitz (globally if v~ is assumed bounded, and locally if not). In particular P r o p e r t y (i) ensures that the internal friction states are bounded and that its upper bound is given by the static friction p a r a m eter (Property (ii). Property (iii), provides existence and uniqueness of a solution to (2.6).
2.3
Distributed
Dynamic
Models
Distributed models assume the existence of an area of contact (or patch) between the tire and the road, as shown in Fig. 2. This p a t c h represent the projection of the part of the tire that is in contact with the road. T h e contact patch is associated to the frame Op, with ( as the axis coordinate. The patch length is L. Distributed dynamical models, as well as their relation with the pseudostatic models, has been studied previously in works of Bliman et al [2]. T h e y propose second order rate independent model (similar to the Dahl ones), and have shown that, under constant v and w, there exist a choice of p a r a m e t e r s t h a t closely m a t c h a curve similar to the one characterizing the magic formula.
30
2. Model-Based Observers for Tire/Road Contact Friction Prediction
Similar results can be obtained by using a model based in the first-order LuGre friction model, i.e.
dhz
t) = F
v~
=
~0}v~{ 5z Lg(vr)
(2.8)
dF((, t ) d ( ,
(2.9)
with g(v~) defined as before and
d F = ( - ~ ~z + vh ~2 + o'2v,.) dF~ , where,
a o / L - rubber longitudinal distributed stiffness per length, d F - differential friction force, dF,~ - distributed normal force [F~/ L], v~ - relative velocity = (rw - v), ~z - differential internal friction state. Note t h a t in this formulation the differential internal friction s t a t e 5z((, t), depends on b o t h time t, and space (. Indeed, Eq. (2.8) describes a partial differential equation.
dhz (r dt
2.4
05z. = -z=(r uq
O~z + --~-((, t) = v~
Relation Between Distributed Dynamical Magic Formula
5z
cr~
Lg(v~) M o d e l a n d the
The linear motion of the differential dF in the patch frame Op is ( = rw, for positive w, and ( --- - r w , for negative ~z (the frame origin change location when the wheel velocity reverses). Hence ( = r]w I. We can thus rewrite (2.8) in the ( coordinates as
d~z dt ((,t)
-
d~z
_
d(
d~zd( d(
d~z
dt ( ( , t ) = - - ~ - I ~ 1
= vr
~o1~1 5 z + s s g n ( r w Lg(v~)
v) .
~olv~l 5z
(2.10)
Lg(vr)
(2.11)
Assuming t h a t v, and w are constant (hence also v~, and s), the above equation describes a linear space invariant system having the sign of the relative velocity as its input.
2. Model-Based Observers for Tire/Road Contact Friction Prediction
31
Considering a positive value for sgn(rw - v) over the space interval [~(t0), ~(tl)], or equivalent over [@, ~1], we have that the solution of the above equation is:
~ ,~,
5Z((1) = 5Z(r
Lg(vT) (1 -
_[_ -
e
-~(r162 ~'~"~)
(y0
Introducing this solution together with Eq. (2.11) in Eq. (2.9), and integrating with 5z(@) = @ = 0, we have that F(s), is given as: For the driving case:
F(s,w) = Fng(s) [l -
(1
- c&s)'g(S) ~ ( e - ~ - l)] +
(2.12)
with ~(s) = # c + ( . s - ~ c ) e - j ~ / ~ ' j ~ / ~
for some constant w, and s E [0, 1]. For the breaking case:
F(s,v)= F~g(s)[1-(1-chs) g(s)crOs( e - ~ -
1)] +
(2.13)
with g(~) = ~ c + ( ~ s - ~ c ) e - ' ~ / ~ ' l ' "
for some constant v, and s E [0, 1]. Figure 3 shows the plot of F(s) with the parameters shown in the table 2.1. Parameter CY0 (r 1 CY2
Value 40 4.9487 0.0018
Unit [N/m l [N. s/m]
IN-s/m]
#s
o.5 o.9
[-]
Vs
12.5
[m/s]
#c
H
TABLE 2.1. Data used for the plot shown in Fig. 3 and Fig. 4 Uncertainty in the knowledge of the function introducing the parameter O, as
g(v~), can
be modeled by
g(vT) = ~(v~) 0 ' where ~(Vr) is some nominal known value for g(vr). Computation of the function F(s, 0), from Eq. (2.13), as a function of 0, gives the curves shown
32
2. Model-Based Observers for Tire/Road Contact Friction Prediction Static
view of the distributed
LuGre friction model
i
O.E
0.7 o 0.E
i
0.5
E ~ 0.,1
.z
vO,3 IJ
~ 0.2
0,1 0'.1
012
013
01.4
015 0'.6 S [Slip rate]
0'.7
0'.8
0'.9
FIGURE 3. Static view of the distributed LuGre model (breaking case, with v = 20m/s = 72Kin~h). This curve shows the normalized friction # = F ( s ) / F , , , as a function of the slip rate s. in Fig. 4. These curves matches reasonable well the experimental d a t a shown in Fig. 1-(a), for different coefficient of road adhesion. Hence, the p a r a m e t e r 0, suitable describes the changes in the road characteristics. Note that the pseudo-static representation, Eqs. (2.12) and (2.13), does not depends on the patch length L. Hence, the p a r a m e t e r s obtained by feeding this model to experimental data, can also be used in the simpler lumped model. This model will be used in the sequel for the observation problem to be defined next.
3
Problem
Formulation
We consider the one-wheel model with the lumped t i r e / r o a d friction model, i.e. mi~
=
Fn(croz + crib) + Fncy2v,
(2.14)
J&
=
-rF,~(C~oz + ~:~,) - cr,,w + uT
(2.15)
o "oiV L z
(2.16)
with,
g(v ) =
+
-
c)e
,
2. Model-Based Observers for T i r e / R o a d Contact Friction Prediction
33
Static view of the distributed LuGre fdction model
0.7,
0.6
==
E o
9~-~~
i t/theta = 0.8
~o, z 0.5
1/theta = 0.6
= 0.~ E
l/theta = 0.4
0.1
0.1
0~.2
01.3
0~.4
0~.5 01.6 s [Slip rate]
0'.7
0.8
0'.9
F I G U R E 4. Static view of the distributed LuGre model, under different values for 1/0. Breaking case, with v = 20rn/s = 72Krn/h. This curve shows the normalized friction # = F(s)/Fn, as a function of the slip rate s.
w h e r e we have n e g l e c t e d t h e t e r m c~2 in t h e e q u a t i o n (2.15), a n d i n t r o d u c e d the p a r a m e t e r 0 to c a p t u r e v a r i a t i o n a n d u n c e r t a i n t i e s in t h e f u n c t i o n g(vr). T h e o b s e r v a t i o n p r o b l e m c a n be now f o r m u l a t e d as follows. Problem formulation: Assume that the lumped friction parameters w i t h 0 -- 1, has b e e n identified off-line, a n d a s s u m e t h a t t h e v a r i a b l e w, a n d v a r e m e a s u r a b l e from s o m e sensors ( t h e n e e d for t h e m e a s u r e of v m a y b e r e l a x e d later). T h e p r o b l e m is t o d e s i g n a n o n - l i n e o b s e r v e r for 0, t h a t allows t h e controller t o m o n i t o r t h e e v e n t u a l c h a n g e s in t h e r o a d conditions.
4
General Observer Design
C o n s i d e r t h e following s y s t e m :
:~ =
A x + B[O~(y,u,x)] + Ru + E y
(2.17)
0 = 0
(2.1s)
y
(2.19)
~-
CTz
w i t h y, 0, ~ ( y , u, x) c R,x ~ R n, a n d u ~ R "~. W e a s s u m e t h a t s y s t e m s t a t e s a r e b o u n d e d , a n d t h a t t h e following holds:
2. Model-Based Observers for Tire/Road Contact l~riction Prediction
34
A1) (A, C) is an observable pair, A2) One of the following properties holds for ~o(y, u, x): There exist a known function oo > Po >_ P(Y, u) >_ O, such that:
WXl,X2,
(i) [~(y,u, x J - ~ ( y , u , x2)l <_p(y,u)llxl-x2[I, (ii) I~o(y,u,x)[ < p(y,u)llxll, Vx, A3) The map r ~-~ ~ of the system
x = ~] =
[A-KCT]~c+Br cT'2
(2.20) (2.21)
is strictly passive, i.e. V Q > 0, 3 P = p T > 0, and K, such that
P[A - K C v] + [A - KCT]T P
=
-Q
(2.22)
PB
--
C.
(2.23)
A4) The trajectories of the system (y(t), u(t), x(t)), satisfy: lim ~(y(t), u(t), x(t)) # 0
t ---*O0
Under this hypothesis, we propose the following observer structure: x
=
A~ + B 0~(v, ~, S:) + R~ + E y + K ( y - ~)) + m,~
(2.24)
0
=
u2
(2.25)
~]
=
cT$c
(2.26)
where ul, and u2, are design variables, which will be defined subsequently. Introducing the error variables: =
x- k
(2.27)
0- 0 y - ~) = c T g ,
(2.28) (2.29)
[ A - K c V ] 2 + Z [ O ~ ( y , u , x ) - 0~(y,u,k)] - Bul
(2.30)
-"2 cT~:,
(2.31) (2.32)
t) = = The error equation becomes
=
~1
=
where, e~(y, u, x) - ~ ( y , ~, 2) = ~ ( y , u, 2) + e [~(y, ~, x) - ~(v, ~, 2)]
2. Model-Based Observers for Tire/Road Contact Friction Prediction
35
Now, defining the L y a p u n o v function
w = ~Tp~ + lg2 a n d using properties A1, a n d A3, we have =
- - ~ T 0 ~ + 20 [9~(y, u, ~) -- 7 - % ]
(2.33)
+2~)0 [~(y, u, x) - ~(y, u, 2)] - y/21
(2.34)
Defining the a d a p t a t i o n law ~2 as ~'2 = 3'P(Y, u, 5:)~
(2.35)
we o b t a i n
<_
- x T Q yc + 21~1101 [~(Y, ~, x) - ~(y, ~, ~)1 - Z~x.
(2.36)
If A2 - (i) holds, then we have,
,r
<
_ ~ T Q ~ + 21~llOlp(u, u)ll:~ll - Y/21
(2.37)
_<
-qll~ll 2 + 211cTIIlOIp(U,u)ll~ll 2 -- ZY~'a
(2.38)
_<
--ll:~ll2(q-- 211cTIIIOIPo) - flv~ ,
(2.39)
where
q
=
/~rninQ.
Since (2.22) holds for any Q, the m i n i m u m eigenvalue of Q can be selected such t h a t the t e r m within the parenthesis of the last inequality is positive, i.e. q = 211cTIII01PO + q0 with any q0 > O, a n d Omax ~_ ]0]. Note t h a t a value of Oma~ can be o b t a i n e d from the knowledge on the road characteristics, as discussed in previous sections. In this case we can simply set Pl = O, to get x~ ~ - q o l l ~ l l 2 . In the second case, when only A2 - (ii) holds, we have
~r
<_ _ ~ T Q ~ c + 2[~[[Olp(y,u)(llx[ I _ [[~l[)
_
y/J1
(2.40)
<- - JcTQ ~c - l Y l [- 20maxp(y, u)([lXllmax -l]21])+sgn(9)t/1] (2.41) which suggests t h a t ~1 should be defined to have a high-gain c o m p o n e n t , i.e. ~'1 = 2 0 m o x K y , u)(llxllmax -- I[:~ll)sgn(~),
36
2. Model-Based Observers for Tire/Road Contact Friction Prediction
where Omax, and ]lxllmax, are respectively constant upper bounds of the p a r a m e t e r 0 and the state norm Ilxll. With this choice of ul we have as before t h a t 9 <_ -qlL~ll 2 . Thus, in b o t h of the cases considered by assumption A2 ~, and 0 are bounded, and ~ -~ 0. Finally, from the error equation (2.30), we have t h a t
u, x)} : 0, which together with assumption A4 leads us to conclude t h a t lim 0 = 8 .
W e have proved the following theorem: Theorem
2.1 Consider the .following system
0 y
=
A x + B IOn(y, u, x)] + R u + E y
(2.42)
= =
0
(2.43)
CTx
(2.44)
under the assumptions A 1 - A4, with y , 0 , ~ ( y , u , x ) u E R m. Then the following observer
with
u, ~)
x
=
A~ + B ~ ( y ,
+ R u + E y + K ( y - 9) + B~,I
0
=
7~(Y,u,S:)~
(2.46)
9
=
C r:?,
(2.47)
{o
"~ =
E R, x 6 R n, and
2 o m ~ p ( y , u)(llxll . . . . - I I ~ l l ) s g n ( 9 )
if if
A2-(i)holds A2-(ii)holds
(2.45)
(2.48)
ensures (under verification (~f A4), that lim ~) = 8.
t---~oO
5
Application to the One-Wheel Model
We consider the one-wheel model with lumped friction as described by the equations (2.14)-(2.16). As formulated previously, we assume t h a t b o t h v
2. Model-Based Observers for T i r e / R o a d Contact Friction Prediction
37
a n d w a r e m e a s u r a b l e variables. To set o u r s y s t e m in t h e s a m e f r a m e w o r k t h a t t h e s t r u c t u r e (2.17)-(2.19), we i n t r o d u c e t h e n e w v a r i a b l e
X = J"~ + rF~0"1z, from w h i c h we get: =
- - -0-O x
+ (ja0
O'1
_ a ~ ) ~ + u~
(2.49)
0"1
(2.50) y
=
(2.51)
"~(:z - rFn0"1z) = w .
Defining x, u a n d y r e s p e c t i v e l y as
Z
~
y=w~
U~
we c a n r e w r i t e t h e a b o v e s y s t e m as =
0~1
0
x -4-
1
0~(y, u, x) +
y A-
0
w h e r e ~ ( y , u, x), is defined as Z.
W i t h this r e p r e s e n t a t i o n we shall now verify c o n d i t i o n u n d e r w h i c h t h e ass u m p t i o n s A1 - A3 hold. T h e last c o n d i t i o n A 4 d e p e n d s on t h e o p e r a t i o n a l c o n d i t i o n s , in p a r t i c u l a r on t h e a p p l i e d t o r q u e u~. Condition i.e
A1 ( l i n e a r o b s e r v a b i l i t y ) . o1
A =
0
0
W i t h A, a n d C d e f i n e d as above,
C~
- ~ ,~l J
we have t h a t c o n d i t i o n A1 h o l d s for a n y values of t h e s y s t e m p a r a m e t e r s .
rank[C, ATC] =rank
[1
_ ~, , o , J
r F ,0 ,oo
]
=
2
J
T h i s r a n k c o n d i t i o n c l e a r l y shows t h a t t h e e x i s t e n c e a n o n - z e r o n o r m a l force F,~ is n e c e s s a r y to b u i l d t h e friction observer. Condition aolry--vl
- ~ z ,
A2 ( g l o b a l L i p s c h i t z
condition).
W i t h ~ ( y , u, x) =
we have t h a t a01ry - vl
Iv(y,u, xl)-v(y,u, xl)l < 9 - ~ - J) izl-z~l <_ p(y,u)lzl-z~l _< p(y,u)ll~l-x~ll
38
2. Model-Based Observers for Tire/Road Contact Friction Prediction
where p(y, u) =
Cro#c[ry-
v].
C o n d i t i o n A3 ( P a s s i v i t y ) . Finding a vector K , so t h a t the m a p r ~ of the system description (2.20)-(2.21), is strictly passive, is equivalent to searching for a vector K = [kl, k2] T, such t h a t the I / O - m a p G(s), defined as
G(s) = C T [Is - A + K C T] -1 B ,
(2.52)
is strictly positive real (SPR), i.e. R e { G ( j w ) } > 0, Vw 6 [0, co] C o m p u t a t i o n of G(s) with the corresponding values for A, B, C gives the map s + fl G(8) =
(2.53)
82 -{- O~18 + C~2
with fl
--
oo kl + (rl J
Oz1
~-
fl - k2 r F n a l J
a2
=
-ilk2
rFn(rl
-~
+klk2
rFnCrl
j2
A sufficient condition to this function be S P R is that k2 < O. T h e n a simple choice for K is thus ]~1
=
0
k2
=
-k,
for some k > 0. From the Kalman-Yakubovich-Popov lemma, we thus ensure with this choice of K that there exist P satisfying the L y a p u n o v equation with P B = C. C o n d i t i o n A4 ( P e r s i s t e n c e o f e x c i t a t i o n ) . To ensure p a r a m e t e r convergence we need to guarantee t h a t lim
t ----*oo
~(y(t), u(t), x(t))
=
lim
olry(t)
-
v(t)l z(t) r o
t --* oo
This implies that the relative velocity should not tend to zero in order for the estimated p a r a m e t e r to converge. This in turn implies t h a t the internal friction state z(t) will not asymptotically converge to zero. Finally, we have,
2. Model-Based Observers for Tire/Road Contact Friction Prediction
39
2.2 Consider the one-wheel model with lumped dynamic .friction (2.~9)- (2.51), then the .following observer:
Theorem
=
_0-0~ + (j0-0 _ 0-~)w + u~ O"1
=
0
=
~) =
(2.54)
0-1
0-01r
~(~-
v) -
0-01r g(vr) -
vl
rFn0-1~),
_
-
(2.55)
(2.56)
(2.57)
with positive nonzero k, and % ensures that all the estimated states are bounded, and that: lim ~ ----X
(2.58)
lim 5 = z.
(2.59)
t ---* ~:) t ----*oO
If in addition, the relative contact velocity does not vanishes, then we also have that lira 0 = 0 . t----*OO
5.1
Simulation Results
Simulations have been performed with the one-wheel system and the l u m p e d LuGre model. The friction parameters used in the simulations are the ones given in Table 1, with the following additional values for the wheel: r -25[cm], m = 5[Kg], J = 0 . 7 5 . m * r 2 = 0.2344[Kgm2], F,~ = 14[Kgm2/s2]. Fig. 5, shows simulation results. Fig. 5-(a) shows the time-profile of the contact friction force resulting form the application of the time torque profile u~(t) shown in Fig. 5-(e). T h e simulation has first an acceleration phase, and then a breaking phase. From Fig. 5-(a), we can see t h a t a b o u t 2 seconds are needed for the friction torque to reach its m a x i m u m value. The observation error of the X, and z is shown in Fig. 5-(d). According to the theorem these two variables should converge to their true values regardless the profile evolution of the system states. This is verified by this curve showing the exponential convergence of the II(~(t), 5)11 to zero. Since the ultimate goal of this work is to be able to on-line estimate this variation, the simulation was done under variations of the p a r a m e t e r 0, representing the road variation conditions (see Fig. 4). Fig. 5-(b) shows in bold lines the value of 0, which evolves within fourth different conditions: the first quarter of the simulation corresponds to dry asphalt conditions. The second quarter corresponds to a sudden change from dry to wet. During the third quarter, there is a smooth variation from wet to snow. T h e last
40
2. Model-Based Observers for Tire/Road Contact Friction Prediction Contact friction force
Estimation of theta 4.~
0.=
0.,
3.~
o~ 0.,
WET 2.~
o ~'-o.
12
"~ -OA
l
-o,E
O~
-o.e -1
DRY
0 ,
2
4
6
8 tO 12 Time [secondsJ
14
16
18
20
-O.5
2
4
6
Applied torque
'
1'0
1'2 2
Time [seconds]
8
10
12
Time [seconds I
14
16
18
20
1'8
2o
Observer error norm
1'4 ' '
1'6 '
1'8 '
0
~
~
;
1'0
,'2
Time [seconds]
~'4
l's
FIGURE 5. a) Contact torque friction F(t) (up left), b)Estimated parameter 0(t), and evolution of 0 (up right), c) Applied wheel torque uT(t) (low left), d)Observe error norm of ()~(t), ~) (low right). quarter is keep constant at the snow conditions. In dotted lines we can see the evolution of the estimate t~(t). As we can observe, a good parameter tracking is obtained, as long as the relative contact velocity is different from zero. During the small time-period when this velocity is small or zero, the adaptation law yields a constant 0(t).
6
Conclusions
We have presented a m e t h o d to estimate on-line the changes in road condition. To achieve this goal we have introduced dynamical friction models that, one hand provide a more accurate description of the contact friction, and one the other hand, allow us to characterize road condition variations via a single parameter. It has been shown that the distributed parameter version of these m o d e l also capture stationary shape profiles between normalized friction and slip rate that are similar to the ones obtained from experimental data (i.e. magic formula).
2. Model-Based Observers for Tire/Road Contact Friction Prediction
41
We have introduced a model-base observer that ensure asymptotic tracking of road condition, under mild conditions implying a non-vanishing evolution of the slip rate. This condition are quite natural in this context (they imply that the vehicle should operate away to the ideal pure rolling condition). Mathematically, this condition correspond to the persistently excitation condition, which is well known in the adaptive control literature. In the context of nonlinear observers, this condition appear as being the characterization of "good " inputs, which are required to recover state observability. The observer presented here has been derived in a general framework allowing to extend our study to the case where the vehicle velocity is not measurable. In particular, assumption A2 - (ii) will allows for this extension, if it can be shown that the assumption A3, also holds. This study and the introduction of other factors like: wheel vertical deformation, and suspension dynamics, are currently under study.
Acknowledgements The LuGre version of the dynamic friction model presented here, was derived during the first author visit at the Department of Aeronautics at the Georgia Institute of Technology ( C N R S / N S F collaboration project). A more complete report on this topic is in preparation. The first author would like also to thanks M. Sorin and P.A. Bliman for the interesting discussion on distributed friction models. 7
REFERENCES [1] E. Bakker, L. Nyborg and H. Pacejka. Tyre Modelling for Use in Vehicle Dynamic Studies. Society of Automotive Engineers Paper 4P 870421, 1987. [2] P. A. Bliman, T. Bonald and M. Sorine. Hysteresis Operators and tire Friction Models: Application to vehicle dynamic Simulator. Prof. of ICIAM. 95, Hamburg, Germany, 3-7 July, 1995. [3] M. Burckhardt. ABS und ASR, Sicherheitsrelevantes, RadschlupfRegel System, Lecture Scripture. University of Braunschweig, Germany, 1987. [4] M. Burckhardt. Fahrnverktechnik: Radschlupfregelsysteme. Verlag, Germany, 1993.
Vogel-
[5] C. Canudas de Wit, H. Olsson, K. J..~strOm and P. Lischinsky. A New Model for Control of Systems with Friction, IEEE TAC, Vol. 40, No. 3, pp.419-425, March 1995.
42
2. Model-Based Observers for Tire/Road Contact Friction Prediction
[6] C. Canudas de Wit and P. Lischinsky. Adaptive friction compensation with partially known dynamic friction model, International Journal o.f Adaptive Control and Signal Processing, Vol. 11, pp.65-85, 1997. [7] P. R. Dahl. Solid Frictioin Damping of Mechanical Vibrations. AIAA Journal, 14, No. 12, pp.1675-1682, 1997. [8] F. Gustafsson. Slip-based Tire-road Friction Estimation. Automatica, 33(6):1087-1099, 1997. [9] J. Harned, L. Johnston and G. Scharpf. Measurement of Tire Brake Force Characteristics as Related to Wheel Slip (Antilock) Control System Design. SAE Transactions, 78(690214):909-25, 1969. [10] U. Kiencke. Realtime Estimation of Adhesion Characteristic Between Tyres and Road. In Proceedings o.f the IFA C World Congress, volume 1, 1993. [11] U. Kiencke and A. Daiss. Estimation of Tyre Friction for Enhaced ABS-Systems. In Proceedings of the AVEG'9~, 1994. [12] H. Lee and M. Tomizuka. Adaptive Traction Control. PATH Technical Report UCB-ITS-PRR-95-32, Institute of Transportation Studies, University of California at Berkeley, 1995. [13] Y. Liu and J. Sun. Target Slip Tracking Using Gain-Scheduling for Antilock Braking Systems. In The American Control Conference, pages 1178-82, Seattle, Washington, 1995. [14] H. B. Pacejka and R. S. Sharp. Shear Force Developments by Psneumatic tires in Steady-state conditions: A review of Modeling Aspects.. Vehicle Systems Dynamics, Vol. 20, pp.121-176, 1991. [15] W. R. Pasterkamp and H. B. Pacejka. The Tire as a Sensor to Estimate Friction. Vehicle Systems Dynamics, Vol. 29,(1997) pp.409-422, 1997. [16] L. R. Ray. Nonlinear Tire Force Estimation and Road Friction Identification: Simulation and Experiments. Automatica, 33(10):1819-1833, 1997. [17] H. T. Szostak,R. W. Allen and T. J. Rosenthal. Analytical Modeling of Driver Response in Crash Avoidance Manuevering. Volume II: An Interactive Tire Model for Driver/Vehicle Simulation. Report no. DOT HS 807-271, U.S. Department of Transportation, 1988. [18] K. Yi and T. Jeong. Observer Based Estimation of Tire-road Friction for Collision Warning Algorithm Adaptation. JSME International Journal, 41(1):116-124, 1998.
Observer Design for Nonlinear Oscillatory Systems Dag Kristiansen and Olav Egeland D e p a r t m e n t of Engineering Cybernetics Norwegian University of Science and Technology Trondheim, Norway
1
Introduction
Numerous vibration phenomena which are theoretically interesting as well as practically i m p o r t a n t can only be understood on the basis of nonlinear vibrations. For instance, the wide field of self-excited, p a r a m e t r i c and auto-parametric vibration demands nonlinear t r e a t m e n t from the very beginning. The sources of the nonlinearities m a y be either geometric, inertiM, material, damping or a combination of these things. Nonlinearities bring a whole range of p h e n o m e n a that are not found in linear systems. In single-degree-of-freedom systems these p h e n o m e n a include multiple solutions, jumps, limit cycles, natural frequency shift, subharmonic and superharmonic resonances, period-multiplying bifurcations, and chaotic motions [12]. Large excitation levels are usually needed to produce periodmultiplying bifurcations and chaotic motions in single-degree-of-freedom systems. In addition to the above mentioned phenomena, the response of nonlinear multi-degree-of-freedom systems can exhibit combinations resonances and modal interactions. T h e latter m a y provide a coupling or an energy exchange between the system's modes and arises if there exists a special relationship between two or more natural frequencies of the linear modes and an excitation frequency. This means that the long-time responses of the system can contain significant contributions in m a n y modes of vibration. The presence of significant responses in more t h a n one m o d e increases the number of modal equations t h a t must be analyzed, and this generally serves to complicate the dynamics of the system. More importantly, modal interactions can lead to dangerously large responses in modes t h a t are predicted by linear analysis to have insignificant response amplitudes. T h e extent of the interaction and its conditions depend on the linear natural eigenfrequencies wi and the nonlinearities of the system. More precisely, autoparametric resonances in systems with n linear natural frequencies (wl,. 99 , w,~) and n corresponding modes (the eigenfrequencies are assumed to real and
44
3. Observer Design for Nonlinear Oscillatory Systems
nonzero) occur whenever two or more eigenfrequencies are c o m m e n s u r a b l e or n e a r l y c o m m e n s u r a b l e (see e.g. [12, 1]). If a harmonic external excitation of frequency f~ acts on a multi-degree-of-freedom system, then in addition to all primary and secondary resonances ( r f ~ ~ s w i , where r and s being integers) of a single-degree-of-freedom system, there might exist other resonant combinations of the frequencies in the form r f ~ ~ S l W l + 9 9 9 + S n W n , n where r and si are integers such that r + ~ i = 1 Isi[ = N, where N is the order of the nonlinearity plus one and n is the number of degrees of freedom. This means for multidegree-of-freedom systems with cubic nonlinearities, to the first approximation, combination resonances may occur
if a
I•
•
a
1•
or a
I•
If
quadratic nonlinearities are added, additional combination resonances m a y occur if ~t ~ [+win + wk[. Thus, a high-frequency excitation may produce large amplitude responses in low-frequency modes that are involved in the combination resonance and vice versa. Interestingly, the concept of modal interactions can also be utilized in control design, see e.g. [14, 13, 3]. In this chapter we will focus on designing full-state nonlinear observers for systems where i n t e r n a l r e s o n a n c e is present. We will assume that we do not have any measurements of the velocities and also that we cannot measure each position separately. Direct applications include e.g. cylinder gyroscopes [4]. As an analysis-tool, we shall use the concept of contraction theory [10]. A short review of this concept is given in Section 2.
2
Contraction Theory
In connection with the observer design, c o n t r a c t i o n t h e o r y will play an important role in the analysis. Here we will give a short review of the theory known as contraction theory which was proposed by [10]. The results are based on ideas from fluid mechanics and tools from differential geometry. The basic idea is to view the system differential equations as an n-dimensional "fluid-flow" described by Euler coordinates. By calculating the squared distance between two trajectories in the "flow-field" one ends up with a concept called contraction region. The interested reader is referred to [10] and the references therein for more on this subject. It is also worth mentioning that a thoroughly mathematical treatment of similar ideas can be found in [2]. Given the nonlinear, non-autonomous system • = f (x, t)
(3.1)
where x C ~n and f :~n • I1~+ ~ R n is assumed to be sufficiently smooth. This equation can be written differentially as 5/r
0 f (x, t) 6x 7xx
(3.2)
3. Observer Design for Nonlinear Oscillatory Systems
45
where 5x is a virtual displacement. The squared distance between two neighboring trajectories can be defined as 5xT6x, which means that the rate of change is given by s dt
= 25x 5
=
(3.3)
Let Area• (x, t) denote the largest eigenvalue of 89( o f + 0,, of T~/ , then
II xll < II xolleJ
....
(3.4)
If )~max (X, t) is uniformly strictly negative, (3.4) shows that [lSx[[ converges exponentially to zero. This implies by path integration that the length of any finite path converges exponentially to zero. Now consider the differential coordinate transformation
6z = o (x, t) 5x
(3.5)
where @ (x, t) is a square matrix. Then a generalization of the squared length is
5zTSz ----5 x T M (x, t) 5x
(3.6)
where M (x, t) = O T (X, t) O (X, t) represents a symmetric and continuously differentiable metric. If M (x, t) is uniformly positive definite, exponential convergence of 5z to zero implies exponential convergence of 5x to zero. We also have that d S z = FSz, where F = ((~ + O ~- 1 5 -O ~ ] , and we can state the following definition and theorem: D e f i n i t i o n 3.1 ([10]) Given the system equations • = f ( x , t ) , a region of the state space is called a contraction region with respect to a uniformly positive de.finite metric M (x, t) = O T (x, t) O (x, t) /f F is uniformly negative definite in that region. Regions where F is negative semi-definite are called semi-contracting, and regions where F is skew-symmetric are called indifferent. T h e o r e m 3.1 ([10]) Given the system equations • = f (x, t), any trajectory which starts in a ball of constant radius with respect to the metric M (x, t), centered at a given trajectory and contained at all times in a contraction region with respect to M (x, t), remains in that ball and converges exponentially to this trajectory. Furthermore global exponential convergence to the given trajectory is guaranteed of the whole state space is a contraction region with respect to the metric M (x, t).
46
3. Observer Design for Nonlinear Oscillatory Systems
R e m a r k 3.1 ([7]) Note that V = 0 T O-~x can be written in terms of Christoffel symbol of the first kind, i.e., [11]: 1 {OMm
rh k =
OMlh Ox k
\-a-7
OMhk ) Ox t
+ - -
(3.7)
Since Fhlk = Fklh [11], we have that Fmkahb k = Fklhakb h = Fmkbha k
(3.8)
where a k is the k-th component of the vector a and bh is the h-th component of the vector b. This means that.for autonomous 0 (x), we can analyze the i.e., dynamics ~ = O• in place of F = O + Ox J
O0f'~O-1,
d (SzTSz) = 25z TS~ dt
(3.9)
since 62 = 5 (O•
=
5x~ + O
0• 25zT 5i = 25xT OT (\ 0OX
5x = -~-x•
+ O~xx~X
f ' ~/I 6X =--~ d (SzrSz) q- o O'~X
(3.10)
(3.11)
A last result which will be used in the observer design is the following [10]: Consider a smooth virtual dynamics of the form
XQ 5Zl ) dt
5z2
(Fll =
F21
0 F22
)(SZl)
(3.12)
5z2
and assume that F21 is bounded. Exponential convergence of 5zl can be concluded for uniformly negative definite F n . Also, if F22 is uniformly negative definite, this implies exponential convergence of the whole system to a single trajectory since F215zl represents an exponentially decaying disturbance in the second equation. We can think of the dynamics of 5zl as the plant, and the dynamics of 5z2 as the observer. By designing the observer such that the system trajectory is contained in the "flow-field" of the observer, this means that the observer is exponential convergent if F u and F22 are uniformly negative definite, and F2a is bounded.
3 System Equations We will assume that our system is given as n nonlinearly coupled oscillators with constant mass M =diag{mi} > 0 and linear viscous damping C =diag{# d > 0, i.e. [12],
3. Observer Design for Nonlinear Oscillatory Systems
M~
=
Cl --
v
y
~qi
=
OV
-Cv
-~q + F
47
(3.13) (3.14) (3.15)
i
where F is an external forcing and V = V (q) will decide w h a t t y p e s of nonlinearities which are present in the s y s t e m (e.g. quadratic, cubic or both). More precisely, V can be w r i t t e n as V = V1 + V2 where V1 is due to the linear spring constants a n d is a s s u m e d to be positive, and V2 reflects the nonlinear coupling terms. Note t h a t our m e a s u r e m e n t y, given by (3.15), m a k e s this p r o b l e m in some sense different from e.g. robotics where one usually can m e a s u r e each position separately. We will a s s u m e t h a t t h e s y s t e m p a r a m e t e r s (mass, d a m p i n g etc.) are known.
3.1
Analysis
C o n t r a c t i o n analysis of mechanical s y s t e m s in H a m i l t o n i a n f o r m were inv e s t i g a t e d in [9, 8, 6], while s y s t e m s in L a g r a n g i a n form were considered in [7]. Here we will give an alternative analysis, which is a direct consequence of energy considerations. We will a s s u m e t h a t 1. V can be w r i t t e n as V = q T p (q) q where P is positive definite. 2. ~
can be w r i t t e n as ~ q = K ( q ) q where K is a square m a t r i x .
Now (3.13) can be w r i t t e n differentially as ( F = 0) Mdv = -Cdq-
OVdt
(3.16)
0q
Using -~q = K (q) q, (3.16) can be w r i t t e n as Mdv = -Cdq
- K (q)
qdt
(3.17)
Since P > 0, there exists a m a t r i x W (q) such t h a t W T (q) W (q) = P (q)
(3.18)
Define
0
x/2W (q)
q
a n d introduce q~ = q
(3.20)
48
3. Observer Design for Nonlinear Oscillatory Systems then dz
~Z
0
~
o de ) v/2W(q) ) ( dq
(3.21)
)
(3.22)
0
( v/-~ 0
Also from (3.19) (3.23) Then, using (3.9) dt =
( 5qTv/-M
V~5r
T (q))
v~o(~(qq/q)Sq
_-- --~qTC5 q - 5qT K (q) 5r +25r
T (q) 0 (W (q) q) 5q 0q
(3.24)
Note that OV m
0q
z
0 (qTWT (q) W (q) q) 0 ( w (q) q ) T w (q) q = K (q) q =2 cOq Oq (3.25)
i.e., KT (q) = 2~vvT (q) 0 (W0q(q) q)
(3.26)
such that d--~
$zTSz
= --SqTCSq
(3.27)
which means that the "flow-field" is semi-contracting. Bounded 5q and ~r and by assuming that K (q) is bounded, leads to bounded 5v (using MSv = - C S q - K (q) 50). Assuming bounded ~ means that 5~ is bounded since MS~ = - C 5 v - ~ 5q. This means that 5q and 5v c o n verges asymptotically to zero.
3. Observer Design for Nonlinear Oscillatory Systems
49
R e m a r k 3.2 As in [7], the above analysis can be regarded as a generaliza-
tion of the energy conservation since d ( 2 6 q T M 6 q + 5r
(q) W (q)
--SqTCSq
(3.28)
d ( l dqTMdq + dCTwT (q) w (q)
-dqTCdq
(3.29)
Multiplying with 7-iv. 1 . d ( 1 v T M v + q T w T (q) W (q) q )
h7 4
=
--vTCv
(3.30)
--vTCv
(3.31)
Observer Design
Since we do not have any measurements of the velocities, we will take advantage of the following result due to [5]: Given the system = f (x, t)
(3.32)
with measurement y = h (x)
(3.33)
and the following general observer X
:?
=
g (~, y, t)
(3.34)
h (~)
(3.35)
where g, h are assumed to be smooth functions. We can state the following result: P r o p o s i t i o n 3.1 ([5]) Given a smooth coordinate transformation of the observer dynamics ~ = N (R, ~), where .for each ~, the mapping ~ ~-~ ~ is in-
vertible, and given the n-dimensional system equations and m-dimensional measurements ~r =
f(x,t)
(3.36)
y
h(x)
(3.37)
=
50
3. Observer Design for Nonlinear Oscillatory Systems
then the observer equations x
=
g(~,y,t)
:9
=
h (R)
(3.38) (3.39)
transform to (3.40)
:~ = g (:~, y, t) + 7a y- (~' - 5')
i.f _
0R (:~ (.~, y), y) g (:~ (R, y) y, t) OR O~ (:~ (R, y ) , y ) O-~Oh(:~ (~, y)) g (Yc (~, y ) , y, t) +~yy
(3.41)
is integrated instead of (3. 38) and (3. 39). T h e proof can be found in [5]. We now propose the following observer for (3.13)-(3.15): Mv
=
s
q
-C9-
~
0~
+F
(3.42)
=
Introduce M ~ = M ~ ' -
"
(3.43)
~), t h e n ( F = O)
M ~ = - C r 1 6 2 OV
(~1) H~
(3.44)
0~
where H =
V1 "
'" "
"Yl ) " and
7n
" " "
~n
M v = M~5 +
" %
= -C9
0 V (s
00
H (~, - v)
(3.45)
We can now view (3.13), (3.14), (3.43) and (3.45) as a hierarchical combination as in (3.12). Since H is bounded, this means t h a t under the assumption t h a t K ((t) and ~O ~ V are bounded, 5(7t and 5~r converge a s y m p t o t i c a l l y to zero.
3. Observer Design for Nonlinear Oscillatory Systems
51
R e m a r k 3.3 Note that due to our measurement (3.15), i.f the observer equation (3.43) is changed to q = 9 - H ( q - q), this implies that (using the relation K T (0) = 2 w T (q) 0(W~l)0))
dt
= ( 5~Tv/-M _ & ~ T H T v / ~
4555 w
(
)
v e,
= - 5 ~ T (C + H) 6~ + ~ ) T H T (C + H) 5~ _~T
( H T K (~1) + 2WT ((?t) 0 (W0~l((?t)q) H'~] 5r
where~=~,r
~z =
o)(o)
0
v ~ W (0) ~q;
,/~W(el)
, ~
=
0
o
(3.46)
'
.
~.0
Generally, there seems to be no conclusion about the contraction behaviour o.f this observer design. However .from (3.46) we see that when the gains in the observer (7i) are in some sense small, this observer "behaves" in the same way as (3.42) and (3.43).
5
Simulations
5.1
E x a m p l e 1: 2 - D O F O s c i l l a t o r y S y s t e m w i t h C u b i c Nonlinearities
Nonlinear oscillations in multi-degree-of-freedom systems with cubic nonlinearities can be found in many physical systems such as the vibration of strings, beams, membranes, and plates for which stretching is significant, the motion of spherical, centripetal, and double pendulums, and the motion of masses connected with nonlinear springs [12]. For a 2-dof mechanical system with cubic nonlinearities, V is given by kl 2 + --~-q2 k 2 2 + alq 4 + ct2q31q2 + a3qlq2 V = --~-ql 2 2 + o~4qlq23 + 0~5q4
(3.47)
where ki > 0 are the linear spring constants and ai are constants. Note that V can be written as V
=
( ql
q2 )
~" + ~
=
(ql
q2 ) P ( q ) (
2+g
89( 2ql + ql )q2
leg
2
3q2
I, 2ql ~- 4(/2] _~a + a5q~ + 89
ql q2 (3.48)
52
3. Observer Design for Nonlinear Oscillatory Systems
Also 0V
_
// kl -t- 4alq12 + 2a3q 2
=
~ a2ql2 + 3a4q2 2 K(q) q
0q
30~2q12 ~-o~4q 2
)(
k2 + 2c~3q 2 + 4a5q 2
ql )
q2 (3.49)
T h e following p a r a m e t e r s were used in the simulations: rnl = rn2 = 1, O~1 = 1, OZ2 ~---0.9, ~3 : 0.8, OL4 = 0.6, O~5 ~---0.5, #1 = #2 : 0.001, k I ---~ 1,
is o i ivo o n 0. nceo >0 ~I~3
d
~ 4
> -
0
(3.50)
c~ 4
> -
0
(3.51)
0~20~4 OL10~5 Jr- - ~ -}- T
> --
0
(3.52)
2 0~3015
2
T h e initial conditions of the plant were: qz (0) = q2 (0) = vl (0) = v2 (0) = 0, while the initial conditions of the observer were ql (0) = 1, q2 (0) = 1, Vl (0) :
0.5, Y2 (0) :
--0.4. T h e r e s u l t s u s i n g ")/1 = "/2 :
-1
is s h o w n in
Figures 1-5
0
-9.5
10
20
30 t~me [s]
40
1
50
60
F I G U R E 1. ql (t) [solid line] and 01 (t) [dotted line].
5.2 Example 2: Cylinder Gyroscope T h e nonlinear d y n a m i c s of a cylinder gyroscope was m o d e l l e d a n d a n a l y z e d by [4]. T h e m o d e l included geometric nonlinearities, a n d it was shown t h a t V is given by
3. Observer Design for Nonlinear Oscillatory Systems
'I
53
t
10
"-0
2o
3o time [s]
4o
so
6o
FIGURE 2. vl (t) [solid line] and ~1 (t) [dotted line].
1
o.
I I I'~ Ib j~ C
-0.2
h
-0.4 ~3.6
10
FIGURE
20
ti
4o
Is]
~o
~o
3. q2 (t) [solid line] and 02 (t) [dotted line].
1
2
1
2
1
2
/ ~]gl -}- alql + ~a4q2 + ~a5q3 [ V = ( ql q2 q3 ) I lasql + lalOq3 1 1 2 1 2 \ 7a7ql + gagq2 ] la - - - la ]a _2 - - 1 a ~2 8r
T ~
I0r
~ 7 ( / 1 - I " - ~ 9~/2
)(ql)
1 1 2 1 2 ~k2 + a2q22 + ~a4ql + ~a6q3 0 q2 1 2 1 2 0 lk3 + a3q 2 + ~asql + ~a6q2 q3 where ki are positive constants and aj are constants depending on the linear axial mode shapes of the gyroscope. Straightforward calculations show
54
3. Observer Design for Nonlinear Oscillatory Systems
U 0 ili" =0.51
i t,me [sl
F I G U R E 4. v2 (t) [solid line] and ~2 (t) [dotted line].
-0.
-1
10
20
30 t,me [s]
40
SO
60
F I G U R E 5. y (t) [solid line] and ~)(t) [dotted line]. that
OV Oq
kl ~- 4alq 2 + 2a4q~ + 2a5q~ + 2asq2 asql § a l o q a a7q 2 + a9q 2 + aloq2 0 k2 + 4a2q2 + 2a4q~ + 2a6q2 + 2a9qlq3 0 3arq2+agq2+az~ k3 + 4aaq2 + 2asq~ + 2asq 2
) ( ql qa
K (q) q T h e following p a r a m e t e r s w e r e u s e d in t h e s i m u l a t i o n s : m l = m2 = m a --- 1, a l = 1, a2 = 1, a3 = 1, a4 = 0.3, a5 = 0.4, as = 0.3, a7 = 0.5, as = 0.5,
3. Observer Design for Nonlinear Oscillatory S y s t e m s a9
=
0.7, alo = 0.3, #1 = #2 = #3 = 0.001,
a n d F -be q; of ~2 in
0 0
k1 =
1, k2 = 9,
k3
55 -----
25,
. Note that with these data, V can be shown to
p o s i t i v e for q r 0. T h e initial c o n d i t i o n s of t h e plant were: q l ( 0 ) ---(0) = qa (0) = vl (0) = v2 (0) = v3 (0) ---- 0, w h i l e t h e initial c o n d i t i o n s t h e o b s e r v e r w e r e 01 (0) = 1, 02 (0) = 1, 03 (0) = - 0 . 2 , ~1 (0) = - 0 . 5 , (0) = - 0 . 4 , ~3 (0) = - 0 . 1 . T h e results u s i n g 71 = 3'2 = 73 = - 1 is s h o w n Figures 6-12.
1 O.E OE 0.4 0.,~ 0
41,2 -0.4 ~).6 ~.8 -1
10
20
30 time [s]
40
50
60
F I G U R E 6. ql (t) [solid line] and ql (t) [dotted line].
-0
-1
-1 .S
10
20
30 time Is]
40
50
60
F I G U R E 7. vl (t) [solid line] and 51 (t) [dotted line].
56
3. Observer Design for Nonlinear Oscillatory Systems
o,
o
i',,,
0
10
2o
~
4o
5O
60
Is]
ti
FIGURE S. q2 (t) [solid line] and 02 (t) [dotted line].
1A 1 0..= C -9.5 -1
iI
-1.5 -2 -25 -3
10
20
3O
4O
5O
6O
t~rne ($]
FIGURE 9. v2 (t) [solid line] and ~2 (t) [dotted line].
6
Conclusions
We have proposed an observer for nonlinear oscillatory s y s t e m s in Lan grangian form with a single m e a s u r e m e n t given by y = 7:i=1 qi. T h e analysis was mainly based on contraction theory which can be found in the papers by Lohmiller and Slotine [5]- [10]. It was s h o w n that the p r o p o s e d observer was asymptotically convergent. T h e observer was simulated first on a 2-dof s y s t e m w i t h cubic nonlinearities, and then on a m o d e l of a cylinder gyroscope. T h e simulations s h o w e d agreement with the theoretical analysis.
3. Observer Design for Nonlinear Oscillatory Systems
57
J,
0
10
20
3O time [s]
4O
5O
6O
FIGURE 10. q3 (t) [solid line] and c)3 (t) [dotted line]. 1.S
~0
-1
-1.5
10
20
30 time [s]
40
50
60
FIGURE 11. va (t) [solid line] and ~a (t) [dotted line].
7
REFERENCES [1] R. Evan-Iwanowski Resonance Oscillations in Mechanical Systems, Elsevier, New York, 1976. [2] P. Hartman. Ordinary Differential Equations, Birkhauser Verlag, Boston, 1982. [3] A. Khajepour, F. Golnaxaghi and K. A. Morris. "Modal Coupling Controller Design Using a Normal Form Method, Part 1 & 2," Journal o.f Sound and Vibration, vol. 205, pp. 657-688, 1997. [4] D. Kristiansen and O. Egeland. "Nonlinear Oscillations in Coriolis Based Gyroscopes," Accepted for publication in Nonlinear Dynamics. [5] W. Lohmiller and J. J.-E. Slotine "On Metric Observers for Nonlinear Systems," Proceedings IEEE International Conference on Control
58
3. Observer Design for Nonlinear Oscillatory Systems
O.
-o.:
-1
10
20
tirn3eO[~]
40
50
60
FIGURE 12. y (t) [solid line] and y (t) [dotted line]. Applications, Dearborn, MI, pp. 320-326, 1996.
[6] W. Lohmiller and J.-J.E. Slotine. "On Metric Controllers and Observers for Nonlinear Systems," Proceedings 35th IEEE Conference on Decision and Control, Kobe, Japan, pp. 1477-1482, 1996. [7] W. Lohmiller and J.-J.E. Slotine. "Applications of Contraction Analysis," Proceedings 36th IEEE Cor~ference on Decision and Control, San Diego, CA, pp. 1044-1050, 1997. [8] W. Lohmiller and J.-J.E. Slotine. "Applications of Contraction Analysis," Proceedings IEEE International Conference on Control Applications, Hartford, CT, pp. 699-704, 1997. [9] W. Lohmiller and J.-J.E. Slotine. "Simple Observers for Hamiltonian Systems," American Control Conference, Albuquerque, NM, 1997. [10] W. Lohmiller and J.-J.E. Slotine. "On Contraction Analysis for Nonlinear Systems," Automatica, vol. 34, pp. 683-696, 1998. [11] D. Lovelock and H. Run& Tensors, Differential Forms, and Variational Principles, Dover Publications, New York, 1989. [121 A. H. Nayfeh and D. T. Mook. Nonlinear Oscillations, Wiley, New York, 1979. [13] S. S. Oueini, A. H. Nayfeh and J. R. Pratt "A Nonlinear Vibration Absorber for Flexible Structures," Nonlinear Dynamics, vol. 15, pp. 259-282, 1998. [14] K. L. Tuer, M. F. Golnaraghi and D. Wang. "Towards a Generalized Regulation Scheme for Oscillatory Systems via Coupling Effects," IEEE Transactions on Automatic Control, vol. 40, pp. 522-530, 1995.
Transformation to State Atfine S y s t e m and Observer D e s i g n A. Glumineau and V. L6pez-M. Institut de Recherche en Cybern6tique de NANTES, IRCyN, UMR 6597 1 rue de la Noe, B.P. 92101, 44321 Nantes cedex 3. F R A N C E
1
Introduction
The observer design problem is completely solved for linear time invariant systems, whereas in the nonlinear case, there is no general theory. In order to tackle this problem, some methods have been employed: Lyapunov-like technique, linearizations, numerical differentiation, and geometric and algebraic methods (cf. [2, 10, 16, 18, 27, 6, 9, 21, 28, 31]). In order to combine the advantages and improve the shortcomings of two different approaches, structural and numerical differentiation have been sucessfully dealed with input time derivatives [25] and input and/or output time derivatives [21]. Table 1 summarizes the existing literature and shows some observer applications. Table 1 L i n e a r i z a t i o n by i n p u t - o u t p u t injection
System =
A( + ~o(y, u)
Approach Geometric: [16, 22, 31] Algebraic: [9, 10, 19]
Applications Motor: Shunt DC, Series De: [5, 24], Flexible joint: [23].
Geometric: = A~ + ~o(y,u,i~,... , u (~))
= A~ + ~(y,--. ,y(S),u,--- ,u (~))
[2s, 15] Algebraic: [26, 25]
[21]
Biological systems: [28], Numerical differentiation: [6].
In order to extend the class of linearizable systems, some results about the transformation of nonlinear systems into state affine systems have been obtained. High-gain observers are useful for state alZfine systems as shown in [3, 12, 30] and the references therein. These observers are based on optimal Kalman's observer and used in physical processes, for instance chemical reactors, distilling columns and mechanical systems [8, 30, I].
60
4. Transformation to State Affine System and Observer Design
T h e following table summarizes the main contributions on the equivalence between a nonlinear system and a bilinear or state afflne system, as well as some observer design applications. T a b l e 1. Authors Construct.
System
= A(u)~ + ~(u, y) = A(u, y)~ + ~(u, y)
[11]
~o
[141 [1]
Yes No
[20]
Yes
Applications
Synch. Generator [17] Inverse Pendulum [1] Chemical reactor [8] Distilling columns [30]
In the following our new results [20] are introduced. One of the contributions of [20] is the definition of a first algorithm to compute the transformed system functions, from the I / O differential equation. The chapter is organized as follows. Section 2 introduces some definitions and notation. Section 3 we state the problem of state affine transformation of nonlinear systems, and gives the aim of our approach we introduce by an example. We define an algorithm t h a t permits to give a NSC in order to solve this problem. Section 4 obtains the synthesis observer for the s t a t e affine system founded in Section 3. This is achieved with a well defined coordinates transformation and a Kalman-like observer. Some conclusions are given in Section 5.
2
Definitions and N o t a t i o n
Consider the nonlinear system:
{~ --
= y
f(x,u)
=
(4.1)
with x ~ M where M is an open and dense subset of ~n, u C ~ m and y E ~. T h e entries of f(., .) and hi. ) are meromorphic functions of their arguments. Let us define the state affine system, considered here
{ A(y(t), (t))Zcz +
(4.2)
where z(t) 9 ..~n y(t) 9 '.~,u(t) 9 ~m. W h e n one measures y(t), one can define ~ :--- (y, u) as a new input a n d as recalled in [13] if it is regularly persistent [3], thus the system
~o { "~--- A(v~). ~. + qo(~)- S - I c T ( c ~ -- y) = - O S - AT(~)S - SA(~)) + c T c
(4.3)
4. Transformation to State Affine System and Observer Design
61
is a Kalman-like observer for ~--~a" Where z(t) C ~ n S(t) E ~ + is a symmetric positive definite matrix and 0 > 0. The norm of the estimation error converges locally exponentially to the origin. From now on, ~ is supposed to be generically observable [25] and will be called observable.
3
Problem
Statement
The goal is to find a state coordinates transformation z ----O(x), such that system ~ (4.1) is locally equivalent to system ~'~.~ (4.2, in order to design the observer ff-~-o (4.3)9 The approach consists in checking t h a t the I / O differential equation associated to the system ~ has the same form than the ~ one. The uniqueness of this equation for an observable system is shown in [29].
3.1
The Input-Output Differential Equation for State Affine Systems ~-~
The I//O differential equation for }-~ verifies
P~
:= y(n) = F n ( A ~ , . . . , A,~_~)+ + F n - 1 (A1, " 9" , A n - l , t/91)+ + A 1 F n - 2 ( A 2 , . . . , A n - l , ~2) + " " +A1A2"'" A n - 2 F 1 ( A n - l , ~gn--1)+ A1A2"" A~-IFo(~,~),
(4.4)
where F~_j (0 <_ j < n), is the sum of all monomials in~ .
.
.
.
u
kl)
'
.
.
.
....
(4.5)
such that n l q l q- 9 ". q- npqp + kll 1 q- . 9 q- kmlm = n - j. F n - j is a function involving all monomials of "degree" ( n - j ) . For instance, Fn verifies ]11(A1,"" , An-l)
f I >(A1, -.. , An_2) + Fn:=Y
f(3)(A1,""", An-3) + 531(-) f~n~--22~l(A1,A2) + ~(n-2)l(')
[l~ nl] (n-x) + (~(n--1)1(')
(4.6)
62
4. Transformation to State Affine System and Observer Design
where Y := [y(n-1)... 9] and ~/31(') (2 < /3 <~ n -- 1), involves all the functions not depending on I / O time derivatives of degree /3. Whereas F n - j verifies
Fn_j
..__-
~(n--j) ~j
^(n--j--l), YJJ(n--j--2) " " " ~gj] 9
[u
]lj(Aj," " ,An-,) f.(2) lA .,.. ~(3)r4 J3j k~3~
An-3) + 53j(.)
[log Aj]('~-J) + 5(~_j)j (.) for l _ < j < n - l a n d F 0 : = ~ n . R e m a r k 4.1 The cascade form of the f i l functions in (4.6), is useful to
compute Ai functions. E x a m p l e 4.1 Let the 3-D nonlinear system )-~a,
Zl ----Al(u)z2 + ~ l ( y , u ) z2 = As(y, u)z3 + ~2(Y, u) ~3 = ~3(y, u) y=-z
(4.7)
1
Its I / O differential equation (4.4) verifies pa
:=
+
y(3):=F3(A1,A2)+F2(A1,A2,~I) A1FI(A2, ~2) + AIA2Fo(7~3)
(4.8)
where, as defined in (4.6) and (3.1),
F3 f(n)
:=
Y . f(A); F2 := ~2) _ [~bl~l]. f(A)
:=
[log(n2n2), loi-'A 1 - l o g ' n I - log 2~1A2] T
F1
:~-- ~2 - ~21og"A2
Fo
:=
(4.9)
~3
where Y := [y(2) $]. One of the contributions of [20] is the definition of a first algorithm to compute the A~ and ~ functions from the I-O differential equation.
3.2
State A j f i n e T r a n s f o r m a t i o n A l g o r i t h m
This algorithm proceeds in two steps. First, one derives all the Ai functions from Fn in (4.4). Then, one finds n - 1 first order partial differential equations and gives a NSC for the existence of a solution. Secondly, substituting
4. Transformation to State Affine System and Observer Design
63
them in (4.4), one solves the following equation y(n) -
F~ =
F~-I(A1,"
" " ,A~-1,~1)
+AIFn-2(A2, ' " , A,~-I, ~2) +'" + AI'" An-2FI(An-1, P,-1) +A1 "-" An-lFo(qon).
(4.10)
S.A.T. Algorithm. S t e p 1. Prom I / O differential equation of ~ ( 4 . 1 ) set P0 := Y("). For k = 1 to n - 1, ck := 1 with c~-l := 0 let:
02 P~ ~ 02 P~ wk .-- Oy(k)Oy(n_k ) dy + OU~k)Oy(,~_k)duj j=l
(4.11)
Forl
0 or dwk A d u r
0 then the problem has no solution.
9 IfdwkAdy=0anddwkAdu=0 then let the Ai functions be any solution of 02 Pa m wk = Oy(k)Oy(n_k ) dy + j=IE
02 Pa
Olt~k)Oy(n_k) duj
(4.12)
For k = n - 1 9 If dw,~-i ~ O, then the problem has no solution. The algorithm stops. 9 If dw,~_l = O, then let A1 be any solution of
1~
OA1
-duj, Cdn--1= ~11 j = l -OUj
(4.13)
where P a ( A ~ , . . - , A,~_I, ~ 1 , " " ~ , ) is the formal I / O equation (4.4) computed for the system ~ .
S t e p 2. Substitute all the A~ functions in (4.4). Forr=l tonlet /Dr := Pr-1 - F n - r + l
(4.14)
~i and let define K r := A1 "A2 . . - A r with An := 1, and the differential form aJ r as
:=
1 L[ |
OP~
dy +
'~ ,~OP~ (nj=l ouj
)duj
]
.
(4.15)
64
4. Transformation to State Affine System and Observer Design 9 If d ~
A dy ~ 0, d ~
A du ~ 0 then the p r o b l e m has no solution.
9 If d ~
A dy = 0, d ~ r A du = 0 t h e n ~r is a solution of
Ar I [~dy ( 0_.~Ad
-J- E j ~ I ~u~,duj - A'A~" m
0_~Ad u . h _ JJ ] ~- wr for 1 < r < n - 1 , AI-.-A,~-I~n:=Pn forr=n.
oy Y "j- E j = I auj
4.1 : The nonlinear system ~ is locally equivalent to the system by a state coordinates transformation z = r if and only if:
Theorem
~
dwk A dy -- 0, dwk A du = 0
(4.16)
for 1 < k K n - 2 dLdn_ 1 = 0
(4.17)
a~r A dy = 0, d~r A du = 0
(4.18)
where wk (1 < k < n - 1), a n d - ~ (1 < r < n) are defined by (3.11) and (4.15) respectively. Thus, if ~ is locally equivalent to ~ a , then the state coordinates transformation z = ~ ( x ) is c o m p u t e d from ~ a : Zl
=
y = h(x)
1
(h - ~ l ( y , u ) )
-
z3
=
(4.19)
A1 9A2 h (~) - A1~2 - ~bl - log"A1 (h - ~1) A1 9 A2
zk
=
y(k-1) _ Pk-1 k-1 I-L=I A~
(4.20)
where k-1
k-1
Pk=-~k H A, +-fik_l + z k H Ai i=1 i=1 for l < k K n , An : = 0 a n d Pl : = Wl. For the details of the proof, see [20]
(4.21)
4. Transformation to State Affine System and Observer Design
4
65
Synthesis Observer for State Affine Systems
The observer ~-~o of ~ ,
can be the realized solving
f; = - O S - A T ( z g ) S - S A O ) ) + c T c
(4.22)
The time varying solution is a definite positive matrice, and 0 E ~+. In order to avoid the time varying computation, [4] propose a transformation, = ~ z where n--1
fi := diag(1-I A,),
with Ao := 1.
(4.23)
i=0
Applying it to system ~ a , one gets = A--~+ r ( o , () y = C(
(4.24)
= (1
(.1)
where V(0,() := Oqo(J) + f i f e - l ( , ~ := flA(0)~t-1 i.e.
1 0 --. 0 0 /
O 0 0 =
1
0
.
0
AI~
.
ooo 0
0 ;f~(o)
=
oi 0
0
n--1 Hr=l Ar~n
0 i
~-
1 = Diag (log 1-I A t - 1 ), 1 < i "( n.
where A0 := 1. Assume that NF(~,()II _< KH(II s with II(lls :~ ( T S ( 9 Thus it is Lipchitzian with respect to (, uniformly w.r.t. 0 with Lipschitz constant K. Then consider the system = ~r
+ r(~,
r
-
s:o'c~(c~
- y)
(4.25)
where So~ is the algebraic stationary solution of 0 ---- -OSoo - "ATsoo - Soo"A + c T c
(4.26)
for 0 large enough. Then, for inputs 0 uniformly bounded by some 0 > 0 : (4.25) is an observer for (4.24), i.e., for 0 large enough H~(t) -- ~(t)l I ~ K ( O ) e x p ( - 0 t / 3 ) l l ~
0- r
(4.27)
66
4. Transformation to State Affine System and Observer Design
For the proof and the details cf. [7]. Thus from (4.23) this observer in the z coordinates have the following form = ~-~-1~
~-lh~
_
(4.28)
substituting (4.25) in (4.28) we finally have
"~= A ( d ) } + ~(~) - f}-I s ~ ' c T ( c ~ - y)
(4.29)
The transformation f} and its inverse is always well defined since the observability assumption.
4.1
Physical Example
In order to illustrate the feasibility of S.A.T. Algorithm, Theorem 4.1 and the observer synthesis, we apply now the algorithm to a physical example. The SISO system is a inverse pendulum, as considered in [1]. The dynamics of the system are :Cl
=
X2 CON X
y
---- X l ,
~_ l \(- a l s i n xl z'22 + .~co~:~1 /(1--acos 2 xl)
,_-,u ~] ,n
(4.30)
where xl denotes the angular displacement of the pendulum shaft from vertical position, 1 is its length, m the mass of the plumb bob, g the force of gravity and a the ratio ~ with M the mass of the bearing trolley, u is the force applied to the car. Then, the system I / O differential equation is given by
(
gm sin p "~
y(2) = - a c o s y \ I m s i n y . ~]2 + u /m(1 -- a cos 2 y)
cosy]
(4.31)
A1Fo(~)
(4.32)
y loi-A1; F1 := 9~1 - l o g A l ~ l p2
(4.33)
a
and for ~-~a, one gets P a - - y(2) = F 2 ( A 1 ) + F I ( A 1 , ~1) +
where F2 F0
:= :=
Applying the S.A.T Algorithm gives AI(y) =
1 and (1--a cos2 y)89
P2 au cos y - g m sin y ~ 2 : = ~ 1 1 ---- lrn(1 acos 2 y)2• " -
-
(4.34)
4. T r a n s f o r m a t i o n to State Affine System a n d Observer Design
67
0.~
................ '
0.1
t
~o -0.1
-0.2
1
2
3
4
s
6
t
3.~ 3
^
z2
....
,
,
, :
1 :
,
:
i
:
i
i
:
i
9
i
9
i
i
f, O.E 0 -~X5
1
2
3
4
5
6
I
FIGURE 1. R e a l a n d e s t i m a t e d s t a t e v e r s u s t i m e (s).
The system (4.30) is then locally equivalent to: = y =- C z
0
0
z +
(0)
~2(Y, u)
(4.35)
= Zl
a n d f r o m (4.20) the state c o o r d i n a t e s t r a n s f o r m a t i o n : Zl = h ( x ) = x L ; z2 = x2(1 - a c o s ~ x l ) ~ .
Thus the observer (4.29) has the following form
= A(y)~ + ~ ( u , y ) - f t - ' S j o l c T ( ~ , - z,)
(4.36)
where Soo is t h e a l g e b r a i c s t a t i o n a r y s o l u t i o n of (4.25). S o l v i n g for Soo o n e
gets $11 = 1/0, $21 = - 1 / 0 2 , $22 = 2/03. T h e n SjolC T = (20 02) T a n d f~-'SjolC T = (20, 02/Al(y) )T.
4.2
Simulation Results
We present the simulation results for a trajectory tracking Yr := cl sin(c2t) where cl = .3, c2 = 1, a n d l = 0 . 3 6 m , t t M = 2 . 4 k g , , m = 0 , 2 3 k g , 0 = 50
68
4
Transformation to State Affine System and Observer Design 0.4
i
0.3
i
H
'
....... i ...........
i-
.
i
i ..................
:
/
/ -o.1 V -o.2
~.......... ~ ....... i . . . . . . . . .
-03~
i
.... i
3
4
5
i
'
!
i. . . . . . . . . . . . . .
i ......................
i
i. . . . . . . . . . . . . . . . .
i
:
. . . . .
1
.........
9
9 ! ..........
........
~
.....
2
i ..........
6 !
. . . . .
~01
:
o ~0'0
. . . . . .
~
.
.
.
.
.
.
.
.
. . . . . . . . . .
...... 1
2
:.
.
.
.
.
.....
9
i .......
.
.
i ....... ; . 3
4
5
6
FIGURE 2. Real and e s t i m a t e d state versus t i m e (s). and~ Zl(0) = z 2 ( 0 ) : = 0, ~1(0) = 0.2, ~2(0) = 3. In Fig. 1, we apply a tracking control with the real state and compare them with the estimated one. This error is practically neglectible. In Fig. 2, we measure only the output and apply the tracking control computed with the estimated states. As we have a pendulum set in our laboratory, next step of this work will be to check this observer in real conditions.
5
Conclusions
A NSC for the transformation to the state affine systems of MISO nonlinear systems has been obtained. The main result is stated in terms of the I / O differential equation and some one-forms obtained in a straightforward way and easy to check. With the help of a regular transformation, one avoids the time varying computation of the observer gain matrice an then a Kalmanlike observer ~-'~o (4.3) for the transformed system ~-~ (4.2) can be directly applied.
4. Transformation to State Atone System and Observer Design
6
69
REFERENCES [1] G. Besan~on and G. Bornard. State equivalence based observer synthesis for nonlinear control systems. Proc. IFA C 13th Triennial World Congress, (San Francisco, USA, Vol. E, 287-292, 1996. [2] J. Birk and M. Zeitz. Extended Luenberger observer for nonlinear multivariable systems, Int. Y. Control, 47, 1823-1836, 1988. [3] G. Bornard, N. Couenne and F. Celle. Regularly persistent observers for bilinear systems Proc. in 29 I.C.N.S., New trends in nonlinear system theory, (Vol. 122 Springer Verlag) June 1988. [4] K. Busawon, M. Farza and H. Hammouri. A simple observer for a class of nonlinear systems, Personal Communication. [5] J. N. Chiasson. 'Nonlinear differential-geometric techniques for control of a series DC motor', IEEE Trans. Contr. Syst. Technology, 2, 35-42, 1994. [6] S. Diop, J. W. Grizzle, P. E. Moraal and A. Stefanopoulou. Interpolation and numerical differentiation for observer design, Proc. ACC 94, Evanston, Illinois, 1329-1333, 1994. [7] J. P. Gauthier, H. Hammouri and S. Othman. A simple observer for nonlinear systems applications to bioreactors. IEEE TAC, 37(6), 885880, 1992. [8] D. Guillaume, P. Rouchon and J. Rudolph. Two simple observers for a class of polymerization reactors, ~th EGG '97, Bruxelles, 1997. [9] A. Glumineau, C. H. Moog and F. Plestan. New algebro-geometric conditions for the linearization by input-output injection, IEEE Trans. Autom. Control, 41,598-603, 1996.
[10] H. Hammouri and J. P. Gauthier. Bilinearization up to output injection, Syst. Contr. Letters, 11, 139-149, 1988. [11] H. Hammouri and J. P. Gauthier. Global time varying linearization up to output injection, SIAM J. Control Optim., 30 1992, 1295-1310, 1992. [12] H. Hammouri and M. de Le6n. Observer synthesis for state-affine systems, 29 th. CDC, Honolulu, Hawai. Dec. 5-7, 1990. [13] H. Hammouri and M. de LeSn. On systems equivalence and observer synthesis. New Trends in Systems Theory, 340-347, 1991. [14] H. Hammouri and M. Kinnaert. A new procedure for time-varying linearization up to output injection, Syst. Contr. Lett., 28, 151-157, 1996.
70
4. Transformation to State Affine System and Observer Design
[15] H. Keller. 'Nonlinear observer design by transformation into a generalized observer canonical form', Int. J. Control, 46, 1915-1930, 1987. [16] A. J. Krener and W. Respondek. Nonlinear observers with linearizable error dynamics, SIAM J. Contr. Optim., 23, 197-216, 1985. [17] J. de Ledn Morales and S. Acha Daza. Stabilization of a class of nonlinear systems, application to a synchronous generator, Proc. of I A S T E Int. Conf. High technology in the Power Industry, Orlando Fl., USA, 23-26, 1998. [18] J. L~vine and R. Marino, Nonlinear systems immersion, observers and finite dimensional filters, Syst. Contr. Lett., 7 (1986), 133-142. [19] V. Ldpez-M. and A. Glumineau. 'Further results on linearization of nonlinear systems by input output injection', Proc. 36th. CDC IEEE, (San Diego, USA), 1997. [20] V. Ldpez-M. and J. de Ledn Morales and A. Glumineau. Transformation of nonlinear systems into state affine control systems and observer synthesis, IFAC CSSC, Nantes, France, July 8-10, 1998. [21] V. Ldpez-M., F. Plestan and A. Glumineau. Linearization by completely generalized input-output injection, Proc. of 6 th IEEE MCCA, Alghero, Italy, 9-11, 1998. [22] R. Marino and P. Tomei. 'Dynamic output feedback linearization and global stabilization', Syst. Contr. Letters, 17, 115-121, 1991. [23] F. Plestan and B. Cherki. 'An observer for a one flexible joint robot by an algebraic method', Proc. IFA C Workshop on New Trends in Design of Control Systems NTDCS'9~ (Smolenice, Slovakia), 41-46, 1994. [24] F. Plestan and A. Glumineau. 'Linearization by generalized inputoutput injection~t for electrical motor observers', Electrimacs 96 (Saint Nazaire, France), 2/3, 569-574, 1996. [25] F. Plestan and A. Glumineau. 'Linearization by generalized inputoutput injection', Syst. Contr. Letters, 31, 115-128, 1997. [26] T. Proychev and R. L. Mishkov. 'Transformation of nonlinear systems in observer canonical form with reduced dependency on derivatives of the input', Automatica, 29, 495-498, 1993. [27] J. Tsinias. Further results on the observer design problem, Syst. Contr. Lett. ,14, 411-418, 1990. [28] D. Williamson. Observation of bilinear systems with application to biological systems, Automatica, 13, 243-254, 1977.
4. Transformation to State Affine System and Observer Design
71
[291
A. J. Van der Schaft. Representing a nonlinear state space system as a set of higher-order differential equations in the inputs and outputs, Syst. Contr. Lett., 12, 151-160, 1989.
[3o]
F. Viel. Stabilit~ des syst~mes control,s par retour d'~tat estim~. Application aux r~acteurs de polym~risation et aux colonnes ~ distiller, Th~se de Doctorat, Universit~ Claude Bernard-Lyon 1, Lyon, 1994.
[31] X. H. Xia and W. B. Gao. Nonlinear observer design by observer error linearization, SIAM J. Contr. Optim., 1, 199-216, 1989.
On E x i s t e n c e of E x t e n d e d O b s e r v e r s for N o n l i n e a r Discrete-Time Systems H. J. C. Huijberts D e p a r t m e n t of Mathematics and Computing Science Eindhoven University of Technology P.O. Box 513, 5600 MB Eindhoven, The Netherlands
1
Introduction
In this chapter, we consider the design of observers for discrete-time nonlinear systems by means of so called (extended) observer forms. Loosely speaking, a system in observer form is a linear observable (continuous-time or discrete-time) system t h a t is interconnected with an o u t p u t - d e p e n d e n t nonlinearity. Observers for this kind of systems m a y be built by building a classical linear Luenberger observer for the linear system, and adding the output-dependent nonlinearity to this observer. Thus, observer design for systems in observer form is relatively easy. By the same token, also observer design for systems that may be transformed into a system in observer form by means of a coordinate transformation and an output t r a n s f o r m a t i o n is relatively easy. Observer design for systems in observer form was first studied, in the continuous-time setting, in [10],[11] (see also [15]). In these papers, conditions were given under which a nonlinear continuous-time s y s t e m m a y be transformed into a system in observer form by means of a coordinate transformation and an output transformation. Basically, these conditions were given in terms of the integrability of certain codistributions. Later on, the observer design for discrete-time systems in observer form was studied (see [1] ,[12],[13] and the references therein), and conditions were given under which a nonlinear discrete-time system m a y be t r a n s f o r m e d into a system in observer form by means of a coordinate t r a n s f o r m a t i o n and an output transformation. These conditions came down to the question whether certain functions could be factorized in a certain way. For singleoutput systems, conditions under which this factorization is indeed possible were given when only output transformations are allowed. (In fact, [13] also claims to give conditions for the multi-ontput case. However, these conditions seem to be incorrect.) One of the purposes of this chapter is to
74 5. On Existence of Extended Observers for Nonlinear Discrete-Time Systems generalize the conditions given in [13] for single-output systems to the case where, besides a coordinate transformation, also an o u t p u t t r a n s f o r m a t i o n is allowed. T h e conditions for the existence of an observer form for continuous-time systems and discrete-time systems given in [10],[11],[1],[12],[13] are quite restrictive. Therefore, generalizations have been considered, b o t h in the continuous-time case and the discrete-time case. In the continuous-time case, so-called generalized observer forms were considered. These generalized observer forms consist of an observable linear system interconnected with a nonlinearity that depends on the output of the system and a finite number of its derivatives. In [6], differential geometric conditions were given under which a continuous-time system m a y be transformed into a generalized observer form by means of so called generalized coordinate transformations (i.e., transformations that, besides the state of the system, also depend on a finite number of time-derivatives of the outputs) and an output transformation. In the discrete-time context, the design of so called extended observers by using extended observer forms was studied in [7],[8]. Here, an extended observer is an observer that, besides the o u t p u t of the system, also depends on a finite number of its past values, while a s y s t e m in extended observer form consists of an observable linear system interconnected with a nonlinearity t h a t depends on the output of the system and a finite number of past output values. In [7],[8], conditions were given under which a given single-output discrete-time system m a y be t r a n s f o r m e d into a system in extended observer form by means of an extended coordinate transformation (i.e., a coordinate transformation t h a t depends on the state of the system and a finite number of past o u t p u t values) and an output transformation. As in [1],[12], these conditions again boilt down to the question whether a given function m a y be factorized in a certain way. A corollary of the results obtained in [7],[8] is t h a t when the n u m b e r of past output values equals n - 1 (where n is the dimension of the state space of the system under consideration), an extended observer form always exists when the system under consideration is strongly observable (for the exact definition of strong observability, we refer to Section 3). In this chapter, differential geometric conditions for the existence of such a factorization for the cases t h a t the number of past o u t p u t values is smaller t h a n n - 1 will be given. It is to be noted t h a t in principle the problem of observer design is a global problem. Therefore one would also like to obtain global conditions for existence of extended observer forms. However, if one knows t h a t the system under consideration evolves on an invariant set, also existence conditions on this invariant set would suffice. In this chapter, all results obtained will be valid on open invariant sets on which some regularity assumptions hold. Of course, this also includes the case where one really would like to have global conditions. The chapter is organized as follows. In the following section, an overview
5. On Existence of Extended Observers for Nonlinear Discrete-Time Systems
75
of results from the theory of differential forms t h a t will be used in this chapter is given. In Section 3, we consider the existence of observer forms for single-output discrete-time systems. The results in this section reformulate and generalize the results in [13]. In Section 4, the existence of extended observer forms for single-output discrete-time systems is studied. Section 5 contains some conclusions.
2
Differential
Forms
In this section we give an overview of results from the theory of differential forms t h a t will be used in this chapter. For details, we refer to [2],[3],[4],[5],[14]. Let V be an r-dimensional vector space over /R. A k-.form w on V is a k-linear completely antisymmetric m a p p i n g w : V • .-. x V ---* ~ , i.e.,
k times
( w , Z e t~)(vvl, vl, v 2 , . . . , vr ~ v ) ( ~ ( ~ l + Zvl,
~2,.
9 9
~k) = (5.1)
~(~,..
, ~))
, vk) + ~,~(v~, ~ , . . .
(VVl,..- ,v k
E
V)(t.d(Vl,-.. ,vk)
=
(5.2) --~d(Vl,--- , V i - l , V i + l , V i , V i + 2 , "
,vk))
Note t h a t a one-form on V is just an element of V*, the dual of V. The space of all k-forms o11 V is denoted by Ak(V*). It is easily checked t h a t the k-linearity and a n t i - s y m m e t r y of a k-form on V implies t h a t all k-forms are zero for k > r. We define A(V*) := A~
*) (t) A I ( V *) (+)... (]9 A~(V *)
(5.3)
where A~
*) :=
We call A(V*) the exterior algebra over V*. An element w c A(V*) is called a .form on V and m a y be written in a unique way as W :wO-[-wl-J-...~-W
r
(5.4)
where wi E Ai(V *) (i -- 0 , - . . ,r). We next define a product on A(V*), the so called wedge product (or exterior product). This product will be denoted by "A". First, let 7/E Av(v*),
76
5. On Existence of Extended Observers for Nonlinear Discrete-Time Systems
w E Aq(V*). Then 7 A w C AP+q(v *) is defined by (7 A ~ ) ( v i , -
9 - , vp+q) =
E(sign(~))7(v~(1),
- 9 -, v~(p)).4v~(p+l),
9 9 -,
7r
v~(p+q))
(5.5)
where the s u m m a t i o n is over all possible p e r m u t a t i o n s of 1 , . . . ,p 4- q, and Vl,... ,Vp+q C V. For 7 = 7 ~ 4- " " 4- 7 r, w = w ~ 4- ... 4- w r, with 7i,w i c Ai(V *) (i = 0 , . - . ,r), we define
T A w : s 7i Aaj j i,j~o
(5.6)
Note t h a t the wedge-product is associative and distributive, but not commutative. Instead, it satisfies 7 A w = (-1)~qw A 77, 7 e AP(V*),w e Aq(V *)
(5.7)
Let v E V be given. Then the interior product v_J : A(V*) ~ A(V*) is defined in the following way. First consider w E Ak(V*). T h e n v J w C A k - I ( V *) is given by (V--J~d)(Vl,''" , V p _ l ) = • ( V , V 1 , ' ' "
where v l , - . . , 8 9 0 , . . . ,r), then
C V. I f w = w ~
, V p _ 1)
(5.8)
", with w i E Ai(V *) (i = r
(v_l w) -- E(v_I w i)
(5.9)
i=0
Next, consider an n-dimensional manifold M. Let T*M denote the cotangent space at x E M and let T*M denote the cotangent bundle of M. Since T*M is an n-dimensional vector space over ~ , we m a y define A k ( T * M ) for a l l x E M , k = 0 , - . . ,n, as n
A ( T ; M ) := E A k ( T * M ) k=0 We then define the bundles Ak(T*M), A ( T * M ) over M by Ak(T*M):=
U
Ak(T*M),(k=O,,n)
(5.10)
xcM n
A(T*M) := ~ A k ( T * M ) k=0
(5.11)
5. On Existence of Extended Observers for Nonlinear Discrete-Time Systems
77
A differential form on M is now defined to be a s m o o t h section of the b u n d l e A ( T * M ) , while a differential k:form on M is defined to be a s m o o t h section of the bundle A k ( T * M ) . So, roughly speaking, a differential ( k - ) f o r m on M is a "prescription" t h a t assigns a ( k - ) f o r m w~ on T * M to every x C M in a s m o o t h way. Note t h a t by this definition a differential 0-form on M is just a s m o o t h function on M . W h e n no confusion arises, we will simply call a differential ( k - ) f o r m on M a ( k - ) f o r m on M in the sequel. T h e wedge product of the forms 7, w on M is defined to be the f o r m (TAw) satisfying
(t1 A w)x = fix A Wx, (Vx 6 M )
(5.12)
Let 7- be a s m o o t h vector field on M , a n d let w be a form on M . T h e n the interior product (7-Aw) is defined to be the form satisfying
(rn ~)~ = rxn.~x, (Vx ~ M)
(5.13)
T h e exterior differential operator d m a p s a k-form w into a (k + 1)-form dw, called the exterior derivative of w. T h e o p e r a t o r d is uniquely defined by the following properties: 1. d is linear: (Va,/3 E H~)(d(a~/+/3w) = a d r 1 + / 3 d w ) 2. If r/is a k-form, then d(r] A w)
d~ A w
=
+
( - 1 ) k r / A dw
3. d 2 = 0 . 4. If f is a 0-form, then d f is the o r d i n a r y differential df of f . 5. d is local: if r / a n d w coincide on an o p e n set U, then dr/--- dw on U. A k-form c~ is called closed if dc~ -- 0; it is called exact if there exists a ( k - 1)-form r] such t h a t w = dr/. Note that, since d 2 = 0, an exact k-form is closed. T h e converse does not need to hold globally. However, it does hold locally, as is reflected by the following theorem. 5.1 ( P o i n c a r ~ L e m m a ) If M is smoothly contractible to a point xo E M , then every closed.form w on M is exact. Theorem
Let r be a s m o o t h vector field on M . T h e Lie-derivative s m a p s a k-form w into a k-form Z;~w. s is uniquely defined by the following properties: 1. If f is a 0-form on M , t h e n
s
=
TJdf
78
5. On Existence of Extended Observers for Nonlinear Discrete-Time Systems 2. /2, is a derivation: /:~-(~ A w) = r/A L:~-w + l:~-~ A w 3. s
commutes with d: s
= d(s
From the definitions of interior product, exterior derivative, and Liederivative one may derive the following identities t h a t will be frequently used in the sequel (here r., r denote smooth vector fields on M , and w denotes a k-form on M). Z;,w = d(T_lw) + [a, 7]_lw = s
(5.14)
T_ldw
- r_l/:ow(Leibniz - formula)
s
= s163
- g~s
(5.15) (5.16)
Denote by ~tl(M) the set of all one-forms on M. T h e n ~ I ( M ) has the structure of a finitely generated module over the ring of s m o o t h functions on M. A finitely generated submodule of ~1 (M) is called a codistribution on M. The minimal number of generators of a codistribution is called its dimension. A codistribution ~ on M is called integrable if it has a set of generators consisting of exact one-forms. The codistribution generated by a set of one-forms w l , . - - , ~dd is denoted by span{wl, 9-. , ~2d}. For a one-form w and a d-dimensional codistribution ~t, we will say t h a t dw ~ Omod~ if and only if dw
A wl
A
. . . A wd
=
O(Vwl,
. . .
,WdE~)
T h e o r e m 5.2 ( P r o b e n i u s T h e o r e m ) Let M be smoothly contractible to a point xo E M , and let Q be a codistribution on M . Then the .following statements are equivalent. (i) ft is integrable. (ii) For every w E f t we have that dw = 0modfL (iii) Let { w l , ' " ,Wd} be a set o.f generators of f~ and let W d + l , ' ' " ,wn be such that { w l , ' " ,wn} generate f t l ( M ) . T h e n ft is integrable if and only i.f there exist smooth .functions Fikj (i, k = 1 , . . - , d; j = i + 1 , . . . ,n) on M such that dwk = E /=1 j=/+l
F~jwi A w j ( k = 1 , . . . ,d)
(5.17)
5. On Existence of Extended Observers for Nonlinear Discrete-Time Systems
79
In what follows, we will also need the following result. Theorem
If w l , " " ,Wd are independent oneare one-forms such that
5.3 ( C a r t a n ' s
.forms, and 7rl,...
, 7rd
Lemma)
7rl A w] + . - . + TcclA Wd = 0 then 71"i E span{w1,..-
3
,02d}
Observer Design using Observer Forms
We start our investigation of observer design for nonlinear discrete-time systems by considering a nonlinear discrete-time system E of the form
{ z(k +~](k)l) _------ Cz(k)AZ(k)+ (I)O)(k))
(5.18)
with state z E ~/~n, output ~ E ~x~,and where A, C are matrices of appropriate dimensions, the m a p p i n g (I) : ~ ~ H~'~ is smooth, and the pair (C, A) is in Brunovsky form. Analogously to [10],[11], a system of this form is called a system in observer.form. For systems in observer form, observer design is particularly simple. Namely, the fact that the pair (C, A) is in Brunovsky form, and thus in particular observable, implies t h a t there exists a m a t r i x K such that all eigenvalues of the matrix A - K C are in the open unit disc. It is then straightforwardly checked that the following system is an observer for E:
{ 2(k + 1) = ~(k)
=
A 2 ( k ) + K ( ~ ( k ) - y(k)) +
q)(~(k))
c~(k)
(5.19)
We next consider a nonlinear discrete-time system E of the form ~ x ( k + 1) y(k) [
=
=
f(x(k))
h(x(k))
(5.20)
where x E ~'~, y E JT~, and the mappings f : /R n --~ /R '~ and h : ~ -~ are smooth. We will say t h a t E can be put in observer form if there exist a diffeomorphism P : JT~n -~ ~ n of the state space and a diffeomorphism p : ~ ---* h'~ of the output space such t h a t in the new coordinates z = P ( x ) and with the new output ~ = p(y) the system E takes the form (5.18), where the pair (C, A) is in Brunovsky form. If E does admit an observer form, an observer for E m a y then be obtained by first building an observer (5.19) for the observer form (5.18) and then letting ~(k) := P - l ( 2 ( k ) ) be the estimate of x(k).
80
5. On Existence of Extended Observers for Nonlinear Discrete-Time Systems
We thus see that observer design for a discrete-time system E of the form (5.20) is relatively easy when E can be put in observer form. This raises the question under what conditions E can be put in observer form. To derive these conditions, we first introduce the so called observable .form of E. Define the observability map ~ : A n ~ j~n by
h(x) ) ho f(x) . ho f~-l(x)
r
(5.21)
where f l : = f , fk := f o fk-1. Assume that the origin is an equilibrium point of E (i.e., f(0) = 0) and that h(0) ---- 0. We then call E strongly observable on an open subset U C ~ n containing the origin if r is a diffeomorphism on U. It follows that if E is strongly observable on U, then s := r forms a new set of local coordinates for E on U. In these new coordinates, E takes the form
I
s l ( k + 1) Sn_l(~-t-1)
s~(k + 1) y(k)
=
s2(k)
~---
8n(k )
=
fs(s(k)) sl(k)
=
(5.22)
where fs : = h o f,~-I o ~ - 1 . The form (5.22) is referred to as the observable
.form of E. We now have the following result. T h e o r e m 5.4 Consider a nonlinear discrete-time system E of the .form
(5.20), and assume that f(O) = O, h(O) = O. Then r can be put in observer .foT-rn on an open invariant subset U o.f (5.20) containing the origin i.f and only if (i) E is strongly observable on U. (ii) There exist .functions p, r "" , r : ~ ---* ~ , where p is a diffeomorphism on h(U), such that on U the .function fs in (5.22) satisfies p o f~(s) = r
+ r
+"" + r
(5.23)
P r o o f . The proof for the case where only coordinate transformations are considered (or, in other words, the case where p = i d a ) may be found in [13],[8],[9]. The case where also output transformations are considered is an (almost) immediate consequence of the case where p = i d a . 9
5. On Existence of Extended Observers for Nonlinear Discrete-Time Systems
81
If functions p, r , r : ~ --* ~ exist such t h a t fs satisfies (5.23), we define the following c o o r d i n a t e change: i--1
z~ := p ( s d - ~
Cj(s~_j)(i = 1 , . - , n)
(5.24)
j=l
In these new coordinates, we obtain the observer form for E: =
z2(k)+r
z~_l(k)
=
z~(k)+~n_l(~(k))
~(k)
=
z~(~)
zl(k+l)
(5.25)
where r : = r o p - 1 . From T h e o r e m 5.4, it follows t h a t the question w h e t h e r or not E can be p u t in observer form m a y be reduced to the question w h e t h e r or n o t the function fs in (5.22) m a y be w r i t t e n in the special form (5.23). We will now derive conditions on fs under which this is possible 9 To this end, we define one-forms w l , - . . , wn by
~i :=
j=l
/~ \ as s )
1,
,n)
(5.26)
For the case t h a t p = i d ~ we t h e n have the following result. 5.5 Consider a discrete-time system E of the .form (5.20) that is strongly observable on an open inva~iant subset U C ~:~'~ containing the origin. Assume further that U is smoothly contractible to the origin, and that the one-forms W l , " " ,Wn in (5.26) generate a codistribution on U. Then E can be put in observer .form with p = idt~ on U i.f and only if the one-forms W l , " " , wn in (5.26) satisfy
Theorem
dwi = 0(i = 1 , . . . ,n)
(5.27)
P r o o f . (necessity) Follows by direct verification. (sufficiency) A s s u m e t h a t the one-forms wi in (5.26) satisfy (5.27). It then follows from (5.26,5.27) t h a t
0--d~
.....
~
dskAdsj(i=l,...,n)(5.28)
~=1 k = i + l
which is equivalent to
(OskOsj 02fs ) = 0 ( j , k = l , - - .
, n ; j 7~ k)
C5.29)
82
5. On Existence of Extended Observers for Nonlinear Discrete-Time Systems
This implies t h a t there exist functions r satisfies (5.23) with p = i d a . 9
,r
: ~ ~ ~ such t h a t f~
We next give conditions under which f~ m a y be w r i t t e n in t h e f o r m (5.23), where p r i d a . To do this, we let r l , . . . ,7~ be vector fields t h a t are dual to the one-forms CO1,"" ,w,~ in (5.26), i.e.,
TiJ wj --- 6ij (i, j = 1 , . . . , n)
(5.30)
where 5ij is the Kronecker delta. T h e o r e m 5.6 Consider a discrete-time system E of the .forvn (5.20) that is strongly observable on an open invariant subset U C ~ n containing the origin. A s s u m e .further that U is smoothly contractible to the origin, and that the one-forms co1,"" ,COn in (5.26) generate a codistribution on U. Then the .following statements are equivalent:
(i) ~ can be put in observer .form on U. (ii) There exists a .function S : U -~ ~7~ such that dCOi := d S A wi(i = 1 , . . . ,n) (iii) The one-forms
CO1,'''
, COn
(5.31)
and the vector fields r a , " " , r~ satisfy
dCOi A COj + dCOj Awi = 0(i, j = 1 , . . - , n)
(5.32)
and
(5.33) P r o o f . ( i ) ~ (ii) Assume t h a t E can be p u t in observer form on U. Define one-forms &i by
~i : =
j=l
\
Osj
It t h e n follows from T h e o r e m 5.5 t h a t we have t h a t d~i = 0(i = 1 , . - . ,n)
(5.35)
Together with the fact t h a t &i = (P' o f~)w~, this gives t h a t O = & S i = d (p,~s
A&~ (5.36)
(p' o f~)d (p,~s ) A w i = d ( - log Ip' v fs]) A col
5. On Existence of Extended Observers for Nonlinear Discrete-Time Systems
83
which establishes our claim for S : = - log {p~ o f~{. ( i i ) ~ ( i ) A s s u m e t h a t there exists a function S : g ~ h~ such t h a t (5.31) holds. Since wn = df~, (5.31) for i = n gives t h a t d S A df~ = 0. B y C a r t a n ' s L e m m a , this gives t h a t d S E span{df~}. Define T : = e x p ( - S ) . We t h e n also have t h a t d T E span{df~}, which implies t h a t there exists a f u n c t i o n /5 : ~ ~ ~ such t h a t T = / 5 o f~. Define p : _t~ ~ ~ by p : = f p ( T ) d % a n d define one-forms & l , " " , &,~ as in (5.34). N o t e t h a t we t h e n have t h a t 9 = Twi (i = 1 , . . . , n), which implies t h a t d&i = Tdw~ + d T A w i = T ( d w i - d S A wi) = O(i = 1 , . . . , n)
(5.37)
Together with T h e o r e m 5.5 this establishes our claim. (ii)~=~(iii) We first show t h a t (5.32) is equivalent to the existence of a unique one-form 71 such t h a t awl = ~] A wi(i = 1 , . . . , n)
(5.38)
Note t h a t if there exists a one-form V such t h a t (5.38) holds, t h e n (5.32) follows immediately. Conversely, assume t h a t (5.32) holds. For i = j, (5.32) gives in particular t h a t dwi A wi = 0 (i = 1,. 9 9 , n). It t h e n follows from the Frobenius T h e o r e m t h a t there exist one-forms P l , " " , pn such t h a t dwi = Pi A a~i(i = 1 , - . . ,n)
(5.39)
rt
Writing p~ = ~ k = l pikcok (i = 1 , - . . , n), we then o b t a i n from (5.32) t h a t 0 = dcoi A wj + doaj A aJi . . . . .
~ k=
(pit: - pjk)wk A w i A wj
(5.40)
I
k#i,j which gives t h a t Pik = p j k ( i , j , k = 1 , . . . , n ; k r i , j )
(5.41)
This i m m e d i a t e l y implies t h a t indeed there exists a unique one-form r / s u c h t h a t (5.38) holds. W h a t remains to be done, is to show t h a t there locally exists a function S such t h a t 7] = d S if a n d only if (5.33) holds. W r i t i n g :
n
Ej=I
7]jO.)j, w e
have that
n
n--1
d~ = ~-~(drlj A wj + ~ j d w j ) . . . . . j=l
~
(s
- g,-j~i)wi A wj
j=l i=j+l
(5.42) B y the Poincar~ L e m m a , this gives t h a t there exists a function S : U ~ such t h a t rI = d S if a n d only if /2~j
=/:~,rli(i, j = 1 , . . - , n)
(5.43)
Using (5.15),(5.14), it is straightforwardly checked t h a t ~y = [T~,Tj]_I w~. Together with (5.43), this yields (5.33), which establishes our claim. 9
84
5. On Existence of Extended Observers for Nonlinear Discrete-Time Systems
4
Observer Design using Extended Observer Forms
In the previous section, we have seen t h a t observer design for a nonlinear discrete-time system (5.20) is relatively easy when the system can be put in observer form (5.18). Unfortunately, however, the conditions for existence of an observer form given in T h e o r e m 5.4 are quite restrictive. In this section, we will relax these conditions by considering so called extended observers and extended observer .forms. We will first explain what is meant by an extended observer. Consider a system E of the form (5.20), and let N E ~V be given. We now assume t h a t at every time instant k > N we do not only know the output y(k) at t i m e k, but also the past outputs y(k - 1),-. - , y(k - N). An extended observer with buffer N for E then is a dynamical system E of the form E : ~(k + 1) --- f ' ( ~ ( k ) , y ( k ) , - - . , y ( k - N ) ) , k _> N where ~ E ~ ' ~ and the m a p p i n g f ' : /R n + g + l ~ ff~'~ is smooth, with the property t h a t x(k) - ~(k) --* 0 (k ~ oo), for all x(0), ~ ( N ) E ff~n. To study the design of extended observers for a discrete-time system E of the form (5.20), we first consider a system E~ of the form
~
[
z ( k + l) 9(k)
= =
Az(k)+O(~(k),... ,~)(k-N)) Cz(k)
(5.44)
where the state z E ~'~, the output ~ E ~ , A, C are matrices of a p p r o p r i a t e dimensions, the mapping ~ : ~ g + l ~ ~,~ is smooth, and the pair (C, A) is in Brunovsky form. Note that for N = 0 the system E~ is identical to the system E in (5.18). Therefore, a system E~ of the form (5.44) will be referred to as a system in extended observer.form with buffer N. As for a system in observer form, the design of an extended observer for a s y s t e m in extended observer form is relatively easy. Namely, it is straightforwardly checked that the system
{
E(k + 1)
=
AE(k) + K(~](k) - y(k)) + O(~)(k),... ,~)(k - N ) )
=
c
(k)
(5.45) where the matrix K is such that all eigenvalues of A - K C are in the open unit disc, is an extended observer for E ~. As in the previous section, we now consider the question under which conditions a given discrete-time system can be put in extended observer form for some N E / N . The transformations we are going to use here, are more general t h a n the ones in the previous section, in the sense t h a t we also allow them to depend on the past output measurements y ( k - 1 ) , - . 9 , y ( k N). More specifically, we will be looking at p a r a m e t r i z e d transformations Z = /:~(X, ~ n , ' ' " , ~N), where z E ~ n , with the property t h a t there exists a
5. On Existence of Extended Observers for Nonlinear Discrete-Time Systems
85
m a p p i n g p - l ( . , f l , ' " 9 , iN) : ~ n __~ ~ n p a r a m e t r i z e d by (fl," " " , i N ) such t h a t for all ( f l , " " , iN) we have t h a t
P(P-l(z, fl,""
,fN),fl,''"
,iN)
:
Z
A m a p p i n g having this property will be referred to as an extended coordinate change. We will then say t h a t the system (5.20) can be put in extended observer .form with buffer N if there exists an extended coordinate change P ( ' , fl," "" , iN) : ~ n --, ~ n parametrized by (fl," "" , iN) and a diffeomorphism p : / R -~ H~ of the output space such t h a t the variable
z(k) := P ( x ( k ) , y(k - 1),--. , y(k - N ) )
(5.46)
satisfies (5.44), where ~) :-- p(y), and the pair (C, A) is in Brunovsky form. As pointed out above, one m a y then build an extended observer (5.45) for z(k) in (5.46). From this extended observer, one then obtains an e s t i m a t e ~(k) for x(k) by inverting the extended transformation P: ~(k) := P - l ( ' d ( k ) , y ( k - 1 ) , . . - , y ( k -
N)),k > N
(5.47)
The following generalization of T h e o r e m 5.4 gives conditions under which a discrete-time system (5.20) can be put in extended observer form. 5.7 Consider a nonlinear discrete-time system (5.20), and assume that f(O) = O, h(O) = O. Let N E be given. Then E can be put in observer .form with buffer invariant subset U C ~ztn containing the origin i.f and only
E of the .form
Theorem
{ 0 , . . . , n - 1}
N on an open if
(i) E is strongly obsewable on U. (ii) There exist .functions r ,r : ~ N + I --~ ~ and a diffeomorphism p on h(V), such that on V the .function fs in (5.22) satisfies n-N
p o f~(s) ---- E
r
, si)
(5.48)
i=1
P r o o f . The proof for the case where only coordinate transformations are considered (or, in other words, the case where p = idt~) m a y be found in [8]. The case where also output transformations are considered is an (almost) immediate consequence of the case where p = idt~. 9
If there exist functions p, r defines the following variables:
,
Cn-N such t h a t fs satisfies (5.48), one
z (k) E Cj(s~_j(k),..- , s l ( k ) , y ( k j=l
(i----1,...,n)
1),--. , y ( k -
N-
1 +i-j))
(5.49)
86 5. On Existence of Extended Observers for Nonlinear Discrete-Time Systems where c~i : = m i n i / t~ y + l by
~)n-N : ~ N + I
1, n - N). Further, define r
~i(~1,''" ,~N+I) : : (~I(p--I(~I),''" ,P--I(~N+I)) (i=l,-.-,n-N) It m a y t h e n be shown t h a t in these variables we o b t a i n the following ext e n d e d observer form: zl(k-4- 1)
=
z2(k)
+ q~l(y(k),... ,y(k -
N))
z,~_g(k+ 1)
=
Zn_NA-l(k)-~-~n_N(y(]'g),''"
z n - g + l ( k + 1)
=
Zn-N+2(k)
= = =
o z (k)
zn_l(k
+ 1)
,y(]~ -- g ) )
(5.50)
Note t h a t T h e o r e m 5.7 generalizes T h e o r e m 5.4. ~hlrther, f r o m T h e o r e m 5.7, we o b t a i n the following result for N = n - 1 (see also [7],[8]). C o r o l l a r y 5.1 Consider a nonlinear discrete-time s y s t e m E of the .form (5.20), and assume that f ( 0 ) = 0, h(0) = 0. Then E can be put in observer .form with buffer N on an open invariant subset U C i~:tn containing the o,~igin i.f and only i.f E is strongly observable on U. Thus, we see t h a t every strongly observable s y s t e m can be p u t in ext e n d e d observer with buffer N = n - 1. F r o m a practical point of view, e.g. when n is large, it m a y be desirable to reduce the size of the buffer. T h e r e fore, we next investigate under which conditions an e x t e n d e d observer with buffer N E { 1 , . . . , n - 2} exists. As in the previous section, these conditions are again given in terms of the one-forms wi in (5.26) a n d the vector fields ~-~ in (5.30). Using the one-forms w~ in (5.26), we define the following codistributions: ~i : = span{wk - wk-1 ] k = i + 1 , . - . , m i n ( n , i + N ) } ( i = 1 , . . . , n ) (5.51)
~i : = s p a n { w ~ , . . - , w i + g } ( i = 1 , . . . , n - - N -
1)
(5.52)
We first consider the case w i t h o u t o u t p u t t r a n s f o r m a t i o n s , i.e., the case where in (5.48) we have t h a t p = i d a . This result generalizes T h e o r e m 5.5. T h e o r e m 5.8 Consider a discrete-time s y s t e m E o.f the .form (5.20) that is strongly observable on an open invariant subset U C ~ n containing the
5. On Existence of E x t e n d e d Observers for Nonlinear Discrete-Time Systems
87
origin. A s s u m e .further that U is smoothly contractible to the origin, and that the o n e - f o r m s wl , . . . , a~n in (5.26) generate a codistribution on U. L e t N E {1,. - 9 , n - 2} be given. T h e n .for p = i d a , E can be p u t in extended observer .form with buffer N on U i f and only i f the o n e - f o r m s wi in (5.26) satisfy dwi ~- 0 m o d f ~ i ( i = 1 , - - - , n )
(5.53)
P r o o f . (necessity) Follows b y d i r e c t v e r i f i c a t i o n . (sufficiency) A s s u m e t h a t t h e o n e - f o r m s wi i n (5.26) s a t i s f y (5.53). N o t e t h a t b y t h e d e f i n i t i o n of t h e wi we h a v e t h a t f~i = s p a n { d s k [ k = i + 1 , - . . , r a i n ( n , i + N ) } ( i = 1 , - - - , n ) D e f i n i n g c~i : = r a i n ( n , i + N ) (i = 1 , - - . , n ) , (5.53) t h e n gives
0 = dwi A dSi+l A 9 .. A d s ~ . . . . . i
E j=l
E
(\ os~.Os~ o s, ))
k=c~i+l
'
A
A
A.. A
(5.54)
"
(i = 1 , . . . , n ) w h i c h is e q u i v a l e n t to
( Osksj o2f ])
= O(j, k = 1 , . . . ,n; IJ - kl > n)
(5.55)
I t is e a s i l y checked t h a t t h i s c o n d i t i o n is e q u i v a l e n t t o t h e e x i s t e n c e of f u n c t i o n s 4 ) 1 , " " , c/)~-N s u c h t h a t fs satisfies (5.48). 9 F o r t h e case t h a t in (5.48) we h a v e t h a t p ~ i d a , t h e f o l l o w i n g r e s u l t holds. Theorem 5 . 9 Consider a discrete-time s y s t e m E o.f the .form (5.20) that is strongly observable on an open i n v a r i a n t subset U C ~T~~ c o n t a i n i n g the origin. A s s u m e .further that U is smoothly contractible to the origin, and that the o n e : f o r m s Wl,. 9 9 , wn in (5.26) generate a codistribution on U. L e t N E {1, - 9 9 , n - 2} be given. T h e n the .following s t a t e m e n t s are equivalent:
(i) E can be p u t in extended observer .form with buffer N on U. (ii) There exists a .function U : ~
-~ ~zt such that
dwi - d S A wi = 0 m o d f t i ( i = 1 , - . . , n )
(5.56)
(iii) The o n e : f o r m s wl," " 9 , wn and the vector .fields ml,. 9 9 , "r,~ satisfy daJi - 0 m o d F t l + s p a n { w n } ( i = 1 , . - . , n - N - 1)
(5.57)
88
5. On Existence of Extended Observers for Nonlinear Discrete-Time Systems dwi =- 0 m o d ~ i ( i = 1 , . - . , n - N - 1)
(~.ss)
dwi -=- 0modf~i(i
(5.59)
= n-
N,...
,n -
1)
[cri, 7,]_lwn = [c~j, T n ] J w ~ ( i , j = 1 , . . , n -
N-
I,''" ,n-- N--1;j=
l,...
s
1)
(5.60)
,n--l)
(5.61)
where the vector.fields or1,... , crn_N_l are de.fined by o'i
:=
T i ~ - " "" n a T i + N ( i
----
1 , . . . , n - N - 1)
(5.62)
P r o o f . ( i ) ~ ( i i ) A s s u m e t h a t there exist functions p, r , C n - N , such t h a t p is a diffeomorphism on h ( U ) a n d fs satisfies (5.48). Define o n e - f o r m s &l,'", 9 by
o?i :=
( O(po_f,) ~ dsj(i = 1,... j=l
\
n)
Osj ]
(5.63)
T h e n it follows from T h e o r e m 5.8 t h a t d&i ~ 0modspan{ 9
- 0~k_ 1 I k =
i + 1,... ,min(n,i + N)} (5.64)
(i = 1 , . - - ,n) Note t h a t we have t h a t &~ = ( p ' o f ~ ) w i (i = 1, .. , n). Defining the f u n c t i o n S := - log IP' o fsl, this gives 1 d '[p ' 0 f s ) ) A --:--: dwi - d S A w i = d (:---k--~.) p'of8 ~ -~ [~-~-~.~ p'of., o?z. . . . . .
p'~I., d&i(i = 1,--- , n)
(5.65) Further, it follows t h a t
span{i~2k -- ~]k--1 I ]~ ~---i ~- 1,--. , m i n ( n , i + N ) }
= f~i(i = 1,--.
,n) (5.66)
O u r claim is then established by combining (5.64),(5.65),(5.66). ( i i ) ~ (i) A s s u m e t h a t there exists a function S : U ~ H~ such t h a t (5.56) holds. Note t h a t from (5.26) we have t h a t wn -- df~. Thus, (5.56) for i = n gives t h a t 0 = dwn - d S A wn = - d S A wn
(5.67)
5. On Existence of Extended Observers for Nonlinear Discrete-Time Systems
89
By C a r t a n ' s L e m m a , this gives t h a t d S e span{dfs}. Define T : = e x p ( - S ) . T h e n we also have t h a t d T E span{df~}, a n d thus there exists a f u n c t i o n t5 : ~ --~ ~ such t h a t T = 15 o f~. Define p : = f i S ( r ) d r , a n d one-forms 9 , 9 as in (5.63). We t h e n have t h a t &i = Taxi (i = 1 , . . . ,n), a n d thus d&i = d T
Awi +
Tdwi = T(d~i - dS
Awi)
=-- 0modf~i(i = 1,--- , n) (5.68)
Together with T h e o r e m 5.8, this establishes our claim. (ii)r F r o m the fact t h a t ~,~ = dfs, it follows t h a t the f u n c t i o n S t h a t needs to exist has to satisfy d S A w n . By C a r t a n ' s L e m m a , this implies t h a t there should exist a function a such t h a t d S = o~w,~ a n d da A w~=O
(5.69)
Thus, the existence of a function S such t h a t (5.56) holds is equivalent to the existence of a function a satisfying (5.69) a n d dwi - (~Wn A wi -- 0 m o d f t i ( i = 1,. -- , n - 1)
(5.7o)
We now have t h a t a two-form w 2 satisfies w 2 _= 0mode2 for some codistrib u t i o n 12 if and only if X_I Y_I w 2 for all X, Y E f t • It is easily checked t h a t we have ~/l = span{r1,- 9 , ri_], r i + N + l , ' ' '
, r~, c~i}
(5.71) (i = 1 , - . . , n -
N-
1)
and 12~ = s p a n { r 1 , . . . , r i _ l , c l i } ( i = n -
N,-.. ,n-
1)
(5.72)
where, analogously to (5.62), we have :=
+...
+
= n - N,...
, n - 1)
(5.73)
NoB, let i E {1,--- , n -- N - 1} be given. We t h e n have 0 = %_1 re-I (da;i - a w n A w i ) = "rkJ re-I dwi = ['rk, re]-I wi (5.74) (k,(=l,...
,i-l,i+N+l,...
,n)
0 ---- Tk--I 0",_1 (dwi - aw,~ A wi) = "rk-I o'{_1 dw{ = [%, o-{]_.1 oJ{ (5.75) (k=l,.-.,i-l,i+N+l,---,n-1)
90
5. On Existence of Extended Observers for Nonlinear Discrete-Time Systems
and 0 = cr~AT,~_I(dcvi - awn A wi) = cri--I7nA dwl - a = [cri, 7-n]-Iw~ - a (5.76) Where in (5.74),(5.75),(5.76) and (5.14). Combining (5.74) ther, (5.76) gives (5.60), while Next, let i E {n - N , - . . , n -
the last equality follows by applying (5.15) and (5.75), we obtain (5.57) and (5.58). Furcombining (5.69) and (5.76) we obtain (5.61). 1} be given. We then have
0 = Tk/Te-I (dwi - o~w,, Awi) = 7k-I r e / d w i = [Tk, Te].-Iwi (5.77)
(k,e =
1,...
,i -
1)
and 0 = 7 k l c~iI (dwi - ozwn A wi) = Tk I cril dwi = [7k, ai] I wi
(5.78) (k=l,...,i-1) Combining (5.77) and (5.78), we then obtain (5.59). m R e m a r k 5.1 Theorem 5.9 generalizes Theorem 5.6. From the .first two items of both theorems this is seen immediately. I f however, one considers the third i t e m of both theorems the generalization is .far.from obvious at .first sight. This is due to the .fact that the equivalence (ii)~==>(iii) in Theorem 5. 6 holds .for general independent one:forms W l , ' " ,Wn, while this equivalence in Theorem 5.9 only holds .for Oneaforms w l , " ' , w , of the .form (5.26).
5
Conclusions
In this chapter, we have given conditions for the existence of extended observer forms and extended observers for single-output nonlinear discretetime systems. All conditions are valid on an open invariant subset of the state space t h a t is smoothly contractible to an equilibrium point of the system and on which some regularity assumptions are satisfied. This raises the question what can be said for the case t h a t (some of) the regularity assumptions are not satisfied. This remains a topic for future research. A further topic for future research would be the question when extended observer forms and extended observers for multi-output discrete-time systems exist. As also mentioned in the Introduction, it seems t h a t the conditions given in [13] for the existence of an observer form when only coordinate transformations are allowed, seem to be incorrect. Preliminary investigations suggest that in fact the problem of coining up with correct conditions m a y be quite intractable. On the other hand however, it m a y
5. On Existence of Extended Observers for Nonlinear Discrete-Time Systems 91 be shown by using the same techniques as in [7],[8] that a strongly observable multi-output system may always be put in observer form with buffer N = n* - 1, where n* equals the maximal so called observability index of the system.
Acknowledgments Part of this research was performed while the author was visiting the Laboratoire d'Automatique de Nantes, Nantes, France, supported by a grant from the R6gion Pays de la Loire. 6
REFERENCES [1] M. Brodmann. Beobachterentwur.f .fiir nichtlineare zeitdiskrete Systeme, VDI Verlag, D~isseldorf, 1994. [2] R. L. Bryant, S. S. Chern, R.B. Gardner, H.L. Goldschmidt and P.A. Griffiths, Ezterior differential systems, Springer, New York, 1991. [3] H. Cartan. Formes diff~rentielles, Hermann, Paris, 1967. [4] Y. Choquet-Bruhat and C. DeWitt-Morette (with M. Dillard-Bleick), Anaysis, manifolds and physics, Part I: Basics, North-Holland, Amsterdam, 1991. [5] H. Flanders. Differential forms with applications to the physical sciences, Dover, New York, 1989. [6] A. Glumineau, C.H. Moog and F. Plestan. New algebro-geometric conditions .for the linearization by input-output injection, IEEE Trans. Automat. Control, 41, pp. 598-603, 1996. [7] H. J.C. Huijberts, T. Lilge and H. Nijmeijer. A control perspective on synchronization and the Takens-Aeyels-Sauer Reconstruction Theorem, to appear in Phys. Rev. E, 1999. [8] H. J. C. Hnijberts, T. Lilge and H. Nijmeijer. Synchronization and observers for nonlinear discrete time systems, submitted to European Control Conference 1999. [9] H. J. C. Huijberts, H. Nijmeijer, and A.Yu. Pogromsky. Discrete-time observers and synchronization, in G. Chen (Ed.), Controlling chaos and bifurcations in engineering systems, CRC Press, Boca Raton, Florida, 1999.
[10] A. J. Krener and A. Isidori. Linearization by output injection and nonlinear observers, Syst. Control Lett., 3, pp. 47-52, 1983.
92
5. On Existence of Extended Observers for Nonlinear Discrete-Time Systems
[11] A. J. Krener and W. Respondek. Nonlinear observers with linearizable error dynamics, SIAM J. Control Optimiz., 23, pp. 197-216, 1985. [12] T. Lilge. On observer design for nonlinear discrete-time systems, Eur. J. Control, 4, pp. 306-319, 1998.
[13]
W. Lin and C.I. Byrnes. Remarks on linearization of discrete-time autonomous systems and nonlinear observer design, Syst. Control Lett., 25, pp. 31-40, 1995.
[14] M. Spivak. A comprehensive introduction to differential geometry, Volume I, Publish or Perish, Houston, 1979. [15] X. Xia and W. Gao. Nonlinear observer design by observer canonical forms, Int. J. Control, 47, pp. 1081-1100, 1988.
Stability Analysis and Observer Design for Nonlinear Diffusion Processes Winfried Lohmiller and Jean-Jacques E. Slotine
Nonlinear Systems Laboratory Massachusetts Institute of Technology Cambridge, Massachusetts, 02139, USA
1
Introduction
The stability of a nonlinear reaction-diffusion process and its convergence rate can be determined very simply. This allows in turn the design of simple observers for such processes. The technique is based on an extension of the recently developed tools of contraction theory. Reaction-diffusion processes are pervasive in physics. Mathematical properties such as existence and smoothness of solutions are well understood for many such processes (Evans [3]). This paper shows that analyzing global stability - are initial conditions or temporary disturbances eventually "forgotten," and if so, how fast? - and determining convergence rates given boundary conditions is very simple for such processes, and shows how this result applies naturally to the design of observers. This is achieved by taking advantage of recent results on stability theory, referred to as contraction analysis (Lohmiller and Slotine [8]), and extending them to partial differential equations describing nonlinear reaction-diffusion processes. After a brief review of contraction analysis in Section 2, Section 3 analyzes the contraction properties of the Laplace operator with general boundary conditions. The result is applied to reaction-diffusion equations of the form
c9_~r = h(t) V2r + g(t) V r + f ( r Ot
(6.1)
whose stability and convergence rates are explicitly quantified, and then to exponentially convergent observer designs for these systems. Numerical aspects are discussed in Section 4, and extensions to other classes of distributed systems in Section 5.
94
6. Stability Analysis and Observer Design for Nonlinear Diffusion Processes
2
Contraction Analysis
Differential approximation is the basis of all linearized stability analysis. W h a t is new in contraction analysis is that differential stability analysis can be made exact, and in turn yield global results on the nonlinear system. We first summarize some basic results of (Lohmiller and Slotine [8]), to which the reader is referred for more details, and then discuss how these results apply naturally to the design of nonlinear observers.
2.1
Basic Tools
We consider general deterministic systems of the form ~=f(O,t)
(6.2)
where 9 is the n • 1 state vector and f is an n • 1 nonlinear vector field, All quantities are assumed to be real and smooth, so that we can write the exact differential relation 0f 5 6 = ~--~(O, t) 5 0
(6.3)
where $0 is a virtual displacement - recall that a virtual displacement is an infinitesimal displacement at .fixed time. Note t h a t virtual displacements, pervasive in physics and in the calculus of variations, are also well-defined m a t h e m a t i c a l objects (Arnold [1] and Schwartz [10]). Consider now two neighboring trajectories in the flow field ~ = f(O, t), and the virtual displacement 50 between them (1). The squared distance between these two trajectories can be defined as ~ 0 T 6 0 , leading from (6.3) to the rate of change
virtual displacement 8 ~
velocity 8
two neighboring traiectories
~K
FIGURE 1. Virtual dynamics of two neighboring trajectories.
6. Stability Analysis and Observer Design for Nonlinear Diffusion Processes d ( ~ ~ T (~~ ) = 2 5 ff2T h Oo = 2 5r T ~--~f~ hrb < 2 /~max ~ O T ~ o
95
(6.4)
where Am~x(q~,t) denotes the largest eigenvalue of the symmetric part of of i.e., the largest eigenvalue of 7(b-~ o of ~ + 1 of T~j. Assume now the Jacobian b-~, that Am~x(q),t) is uniformly strictly negative (i.e., 3 /3 > 0, VO, Vt _> 0, Amax(gP,t) < --Z < 0. ). Then, from (6.4) any infinitesimal length [[5~][ converges exponentially to zero. By path integration, this immediately implies that the length of any finite path converges exponentially to zero. Thus, as in stable linear time-invariant systems, the initial conditions are exponentially "forgotten." We can state the following definition and basic result (Lohmiller and Slotine [8]) T h e o r e m 6.1 The system (~ = f(qh, t) is said to be contracting if -of ~ is uni.formly negative definite. All system trajectories then converge exponentially to a single trajectory, with convergence rate IAm~l, where )~max i8 Of the largest eigenvalue of the symmetric part of -5"~" The system is called semi-contracting if b-~ of is only negative semi-definite, and indifferent if of is skew-symmetric. More precise local versions of the above theorem can also be derived. In addition, departing further from classical Krasovskii-like results for autonomous systems (Krasosvkii [6], Hahn [5] and [7]), the approach can be vastly extended by allowing for a prior differential coordinate transformation, leading to a a necessary and sufficient condition for global exponential convergence (Lohmiller and Slotine [8]). Specifically, the line vector 5~ between two neighboring trajectories in Figure 1 can also be expressed using the differential coordinate transformation = o
(6.5)
where O(q~, t) is a square matrix. This leads to a generalization of our earlier definition of squared length 6~JT 6tII = 5 0 T M
5~
(6.6)
where M(O, t) = o T o represents a symmetric and continuously differentiable metric - formally, equation (6.6) defines a Riemann space (Lovelock and Rund [9]). Since (6.5) is in general not integrable, we cannot expect to find explicit new coordinates ~(O,t), but 6~ and 5 ~ T 5 ~ can always be defined. We require M to be uniformly positive definite, so that exponential convergence of 50 to 0 also implies exponential convergence of 5q~ to 0. Distance between two points P1 and P2 with respect to the metric M is defined as the shortest path length (i.e., the smallest path integral f 2 ~ [15~l] ) between these two points. Accordingly, a ball of center c and radius R is defined as the set of all points whose distance to e with respect
96
6. Stability Analysis and Observer Design for Nonlinear Diffusion Processes
to M is strictly less than R. Computing I
d 5~ = F S ~ dt
or alternatively ~-~
where
F---(E)+o0f)o
0,)
\ ~ - ~ M + 1~I + M~-~
-1
(6.7)
5(I)
(6.8)
we can state the following definition and main result (Lohmiller and Slotine
[8]) D e f i n i t i o n 6.1 Given the system equations ~ = f(O,t), a region of the state space is called a contraction region with respect to a uniformly positive de.finite metric M(O, t) = OTO i f F in (6.7) or equivalently ~O 0 T M ~- - - M Oof 4cP -1(r is uniformly negative de.finite in that region. Regions where F is negative semi-definite are called semi-contracting, and regions where F is skew-symmetric are called indifferent. T h e o r e m 6.2 Given the system equations ~ = f(O,t), any trajectory, which starts in a ball of constant radius with respect to the metric M(O,t), centered at a given trajectory and contained at all times in a contraction region with respect to M(O,t), remains in that ball and converges exponentially to this trajectory. Furthermore global exponential convergence to the given trajectory is guaranteed if the whole state space is a contraction region with respect to the metric M(O,t). It can be shown that the existence of a uniformly positive definite metric with respect to which the whole state space is a contraction region is actually a necessary condition for global exponential convergence. In the linear time-invariant case, a system is globally contracting if and only if it is strictly stable, with F simply being a normal Jordan form of the system and O the coordinate transformation to that form. Note that the metric is unchanged by an additional (perhaps time-varying or state-dependent) orthonormal transformation, i.e., by left-multiplying O by an orthonormal matrix. 2.2
Nonlinear Observer Design using Contraction
Theory
By using a differential approach, contraction theory in a sense treats convergence analysis and limit behavior separately. Guaranteeing contraction means that after exponential transients the system's behavior will be independent of the initial conditions. In a control context, once contraction is guaranteed through feedback, specifying the final behavior reduces to the problem of shaping one particular solution, i.e. specifying an adequate
6. Stability Analysis and Observer Design for Nonlinear Diffusion Processes
97
open-loop control input to be added to the feedback terms, a necessary step of any control method. In an nonlinear observer context, contraction theory is a rather natural tool, since once the observer contraction behavior has been shown and quantified, one needs only verify t h a t the observer equations contain the actual plant state as a particular solution to automatically guarantee convergence to t h a t state. If y((I), t) is the available m e a s u r e m e n t vector, and q) the estimated state, with ~, = y(q), t), this m a y be achieved by simply copying the system dynamics (identity observer) and adding to the right-hand side a term of the form k ( p , t) - k ( y , t) , where the vector field k is selected to guarantee or enhance the contraction behavior of the observer. W h e n the actual system is itself contracting, as will be the case for most of the nonlinear diffusion processes considered in this paper, k needs only to be selected to enhance (speed up) the natural contraction behavior of the system.
2.3
Weakly Contracting Systems
In this section we derive an exponential convergence condition for classes of semi-contracting systems. The idea is simple: whereas in Section 2 we have used only first time-derivatives of a virtual displacement to characterize a flow field, we now perform a complete Taylor series expansion to analyze a semi-contracting virtual dynamics. This section m a y be skipped in a first reading, as it will only be needed for an extension in Section 5. Consider a semi-contracting, analytic virtual dynamics in 5 ~ d The corresponding virtual length dynamics is
d (e~Te~)=--2er dt T
with positive semi-definite F~ = - ~1 ( F + F T ) . Factorizing F s as x / ~ v ~ , say with a Cholesky factorization, allows one to compute the time-derivatives of (~tI/T~ / as
o
T
dt 2 d3 =
+
2 L l v ~ T nlv/-Fs + n o v ~ sT n2x/F~)~k0
98
6. Stability Analysis and Observer Design for Nonlinear Diffusion Processes
with the Lie derivatives L J v ~ ( x , L~
=
t) (Lovelock and R u n d [9])
~;~
-- L ~ v ~ F + ~ L J v ~
vj>_o
dt
As a result 6k~T6k~(t A- T ) along a trajectory x(t) can be written as the Taylor series expansion
(
6q2T6q~(t + T )
<
TT 2
~-.3 2T
-~.4 3T
T~
3~_._~ 4
4~ 5
. 9 9"
) LOv/-~8
LIv/-Fs
~
6ff2T6ff2
_ ~ ~RLOv,-f2 L1,/-f~ ...)
L~,/-f~
~
where all the terms on the right hand side are c o m p u t e d at time t. Note t h a t for a given constant T > 0 the interior matrix above can be shown by complete induction to be uniformly positive definite, which implies a uniformly positive ~. As a result we can conclude on exponential convergence of 116q211 to zero for uniformly positive definite C T C with
C =
( L~ ) L1 v/-Fs
(6.9)
Contrary to linear time-invariant systems the rank of C can be increased with additional Lie derivatives L i v e 8 . However, once C T C is uniformly positive definite for some finite number of Lie derivatives the following ones do not influence the definiteness of c T c anymore. Consider similarly the corresponding semi-contracting metric dynamics
d (5~TMS~) dt
=
_25~Tose;
(6.1o)
with positive semi-definite O = 0~ of T M 0f + 1VI. Factorizing O as - ' - + M~-~ v/-OTv/'O, say with a Cholesky factorization, and a similar a r g u m e n t a t i o n to the previous discussion allows to conclude on exponential convergence
6. Stability Analysis and Observer Design for Nonlinear Diffusion Processes of 115(~I[M : x/5OTMSO to zero for uniformly positive definite c T c
99
with
( L~ ) ---
Llv~
(6.11)
and the Lie derivatives LJv/'O(O, t)
Love6 = LJ+lv/o
--
of L J v / O ~-~ +
LJx/~
Vj > 0
This leads to the following definition and theorem D e f i n i t i o n 6.2 Given the globally analytic system equations ~ = f(O, t), a semi-contraction region of the state space is called a weak-contraction region with respect to the metric M(O,t), /.f c T c in (6.9) or c T ~ in (6.11) is uni.fozTnly positive definite in that region. T h e o r e m 6.3 Given the system equations (~ = f(O,t), any trajectory, which starts in a ball o.f constant radius with respect to the metric g ( o , t ) , centered at a given trajectory and contained at all times in a weak contraction region with respect to M ( ~ , t ) , remains in that ball and converges exponentially to this trajectory. Furthermore global exponential convergence to the given trajectory is guaranteed i.f the whole state space is a contraction region with respect to the metric M(O,t).
3
Nonlinear Diffusion Equations
We now extend and apply the above results to partial differential equations describing nonlinear reaction-diffusion processes. We shall use continuous state vectors 9 in Cartesian coordinates x, defined on a bounded m-dimensional region V. Bold characters will denote the continuous statespace quantities corresponding to pointwise terms. For instance, ~ will denote the state-space vector of components ~ t , and VkO the state-space vector of components Vkr Formally, 9 now lives on the Hilbert space L 2 (V), and differential length is defined by f v 5r so that the derivation of Section 2.1 extends immediately. F~rthermore, continuous state-space quantities can be computed as the limits of regularly discretized versions, as the discretization step tends to zero. For instance, on a one-dimensional continuum of length 1 5r
=
lira n---*+oo~ "
5r 5r i=1
100
6. Stability Analysis and Observer Design for Nonlinear Diffusion Processes
where the 5r are the discretized values. This limiting process will be our main c o m p u t a t i o n tool in assessing and quantifying the stability properties of the systems 9 Specifically, we first analyze the contraction properties of the V and V 2 operators. We then analyze and quantify the convergence rate of the nonlinear reaction-diffusion equation (6.1).
3.1
Contraction Properties of Reaction-Diffusion Processes
One-Dimensional V Operator Consider first the one-dimensional V operator on a one-dimensional continuum of length l, with given left and right b o u n d a r y conditions Cz (t) and Cr(t). We can write VO as the limit of an n • n discretization m a t r i x as
VO
n+l 21
lim
=
~k
-
.
-t-
1
0
0
.
-1
0
1
".
".
0
-1
".
"'.
0
0
".
".
0
1
0
-1
0
"' 9
~
o where Cz(t) and r Jacobian
0
'.
/
are the given left and right b o u n d a r y elements 9 T h e \
0VO 0O
-
n + 1 lim n-~+~ 2 l
0
1
0
0
".~
-1
0
1
".
".
0
-1
".
".
0
.
0
1
0
-1
0
0 "'.
9
.
"'.
.
(6.12)
is skew-symmetric, independently of Cz(t) and Cr(t). More generally, it is straightforward to show t h a t any odd derivative VkO, with k an odd positive integer, is skew-symmetric. One can easily show t h a t the result is unchanged if Vr (t) or V e t (t) are given instead 9
6. Stability Analysis and Observer Design for Nonlinear Diffusion Processes
101
One-Dimensional Laplace Operator Consider now the one-dimensional Laplace operator on the continuum, with given Ct(t) and r We can write [
(n + 1)2 V2~
=
lim
12
-2
1
0
0
.
1
-2
1
'.
.
0
1
'.
".
0
0
.
.
-2
1
.
0
1
-2
1
0
0
'.
1
-2
1
.
.
0
1
"'.
"'.
0
0
'.
'.
-2
1
"'.
"'.
0
1
0 0 +
4,+
0 0
The corresponding Jacobian is (-2 0V2O 00
-
lim (n+l) 2 ~-~+o~ l2
(6.13)
-2 )
whose largest eigenvalue is shown in the Appendix A to be upper bounded by - ~ . Thus, the one-dimensional Laplace operator ~720 with given boundary elements el(t) and r is contracting. Consider instead the Laplace operator with given boundary elements Cz(t) and V e t ( t ) along a continuum of length 1. By adding a mirror image of the system to the right we get a continuum of length 21 and given boundary elements Cz(t) and 7r 2 r Thus the largest eigenvalue of the Jacobian is now -2-~" If instead, Vr and v e t ( t ) are given on the left and right boundary, then in (6.13) the upper left and lower right corners become - 1 . A similar derivation in the Appendix A shows that the resulting matrix is only negative semidefinite, i.e. the resulting Laplacian is only semi-contracting. Note that this can be expected physically, since the system might simply converge to a specific 9 with a constant error over the continuum.
Multi-dimensional Laplace Operator Consider now an m-dimensional continuum, and the Laplacian
V2O =
Ox~ i=1
with boundary condition Cb(t), where the x~ are orthonormal Cartesian coordinates. Discretizing the region along any coordinate axis xi, similarly
102
6. Stability Analysis and Observer Design for Nonlinear Diffusion Processes
to the previous section, the largest eigenvalue of ~~
is at most -~,2~2 ....... ,
where l~,ma~ is the diameter (maximal thickness) of the region in the direction xi. Furthermore, since a discretization along any coordinate axis x~ can be transformed into a discretization along any other coordinate axis using an orthonormal coordinate transformation, the largest eigenvalue of rn the sum V 2 ~ = E i = I c92O is at most 27r2
m
=
-
l?
(6.14)
z,max
i:1
and thus the multi-dimensional Laplace operator V2O with given b o u n d a r y conditions Cb(t) is contracting. If instead VCb(t) is given on the whole boundary, then the Laplacian is only semi-contracting, with ~v~ = 0
(6.15)
E x a m p l e 3.1: Consider the Laplace operator in spherical coordinates 020 2 0~p 1 020 cosu 0~ 1 02~ V2(I, = ~ ++ + - + - r ~r r'~ ~ r 2 sin ~ cO, r 2 sin 2 , cO02 with angles 0, p, and radius r, and assume that Cb(t) is given on the sphere r = ro. The sphere's diameter is 2ro, so that the largest eigenvalue of this 3 ~v2 [] Laplacian is Av~ ------~-~,.
Reaction-Diffusion Equation Collecting the above results and using T h e o r e m 6.2 thus yields Theorem
6.4 Consider the reaction-diffusion equation
0r = h(t)V~r Ot
+ g(t)Vr
+ f(r
(6.16)
with h(t) a continuous and uniformly positive .function of time ( 3ho > O, Vt > O, h(t) >_ ho), and g(t) an arbitrary continuous .function of time. Assume that
h(t)
+
of
is uni.formly negative, where Av~ is given by equation (6.14) or (6.15) depending on the boundary conditions. Then, all system trajectories converge exponentially to a single trajectory ~Pd(x,t), with convergence rate Iho)~v2 + ~ ]. In the autonomous case ( f = f ( r h(t) =_ no, g(t) =_ go) and with constant boundary conditions, the system converges exponentially to a steady-state (I)d(X), which is the unique solution of the generalized Poisson equation 0 = ho V2r
+ go VCd + f(r
6. Stability Analysis and Observer Design for Nonlinear Diffusion Processes
103
T h e above development also implies t h a t all the results on c o n t r a c t i n g systems in (Lohmiller a n d Slotine [8]) can be e x t e n d e d to c o n t r a c t i n g reactiondiffusion processes, with b o u n d a r y conditions acting as additional i n p u t s to the system. For instance, any a u t o n o m o u s c o n t r a c t i n g reaction-diffusion process, w h e n subjected to b o u n d a r y conditions periodic in time, will t e n d exponentially to a periodic solution of the same period. T h e convergence is robust to b o u n d e d or linearly increasing disturbances. (e.g., in the form of a discretization of the c o n t i n u u m for simulation purposes). Also note t h a t the above result can be i m m e d i a t e l y e x t e n d e d to the case where the left-hand side of (6.16) is multiplied by a uniformly positive definite t e r m re(x), by using this t e r m as the metric.
3.2
Observer Design for Nonlinear Diffusion Processes
W h e n the nonlinear diffusion processes are n a t u r a l l y contracting, as described by the above theorem, observer designs b e y o n d merely copying the s y s t e m d y n a m i c s need only be selected to speed up the n a t u r a l c o n t r a c t i o n behavior of the system, as discussed in Section 2.2. In addition, b o u n d e d m e a s u r e m e n t disturbances in a contracting obsever design do not affect the c o n t r a c t i o n behavior, t h o u g h they m a y lead to a b o u n d e d offset in the limit behavior (Lohmiller and Slotine [8]). Finally, numerical implem e n t a t i o n s will discretize the continuum. T h e specific issues linked to this discretization will be discussed in Section 4. Let us now illustrate the design technique on a specific example, which also points out the possible role of b o u n d a r y conditions as a design tool, specific to observers for d i s t r i b u t e d systems. E x a m p l e 3.2: Consider a wafer disk of radius ro (Figure 2) subjected to continuous external light source, similarly e.g. to (Cho and Gyugyi [2]). The dynamic equations in radial coordinates 0 and r are (Groeber et al. [4])
= hV2T _ f (T 4 -- To4) with T > 0 the wafer temperature, To = 500 + 200 sin ~ K the external temperature, h = 1 m2/s a heat transfer constant, f = 10-SK-3s -1 a radiation constant, ro 20 cm, and boundary conditions ( OT )to = 0. According to Theorem 6.4, the system is naturally contracting with convergence rate 4 f T a. This means that an open-loop identity observer =
§
h 2r I
guarantees exponential convergence to T at the rate 4 f T a. The corresponding plant and observer response with initial conditions T(r,t = 0) = 500 + 200 cos 2,~_.._r.~K and T(r, t = 0) = 700 K are illustrated in Figure 3. For compurO tational simplicity the simulation exploits rotational symmetry in 0. Assume now that the actual temperature T(ro, t) is measfired at the boundary r = ro of the disk. Using this measurement as a boundary condition T(ro, t) increases
104
6. Stability Analysis and Observer Design for Nonlinear Diffusion Processes external racllatlon
//// rJ
X2
\\\\ external radiation FIGURE 2. Thermal processing of a wafer. the convergence rate to 4 f T 3 + h , according to Theorem 6.4, while at the same time preserving consistency with the actual plant (i.e., keeping the actual T as a particular solution of the observer equations with their boundary condition). The corresponding observer response with the same initial conditions as above is shown in Figure 4. []
4
Spatial Discretization and Numerical Implementation
This section briefly discusses some of the implications of t h e previous results for the numerical simulation of partial differential equations, which is of particular relevance in the context of observer i m p l e m e n t a t i o n . Let us spatially discretize equation (6.16) by a p p r o x i m a t i n g the c o n t i n u o u s s t a t e vector ~ with ~ ( O , x ) , where 9 is a finite-dimensional s t a t e vector. This discretization leads to an error e in e q u a t i o n (6.16) = h V 2 ~ + f ( ~ , x, t) + e
(6.17)
We can minimize e by requiring a00~ T e z 0 resulting in
M ~ = O---~-~T with M
--
(hv2
+ r)
ocTow Taking the variation of (6.17) a n d 6(-~Te) = 0 lead to 00 0r
1 d (SOTM~O)
TC~ff2To(hV2q2 + f) 0 ~
02/I/T
6. Stability Analysis and Observer Design for Nonlinear Diffusion Processes . . . -
Boo .... 'i .... i4oo looo
:
:
i
0
,
......
..-i ....... , ......
2oo~ 40
.
time t
0
0
105
'!"""
'--
....
i
'":
...,.
"'.
20
radius r
FIGURE 3. Wafer temperature as a function of time. This dynamics can be simplified using the largest eigenvalue Amax of 0q2 T O(hV2~+f/ (9~IJ with respect to M , and the m a x i m a l principal curvature 0,P o~ 0~
0"~,~T e - ~ M I = 0 I~lma= of qJ(O,x), obtained from det ]~-~~ d ( 6 o T M 6 ~ ) <_ (/~rnax Jr-Ile]] ]~],,~=)5'~TMSO a~ If we assume ~5 to be a minimal realization of ~, i.e. require M to be uniformly positive definite, then exponential convergence of 5r to zero implies exponential convergence of 6r to zero. T h e contraction behavior is hence unchanged in regions of small ]le]] ]~]max- This shows t h a t we have to approximate regions with large I~]ma= more precisely in order to preserve contraction.
5
Further Extensions
Besides actual reaction-diffusion or heat transfer processes, m a n y other distributed physical processes can be written as reaction-diffusion equations using a nonlinear coordinate transformation (Evans [3]). This is e.g. the case of the Burgers equation, used to model turbulent flows and shock waves, by applying the so-called Hopf-Cole transformation. Reaction-diffusion equations are also used as computing paradigms in other fields, such as machine vision. In this section, we briefly discuss some further extensions of the analysis, from which exponentially convergent observers could be similarly designed.
106
6. Stability Analysis and Observer Design for Nonlinear Diffusion Processes
9 ,oo
9 , .
=.J.
..-
b ,
- . ,
~
I--
4OO
. - o
,o~
~
,
, ,
, ,
o , ,
o ~ ~ I
, .=.~
9
.--
o*-"
9 . " ' ' ' i
, , ~ 1 7 6 1 7 6 ~
~176
.
. ..
,
.
"~ 200 E 0 3
20 time t
0
0
radius r
FIGURE 4. Estimation error of identity observer.
Nonlinear Functions of the Laplace Operator Consider now the system
0 r = h(V2r x, t) Ot with given boundary conditions Cb(t), where h is continuous function of its argument, differentiable and uniformly strictly monotonically increasing with respect to its first argument V2r The corresponding flmction h(V2qb, x, t) is a continuous state-space vector of elements h(V2r x, t), whose Jacobian is Oh Oh 0V2~ 0~ 0V2~ 0~ Oh Using the symmetric where ~ 0 h is a diagonal matrix of elements 0-'W~" positive definite Jacobian - ~ dynamics
as the metric leads to the virtual length
d -~
O~b
OVe~b O~b
The increase condition on h immediately implies that the system is contracting.
Wave Equation with Nonlinear Damping Consider now the wave equation re(x) 02r Ot2
V2r -
g(0__~r x , t ) - k(x)r (]b
6. Stability Analysis and Observer Design for Nonlinear Diffusion Processes
107
~ ,--
oo~"
,
'o
o~
.,.. ..,.
400
9
....
I-2.,
" ' ' ' ~ "
'
i i
'~ 200 ......
E
'
0 3 20
2
10 time t
0
0
radius r
FIGURE 5. Estimation error with temperature measurement at to. where all functions are assumed to be smooth, m and k are uniformly positive definite functions of x. and g is uniformly monotonically increasing in -ST" 0r The following virtual "energy" dynamics d
6(i)Tdiag(m ) 66 + 6(I}w diag(k)
60
2
(6.1s) guarantees semi-contraction behavior and hence bounded 5ep a n d 56. This implies t h a t any time-derivative of (6.18) is bounded since it can always be expressed as functions of 60 and 6q?. B a r b a l a t ' s l e m m a then implies a s y m p t o t i c convergence of 5r and 6(~ to zero. Veryfing the condition in T h e o r e m 6.3 even shows weak-contraction behavior i.e. t h a t the system is globally exponentially convergent.
Chains of Contracting Processes Chains of contracting processes are themselves contracting. More precisely, consider a smooth virtual dynamics of the form
dt
( ) (0. 502
=
o~1
of~ o~2
5~2
A uniformly negative definite os 04,t implies exponential convergence of 5~1 to zero. In turn, assuming t h a t ~OOl is bounded, o_f2_f 0(I'1 5Ol represents an exponentially decaying disturbance in the second equation, so t h a t a uniformly negative definite ~04~ implies exponential convergence of 5~2 to zero. T h e result extends by recursion to higher-order chains.
108
6. Stability Analysis and Observer Design for Nonlinear Diffusion Processes
E x a m p l e 5.1 : Consider the nuclear chain reaction U 9 223 s -t- n I
N
---*
U, 239 92 "~- "~
U9239 2
----+
~r 239 Ivp93 -~- e -
239 P93
"--*
n 239 Jr- C /-'U94
whose dynamics is, with reaction constants ki, diffusion constants hi, and bounded uniformly positive neutron input n~(t) _ U9238 2 U9239 2
N. 239
h
h
P93
~-2 r r23S I 1V t)92 -- /r
h 2 V~ 2 T ~029329
1TT238
, k 1~T238 ~. r r 2 3 9 -~- I n 0 u 9 2 -- n;2r
~-v2 ~ r 2 3 9 3V IYP93 t
~ rr239 ~g2u92
~ ~ 239 -- E31vP93
Since the output of a contracting process is bounded for bounded input, using the above reasoning recursively shows that the chain reaction is globally exponentially convergent. []
Nonlinear Schroedinger Equation Finally, the nonlinear Schroedinger equation
ih~t
= -h2v2r
+ V(r
can also be studied with only minor modifications, where h is the reduced Planck constant and m the mass. Defining squared length as 5q~T~ * (where 9 indicates complex conjugation), using h 2 0V2dP ihz
-
0V
2m
leads to (assuming that V(r x, t) is holomorphic in r
d'-~d((~T (~*) = (~T (~* ~- ~c~T(~* = -h2~r Im
(0V)_~_ 50*
In the linear case, the system is indifferent, a well-known result.
Acknowledgement The authors are grateful to Christophe Bernard for his help in computing the contraction rates in Appendix A.
6. Stability Analysis and Observer Design for Nonlinear Diffusion Processes 6
109
REFERENCES [1] V. I. Arnold. Mathematical Methods of Classical Mechanics, Springer Verlag, 1978. [2] Y. M. Cho and P. Gyugyi. Control of Rapid Thermal Processing: A System Theoretic Approach, LE.E.E. Transactions on control systems technology, 5(6), 1997. [3] L. C. Evans. Partial Differential Equations, American Mathematical Society, 1998. [4] H. Groeber, S. Erk and U. Grigull. Die Grundgesetze der Waermeuebertragung, Springer Verlag, 1988. [5] W. Hahn. Stability of motion, Springer Verlag, 1967. [6] N. N. Krasovskii. Problems o.f the Theory o.f Stability o.f Motion, Mir, Moskow, 1959. English translation by Stanford University Press, 1963. [7] P. Hartmann. Ordinary differential equations, second ed., Birkhauser, 1982. [8] W. Lohmiller and J. J. E. Slotine. On Contraction Analysis for Nonlinear Systems, Automatica, 3~(6), 1998. [9] D. Lovelockand H. Rund. Tensors, differential .forths, and variational principles, Dover, 1989.
[10] L. Schwartz. Analyse, Hermann, Paris, 1993.
Appendix A: Computation of Contraction Rates Let us compute an upper bound on the largest eigenvalue A of the n x n matrix
(n + 1) 2 12
2
-1
0
0
".
-1
2
-1
".
".
0
-1
.
.
0
0
".
".
2
-1
9.
".
0
-1
2
(6.19)
110
6. Stability Analysis and Observer Design for Nonlinear Diffusion Processes
as n --+ +cx~. Since ,k is the largest eigenvalue of (6.19), the n x n m a t r i x 2 - 2rl
-1
0
0
.
-1
2-2~/
-1
".
".
0
-I
"'.
"'.
0
0
.
.
2 - 27/
-1
".
"'.
0
-1
2-
(6.20)
2rl
l2 is positive semi-definite, where r/--- - 2-CKJTp" ~" Now the principal m i n o r s Ak of (6.20), which m u s t all be positive or zero, can be c o m p u t e d recursively
Ak = ( 2 - - 2r/)Ak_l -- A k - 2 with A 1 = 2 - 2 7 and A2 = (2 - 2r/): - 1. Noticing t h a t A1 _> 0 implies r/_< 1, this yields (1 . r / + V / (. 1
2V/(1 _ . ) 2
Thus, letting c o s a = l - ~ / one can write Ak =
( 1 - .- r/
1) k +.l
7) 2. -
Ak=
and s i n a =
V/(I -- ~/) 2
1) k+l
_ 1
X/I-(1-7/)
( c o s a + i s i n a ) k+l - ( c o s a - i s i n a ) k+l 2i sin a
2,with
0
~,
sin(k + 1 ) a sin
Thus, Ak _> 0 implies t h a t a < ~+1 , for any k > 1. This in t u r n implies that 7r 2 1-~?=cosa >1 -a2> 1(k + 1) 2 71.2
a n d thus t h a t 7/ _< ~W4-/V" As n --* + o o , the largest eigenvalue of (6.19) thus verifies AV2 = - lim 2 ( n + 1 ) 2 27r2 n--++oo l2 71 -< 12 In the simpler case of b o u n d a r y conditions in V@(t) a n d V e t ( t ) , the corresponding n • n m a t r i x is -1
1
0
0
.
1
-2
1
".
".
0
1
'.
'.
0
0
"'.
"'.
-2
1
0
1
-1
(n + 1) 2 12 .
.
".
".
6. Stability Analysis and Observer Design for Nonlinear Diffusion Processes To show t h a t this m a t r i x is n e g a t i v e s e m i - d e f i n i t e as n ~ to show t h a t 1
-1
0
0
.
-1
2
-1
".
".
0
-1
'.
'.
0
0
.
.
2
-1
~, ' .
'.
0
-1
1
111
+ o c it suffices
(6.21)
is p o s i t i v e semi-definite. C o m p u t i n g b y i n d u c t i o n t h e p r i n c i p a l m i n o r s A k of t h e a b o v e m a t r i x leads to A k = 1 for 1 < k < n - 1, a n d A n = 0, h e n c e t h e result. N o t e t h a t , a l t e r n a t i v e l y to this e x p l i c i t d e r i v a t i o n , s t a n d a r d r e s u l t s on eigenvalues of t h e L a p l a c i a n o p e r a t o r m a y also b e used, since t h e a s s o c i a t e d J a c o b i a n m a t r i x is itself s y m m e t r i c .
Nonlinear Passive Observer Design for Ships with Adaptive Wave Filtering Jann Peter Strand 1 and Thor I. Fossen 2 1ABB Industri AS Oslo, Norway 2Department of Engineering Cybernetics Norwegian University of Science and Technology Trondheim, Norway
1
Introduction
Dynamic positioning (DP) systems have been commercially available for marine vessels since the 1960's. DP systems are often installed on advanced ships like cable-layers, ice-breakers semi-submersible rigs and offshore supply vessels, for maintaining the horizontal position and orientation by use of the thrusters. See the recent paper by Sorensen, Sagatun and Fossen [15] for an overview of DP systems and references to earlier work. More recently positioning mooring (PM) systems has been developed. This is a control system for thruster assistance of moored structures. Such systems are important for safe operation of floating oil production, storage and offloading vessels (FPSO's) and semi-submersible rigs, which are moored to the seabed by an anchor system. Modeling and control system design of turret-moored ships is treated in Strand, Sorensen and Fossen [16]. Filtering and state estimation are important features of both DP and PM systems. In most cases, accurate measurements of the vessel velocities are not available. Hence, estimates of the velocities must be computed from noisy position and heading measurements through a state observer. Unfortunately, the position and heading measurements are corrupted with colored noise mainly caused by wind, waves and ocean currents. However, only the slowly-varying disturbances should be counteracted by the propulsion system whereas the oscillatory motion due to the waves (1st-order waveinduced disturbances) should not enter the feedback loop. This is done by using so-called wave filtering techniques, which separates the position and heading measurements into a low-frequency (LF) and a wave frequency (WF) position and heading part. The traditional Kalman filter-based estimators are linearized about a set of pre-defined constant yaw angles, typically 36 operating points in steps of 10 degrees, to cover the whole heading envelope between 0 and 360 degrees. When this estimator is used
114 7. Nonlinear Passive Observer Design for Ships with Adaptive Wave Filtering x
Y V (,!
x~.
X li ~' Y~:'
(Sltt i" ( y a w )
FIGURE 1. Reference frames (left) and definition of surge, sway and yaw modes of motion (right).
in conjunction with a linear (often linear quadratic) controller, there is no guarantee for global stability of the total system. In addition, linearization of the kinematic equations will naturally degrade the performance of the system. A nonlinear observer that can replace the traditional Kalman filter based designs was proposed by Fossen and Strand [6] for free-floating ships. This observer includes wave filtering and bias state estimation and it is proven to be GES, through a passivation design. Compared to the Kalman filter, the number of tuning parameters is significantly reduced and the tuning parameters are coupled more directly to the physics of the system. By using a nonlinear formulation, the software algorithms in a practical implementation are simplified. The observer of Fossen and Strand [6] has been applied by Aarset, Strand and Fossen [1], Strand and Fossen [17] and Loria, Fossen and Panteley [9] in output feedback controller design. Nonlinear observer designs for mechanical systems have been discussed by many. For instance, sliding observer designs for nonlinear systems is discussed by Slotine, Hendrick and Misawa [14]. Passivity and observer designs for mechanical systems are treated in Ortega, Loria, Nicklasson and Sira [13]. A similar problem to the ship positioning control is o u t p u t feedback control of robot manipulators, see e.g. Berghuis and Nijmeijer [2], Canudas de Wit, Fixot and Astrom [3], Nicosia and Womei [11], and Nicosia, Tornambe and Valigi [12]. In this paper nonlinear and adaptive observers for DP and P M systems are proposed. These are extensions of the nonlinear observer of Fossen and Strand [6]. The observers are proven to be passive and GES. An observer with adaptive wave filtering is derived, in order to adjust to slowly-varying sea states. The observers have been implemented and tested on a model ship and the results are reported. More details regarding nonlinear observer and controller designs for ships and experimental results can be found in Strand [18].
7. Nonlinear Passive Observer Design for Ships with Adaptive Wave Filtering
115
2 Modeling 2.1
Kinematics
Let the position (x, y) and the orientation (heading) ~ of the ship in the horizontal plane relative to an earth-fixed (EF) XEYEZE-frame be represented by the vector ~ = Ix, y, r An X Y Z - f r a m e is fixed to the vessel body, with the origin often located at the centre of gravity (CG), see Fig. 1. Body-fixed velocities are represented by the vector u = [u, v, r] T where u is the alongship velocity (surge), v is the athwartship velocity (sway) and r is the rotational velocity (yaw). A third frame, the so-called vessel parallel (VP) frame, specifies the desired position (Xd, Yd) and heading Cd of the ship relative to the XEYEZE-frame, and is represented by the vector ~d = [Xd, Yd, r T. In the design of tracking control systems, the reference vector is a smooth time varying signal ~d G C r, where the corresponding reference velocities in the body-fixed frame is Ud = [Ud, Vd, rd] T. T h e reference trajectories are generated by a separate reference generator. T h e linear velocities of the ship and the reference model in the body-fixed and in earth-fixed frames are related by the transformations:
i? = J(r
ild ---- J(r
(7.1)
where the rotation matrix in yaw J(a) : !t~ - - , ~}~3x3 is defined as:
J(a) =
cosa sin a 0
-sina cos a 0
0 ] 0 . 1
(7.2)
Note that J - l ( a ) = JT(a).
2.2
Vessel D y n a m i c s
In the m a t h e m a t i c a l modeling of ship dynamics, it is common to separate the model into a LF model and W F model. The W F motion of the ship is due tolst-order wave loads. The nonlinear LF equation of motion is driven by 2nd-order mean and slowly-varying wave, current and wind loads as well as thrust forces. In the case of moored ships, the restoring forces from the mooring system are treated as a function of the LF position and velocity of the ship. The total motion of the ship is given as the sum of the LF and the W F contributions, see Fig. 2.
Nonlinear Low-Frequency Model Both D P and P M systems are typical low-speed applications. In D P operations the ship will either follow a pre-defined track or maintain a fixed position and heading in the horizontal plane. In tracking operations the
116 7. Nonlinear Passive Observer Design for Ships with Adaptive Wave Filtering Low-frequency (LF) and wave-frequency (WF) motion
WF motion 0
o
~o
1(30
150
time
FIGURE 2. The total motion of a ship is modeled as a LF response with the WF response added as an output disturbance. speed of the reference velocities are small. In P M systems the mooring syst e m will naturally strongly limit the speed. W i t h low speed we m e a n speed in the range of 0 - 3 m/s. Under the assumption of low vessel speed, the following LF ship model is proposed (Fossen [5]):
Mi, + Du + JT(r
= Tthr + JT(~b)b.
(7.3)
Here Tthr C !}~a is a control vector of forces and m o m e n t provided by the propulsion system (thrusters). M C !t~ax3 is the inertia matrix including hydrodynamic added inertia, D E ~3x3 is a linear damping matrix, G E !t~3x3 is a stiffness matrix, due to a mooring system and b is a bias term, accounting for unmodelled external forces and moment. A symmetrical, spread mooring system is assumed, where we for simplicity have placed the earth-fixed frame in the natural equilibrium point of the mooring system. Further, we assume that G is a constant, diagonal matrix. For more details on mooring systems, see Faltinsen [4], Strand et al. [16] and Triantafyllou
[19]. Based on the low speed assumption, the following statements are made: P 1 M is constant and positive definite (Newman [10]):
M=MT>O
and 21}/=0.
(7.4)
P 2 The linear damping matrix D is strictly positive: D>0.
(7.5)
7. Nonlinear Passive Observer Design for Ships with Adaptive Wave Filtering
117
Linear Wave Frequency Model The W F motions are mainly generated by the 1st-order wave forces acting on the ship. Based on linear approximations of existing wave s p e c t r u m descriptions, see Fossen [5] for details, a linear W F model can be formulated as:
~
= A ~ + Ewww
(7.6a)
= C~,~
(7.6b)
where p is the order of the W F model, ~ E !Rap, w~ C ~3 is a zero-mean bounded disturbance vector and Aw, C~ and Ew are constant matrices of appropriate dimensions. The components of the W F motion is represented by the vector r/w = [xw,yw, ~bw]T. In one degree of freedom, this can for example be a 2nd-order d a m p e d oscillator (p = 2):
~{~} (s) = s2 + 2r
e~is
2 w{~i} + Woi
(i = 1, 2, 3)
(7.7)
where (.){,:} denotes the i-th vector element. Here (i is the relative d a m p i n g ratio and Woi is the natural frequency, which is related to the dominating wave frequency of the incoming waves. From a practical point of view, these are slowly varying quantities, depending on the sea state. Typically, the periods of the dominating waves are in the range of 5 to 20 seconds in the N o r t h Sea. In the case of a 2nd-order W F model the matrices in (7.6a)--(7.6b) are:
,]
-2af~
'C~:
[ 0 I ], E ~ =
[0] E~,2
where A = diag {~1, @, ~a}, = diag {Wol, wo~, Woa} , Evo2 = diag {ewl, ew2, ew3} .
(7.9) (7.10) (7.11)
Bias Modeling A frequently used bias model for marine control applications is:
= - T b l b + EbWb
(7.12)
where b E .~3, Wb C {R3 is a zero-mean bounded disturbance vector, Tb C ~axa is a diagonal m a t r i x of bias time constants and Eb a diagonal m a t r i x scaling the amplitude of Wb. T h e bias model accounts for slowly-varying forces and m o m e n t due to 2nd-order wave loads, ocean currents and wind.
118 7. Nonlinear Passive Observer Design for Ships with Adaptive Wave Filtering In addition, a bias model wilt account for errors in modeling of the constant mooring loads, actuator thrust losses and other unmodelled slowly-varying dynamics. Measurements
For conventional ships usually only position and heading measurements are available to the positioning system, whereas accurate velocity measurements are not available. Hence the measurement equation is written: y=v+~
+ vy
(7.13)
which consists of the LF and W F motions and measurement noise Vy c .~3.
2.3
Total Ship Model
When designing the observer, the following assumptions are made in the Lyapunov analysis regarding the ship model: A1
J(r ~ J(~ + r = J(r where Cy A r + r denotes the measured heading. This is a good assumption since the magnitude of the wave-induced yaw motion r will be less than 5 degrees in extreme weather situations and less than 1 degree during normal operation of the ship/rig.
A2
Position and heading sensor noise is omitted, vy = 0, since this noise is negligeable compared to the wave-induced motion.
F r o m Assumptions AI-A2 the total motion of moored and free-floating
ships is represented by the following equations: (7.14a)
= Aw~ + E w w w
(7.14b)
i] = J ( ~ y ) u = -T[-lb + Mi, = -Dr
y = ~+ ~
EbWb
- JT(r
= ~] + Cw~.
(7.14c) + JT(r
+ Tthr
(7.14d) (7.14e)
3 Non-Adaptive Observers Two different non-adaptive observers will be derived in this section. T h e first one is similar to the observer of Fossen and Strand [6] for dynamically positioned (free-floating) ships, where here also the effect of a spread mooring system is taken into account. In the second design, the observer is augmented by a new filtered state of the innovation signals. This adds
7. Nonlinear Passive Observer Design for Ships with Adaptive Wave Filtering
119
more flexibility to the observer design. By using feedback from the highpass filtered innovation in the W F part of the observer there will be no steady-state offsets in the W F estimates. Moreover, by using the low-pass filtered innovation in the bias estimation, these estimates will be less noisy and can thus be used directly as a feedforward t e r m in the control law. The adaptive observer proposed in Section 4 is based on the a u g m e n t e d observer. In the design we use an S P R - L y a p u n o v approach for obtaining passivity and stability of the observers. By including the synthetic wave model in the observer, wave-filtering is obtained, see Definition 1.
Wave .filtering can be defined as the reconstruction of the LF motion components .from noisy measurements o.f position and heading by means of an observer. In addition to this, noise.free estimates o.f the LF velocities should be produced. This is crucial in ship motion control systems since the W F part of the motion should n o t be compensated .for by the positioning system. If the W F part of the motion enters the .feedback loop, this will cause unnecessary tear and wear of the actuations and increase the .fuel consumption. D e f i n i t i o n 7.1 ( W a v e F i l t e r i n g )
3.1
O b s e r v e r in the E F f r a m e
The observer in this section is similar to the observer in [6], except t h a t the effect of a spread mooring system attached to the ship is included.
Observer Equations A nonlinear observer copying the ship-mooring dynamics (7.14a)-(7.14e) is: = A ~ + KI~)
;) = J ( r
+
b = --Tb-1/~ + K3y
M ~ = - D ~ - JT(fv)G~ ) + J T ( f v ) b + Tthr + JT(~y)K4~] 9 =
+
where ~ = y - ~) is the innovation vector and K1 E ~2.p• ~3• are observer gain matrices to be determined later.
(7.15a)
(7.15b) (7.15c) (7.15d) (7.15e) K2 ' K3 ' K4 C
Observer ET~ror Dynamics The estimation errors are defined as ~ = ~ - ~, ~ = ~ - ~), b = b - / ~ and = v - ~. Hence, from (7.14a)-(7.14e) and (7.15a)-(7.15e) the observer
120 7. Nonlinear Passive Observer Design for Ships with Adaptive Wave Filtering error dynamics is: :
A~
- KI~]
: fl(r
+
E~w~
(7.16a)
- K2y
(7.165)
: - T b l b - K3y + Ebwb M ~ : - O f , - JT(r
(7.16c)
+ jT(jy)~ _ jT(~y)K4~
9 = ~ + c~.
(7.16d)
(7.16e)
By defining a new output
50 A K4~I + G@ - b A Co,co
(7.17)
and the vectors
the error dynamics (7.16a)-(7.16d) can be written in compact form as:
M~, = - D e , - Jr(r
(7.19a)
)o = Ao?co + BoJ(~y)i, + EoW
(7.19b)
where
[ Aw-K1Cw Ao =
-I(l
0
-K2Cw
-K2
0
-K3Cw
-K3
--Tb 1
Co=[ K4C Bo =
I 0
, Eo:
4+C 0 0
] ,
-I], 0 Eb
.
Next the requirements on the observer gain matrices for stability and passivity of the observer error dynamics is provided.
Stability and Passivity By rewriting the observer error dynamics as (7.19a)--(7.19b) stability of the observer is provided by a SPR-Lyapunov design. The error dynamics is shown in Figure 3 where two new error terms ez and e , are defined as:
cz
A
-- flT(r
eu A J(r
(7.20)
Thus, the observer system consists of two linear blocks, interconnected through the bounded transformation matrix J(r Based on the physical properties of the ship dynamics, we can make the following statement:
7. Nonlinear Passive Observer Design for Ships with Adaptive Wave Filtering
121
FIGURE 3. Block diagram of the observer error dynamics.
P r o p o s i t i o n 7.1 The mapping Cz H 5 is state strictly passive and the block 7-ll in Fig. 3 is strictly passive. Proof. Let S1 ~--
I DTMD
(7.21)
be a positive definite storage .function. From (Z19a) we have: 1
: - - 2 i T ( D + DT)D + 5TE~
E'Tr >-- $1 + fl5 TE'
(7.22)
(7.23)
where fl ---- 1/~min(D q- D T) > 0 and/kmin(.) denotes the minimum eigenvalue. Thus, (7. 23) proves that cz ~-~ ~ is state strictly passive [8]. Moreover, since this mapping is strictly passive, post-multiplication with the bounded transformation matrix J(r and pre-multiplication by it's transpose will not affect the passivity properties. Hence the block Tll is strictly passive. [] Passivity and stability of the total system will be provided if the observer gain matrices K1, ..., K4 can be chosen such that the mapping c, ~-* So is passive. This is obtained if the matrices Ao, Bo, Co in (7.19a)--(7.19b) satisfies the KYP Lemma which is stated as below: L e m m a 7.1 ( K a l m a n - Y a k u b o v i c h - P o p o v ) Let Z ( s ) = C(81--J4)--1~ be a n x n transfer .function matrix, where .4 is Huvwitz, (.4,/~) is controllable, and (.4, C) is observable. Then, Z(s) is strictly positive real (SPR) if and
122 7. Nonlinear Passive Observer Design for Ships with Adaptive Wave Filtering only if there exist positive de.finite matrices P = p T and Q = Q T such that: "P*A + *ATT~ = _ Q,
13TJ i) = ft.
(7.24)
Proof: See e.g. Khalil [8]. []
Given a set of observer gains K1,...,/s the existence of the system to satisfy the KYP Lemma can be checked numerically by using the Frequency Theorem, originally formulated by Yakubovich [20], explicitly contained in Gelig, Leonov and Yakubovich [7]: T h e o r e m 7.1 ( F r e q u e n c y T h e o r e m )
Consider the system
= *Ax + 13u
(7.25a)
(7.25b)
y = CTx
where x E ,~n, u E ,~m, y E ,~m and,A, 13, C are real matrices of appropriate dimensions. Suppose the pair (*A,13) is stabilizable and det(jwI,~ - *A) ~ 0 Vw E !}~1. There exists a P : 7~T > 0 with
7) +*ATP < 0,
7~13+C = 0
(7.26)
i f the .following conditions
Re
( C T ( j w I n --
lirno w2Re (C T (jwIn -
,A) -1 13) < O, V~ E !}~1
(7.27)
*A)-I 13) < 0
(7.28)
hold. []
If the Frequency Theorem is satisfied for A = Ao, 13 = Bo, CT = - C o , the mapping c, H 4o (block ~2 in Fig. 3) is SPR and the observer error dynamics system is passive and GES as stated in the following: T h e o r e m 7.2 ( P a s s i v e O b s e r v e r ) The nonlinear observer (7.15a)-(7.15d) is passive. P r o o f . Since it is established that Tll is strictly passive and TI2 is SPR, the nonlinear observer is passive. [] T h e o r e m 7.3 (ISS a n d G E S O b s e r v e r ) The observer (7.15a)-(7.15d) with disturbance w is input-to-state stable (ISS). In addition, the observer error dynamics is rendered GES i.f we disregard the zero-mean disturbance, w~-O.
P r o o f . Consider the .following Lyapunov .function candidate: Vo =
+
7. Nonlinear Passive Observer Design for Ships with Adaptive Wave Filtering
123
Time differentiation of Vo along the trajectories of D and Xo yields: ~:o = - D T ( D + DT)D + Xo -T (PoAo + droPo) JCo + 2DTjT(r -- 2DT jT(r
(7.29)
+ 2~ToPoEow.
If the K Y P Lemma is satisfied .for the mapping e, ~ AoT 10o = - - Q o and BoT Po = Co, Vo can be written as:
zo, with PoAo +
17o = --DT(D + DT)D -- YcTQo&o + 2?cToPoEow.
(7.30)
From (7.30) it is seen that
~:o < O,
II~:o[I > 2
HQO1Eow[I
(7.31)
which shows that the observer is ISS. Moreover, in the disturbance free case, w =- O, the equilibrium point of the error dynamics is GES. []
Regarding the choice of observer gain matrices, the tuning procedure can be similar as for the observer for free-floating ships in Fossen and Strand [6]. Pole placement techniques can also be applied.
3.2 Augmented Observer The proposed observer in Section 3.1 can be further refined by augmenting a new state. The augmented design provides more flexibility and it is the basis for the adaptive observer in Section 4. We start by adding a new state, x i , in the observer, which is the low-pass filtered innovation ~: ~cI = - T f i x l
§ ~1= - T ? l x I
+ ~ + C~,~
(7.32)
where x I E !tP and T$ =diag{T$i,Ti2,T$3 } contains positive filter constants. High-pass filtered innovation signals can be derived from x I by:
~.f = - T f l x f + ~ = - T ? l x f -t- (] + Cw~
(7.33)
Thus, both the low-pass and high-pass filtered innovation is available for feedback. Moreover,
-
i+~I:'Y{O(S)T[: ~,{i}(s~ } , l+Tfis ~
\
(i = 1,2,3)
(7.34)
2
The cut-off frequency in the filters should be below the frequencies of the dominating waves in the WF model (7.6a)--(7.6b). Augmented Observer Equations
124 7. Nonlinear Passive Observer Design for Ships with Adaptive Wave Filtering The augmented observer is formulated as: = A~ + KlhYf
(7.35a)
= J(r163 + K2~ + K21xf -t- K2hYf
(7.35b)
= -Tb1~, + Ka9 + Ka~xl
(7.35c)
M~, = - D i - Jr(r
+ JT(r
+ Tthr (7.35d)
~_ jT(@y) (K4y ~- K41xj: -I- K4hYf)
9=~+c~
(7.35e)
where xy is the low-pass filtered innovation vector and Yl is the high-pass filtered innovation given by (7.32) and (7.33), respectively. Here Klh E ,~6x3 and K21, K2h, /(31, K41, K4h C ~3x3 are new observer gain matrices. Augmented Observer Error Dynamics
The augmented observer error dynamics can be written compactly as: (7.36a)
MP, = - D~, - J T (~py)Cj:a
(7.365)
~ca = AaS:a + B,,I(*py);' + Eaw
where
[
(7 7)
Za A_~K4~1 + K41xf + K4hYf -t- G~ -- b ~ Ca~ga,
(7.38)
and
Aa =
A~. - KlhCw - ( K 2 -~- K2h)Cw c~
-KaC~ 0 I 0 0
Ba =
Ca =
--Klh - ( K 2 -t- K2h) I -Ka Ea=
'
(K4 -t- K4h)Cw
0 0 0
Klhrf 1 K2fr) -1 - K21 -Kal
0 0 0 _Tb-1
0 0 0
Eb
(K4 -I- K4h) -F- G
- K 4 h T i I -t- K41
-i]
The signals Sf and x I are extracted from xa by Yl = ChiC. and xf = Cz2a where Cs--[C~
I-T~-'
Passivity and Stability
0],
Cz=[O
0
I
0].
(7.39)
7. Nonlinear Passive Observer Design for Ships with Adaptive Wave Filtering
125
Let Va = ~'TM ~ + YcTPaYca
(7.40)
be a Lyapunov function for the observer error dynamics (7..36a)--(7.36b). As before, the cross-terms of ~ and ~a in the expression for Va are cancelled by using an SPR-Lyapunov design, where it is required that: A T pa + PaAa -- - Q a ,
B T pa = Ca
(7.41)
The existence of a Pa = p T > 0, satisfying the KYP Lemma for the augmented observer error dynamics, can be tested numerically for a fixed set of observer gains by using the Frequency Theorem with 7~ = Pa, .4 = Aa, 13 = Ba, CT = - C a and u = e~. If so, the passivity and stability properties are similar as for the observer in Section 3.1 where: Va = --;'T(D + D T ) ; ' -- xTQaxa + 2y:TaPaE~w
(7.42)
and ~'a < 0,
[[Xa[I ~> 2 [[QalEaw[]
(7.43)
4 Adaptive Observer In this section we treat the problem when the parameters of Aw in the W F model (7.6a) are not known. The parameters vary with the different sea-states in which the ship is operating. Gain-scheduling techniques, using off-line frequency trackers and external sensors such as wind velocity and wave radars can be used to adjust to the W F model to varying sea states. However, this can be circumvented by using an adaptive observer design. Since the wave models are decoupled, A and ~ in A~ are diagonal matrices, and we have: A~o(0~)= [
0 _~2
T
I ] ~ [ 0 -2A~ = -diag(0~l)
I ] -diag(0w2)
(7.44)
S}~3
where 0~ = [0~1, T 0~2 ] , 0wl, 0~2 E , contains the unknown wave model parameters to be estimated. We start with the following assumption: A3
( C o n s t a n t e n v i r o n m e n t a l p a r a m e t e r s ) . It is assumed that the unknown parameters ~ and A in the W F model (7.7) are slowly varying and within the range of
0 <00J~~min"~<02~<<~maxw~ } ,
i = 1, 2, 3
(7.45)
such that A~ is Hurwitz. Hence, the unknown wave model parameters are treated as constants in the analysis, such that b~ = 0
(7.46)
126 7. Nonlinear Passive Observer Design for Ships with Adaptive Wave Filtering
~.1
Adaptive Observer Equations
The adaptive version of the observer is equal to the augmented observer (7.35a)-(7.35e), except from the W F part which is now using the estimated W F parameters, 0~, such that: ~.
^
^
= A~(O~)~ + Klh~lf.
(7.47a)
The adaptive update law for 0~ remains to be decided.
4.2
Adaptive Observer Error D y n a m i c s
The adaptive W F observer error dynamics is:
= A ~ - Aw(O~)~ - KlhYf + E~w~
(7.48)
By adding and subtracting A ~ , defining BwT~(~)0~ = (A~ - A~(t~))~
(7.49)
where 0~ : (~ - 0~ denotes the estimation error, T
^
A
T~(~) : [ diag(~l)
diag(~2) ],
(7.50)
I
(7.51)
where ~ = [~T, ~T]T, ~1, ~2 9 !}~3, and by using (7.33) then (7.48) can be rewritten as: (dw - KlfCw)~
- Klf?)
r
^
+ BwTw(~)Ow
+ K l f T ~ l x i + E~w~
(7.52)
The observer error dynamics can be written compactly as:
M~ -- - D 5
(7.53a)
- J T (~y)Ca~c a
2a = Aj:~ + B~J(r
+ HaT~(~)O~ T ^ - + E~w
(7.535)
where
Ha= [ 4.3
0 0 0
(7.54)
Stability and Passivity
Let -T w - - 1 Ow ~ Vad ---- DT M 5 + s:T p a ~ a + OwE
(7.55)
7. Nonlinear Passive Observer Design for Ships with Adaptive Wave Filtering
127
be a Lyapunov function for the observer error dynamics (7.53a)--(7.53b), where F~ -- F T > 0 C .~}~6x6 is an adaptive update gain matrix. Thus:
l~rad : __~,T(D + DT)F] _ 2 F ] T f l T ( r nt- 2 ~ T j T ( r
a ~ _ x a ( m a PTa . . . i .
PaAa):~a
a + 20wTw(~)g2 -T ^ T PaXa -
+ 2SaPaEaw + 20Tr-lb~
(7.56)
In the adaptive case, we want the W F adaptive law to be u p d a t e d by the high-pass filtered innovations signals9 Hence, it is required that:
ATpa + PaAa = - Q a
(7.57a)
BTaPa = Ca
(7.57b)
HT p~ = Ch
(7.57c)
The existence of a Pa = p T > 0 that satisfies (7.57a)-(7.57c) can be tested numerically by using the Frequency Theorem with 7 ~ = P , , .4 = Aa, U = [gT,-T 0~] T and B=[
Ba
Ha ]
=-
[ Ca
"
(7.58)
Since there exists a solution, the expression for l)~d becomes: I)ad = --YcTQaY:a -- DT(D + DT)D -1- 2ycTpaEaW -T ^ ~ + 20wr~ ~T - 1 0~ "+ 20,~T,~(~)Chz,
(7.59)
which suggests that the adaptive update law should be:
Ow o,,,,--o b~ = - r w T ~ ( ~ ) C h 2 a = - F ~ T ~ ( ~ ) ~ I
(7.60)
V~d = --YcTQaYca -- ~'T(D + DT) ;' + 2s:TpaEa w
(7.61)
Thus,
and error dynamics of the adaptive observer is:
M~, = -OF, - JT(r Yca= A j c a + BaJ(r 9
= -FwT~(~)Chia
(7.62a)
+ HaTw(~)0w + Eaw T
^
~
(7.625) (7.62c)
From (7.61) we see that the system is ISS. If the ship is exposed to W F motions, the system will be P E and 0w --* O~.
128 7. Nonlinear Passive Observer Design for Ships with Adaptive Wave Filtering Basin
wi~r X, Y
wave generator
FIGURE 4. Left: Experimental setup. Right: Picture of Cybership I.
5
Experimental Results
Both the augmented and the adaptive observer have been implemented and tested at the Guidance, Navigation and Control (GNC) Laboratory, D e p a r t m e n t of Engineering Cybernetics at the Norwegian University of Science and Technology (NTNU). We have used Cybership I, a model ship of scale 1:70, see Fig. 4. A nonlinear P I D controller is used for maintaining the ship at the desired position ( X d , Yd) and heading ~d- The P I D control law is written: Tc = - - M J T ( r
-- Cd)
(
K p e ~- K d e -~ K i
/0 ) e d~
(7.63)
where e is the estimated LF position deviation, defined as: e ~- J T ( r
- - ~?d)
(7.64)
An illustration of the experimental setup is given in Fig. 4. The experimental results are scaled to full scale by requiring that the F r o u d e n u m b e r Fn
U
-- - -
--
constant,
(7.65)
where U is the vessel speed, L is the length of the ship and g is the acceleration of gravity. The scaling factors are given in Table 7.1 where m is the mass and the subscripts m and s denote the model and the full-scale ship, respectively. The length of the model ship is L m : 1.19 m and the mass is mm = 17.6 kg. A full scale ship similar to Cybership I has typically a length of 70-90 meters and mass of 4000-5000 tones. In the scaling we used Ls = 70Lm meters and ms = 4500 tones.
7. Nonlinear Passive Observer Design for Ships with Adaptive Wave Filtering I position:
129
Ls/Lm
I linear velocity: I angular velocity: I linear acceleration:
1
I angular acceleration:
L,~/Ls
I force:
ms/' m
I moment:
m,~L,~
rn.~ L m
I time: TABLE 7.1. Scaling factors used in the experiments (Bis scaling).
The experiment was performed in three phases:
9 Phase I (No waves). Initially the ship is maintaining the desired position and heading with no environmental loads acting on the ship (calm water). When the data acquisition starts, a wind fan is switched on. There is no adaptive wave filtering and the observer is identical to the augmented design in Section 3.2. The effect of the wind loads are reflected in the bias estimates in Fig. 6. 9 Phase II (Waves, adaptive wave .filter off). After 1700 seconds the wave generator is started. In this phase we can see the performance of the observer without adaptive wave filter. In the wake model we are assuming that the dominating wave period is 9.2 seconds and the relative damping is 0.1, see Fig. 5. 9 Phase III (Waves, adaptive wave .filter on). After 2800 seconds the adaptive wave filter is activated. The estimates of dominating wave period and relative damping are plotted in Fig. 5 for surge, sway and yaw. A spectrum analysis of the position and heading measurements shows that the estimated wave periods converge to the true values, that is wave periods of approximately 7.8 seconds and relative damping ratios of 0.07, see Fig. 5. In Fig. 6 the measured position deviation and heading are plotted together with the corresponding LF estimates. The effect of the adaptive wave filtering is clearly seen in Fig. 8, where the innovation signals are significantly reduced in Phase III, when the adaptation is active and the wave model parameters start converging to their true values. The effect of bad wave filtering is reflected by noisy control action by the thrusters in phase II, see Fig. 8. A zoom-in of the heading measurement together with the LF estimate is given is Fig. 7 both for phase II and III. Here we see that the LF estimates have a significant W F contribution when the adaptive wave filter
130 7. Nonlinear Passive Observer Design for Ships with Adaptive Wave Filtering Estimated wave periods [see]
Estimated relative damping ratios [-]
9.5
0.3
8.5
o2
7.5
"~
9
7 ...r.~..~.~?.................
....~..:~.~:................
,~ o
6.5 I000
<
I
2000
F~
3000
11 ~ time [sec]
4000
II1
5000
lO00
I
2000
II
3000
~" time [see]
4000
5000
111
FIGURE 5. Estimated wave periods (left) and estimated relative damping ratios (right) for surge (solid), sway (dashed) and yaw (dashed). The adaptive wave-filter is activated after 2800 seconds.
is off. This is the reason for the noisy control action in phase II. T h e other zoom-in shows excellent LF estimation when the adaptive wave filter is active and the wave model parameters have converged to their true values. Hence, it can be concluded that adaptive wave filtering yields a significant improvement in performance compared to a filter with fixed p a r a m e t e r s and varying sea states.
6
Conclusions
In this paper we have derived a nonlinear passive observer for m o o r e d and free-floating ships. By adding low- and high-pass filtered innovation signals in the design, we have additional flexibility in the design. This a u g m e n t e d design is extended to a new observer with adaptive wave filtering. Experiments with a model ship shows that the adaptive observer will significantly improve the performance of the ship positioning system. This results in reduced magnitude of the observer innovation, better filtering properties and reduced control action by the propulsion system.
ACKNOWNLEDGEMENTS T h e authors are grateful to Dr. Anton Shiriaev, Dept. of Eng. Cybernetics, N T N U , for valuable discussions on passivity and frequency domain m e t h ods. This work is sponsored by the Research Council of Norway and ABB Industri AS.
7. Nonlinear Passive Observer Design for Ships with Adaptive Wave Filtering
131
Estimated bias in surge [kNI
Measured (grayl aJld estimated LF (solid) x-posilioiL deviation [m]
0 -10 -20 -30 -40 -50 -60 -70
-3
-80 -90 1000 Measured r
2O00
30OO
4O0O
50O0
and estinlated LF (solid) y-p~Jsition deviation [nil
1~
2000
3000
4000
Estimated bias in sway IkN I
20OO
tO(O)
5000
Maurcd(graveslimatcdLFtsolidanadcsireddt)tledheJdi.nldeg]
3000
4000
Estimated bias in yaw [kNm] 0
-136
-200
-138
-400
-140 -142 -800
-1441
-ICO(
-146 1000
<
2000
3000
>< 1
4000
5000
><
>
II time [see]
0
1c00
<
Ill
200o
300o
~-< I
4000
5000
><
>
It
III time [see]
F I G U R E 6. Left column: Measured position and heading together with corresponding LF estimates. Right column: Estimated bias in surge, sway and yaw.
Z o o m - I n : Measured (dotted) and estimated LF (solid) heading (dashed)
I l L Adaptive W a v e - F i l t e r On
11: Adapti;'e Wave-Filter Off -135
-135
-136
-136 -137
...
I :"
" "~
.4 ""
,: --
.t.
~.
-137
'.
-138
-138
-139
-139 -140
-140
-141
-141 -142
"'
'~V
-142
-143
-143
-144
-144
-145 2300
Ldeg]
2320
2340
2360
t i m e [sec]
2380
2400
-145 4900
VVVVVV JVVVVV 4920
4940
4960
4980
tirae [see]
F I G U R E 7. Zoom-in of measured and estimated LF heading. Left: Observer without adaptive wave-filtering. Right: Observer with adaptive wave-filtering.
5000
132 7. Nonlinear Passive Observer Design for Ships with Adaptive Wave Filtering
~
Innovation in x..~e,~ou [m]
thrust in sm'sc
80 70 60 5O 40 30 20 10 0 -10
Innov~
I000
in y-d~-~on Ira]
2000
30~0
~
5000
Commmsded~a'u~ m ~wry Dr~rJ
0.8
180
0.6
160 140
0.4
1:20
0.2
I00 0
80
-0.2
60
-.0.4
~0
-0.6
20
-0.8
0
zm~o-~tionin ~
[~ld
C~,,,m,,,~d tt~umin ~
[Sdqm]
0.8 1400
0.6
1200
0.4
1000
0,2
8OO 0
6OO
-0.2
4OO
-0.4
200
-0.6
0
-0.8
1000
I
2000
3000
I!
4000
Ill
5000
-2OO
1000
< [
2000
II
~00
4000
~00
III
F I G U R E 8. Left column: Innovation in position and heading. Right column: Commanded thrust in surge, sway and yaw.
7. Nonlinear Passive Observer Design for Ships with Adaptive Wave Filtering 7
133
REFERENCES [1] M. F. Aarset, J. P. Strand and T. I. Fossen. Nonlinear Vectorial Observer Backstepping With Integral Action and Wave Filtering for Ships. Proc. of the IFAC Conference on Control Applications in Marine Systems (CAMS'98), Fukuoka, Japan, 1998, pp. 83-89, 1998. [2] H. Berghuis and H. Nijmeijer. A Passivity Approach to ControllerObserver Design for Robots. IEEE Trans. on Robotics and Automation, RA-8(6):740-754, 1993. [3] C. Canudas de Wit, N. Fixot and K. J. Astrom. Trajectory Tracking in Robot Manipulators via Nonlinear Estimated State Feedback. IEEE Trans. on Robotics and Automation, RA-8:138-144, 1992. [4] O. M. Faltinsen. Sea Loads on Ships and Offshore Structures. Cambridge University Press, 1990. [5] T. I. Fossen. Guidance and Control of Ocean Vehicles. John Wiley & Sons Ltd, 1994.
[6] T. I. Fossen and J. P. Strand. Passive Nonlinear Observer Design for Ships Using Lyapunov Methods: Full-Scale Experiments With a Supply Vessel. Automatica. AUT-35(1)3-16, 1999. [7] A. H. Gelig, G. A. Leonov and V. A. Yakubovich. Stability of Nonlinear Systems with Unique Stationary Points. Nauka. Moscow, 1978. [8] H. K. Khalil. Nonlinear Systems. Prentice Hall Inc., 1996. [9] A. Loria, T. I. Fossen and E. Panteley. A Cascaded Approach to a Separation Principle for Dynamic Ship Positioning. IEEE Trans. on Control Systems Technology (submitted). [10] J. N. Newman. Marine Hydrodynamics. MIT Press. Cambridge, Massachusetts, 1977. [11] S. Nicosia and P. Tomei. Robot Control by Using only Joint Position Measurements. IEEE Trans. Automat. Contr., AC-35:1058-1061, 1990. [12] S. Nicosia, A. Tornambe and P. Valigi. Experimental Results in State Estimation of Industrial Robots. Proc. Conf. Decision and Control, Honolulu, HI, Dec. 1990, pp. 360-365, 1990. [131 R. Ortega, A. Loria, P. Nicklasson and H. Sira. Passivity-Based Control of Euler-Lagrange systems. Springer-Verlag, 1998. [14] J. J.-E. Slotine, J. K. Hedrick and E. A. Misawa. Sliding Observers for Nonlinear Systems. ASME J. Dynam. Syst., Measurement, Control, Vol. 109, pp. 245-252, 1987.
134 7. Nonlinear Passive Observer Design for Ships with Adaptive Wave Filtering [15] A. J. Scrensen, S. I. Sagatun and T. I. Fossen. Design of a Dynamic Positioning System Using Model-Based Control. Journal of Control Engineering Practice. CEP-4(3):359-368, 1996. [16] J. P. Strand, A. J. Scrensen and T. I. Fossen. Design of Automatic Thruster Assisted Position Mooring Systems for Ships. Modeling, Identification and Control, MIC-19(2):61-75, 1998. [17] J. P. Strand and Thor I. Fossen. Nonlinear Output Feedback and Locally Optimal Control of Dynamically Positioned Ships: Experimental Results. Proc. of the IFAC Conf. on Control Applications in Marine Systems (CAMS'98), Fukuoka, Japan, pp. 89-95, 1998. [18] J. P. Strand. Nonlinear Position Control Systems Design for Marine Vessels. Doctoral Dissertation. Department of Engineering Cybernetics, NTNU, Trondheim, Norway, 1999. [19] M. S. Triantafyllou. Cable Mechanics for Moored Floating Systems. BOSS'94 pp.67-77, Boston, MA, 1994. [20] V. A. Yakubovich. A Frequency Theorem in Control Theory. Siberian Mathematical Journal SMJ-14(2):384-420, 1973.
Nonlinear Observer Design for Integration of D G P S and INS Bjornar Vik, Anton Shiriaev and Thor I. Fossen D e p a r t m e n t of Engineering Cybernetics Norwegian University of Science and Technology N-7034 Trondheim, Norway
1
Introduction
This section contains the notation used in this chapter and the main motivation for the work with references to previous research.
1.1
Nomenclature
In the rest of this chapter, we will use the notation: w~ b ~ ~3
:
v a E !}~3
: : :
a~ ~ so(3)
Angular velocity of frame a relative to frame b, decomposed in frame c. Linear velocity decomposed in frame a. 3 • 3 skew-symmetric cross product matrix. Rotation matrix from the b-frame to the a-frame.
The navigation frames used in the following are: i-~ame
:
e-~ame /-~ame b-Lame
: : :
The inertial frame located at the center of the E a r t h (ECI-frame). Earth-Centered, Earth-Fixed (ECEF) frame. Local geographic, North East Down (NED) frame. Body-frame.
T h e origins of the i- and c-frames are assumed to coincide. These frames are, however, rotated relative to each other with the E a r t h r o t a t i o n rate. The /-frame origin is located in the center of the navigation system. T h e b o d y frame (b-frame) is assumed to be aligned with the vehicle's roll, pitch, and yaw axes with the origin located in the vehicle's rotation point. In this chapter, it is assumed that the platform containing the inertial measurement unit (sensors) is in the b-frame. It is also assumed t h a t there is no lever a r m between the GPS and the inertial measurements. T h e position and velocity variables are defined according to:
136
8. Nonlinear Observer Design for Integration of DGPS and INS
~3, q, r] T
p~ Ve Vl
P q
1.2
IX, y, Z] T = = =
[Vx,Vy,Vz] [vN, [p,,
~--
[51,...,
=
[ql,q2,q3,q4] T
5n] T
: Angular velocity of the b o d y frame relative to the local frame. : Distance from center of E a r t h to the navigation system in E a r t h coordinates. : Velocity of vehicle in E a r t h coordinates. : North, East and Down velocities. : Vector of n pseudorange observations. : Vector of n deltarange observations. : Euler p a r a m e t e r s (unit quaternions).
Motivation
The development of GPS technology the last two decades has had a great impact on navigation. For local area operations, it is now possible to achieve meter accuracy by using Differential GPS (DGPS), and sub-decimeter accuracy by using Carrier Differential GPS (CDGPS). T h e development of wide area augmentation systems (WAAS) is expected to give m e t e r accuracy across entire continents. However, the integrity and reliability of G P S are not on a sufficient level. Interference on d a t a links, and several types of disturbance causing loss of lock on satellites, are c o m m o n problems for m a n y G P S applications. The latter can be a problem at latitudes between 60 and 70 degrees North, where the satellite coverage is low to begin with. In addition to the time spent reaquiring satellite signals, the ambiguity integers must also be recalculated after each loss of lock in applications where C D G P S is used. A strapdown Inertial Navigation System (INS) is basically a cluster of accelerometeres and gyros known as an Inertial Measurement Unit (IMU), and a computer. The INS computer calculates position, velocity, heading and attitude using t h e strapdown equations. The INS is self-contained, and does not depend on an external signal to function properly. However, the long t e r m drift resulting from integrating small m e a s u r e m e n t errors in the gyros and accelerometers is a m a j o r problem, and only very expensive units can be expected to have sufficient accuracy for a long period of time without re-initialization. T h e complementary properties of G P S a n d INS, suggest t h a t integration of GPS with low-cost INS will give an affordable, highly reliable, and accurate solution. There are several levels of integration possible, see Maybeck [4]. T h e most c o m m o n approaches shown in Figures 1 and 2, b o t h s u b t r a c t the INS position m e a s u r e m e n t from t h a t of the GPS system, and feed this error signal into a K a l m a n filter (KF). T h e K F model is usually a Markov process or integrated white noise for each of the error sources in the G P S a n d INS systems. T h e estimated INS errors are subsequently either s u b t r a c t e d from the INS measurement to give the o u t p u t signal (open-loop), or fed back to the INS strapdown c o m p u t a t i o n processor (closed-loop). W h e n the position measurements from GPS are lost, the K F is not updated, and the INS has
8. Nonlinear Observer Design for Integration of DGPS and INS
137
GPS Receiver
k Kalman Filter
I I
Strapdown E~176
Gyros Accelerorneters IMU
c,o0 . .P,V,q ... ]= P,V,q
+
INS
FIGURE 1. Open loop integration of INS and GPS. to coast through the outage using the last error u p d a t e from the filter. This is often referred to as uncoupled integration in the open-loop case, and loose integration in the closed-loop case. The filter can be designed with a slow u p d a t e rate due to the slow dynamics of the error model. GPS receivers have an internal K F that computes position, velocity and time ( P V T ) solutions, and the outputs are correlated typically with 1-10 seconds correlation time. In order to avoid problems with the second KF, this filter needs to be updated sufficiently slower. In addition, the timing and lags between the two filters can be difficult to predict and control, see Tazartes and Mark [9]. A different approach to integration, often referred to as tight integration, is to use the raw accelerometer, gyro, pseudorange, and deltarange measurements from the sensors. One main advantage with this approach is t h a t even in the case where less t h a n the necessary amount of satellites are available for a P V T solution, the remaining pseudoranges and deltaranges provide information t h a t will help the INS coast through the outage. Also, the accuracy of the solution is known to be better when using uncorrelated measurements in the estimator. T h e cost is increased complexity a n d bandwidth because the filter dynamics includes the vehicle dynamics. However, with the increased computing power available in the future, this is going to be less of a problem. W h e n the nonlinear strapdown equations enter into the integration filter, a linear K F should not be used. In this chapter, we therefore propose nonlinear observers for tight integration t h a t estimate the accelerometer and gyro biases, scale factor errors and misalignments using GPS measurements. The outline of the chapter is as follows: In Section 2 some G P S basics are reviewed, and in Section 3 the INS strapdown equations are given. T h e stability and convergence properties of the proposed observers are analyzed in Section 4, and Section 5 contains a case study. Our conclusions are given
138
8. Nonlinear Observer Design for Integration of DGPS and INS
GPSRece!~'er.~/ ~ ~,
GPS ~ ........
Velocity
Aiding
~J
GPSErrorEstimates
/ /
P,V,q
GPS
1
tsl I~a,ma~ I
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
L Gyros
i IAccaler~ IMU
#
H ~
Strapdown Equations
). P,V,q
--
.
INS
t'- ~--~ iNS ErrorEstimates
Corrected
P,V/:I
1
FIGURE 2. Closed loop integration of INS and GPS. in Section 6.
2
Review of G P S Fundamentals
T h e raw GPS measurement is called pseudorange, and it is the distance from the receiver antenna (here assumed to be located at the origin of the body frame) to the satellite antenna. There are two basic ways of measuring the pseudorange, code and phase measurements. Phase m e a s u r e m e n t s are the most accurate, but also the least reliable measurements. To convert pseudoranges into user position, four m e a s u r e m e n t s are needed. In addition to the three unknown position variables, the clock bias is unknown. W i t h DGPS, the satellite clock errors cancel out when the differential corrections are applied, and only the receiver clock bias will be unknown. The clock bias multiplied with the speed of light gives the position error, which can be very large. Given the cost of very precise atomic clocks, it is a better solution to include the clock bias a m o n g the unknowns using four satellites instead of three. Given n satellites, the basic pseudorange measurement equation is:
(8.I) where i E [1,-. 9, hi, and x~ E ~R3 is the position of satellite i. This position is transmitted with the navigation message, c is the speed of light, T* is the clock bias and cpi is the composite of other smaller pseudorange errors a n d
8. Nonlinear Observer Design for Integration of DGPS and INS
139
noise. Note that 9-* is common to all n equations. The position is usually found by linearizing (8.1), that is:
(8.2)
A p ~9 = - A J A x + 9- + epi where
~ =
(x~ - x0) ~ v/(x~ - x0)T(x~ - x0)
(8.3)
and 9- = c9-*, x0 is an estimate of x, and A x is the position error. If we stack n measurements together in a vector, we obtain: Ap = A . A x .
(8.4)
where A x . = [Ax T, AT] T E ,~4, Ap E !l~n and
_Alx _AIy _Alz 1 _A2z _A2y _A2z 1 A.= _A~
_A~y _A~
E ~nx4
(8.5)
1
A x . can now be found from
Z~x.= h**Ap
(8.6)
Notice that, the pseudo-inverse At. -- rATA.] - 1 A T of A. always exists9 Hence, by using a KF for instance, x and 7- can be computed from a minim u m of four pseudorange measurements. The distance from the user to a satellite in view is approximately 20,000 kin, and A. is therefore not very sensitive to position errors since A~ is the line-of-sight (LOS) unit vector from the user to satellite i. Thus, with one exception which will be addressed later, we can consider A. constant throughout this chapter, even though it is a function of both satellite and user positions. W h e n calculating position only, and not clock bias, we will use the following LOS matrix
_AI~ _AIy _AI~ __A2x
__A2Y
__A2z
~,~3
A=
_A~
(8.7)
_A~y _An~
The pseudo-inverse A t of A will always exist. The deltarange equation is written: 6i = Ar
-v)
+ef* +c~
(8.8)
140
8. Nonlinear Observer Design for Integration of DGPS and INS
where v / is the velocity of satellite i, f* is the clock drift (or frequency bias), and r is the composite of other smaller deltarange errors and noise. v and f = cf* can be calculated equivalently to (8.6) from A v . = At.AS
(8.9)
where A v . = [Av T, A f] T, or directly from v . = A~(5 - 5sat)
(8.10)
=
(8.11)
where v . = [v T, f]T, and
is the known satellite velocity component of the deltarange vector. T h e main focus of this chapter is the INS dynamics, and for simplicity it is assumed that 6pi = 0 and c~ = 0. The clock bias and clock drift are the only GPS errors that will be estimated together with the INS errors. More details about GPS navigation processing can be found in Parkinson and Spilker [5], Chapter 9.
3
Strapdown
Equations
PVA measurements from an INS are derived from acceleration measurements along the three orthogonal axes in the body (vessel-fixed) frame, and from angular velocity (gyroscope) measurements of the rotation of the inertial frame relative to the body frame. For applications on the Earth, the inertial frame can be well approximated by the Earth-Centered-Inertial (ECI) frame. The GPS measurements are usually given in the E C E F frame, while the raw INS measurements are given in the body frame. The best choice of reference frame for integration will depend on the application and the priorities of the system integrator.
3.1
Local Frame Representation
The large majority of strapdown implementations are mechanized in the local frame. The basic velocity equation decomposed in the local frame can be found in standard textbooks, e.g. Britting [1], and it is written: 91 = R~a b + E l - [2S(w~e ) + S(wlel)] v l
=
(8.12)
(8.13)
where a b E ~3 is the measured acceleration, and ~t = gl _ S(w~)S(w~e)pt is the plumb-bob gravity where ~ l = [0, 0, gD]T
(8.14)
8. Nonlinear Observer Design for Integration of DGPS and INS
141
p~ E ~3 is the position vector from the center of the E a r t h to the b o d y frame decomposed in the Earth-fixed frame, w~e is the rotation rate of the E a r t h decomposed in the local frame, and wzel is the rotation rate of the local frame relative to the Earth-fixed frame (due to vehicle m o v e m e n t z and w~l l are in over the E a r t h surface) decomposed in the local frame, w~e general slow compared to the angular velocity wbb of the b o d y frame relative to the NED frame. T h e y can therefore, even though they are functions of the linear velocity v l, be assumed known. Finally, gD is the acceleration of gravity, which can be estimated from the following equation:
gD = Cl (1 - 2h/ro + 2esin2#) - G2(1 - 3sin2tt) - G3(1 - 3sin2# + h/ro)cos2#
(8.15)
where G1, G2, G3, r0 and e are positive E a r t h model constants, h is altitude and p is geodetic latitude. From (8.15) we see that the acceleration of gravity decreases as the height increases. T h e differential equation for the height is:
= --gD -- az
(8.16)
Thus, the INS stand-alone vertical channel is unstable since this is a pure double integrator. Integration with G P S height measurements can be used to stabilize the vertical channel. However, in the event of a G P S dropout, the vertical channel of the INS would go unstable. Therefore, a b a r o m e t e r is usually employed in the integration filter for applications in the air, while for marine applications a height constraint can be used.
3.2
Earth Frame Representation
It has been suggested, Wei and Schwartz [12], that an E a r t h frame mechanization can offer some advantages over the local level frame. An increase in computational speed was found, while the accuracy was found to be comparable to t h a t of a local level implementation. On the other hand, as seen from (8.14), the gravity equations are rather simple and directed only along the vertical axis in the local level frame. In the E a r t h frame, however, difficulties arise in representing the E a r t h ' s gravitational field precisely in the navigation computer, see T i t t e r t o n and Weston [10]. Gravity will have a component along all three axes, and all three axes need to be stabilized. The use of barometer data to stabilize all three axes would require more investigation into the practicability and detail of implementation, see e.g. Tang and Howell [8]. In this chapter, we will not deal with this problem, because GPS measurements are assumed to be present at all times, but we present an observer mechanized in the e-frame as an alternative to the system integrator. We will therefore consider the velocity equation decom-
142
8. Nonlinear Observer Design for Integration of DGPS and INS
posed in the E a r t h frame, t h a t is: +e = R ~ l ~ a b + R r $
- 2S(CO~)v ~
(8.17)
l~e ~ V e
3.3
(8.18)
A n g u l a r Velocity E q u a t i o n s
T h e rotation m a t r i x R~, which gives us roll, pitch, and yaw angles, can be calculated from
R~= R~S(COfb)
(8.19)
b = COil b + C~ which leads to T h e gyros measure COib
l ,b = R~S(CObb) -- S(COil)R
(8.20)
z @ coel" t Another representation of a t t i t u d e is Euler p a r a m e where coilt z coie ters (unit quaternions). T h e set 7-/of unit quaternions is defined as ~={qlqrq=l,q=[rh
e T I T , r l E ! I ~ , e E ! t ~ a}
(8.21)
T h e most c o m m o n error representation used for unit quaternions is ~t = Ct ~ q E 7-/, where the product C~ is defined as
This is equivalent to R = I~TR. W h e n the error is zero, l~ = I and cl = [• 0, 0, 01T. T h e dynamics associated with the unit quaternions takes the form: ~--~ 2
[f]I Jr- S ( c ) ]
coib -- -2
[7]I
-
-
S(s
COil
Finally, we note t h a t the relation between the rotation m a t r i x R~ and q is Rtb(q) = I3~a + 2r/S(e) + 2S2(5)
4
(8.24)
N o n l i n e a r Observer D e s i g n
T h e main task of the observers is to estimate the most i m p o r t a n t errors of accelerometers and gyros. In addition, filtering of G P S and INS positions,
8. Nonlinear Observer Design for Integration of DGPS and INS
143
velocities, and angular velocity measurements are achieved. T h e most dominating errors are usually biases, scale-factor errors and misalignment errors. The biases and scale-factor errors are usually precompensated through calibration and t h e r m o m e t e r feedback, and the misalignment errors (which include non-orthogonality of the sensor axes) are m a d e as small as possible during mounting of the IMU. However, there are still residual errors t h a t need to be estimated. In the following, the errors will be modeled in a standard way as: [I + A(~, OL)O.Jimu] -~- b l
(8.25)
a b = [I + /k(e,/3)aimu] + b2
(8.26)
b OJib
~---
where
[ Cyx ~ Cxy Cxz] ~y Cyz
A=A(~,r
(8.27)
Here cr = [erx, cry,
r = [r
r
Cyx,%=, r
Czy]T
are six small misalignment angles, b l E !~3 and b2 E ~3 represent gyro and accelerometer biases, respectively. Notice that all error signals are decomposed in the body frame. T h e magnitude of these errors are in general directly proportional to the price of the IMU. The effect of white noise on stability will not be considered in this chapter. However, all results can easily be extended to show t h a t the solutions will converge to a ball around the origin when white noise is considered. W i t h o u t loss of generality, we will therefore restrict our analyses to the case of zero noise.
4.1 Angular Velocity Observer W h e n bias, scale-factor and misalignment errors are included, (8.23) is written: 1
1[
2
--c T
-e T
[~I-s(~)]
]
l
(8.28)
~'
where A1 = A(n, a). The gyro error models are assumed to be described by the 1st-order models: 1~1
=
- T 1 1 b 1 -Jr- W 1
(8.29)
k
=
-W21t~ + w2
(8.30)
&
=
-Tala+w3
(8.31)
144
8. Nonlinear Observer Design for Integration of DGPS and INS
driven by bounded signals wi E N a (i -- 1, 2, 3). The matrices Ti > 0 (i = 1, 2, 3) are diagonal matrices of time constants, n = [n~, ny, nz] r are three gyro scale factor errors, and a = [axy, C~z, aye, ayz, azx, azy]r are six small gyro misalignment angles, bl = - b g y r o represents the biases of the gyros. The nonlinear observer of Saleudean [6] is extended to include bias and error update laws according to:
[
~
1
_gT
2
[?)I -- S(~)]
(8.32)
02il
9 1 g~ = -T~ 1~,~ + ~sgn(9) -
(8.33)
1
k = - T ~ - l k + ~diag(g) a)imu sgn(ff])
(8.34)
(~ = - T 3 1 5 + iF(g)Wimu sgn(~)
(8.35)
where 0 0 r(e) =
e2
0 g3 0
gl 0 0 0 0 g3
0 gl 0 g2 0 0
(8.36)
The error model is found by combining (8.29)-(8.35):
1[ [~I + S(g)] ]E , imu l Kl sgo( ,l bl= -Tllt)l
1
- ~gsgn(~)
,837, (8.38)
1 ~ = -T~-I~ - ~diag(g) wire. sgn(9)
(8.39)
1 5 = - T 3 1 5 -- ~r(g) Odimu sgn(~)
(8.40)
Note that the equilibrium points (~/, g, I~1, ~, 5) = (4-1,0, 0, 0, 0). T h e o r e m 8.1 ( E x p o n e n t i a l l y S t a b l e A n g u l a r V e l o c i t y O b s e r v e r ) The equilibrium points (=t=1,0,0,0,0) of the error model (8.37)-(8.40) are exponentially stable. P r o o f . Consider the following Lyapunov function candidate: 1~ T _ I ~ T _ 1 { ( ~ - 1)2 + g g ifT) > 0 V = lt)lT~)i + ~a; n + ~ c~ + -2 (~ + 1)2 + gTg i f ~ < 0
(8.41)
8. Nonlinear Observer Design for Integration of DGPS and INS
145
T h e time derivative along the trajectory of (8.37)-(8.40) is:
=
+
+
+ {
if <0if >0
_-- _ l ~ T T 7 1 ~ 1 _ 1gl~T/sgn(~) -- ~ T T 2 1 ~ -- l ~ T d i a g ( [ ) Wimu sgn(~)
--&Tw31a -- 2&TF(g) Wimu sgn(~) + ~ 1-T [/TklCdimu-]- bl -- gl~sgn(?~)] s~n(?)) = -I~ITTI-lbl - ~TT21~--&TT31(~ -- ~1 [ T g l [
<0,
Vbl,~,(~,c # 0
(8.42)
where we have used: ~ + ~T~ _= 0
~T/Ykl02irnu : ~Tdiag(~)~im u q_ ~TF(~) 02imu
(8.43)
(8.44)
From the relation r + ~--T[ -- 1 it is seen t h a t as ~ --* 0 exponentially, I~1 --* 1 exponentially. This completes the proof. 9
4.2
Velocity and Position Observers
In this section, we will show how tightly integrated observers for position and velocity can be designed. The observers will be u p d a t e d with raw G P S measurements, in order to take advantage of G P S measurements when there is less t h a n four satellites available. W i t h less t h a n four satellites available, these observers will still perform better t h a n the stand-alone INS. See e.g. Lewantowicz [3] for a study on graceful degradation of an integrated solution with less t h a n four satellites in view. The stability proofs below will of course not be valid in this case. Note t h a t the raw GPS pseudorange and deltarange measurements are scalars, and as such not decomposed in a reference frame. However, the LOS unit vectors t h a t give the direction from the G P S receiver antenna to the satellite antennas, are easiest to calculate in the E a r t h frame. The main reason for this is t h a t the satellite coordinates in the navigation message are given in the E a r t h frame. This means t h a t we need to transform positions and velocities to the E a r t h frame before the transformation to pseudoranges and deltaranges can be done. We have chosen to design three different observers which are mechanized in the local frame, E a r t h frame, and LOS-subspace of the E a r t h frame, respectively.
146
8. Nonlinear Observer Design for Integration of DGPS and INS
Local Frame Formulation The update mechanism of the observers takes the following form:
[
#~
] =(AC)~(pgp~--V/(p~--O~)T(p~--O~)--c'7- ) = [ (A~)*
(8.45)
for position, while velocity is given by
(,~e)~.
] (~gps-~sat-Aea~v '
C/)
(8.46)
where ~ and 9l are the position and velocity estimation errors, Pgps and 5gp~ are the pseudorange and deltarange vectors measured by GPS, ~at is given by (8.11), A~. and A ~ are defined in (8.5) and (8.7), and (A~)*. 9 ,~a is the last row of (A~)~. The superscript e is used to emphasize that the LOS vectors are decomposed in the E a r t h frame. 15%eft, ~ and ] are estimated Earth frame position, local frame velocity, receiver clock bias, and frequency bias. The l-frame velocity and position update equations (8.12)-(8.13) with bias, scale-factor and misalignment errors included are written: ~l = a / [(I + A2)aim u + b2] + ~z _ [2S(w~e) + S(wZz)] v l
(8.47)
1b~ = R~v I
(8.48)
1~2 = - T 4 1 b 2 + w4
(8.49)
= - T 5 1 e + w5
(8.50)
= -T~-~/3 + w6
(8.51)
] = -t7af + w7 = f + w8
(8.52)
(8.53)
where A2 = A(e,/3), c = [ex, ey, ez]T a r e three accelerometer scale factor errors, and /3 = [/3xv,/3xz,/3v~,/3yz,/3z~,/3z~] T are six small accelerometer misalignment errors, b2 = -b~cc represents the biases of the accelerometers, T and f are the scalar GPS clock and frequency biases, and wi E ~3 (i = 4, 5, 6) and WT, w8 C ~ are white noise. T~ (i = 4, 5, 6) are large positive definite time constant matrices, and t7 > 0 is a large time constant, all
8. Nonlinear Observer Design for Integration of DGPS and INS
147
assumed to be known. The following observer is proposed for (8.47)-(8.53): ~ l = 1~ [(I + A2)aimu + 1~2] -4- ~ l _ [2S(w~e) + S(w'el)] 9t + K2~" + ( R ~ ) T p e
(8.54)
b e ~- R~g/d- K3p s 132 = -T411~2 q-
(8.55)
(I:~Zb)Tv'
(8.56)
= -T~-I~ + diag(aimu)(l~/)Tv "/ = T ^l T~l
(8.57)
- T ~ - ~ + T (a~m.)(Rb) v
(8.58)
) = - t ~ 1] + k4f + g-
(8.59)
~- = ] + kh~
(8.60)
where T is defined in (8.92), K/ = K T > 0, (i = 2, 3) are two 3 x 3 gain matrices, and k4 and k5 are scalar gains. The error dynamics takes the form: v l = 1~ [/k2aimu + ]32] -4- E [(I + A2)aimu + b2] - [2S(w~e) + S(Jet)] 91 - K29t - (R~)T~ e --e
p = R~fi z - g 3 ~ ~ b2 = - T 4 1 l ~
-
(8.61) (8.62)
(I:[tb)Tv'
(8.63)
= - T 5 1 g - diag(aim,)(t~)T9 ' T ^l T l ~=-T~-I/3-T (airnu)(Rb) "r
(8.64)
) = - - t 7 1 / - k 4 ] - 7-
(8.66)
r = ]-
(8.67)
(8.65)
kh~
where E = R~ - l ~ . Now, (8.61)-(8.67) can be written: = f(x, t) + g(t) where
x =
;?,],
(8.68)
,
1~ [/X2aimu + 1~2] -- [2S(w~e ) + S(wlel)] ~l__ Z 2 ~ . / - (R~)T~e RT~ l - K 3 ~ e -
f(x, t) =
-T51g-
-W~l;~
(1%)Tr
diag(aimu) (I?t~)T9 l _
T
^l
T (aimu)(Rb)
T
l
kj-
f - kh~ (8.69)
148
8 Nonlinear Observer Design for Integration of DGPS and INS
and g(t) = [ E [ ( I + A 2 ) a i m u + b 2 ]
0 0 0 0 0 0] T
(8.70)
By recognizing that the rotation matrix R~, as seen from (8.24), is a nonlinear function of the quaternion q, and using the angular velocity observer above, it can be seen that the error matrix E = R~(q) - R~(~I) -~ 0 as t --* oc. Thus, it is established that g(t) --* 0 as t -~ o<3 since (I + A2)aimu + b2 = a, the true acceleration of the vehicle, obviously is bounded. The following two lemmas will be useful for analyzing the stability of (8_68). L e m m a 8.1 ( Y a k u b o v i e h , 1964) Suppose that 3 a.function V(x) satisfying the conditions: (I) V(x) is Lipschitz in every bounded region. (II)
V(x)
oo as x
oo.
( I I I ) 3 ( > 0 and a continuos function c~(x) > O, well defined when [x[ > such that for any solution x(t) of the system
5: = f(x, t)
(8.71)
when [x] > { the relation dv(x(t))
< -~(x(t))
(8.72)
hold almost everywhere. Choose the number 7? > 0 such that
F = E{V(x) _< 77} D E{lxl _< ~}
(8.7a)
Then: a) F is an invariant set of (8.71). b) For any solution x(t) of the system (8.71) ~T _> 0 where x ( T ) c F.
P r o o f . See Yakubovich [11]. 9 R e m a r k 8.1 Lemma 8.1 remains true ~f the relation (8.72) becomes valid .for any solution after some time T* > 0, which may depend on initial conditions, provided that this solution o.f (8. 71) is bounded on [0, T*].
8. Nonlinear Observer Design for Integration of DGPS and INS
149
L e m m a 8.2 ( A s y m p t o t i c S t a b i l i t y o f F o r c e d S y s t e m s ) Suppose that .for the system ~ = f ( x , t ) , there exist a scalar Cl-smooth .function V(x) and a continuos .function ~(x) > 0, such that (I) There exist class 1C .functions ~1,~2 such that V x
~Cl(IXl) ~ V(x) ~ ~2(Ixl)
(8.74)
(II) Along any solution x(t) of the system ~r = f ( x , t ) the following relation holds
dv(x(t)) < -~(x(t))
(8.7~)
limsup I-~xx~] - 0
(8.76)
(Ill)
Ixi-~ ~(x)
Then,
a) A n y solution x(t) o.f the system = f(x, t) + g(t)
(8.77)
where g(t) -~ 0 as t --~ cxD, tends to zero as t --* cx~.
b) Ve > 0 there exist a 5 > 0 and a T** = T**(g(t),e) _> 0 such that ff Ix01 < 5, t h e n V t _>T**: [x(t, x0)] < e. P r o o f . Given g(t). Suppose that Ix(t)[ does not tend to zero as t --~ oo. Then there are two possible cases: limsup Ix(t)l --, oo or [x(t)l < R. t~o<)
1) Let limsup [x(t)l --* cxz. Choose any e > 0. T h e n 3 T > 0 such that $--* oO
Vt > T : [g(t)l < e. Due to the assumptions, the derivative of V(x) along any solution of (8.77) satisfies the inequality
_~V(x(t)) ~ - a ( x ( t ) ) + ov(~)., ~gt~ )
(8.78)
Due to the assumption 8.76), there exist a N > 0 such that
Oy(x)
J
< ~(x) A~
Vx e ~ \ B ( 0 ,
N)
(8.79)
150
8. Nonlinear Observer Design for Integration of DGPS and INS Therefore, Oy(x) c Ox 1 < - a ( x ( t ) ) ( 1 - ~) = - l a ( x ( t ) ) Vx 9 ~ n \ B ( 0 , N)
dv(x(t)) < -~(x(t)) +
(8.80) Due to Lemma 8.1 and (8.74), x(t) is bounded. The contradiction proves that Ix(t)[ _< R. 2) Let [x(t)[ be bounded, and assume that x --~ 0 as t ---* ~ . Take any small 5 >0. If x --~ 0 as t -~ oo, 3 {T ,~}n=l, Tn ---* +c~ such that Ix(T~)[ > 5 >0, :: Vn_>l
(8.81)
Due to (8.74) there exist a 51 > 0 and a 52 > 0 such that
E{]x]~_52}cE{V(x)<_51}cE{[x[<_5}
(8.82)
Choose 1 e= ~
0V(x) ]-1 max j [ min a(x)] ~<[x(t)[
and a T* = T*(e) >_ 0 such that Vt >_ T* : ]g(t)] < e. Again using (8.78), we have that Vt _> T*, when 52 < Ix(t)l < R dv(x(t))
_< - c ~ ( x ( t ) ) + 0~(xX) g(t )
0V(x) < - ~ ( x ( t ) ) + --5-s ~
___- 2 ~ ( x ( t ) )
(8.83)
Taking advantage of Lemma 8.1 and Remark 8.1, we conclude that there exist T** > T* such that the solution x(t) will belong to E {V(x) < 51}
(8.84)
This is a contradiction to (8.81)-(8.82), and thus Ix(t)] --* 0 as t --* oo. This completes the proof of part a). 3) To prove b), it is necessary to point out that due to Step 1, if the initial condition x0 belongs to E {Ix01 < 5} then there exist a R5 such that
x(t, x0) e E{IxI _< R~} Vx0 9 E{Ix0[ _< 5}, Vt > 0
(8.85)
8. Nonlinear Observer Design for Integration of DGPS and INS
151
Choose any small e > 0. Due to (8.74) there exists a 61 :> 0, and a 52 > 0 such that
E{Ixl _< 62} C E{V(x) < 61} C E{Ixl < ~}
(8.86)
Fix any 6 > 0. Then following Step 2, there exists a T* : Vt > T*, 62 < Ix(t)] < R5 where the relation
tV(x(t))
(8.87)
is valid. Then,
V(x(t)) -- V ( x ( T * ) ) +
~'(x(r
_< V ( x ( T * ) ) - ~1( t - T*)ao where
c~ o =
(8.88)
min a(x). 52
The inequality (8.88) implies that any solution x(t) starting with E { I x I < R~} at time t = T* should reach the set E { I x I _< 52} at least at time T** -- T* + 2 max V(x) OLO1 Ixl_
(8.89)
Otherwise, it will contradict with the positive definiteness of V. Due to Lemma 8.1, the set E { V ( x ) < 51} is invariant, and due to the inclusion (8.86) Vt > T**
x(t) e E{lx I _< ~} This completes the proof of part b)
R e m a r k 8.2 ( Q u a s i - E q u i A s y m p t o t i c S t a b i l i t y ( Q E A S ) ) The results of Lemma 8.2 guarantees that the equilibrium point x -- 0 of the .forced system ~ = f ( x , t ) + g(t) is quasi-equi asymptotically stable in the large, as de.fined by Lakshmikantham et al. [2]. The next theorem considers the convergence and stability properties of the system (8.68): T h e o r e m 8.2 ( Q E A S l - F r a m e O b s e r v e r ) For any solution of the 1.frame observer error equation (8.68), Ix(t)l --+ 0 as t ~ oo and Q E A S follows .from Remark 8.2.
152
8. Nonlinear Observer Design for Integration of DGPS and INS
P r o o f . Consider the unforced system • = f(x, t), where f(x, t) is given by (8.69). Let V be a Lyapunov Function Candidate for ~ = f(x, t): 1 1 v= ~(~)r~+ (5e)rOe+1 Gv~,2
(8.90) It is easily seen that the conditions on V in Lemma 8.2 are satisfied. T h e derivative of V along the system trajectories is:
= ( ~ ) ~ v ~ + (~e)%~
+~
+ ~j
+ ]] +
~
[I~ / (/~2aimu q- ~)2) --(2S(w~e)'1- S(w/e/))~.l K2~r (~r + (f)e)T [R~9/ _ K3~e]
(-7r T _ _
I~T [T411~2 + (R~)T9I] _gTdiag(aim,)(t?t~)T9t
-- flT T T (aimu) (I:tlb) T ~rl -- gTw51g - ~T T61 ~
--(t71 +
k4)]--
f T --
k5"c2 + T/
= --(,~l)TK2Cct _ ( ~ ) T K 3 f ~ -- 1~2TT411~2 - g T w 5 1 g - ~TT61 ~ -- (t~-1 + k 4 ) f 2 - k5? 2 =-a(x)
< OVx r 0
(8.91)
where T have been designed such that
/~2aimu -- T(aimu)fl - diag(aimu)e = 0
(8.92)
Thus, Condition II in Lemma 8.2 is satisfied. In addition it is seen that Ov limx__.oosup~
= 0
(8.93)
Hence, Conditions I III of Lemma 8.2 are satisfied, and it follows that ]x(t)l ~ 0 as t --* c~ and therefore the system is QEAS. 9
Earth Frame Formulation The update equations for position and clock bias are the same as for the local frame observer. For velocity and frequency biases, we now have the update equations:
[ ]
(8.94)
8. Nonlinear Observer Design for Integration of DGPS and INS
153
The Earth frame velocity and position update equations (8.17)-(8.18) with bias, scale-factor and misalignment errors included are written: /,e = R~R~ [(I + A2)aim u + 52] + a~/g I - 2S(wee)V e
(8.95)
pe ~ V e
(8.96)
The proposed observer for (8.95)-(8.96) is: re ~ R~R~ [(I + Az)aim u + ~)2] + R ~ g t - 2S(~
~re
+ K29 ~ + ~
(8.97)
1~e = 9 ~ + K3~ ~
(8.98)
b2 = - T 4 1 6 2 + (I:~lb)T(R~)Tge
(8.99)
= - T 5 1 ~ + diag(aimu) (I?t~)T(R~)T9 ~
(8.100)
---- -W61/~ -{- "~T(aimu)(I:tlb)T(al)T~ce
(8.101)
) = -t
(8.102)
V + kj + +
~- ----f + k5"Y
(8.103)
and the error dynamics is: V ~---R~l~ / /~zaim u + ~)2 + R ~ E [(I + i z ) a i m u + bz]
2S(w~e)9 e - K29 e - ~ p~e ~ T e - K3~ ~
(8.104)
b2 = - T 4 1 b 2 - (I~)T(R~)T9 *
(8.106)
-
(8.105)
= - W 5 1 g - diag(almu)(R~)T(R~)T9 ~ =-T61/~ -
TT (aimu)(Rb) ^l T e T~e (R,) v
(8.107) (8.108)
) = - - t T l f - k n f i - 7-
(8.109)
~- = f -
(8.110)
ks?
C o r o l l a r y 8.1 ( Q E A S e - F r a m e O b s e r v e r ) Any solution of the e-frame
observer error equations (8.104) (8.110) will tend to zero as time goes to infinity and QEAS .follows .from Remark 8.2. P r o o f . The proof is analogous to that of Theorem 8.2. We write the systern (8.104)-(8.110) on the form ~ = f(x, t ) + g ( t ) . The following Lyapunov Function Candidate for the unforced system ~ = f(x, t) satisfies the conditions on V in Lemma 8.2: V = l(~.re)Tge-{I _T_
+~e
l(f~e)Tf)e+l]i)2T]i)2
1-T -
e+~3
1-2
3+~f
1_2
+~r
(8.111)
154
8. Nonlinear Observer Design for Integration of DGPS and INS
The derivative along the system trajectories is
--
~TT51 ~ _ / ~ T T 6 1 / ~
--
(t71 q- k n ) / 2 - k 5 ~=2
= - a ( x ) < 0, Vx ~ 0
(8.112)
Further arguments are the same as in Theorem 8.2, and it can be concluded that Ix(t)[ -~ 0 as t --* o c . . . L O S Formulation
In this section, we will show how the velocity equation can be transformed to pseudorange and deltarange differential equations. This makes it possible to use raw GPS measurements directly to update the observer velocity equation. The outputs of the observer are pseudoranges and deltaranges. The vehicle position and velocities can be calculated from (8.6) and (8.9), or from other algorithms that doesn't require linearization. The equations (8.95) (8.96) can be transformed to satellite LOS space by defining ~ = 5 + 6sat= Aeve+6sat, and further that 5 = Av e + A~9 ~. Note that the dynamics of the 5 in this case only is due to the dynamics measured by the IMU. This gives the system: = ACR~R~ [(I + n2)aim u -{- b2] q- A r - (2Ar = 6 + G~t
-
hr162
' (8.113) (8.114)
where 5 E !~n, and p E !~'*. R e m a r k 8.3 I,VI = 1 (i = 1, ..., n). Hence, ~,~ will mainly be caused by the angular motion due to the movement of the satellites. It is therefore a very small term, but it will have significant effect when integrated over a long period of time. Ae can, however, be assumed to be known. R e m a r k 8.4 Note that when the number of satellites the receiver is locked onto goes below .four, the matrix A e should keep the L O S vector(s) o.f the last satellite(s) that .fell out until there is .four satellites in view again. If this is not done, position and velocity solutions cannot be computed. Satellite orbits can be predicted accurately .for hours after the last satellite orbit data download. The update part is now simply [ t5 ? ] = [ IAi J ( p g p ~ - j S - 4 ) f ] = [ (A~)~. ] @ g p s - ~ t - - 5 - - f )
(8.115) (8.116)
8. Nonlinear Observer Design for Integration of DGPS and INS
155
The following observer is proposed for (8.113)- (8.114):
A2)aimu
A~D~fil [
1~2] + ~ ~ttg -
(8.117) (8.118) (8.119) (8.120)
+ K26 + fi
b=
+ 5sat + K3t5 -T41112 § (I{Ib)T(R~)T(Ae)T6
i= - T 5 1 g § diag(aimu)(Rlb)T(a~)T(Ae)T6 -T61~
+
TT(aimu)(I{lb)T(al)T(Ae)T6
- t ; 1] + h i +
(8.121) (8.122) (8.123)
~-
~-= f + ks~ The closed-loop system is: ~ : AeR[t?tlb [/~2airnu § 112] § A~R~E [(I + h2)aimu § 52] - (2A~S(w[~) - h~)(Ae)t6 - K25 - t5
(8.124)
= 3 - K3Z
(8.125) (8.126)
f,2 : - W 2 1 f~2 - ( R ~ ) T ( R [ ) r ( A ~ ) r [
e : - T 5 1 ~ - diag(aimu)(IZtlb)T(R~)T(Ae)T6
(8.127) (8.128)
(8.129)
= --t7 l f - k4f -
(8.13o)
C o r o l l a r y 8.2 ( Q E A S O b s e r v e r for LOS F o r m u l a t i o n ) Any solution o.f the observer error equations (8.12~)-(8.130) will tend to zero as time goes to infinity, and QEAS .follows .from Remark 8.2. P r o o f . The proof is also here analogous to that of Theorem 8.2. We write the system (8.124)-(8.130) on the form ~ = f(x, t) + g(t). The following Lyapunov Function Candidate for the unforced system ~ = f(x, t) satisfies the conditions on V in Lemma 8.2:
V = 16T5 + IpTp+1[~2T[~22 2 1 -2 + + I [ T [ + -1~- T /3+-~f -
2
1 -2
-~r
(8.131)
156
8. Nonlinear Observer Design for Integration of DGPS and INS
{D
I~1c
5 ~-
~ %
I
o 0.5
I
O ~
20
40
!
I
.................
~-0 5
0
60 time[s]
80
1 O0
120
60 time[s]
80
1 O0
120
"
20
40
~' 0.02
o.o15.
. . . . . . . . . . . . .
-
o t~
............
i
0.01 . . . . . . . . . . .
. . . . . . . . . . .
. . . . . . .
:. . . . . . . . . . . . . . . . . . .
.....
i
:
:. . . . . . . . . . .
. . . . .
............
i
"
0.005
v(
20
40
60
80
1 O0
120
time[s]
F I G U R E 3. The figure shows the estimated position, velocity and acceleration bias errors. Differentiating V along the s y s t e m trajectories gives F
3
= - 3 T [K2 + ( 2 A e S ( ~ o ~ ) - h~)(A~)tJ
3-
fiT K afi
-- ]~2T4117)2-gTTsl~ " -/~TT6-1/~ -- (t71 4- k 4 ) / 2 - k s - F 2 =
-~(x)
(8.132)
If the gain matrices are chosen according to: K2 - (2A~S(w~i) -
he)(A") * > 0
(8.133)
K3 > 0
(8.134)
k4, k5 > 0
(8.135)
this results in: l/=-a(x)<0,
Vx#0
(8.136)
F u r t h e r a r g u m e n t s are the s a m e as in T h e o r e m 8.2, a n d it can be concluded t h a t Ix(t)l ~ 0 as t --~ oo. 9
8. Nonlinear Observer Design for Integration of DGPS and INS
157
0.03 -~- 0.02 E
~. o.ol .m
uJ
0 -0.010
20
40
60 time[s]
80
1O0
120
5x103 , ~9
~4 ~3
...........
i
. . . . .
9
i . . . . . . . . . . . . . . .
. . . . . . . . . .
i. . . . . . . . . . . .
i ...............
i. . . . . . . . . . . . .
9. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
0
c5
0
20
40
60 time[s]
80
1O0
120
F I G U R E 4. The top figure shows E. The maximum error corresponds to about 2 (deg) heading error. The lower figure shows the error of the estimated gyro bias.
5 Case Study T h e / - f r a m e observer have been simulated on a simple ship t r a j e c t o r y w i t h r a t h e r slow dynamics. T h e scale factor observability a n d misalignments are best u n d e r highly d y n a m i c conditions, while biases are observable also at low speed maneuvers. T h e accelerometer bias was set to 0.5 mg, a n d t h e gyro bias was set to 0.06 deg/s, for all three axes. Thus, the sensors are at the lower end of the price range. Accelerometer a n d g y r o m e a s u r e m e n t s were available at 50 Hz, while G P S position, velocity, a t t i t u d e a n d h e a d i n g m e a s u r e m e n t s were available at 5 Hz. T h e following scenario is simulated: At time t = 0 each bias is set, a n d there is no u p d a t e from GPS. At t = 40 s, the G P S reference m e a s u r e m e n t is enabled, while at t = 60 s, bias estimation is started. At t = 80 s, the G P S reference falls out again and the bias e s t i m a t o r is stopped. F r o m Fig. 3 we can see t h a t the position, velocity, a n d bias errors t e n d to zero, a n d t h a t the position a n d velocity errors of the free r u n n i n g INS diverge slower after the biases are estimated. Fig. 4 shows a similar result for the a n g u l a r velocity observer. This is one of the m a i n reasons for e s t i m a t i n g biases, because it enables the s y s t e m to handle longer periods of G P S absence.
158
6
8. Nonlinear Observer Design for Integration of DGPS and INS
Conclusions and Future Work
In this chapter, nonlinear observers for integration of GPS and INS position velocity and attitude have been presented. The angular velocity observer was proven to be exponentially stable, while the position and velocity observer was proven to be Quasi-Equi Asymptotically Stable in the large. This guarantees that all estimation errors tends to zero. The position and velocity observer have been designed for three different coordinate frames, making it possible to use the observer in different system architectures. Simulations have shown that the observers perform well. Future work will include more extensive comparative simulations as well as experimental validation.
Acknowledgements This work was sponsored by Navia Maritime, Division Seatex and the Norwegian Research Council. The authors are grateful to Dr. John-Morten Godhavn at Navia Maritime for his comments on nonlinear observer design for GPS/INS navigation systems. The first author is also grateful to Halgeir Ludvigsen at the Department of Engineering Cybernetics, NTNU for commenting on the nonlinear stability analyses. 7
REFERENCES [1] K. R. Britting. Inertial Navigation Systems Analysis. Wiley Interscience, 1971. [2] V. Lakshmikantham, V. M. Matrosov and S. Sivasundaram. Vector Lyapunov Functions and Stability Analysis of Nonlinear Systems, Kluwer Academic Publishers, 1991. [3] Z. H. Lewantowicz and D. W. Keen. Graceful Degradation of GPS/INS Performance With Fewer Than Four Satellites. Proceedings of the Institute of Navigation, National Technical Meeting, 1991. [4] P. S. Maybeck. Stochastic Models, Estimation and Control, Vol. 1. Academic Press, New York, 1979. [5] B. W. Parkinson and J. J. Spilker (Eds.). Global Positioning System: Theory and Applications, Volume L American Institute of Aeronautics and Astronautics, Inc., Washington DC, USA, 1995. [6] S. Salcudean. A Globally Convergent Angular velocity Observer for Rigid Body Motion. IEEE Transaction on Automatic Control, T A C 36:1493-1497, 1991
8. Nonlinear Observer Design for Integration of DGPS and INS
159
[7] P. G. Savage. Strapdown Inertial Navigation, Lecture Notes. Strapdown Associates Inc., Minnetonka, MN, USA, 1990. [8] W. Tang and G. Howell. Integrated GPS/INS Kalman Filter Implementation Issues. Proceedings o.f the ION GPS-93, pp. 217-224, Salt Lake City, Utah, USA, 1993. [9] D. A. Tazartes and J. G. Mark. Integration of GPS Receivers into Existing Inertial Navigation Systems. Journal of the Institute of Nayigation, JIN-35:105-119, 1988. [10] D. H. Titterton and J. L. Weston. Strapdown Inertial Navigation Technology. IEE, London, UK, 1997. [11] V. A. Yakubovich. Method of Matrix Inequalities in the Theory of Stability of Nonlinear Control Systems. Part I: Absolute Stability of Forced Oscillations. Journal of Automation gd Remote Control, JARC-7:1017 1029 (English translation pp. 905-917), 1964. [12] M. Wei and K. P. Schwartz. A Strapdown Inertial Algorithm Using an Earth-Fixed Cartesian Frame. Journal of the Institute of Navigation, JIN-37:153-167, 1990.
Variants of N o n l i n e a r N o r m a l Form Observer Design J. Schaffner 1 and M. Zeitz 2 1Institute for Systems, Informatics and Safety, European Commission Joint Research Centre, Ispra, Italy 2Institut for Systemdynamik und Regelungstechnik, University of Stuttgart, Germany
1
Introduction
Normal form coordinates are a very appropriate basis for analysis and design of nonlinear systems. But the transformation of physical coordinates of a nonlinear system into a normal form representation is requiring very restrictive assumptions [10, 24]. Therefore, various modifications of nonlinear normal form approaches have been developed in order to enable an extended application of the normal form coordinates. In this paper, three variants of the nonlinear normal form observer design are presented in the development of which the authors have been involved. The normal form observer design is in some sense dual to the exact inputstate linearization (confer [8, 1, 9, 7, 21] or see text books [6, 12]). In the next section, this design method will be recapitulated for nonlinear single output systems E without input E:
x=f(x),
x(0)=x0,
y=h(x),
xeMc~:t
'~, y e Y C ~ : t
1 (9.1)
where f , h e C ~ , f ( 0 ) -- 0, h(0) = 0. The observability properties of (9.1) required for the considered observer design variants are related to the observation space O or the observability map
M
-'~
j~n:
q(x) = h(x), L / h ( z ) , . . .
, L~}-lh(x)
(9.2)
and the observation codistribution dO or the observability matrices
dq(x) =
dh(x) Lfdh(x) . L~}_l dh(~)
cT cTA
, dq(O)=
with A = Of(O)
.
Ox
cTA '~-1
cT = Oh (o1
'
(9.3)
The injectivity of the map q(x) and the regularity of the matrices dq(x) and dq(O) are used to define the observability of system (9.1) and of its Taylor linearization (A, cT), respectively.
162 2
9. Variants of Nonlinear Normal Form Observer Design Normal
Form
Observer
The design of a nonlinear observer becomes very simple, if the system (9.1) can be represented by means of the state transformation x* = ~ ( x )
(9.4)
in the observer normal form coordinates [6, 12]: E*:
5:*=A*x*+c~(y),
s*(0)=O(s0)---s0,
y = c * T s * = x 1. (9.5)
This form is characterized by the matrices (A*, c *T) in dual Brunovsky form and by the output injection c~(y). The coordinates s* allow the design of a nonlinear observer with an exactly linear error dynamics. This property is related to the output dependent vector field c~(y), which is reproduced exactly in the observer equations + l*. ( y -
s"* = A*5:*
with
c*Th: *) , 5:*(0) = s0, ^ * 5: =
l*:Re{A/(A*-/*c*T)} <0,
i=l,...,n
.
(9.6)
The normal form observer E* comprises a dynamical part and the algebraic transformation 9 -1 from coordinates 5:* to original coordinates 5:. The design of the constant gain vector l* is based on the linear observer error dynamics E* : x = ( A * - / * c *T) S , 5:*(0) = x 0 - S ; , such that E* possesses eigenvalues with negative real part. The normal form observer design is applicable, if the transformation (9.4) does exist and can be determined from the first order partial differential equations (pdes) ~'*
0,I, ~ S
[adn]lr(x),. --
~.
""
r(x)=dq-~(s)[O,...
^.
,adl]~'(x),r(x)] --
,0,1] T,
= In
In-(nxn)
with
(9.7)
unity matrix.
(9.8)
The n 2 pdes (9.7) for the components Oi(s),i = 1 , . . . ,n have state dependent coefficients, which are formulated by means of T(S), i.e. the last column of the inverse d q - l ( x ) of the observability matrix (9.3). The following two conditions for the existence of the diffeomorphism O(x) are corresponding to the existence of the vector v(x) in (9.8) and to the integrability of the pdes (9.7) [6, 12]: T h e o r e m 9.1 ( E x i s t e n c e of N o r m a l F o r m O b s e r v e r ) The state transformation (9.5) of system (9.1) into the observer normal form (9.5) exists and can be calculated from (9.7), (9.8), i.f and only if r a n k d q ( s ) = n,
j )] = 0 ,
a d f T ( x ) , a d ~-(s
(9.9) i,j=0,1,...,n-1.
(9.10)
9. Variants of Nonlinear Normal Form Observer Design
163
The rank condition (9.9) is sufficient for the local observability of the system (9.1), which is satisfied in most cases. The commutativity condition (9.10) is yet very restrictive, especially for higher order systems. Moreover, an analytical solution of the pdes (9.7) for n > 2 is often rather difficult to find. However, the elegance of the normal form observer design and of the exact linearization of the observer error dynamics, respectively, have caused a lot of investigations to loosen these restrictions. In the next sections, three variants of the nonlinear normal form observer design are presented, which extend its application to a wider class of nonlinear systems. The continuous (normal form) observer [22, 25] can be applied if the observability rank condition (9.9) is not fulfilled, but the system has an injective observability map (9.2). If the commutatitvity condition (9.10) is violated, it is worth considering the design of an extended Luenberger observer [1, 23, 2, 3]. This approach is similar to the extended Kalman filter and requires only the observability rank condition (9.9). The third variant concerns the block-triangular (normal.form) observer [16, 17, 18, 19], where the presence of multiple outputs is used to decompose the original system in subsystems that have to be connected in a cascade. This structure allows a decentral (normal form) observer design.
3
Continuous Observer
Most state estimation methods for smooth nonlinear systems (9.1) are smooth and require that the Taylor linearization (A, c T) of the system is observable (detectable)[20], i.e.: rank dq(O) = n.
(9.11)
Under the assumption that the observability rank condition (9.9) of Theorem 9.1 is fulfilled, normal form observers (9.6) are even generically smooth. If condition (9.9) is violated, a continuous normal form observer may still be possible. The potential of continuous observers was pointed out by Krener in [11], where the following example has been explicitly constructed to show the non-existence of a smooth observer and the intuitive set-up of a continuous observer: E:
~=Ax,
y=x3;
~:
~--A2+I.(~r~-2)
with I > A .
(9.12)
The first order system E has no smooth observer, because the Taylor linearization (A -- A, c = 0) is not observable, i.e. rank dq(O) = O. But E is in fact observable, also in the origin, because of the injectivity of the observability map q(x) = x 3, q - - l ( y ) ~___~/~. The constructed observer ~ has an asymptotically stable error dynamics ~ : x ---- (A - l)2. The first systematic design m e t h o d of continuous observers for a class of nonlinear systems was developed in [22]. The systematics includes the
164
9. Variants of Nonlinear Normal Form Observer Design
continuous normal form observer as a special case [25]. The design method of continuous observers is based on the smoothness properties of the observability map q E C ~ and its inverse q-1 E C O as in Example (9.12). In [22], such a map is named as a semi-diffeomorphism: D e f i n i t i o n 9.1 ( S e m i - D i f f e o m o r p h i s m ) A smooth map 9 : F~'~ --~ Ktn with a continuous inverse 0 -1 : Kt '~ --* Ktn is called a semi-diffeomorphism. It is easy to see that the scalar map O(x) = x p (p > 1, odd) is a semidiffeomorphism, but not a diffeomorphism. If 9 is a semi-diffeomorphism, denoted ~ = O(x), then one can formally transform system (9.1) into :
~ - X f O cI)--l(~) : ? ( ~ ) ,
~ ( 0 ) : (I:~(X0) = ~0,
Y= hor
----h(~) (9.13)
where ] and h are smooth or continuous. The uniqueness of the solution ~(t;~0) requires that the components .~(~), i = 1 , . . . ,n are Lipschitz continuous. Under this condition, the trajectories of the two system representations (9.1) and (9.13) are equivalent. This has led in [22] to D e f i n i t i o n 9.2 ( T r a j e c t o r y - E q u i v a l e n c e ) B y m e a n s o.f the semi-diffeom o r p h i s m ~ the s y s t e m (9.1) is trajectory-equivalent to the transformed s y s t e m (9.13), if the latter has the property of uniqueness of solutions: x(t;xo) = ~-l(~(t;~(x0)))
and ~(t;~o) = ~ ( x ( t ; ~ - l ( ~ 0 ) ) )
(9.14)
Trajectory-equivalence of the two system representations can be used for the design of an observer in order to study its convergence in the most appropriate coordinates. This is the key idea of the proposed design method of continuous observers [22, 25]. The design coordinates for the continuous observer are defined by the semi-diffeomorphic observabihty map (9.2), by which the system (9.1) is transformed into observability normal form [10, 24, 6, 12]:
-_
with
, ~ ( 0 ) = q ( x o ) = ~0,
Y --:/t(~) = r
(9.15)
(i)
~(~) ----L ~ h o q - l ( ~ ) E C ~, i > 0
(Lipschitz-C~
(ii)
~(t;~o) = q(x(t; q-l(~o))) , x(t; xo) = q-l(~(t; q ( x o ) ) ) (trajectory-equivalence of E and E ) ,
(iii)
~/(~) ----~, d~/-- I,~ (E - structurally observable).
9. Variants of Nonlinear Normal Form Observer Design
165
The properties (i)-(iii) permit the design of a continuous observer with the following structure E:
~ = ] ( 5 ) + 1(5, Y) with l(~, y) = 0, 5(0) = 50, 5~ ---- q - l ( 5 ) . (9.16)
The continuous observer E comprises a dynamical part and the algebraic transformation q-1 from coordinates 5 to original coordinates 5. At least the algebraic part is C~ Therefore, no differential equation exists for the estimation ~(t). Due to the observability property (iii) of E, the design of the injected correction 1(5, Y) is possible with an arbitrary method, where only the particular form of f (~) or ~(~), respectively, must be considered. The desired convergence 5(t) ---* ~(t) can be obtained locally by a linear weighting of the difference between the output and its estimation: l(5, Y) -- I. (y - ~1), l 9 ~ . Summing up, it can be stated that: T h e o r e m 9.2 ( E x i s t e n c e o f C o n t i n u o u s O b s e r v e r ) A continuous observer ~ (9.16) exists, i.f the system (9.1) possesses a semi-diffeomorphic observability map q C C a , q-1 E C O and a Lipschitz continuous obse~'vability normal form E (9.15). The design of a continuous observer is requiring two steps. At first the observability map (9.2) and the observability normal form (9.15) are determined. Then the convergence of the dynamical part of the observer must be guaranteed by a suitable choice of the correction vector 1(5, Y)If these design steps are applied on Krener's example (9.12), a different continuous observer is obtained E:
~=3A~+/.(y-~)
with / > 3 A ,
~=(~)1/3
(9.17)
with a dynamical part in the coordinates of the observability normal form : ~ = 3A~, y = ~. In this special case, E is linear which enables a simple design of the observer gain [. Krener's example (9.12) can also be used to illustrate a first application of the proposed design of continuous observers. For the parameter A : 0, system (9.12) presents a so-called critical observation problem, which is in a loose sense dual to the critical stabilization problem [4, 11]: D e f i n i t i o n 9.3 ( C r i t i c a l O b s e r v a t i o n P r o b l e m ) The critical observation or observer design problem is characterized by non-observable (nondetectable) modes of a marginally stable Taylor linearization (A, c T) having eigenvalues with zero real part: Re(A~(A)} = Re{A~(A - / c T ) } = 0, i e {1,... ,n} Vl 9 ~ .
(9.18)
Of course, both continuous observers E in (9.12) and in (9.17) also represent a solution of the critical observation problem.
166
9. Variants of Nonlinear Normal Form Observer Design
The second example of a critical observation problem (9.18) E:
5~= [ Xlx~x~ ]
y~xx
(p>l'~
(9.19)
is leading to the design of a continuous normal form observer of second order. In the Taylor linearization (A = O,c T -= [1,0]), the mode x2(t) is marginally stable and non-observable, i.e. represents the critical mode. By means of the semi-diffeomorphic observability map ~ ---- q(~) ---[Xl , X p2J| T , x = q - l ( ~ ) = [~1, ~l/v]T , the observability normal form (9.15) is obtained P~1~2
'
Y = ~t
(9.20)
which is smooth. Moreover, the representation E fulfills both conditions of Theorem 9.1 for the transformation x* = (~(~) into the observer normal form (9.5): d~/(~) = I2
-~ r(~) -- [0, 1]T --~ [T(~),ad/T(~)] = 0.
A solution of the four pdes (9.7) is x* = [41, ~2 - P~2/2] T, by means of which the observer normal form E* : 5~* = [x~ +py2/2, 0] T, y = x~ can be derived from (9.20). These coordinates allow an exactly linear design of the dynamical part of the continuous normal form observer with constant gains l~,2 > 0
l~ (y
~*~)
'
(s~ + pS~*~2/2)l/p
"
The algebraic part comprises the transformation ~ = q-1 o ~ - 1 ( ~ . ) from ~* via observability coordinates ~ to original coordinates ~ and is continuous due to q-1 E C ~ This continuous normal form observer design explained for Example (9.19) means an extension of the smooth normal form observer (9.6), since the observability rank condition (9.9) is no longer required in original coordinates:
T h e o r e m 9.3 (Existence of Continuous N o r m a l Form Observer) A continuous normal.form observer (9.16) with an algebraic part ~z ----q - l o
(~--1(~.)
e x i s t s , if and only if the obse~vability normal .form (9.15)fulfills the commutativity condition (9.10).
The design of a continuous normal form observer is rather similar to the two-step transformation into the observer normal form E* via the observability normal form E as proposed by Keller [7] and by Phelps [13]. Yet in those contributions, the observability rank condition (9.9) is required in order to obtain a diffeomorphic transformation x* = ~ o q(x). Thus, the
9. Variants of Nonlinear Normal Form Observer Design
16T
smooth normal form observer (9.6) can be seen as a special case of the continuous one. Of course, the continuous (normal form) observer design can be applied also to systems with multiple outputs and with smooth inputs [25].
4
Extended Luenberger Observer
The rather restrictive commutativity condition (9.10) and the analytical solution of the pdes (9.7) in course of the normal form observer design are bypassed by using the extended Luenberger observer [1, 23, 2, 3]. This observer has a classical structure consisting of a simulation part and an injected difference of the measured and the estimated o u t p u t weighted by an 5~-dependent gain vector l(~) E : ~ - - - f ( ~ ) + / ( 5 ~ ) . [ y - h(~)], ~(0) = 5~0 with
(9.22)
l(Jc) = {~9oad~ + . . . + p,~_ladn_f I + ad~_f] o T ( x ) } / Z ( x ) ,
(9.23)
T ( ~ ) = dq-l(~c) [0,... , 0,/3(i)] T .
(9.24)
The nonlinear Ackermann formula (9.23) for the gain vector l(~), which contains the dynamical parameters Pi, i = 0 , . . . , n - 1 and the functional degree of freedom ~(~) # 0, is derived from the observer error dynamics
E:
x=f(5:)-f(x)+l(~).[h(x)-h(Jc)],
@(0)=~0-x0.
(9.25)
In a first step, ~, = E - E is brought into the observer normal form representation E* = E* - E*. For this operation, the state transformation x* = O ( x ) defined by (9.7), (9.24) and the o u t p u t transformation
Y* = "T(Y) = c*Tx* = x~
(9.26)
are applied to (9.1) and (9.22). The inverse output transformation ~/- l(x~) is related to the functional degree of freedom in (9.23), (9.24) by /~(5~) = d ~
~
(9.27)
In the second step to derive the formula (9.23), the observer error dynamics transformed into normal form coordinates x* -- A*~* + c~o'y-l(~) - t~o'/-l(x~) + l*(~*). [~-l(x~) - . y - l ( ~ ) ] , ~*(0) -- ~ - x~ /*(x*) ---- -~x
with l o V(5;*),
(9.28) \~--~x]
~(x) = ad~-fT(~)
168
9. Variants of Nonlinear Normal Form Observer Design
is linearized along the estimated trajectory ~ ( t ) -- Ol(~(t)): L*
E*:
x = ( A * - pc*T)~ * + O2([5~*[)
with
(9.29)
det (AI,~ - A* + pc *T) -- Po + p l A + ... + pn--1)~ n-1 + )~n. For this, the equations (9.7), (9.23), (9.24) and (9.27) have been inserted into (9.28). By choosing the dynamical parameters p = [Pn-1,... ,po] w in (9.29) appropriately, the eigenvalues of the matrix (A* - pc *T) can be assigned in a desired manner. The development of the design formula of the extended Luenberger observer (9.22)-(9.24) is leading to the following: D e f i n i t i o n 9.4 ( E x t e n d e d T a y l o r L i n e a r i z a t i o n ) The extended Luenberger observer (9.22)-(9.24) has an extended Taylor linearization (9.29) of the error dynamics (9.28) in observer normal form coordinates, which can be stabilized by an eigenvalue assignment. T h e o r e m 9.4 ( E x i s t e n c e of E x t e n d e d L u e n b e r g e r O b s e r v e r ) The extended Luenberger observer (9.22)-(9.24) exists, if the system (9.1) .ful.fills the observability rank condition (9.9), and if the observer error 5c*(t) is sufficiently small ([x*l << 1) allowing an extended Taylor linearization (9.29) of the error dynamics (9.28). R e m a r k 9.1 ( C h o i c e of ~(~)) By a suitable choice o.f the .function ~(~), the calculation (9.23) of the observer gain vector l(~c) can be simplified such that the recursive application of the ad-operator becomes as simple as possible. A computer-aided calculation o.f l(~) can be carried out easily using a symbolic programming language [3], e.g. MATHEMATICA o r MAPLE. The design of the extended Luenberger observer for systems with inputs and multiple outputs is shown in [2, 3]. Compared to single output systems, multiple output variables Yi, i = 1,... , m, offer more degrees of freedom ~i(x), i -- 1,... , m. This fact can be used to simplify the calculation of the Ackermann formula (9.23) even more. The design of an extended Luenberger observer (9.22)-(9.24) is demonstrated for the second order polynomial system E:
~= I axl-bxix2
(xl,2>0),
y:x2.
For this system, the observability matrix (9.3) dq(x) =
[ 0 cx2
1 1 CXl - d
has full rank for cx2 r O. According to (9.24), the vector
(9.30)
9. Variants of Nonlinear Normal Form Observer Design
169
is calculated. It is simplified by choosing the function fi(~) --- ci2 # 0. Now the Ackermann formula (9.23)
l(~) = { [poad~ + pladl_f + ad2_i] o ={P0
01
[11 0
a -c~2b3c2 +
+Pl
} / fi(~) =
c3c2(a -
bJ?2)
(9.31) } / (c:~2)
and the corresponding observer equations (9.22) are containing less arialytical operations (by a factor of 5) than without a functional degree of freedom 3(5~) or without an output transformation (9.26), respectively. Moreover, the output transformation 7(y) = ! In y, which is calculated from (9.27) with fi(x) = cx~, allows the transformation x* = ~ ( x ) of system (9.30) into the observer normal form (9.5), since the commutativity condition (9.10) is fulfilled by v = [1,0] T and adfv = [(bx2 - a), -cx2] T. Hence, a normal form observer (9.6) does exist for Example (9.30): E*:
~*= [ ~ + ~=
}(alny-by-d) d(a--b2y)
+ l~'(llny-:~) ] + l~ (~lny ~ )
(9.32) '
[ ax~--bexp(cx~)+x~ ] exp(c:~)
The constant observer gains l~,2 can be specified by eigenvalue assignment for the linear error dynamics in normal form coordinates ~*
:
X
z
*
--12
0
.
For the gains l~" = Pl and l~ = P0, E* is identical with the Taylor linearization of the error dynamics of the extended Luenberger observer with the gain vector (9.31). Finally, a relation between the extended Luenberger observer and the normal form observer design for linear time-variant systems will be established [3]. Therefore, at first the observer error dynamics (9.25) is linearized. Then the extended Taylor linearization is transformed into normal form coordinates. This means a reverse order of the two operations compared with the derivation of the extended Luenberger observer gain formula (9.23). Under the assumption that the observer error ~(t) is sufficiently small (15~1 << 1), an extended Taylor linearization of observer error dynamics (9.25) aiong the trajectory 5:(t) yields E:
l(t)cT(t)] 5z+ O2([5~(t)[) A ( t ) = O f (~(t)) cT(t) ~Oh (~(t)) ~ = [A(t) -
with
l(t)~l(~(t))
(9.33)
170
9. Variants of Nonlinear Normal Form Observer Design
In order to design the time-variant gain vector /(t), the linear system (A(t), cT(t)) is transformed by x* = O(t)x into the time-variant observer normal form E*: x* = A ' x * + c~(t)y, a~*(0)=O(0)x0=x~, y=3(t)c*rx * =3(t)x;. (9.34) This form is characterized by the constant matrices (A*, e *T) in dual Brunovsky form, the time-dependent output injection a(t)y, and the timevariant degree of freedom 3(t) :/: O. Moreover, (9.34) is consistent with the nonlinear observer normal form (9.5), if there, additionally, the output transformation (9.26) is taken into account. The defining equations for the time-variant transformation matrix ~(t) are [15]: 9 (t) [ N ~ - l v ( t ) , . . . , Nlr(t),r(t)] = I n
r(t)=dq-l(t)[O,...,O,~(t)] T,
with
(9.35)
gAr=-~r+A(t)r,
(9.36)
or(t) dq(t) =
MAcT(t) .
,
MAtT = &T + cT A(t).
(9.37)
M~t-1 cT(t) These linear equations are corresponding to the nonlinear ones (9.7), (9.8) and (9.3), because the linear time-variant differential operators MA and NA can be represented as Lie derivatives LI and adf for the vector field f ( x ) = A ( t ) x applied to dh(x(t))~-cT(t) and r(x(t))~r(t):
MAcr(t)&LAxdh(x),
NAv(t)~adAxr(x).
Unlike in Theorem 9.1 for the existence of the nonlinear observer normal form (9.5), only the condition rank dq(t) = n or the observability of singleoutput systems (n(t), cT(t)), respectively, is required for the determination of the transformation matrix R,(t). As a result of the extended Taylor linearization and of the linear timevariant normal form representation of (9.25), the error dynamics (9.29) of the extended Luenberger observer is obtained, if the time-variant Ackermann formula [3, 15]
l(t) = {[p0NA~ + . . . +pn_lN] -a + N~] o r(t)}/~(t)
(9.38)
is inserted into (9.33). This formula is consistent with the nonlinear one (9.23), when the correspondence between NAr(t) and adAxr(x) is considered. The calculation of the gain vector l(t) as well as of the transformation matrix O(t) are essentially simplified by an appropriate choice of the function/3(t) ~ 0 in (9.34) and (9.36), which is similar to the nonlinear case.
9. Variants of Nonlinear Normal Form Observer Design
171
The time-variant Ackermann formula (9.38) has received a new application in course of the flatness-based tracking control, where a nonlinear observer with time-variant gains has to be designed based on the extended Taylor linearization along a reference trajectory [14, 15].
5
Block-Triangular Observer
Block-triangular observers [16, 17, 18, 19] are designed in a decentral approach, i.e. they are designed by considering subsystems. The decentral method uses degrees of freedom, which are generally available in the observer design for systems with multiple outputs E:
5~=f(x),
y=h(x),
x e M c 1 R n, y e Y C . l ~ p, p > l .
(9.39)
A survey of the literature on decentral observer design can be found in [19]. The first step of the decentral approach is to decompose the system (9.39) into single output observable subsystems P
Ei:
&i = f ~(x), yi=hi(x),
xi e IR ~, Yi e J~zt, E n i = n .
(9.40)
i=1
For the decomposition, the multiple output observability map r n z--1 l~
q(x)=
hl(x),Lfhl(X),...,~f
,q(x),... ,
.... hp(x),L, hp(x),... ,L p-lhAx)]
T
(9.41)
with
P i=1
is used where the parameters ni indicate possible subsystem dimensions. To ensure an observer design by subsystems, the single o u t p u t subsystems (9.40) must be observable by their related output Yi alone. However, in (9.41) there is a condition only for the sum of the subsystem dimensions ni. This fact is related to the general ambiguity, how a system can be observed by means of multiple outputs. Because of these degrees of freedom, there are usually several possibilities to decompose a system. Moreover, the decompositions may not only differ in the dimensions n~ of the subsystems but also in the type of the connections: Subsystems
a)
c)
b) Xl
Xl
Yl
FIGURE 1. Possible connections of two subsystems E1 and E2 [16, 17, 18, 19].
172
9. Variants of Nonlinear Normal Form Observer Design
(99 can be connected mutually by their states (see Figure la), or they can be connected in a cascade or block-triangular .form (see Figure lb), respectively9 A mutual connection by the outputs (Figure lc) is a special case of a block-triangular form. D e f i n i t i o n 9.5 ( B l o c k - T r i a n g u l a r F o r m ) A multiple output system rep-
resentation E/x: ~
= fA(z,y),
with z
=
[zT,...
Y ---- h A ( z ) , , Z pT] T , Z i = [Zil , . . .
~1A : Z1 --~ fllA(zl,Y2, . . . , Y p ) , i2 = 12 (zi, z2, y3,...,
Z/A : Zi ~" fli ( Z l , ' ' ' z p A : ~Sp ~- f p
( 1,
, Z inl] T ,
i
and
= 1,...,p
Yl --~ hlZX(Zl) yp),
,zi,Y~-I,''"
(9.42)
= ,Yp),
Yi
=
yp =
. ,Zp),
is called a block-triangular form9 The subsystems Eia of this .form are assumed to be locally observable by their related outputs Yi.
R e m a r k 9.2 ( O u t p u t T r a n s f o r m a t i o n ) A further degree of freedom to produce a block-triangular .form (9.42) is an additional output transformation y A __ ,,/(y) , by which a triangular state connection o.f the output equations can be considered [16, 17, 18, 19]. D e f i n i t i o n 9.6 ( B l o c k - T r i a n g u l a r O b s e r v e r ) The set of subsystem observers E ~ , i = 1 , . . . ,p which in a decentral approach are designed for a system in block-triangular.form (9.42) is called a block-triangular observer. The block-triangular decomposition is very suitable for a decentral observer design. This fact will be shown below for the special case of the blocktriangular normal form observer, which was introduced by Rudolph and Zeitz in [16]9 D e f i n i t i o n 9.7 ( B l o c k - T r i a n g u l a r O b s e r v e r N o r m a l F o r m ) A multiple output system representation with subsystems E~ A:
. z. ,i = A ~.z i. + ~ i A (zj~), Y~=Ci. T z~9 = z i *l ,
i,j=
l,
. ,p
(9.43) with (A*, c*T) in dual Brunovsky .form is called a block-triangular observer normal form.
The block-triangular observer normal form permits the design of an approxlm9
at
~.~:
e normal
~:
form
observer
. wlth
constant
gain. vectors
I i.A
.^.+ z~ . . . . z~ , .T^., = Aizi a i (zji) + t i "tYi -- Ci Z i ) .
(9.44)
9. Variants of Nonlinear Normal Form Observer Design
173
The error ~* -- ~* - z i* of this subsystem observer has an approximately linear dynamics L* E~/~: zi
(A*
-
,.A ti Ci, T ,) -Z*i
-
A
.
~-
~.
/X
.
C~ (Zj< i + Zj~) -- Oti (Zji) if it is guaranteed that the errors ~ < i of the preceding subsystem observers ~.A ~. j~ (or z~>i, respectively) of subsequent subsystems is excluded. This fact enables the decentral observer design. Since the decentral approach does not permit exactly linear subsystem error dynamics, in general, the conventional multiple output normal form observer, which is designed in a central approach, will be preferred. The structure of the multiple output observer normal form
y=C*x*=[x~l,...,X~l]
E*: :~*=A*x*+c~(y),
A * =
"
"..
0
...
"
,
C * =
Ap
T with
"
"..
"
0
...
CpT
(9.45)
and the (ni • n~)-matrices A* and (1 • ni)-vectors c *T in dual Brunovsky form is determined by the observability indices ni, confer e.g. [10]. However, the form (9.45) may as well be interpreted as a special case of a blocktriangular form (9.42) with a cascade of subsystems connected only by the outputs (see Figure lc). To obtain (9.45) from (9.39), a state transformation x* = O(x) (9.4) must exist and can be calculated [9, 6, 12] from the pdes ego Ox [adn} _ r r ~(x) ] i = 1,. . . , p ; r . . . .1,
7"i(x) from
Lv~Llyhj(x) = { O, ~j,
, nil = I,~ with
0 < l < nj - 1, l =nj-1;i,j=
l , . . . ,p.
(9.46) (9.47)
T h e o r e m 9.5 ( E x i s t e n c e o f M u l t i O u t p u t O b s e r v e r N o r m a l F o r m )
The state transformation (9.4) of a system (9.39) into a multiple output observer normal .form (9.45) does exist, if and only i.f rank d q ( x ) = n ,
(9.48)
[ad}Ti(x), a d ] r y ( x ) ] = 0
(9.49)
.fori, j = l , . . . , p ;
l=O,...,ni-1;
r=O,...
,nj-1.
174
9. Variants of Nonlinear N o r m a l F o r m Observer Design
If however the conditions (9.48) and/or (9.49) of the central approach are violated, it is worth checking whether a decentral approach will lead further. Yet, a multiple output system (9.39) seldom has a block-triangular observer normal form (9.43) in its original coordinates. A concept for the corresponding state transformation was introduced by Rudolph and Zeitz [16], which was further investigated in [17]. The starting point of this transformation is the system (9.39) in a multiple output observability normal form [10, 24, 6, 12]:
~inl ~(~,,. with
( =
~i(~1,'''
99 ,...
,~p) =
~p)
,(
, ( ~ = [~gl,
Lr~hi~ q - l ( ~ )
.. ,(~n~l r
(9.50)
.
It is determined by using the multiple output observability map ~ = q(x) (9.41), which for observable systems is diffeomorphic or injective. The further transformation from (9.50) into (9.43) is obtained in two steps that have to be performed successively for each subsystem [16]: (A) A state transformation z = q~(() from observability normal form (9.50) into block-triangular form (9.42). (B) A state transformation z* = (I,(z) from block-triangular form (9.42) into block-triangular observer normal form (9.43). Hence, the total state transformation from original to block-triangular observer normal form coordinates (and vice versa) consists of three parts: z*= ~o~oq(x), x = q-1
o
~/-10
~D-l(z*)
.
(9.51) (9.52)
R e m a r k 9.3 ( S t r u c t u r e o f B l o c k - T r i a n g u l a r O b s e r v e r ) In the new coordinates z* a block-triangular normal .form observer can be designed; the resulting estimates ~* have to be transformed back to original coordinate estimates ~ by means of the inverse transformation (9.52). Thus, the observer comprises a dynamical part (9.44) and an algebraic part (9.52).for the estimates. If the observability map q(w) (9.41) is a semi-diffeomorphism (Definition 9.1), the block-triangular observer is continuous (Theorem 9. 2). The idea how to transform a system from observability normal form (9.50)
9. Variants of Nonlinear Normal Form Observer Design
175
to a block-triangular form (9.42) is shown below: ( O~gj )
\
~ = ]~(~j<~, ~ , y~>~,gj>~,... , Yj>~), y ~ = ~ l , (A)
z i = ~ i ( ~ i , . . . ,~p),
~ij < n j - 1 ,
i,j=l,...,p,
/,,
z~ = I i (~j<~, ~ , yj>~),
y~=h~(zi)=Zil .
Subsystems Ei of an observability normal form (9.50) are usually mutually connected (see Figure la). Since the observability coordinates ~i of a subsystem Ei can be interpreted as the output Yi and its time-derivatives d~Y~ r = 1 , . . . n ~ - 1 (see (9.41) and (9.50), the desired transformation dt ~ , ~ i has just to eliminate the o u t p u t derivatives of subsequent subystems dryj Ey>i, i.e. dt---j-, r = 1 , . . . , n j - 1 for j > i. Thus, a cascade connection of
subsystems is ensured. The influence of subsequent outputs Yj>i in Ei is no problem in a decentral observer design, for they are measured and, hence, known variables. In order to remove output derivatives d~yj>i dt ~ , r = 1 , . . . , n j - 1 in a subsystem Ei, the outputs yj>i are interpreted as inputs of the subsystem. Now, the results of DelMeau and Respondek [5] for the elimination of input derivatives can be applied as published in [16]. To calculate the transformation ~i(~i, 999 , ~p), i = 1 , . . . , p - 1 , (the last subsystem needs not to be transformed, see the structure of EpA in (9.42)) the extended subsystem state ~i,e ~- ~T Yj>i, ... ,
(~.,:j) ] Y j>~J
(9.53)
and the extended vector field p
v~j+l
= j=id-1 hi--1
y j - (z-l) /=1
0
0
(9.54)
yj
0
with r:l
are introduced. In the following theorem, which is taken from [16], sufficient conditions are given for the existence of the state transformation zi=~i(~i,e),
i=1,...
,p-1.
(9.55)
T h e o r e m 9.6 ( E x i s t e n c e o f B l o c k - T r i a n g u l a r F o r m ) A single output subsystem Ei, i = 1 , . . . ,p - 1 of an observability normal .form (9.50) can locally be transformed into a subsystem ~ A of a block-triangular.form (9.42)
176
9. Variants of Nonlinear Normal Form Observer Design
by a state transformation zi = ~2i((i,e) , ~f
ad~
0 ,ad (a~j""~ 0 yj
=0
where
O< - q <-a i j 0 < r < a~l
(9.56)
P
The transformation kl'~(~i,~) is a solution of a system of n i . E (aij + 1) j=i+ 1 first order pdes, denoted in vector form
0~
q
O~i'e
- ~0 yj
--ad
0
~ - - -(~.,:j) 0,
q=O,...aij,
l <- i < j < p_.
(9.57)
T h e o r e m 9.7 ( N e c e s s i t y a n d S u f f i c i e n c y o f T h e o r e m 9.6) The conditions (9.56) of Theorem 9. 6,for the existence of a state transformation are necessary and sufficient for a subsystem decomposition o.f orders ni <_n j, 1 <_ i < j < p. For subsystems of orders ni > nj, 1 <_ i < j <_ p, the conditions (9.56) are only sufficient. In [16, 17, 19], the two theorems are proved and discussed. After the state transformations into a block-triangular form (9.42), the generated subsystems are sometimes in block-triangular observer normal form (9.43). If not, state transformations z* = ~ i ( z ) into (9.43) have to be tried [1@ ~:
(B)
~
~
=
$ ~zx( j < z ~,z~,yj>~)
z* = • i ( z i , . . . ..
E *zx:
,Zp), /x
, i,j=
*
y ~ = h~(z~) = z~l ,
l,...,p-1,
zi = A * z * + a i ( z / < i , y i , y j > i ) ,
Yi=Zr
*
T h e o r e m 9.8 ( E x i s t e n c e o f B l o c k - T r i a n g u l a r O b s e r v e r N o r m a l F o r m ) d subsystem EiA in a block-triangular.form (9.42) is transformable into a block-triangular observer normal .form subsystem ~./x (9.43) if (i) [ad~z~Ti, a d .I~ ' r i lj = O, k J i
L~
0 <<_r,s <_ n i - 1
with r~ .from
(9.58)
L~L~ik-lh~(zi )=1,
(9.59)
i
.~
(ii) ad'llA~-i,
=0, k=0,...,ni-2, =0,
O
j=i+l,...,p.
(9.60)
i
Condition (i) of this theorem is identical to condition (9.10) for the state transformation of a single output system (9.1) into an observer normal form (9.5), whereby the other states zj r zi are considered as parameters. T h a t transformation is now applied on a single o u t p u t subsystem of (9.42) and must produce a very special subsystem right hand side. However, at
9. Variants of Nonlinear Normal Form Observer Design
177
the same time, it may not destroy the triangular connection of subsystems, which was generated by the first transformation step zi = ~Pi((i,~). This requirement is ensured by condition (ii). If the conditions of Theorem 9.8 are not fulfilled, a different observer m e t h o d has to be used for the respective subsystem. Moreover, one can still go back to the original system (9.39), select a new set of subsystems, and check whether a block-triangular observer normal form may be generated now. This flexibility in using every possible degree of freedom is the advantage of the decentral approach. The two transformation steps from observability normal form (9.50) via block-triangular form (9.42) to the block-triangular observer normal form (9.43) are illustrated in the following example: X2 x 2 -}- X l X 4 -}- x 2 x 3 x4
,
y :
[Xl,
X3] T .
(9.61)
(X212 For this system, no state transformation into a multiple o u t p u t observer normal form (9.45) exists. Since (9.61) is already given in observability coordinates, i.e. ~ = q(x) = x, in a decentral approach two observable subsystems can be chosen easily by grouping the state variables into corresponding subsystem state vectors: ~ 1 : ~1 :
[~11,~12] T = [Xl,X2] T
~ 2 : ~2 = [~21 ~22] T z [323,x4] T
These subsystems are mutually connected like in Figure la. To bring the system into a block-triangular form (9.42), the influence of x4 = ~21 = Y2 in E1 has to be eliminated by a state transformation of the first subsystem, zl = ~1(~1,~)- For this, the extended subsystem state ~l,e = [~11, ~12, Y2, Y2]T and the extended vector field F1 = ] 1 ( ~ 1 , Y 2 , 5 2 ) ~0 0 52~--~,~ + y2~---~,~ have to be introduced. The conditions (9.56) of Theorem 9.6 reduce to the expression
0 0 ~11 -0~12 Oy2" Hence, the state transformation does exist. From the first order partial differential " 0~I:1~ -}- 0 ~ 1 -- 0, the state transformation equations 01II1 052 -- 0 a n d ~11~---~ Oy2 Zl = [~U, ~ 1 2 - ~U~21] T is obtained. Since the second subsystem E2 is already in a block-triangular form E2A, no transformation is necessary, i.e. z2 = [ 21, 22] r. by the two vector fields ~
0
adyl 052'
0 and adFi 052
-
178
9. Variants of Nonlinear Normal Form Observer Design
Hence, Example (9.61) has the block-triangular form z12 + E ~ : i~ =
YlY2
z12 + y l y 2
Z22 (z12 + yly2) 2
,
y = [z11, z21] T
(9.62)
Because subsystem E1A of this representation is still not in a block-triangular normal form, the additional state transformation z~ = Ol(Z) has to be performed. Using the vector fields 7-1 : [0, 1]T and a d l e r 1 = [-1, -1] T, the conditions (i) and (ii) of Theorem 9.8 are fulfilled; and the state transformation z~ -- [zu, z12 - Zll] T is determined. For the second subsystem E ~ again no transformation is needed, i.e. z~ --- [z21, z22] T. The final block-triangular observer normal form of Example (9.61) is calculated as zi'2 + yl(1 + Y2) 0 Z~ 2
E. ~ : / . =
,
y =
, , Z~I] T . [Zll
(9.63)
[Z~2 -[- Yl (1 + y2)] 2 Now for the first subsystem E~/x a single output normal form observer with constant gains l~i, i = 1, 2 can be designed:
[ Z12 . -~-. yl(1. -~-. Y2). -t-
~1 '~ : ~1 =
/11 (Yl lh
(yl-
Zll) h)
]
(9.64) '
Ensuring that this observer is converging sufficiently fast by assigning the eigenvalues of the error dynamics properly, the variable z{2 is approximately known by its estimate ~2. Thus, for the second subsystem E; zx a single output normal form observer with constant gains l~i , i = 1, 2 is possible as well: :*
[
z! ^2 [Z~2 ~-Yl~, • ~- Y2)] 2
-k ~-
/~I'(Y2-Z~1) ] l~2 (Y2--Z~I) '
(9.65)
The estimates in original coordinates are calculated in the algebraic part of the observer (Remark 9.3): =
o
o
=
(9.66)
For slightly modified equations of Example (9.61), i.e. for f2 (x) -- X l X 4.-~ x 2 x 3 , the block-triangular representation is even identical with the blocktriangular observer normal form. For f2(x) = x l x 4 , the system (9.61) can be transformed to a block-triangular form (9.42), but not to a blocktriangular observer normal form (9.43), since condition (ii) of Theorem 9.8 is violated. In such a case, an additional output transformation may be considered [17], or a different appropriate observer design method for the subsystem has to be chosen [3].
9. Variants of Nonlinear Normal Form Observer Design
6
179
Conclusions
The study of the normal form observer design and its variants is confirming the experience that the observer design is far more complicated to solve than the feedback or controller design of nonlinear systems. The presented normal form observer design variants illustrate that a progress in the solution of the nonlinear observer design problem is possible in various directions: via the application of a semi-diffeomorphic state transformation derived from the observability properties in case of the continuous observer; through an extended Taylor linearization and the additional output transformation used by the extended Luenberger observer; and by a subtle utilization of degrees of freedoms of multiple outputs for a subsystem design of the block triangular observer. ACKNOWLEDGEMENT: As it can be concluded from the list of references, the presented nonlinear observer design methods are originating from cooperations with D. Bestle, J. Birk, A. J. Krener, R. Rothfufl, J. Rudolph, and X. Xia. The work was supported by Deutsche Forschungsgemeinschaft.
7
REFERENCES [1] D. Bestle and M. Zeitz. Canonical form observer design for non-linear time-variable systems. Int. Journal of Control 38, 419-431, 1983. [2] J. Birk and M. Zeitz. Extended Luenberger observer for nonlinear multivariable systems. Int. Journal o.f Control 47, 1823-1836, 1988. [3] J. Birk. Rechnergesttitzte Analyse und L/)sung nichtlinearer Beobachtungsaufgaben. VDI-Fortschritt-Berichte Nr. 8/294, VDI-Verlag, D~isseldorf, 1992. [4] R. W. Brocket. Asymptotic stability and feedback stabilization. In: R. W. Brocket, R. S. Millmann and H. J. Sussmann (eds.): Differential Geometric Control Theory. Birkhiiuser, Bosten, 181-191, 1983. [5] E. Delaleau and W. Respondek. Lowering the orders of derivatives of controls in generalized state space systems. Journal of Mathematical Systems, Estimation, and Control 5 , 1 27, 1995. [6] A. Isidori. Nonlinear Control Systems (3rd edition). Springer-Verlag, London 1995. [7] H. Keller. Non-linear observer design by transformation into a generalized observer canonical form. Int. Journal of Control 46, 1915-1930, 1987. [8] A. J. Krener and A. Isidori. Linearization by o u t p u t injection and nonlinear observers. Systems and Control Letters 3, 47-52, 1983. [9] A. J. Krener and W. Respondek. Nonlinear observers with linearizable error dynamics. SIAM J. on Control and Optim 23, 197-216, 1985.
180
9. Variants of Nonlinear Normal Form Observer Design
[10] A. J. Krener. Normal forms for linear and nonlinear systems. In: M. Luksic, C. Martin and W. Shadwick (eds.): Differential Geometry: The Interface between Pure and Applied Mathematics. American Mathematical Society, Providence, 157 189, 1987. [11] A. J. Krener. Nonlinear stabilizability and detectability. In: U. Helmke, R. Mennicken and J. Saurer (eds.): Systems and Networks: Mathematical Theory and Applications, Vol. I, Akademie Verlag, Berlin, 231-250, 1994. [12] H. Nijmeijer and A. J. Van der Schaft. Nonlinear Dynamical Control Systems. Springer-Verlag, Berlin 1990. [13] A. R. Phelps. On constructing nonlinear observers. S I A M J. on Control and Optim. 29, 516-534, 1991. [14] R. Rothfufi, J. Rudolph and M. Zeitz. Flatness-based control of a nonlinear chemical reactor model. Automatica32, 1433-1439, 1996. [15] R. Rothfufi. Anwendung der flachheitsbasierten Analyse und Regelung nichtlinearer Mehrgr0t3ensysteme. VDI-Fortschritt-Berichte Nr. 8/663, VDI- Verlag, Dttsseldorf 1997. [16] J. Rudolph and M. Zeitz. A block triangular nonlinear observer normal form. Systems and Control Letters23, 1-8, 1994. [17] J. Schaffner. Zum Beobachterentwurf fiir nichtlineare Systeme mit mehreren Met3grOt]en. VDI-Fortschritt-Berichte Nr. 8/620, VDI-Vetlag, Diisseldor.f, 1997. [18] J. Schaffner and M. Zeitz. Decentralized block triangular observer design for nonlinear systems. Proe. European Control Conference ECC'97, Brussels, 1997. [19] J. Schaffner and M. Zeitz. Decentral nonlinear observer design using a block-triangular form. To appear in Int. J. of Systems Science, 1999. [20] X. H. Xia and W. Gao. On exponential observers for nonlinear systems. Systems and Control Lettersll, 319-325, 1988. [21] X. H. Xia and W. Gao. Nonlinear observer design by observer error linearization. SIAM J. on Control and Optim.27, 199-216, 1989. [22] X. H. Xia and M. Zeitz. On nonlinear continuous observers. Int. Journal o.f Control66, 943 954, 1997. [23] M. Zeitz. The extended Luenberger observer for nonlinear systems. Systems and Control Letters9, 149-156, 1987. [24] M. Zeitz. Canonical normal forms for nonlinear systems. In: A. Isidori, (ed.): Nonlinear Control Systems Design - Selected Papers .from IFA CSymposium, Capri/Italy 1989, Pergamon, Oxford, 33-38, 1989. [25] M. Zeitz. Nichtlineare stetige Beobachter. Z A M M - Zeitsch~ft .fiir Angewandte Mathematik und Meehanik78, $1137-$1140, 1998.
Separation Results for Semiglobal Stabilization of Nonlinear S y s t e m s via M e a s u r e m e n t Feedback Stefano Battilotti Dipartimento di Informatica e Sistemistica Universit~ di Roma "La Sapienza" Via Eudossiana 18-00184, Italy
i
Introduction
Recently, the problem of semiglobally (rather than globally) stabilizing a nonlinear system via output feedback has gained a renewed interest ([4], [5], [161, [18], [17], [141, [15], [8]). Here, semiglobally means that one is requiring that tile region of attraction of the equilibrium point contains at least an a priori given compact set of the state space. The earlier works of Esfandiari and Khalil ([4], [5]) have shown that fully feedback linearizable nonlinear systems, which are generally not globally stabilizable via dynamic output feedback, are instead semiglobally stabilizable. The basic ingredients are input saturations and high gain observers: large values of the observer gain guarantee that the error between the state and its estimate, generated by the observer itself, goes to zero "sufficiently fast", while input saturations rule out destabilizing effects such as "peaking" ([12]). The combination of these two successful ingredients, together with the key concept of "complete uniform observability", has led to the unifying work of Teel and Praly ([14]): for the class of systems
:~ = y =
f(x,u,d(t)) C(x,d(t))
(1.1)
where d(t) is some exogenous disturbance, it is proved that semiglobal stabilization via state-feedback plus complete unifoTnn observability imply semiglobal stabilization via dynamic output.feedback. Complete uniform observability implies that a state feedback u(x), which semiglobally stabilizes (1.1), can be written as a known function 9 of y, u and their higher order derivatives (but independent of d(t)). Besides, a high-gain observer is designed to reconstruct the higher order derivatives of y. A key feature of this design procedure is the possibility of
184
1. Separation Results for Semiglobal Stabilization of Nonlinear Systems
taking y as a state, i.e. the dynamical behavior of y and their higher order derivatives can be modeled through ~) =
y(ny+l)
y(1)
Cn~+l(x(t),u(t),... ,u( .... )(t),d(t))
(1.2)
This indeed allow to act as if the output were not corrupted by uncertainties. An i m p o r t a n t problem has been left open so far: what can we do either when (1.2) is not available or the observer gain cannot be pushed arbitrarily large? Say, consider the uncertain system
it(t)
=
Az(t) + B2u(t) + B l ~ ( u ( t ) , z ( t ) , t )
y(t)
=
C2x(t) + Cl(U(t),z(t),t)~(u(t),x(t),t)
(1.3)
where ~(u(t),x(t),t) is some exogenous disturbance (vector), of which nothing but some bounds are known. Note that (1.3) is m a d e up of a nominal linear system, perturbed by nonlinear terms and uncertainties. Under which conditions (1.3) is semiglobally stabilizable via measurement feedback, i.e. when only the measure of y is available? In this paper, we give a general theorem on the regional stabilization of (1.3) via measurement feedback (Theorem 1.1), which stands as a generalization of a previous result on quadratic stabilization of linear uncertain systems ([6]). Our approach is completely different from the ones pursued in the literature and it is strongly based on 7-/~ control tools. Moreover, we recover into a general framework most of the existing results on the semiglobal stabilization via output .feedback (see Section 3) and generalize t h e m into the more appealing perspective of stabilization via measurement .feedback In particular, in Section 3.2.1, we will discuss the case C1 (u(t), x(t), t) = 0 ("uncorrupted outputs") and see how high gain observers arise in this case; in Sections 3.2.2 and 3.2.3, we discuss the case of input saturations and o u t p u t saturations, which can be seen as particular nonlinearities perturbing a linear system. Finally, using Theorem 1.1, in Section 3.3, we give some general tools for semiglobally stabilizing via arbitrarily bounded measurement .feedback a significant class of nonlinear uncertain systems ( T h e o r e m 1.2), which include at least feedforward systems and homogenous systems ([18], [13]). Some key features of our design are given by allowing b l o c k - s t a t e equations and uncertainties in the outputs: moreover, our design procedures end up with linear controllers and quadratic Lyapunov functions. As a particular-
1. Separation Results for Semiglobal Stabilization of Nonlinear Systems
185
ization of our results, we o b t a i n t h a t s y s t e m s
Zl z2
= =
z2 q-P11(z,u,t) z3 + P21(z,u,t)
zn
=
u+p,~l(Z,u,t)
yj
~-
zjq-Pj2(z,u,t),
:
j=l,...,n
(1.4)
with zj, y, u E ~ and z = col(z1,..., z~), are always semiglobally stabilizable via arbitrarily b o u n d e d m e a s u r e m e n t feedback as long as p j l ( z , u , t) a n d pj2(z, u, t), j = 1 , . . . , n, are higher order in ( Z j + l , . . . , z,~, u), u n i f o r m l y with respect to t and Zl,. 9 zj (see [18] for state feedback), while s y s t e m s in the form
y
=
Az+Bu+~l(z,u)u
(1.5)
=
z+~(z,~)~
(1.6)
with ~j(z,O) = 0, j = 1,2, for all t and z, ( A , B ) in Brunowski f o r m a n d z E ~'~, y E ~ n a n d u E /~, are semiglobally stabilizable via arbitrarily b o u n d e d m e a s u r e m e n t feedback as long as ~l(z, u)u a n d ~ 2 ( z , u ) u are of at least order one and zero, respectively, with respect to the "generalized" dilation 5l(z,u) = (ll-'~zl,... , I - l z n _ l , z,~, lu) a n d uniformly w.r.t, t (see [13] for definitions and related results for state feedback).
2
Notations 9 B y ,\max{S} (Am/n{S}) we denote the m a x i m u m ( m i n i m u m ) eigenvalue of a given m a t r i x S;
if Ilvll denotes the 2 - n o r m of any given vector v, by IIAII we denote the induced 2 - n o r m of any given m a t r i x A a n d we have IIAII = V/Amax{ATA}; by IIvlIA we denote the A - n o r m of v, i.e. IlvllA =
vCTG; 9 by $ 7)~ we denote the set of n x n positive definite s y m m e t r i c m a trices; 9 for a n y v e c t o r - v a l u e d function ~ / : / R s --* ~ , we d e n o t e by ~/~ its i - t h c o m p o n e n t ; for any m a t r i x M we denote by Mi its i - t h row.
186
1. Separation Results for Semiglobal Stabilization of Nonlinear Systems
3
Regional Stabilization via Measurement Feedback
3.1
Tools
Let us consider the s y s t e m
~c(t) = y(t) =
Ax(t) + B2u(t) + B]~(u(t),x(t),t) C2x(t)+Cl(u(t),x(t),t)~(u(t),x(t),t)
(1.7)
with t E ~'~ a.e., state vector x(t) E ~'~, input vector u(t) E 1Rm, ~(u(t), x(t), t) E JT~k, A, Bj, j = 1,2, and 6'2 (resp. C1(.,.,.)) known matrices (resp. m a t r i x - v a l u e d function) with a p p r o p r i a t e dimensions. T h e structure of (1.7) is characterized by a nominal linear s y s t e m pert u r b e d by nonlinear (unknown) terms. T h e vector ~ ( - , . , .) c a p t u r e s b o t h the uncertainties a n d e x o g e n o u s disturbances, which p e r t u r b the n o m i n a l system. We say t h a t q) : ~ m X ~ X ~ --~ ~ k is an admissible u n c e r t a i n t y if
continuous w.r.t, the first two a r g u m e n t s a n d Lebesgue measurable w.r.t, the third a r g u m e n t
9 it is
9 for e a c h t E ~ , x E ~ n
anduE~'~
~(u,x,t) ElJ(u,x)de=f {vC ~ k : i i v i l < s ( u , x ) }
(1.8)
for some known continuous positive semidefinite function s : ~ m x
T h e class of follows
admissible feedback laws we consider is characterized as
= k(y,~) h
=
v ( y , o-),
cr E ~:t q
(1.9)
with continuous k : ~ P x ~ q -~ ~ m and v : ~ P x h~ q ~ ~ q , vanishing at the origin. In this section, we are interested in the regional stabilization of (1.7) u n d e r the constraint (1.8). Let i/) be the set of admissible uncertainties : ~ ' ~ x ~ n x ~ ~ ~ k and let (x(t, xo, (to), cy(t, x0, c~0)) d e n o t e the trajectories of (1.7) (1.9), at time t with initial condition (x0,c~o). We say t h a t the s y s t e m (1.7) is uniformly locally asymptotically stabilizable
via measurement-feedback (ULASMF) with region of attraction containing f~e C ~ n x ~ q if there exists an admissible control law (1.9) such t h a t along the trajectories of (1.7)-(1.9)
1. Separation Results for Semiglobal Stabilization of Nonlinear Systems
187
1. ( U n i f o r m s t a b i l i t y ) Ve E ~ + there exists 5~ E / R + such t h a t
(x(t'x~176 )
(1.10)
~(t, x0, 00)
for all
c {v e
iivil _<
2. ( U n i f o r m b o u n d e d n e s s such t h a t
t 9 [0, +oo)
plus attraction)
~(t,~0,~0)
a.e. a n d
$ E 7);
there exists M E /R +
< M
(1.11)
for all (x0, cr0) E f ~ , t E [0, +oo) a.e. and ~ E 79, and Ve E ~ + there exists T~ E ~ + such t h a t
( x(t, xo,~o) ) ~(t, x0, ~0)
(1.12)
< ~
for all (x0, or0) E fl ~, t E ITs, +c~o) a.e. and ~ E 79. If, in addition, fl ~ contains an a priori given compact set $ • YY C • ~ q we will say that (1.7) is uniformly semiglobally asymptotically stabilizable. T h r o u g h o u t the paper, we will consider compact sets containing the origin of the state space. Assume that s2(u, x) can be taken some quadratic function of u and x, whenever iluli _< A(c) and x lives in ~
e(c) doj {~ c z ~ :
Ilvll~,(~) _< c2 }
for some positive number c and where PSF(C) is a positive definite symmetric solution of the Riccati equation, associated to the s t a t e - f e e d b a c k 7-/oo linear control problem (see [3]), with ~(u, x, t) as exogenous disturbance and s(u, x) as penalty variable. Assume also t h a t one is capable to find a positive definite symmetric solution Pol(C) of the dual Riccati equation, associated to the output injection T / ~ linear problem (see [3]), with as exogenous disturbance and s(u, x) as penalty variable. If P s F ( c ) is sufficiently "small" than Pos(c), then (1.7) is U L A S M F with a region of attraction [V(c), which in general m a y suffer from an intrinsically nonlinear phenomenon, pointed out in [7], and commonly referred to as vanishing region o.f attraction. This negative phenomenon can be counteracted if an additional nonlinear inequality, involving Psg(c) and Pos(c), is satisfied. This inequality requires the knowledge of two continuous functions ~7(') and 5(-): the first one is instrumental in constructing a stabilizing controller (1.9), since k(y, a) is taken as the composition of ~7(') with a linear controller v = F2(c)a, where F2(c) comes from solving the s t a t e - f e e d b a c k
188
1. Separation Results for Semiglobal Stabilization of Nonlinear Systems
7-/o0 linear control problem, with ~(u, x, t) as exogenous disturbance and s(u, x) as penalty variable; the second one pops up in the candidate Lyapunov function of the closed-loop system (1.7)-(1.9) and, if suitably designed, prevents the phenomenon of vanishing region of attraction so that any given a priori compact set $ x 14; can be included in the region of attraction of the closed-loop system. Different choices of these functions 7(') and 5(.) allow to recover into a general framework most of the existing results on the semiglobal stabilization via output .feedback of (1.7) (see next sections) and to put them into the more general perspective of stabilization via measurement .feedback. The above discussion leads directly to our first theorem. For simplicity of computations and resolving formulas, we will assume that B1C T (u, x, t) = 0 for all u E / R m, x E f i n and t E f i . T h e o r e m 1.1 Assume BiCTl ( u , x , t ) = 0 .for all u E fire, x E j~n and t E f i . Moreover, assume that for each c E fi+ 9 ( s t a t e f e e d b a c k ) there exist PSF(C), QsF(c) E ,SP n, such that -.for some "7(c) E fit + and A(c) E (0, oo] and.for some known matrices El(c) and E2(c) one has
II (u,x,t)l [ <<_s(u,x) d_t 1 [ IIE (c) ll2 + IIE2(c)ull2] (1.13) .for all t E f i , u E f i m such that ]]ui] _< A(c) and 2
C2
- Rl(c) clef ET(c)E2(c) E STem; - the .following Riccati equation is satisfied
u F(c)
de____f ATpsF(C) + P s F ( c ) A
(c)(- 1 B1B1 T - B2R7I(c)8 )P F(c)
+
Ps
+
ET(c)EI(c)=--QsF(C)
(1.14)
9 ( o u t p u t i n j e c t i o n ) there exist R2(c) E S P Y and P o i ( c ) , Q o ~ ( c ) E $7~n, such that -.for all u E -Yztm such that IiuII _< A(c), x E f~(c) and t E f i R2(c) > C1 (u, x, t)CT(u, x, t)
(1.15)
1. Separation Results for Semiglobal Stabilization of Nonlinear Systems
189
- the .following Riccati inequality is satisfied
de__J ATpoI(C) + P o i ( c ) A + P o I ( C ) B I B T p o I ( c )
~.~oi(C)
CTRa-I(c)C2 < - Q o i ( c ) (1.16)
ET(c)EI(C)
+
72(e)
9 (relative speed) - Q m ( c ) aej 7 ~ ( c ) Q o i ( e )
Qsp(c) > 0
_
- Pm(c) de__j72(c)Pot(c ) _ PSF(C) > 0
9 ( n o n l i n e a r c o u p l i n g ) there exist a E (0, 1) and C O .functions 6 : -~ (0, 1 - a] and 71 : 1Rm ---* ~ m such that if F2(c)
def =
-R~I(c)BTpsF(C)
(1.17)
~(s)
aof =
as +
(1.18)
o
6(~))d~
one has ~(F2(c)x)
=
F2~(c)x,
i = 1,...,m
(1.19)
and
IIv(F2(e)(x - e)) - F~(c)xlb~,(~) - I l x l l ~ ( ~ ) IlF2(e)ell~,(c) + Ilell~,,~(~)
< a + 5(IHl2gm(c))
(1.20) for all x E ~(c) and e e KU such that 0 < ~(]le][p,,,(c)) <_ c2. Under the above assumptions, (1.7) is U L A S M F with region of attraction containing
~e(c ) def~___{(Vl,V2) 9 ~ n X ~ n :
[]Vl[[2PsF(C) ~_ ~O([[V1 __ V2t[Pm(c))2 __~C2} (1.21)
whenever ][~l(F2(c)cr(t))][ <_ A(c) for all t >_ O. A n admissible stabilizing controller is given by
6
u
= =
~(F2(c)~) H ( c ) a + B2~(F2(c)a) + G(c)y
g(c)
=
A + --~(c) B1Bx PsF(C) -- G(c)C2
G(c)
=
~/2(c)Pml(c)CT R2-1(9)
(1.22)
with 1
T
(1.23)
190
1. Separation Results for Semiglobal Stabilization of Nonlinear Systems
Remark
1.1 The value a = 0 can be allowed in T h e o r e m 1.1 as long as
0
R e m a r k 1.2 T h e design p a r a m e t e r A(c) allows to take into account input saturations, see Section 3.2.2. R e m a r k 1.3 T h e o r e m 1.1 states that regional stabilization can be achieved if the following can be achieved: a semiglobal state .feedback stabilization problem, with controller u = F2(c)x and region of a t t r a c t i o n being the points inside some level s e t / : of a Lyapunov function VSF, an output injection stabilization problem, assuming the state x remain i n s i d e / : , a linear coupling condition which guarantees the observer error converge to zero "sufficiently" faster t h a n the state x does whenever (1.7) is plugged with u -- F2 (c)x and a nonlinear coupling condition which determines the width of the region of attraction. If Cl(u,x,t) is constant a n d (1.13) holds for all x, u and t, T h e o r e m 1.1 recovers a well known result on quadratic stabilization of linear uncertain systems ([6]), since in this case (1.20) is satisfied with a = 0, 6(s) --- 1 and ~(s) = s for all s. Moreover, the region of attraction is all o.f~ n • ~q. For a general system (1.7), this simple choice of 5(s) and ~(s), although still satisfying (1.20), does not prevent the p h e n o m e n o n of vanishing region of attraction to appear (see [7]). T h e flexibility in the choice of the functions 5(s) and ~?(s) is crucial in recovering most of the existing results on robust semiglobat stabilization, as it will be shown in the next sections. R e m a r k 1.4 From the proof of T h e o r e m 1.1 it follows t h a t the conclusions of T h e o r e m 1.1 hold if the inequality in (1.14) is replaced by inequality and (1.16) together with the relative speed constraint are replaced by the existence of matrices Pro(c), Qm(c) E S79n such t h a t
[A + 2--2~BIBTpsF(C)--G(c)c2ITpm(C) ( q- Pm(c)[A q- 21-~,cBIBTI psF(c)-G(c)C2] q- FT(c)RI(c)F2(e) __() + with
G(c)
1 pm(c)(BiBT + G(c)R2(c)GT(c))Pm(c) < -Q,~(c) .r2(c) as
in (t.23).
(1.24)
1. Separation Results for Semiglobal Stabilization of Nonlinear Systems
3.2
191
Applications
3. 2.1 Uncorrupted Outputs Revisited This section is devoted to recover in the framework of measurement feedback the case in which C1 (u, x, t) = 0 for all u E /R m, x E ~ and t ~ ([11], [4], [5], [8], [14], [15]). If this is the case, for each c E ~ + the matrix R2(c) (see (1.15)) can be taken any positive definite matrix. If (A, B1, C2) is (possibly after a change of coordinates) left invertible with no zero dynamics ([9]), the output injection, relative speed and nonlinear coupling constraints can be met as follows. Assume that for each c r ~ + there exist PsF(c), Qsg(c) E 8 P n satisfying (1.14) for all'7(e) E ~:~+ (i.e. the gain between w and z = s(u,x) can be taken arbitrarily small). Without loss of generality, we can assume that El(-) and ~/(.) have been chosen so that ~/(c) < [IEl(C)ll for all c 9 ~ + . Moreover, for each given compact set $ assume the existence of c 9 t~ + such that S C f t ( ~ ) . Under the above assumptions, one can find continuous functions Q(o)(C),
P;~(c) and P ~ ) ( c ) : ( ~ , + o o ) - ~
S P n and e~)/ : ( ~ ,
+oo)--,
~ + with the following properties: 1. the following Riccati equation is satisfied ~(to,),.~ a~f ATpg}.)(c) + P(~f)')(c)A + 1~(91 ~(lo, ) ( c ) B 1 B ~"~--(lol)[e. ' ~ I k J~ ~_ET(c)EI(c)
--T,~(lo,)(C) -1C2
2(c )
,~(lo,)
,.)
(e)
(1.25)
E1 c 2 + ~ ) for all loi E ( ~ , .
ol
lira
~ J=~or
loI-*+oo
kCj'ot
llPg ")(e)ll lki
(c)
0,
(1.26)
for some k > 1
(1.27)
(lo~) (C)} = ~ - ~ lim .~min{P~, 1ol--~+oo
(1.28)
lim e ~ ) ( c ) lo~+oo lOI
-- +oo
(1.29)
= 0
(1.30)
lim I]R~'~ 101---,-~-oo
Using (1.26), (1.28) and (1.29), choose lOi,l(C ) for all lot E [Ioi,1(c), +oo)
( ~~2 ( c ) e ~,
,
+cx~) such that
192
1. Separation Results for Semiglobal Stabilization of Nonlinear Systems
1. P ~ ~
d~j 72(c)p(~?,)(c)
PSF(C) > 0
_
2. w}(l~ ( C) def =
- Qs
(e) > 0
and pick L(c) E J~+ such that, if
um,~,~,i
----- max {]F2i(c)x]} ~e~(c)
7?~(s~)
=
s~min{1, u m ~ # }
~(81,...,8m)
~-
COI(TI1(81),...,~lm(Srn)) 1 ifs>_O ~ ifs<0
6(s)
(1.31)
then (1.20) holds for all Ioi C [loi,1(c),+oo), x E Q(c) and e E /R n such t h a t IlellR(:o, 2 (c) _> L(c). Note t h a t with our choices the region of attraction of the closed-loop system (1.7)-(1.22) is given by ~ e ( e )def {(Vl .02 ) E ~ n 1 ln(1 + Itvl
+ loz
2 X ~ n : tlVlllpsl.,(c) 2
,,,
_< c 2}
(1.32)
Since
lns k
lira ~ = 0 , s-~+oo 8
Yk_>l
and from (1.27), it follows that, for each pair of compact sets $, • lira
sup
~1
to~-*+oo xC,S;aE~V I loI
C / R n,
ln(l+[]x-crl[~(,:o,)(c))}=0
We
conclude that for each pair of compact sets $, ]A2 c ~n one can pick lol E ~( ~~(c) , +oe) sufficiently large in such a way that fie(c) contains
9S • l/Y. For the class of systems (1.7), by using the above arguments one can also recover the fact t h a t semiglobal stabilizability via state .feedback (in
the sense of (1.14)) plus complete uniform observability implies semiglobal stabilization via output feedback. Indeed, under these assumptions, the output can be taken as a state and, thus, one can assume C l ( u , x , t ) = 0 for all x , u and t. However, since throughout this paper we consider a nominal systems which is linear, Teel and Praly's separation result remains still more general t h a n ours as long as C1 (u, x, t) = 0 for all x , u and t and the dynamical model (1.2) is available.
1. Separation Results for Semiglobal Stabilization of Nonlinear Systems
193
3.2.2 Input Saturations Let us consider the system ~(t)
=
A x ( t ) + B2u(t) + B l ~ ( u ( t ) , x ( t ) , t )
y(t)
=
C2x(t) + C l ( u ( t ) , t ) ~ ( u ( t ) , x ( t ) , t )
(1.33)
with B 1 C T ( u , t ) = 0 for all u E /R m and t E /R (the case B 1 C T ( u , t ) 0 for some u and t can be studied as well). We will make the following assumptions ( H 1 ) the pairs (A, B1) and (A, B2) are stabilizable and the eigenvalues of A have nonpositive real part ( H 2 ) the pair (C2,A) is detectable " ( ) : ( 0 , 1 ] ~ 8 7 ~ n and % A E ( H 3 ) there exist continuous j'f u n c tw ns ~"~(') S F , I~f~F Kt + such that
/. II~(u,x,t)ll 2 _< ~llE2ull 2 for some E2 C ~ •
for aU t C ~ ,
x C 1~n and u C 1Ftm such that ]lulI _< A 2. R1 de_fETE2 is invertible and there exists R2 E 8 P p such that
R2 > cl(u, t)C~(u, t) .for all u E ff~m such that Iiul] _< A and for all t E 2R; 3. the .following Riccati equation is satisfied ,~4(lsf) ' ~SF
__ def
~---
( I s F ) "~ ,d A T p ( ~ F) -b .I DSF
P(lSF)fBIBT ~ SF
k
V
B D--1BT~p(ISS) 2x%1 2 } SF
_ r '~r ) q S~F" )
(1.34)
.for all lsF 6 (0, 1] 4-
lim ] ] Q ( ~ ' ) i l = O a n d
l,q F----*O
lim
l~,;~ ---*0
[pq~')l]--O
The interest in the class of systems (1.33) relies on the possibility of taking into account input saturations. As an example, consider ~(t)
=
Ax(t) + B a l ( u ( t ) )
~(t)
=
C2x(t) + Ca2(u(t))
(1.35)
where cyl, a2 are locally Lipschitz continuous, uniformly with respect to t, and such that
C(g2(u, t) - u) ~- C l ( u , t)(ol(U, t) - u) -[- (y3(u, t)
194
1. Separation Results for Semiglobal Stabilization of Nonlinear Systems
for some continuous 6"1(',-) and c,3(.,-), with cra(0, t) = 0 for all t e tg. Clearly, (1.35) can be rewritten as (1.33) with ~(u(t), x(t), t) = C*l(u(t), t ) u(t), B1 = B2 -- B and y(t) --- ~(t) - Cu(t) - rr3(u(t), t). In (1.35) the term c*2(u) may also capture any (unknown) erro affecting C2x, due to torque disturbances, etc. etc. Assumptions (H1)-(H3) are exactly the same invoked in [16], Lamina 3.1 (see also ([8]). By (H2), there exist Qol, P o l c ST ~n such that the following Riccati inequality is satisfied ~k~OI
clef ATpox + P o I A + P o z B 1 B T p o I - CTR~IC2 <_ - Q o I
(1.36)
. ~(l.~F) def n(lSF) Let lsp E /R + be such that 2",~" = 72P01 - /-'SF > 0 for all lsF E (0, [*SF]. This choice is always feasible by H3.4. Fix compact sets S, }/V C ~ n and let c E ~r~+ be such that
c~ >
sup
{ll<J~,sF, + ]>
xES;cyCW;lsFC(O,I}F ]
-
SF
Define a = 0, r](s) = s and 6(s) = 1. Pick ISF E (0, l}F ] such that
1. if F~'~) ~ -RI1B~P~'? ), one ha, IIF~'~%II < ZX for all ~ such that (x, ~r) e ~ ( c ) 2. Q ~ ' )
d~f=72Qo I - "~SFf)(<"")> 0
Property 1 above can be satisfied by H3.4 and since
/ n(/~.,)
lir2otl _< (h V-'-s~
,
II + ~
p(l,.~..)
s,~ II)c
for some kl, k2 ~ / R + and for all ~ such that
(x,o')e~edef{(vl,v2)e~'~Xg{'~:llVl=II~?F> + Ilvl -- V2II<,..,.F> } 2 <_ c2 With our choices, (1.20) is satisfied for all x 9 a(c) and e 9 ~ n \ { 0 } . By applying Theorem 1.1, one conclude that the region of attraction ~ contains at least S x I/V.
3.2.3 Output Saturations Let us consider the system
2(t)
=
Ax(t) + B2u(t)
y(t)
=
C2x(t) + C l ( u ( t ) , x ( t ) , t ) ~ ( u ( t ) , x ( t ) , t )
with x 9 Ktn, satisfying the following assumptions: (nl)
(d, B2, C2) is in prime canonical form ([9])
(1.37)
1. S e p a r a t i o n
Results
for Semiglobal
Stabilization
of Nonlinear
Systems
195
( H 2 ) .for any C O function c(') : (0, 1] --~ ~ + such that limc(0 = + o c there '
l--,O
exist C O .functions 7 ( ) , / ~ ' ) : (0, 1] --~ h~ + such that
ii ,(u,x,t)ll
<
IIC2xl?
IlCl(u,x,t)cT(u,x,t)l[
M < (7(0)2(R(0) -1 - 1
(1.38)
(7(z))2
-
<_ I ~ 0
(1.39)
>
(1.40)
0
.for s o m e M E Kt +, .for all t E (0, 1], t E fit, u E fit m and x E 1R'~ such that ttC2xll <_ c(O and, in addition,
] -~ ( c
(l) 2 n - - l - - 2 h
)
( ( 7 )(l)
2
( P ~(l))
--1
-1),,~-~
=
+oc
(1.41)
where h = n - l, n - 2, . . . , O.
Note t h a t if y -- sat(C2x), where sat (s) -- s m i n { 1 , T~}, one can take ~(u,x,t)
= sat(C2x) - C2x, C l ( u , x , t )
= 1, / ~ 0 = 1 and 7(0
c(')
with ]imT(0 = 1 and I]~(u,x,t)l] 2 < ~(~(,))~ for all x E /R n such t h a t -
IIC2xil _< c(0. Moreover, ( 7 ( 0 ) 2 ( P ( 0 ) -1 - 1 = ~2c U) --1
so t h a t (1.41) is
satisfied (w.t.o.g. we can assume t h a t c(0 > 89 for all l E (0, 1]). From (H2) it follows that, whatever the C o function R~ ) : (0, 1] --, ~ + is, one has (1.42)
IiC2xi]2 + ]IuiI21"
II~(u, ~, t)ll 2 <
(7(0)2
for all t E ~ , l E (0, 1], u E h'~m and x E f f ~ such t h a t IIC2xiI ~ c(O. Let R~ ) be such t h a t lim R~ t) = 0. It is easy to see t h a t there exist l* E (0, 1] l---*0
and C o functions P (S)F ~ C SoF(~) ATP(l)
~(l) A
* S F ~- * S F
p (rn) ~'qcTn o()
:
(0, 1]
D(O D [ D ( I ) x - - I D T p ( l )
-- I S F - O 2 ~ J t l
+ P 2 ) A ~-- "(')D f SFZ'~2[*~l
)
!
S P ~ such t h a t
/:)2 * S F zV
~-'2 * S F
c T c 2 = -r~(z)
"~SF
(1.43)
(7(O)2R-IcTc2 <
(1.44)
for all l C (0,/*]. Indeed, define hi = ~ and h2 = ~!i . Let -PsF, O be the unique (stabilizing) positive definite symmetric solution of -PsE, oA + A T p z F , O -- -fiSF, oB2BT-PsF, O + c T c2 = 0
For each fixed C O function Q ~ ) :
(1.45)
(0, 1] ~ S P ~ such t h a t l--~0 l i m Q ~o~,) -- 0 pick
C O function --(') P S F : (0, 1] --~ S P ~ such t h a t --(l)
~(I) r~T--~(l) -~(t) P S F A + A~ T ' ---SF -- "-PSF B 2~2 r SF + c T c 2 = --WSF
(1.46)
196
1. Separation Results for Semiglobal Stabilization of Nonlinear Systems
for all l e (0, 1] and limT~ ) = -fisg, o. Define l--+0
o,.
Ps ')F Q(0 SF
(1.47)
Z(O-P(~)FZ(O
=
7 ( 0 ~ ( 0 7(0r~(0~-h~
~
~
"gSF
L~
\~1
(1.48)
]
where Z (0 = diag{(R~0) hi , (R~0 ) h ' + h 2 , . . . , (R~O)hl+h2(n--1)}. Let c(0 = e(R~0) - h , , with e E K/+ (independent o f / but dependent on the compact set S x W to be included in the region of attraction) to be specified later. Moreover, define k(0 = ((~(0)2(P40)-1 - 1)~,~'-~ and pick
p(O = W(O-fioW(O
(1.49)
where W(0 = d i a g { ( k ( 0 ) n - 1 , . . . , 1}, P0 is any positive definite symmetric solution of
-fioA + AT-rio -- c T c2 = --Qo -~(0
for Q0 c S:P n. Using (1.41)-(1.41) and 2hi = h2, pick wSF such that for some l* C (0, 1] -
-
Q(O sF
< k (0 W(O-Q~
(1.50)
-
2
Ps 0F
--
<
P0(0
(1.51)
p~)
=
p0(0_ ,P(0sg
(1.52)
Q~)
=
k(O W(O-Q~
(1.53)
for all l C (0,/*]. Define
2 Finally, fix compact sets S, W C ~ n and pick e E ~ + such that s
- ~es;o~w;ze(0,1]
'~
(note that e < +oo by (1.40)). This concludes the proof of (1.43) and (1.44). By Remark 1.4 and (1.54), it follows that, under (HI) and (H2), (1.at) is semiglobally stabilizable by arbitrarily bounded measurement feedback.
3.3
Semiglobal Stabilization of Uncertain Nonlinear S y s t e m s
In this section, using Theorem 1.1 and the arguments of Section 3.2.2, we give some general tools for achieving large regions of attraction for uncertain system using arbitrarily bounded measurement feedback. A basic feature of our design is that we allow for block state equations and uncertainties
1,
Separation Results for Semiglobal Stabilization of Nonlinear Systems
197
in the outputs and we end up with linear controllers and quadratic Lyapunov .functions. We will recover and generalize some recent result on the semiglobal stabilization via state feedback of
AlZl(t) + Bl~l(z(t),}'n+l(t),t)
Zl(t)
=
yl(t)
-=-- C1221 (t) -~ Cll~I21(z(t), z'n+l (t), t)
~(t)
=
A~z~(t) + B ~ ( z ( t ) , ~ + l ( t ) , t )
~(t)
--
C,~2zn(t) + C,~lq2,~(z(t),~+l(t),t)
(1.55)
with zj e ~ J , z = col(z1 . . . Zn) T, Zn+l e j~m the input vector, ~j 9 ~PJ the o u t p u t vector, A j , B j , Cjl and Cj2 matrices of suitable dimensions,
~j(z, ~, t) 9 ~:tkj are admissible uncertainties and j~= l rj = r. We will assume that C j l B ~ --- 0 for all j -- 1 , . . . , n and rank (Bj) = rj for all j ---2 , . . . , n (these assumptions can be relaxed and are motivated by simpler calculations and formulas). We propose the following family of measurement feedback controllers. Let
..(0 @
=
n~Z)c~j+ Bj2zj+, +G~Z)yj,
aj e ffFJ, j = 1 . . . n (1.56)
where vl = Yl
-
C11KI~2 2<j
(1.57) and
1
D(o
_ a
II~l)
=
Aj + - ~ L ~ j l L ~ j l , SF, j u
G~l)
z
^lj2/~(l) , T r~(l)~-I ~ mj]~ - l r~j2k2vj2 ]
)cj2
(1.5s)
(.) P~j (-) and R)2 (.) are C O functions with 1 C (0, 1], Bjl = Bj, 7j C JTt+, P)F,j, with domain (0, 1] and codomain $7 ~rj and $:PPJ, respectively, and Bj2 and Rjl are matrices of appropriate dimensions. In order to understand how to choose the parameters characterizing the
198
1. Separation Results for Semiglobal Stabilization of Nonlinear Systems
above family of controllers, we need some more definitions. For, define Z1
Q
=
zj-'~j 0
Zj Zj p~ VSFS(Vl) Wj(Ul,?32)
= =
if j = 1 if2<j
(1.59)
col(zj,...,z,,) col('~j,...,Tn+l)
(1.60)
d~._f_ 3'~Pg},j - "(')SF,j
(1.62)
4~=f IlvlI[2ps(,)F.j,
(1.63)
def =
(1.61)
j=l,...,n
[IVl lips(t),:/ 2 2 q - I ] V l -- V2[]p(O,,~j
(1.64)
If no ambiguity arises, we will usee~.alternatively the notations Z2 and ~. Given h C /R +, compact sets Sj, levy, j = 1,..., n, and C o matrix-valued functions Nji(.,.), 1 <_ j <_ n, j + 1 < i < n, and /Vii(','), 2 _< j _< n, j <_ i < n, of appropriate dimensions define the following C o functions c~ )
d~_f 2,~_1
g#)
=
~'(n/) 1
de f
sup
ILRalBn2~ Inax
v,~v,,~((,,)<_d2
k(Z) d~j 4.
{Wn(vl,v2)}
(1.65)
o.,,011 1+
(1.66)
{I](,d[}
(1.67)
sup
{
O
I~rn-l,n(z,,, 2n)(.ll ~ I[r
2
'
} (1.68)
"
def
9
CII)
de f
C(l) k(l)
--
~2 1
def ~
/g}l)
de f
i+1
(1.69)
i+1
(1.70)
max 2n_i+2
.
sup 0< VSF,. ((,.)
K If j Q4k,
.......
,j=i,..., (1.71)
"
def
"
C(1)
def
~(l) b(1) ~'2 '~2
1
(1.72)
1. Separation Results for Semiglobal Stabilization of Nonlinear Systems
199
with Fji as follows: = for i = 2 , . . . , n
N T ( z 2 , Z2)Nli(Z~,Z2)
(1.73)
and 473?
NT(zj,Zj)Njj(Zj
Zj)
(1.74)
"/j-- 1 T Zj+ t, 2j+ ~)F yg( Zj+ I, 2~+ ~) F j~(
--
2
"/3-1
gji(Zj+l, Zj+l)gji(Zj+l,
+ 2Nj (Zj+I, T
Zj+l)
~
(1.75) for all j = 2 , . . . , n and i = j + 1 , . . . , n. Finally, let D! l) J
+ $3(l)
--1
T {ATD(s) Bj2k"j" SF, j
--1
T [zlTp(s)
---- B+j+I,1 Aj+lR;11B~2p(l)Fj, ~- R j l
Q(2 ,j + --
[~(l)
---- B-~-j+I,1 [nj+l~jllBff2p(sl)F,j ~- -~jl J ~ j 2 ~ ' ~ j ~ S F , j ~-'~SF, j ] ]
(1.76) The main result of this section is the following. T h e o r e m 1.2 Let Sj, ]/Yj, S j , ~ j C ~ r ~ , j = 1 , . . . , n, be given compact sets with Sj C Sj and YYj = ~/~j. If there exist ~/j E K~+, Rjl C 8~ )r~+' ,
C O .functions P ( ) 5 ' ~r jr)('), ,P()Oij, QoIs() : (0, 11 -~ 8:P~J, j2 E S P pj, j = 1,... ,n and matrix-valued C O functions Eji(., .), 1 < Rq) j _< n, j + l < i < n + l , Eji(','), 2 < j <_ n, j < i < n + l , Nji(.,'), 1 <_j < n, j + l < i < n, Nj~(.,.), 2 < j <_ n, j < i < n, of appropriate dimensions, such that Kj C ~kj •
(A1) .for all l E (0,1] and j = 1 , . . . , n
ATP(l)
p(l) A.
p(l) tBjlBjTt u
AV p(l)
pq) A . D(l) I:~ D T D(l) ['~T [ 1 9 ( I ) ) - I c j 2 < y3(l) j ~ OI,j ~- ~ O I , j " g ~- r O I , j ~ ) j l ~ j l ~ Ol,j -- ~ j 2 \ ~ j 2 -- --"~OI,j (1.77)
~ --T. with N i l ~ Bj, B j 2 = B j K j and r,q) "~j2 > CjlCjl, ( A 2 ) there exists l~" E (0, 1] such that for all l E (0, l~] and j = 1 , . . . , n ,)/2p(z) OI,j -- ~(l) ' SF,j
>. . . . . h p (SF, 0 j
(1.78)
3j2r~(1) 'qcOI,j - Qq) SF, j
> --
(1.79)
hQ SF, (0 j
200
1. Separation Results for Semiglobal Stabilization of Nonlinear Systems .for some h E IR + and, in addition, =
!imII.~[j[ I < b--,13
o
(1.81)
+oc
- - -
.for j = 1, . . . , n;
(A3) for all l 9 (0, 1]
1 ~ IINAZJ+~,ZJ+I)r ~[ ~Y3 j+l<{<,~
II~j(z, ;,+~, t) - Ks~s+~ II2 <-
2
j-Fl~i
and
ii~jp~j-l,l~-9-1,2
SF, j - P ' j - l w J - I , I
IINji(Zj+l'2j+l)~ill2
1 []I~jj(Z,Z~jI[2 -~ E j+l
+ ~
liE#(z,z~f]
(1.83)
j<~<_~+l .ford = 2 , . . . , n
(A4) there e~:ists l~ E (0, 1] such that for all I 9 (0, l~] and j = 1 , . . . , n - 1
IIR;~~BTJ:V~ rsF,/ll ( 1 +
V/~)~--<
max
(,) { ll~j+l I]}
(1.84)
Q(•)> r~ 3
--
Q(~) > sF, j
-
2
(t)
(l)
T
(l)
1673+lkj+l(Dj ) D j (1.85) 3~, 2 ~(z) ,r,(1).,Tc,(l) z'ya+l~j+l~Dd ) D~ (1.86)
and. in addition, .for all l C (0, l~] and whenever
vs~,~(r
<_ c~~)
IL,+,ll _
(1.87)
o_II 1 +
(1.88)
1. Separation Results for Semiglobal Stabilization of Nonlinear Systems
201
.for i = 1 , . . . , n , + k s(0- - l " " k ,?~1 (0 R s - - l , l
2,~-J+~ n ~ 4_ ~ , ~
<-
if2_<j_<s-2,4<s
2ETs(z, z-)El~(z, ~ < -
(1.89)
,-(Or,
2,'
if 3 < s < n + 1
nl~
if s =
4
2
(1.90)
167 = _
7 -1
k(.0
j=2,...,n
(1.91)
j=2,.
(1.92)
J
16~y~/VT(z,~]VJJ( z , z - ' ) 2 73-1
-<
r)(0~SF, j,
. .,n
then there exists a C O .function c() : (0, 1] -+ ~ + and a positive definite quadratic .function V(Zl, a l , . . 9 zn, crn) such that f o r each 5 9 J~rt+ there exist l* 9 (0, 1] such that .for all l 9 (0, l*]
( B 1 ) ~2(l) D 81 x 1471 x .-. x S~ x l/Yn, where f~(O
a~j
{(Vtl,V12,...,Vnl,Vn2) V(Vll , v12,...
(B2)
] ] Z n + l (t)][ < ~ .for all t
9 ~t r' X ~r~ X -.. X ~ ' ~ X ~ "
:
(1.93)
, Vnl , Vn2 ) ~ C (/) }
>_ 0 a n d the t i m e d e r i v a t i v e o . f V ( Z l ,
(Yl, 9 9 9 , zn,
~,~), along the trajectories of (1.55)-(1.56), is negative definite whenever
9 a(')
(1.94)
R e m a r k 1.5 Theorem 1.2 gives a sufficient condition for semigtobally stabilizing (1.55) via arbitrarily bounded measurement feedback. Some key features of Theorem 1.2 consist of allowing for block-state equations and uncertainties in the outputs. As it will be shown in the proof of Theorem 1.2, our design procedures end up with linear controllers and quadratic Lyapunov functions, which can be used together with any systematic design tool such as backstepping or forwarding. It is only a m a t t e r of technicalities
202
1. Separation Results for Semiglobal Stabilization of Nonlinear Systems
([1], [2]) to see that Conditions (A1)-(A4) are satisfied for systems zl
=
z2 + p l x ( Z , u , t )
z2
=
z3 + p 2 1 ( z , u , t )
$n
=
u+p~l(z,u,t)
yj
=
zj + p j 2 ( z , u , t ) ,
j = 1,...,n
(1.95)
where z j , y , u C f t and z = col(z1,... ,Zn) and P j l ( Z , u , t ) and p j 2 ( z , u , t ) , j = 1 , . . . , n, are higher order in ( z j + l , . . . , Zn, u), uniformly with respect to t and z l , . . . , zj (see [18] for state feedback), and, in addition, for systems
y
=
Az+Bu+~l(Z,u,t)u
(1.96)
=
z+~2(z,u,t)u
(1.97)
where ~j(z, 0, t) = 0, j = 1, 2, for all t and z, (A, B) is in Brunowski form, z E ~ n , y E /R n, u E ~ and ~ l ( Z , u , t ) u and ~ 2 ( z , u , t ) u are of at least order one and zero, respectively, with respect to the dilation 51(z,u) = ( l l - n z l , . . . , l - l z ~ _ l , z,~, lu) and uniformly w.r.t, t (see [13] for definitions and related results for state feedback). We shortly illustrate how to satisfy Assumptions (A1)-(A4) through the following example. Let us consider Xl
~
X2
~2
=
sin X3 @ Xl v2
X3
=
V
(1.98)
(see the ball and beam example in [17]). Assume that only xl and x3 (angular and linear positions of the ball and beam) are available for feedback. Assume also that the measure of xl is affected by some error, which depends on t and x3, it is bounded with respect to t and it is zero near x3 = 0, uniformly with respect to t. Our model is finally given by Xl
=
Z2
22
=
sinx3 +XlV 2
X3
:
V
=
Xl +
:
X3
Y2
(X3, t)
(1.99)
with ~1(', ") a C O (unknown) function, bounded with respect to the second argument and zero near z3 = 0. Let zl = (xl,z2) T, z2 = x3, ?'2 = z3 and z3 = v. We have AI=
(0
0
1 ) Bll= 0 '
(0
1
0) 0
'
B12= (0) 1
1. Separation Results for Semiglobal Stabilization of Nonlinear Systems C12 ~ - (
1 0 ),
1 )
Cll = ( 0
203 (1.100)
A 2 = O , B21= ( 1 0 ), B 2 2 = 1 , 6'22=1, C21 = ( 0
1 )
(1.101)
~l(Z,~,t)- Klz2 = ( sinxa- ~2 +) xlz~
o)
~2(z, ~, t) - K2z3 = ( 0 Let 3'1,3'e > 1, 0 < Rjl < 3'2, j = 1, 2, and Q(0
~(0
( le0
SF, 1 = ~ 0 1 , 1
~-
0
) l
Moreover, define 1
1
-~i ~- K~'il
7,),~.
i ---- 1, 2
It is to see that
p s(0
F, 1
l 1+~
l
( 2lv/'~ -2l ) n(t)
po(O
1
and
Q(l) SF,2
_- /~2(p(/) ~2 ~, S F , 2 )
01,2
~
m(o~,e
--
R(l) 22
~-
~1~22)
-- kl 0 1 , 2 1
(1.102) (1.103) (1.104)
_-2D "/2 lt21t,rD(~) l 0 1 , 2 )~-2
satisfies (A1)-(A2) with p(t) S F , 2 to be specified later.
(1.105)
204
1. Separation Results for Semiglobal Stabilization of Nonlinear Systems
Assume that I~bl(Xa,t)] <_ k[x3[ h, h _> 2. Let 51, 52 C s ( sin(@+z'2)-sin~2+sing2-z'2+xl~
be such that )
2
~)1 (X3, t) --~ "1"~1[N122(~2, ~.3)~2 -+- g22z~22 -{- E123(z 1 , z'3)~]
(1.106) ~2 _~.
- 1 (,) (,)2 t IIRll PSF,1G1 II ~21(x3, ) -~ "112
whenever ]?'Jl -< 5j, j = 2,3, for some C O functions N12(',-) and/Y22(', "), with E12, independent of %2, and with E13(z1, z'3) = v/2"111xlzal - Let c[l) = p(0 h with h2 = 2 sup _ {(~+((2_rr2)2}, 5~1) = ~z) = 2/?41-1X/~-22P}F,2(O SF,2 2,
(2 E~2 ;a:aEl4,2
and k (') = 4
4F212 ~ /D(l) ~2
F212(~2, ~'2) sup ( /D(1) ~2) Ir I_
"2~, x SF,2]
where F12(', ") = N12(', '). Moreover, let c~0 = ~2"(z)~(')'~2, 5{') = 2HRll 1BT V -r'SF'I V ci" and
= V P~'~)'a"
(A4) can be met as long as '1j,
Rj,, 5J), RS~, j = 1,2,/Y22(', "), E13(','),
" SF,2 are such that E12, /~22 and p(z) "12 > Rjl
(1.108)
0
<
~,2~(z) l l * 01,1 _ p(0 SF, I
(1.109)
Rj2 5~l)
> <
1 min{Sj,6~ 0}
(1.110) (1.111)
E23(zl,~3)
<
R21k~O
(1.112)
-
8
< -
'112Rll -,~
(1.113)
2-(0
It)"/2/V2
E22
<
Ru
-
8
(1.114)
A -2rP(0 _
(l)
ml Q(s) SF,1
~2
16.12
(1.115)
--
>
•
1R^ 2h(l) { D(l)'~T Fl(1)
(1.116)
~> --
qO~'2b(Z)[r
(1.117)
\ ~ 1 ] *al r
~"I2'~2 k~'l ) ~'1
1. Separation Results for Semiglobal Stabilization of Nonlinear Systems
205
for all l e (0, l*], l* e (0, 1], and whenever
I~ll < I
c[z)p(t)
SF,22 iv(l) _ t~,(t)
p(O
SF,11 SF,22
~2 '
(1.118)
kuSF,121
1(21 -< v~2, 1~21 _< 5[') and [~31 < 5(0, with P~ll) =
/~//(1+'~1 )
1
p(t) 12
--
/~22
~
(1.119) (1.120)
e'El - ~1
(1.121)
Note that (1.108)-(1.117) impose that p(0 SF,2 "-- #v~ for some # C fit +. 4
REFERENCES [1] S. Battilotti, More results on tile semiglobal stabilization of uncertain nonlinear systems via measurement feedback, Proc. o.f the Conference on Decision and Control (CDC'98'), 1998. [2] S. Battilotti, Semiglobal stabilization of uncertain block-feedforward systems via measurement feedback, NOLCOS, 1-3 July 1998. [3] J. C. Doyle, K. Glover, P. P. Khargonekar and B. A. Francis, State space solutions to standard 7-/2 and 7~oo control problems, I E E E Trans. Autom. Contr., 34, 831-847, 1989. [4] F. Esfandiari and H. K. Khalil, Output feedback stabilization of flflly linearizable systems, Internat. Journ. Contr., 56, 1007-1037, 1992. [5] H. K. Khalil and F. Esfandiari, Semiglobal stabilization of a class of nonlinear system using output feedback, I E E Trans. Autom. Contr., 38, 1412-1415, 1993. [6] P. Khargonekar, I. R. Petersen and K. Zhou, Robust stabilization of uncertain linear systems: quadratic stabilizability and Hoo theory I E E E Trans. Autom. Contr., 35,356-361, 1989. [7] P. Kokotovic and R. Marino, On vanishing stability regions in nonlinear systems with high gain feedback, I E E E Trans. Autorn. Contr., 31, 967-970, 1986.
206
1. Separation Results for Semiglobal Stabilization of Nonlinear Systems
[8] Z. Lin and A. Saberi, Robust semiglobal stabilization of minimum ohase input output linearizable systems via partial state and output feedback, IEEE Trans. Autom. Contr., 40, 1029-1042, 1996. [9] S. Morse, Structural invariants of linear multivariable systems, S I A M Journ. Contr. Optim., 11,446-465, 1973. [10] A. Saberi, Z. Lin and A. Teel, Control of Linear systems with saturating actuators, IEEE Trans. Autom. Contr., 41,368-378, 1996. [11] Semiglobal robust regulation of nonlinear systems, Colloquium on Automatic Control, Lect. Notes in Contr. and Inform. Sciences, 215, 27-55, 1997. [12] H. J. Sussmann and P. V. Kokotovic, The peaking phenomenon and the global stabilization of nonlinear systems, IEEE Trans. Autom. Contr., 36, 424-439, 1991. [13] R. Sepulchre, Slow peaking and low-gain designs for global sstabilization of nonlinear systems, 36th IEEE Conf. Dec. Contr., San Diego, CA, December 1997. [14] A. Teel and L. Praly, Tools for semiglobal stabilization by partial state and output feedback, SIAM Journ. Contr. Optim., 33, 14431488, 1995. [15] A. Teel and L. Praly, Global stabilizability and observability imply semiglobal stabilizability by output feedback, Syst. Contr. Lett., 22, 313-325, 1994. [16] A. Teel, Semiglobal stabilizability of linear null controllable systems with input nonlinearities, IEEE Trans. Autom. Contr., 40, 96-100, 1995.
[17] A. Teel, Semiglobal stabilization of the ball and beam using output feedback, Am. Contr. Conf., 1993. [18] A. Teel, Using saturation to stabilize a class of single input partially linear composite systems, Proc. IFAC NOLCOS'92, 232-237, 1992.
Observer-Controller Design for Global Tracking of N o n h o l o n o m i c Systems Zhong-Ping Jiang I and Henk Nijmeijer 2' 3 IDepartment of Electrical Engineering, Polytechnic University, Six Metrotech Center, Brooklyn, NY 11201, U.S.A. 2Faculty of Mathematical Sciences, Dept. of Systems, Signals and Control, University of Twente, P.O. Box 217, 7500 AE Enschede, The Netherlands. 3Faculty of Mechanical Engineering, Eindhoven University of Technology, PO Box 513, 5600 MB Eindhoven, The Netherlands.
1
Introduction
Recent progress on the control of nonholonomic dynamic systems has been summarized in several excellent surveys - see, for instance, [16, 4, 2]. As it is seen from the rather complete list of numerous references in those survey papers, this nonlinear control problem has attracted attention of many researchers. Most of these papers have been devoted to the asymptotic feedback stabilization of some important classes of controllable nonholonomic systems. As it is well-known in the nonlinear control community, according to Brockett's necessary condition for feedback stabilization [1], there is no C 1 (or, even, C ~ state-dependent feedback which can asymptotically stabilize a system in these classes at the origin. As a consequence, the linearization of the system is not controllable, although the original nonlinear system itself is controllable (in the nonlinear sense). Several novel nonlinear control designs have been proposed ranging from time-varying smooth or nonsmooth techniques to discontinuous feedback strategies (see [2, 4, 16] for relevant references). When turning our attention to the tracking problem, the technical difficulty appears to be more tractable because the linearization around the reference trajectory often is uniformly controllable. As a matter of fact, this idea has been pursued in a number of recent work - see, for instance, [13, 22, 23, 6, 2]. Some other approaches are also proposed in [17, 5]. The purpose of this article is to continue our recent work on the statefeedback tracking control of a class of nonholonomic systems in chained form [8, 9, 10], where semiglobal and global tracking results were obtained beyond local results pertaining to the idea of linearization. Here, instead
208
2. Observer-Controller Design for Nonholonomic Systems
of assuming full-state information, we will study the problem of global output-feedback tracking. The reasons why we are particularly interested in nonholonomic systems in chained form are twofold. Firstly, the chained form has proved useful in modelling the kinematics of many nonlinear mechanical systems with nonholonomic constraints. This has been shown in the pioneering paper [20], where this class of systems was originally introduced, and in papers by several other authors where the control of nonholonomic systems in chained form, or extended chained form, is the main issue (see [16] for the details). Secondly, nonholonomic systems in chained form appear to be the largest class of nonholonomic dynamic systems in the literature for which systematic controller design methods are available. Like in our previous papers [11, 7, 8, 9, 10], the main recursive technique we will invoke in this paper is the integrator backstepping which has found applications in several control problems. We refer the reader to [18] for an expository introduction to this technique and several original references. Our control design procedure is divided into three parts: 1. design a Luenberger-like reduced-order time-varying observer; 2. introduce an appropriate system of coordinates and apply backstepping to design an observer-based output-feedback control law u2; 3. we invoke either Jurdjevic-Quinn method or a cascade design idea to complete the design of the other control input ul. It is of interest to note that we are able to allow any saturation level on ul. In a companion paper [19] of the second author, an alternative approach was independently developed on the basis of the main results of [3, 21]. The rest of this chapter is presented as follows. Section 2 briefly formulates the problem of global output-feedback tracking. Then, we introduce a reduced-order time-varying observer in Section 3. In Section 4, two abovementioned methods involving the application of backstepping are presented to design desired output-feedback control laws. In Section 5, an extension of our results to the extended chained-form, or the so-called dynamic model, is discussed in detail with the help of a simple benchmark nonholonomic knife-edge example. The simulation results demonstrate the effectiveness of our methodology. Finally, we close this article with some concluding remarks and follow-up problems for future work.
2
Problem Statement
The problem we deal with in this paper is the output-feedback global tracking for a class of nonholonomic mechanical systems which can be transformed into chained form
2. Observer-Controller Design for Nonholonomic Systems
x2
~
u2
X3
~
X2Ul
"Xn
~
Xn--lUl
y
=
(Xi, Xn) T
209
(2.1)
where x = ( x l , . . . ,xn) is the state, y is the output, u = (ul,u2) is the control input with Ul satisfying the following constraint
lul(t)l ~ Ulmax
(2.2)
with u 1 m a x a positive constant. More precisely, given a desired reference path Xd(t) described by :~ld
~-
ttld
:~2d
~
~t2d
X3d
z
X2dUld
:~nd
-~
X(n-1)dUld
(2.3)
with Ud = (Uld, U2d) as the bounded and continuous time-varying reference input, we wish to design a dynamic output-feedback law u = #(t,y,x),
)~=v(t,Y,X)
(2.4)
which forces x(t) asymptotically track Xd(t) as t ~ ~-CX). We want to find a systematic way to construct such an output-feedback law (2.4) for any given Ulma• > suPt_>0 ]Uld(t)l. It should be mentioned that the time-dependence in (2.4) comes from the reference signals ud(t) and xd(t). In terms of differentially fiat systems theory [6], the o u t p u t y = (Xl, x~) is a fiat output of system (2.1). Clearly, this is the minimal number of state components we require to know so as to solve the output-feedback tracking problem. For many physical systems which can be brought into a form of (2.1) (see, for instance, [20, 16]), ul takes the meaning of a linear velocity; see also the knife-edge example in Section 5. So the condition (2.2) imposes a practically meaningful constraint on the input ul. On the other hand, the saturation condition (2.2) will also help a successful observer design, as shown in the next section.
210
2. Observer-Controller Design for Nonholonomic Systems
Reduced-Order Observer
3
T h e purpose of this section is to introduce a Luenberger-type "observer" for the unmeasured states of the linear time-varying (LTV) ( x 2 , . . . , xn)system of (2.1) if we regard ul as a time-varying signal and xn as the output. Obviously, there does not exist an a s y m p t o t i c observer - - which asymptotically recovers the unmeasured states ( x 2 , . . . , x n - 1 ) - - irrespective of the time-varying function Ul (t). Roughly speaking, this LTV s y s t e m is uniformly observable only if Ul (t) is persistently exciting. For instance, a null function ul (t) renders the unmeasured states unobservable. W i t h this observation in mind, our observer yields an a s y m p t o t i c estimator for the unmeasured states ( x 2 , . . . , X~-l) whenever the input ul is chosen appropriately. A similar idea has recently appeared in the independent work [14] on local observer designs. In order to design a reduced-order observer, introduce the new variables ~1
:
Xn-1
-- k l ( t ) X n
~2
=
x~-2
-
=
x2
~-2
-
k2(t)x~
(2.5)
k,_2(t)x,
where every ki is a real-valued function of time t to be chosen later. Differentiating these variables with respect to time yields ~
~-2
= ~i+lul
+ ki+lx~ul
- ki(~l + klx,~)ul
- kix,~ , 1<
i < n - 3
(2.6)
= ~ - k~_~(~l + k l X n ) ~ , -- ~ _ ~ X ~
This leads us to introduce the minimal-order observer of the type ~ ~n--2
= ~i+lul =
+ k~+lx~ul
- k~(~'l + k l x ~ ) u l
U2 -- k n - - 2 ( ~ ' l Jr k l X n ) U 1 --
- ]~ix~,
1 < i < n - 3
]gn--2Xn
(2.7)
(110 0}
Denote the observation error { = i
(~1,... ,~,~-2) with ~ = ~ i - ~ for 1 <
From (2.6) and (2.7), we have
2.. ~ =
-k2
U1
.
0
.
-k~-3 -k~_2
.
0 0
1
.
-..
0
.
0 0
~"
-.. .-.
(2.s)
1 0
For a given C o bounded function Uld(t) (which will be understood as the reference input signal in Section 4), define h~ as hi(t) = 1 if U l d ( t ) >_ 0 and
2. Observer-Controller Design for Nonholonomic Systems
211
h~ i t) ---- - 1 otherwise. Using this notation, (2.8) is rewritten as
I =
:=
--k2hl klhl
ilUldl ~- h l i U 1 - - U l d ) )
i[Uldl +
h l i U l -- U l d ) )
.
hi0 .
.
0 hi .
... .. 9
00 I
.
-kn-3hl
0
0
.--
hl
-kn-2hl
0
0
"'"
0
Ait)~
i2.9)
Notice that A becomes a time-invariant matrix if the sign of Uldi t) remains unchanged and the ki's are constant. In this case, it is easy to select the k~'s to make A asymptotically stable with A(A) less than any prescribed stability margin A* < 0. This situation happens when the virtual reference object moves either forward or backward along the reference trajectory Xd (cf. [13, 8]). When the sign of Uld(t) is not constant, we can pick the design functions ki's appropriately so that Ait ) is asymptotically stable. To this purpose, notice that hi takes either the value of 1 or - 1 . Let K = ( k l , . . . ,k,~-2) and let ( K + , K _ ) be a pair of vectors which render A(t) asymptotically stable corresponding to the case when hi = 1 and th --- - 1 , respectively. Given a constant matrix Q = QT > 0 and any t _> 0, there is a unique matrix P(t) = pT(t) > 0 such that
A(t)T p(t) + P(t)A(t) = - Q
i2.10)
In particular, Pi t) = P+ if uldi t) > 0 and Pi t) ----P_ if Uld(t) < 0, with P+ > 0 and P_ > 0 solving the Lyapunov matrix equation (2.10) for i K, hi) = iK+, 1) and i K, hi) = i K - , - 1 ) , respectively. The following result shows that our reduced-order observer produces an asymptotic estimate for the unmeasured states i x 2 , . . . ,xn-1) of system i2.1), provided that ul(t) is persistently exciting in the sense of i2.11). P r o p o s i t i o n 2.1 Let Uldi t) be any bounded C o function./full t) is bounded
and there exist two constants to >_0 and ~ = 7(to) > 0 such that
[to+t
1 liminft~oo7 Jto
[Uld(T)[dT > 7,_>toSUp[Ul(T)
-- UId(T)[
i2.11)
--1 --1 and if 7 > Amax(P)Amax(Q)Amin(P))~min(Q), then, .for any "(i0) e ]Rn-2, the solution ~i t) of (2.9) exponentially converges to 0 as t --~ oo.
P r o o f . Consider the positive-definite and radially unbounded function
Vit,"0 = ~Tp(t)~
(2.12)
212
2. Observer-Controller Design for Nonholonomic Systems
Noticing t h a t /5 = 0 almost everywhere (a.e.), the time derivative of V along the solutions of (2.9) satisfies
(z = ( - l U l d ( t ) l -
h,(ul(t)
- Uld(t)))~TQ~
a.e.
(2.13)
Then,
ami.(Q) l~r< _[Uld(t)[Arnax(r)V
ama~(Q)
+ /~min(P--------~lUl(t)- Uld(t)lV
a.e.
(2.14)
Using the variation of constants formula and Gronwall-Bellman inequality [15], it follows from (2.14) that, for all t _> to,
V(t,'~(t))
V(to,~(to))exp(
~ --
~"
)~min(Q) /~max (P) fro" lUla('r)ldr
/~max(Q) (t -- to) sup litl(T) -- Uld(T)I ) (/~Prain -~ "r>to
(2.15)
In view of (2.11), there exist two positive constants 5 and T such that, for all t > T > to,
'[
t -- to
luld(T)ldr -- 7 sup tul(r) -- Uld(r)I >_ 5 )
(2.16)
T__~tl)
From (2.15) and (2.16), we have
V ( t , g ( t ) ) <_ V ( t o , ~ ( t o ) ) e x p ( - 5 ( t - to)),
Vt >_ T
which completes the proof.
(2.17) 9
R e m a r k 2.1 If the left-hand side of (2.11) is positive, this "persistent excitation" condition can be met via two different ways: (i) design a s a t u r a t e d single-input Ul with the saturation level as small as possible; (ii) design a (not necessarily saturated) control ul so t h a t U1(t) -- Uld(t ) tends to 0 as t ~ oo. In b o t h cases, ~ = ")'(to) can be made large for some to > 0. We develop this idea in the next section.
4
Output-Feedback Design
Using the observer (2.7), we will design a dynamic output-feedback law (2.4) to drive the tracking error x(t) - xa(t) to zero. To this end, we notice that x, =
,~\_, + g,~_, + k,,_,(t)x,~
v2 < i < n - 1
(2.18)
2. Observer-Controller Design for Nonholonomic Systems
213
Introduce the new variables ~1
~
X n -- X n d A
i~ =
~-1 - (x(n-~+~)d-- k~-~Xnd)
~n
X l -- X l d
~-
V2 < i < n -
1
(2.19)
Let 9 = ( x 2 , . . . ,xn), ~d -= (X2d,... ,Xnd) and ~---- (~1,... ,I,~-1). If all signals are bounded and the conditions of Proposition 2.1 hold, then (2.19) together with Proposition 2.1 implies that ~(t) --'2d(t) converges to 0 if and only if ~(t) converges to 0. Notice that, when ki is selected as in Section 3, ]~ = 0 a.e. for all 1 < i ~ n - 2. Differentiating the variable ~ along the solutions of (2.1)-(2.3) yields
~1 :
(~2 + ]~l~l)Uld -~- ~1Ul ~l- (~2 "~- X(n--1)d -~- ]gl;1)(Ul -- Uld)
(~i+1 + k i l l -- ki-1(~2 + k l ~ l ) ) U l d -~- [~i+1 -t- ki~l - k i - l ( ~ 2 ~- k1~1) -+- X ( n - i ) d -- k i - l X ( n - 1 ) d ]
~n--1 ~n
it2 -- kn--2(~l ~- ]r
--
(Ul -- U l d )
(U2d -- k n - 2 X ( n - 1 ) d l t l d )
Ul -- Uld
(2.20) In the sequel, we will apply the backstepping approach to the transformed system (2.20) in order to design two desired output-feedback control laws u subject to (29 The first one is based on a combined application of backstepping and Jurdjevic-Quinn methods. The second one is a mixture of backstepping and cascade designs.
4.1
Backstepping-Based
Trackers
Noticing the lower-triangular structure of the (~1,... , ~n_l)-subsystem of (2.20) with u2 as the input, the backstepping technique will be first applied to design the control u2. Then, the design of the single-input ul is carried out via the Jurdjevic-Quinn m e t h o d [12]9
Step 1 : Begin with the ~l-subsystem of (2.20) with @ viewed as the virtual control. Let zl = ~1 and write the ~l-subsystem in more compact form Zl = (~2 -~ ~21(~1)) Uld -t- ~lUl -~- r (t, ~1, ~2)(ul - Uld)
(2.21)
Consider the quadratic function
yl =
(2.22)
214
2. Observer-Controller Design for Nonholonomic Systems
We have
+ zl~lUm + Zlr
1/i = zl (r + r
~1, ~2)(ul - uld)
(2.23)
P e r f o r m the change of variable z2 = ~2 - al(Uld 2t-1 , z l ) where O~1 = - - C l ~ t l2t-1 d Zl - r
(2.24)
with Cl > 0 a design p a r a m e t e r and l > n - 2 an integer. Then, (2.23) implies Yl = --~lUldZl ~ 2l 2 + zlz2Uld + z1~1u1 + Zlr
~1, @)(ul -- Uld)
(2.25)
For later reference, the z2-dynamics satisfy "~2
~"
(~3 -r-W2\ -- .I. [U2l--1 "~ /" ~ U ld"t-~2~, -I t )~1 ~y U1 ld ' U2l--3~ ld ld, t~l,~2}) -[-r ( t, ~I, ~2, ~3)(Ul -- Uld)
(2.26)
where qo2(t) is d e p e n d e n t on Uld(t) and kl.
Step i (2 < i < n - 2) : Assume that, at Step i - 1, we have designed i - 1 virtual control functions aj (1 < j < i - 1) and o b t a i n e d new variables , 2/--1
2(/--j)+1
zj+l =~j+l --aj(Uld , . . . ,Uld assumed that, for all 1 _< j _< i, 2:j
:
(~j
--,
, 2/--1
-t-1-i-~ji, Uld
(j--l)
Uld
2t-a-
,Uld
Uld,...
,~l,.-.,~j).Furthermore, (j-l)
2(l-j)+l ,t~ld
t~ld
f
,~1,...
,
itis
Cj))
Uld
/
%
+~j(t)~lUl + Cj(t, ~1,--. , ~j+l)(Ul - Uld)
(2.27)
W i t h respect to the solutions of (2.27), the time derivative of V/_I ---- l z l 2 ~- . . . L
-1- l z 2
2
-
1
(2.28)
satisfies i--1
~-1
-- E
i--1 CjUldZj ~ Zi--1Zi~ld
j=l
j=l
i-1 "~ E
ZjCj@, ffl,.--,
f f j + l ) ( U l -- Uld)
(2.29)
j=l
where ~ol(t) = 1. We wish to prove t h a t the above properties hold for the (fix,... ,ff~)subsystem with ffi+l considered as the virtual control. To this end, consider the quadratic function t~ = V i _ l ( Z l , . . . , z~-l) + l z ~
(2.30)
2. Observer-Controller Design for Nonholonomic Systems
215
In v i e w of (2.27) and (2.29), differentiating V~ w i t h respect to t i m e yields i--1
v,-
i--1
CSUldZj + E 5=1
i--1
ZJ~J(t)~lUl Ar E
j=l
+Zi (~iq-1 -~- r
Uld i
~- Zi~gi~lUl ~- Zir
2/--1
Letting Zi+l -- ~i+1 - aikUld 2/-1
a i = --C~Uld
ZJCj(Ul -- u l d ) + Zi--lZiUld
5=1
~
,.."
,"
-- Uld)
2(l-i)-F1 (i--1) , Uld Uld
2/--1
zi -- z i - 1 -- W~tUld
2(/--i)-bl
,...
,Uld
, ~1,
(2.31)
9 - 9 , ~i)
where
( i - 1 ) .~
,ql,...
Uld
,~i)
(2.32)
w i t h ci > 0, it follows from (2.31) that i
.o
=
i
V'
_ z__ c s u l d21 z5
2
+
ZiZi+lUl d
+
5=1
i
E ZJ~J~Ul "~ E ZJr j=l 5=1
Uld)
(2.33) S t e p n - 1 : At this step, w e c o n c e n t r a t e on the design of the true control u2. Consider the L y a p u n o v function candidate
1 2
Vn-1 = V n - 2 ( Z l , . . . , zn-2) -[- ~Zn_ 1
(2.34)
F r o m Step n - 2 and (2.20), it holds n--2
yn_l
=
n--1
-- ~ CjU211dZy-~ Zn_2Zn_lUld -~5=1 5=1 n--1
+
zsr
-
ld) + Z _l (u2 +
zs~#~lu1 n-1)
(2.35)
j=l
w h e r e #J~-I is a function d e p e n d e n t on (t, u l , ( 1 , . 9 9 , ~,~-l), r is a function of (t, ~ 1 , . . . , ~n-1) and ~ n - 1 is a function of U l d ( t ) and its derivatives up to order n - 2. B y choosing the control law u2 as
U2 = --Cn--1U211dZn--1 -- Zn--2Uld -- C n - - l ( t , U l , ~ l , . . - , ~ n - - 1 )
(2.36)
w e obtain n--1
Yn-1 : - E 5=1
n--1
Cjlt2ldZ2 -~ E
n--1
ZJ~PJ~Ul AF E
j=l
ZJCJ(Ul - uld)
(2.37)
5=1
S t e p n : In order to design a control law for u l , let us consider t h e L y a p u n o v function
Vn = V n - l ( Z l , . . .
1 2 , Zn-1) -}- "~n
(2.38)
216
2. Observer-Controller Design for Nonholonomic Systems
Along the solutions of (2.20), with (2.37), the time derivative of Vn satisfies -
=
--
CjUldZ j -Ij=l
ZjqPj~lUl -Jr- ~n -["=
Zjq~j
(U 1 - -
Uld ) (2.39)
j=l
This leads us to choose the control law Ul as
U 1 = Uxd -- O" ~n -[-
Zjr
(2.40)
j=l
where a in C 1 is a saturation function such t h a t or(r) = r for small signals r, rot(r) > 0 for all r e IR \ {0} and s u p r e ~ Icr(r)l = (~m < OO. The saturation level crm is selected to meet the input constraint (2.2) with U l m a x > supt_>O {Uld(t)[ : = U l d , m a x . Under this choice (2.40), (2.39) gives
2 nl ~'~n ~- -- E j=l
CJUldZj "~ E
( Zj~Dj
--
nl ~n ~- E
j=l
)( Zj(/)j
nl (7
~n 71- E
j=l
) Zj~)j
j=l
(2.41) Finally, we are in a position to formulate the following tracking result. P r o p o s i t i o n 2.2 A s s u m e that the reference trajectories Xid(t) (2 _< i _~ n) and reference input ud(t) are bounded. It is .further assumed that the derivatives of uld(t) up to order n - 2 are bounded on [0, cx~). If there exists a constant eu > 0 such that liminf lUld(t)[ > t--*~
(2.42)
~u ,
then, cym can be tuned towards any level of size ul m~x - Uld,max SO that, .for any initial tracking error x(O) - xd(O) C IR n and any initial condition ~(0) E IR '~-1, the trajectory ( x ( t ) - - x d ( t ) , ~(t)) of system (2.1), (2. 7), (2.36) and (2.r is bounded with the .following properties lim Ix~(t) -~,~_~(t) - k~_i(t)x,~(t)l
=
0
lim Ix(t) - xd(t)[
=
0
t----*oO
t---*oO
Furthermore, the convergence rate in (2.r
V2 < i < n -
1 (2.43) (2.44)
and (2.r162 is exponential.
P r o o f . We first prove the boundedness property. By assumptions, we can choose a sufficiently small constant am such t h a t the conditions of Proposition 2.1 hold. As a consequence, the observation error ~(t) exponentially
2. Observer-Controller Design for Nonholonomic Systems
217
converges to 0 and thus the property (2.43) is satisfied. We can rewrite the ~-system (2.20) in more compact form
4
=
f(t,-~)
~,
=
-c~
(2.45) 4- + ~ z 5 r
(2.46)
j----1 It is directly checked that f is linear in ~ for any fixed t. Therefore, the closed-loop solutions ~(t) = (~(t),4~(t)) and x(t) - xd(t) do not exhibit finite escape. Given a positive constant e, by means of the Schwartz inequality, (2.41) gives
%
1
< -
,v'rc5
j=l n--1
(n l )
-
-
+ Z
j=l
zs,
n--1
/
o- 4n + y-s zsr j
j=l (2.47)
j=l From (2.42), there exist a time instant to > 0 and design parameters cj's such that
cju~(t) - s > 0 Vt >_ to
(2.48)
Then, it follows from (2.47) that n--
1
Vn(z(t)) < V,~(z(to))+j~lftoll~3gluil2dT.=
t
(2.49)
With the help of Proposition 2.1, (2.49) completes the proof of the first statement since the ~j's and ul are bounded. Back to the inequality (2.47), using the fact that ai = 0 if 41 . . . . . 4i = 0, a direct application of Barb~lat lemma [15] and Proposition 2.1 yields the last statement (2.44). 9
4.2
A Modification
In the first step, the Jurdjevic-Quinn method was used to design a controller ul to diminish the effect of the (Ul - Uld)-related terms on the (~1,... , 4n_l)-subsystem of (2.20). In this subsection, we pursue the line of a cascade design. T h a t is, we design ul in such a way that xl(t) - Xld(t) converges to zero, regardless of the (41,... ,4,~-l)-Subsystem design. For
218
2. Observer-Controller Design for Nonholonomic Systems
instance, looking at the ~ - s u b s y s t e m of (2.20), we can simply choose the controller Ul as U l = U l d -- O ' ( ~ n )
(2.50)
where (7 is a saturation function as defined above. The global output-feedback tracking result is stated below. P r o p o s i t i o n 2.3 Assume that the re.ference trajectories Xid(t) (2 _~ i _~ n) and re.ference input ud(t) are bounded. It is .further assumed that the derivatives o,f u d(t) up to order n -
2
bounded on
[0,
If
there exist
an integer I >_ n - 2 and a constant ~ > 0 such that 1 fto+t litra~inf ~ Jto lUId(T)IdT
>
0 Vto _> ~
liminf i f t---~oo
t~
lUld(V)12ldT
>
(2.51)
0
t Jto
then, .for any initial tracking error x(O) - X d ( O ) E ]R n and any initial condition ~(0) C IR ~-1, the trajectory (x(t) - x d ( t ) , ~ ( t ) ) of system (2.1), (2.7), (2.36) and (2.50) is bounded. Furthermore, (2.43) and (2.4~) hold with exponential convergence. P r o o f . As in the proof of Proposition 2.2, we can prove that the closed-loop trajectories do not exhibit finite escape. Thanks to the choice (2.50), the closed-loop solution ~ ( t ) satisfies ~,~ = -c~(~,~)
(2.52)
and converges to zero when t --~ cx). Moreover, there exist a finite time instant t ~ (probably dependent on the initial condition ~n(0)), two positive constants Pl (dependent on the initial condition ~,~(0)) and P2 (independent of the initial condition ~n(0)) such that I~n(t)l _~ Pl e x p ( - p 2 t )
Vt > t~
(2.53)
Notice that t ~ -- 0, pl --- [~n(0)l and P2 = c~0 if (r(r) = (~0r for c~0 > 0. Without loss of generality, we may assume that Pl e x p ( - p 2 t ) is so small for t > t ~ that c~(~n(t)) = ~n(t) and lUl(t) -- Uld(t)l <: pl exp(--p2t)
~/t >_ t ~
(2.54)
With the aid of (2.54), pick a sufficiently large to >_ t ~ so as to check (2.11) and (2.51). As a result, by Proposition 2.1, I~(t)l _< ql e x p ( - q 2 t )
Vt _> to
(2.55)
2. Observer-Controller Design for Nonholonomic Systems
219
where ql > 0 is a constant which depends on ~(0) and q2 > 0 is a constant which does not depend on g(0). Let us now look at eq. (2.37). By virtue of (2.54) and (2.55), noticing the fact that every Cj is overbounded by a~l + ai21zl with (hi1, a~2) a pair of positive constants, there exist positive constants c, al, a2, bl and b2 such that 1
V,~-I <_ --cu2ldV~-l(z) + a(t)Yn-l(z) + b(t)Y~_l(z)
(2.56)
where a(t) and b(t) are two time-varying signals satisfying
]a(t)l Ib(t)l
<_ al exp(-a2t) ~ < blexp(-b2t) J
Vt > to
(2.57)
From (2.56) and (2.57), Proposition 2.3 follows readily. Indeed, (2.56) im2 plies that W(z) = V~ l(z ) satisfies
< -O hC l w(z) + O ha(t)w + 0 5b(t)
(2.58)
The rest of the proof goes like the proof of Proposition 2.1. R e m a r k 2.2 It should be mentioned that Proposition 2.3 can be extended to a simplified dynamic extension of (2.1). That is, it is composed of (2.1) and two integrators ~il = Vl, /t2 = v2, with (Vl,V2) considered as the new control. In the next section, we illustrate this extension via a simple benchmark mechanical system with nonholonomic constraints.
5
Example: A Knife-Edge
We illustrate the presented output-feedback design methodology with the help of a simple knife-edge, which moves on the plane and has often served as an elementary illustrative example for theoretic studies on nonholonomic control systems. We refer the reader to [17] for details on this system. The knife-edge dynamics satisfy the following differential equations [17]: ~ ~
_
-
-
3' s i n r 1 6 2 m m T1 3" c o s r 1 6 2 m
xcsinr
=
m
(2.59)
~)ccosr
where (xc, Yc) denotes the coordinates for the center of mass of the knifeedge, r denotes the heading angle measured from the x-axis, and ~'1 is the
220
2. Observer-Controller Design for Nonholonomic Systems
pushing force in the direction of the heading angle, T2 is the steering torque about the vertical axis through the center of mass. The constants (m, Ic) are the mass and the moment of inertia of the knife-edge respectively, and 7 is the scalar constrain multiplier. Note that the fourth-equation in (2.18) represents the nonholonomic constraint on the velocity of the knife-edge system. It was shown in [9] that, after a suitable transformation of coordinates and state feedback, the system (2.59) can be brought into Xl
=
x4
x2
=
x5
X3
=
X2X4
X4
=
Vl
x5
=
v2
(2.60)
As in [17], consider the reference trajectory cr(t)=t,
xrc(t)=sint,
yrc(t ) = - c o s t
Vt > 0 ,
(2.61)
which corresponds to the center of mass of the knife-edge moving along the circle centered at the origin of unit radius with uniform angular rate. In the transformed x-coordinates, the desired trajectory is: Xld(t )
:
t,
X2d(t) = 0,
X4d(t)
=
Utd(t) = 1 ,
X3d(t) = 1 ,
X5d(t) =U2d(t) = 0 .
(2.62) (2.63)
For this particular reference trajectory (which is a straight line in the new xcoordinates), a global state-feedback tracking control law has been derived in [9] via a recursive approach. For simulation purposes, we recall that the coordinate and feedback transformation leading to (2.60) is Xl
=
r
X3
=
x~sinr162
X5
=
k, cos r + $c sin r + r
x4= sin r + y~ cos r
(2.64)
T2
Vl V2
x2=x~cosr162
m
=
7-1
-- + m
T2
. ( - x c sm r + Yc cos r
q~2(xccos r + yc sin r
For the new system (2.60), we consider y = (Xl, x3) as the o u t p u t and assume that the other states (x2, x4, xs) are unavailable to the designer.
2. Observer-Controller Design for Nonholonomic Systems
221
We first introduce an observer to reconstruct the unmeasured state x4. Introduce a new variable ~a = x4 - k3xl with ]r > 0 a design parameter, which satisfies
~a=-ka(a- k~xl+vl
(2.65)
Then, the reduced-order observer is introduced .2.
~3 = -ka~'3 - k 2 x l + Vl
(2.66) ,&
which leads to an exponentially stable linear dynamics (3 = -k3~3 where
5=
3-g3.
Consequently, the unmeasured state x4 = ~a + k a x l + ~'a can be exponentially recovered via the observer (2.66). Next, we turn to the observer design for the unmeasured states (x2, xs). Guided by the development in Section 3, introduce the new variables (l = x2 - klx3
,
(2 = x5 - k2x3
(2.67)
where K = (kl, k2) is a vector of design parameters, which are constant here because the sign of Uld = X4d : 1 does not change. Direct computation yields ~1
=
(2+k2xa-kl((l+klx3)x4
42
=
V2 -- k 2 ( ~ l - + - k l x 3 ) x 4
(2.68)
Since x4 is not measured in the present case of e x t e n d e d chained form (2.60), in contrast to the observer (2.7) for the chained form case, the following observer is introduced in which the estimate x4 := ~'3 + k 3 x l of x4 is used in place of x4: ~1
~-
~ 2 - I - k 2 x 3 -- ~ l ( ~ ' l - } - k l X 3 ) ~4
~2
=
v2 - k 2 ( g l + k l x 3 ) ~ 4
(2.69)
Letting ~'1 = ~1 - ~'1 and ~-2 -= ~2 - ~ , (2.68) and (2.69) imply
(kl
--,-k20
-
k2
(~1 (x4 - 1 ) + (~'1 + klXa)g3)
(2.70) Clearly, we can pick two parameters kl and k2 such that A is an asymptotically stable matrix. For simulation use, take kl = 2 and k2 = 1. T h e second term in the last brackets of (2.70) is new comparing with the chained form
222
2. Observer-Controller Design for Nonholonomic Systems
case, a special class of the so-called Chaplygin (kinematic) form. Bare in mind that the first term depends upon x4 which is a state component and thus, unlike in the case of chained form, is not free to choose. These terms together prevent us from applying Proposition 2.1 in order to conclude the exponential convergence of the observation error (~1, ~2). Nevertheless, as we will show below, we can still design an output-feedback control law to achieve the global tracking task. Before designing such a controller, we introduce a system of coordinates under which our synthesis is developed N
~1 ~-- X 3 -- X 3 d ,
~3 :
r
~2 ---- ~1 -- ( X 2 d - -
~2 -- ( X 5 d -- k 2 X 3 d )
= r
,
klX3d)
~4 --~ Xl -- X l d
(2.71)
-- (~4d -- k 3 ~ )
Then,
/
r
=
(r + k1r
+ ~1x4 + (r + k1r
+ (r + k1r162
~2
:-
~3 "[- ]g2~1 -- kl(~2 -[- k l ~ l ) ( 1 at- ~5 at. k3~4)
~3
=
v2 - k2(r
+ k3r
+ k1~1)(1 + r + k3~4)
(2.72) where x4 ---- 1 + r + k3r + ~'3
(2.73)
Notice that the states of system (2.72) are measured and available for feedback design. If ~'(t) - ~(t) goes to 0 as t --~ oo, then ~(t) converges to 0 if and only if x(t) - xd(t) does. In other words, we have converted the global output-feedback tracking problem into a global state-feedback regulation issue. In the sequel, the design of our desired dynamic output-feedback controllers vl and v2 will be developed according to the second m e t h o d proposed in Section 4.2. First, we observe that the (~4, ~s)-subsystem of (2.72) can be easily made GES (globally exponentially stable) at the origin. Indeed, a direct application of integrator backstepping generates a Lyapunov function candidate
1 2 1+
W1 -- ~
where c4 > 0 is a design parameter.
(@ + k3~4 + c4~4) 2
(2.74)
2. Observer-Controller Design for Nonholonomic Systems
223
To render IPdl nonpositive when ~3 = 0, we are led to choose the control law vl = -c5(r
+ k3r + c4~4) - r - c4(r
+ k3r
c5 > 0
(2.75)
which gives
r162 = -c4r
- c5(r + k 3 6 + c4r 2 + [r + (kz + c4)(r + k3r + ~4r
~-3
Now, consider the quadratic Lyapunov function 1~
(2.76)
Then, we have
W2
=
--C4r
-- C5(r ~- k3r 4 -[- c4r
2 - k3~"2
4-r
Jr- (k3 -[- c4)(r
~- c4r
-~- k3r
(2.77)
Hence, 1)r is negative-definite if c4 > 0.5e -1, c5 > 0.5e-l(k3 + c4) 2 and k3 > c, with e > 0 being arbitrary. In addition, given any c > 0, we can select the design parameters c4, c5 and k3 appropriately such t h a t
~/i/'2(r
r
~'3) < -cW2(r
r
~"3)
(2.78)
From (2.78) and (2.76), it follows that there exist two positive constants 51 and 52 such t h a t
1(r
I < 511(r
)
(2.79)
Next, it remains to design a suitable control law for the input v2. As above, we approach this goal by an application of backstepping to the (r r r subsystem of (2.72). Without going into details, a direct application of backstepping generates a Lyapunov function
where zl = r
z2 = @ - a ~ l ( r
Ot1
=
--(C 1 + kl)r
0~2
=
--C2Z2
--
Zl
z3 = r
, --
r
- 0t2(r
r
r
and
C1 > 0
k2r
- c1(r
(2.81) -[- k1r
q- r A- k3r )
(2.82)
If we choose the control law V2 :
Oa2 ~t -
--C3Z3-- Z2 -4- (k2 -4- ~ 1 }t,~2 A- klr
+0(~2,. "HT-~((,3 +/;2r ~,2 . OOL2 - *
+ ' ~ ' 4 ((,5 + k3r
-- k1(r 00~2
+ k1r
q- r -4- k3r A- r + k3r 2
+ -~--. (vl - k3r - k3r
(2.83)
224
2. Observer-Controller Design for Nonholonomic Systems
with c3 > 0 a design parameter, it follows t h a t
~0~2 ~--
- - e l z2 -- C2z2 -- C3 z2 -{- Zl(~2 -~- ]g1~1)(~5 -~- k3~4) -- z 3 " ~ 4 ~3
C90~2
+(zl + (cl + kl)z2 - ~ - 1 z3)(~lx4 + (r + k1r
(2.84)
Since the matrix A in (2.70) is asymptotically stable, there exists a unique solution P = p T > 0 to the Lyapunov m a t r i x equation
P A + A T p = --I2
(2.85)
w h e r e / 2 is the 2 x 2 unit matrix. Consider now the quadratic function
V(zl,z2,z3,~l,~2)
-~- Y 3 ( z l , z 2 , z 3 )
7t- (~1, ~2)P(~l,
~2) T
(2.86)
In view of (2.71), (2.73) and (2.84), from (2.79), it follows the existence of a positive constant ~ > 0 and two exponentially converging signals a(t) > O, b(t) >_ 0 such t h a t
f/ < _ - ( ~ - a ( t ) ) V + b ( t ) ,
Vt>O
(2.87)
From (2.87), like in the proof of Proposition 2.1, we can invoke the variation of constants formula and Gronwall-Bellman inequality to conclude the exponential convergence of V(zl (t), z2 (t), z3 (t), ~l(t), ~2 (t)) and, therefore of the tracking error x(t) - xd(t), to 0 as t goes to ~ . The simulations in Figure 1 were obtained with the following values of design parameters and initial conditions
kl = 2 ,
k2 ----k3 ~- 1 ,
r
Cl ----c2 = c3 = c 4 = 1, r
1(0) =
2(0) =
c5 = 3,
(2.88)
3(0) = 1
The responses indicate t h a t the tracking error exponentially converges to 0 under mild control effort.
2. Observer-Controller Design for Nonholonomic Systems (xc(t}-sin(t),
1.
y c(I)+cos(t),
~,(t)-t)
",
x (t)-~n(t) y~(tI+cos(t)
. ................
-0 5
9 0
2
4
8
t
8
J
10
225
ll2
1'4
12
14
J
16
i
18
secs
2
~o
i
f....
......
"
9...........
I .................. % 6
8
10
16
18
sees
FIGURE 1. Global output-feedback tracking of the knife-edge (2.59).
6
Conclusions and Future Work
The problem of global output-feedback tracking was addressed for a class of nonholonomic systems in this paper. T h e presented design methodology is a natural extension of our recent state-feedback tracking algorithms proposed in [8, 9, 10]. More specifically, when considering a flat o u t p u t of a chained-form system in this class as the only accessible measurements, we first design a Luenberger-like reduced-order time-varying observer in order to recover the remaining unmeasured states. Under a condition of persistant excitation on the reference input Uld(t), the observation error was shown to converge to zero at an exponential rate if ul is chosen appropriately. Then, based on this observer and using the chained form structure, two constructive methods involving the backstepping technique have been proposed to design desired output-feedback tracking controllers. An extension to the simplified dynamic model was discussed via a simple b e n c h m a r k nonholonomic knife-edge system. It is of interest to mention t h a t an arbitrary s a t u r a t i o n level can be imposed on the control input ul. However, we are unable to extend our approaches to cover the case when the other control input u2 is saturated. T h e chained form represents a good model for m a n y nonholonomic mechanical systems in the ideal case, t h a t is, when the disturbances are ignored. However, almost all physical systems are subject to some kind of disturbance. It turns out to be necessary to examine the robustness of the global tracking p r o p e r t y which was guaranteed by our current trackers. In short, the following problems are meaningful from a practical point of view and deserve our further investigation:
226
2. Observer-Controller Design for Nonholonomic Systems
1. What happens if all control inputs of a nonholonomic system in chained-form are subject to some L~-type constraints? In relation to the knife-edge example, further difficulties arise here in the boundedness of the controller for the dynamic extension of the considered chained models. 2. In case when uncertainties occur in nonholonomic mechanical systems, how do we give a good mathematical description of these uncertainties? If the nominal system is transformable into a chained form, how will these uncertainties affect the coordinates and feedback transformation and the stability obtained from the undisturbed chained form? We are also interested to know how to modify our proposed tracking controllers in [8, 9, 10] and in this paper in order to maintain stability properties in the presence of uncertainties. 3. Experimental work ought to be done on some laboratory-type robots so as to test the effectiveness of our proposed tracking approaches. 7
REFERENCES [1] R. W. Brockett, Asymptotic stability and feedback stabilization, in: R.W. Brockett, R.S. Millman and H.J. Sussmann, eds., Differential Geometric Control Theory, pp. 181-191, 1983. [2] C. Canudas de Wit, B. Siciliano and G. Bastin (Eds), Theory of Robot Control. London: Springer-Verlag, 1996. [3] M.-S. Chen, Control of linear time-varying systems by the gradient algorithm, Proc. 36th IEEE Conf. Dec. Control, pp. 4549-4553, San Diego, 1997. [4] J.-M. Coron, Stabilizing time-varying feedback, NOLCOS'95, Tahoe City, CA, pp. 176-183, 1995. [5] G. Escobar, R. Ortega and M. Reyhanoglu, Regulation and tracking of the nonholonomic double integrator: A field-oriented control approach, Automatica, 34, pp. 125-131, 1998. [6] M. Fliess, J. Levine, P. Martin and P. Rouchon, Flatness and defect of non-linear systems: introductory theory and examples. Int. J. Control, 61, pp. 1327-1361, 1995. [7] Z. P. Jiang, Iterative design of time-varying stabilizers for multi-input systems in chained form, Syst. Contr. Letters, 28, pp. 255-262, 1996. [8] Z. P. Jiang and H. Nijmeijer, Tracking control of mobile robots: a case study in backstepping, Automatica, 33, pp. 1393-1399, 1997.
2. Observer-Controller Design for Nonholonomic Systems
227
[9] Z. P. Jiang and H. Nijmeijer, A recursive technique for tracking control of nonholonomic systems in chained form, to appear in: IEEE Trans. Automat. Control, Feb. 1999. [10] Z. P. Jiang and H. Nijmeijer, Backstepping-based tracking control of nonhotonomic chained systems, Proc. European Control Conference, 1-4 July, 1997, Brussels. [11] Z. P. Jiang and J.-B. Pomet, Combining backstepping and timevarying techniques for a new set of adaptive controllers, Proc. 33rd IEEE Conf. Dec. Control, pp. 2207-2212, Florida, I994; also in: Int. J. Adaptive Contr. Signal Processing, vol. 10, pp. 47-59, 1996. [12] V. Jurdjevic and J.P. Quinn, Controllability and stability, J. Diff. Eqs., 28, pp. 381-389, 1979. [13] Y. Kanayama, Y. Kimura, F. Miyazaki and T. Noguchi, A stable tracking control scheme for an autonomous mobile robot, Proc. IEEE 1990 Int. Conf. on Robotics and Automation, pp. 384-389, 1990. [14] W. Kang and A. J. Krener, Nonlinear observer design, a backstepping approach, preprint, 1998. [15] H. K. Khalil, Nonlinear Systems. Prentice Hall, Upper Saddle River, N J, 2nd edition, 1996. [16] I. Kolmanovsky and N. H. McClamroch, Developments in nonholonomic control problems, IEEE Control Systems Magazine, Vol. 15, No. 6, pp. 20-36, 1995. [17] I. Kolmanovsky and N. H. McClamroch, Hybrid feedback laws for a class of cascaded nonlinear control systems, IEEE Trans. Automat. Control, 41, pp. 1271-1282, 1996. [18] M. Krstid, I. Kanellakopoulos and P. V. Kokotovi6, Nonlinear and Adaptive Control Design. New York: John Wiley & Sons, 1995. [19] E. Lefeber, A. Robertsson and H. Nijmeijer, Output feedback tracking of nonholonomic systems in chained form, preprint, October 1998. [20] R. M. Murray and S. Sastry, Nonholonomic motion planning: steering using sinusoids, IEEE Trans. Automat. Contr., 38, pp. 700-716, 1993. [21] E. Panteley and A. Loria, On global uniform asymptotic stability of nonlinear time-varying systems in cascade, Systems 8z Control Letters, 33, pp. 131-138, 1998. [22] C. Samson and K. Ait-Abderrahim, Feedback control of a nonholonomic wheeled cart in Cartesian space, Proc. of the 1991 IEEE Int. Conf. Robotics and Automation, Sacramento, pp. 1136-1141, 1991.
228
2. Observer-Controller Design for Nonholonomic Systems
[23] G. Walsh, D. Tilbury, S. Sastry, R. Murray and J. P. Laumond, Stabilization of trajectories for systems with nonholonomic constraints, IEEE Trans. Automat. Contr., 39, pp. 216-222, 1994.
A Separation Principle for a Class of Euler-Lagrange Systems Antonio Lorfa v and Elena Panteley t ~C.N.R.S., UMR 5228, Laboratoire d'Automatique de Grenoble, ENSIEG, St. Martin d'H~res, France. *I.N.R.I.A., Rh6ne Alpes, Projet BIP, St. Martin d'H~res, France.
1
Introduction
The solution to the state feedback tracking control problem of fully damped Euler-Lagrange systems (in particular, rigid-robot manipulators) has been known from many years now - for a literature review, see e.g. [22, 27] -. Nevertheless, a drawback of many of the available results in the literature is that they require the measurement of joint velocities which may be contaminated by noise. An ad hoc solution, often taken in practice, is to numerically differentiate the joint positions. However, it has been shown experimentally [2] that this method is inefficient for high and slow velocities. This has motivated researchers in the robotics community to solve the global output feedback control problem of robot manipulators. This problem has been open for many years now. As in the regulation control problem, an approach alternative to numerical differentiation, that has been widely considered in the literature, is to design an observer that makes use of position information to reconstruct the velocity signal. Then, the controller is implemented replacing the velocity measurement by its estimate. Even though the certainty equivalence does not apply for general nonlinear systems, the rationale behind this approach is precisely that the estimate will converge to the true signal, and this should in turn entail stability of the closed loop. As far as we know, some of the earliest works on state estimation for robot manipulators are [20, 17] and some of the references therein. See also [21] for some interesting experimental results. In [17] the authors used a nonlinear observer that reproduces the robot dynamics, in a P D plus gravity compensation scheme. The authors prove the equilibrium is locally asymptotically stable provided the observer gain satisfies some lower bound determined by the robot parameters and the trajectories error norms. See also [7] where a sliding mode approach is taken.
230
3. A Separation Principle for a Class of Euler-Lagrange Systems
The authors of [4] proposed a linear observer-computed torque scheme which exploits the feedback linearizing property of the computed torque scheme providing an efficient tuning technique. Later, using the same tuning idea [3] presented a systematic procedure that exploits the passivity properties of robot manipulators into the design of controller-observer systems to solve both the position and tracking control problems. Local asymptotic stability was proved for sufficiently high gains. Later in [16], based on a computed torque plus PD-like controller first appeared in [27], we added an n-th order "approximative differentiation filter", to eliminate the necessity of velocity measurements. In t h a t paper we proved semiglobal asymptotic stability of the closed loop system hence showing that the domain of attraction can be arbitrarily enlarged by increasing the filter gain. Some more recent results addressing the same problem are for instance: [11, 18], and [19]. The authors of [11] proposed the first adaptive controller for flexible joint robots by using only position measurements. Simultaneously, [18] proposed a globally asymptotically stable observer-based controller needing only link (position and velocity) measurements and later in [19] they extended this result to link position feedback. The approaches mentioned above, rely on a Lyapunov design, that is, the principal aim is to design an observer and a controller such that, the total time derivative along the closed loop trajectories, be negative definite. A common drawback however, is the appearance of high order terms in the derivative of the Lyapunov function, and which can be dominated only for small states. In the best case, one can prove that the region of attraction can be enlarged for large control gains. As an a t t e m p t to bound the cubic terms in the time derivative of the storage function we presented in [13], as far as we know, the first smooth controller which renders the one-degree-of-freedom (dof) EL system. Our approach relies on a computed torque plus P D structure and a nonlinear dynamic extension based on the linear approximate differentiation filter. The main innovation in our controller, which allows us to give explicit lower bounds for the controller gains, in order to ensure GUAS, is the use of hyperbolic trigonometric functions in a Lyapunov function with cross terms. Global uniform asymptotic stability is ensured provided the controller and filter gains satisfy some lower bound depending on the system parameters and the reference trajectory norm. Unfortunately, the performance of our approach can be ensured only for one dof systems and nothing can be claimed for the general multivariable case. Independently, in [6] Burkov showed by using singular perturbation techniques, that a computed torque like controller plus a linear observer is capable of making a rigid joint robot track a trajectory starting from any initial conditions. The main drawback of this result is that no explicit bounds for the observer and control gains can be given. Thus, the author proves in an elegant way, the existence of an output feedback tracking controller t h a t
3. A Separation Principle for a Class of Euler-Lagrange Systems
231
ensures GUAS. Later, A.A.J. Lefeber proposed in [10] an approach which consists on applying a global output feedback set point control law (for instance an EL controller) from the initial time to until some "switching time" t~, at which it is supposed t h a t the trajectories are contained in some pre-specified bounded set. At time t~ one switches to a local o u t p u t feedback tracking control law (such as any among those mentioned above). The obvious drawback of this idea is that the controller is no longer smooth, furthermore, the switching time may depend on bounds on the unmeasured variables. The results contained in I10] concern the existence of the time instant ts such that the closed loop system is GUAS. Most recently, based on the controller of [13] the authors of [28] proposed a dynamic output feedback controller for the multivariable case. The Lyapunov stability proof for the closed loop system is carried out relying on a nonlinear change of coordinates (See Eqs. 35 and 39 of that reference). This change of coordinates is not invertible, and therefore the controller the authors propose in [28] is not implementable without velocity measurements, for any intial conditions of the dynamic extension. In [5] an elegant alternative result for one-degree-of-freedom systems was reported. The controller proposed in [5] is based upon a global nonlinear change of coordinates which makes the system affine in the unmeasured velocities. This is crucial to define a very simple controller which has at most linear growth in the state variables, as a matter of fact the proposed controller is of a P D + type. This must be contrasted with the exponential growth of the control law proposed in [13], due to the use of hyperbolic trigonometric functions. Hence, from a practical point of view, the controller of [5] supersedes by far that of [13]. The work of [5] suggests that more attention should be payed to the modelling stage of the control design. As far as we know, the position tracking control problem stated at the beginning of this section for any initial conditions and for n-degrees-offreedom EL systems still remains openr In this chapter we will present a solution to this problem, for a class of n-degrees of freedom EL systems (including robot manipulators). The systems belonging to this class, allow a factorisation which does not exhibit the Coriolis effects in the dynamic model. Inspired by [5, 9, 24], we consider a kinematics model which in other words, provides a global change of coordinates. As it will become clear later, the model considered here covers a fairly wide class of EL systems, however, in general, it is very difficult to find such factorisation. Our main result is to prove that, for this class of EL systems, it is possible to design a state observer and a state feedback controller independently of each other. T h a t is, we will establish a separation principle for a class of EL systems. Our results are an extension of [14] to the tracking problem, i.e., to the time-varying case. In a more general context, some work on separation principles for nonlinear systems has been done recently for local stabilisation of input-output linearizable systems [1] and for the case of
232
3. A Separation Principle for a Class of Euler-Lagrange Systems
nonaffine systems in [12]. Our results differ from those of in the latter references in that, neither high gains nor bounded feedbacks are required. Moreover, we consider here time-varying systems. This chapter is organised as follows. In next section we present the model we consider here. In Section 3.1 we construct a state estimator and prove global exponential stability in closed loop with the plant dynamics and kinematics. In Section 3.2 we construct a state feedback controller and prove global exponential stability. In Section 3.3 we establish our separation principle, i.e., we prove that, if the state-feedback control law is implemented using the state estimates, the overall closed loop system is uniformly globally asymptotically stable (UGAS). Finally, in Section 4 we discuss our results, when applied to robot manipulators. N o t a t i o n s . In this chapter, we use 1[-[I to denote the Euclidean norm of vectors and induced norm of matrices. The symbols km and kM are used for lower and upper bounds on [[K[[. The symbols := and =: mean "equal by definition". A continuous function/3 : N_>0 --~ 11~>0 is said to be of class /~ (/3 E )U), if j3(s) is strictly increasing and/3(0) = 0
2
Model and Problem Formulation
We consider in this chapter, fully actuated Euler-Lagrange systems with generalised coordinates q E N ~, and control inputs u C 11('~, i.e., --
dt
Oq
--
u
(3.1)
where the Lagrangian s ~)) := T(q, O) - ~;(q). It is assumed that the kinetic energy function is of the quadratic form,
:r(r O) = 10TD(q)q where the inertia matrix D(q) C N nx'~ is positive definite and uniformly bounded. The potential energy function, ~;(q), is assumed to be uniformly bounded from below, i.e., we assume that there exists a real number c, such that l;(q) > c for all q E ll~'~. As it is well known, using the Christoffel symbols of the first kind [26, 24], the system (3.1) can be rewritten in the form
D(q)~j + C(q, dt)dl + g(q) = u
(3.2)
where, in our notation, the matrix C(q, O) contains the terms corresponding to centrifugal and Coriolis effects, and the vector g(q) :-- ~ Oq As it is discussed in [13], a common drawback of o u t p u t feedback controllers relying on Lyapunov design, is that certain 3rd order terms t h a t
3. A Separation Principle for a Class of Euler-Lagrange Systems
233
appear in the Lyapunov function derivative, cannot be dominated. These high order terms arise since the Coriolis and centrifugal forces vector in (3.2), has a quadratic growth in the generalised velocities, which are not measured. The global change of coordinates introduced in [5] for one degree-offreedom systems overcomes this problem by rewriting the dynamics with functions which are linear in the unmeasured velocities. A physical meaning for this "change of coordinates", which makes best sense when considering mechanical systems, is that this can be regarded as a kinematic model. The result of [9] for output feedback control of boats in slow motion tasks, combined with the underlying ideas in [5] suggest that, if we could rewrite the model (3.2) in a way which exhibited these kinematic relations and a dynamic model, linear in the unmeasured states, the problem of o u t p u t feedback tracking should be considerably simplified. For setpoint control, a first step has been undertaken in [14] where a separation principle for dynamic positioning of ships was already proven. Thus, inspired by the results of [9, 5] and motivated by those in [14], in this chapter, we extend the latter to the tracking problem. For this, we will assume that there exists a function J : R n --~ It('~xn, with the following properties P 1 J(q) is invertible for all q E IR'~ and satisfies 0 < kj,~ _< [[J(q)[[ <_ kjM <( CX3.
P 2 dJ(q) dt =: j ( q ' ~)) is globally Lipschitz in ~, uniformly in q; i.e., there exists k~ > 0 such that,
[]J(q,x) - J(q,y)l I < k~[[x- y[[
Vq C IRn.
(3.3)
If a function J(q) with the properties above is available, we can rewrite the model (3.2) as [j-TMj-i]
(q)• + [--J-T M J - 1 J J - 1 ] (q,(t)(t + J - T (q)v(q) = J - T (q)T (3.4)
by defining [J-TMJ-1] (q) =: D(q), with M being positive definite and constant, [ - J - T M J - 1 J J - 1 ]
(q,~) =: C(q,O), J-m(q)v(q) =: g(q)and
J-X(q)T :----u. We will consider in this chapter, EL systems for which a J(q), satisfying P 1 and P 2 , exists. Our motivation to consider EL systems which can be modelled by (3.4) is that these equations are equivalent to
Mi, + v(q) = T
(3.5)
~t = J(q}u.
(3.6)
234
3. A Separation Principle for a Class of Euler-Lagrange Systems
where the dynamics (3.5) of the system have been expressed separately from the kinematic model (3.6). Notice that under the assumption that the inertia matrix D(q) is positive definite, there exists a uniformly bounded function A : ]R'~ --, ~n• such that A(q)TA(q) > 0 for all q 9 IR'L While this property is true in general for positive definite matrices, it is usually very hard to find such factorisation for n degrees of freedom systems. In [8, 9, 14] the model (3.5,3.6) represents the dynamics an kinematics of a surface vessel, for small motion applications. In [5], the author proposed a "change of coordinates" similar to (3.6) for one-degree-of-freedom systems. The control problem we solve in this chapter is the following. D e f i n i t i o n 3.1 ( G l o b a l o u t p u t f e e d b a c k t r a c k i n g ) Let qd : I~>_0 ~n be twice continuously differentiable and assume there exists /3d > 0 such that, max{llqd(t)ll, IIqd(t)l]} < /3d, uniformly in t. Assume that only q is available for measurement. Under these conditions, .find a dynamic controller T( t, q, ~), ~ ----r ~, q), such that, for any initial conditions (to, q(to), O(to), ~(to)) 9 IR>0 x ~n x R n x ~m, the system (3.~) in closed loop with 7(t, q, ~), be uniformly globally asymptotically stable (UGAS).
3
A Cascades Approach to a Separation Principle
Our main result in this chapter is a separation principle for EL systems under the assumptions made in the previous section. Our control design relies on defining an observer and a control law, with the aim at having a cascaded closed loop system, i.e., we seek for an error dynamics of the form :
r2
:
Xl = fl(t, Xl) q-g(t,X)X2
= Y2(t, X2)
(3.7) (3.S)
w h e r e x 1 9 ]I~2n, X 2 9 lt~2n, t h e f u n c t i o n s f l ( t , Xl), f l ( t , Xl), a n d g ( t , x ) are continuously differentiable and, both subsystems, E2 and
}-]0 : Xl = f l ( t ,
Xl)
(3.9)
are UGAS. Our motivation for considering this class of systems, is that the sufficient conditions for UGAS of cascades, are often easier to verify than to find a Lyapunov function for the system (El, E2), with a negative definite time derivative. In particular, in this chapter we will use Theorem 2 from [23]. In classic Lyapunov control design, one aims at designing a control law which yields a Lyapunov function with a negative definite derivative. In our control design for EL systems in the form (3.5,3.6), the system E2 will be the estimation error dynamics, hence, our first goal is to construct
3. A Separation Principle for a Class of Euler-Lagrange Systems
235
an exponentially convergent observer. The system E ~ will correspond to the plant in closed loop with a state feedback controller. Then, E1 will correspond to the system (3.5, 3.6), in closed loop with the output feedback controller. In other words, g(t, x)x2 will correspond to nonlinearities of the system that result from implementing the state feedback control law, using the state estimates, instead of the true values. Then, to analyse the stability of the overall system, we will invoke [23, Theorem 2]. Hence, our design is made with aim at verifying the conditions of that theorem.
3.1
Observer Design
The observer design is based on [9]. With respect to the result in the last reference, we relax the assumption that the dynamic model (3.5) is internally damped. Consider the observer
M~, + v(q) = T + MKo2(q)~ = j ( q ) i + Ko,
.
(3.10)
(3.11)
where Ko: 9 ~n• and Ko2(q) 9 I~nxn are to be defined later and we denote the estimation error q = q - 0, correspondingly for the other variables. The estimation error dynamics (3.5, 3.6), (3.10, 3.11) is
= -Ko2(q)~
(3.12)
= J(q)D - Ko, q.
(3.13)
P r o p o s i t i o n 3.1 ( E x p o n e n t i a l l y c o n v e r g e n t o b s e r v e r ) Let P1, P2 be positive definite, Ko: be such that P1Ko: + K ~ P1 is positive definite, and let Ko~(q) := P21J(q)TP1. Then, the origin (~,~) = (0,0) of the system (3.13,3.12) is uniformly globally stable (UGS). Furthermore, assume that the trajectories q(t) and u(t), starting at (to, qo, vo) are globally uni.formly bounded, i.e., there exist c > 0 and ~ 9 3: such that II[q(t); u(t)]ll ~(llqo; Poll) + c .for all t > to > 0 and all (qo, vo) 9 ~'~• Then, the origin is UGAS. R e m a r k 3.1 The assumption on the uni.form boundedness o.f the plant trajectories is needed here to establish UGAS .for (3.12,3.13), however, this condition will be relaxed later when considering the overall closed loop system. That is, when introducing the output .feedback controller. P r o o f of P r o p o s i t i o n 3.1. Consider the control Lyapunov function candidate Vo(~, ~) = :1
(qTplq-f- ~Tp2~)
(3.14)
236
3. A Separation Principle for a Class of Euler-Lagrange Systems
where Pi E N '~x'~ and/~ e IE~x" are positive definite matrices. The time derivative of Vo(q, F,) along the trajectories of (3.13,3.12) yields
Vo(4,~') = --~ql-T (PIKol + K ~ P 1 ) q + 4 T p I J ( q ) Y ' - qTKo2(q)Tp2~ (3.15) hence, using the definition Ko2(q) = P~-ij(q)Tp1, we obtain that
vo(4, r,) = - - i1-T q (P1Ko~ + K x P1)4 .
(3.16)
Since by assumption, P1K m + K ~ P 1 is positive (semi-)definite, the time derivative Vo(4, ~) is negative semidefinite. We conclude that the origin of the system is uniformly globally stable. To prove global exponential stability we rely on the a theorem, from [15], which is repeated in the Appendix A for the sake of completeness. To t
apply Theorem 3.3, let (i := q, (2 := ~, W ( t , ( i ) := ~ 4 T p i 4 , G(t,() :=
J(~l + O(t)) = J(q), P := Ps and h(t, (1) := -Ko~ 4. With these definitions, it is clear that the system (3.13,3.12) is of the form (3.51,3.52). Hence, we simply have to verify that conditions A1 - A 2 hold. The bound (3.53) is clearly satisfied with Pi (') = rnax{ko:M ,PIM } - T h e bounds (3.54,3.56) hold due to the property P l . Also, the inequality (3.55) is satisfied since, using J(q, 0) = 0 and (3.11,3.13), we can compute J(q,q)
<<
J(q,q) - J(q,O)
+ J(q,O)
k} (ll/]ll + HJ(q)[u(t) - ~]]ll + kO,M I]411)
_< k} (llqll + kj (c + r
-o]H)llvll)+ k o ~ 11411)=: pz(ll(l[) -
Finally, it is immediate to verify that Assumption A 2 holds true with 0:1() = 0.5plrn, 0:2(') = 0.5plM, and , = A~(P1Ko~ + K ~ P ~ ) . We conclude that the origin (4, P) = (0, 0) of (3.13,3.12) is globally uniformly asymptotically stable and uniformly IocaIly exponentiaIly stable.
3.2
State Feedback Controller
Having designed an exponentially converging state observer, we proceed now to derive a state feedback controller for the system (3.5, 3.6). In the next section we will prove that this state feedback controller can be implemented using the state estimates and the observer above, leading to uniform global asymptotic stability of the closed loop. As in [25], let us define the virtual error s := q + Kp~, where ~ := q - qd, and Kp is positive definite. From (3.6), we also have that
= J ( q ) - - 4d + Kp4.
(3.i7)
3. A Separation Principle for a Class of Euler-Lagrange Systems
237
The control design consists on finding r(t, q, q) such that the closed loop system have the form
=
--Kds--O
(3.18)
q =
-KpO+s.
(3.19)
L
The obvious reason we aim at such closed loop system, is that (3.18, 3.19) is globally exponentially stable for any positive definite matrices Kd and Kp. This can be verified, using the Lyapunov function candidate
vc(t, 0, s) = 1
(110112+ ilsl12)
(3.20)
which is positive definite and radially unbounded in the state (0, s). Thus, to derive 7(t, q, 0), we evaluate the time derivative on both sides of (3.17), using (3.5), to obtain
= J(q)M -1 [7 - v(q)] + J(q, q)~
--
qd
"~-Kpq,
(3.21)
hence, substituting the right hand side of (3.18) in (3.21) we have that the latter is satisfied if we apply the control law 7 = T*, with
T* =v(q) + M J ( q ) - l [--O+~d-- Kpq-- J(q,(t)v-- gds] .
(3.22)
This control law can be implemented under the assumptions that velocity and position measurements are available, J(q) is invertible, and using (3.6). We conclude that the system (3.5, 3.6) in closed loop with (3.22) is globally exponentially stable for any positive definite matrices Kp and /(d.
3.3 A Separation Principle We show in this section that, the control law (3.22) can be implemented using the state estimates provided by the observer of Section 3.1 and UGAS can still be achieved. This is our main result. Consider the controller
v(q) + MJ(q) -1 [ - 0 + qd -- Kpq - J(q, J(q)i)~ - Kdg](3.23) q
:=
q--qd:=J(q)5--qd
(3.24)
g
:=
( J ( q ) 5 - (td + Kp~).
(3.25)
We are now ready to state our main result. T h e o r e m 3.1 ( S e p a r a t i o n p r i n c i p l e ) The system (3.5,3.6) in closed loop with the observer (3.10,3.11) and the controller (3.23)-(3.25), is UGAS.for any positive definite matrices Kp, Kd, P1, P2 and Ko~ such that PIKol + K ~ P1 is positive definite, and Ko2(q) := P21j(q)P1.
238
3. A Separation Principle for a Class of Euler-Lagrange Systems
R e m a r k 3.2 In order to prove Theorem 3.1 we will invoke [23, Theorem 2] (see Theorem 3.2) which is .formulated for cascades of UGAS systems. It can be proven along the same lines as in [23], that, under the conditions of that paper, the cascade of a subsystem ~1 being GES, with a subsystem E2, being UGAS and ULES, yields a UGAS and ULES cascade. Therefore, even though we do not state it explicitly in Theorem 3.1, the closed loop system is UGAS and ULES. P r o o f o f T h e o r e m 3.1. Comparing (3.23) to (3.22), we have that "~ =
gl
:~_
~-* + g l ( t , s , ~ , ~ , ~ )
(3.26)
M J ( q ) -1 [(Kp + Kd)J(q)~ 4- J(q, g t ) u - J(q,~t)u + J(q, q)P] (3.27)
q
:=
J(q)fJ
(3.28)
and we remark that we have abbreviated ~ + qd = q and j ( q ) - I [s - Kp~ + qd] ----u for simplicity in the notation. Thus, the closed loop system (3.5,3.6); with 7 = ~, (3.10, 3.11) and (3.23) yields the system
--
--Kds -- ~
+ g(t, s, ~, ~, q)
(t + J(q(t))F, -Ko2(q(t)) (t 1
(3.29)
(3.30)
where the interconnection term
g(t,s,~,~,~) :=
[ gl(t,s,~,~,Kt 0 ) I.
In order to study the stability of (3.29-3.31), we present below a theorem which follows from the proofs of [23, Theorems 1 and 2] for nonautonomous cascades. T h e o r e m 3.2 Assume that the system (3.9) is UGAS and that the trajectories of (3.8) are globally bounded, uniformly in the initial states and the initial time ( xo, to). If moreover, Assumptions A3 - A5 below are satisfied, then the solutions x(t, to, Xo) of the system (3. 7,3.8) are globally uniformly bounded. If.furthermore, the system (3.8) is UGAS, then so is the origin of the cascade (3.7,3.8). A3 There exists a Lyapunov.function V ( t, xl) .for (3.9) such that V : I~>_o• I~n --~ 1~>_o is positive de.finite (that is V(t,0) ---- 0 and V(t, xl) > 0 for all xl ~ O) and radially unbounded, which satisfies 0_~____Y i]Xlll _~ c t g ( t , xi) UXl
VllXl[I _ 7/
(3.32)
3. A Separation Principle for a Class of Euler-Lagrange Systems
239
where q , r] > O. We also assume that o ~ (t, xl) is bounded uniformly in t .for all Ilxlll <_~?, that is, there exists a constant c2 > 0 such that for all t > to > 0 OV
-< c2
Vllxlll -< v
(3.33)
A 4 There exist two continuous .functions 01, 02 : ]I{>O ~ ]I~>_O, such that g(t, x) satisfies
ltg( t, X) I! <_ 01(119J211)q-02(t12:2tl)11Xlll
(3.34)
A 5 There exists a class 1~ function r such that, .for all to > O, the trajectories of the system (3.8) satisfy
jT
l[x2(t,to,x2(to))lldt < r
(3.35)
In order to apply Theorem 3.2, let xl := col(~, s) and, x2 := col(q, ~). Notice that, g(t, ~, 0, 0) ~ 0 and, the zero input dynamics of (3.29),
-Kas-
~
'
corresponds to the closed loop system with the state feedback control law r*, which was designed to be GES for any positive definite Kp and Kd. Also, it is clear that the Lyapunov function (3.20) satisfies A3 above. We show now, that g(t, ~, s, q, r,) as defined in (3.31,3.27), satisfies Assumption A 4 of Theorem 3.2. For this, consider the following bounds: II(Kp + gd)J(q)>l[ [0i(q, 0 ) - J(q, ~)] z~
<_ (kpM + kdM)kjM[lb'll [ j ( q , J ( q ) u ) - J(q,J(q)s
=
< kSilJ(q)F, llll~'ll < kSkJM (llsll + kvMl[0l[ +/3d) I]~ll --
J(q, g(q)i)~
~jm
<_ ( J(q, g(q)i) - J(q,0) ) I1~11
<_ k'jkjMl["-- [llloII
_<
(k-m(,,sll + kpMl'0"+ ' a +
where we have used properties P 1 and P 2 and J(q, 0) ~- 0. It follows that
240
3. A Separation Principle for a Class of Euler-Lagrange Systems
g( t, ~, s, q, ~ ) satisfies (3.34) with
01(114; Pl])
"--
O2(llq; 11)
.-
mM [ I kjm (k'pM+ kdM)kjM + 2kjkjM~d]~jm J -]- mMk;kjMkjm H~][2 2]~;~jMmM kym
(kpu + 1)11 11.
Therefore, the trajectories (~(t), s(t), q(t), F(t)) of the overall closed loop system (3.5,3.6), (3.10,3.11), T = ~ and (3.23), are uniformly bounded for any initial conditions. Since qd(t) and Oa(t) are uniformly bounded, so are 0(t) and q(t) for all initial conditions. Finally, from (3.6) and P r o p e r t y P 1 it follows that ~(t) is also uniformly bounded for all initial conditions. At this point, we can invoke Proposition 3.1 to conclude that the origin (4, ~) = (0, 0) of the time-varying system
= -Ko2(q(t))q = Z(q(t))~ - Kol~ is uniformly globally asymptotically stable. Moreover, this property is also uniform in the initial conditions of the trajectories q(t, to, qo) and y(t, to, v,) since these are globally uniformly bounded. From these properties, it is clear that there exist r E K: such that the trajectories x2(t) = [q(t); P(t)] satisfy
(3.35). We conclude that the cascaded system (3.29,3.30) is UGAS.
4
9
Application to Robot Manipulators
Our main result applies to EL systems (3.1) with quadratic kinetic energy function, and potential energy bounded from below. However, the control implementation involves the knowledge of a matrix J(q) such that the inertia matrix can be written as D(q) -- J ( q ) - T M J ( q ) - l . While for systems, such as marine vessels, the Jacobian J(q) is a simple orthogonal rotation matrix, in general, this factorisation is very hard to find (such is the case of robot manipulators). In this section we give the insight on an alternative result for robot manipulators. We start with the expression of the kinetic energy of a robot manipulator, assuming that each articulation is actuated thru a rigid transmission. Following the notation of [24], let pi denote the position of the centre of mass of the i-th link, expressed in the Cartesian coordinates of the base frame hence, pi c N a. Let, 1hi denote the linear velocities of the centre of mass and, wi denote the angular velocity of the i-th link, expressed in the base frame, due to the rotations (if any) of the i-th link. Define mi as the mass of the i-th link and I / the inertia tensor of the i-th link, expressed
3. A Separation Principle for a Class of Euler-Lagrange Systems
241
in an inertial reference frame, with origin at the centre of mass of the i-th link. Define also, the position and orientation Jacobians for the i-th link as the maps, J ) : N~ --~ N axn and J~ : ~ --~ N3x~ from the space of linear and angular velocities into generalised velocities, i.e., let
~i = J~p(q)il wi = J~)(q)gt.
(3.36) (3.37)
W i t h all these definitions, and neglecting the inertia contributions of the actuators, we have that the kinetic energy of the manipulator is given by
r(0' q) = 21 ~
(rni0Tj~(q)T
jig(q)O+oTj~(q)TRi(q)i~Rm(q)j~(q)o )
(3.38) where Ri(q) is the rotation matrix expressing the orientation of the i-th inertial frame, in the base frame. Using, (3.36,3.37), the expression (3.38) is equivalent to
:r(p,
i T -i
=
i "~
i=1
where ~ = Ri(q)Tw~ is the angular velocity of the i-th link, expressed in the coordinates of its inertial frame. At this point, let us define the vector v := col[;S1,... ,~5,~, w~, ... , w~] and the matrix J ( q ) E ]~6nx,~ as
J (q)
fl(q)
J~(q)
R~(q)J~(q)
:=
(3.40)
RTn(q)J~)(q) hence we have that :=
,7(q)O.
(3.41)
Observe that, since there are n generalised (independent) coordinates, the column rank of J(q) is n. Therefore, we can also write O :=
[j(q)Tj(q)]-lj(q)Su.
(3.42)
Using all these definitions, the kinetic energy function can be rewritten (with an obvious abuse of notation) as T ( v ) = vTA4u
(3.43)
242
3. A Separation Principle for a Class of Euler-Lagrange Systems
where A/t := blockdiag{mlI, ... , rnnI, I~, ... , I~} where I E IR3x3 is the identity matrix and we recall that I~ is the constant inertia tensor of the i-th link, i.e., referred to the inertial frame. Concerning the potential energy of the manipulator, this is given by L/(p) := ~
rnigTop~
(3.44)
i~1
where go E N3 is the vector of gravity acceleration. Therefore, using (3.43,3.44) we can derive the dynamic model of the manipulator, A/tu + v = 7-
(3.45)
where the constant vector v:--[
0L/(P) T Op , O, ... , On
IT
and which is subject to the holonomic constraints imposed by the kinematics equation (3.41). Two observations are important at this point. Firstly notice that the manipulator model (3.45,3.41) is of the form (3.5,3.6) except of course, for if(q) which is not square. Moreover, it can be shown that J(q) satisfies P 1 and P 2 . Indeed, P 1 holds since if(q) is full column rank. This is equivalent to the assumption that the inertia matrix is positive definite and its induced norm is bounded from above. Secondly, differentiating (3.41) with respect to time and substituting for ~, in (3.45), we obtain that
[JTJ~j](q)~+[JTA42](q,~1)o+jT(q)v=JT(q)T.
(3.46)
The inertia matrix D(q) := y(q)TA/tff(q) is positive definite i.f and only i.f the "Jacobian" J ( q ) is of full column rank and A/[ is positive definite. Also, it is easy to see that, defining the Coriolis and centrifugal matrix C(q, il) := [ j T A 4 f f ] (q, 0), the matrix 0.5D(q, q ) - C ( q ,
(1) is skew-symmetric. Finally,
the gravity forces vector
g(q) .-- OU(p(q)) _ j ( q ) T v " Oq We remark that this does not mean that 0p(q) Oq = j ( q ) . Comparing the systems (3.45,3.41) to (3.5,3.6) and from Theorem 3.1,
3. A Separation Principle for a Class of Euler-Lagrange Systems
243
one could conjecture that the controller
v § .hd,J'(q) [-(t + qd-- K p q - J(q,J(q)P)P- Kd~] (3.47) q
j(q)
:=
J(q)D-(~d
(3.48)
:=
(J(q)P - qd + Kp~)
(3.49)
:=
j(q)t := [j(q)T j ( q ) ] - I j(q)T
(3.50)
together with the observer (3.10,3.11), renders the robot manipulator system (3.46), UGAS. However, it shall be remarked that the observer system was designed under the implicit assumption that the coordinates ~ are linearly independent. In the case of the robot manipulator, ~ c ]l~6n and these variables are subject to 5n holonomic constraints. The physical meaning of these constraints is clear if we recall that a manipulator consists of an open kinematic chain of n rigid linked bodies. Therefore, strictly speaking, we cannot talk about UGAS, more specifically, of global (uniform) attractivity of the origin (~, P, ~, s) = (0, O, O, 0). Yet, based on the results of the previous section, we conjecture the following.
Consider the closed loop system (3.46), (3.10,3.11), (3.473.50). Under the conditions of Theorem 3.1, the origin (~, ~, ~, s) = (0, 0, 0, 0) is UGS and the signals ~(t), ~(t) := J(q)P(t), (t(t) and q(t) converge uni.formly to zero, .for any initial conditions. C o n j e c t u r e 3.1
5
Conclusions
We have addressed in this chapter, the problem of output feedback trajectory tracking of Euler Lagrange systems. For a subclass of these systems (including manipulators), characterised by certain factorisation, we have provided a separation principle. Our main result establishes that, if a globally exponentially stabilising state feedback controller can be implemented using the state estimates provided by a globally exponentially convergent observer; the overall closed loop system, remains uniformly globally asymptotically stable. Even though our results cannot be directly applied to robot manipulators, for these systems, we have conjectured the weaker property of UGS plus global uniform convergence of part of the state variables. These include the position and velocity tracking errors. The proof of the latter is currently under investigation.
Acknowledgements This work was realised while the authors were with the Dept. of ECE, University of California at Santa Barbara, USA, under grant NSF-9812346.
244
3. A Separation Principle for a Class of Euler-Lagrange Systems
The second author is on leave from the IMPE, Academy of Sciences of Russia, St. Petersburg, Russia. 6
REFERENCES [1] A.N. Atassi and H. Khalil. A separation principle for the stabilization of a class of nonlinear systems. In Proc. ~th. European Contr. Conf., Bruxelles, BE, 1997. CD-ROM Paper no. 952. [2] P. B61anger. Estimation of angular velocity and acceleration from shaft encoder measurements. In Proc. IEEE Conf. Robotics Automat., volume 1, pages 585-592, Nice, France, 1992. [3] H. Berghuis and H. Nijmeijer. A passivity approach to controllerobserver design for robots. IEEE Trans. on Robotics Automat., 96:740-755, 1993. [4] H. Berghuis, H. Nijmeijer, and P. L~hnberg. Observer design in the tracking control problem of robots. In Proc. IFAC NOLCOS, pages 588-593, Bordeaux, France, 1992. [5] G. Besan~on. Simple global output feedback tracking control for onedegree-of-freedom Euler-Lagrange systems. In IFA C Conf. on Systems Structure and Control, pp. 267-271, 1998.
[6] I. V. Burkov. Mechanical system stabilization via differential observer. In IFA C Conference on System Structure and Control, pages 532-535, Nantes, France, 1995. [7] C. Canudas de Wit, K. Astrom, and N. Fixot. Computed torque control via a nonlinear observer. Int. J. Adapt. Contr. Sign. Process., 4(3):443-452, 1990. [8] T. I. Fossen. Guidance and control o.f ocean vehicles. John Wiley Sons Ltd., 1994. [9] T. I. Fossen and A . Grcvlen. Nonlinear output feedback control of dynamically positioned ships using vectorial observer backstepping. IEEE Trans. Contr. Syst. Technol., 6(1):121-128, 1998. [10] A. A. J. Lefeber. (Adaptive) Control of chaotic and robot systems via bounded feedback control. Master's thesis, University of Twente, The Netherlands, 1996. Available from the author upon request to A. A. J. Lefeber~math. utwente, nl . [Ii] S. Lim, D. Dawson, J. Hu, and M. S. de Queiroz. A n adaptive link
position tracking controller for rigid link, flexible joint robots without velocity measurements. In Proc. 33rd. IEEE Conf. Decision Contr., pages 351-357, Orlando, FL., 1994.
3. A Separation Principle for a Class of Euler-Lagrange Systems
245
[12] W. Lin. Bounded smooth state feedback and a global separation principle for non-affine nonlinear systems. Syst. Contr. Letters, 26:41-53, 1995. [13] A. Loria. Global tracking control of one degree of freedom EulerLagrange systems without velocity measurements. European J. of Contr., 2(2), 1996. See also: Proc. 13th IFAC World Congress 1996, vol. E, pp. 419-424. [14] A. Loria, T. I. Fossen, and E. Panteley. Global output-feedback dynamic positioning of ships: A cascaded approach. Technical report, Norwegian University of Science and Technology, November 1997. [15] A. Loria, T I. Fossen, and A. Teel. UGAS and ULES of nonautonomous systems: Applications to integral control of ships and manipulators. In Proc. 5th. European Contr. Conf., Karlsruhe, Germany, 1999. Submitted for presentation. [16] A. Loria and R. Ortega. On tracking control of rigid and flexible joints robots. Appl. Math. and Comp. Sci., special issue on Mathematical Methods in Robotics, 5(2):101-113, 1995. eds. K. Tchon and A. Gosiewsky. [17] S. Nicosia and P. Tomei. Robot control by using only joint position measurement. IEEE Trans. on Automat. Contr., 35-9:1058-1061, 1990. [18] S. Nicosia and P. Tomei. On the control of flexible joint robots by dynamic output feedback. In Proc. 4th. Symp. Robot Contr., pages 717-722, Capri, Italy, 1994. [19] S. Nicosia and P. Tomei. A tracking controller for flexible joint robots using only link position feedback. IEEE Trans. on Automat. Contr., AC-40(5):885-890, 1995. [20] S. Nicosia and A. Tornamb~. High-gain observers in the state and parameter estimation of robots having elastic joints. Syst. Contr. Letters, 13:331-337, 1989. [21] S. Nicosia, A. Tornamb~, and P. Valigi. Experimental results in estimation of industrial robots. In Proc. 29th. IEEE Conf. Decision Contr., pages 360-365, Honolulu, HI, 1990. [22] R. Ortega and M. Spong. Adaptive motion control of rigid robots: A tutorial. Automatica, 25-6:877-888, 1989. [23] E. Panteley and A. Loria. On global uniform asymptotic stability of non linear time-varying non autonomous systems in cascade. Syst. Contr. Letters, 33(2):131-138, 1998.
246
3. A Separation Principle for a Class of Euler-Lagrange Systems
[24] L.
Sciavicco and B. Siciliano. Modeling and control of robot manipulators. McGraw Hill, New York, 1996.
[25]
J. J. Slotine and W. Li. Adaptive manipulator control: a case study. I E E E Trans. on Automat. Contr., AC-33:995-1003, 1988.
[26] M.
Spong and M. Vidyasagar. Robot Dynamics and Control. John Wiley & Sons, New York, 1989.
[27]
J. T. Wen and S. D. Bayard. New class of control laws for robot manipulators, Parts I and II. Int. J. of Contr., 47-5:1288-1310, 1988.
[28] F.
Zhang, D. M. Dawson, M. S. de Queiroz, and W. Dixon. Global adaptive output feedback tracking control of robot manipulators. In Proc. 36th. I E E E Conf. Decision Contr., pages 3634-3639, San Diego, USA, 1997.
Appendix A: A Theorem on UGAS for Strictly Passive Systems T h e o r e m 3.3 [15] Consider the system
41 = 42
=
h(t, ~1) -~- a(t, x)~ 2 ( OW(t,~l) ~ T --PG(t,~)T
~
]
(3.51) ,
(3.52)
where ~x C IR"~, ~2 E N m, P = p r > 0 and W :1~'~ x N>_o ~ N>_o is a C 1 .function satisfying certain properties (see A1). If Assumptions A1 and A2 below hold, then the origin of the system (3.51,3.52) is UGAS. A1 There exist continuous nondeereasing .functions flj : ]I~>_O~ 11~>0, 0 -=1, 2, 3) such that, for all t > O, x 6 N n+m max
{
]lh(t,~l)l[,
OW(t,~l) ~1
}
-< Pl(llSlll)llStll
(3.53)
IIG(t,~)ll
~
p~([]~ll)
(3.54)
IlG(t,~)ll
-< p3(II~I]).
(3.55)
Furthermore, .for each compact set K C N n+m there exist b m > 0 such that
c(t, ~)Tc(t, ~) >_b2mI .for all (t, ~) C K x R>_o.
(3.56)
3. A Separation Principle for a Class of Euler-Lagrange Systems
247
A 2 There exist class-]Coo .functions (~1 and c~2 and # > 0 such that ~1(I](1[]) ~ W(t,~x) < ~2([l~x][) cgW(t,~l) O W ( t ' ~ l ) h ( t , ~ l ) < --#N~ll] 2 Ot + 0~ -"
(3.57) (3.58)
Moreover, i.f a2(s) o( s 2 then the origin is uniformly locally exponentially stable (ULES); i.e., there exist "Yl,'Y2 and r > 0 such that ]l{oll -< r ~
]]~(t,to,{o)[I ~ 71]1~o[]e-'y~(t-t~ ,
(3.59)
where ~(t,to,~o) is the solution of (3.51,3.52) starting at (to,~o). R e m a r k 3.3 It is important to mention that the inequality (3.55) used here, differs .from the original condition f max
OG(t,~)
l ---5V- '
OG(t,~)
}
_<
pa(ll ll), i=1,2
(3.60)
with p(.) continuous nondecreasing, used in [15]. However, the condition (3.60) is used in that re.ference, in the proo.f of Theorem 3.3, to imply (3.55). We have modi.fied this condition here .for the purposes of this chapter.
H i g h - G a i n Observers in Nonlinear Feedback Control Hassan K. Khalil Department of Electrical and Computer Engineering Michigan State University East Lansing, MI 48824-1226, USA
1
Introduction
The use of high-gain observers has evolved as an important technique for the design of output feedback control of nonlinear systems. The basic ingredients of this technique are (1) a high-gain observer that robustly estimates the derivatives of the output; (2) a globally bounded state feedback control, usually obtained by saturating a continuous state feedback function outside a compact region of interest, that meets the design objectives. The global boundedness of the control protects the state of the plant from peaking when the high-gain observer estimates are used instead of the true states. The technique was first introduced by Esfandiari and Khalil [14] and since then has been used in about forty papers, about two thirds of which represent the work of Khalil and coworkers while the rest represents the work of about twenty nonlinear control researchers. This chapter is intended as a tutorial/survey paper on the use of highgain observers in nonlinear control. In Section 2, we use a second-order example to illustrate the main ideas of the technique. In Section 3, we review some recent results by Attasi and Khalil which give a fairly general separation principle for a class of nonlinear systems. These separation results can be used to derive most of the results reported in the following sections. There are a few exceptions where the results cannot be derived using the separation approach, like the adaptive control results of Section 6 in the lack of full persistence of excitation and the sliding mode control results of Section 7. Sections 4 through 10 survey work on stabilization, nonlinear servomechanisms, adaptive control, sliding mode control, robustness to fast unmodeled dynamics, discrete-time implementation, and application to induction motors.
250
4. High-Gain Observers in Nonlinear Feedback Control
2 Motivating Example Consider the second-order nonlinear system :~1
=
~2
=
X2
r
y
=
Xi
(4.1)
where x = [xl, X2] T. Suppose u = 7(x) is a state feedback control t h a t stabilizes the origin x = 0 of the closed-loop system xl ~2
= =
x2 r
(4.2)
To implement this feedback control using only measurements of the o u t p u t y, we use the observer Zl. x2
= =
52+hi(y-51) r u) + h 2 ( y -321)
where r u) is a nominal model of the nonlinear function r estimation error 2 =
22
=
(4.3)
u). T h e
ix1
x2 - 52
satisfies the equation L
xl L x2
= =
-h121 +22 -h25h + 5 ( x , 2 )
(4.4)
where 5(x, 2) -- r 7(5)) - r 7(5)). As in any a s y m p t o t i c observer, we want to design the observer gain H = [hi, h2] T to achieve a s y m p t o t i c error convergence, that is, l i m t ~ 2(t) = 0. In the absence of the disturbance term 5(x,2), asymptotic error convergence is achieved by designing the observer gain such that the matrix Ao =
I
-h1 -h2
1 ] 0
is Hurwitz; t h a t is, its eigenvalues have negative real parts. For this secondorder system, Ao is Hurwitz for any positive constants hi and h2. In the presence of 5, we need to design the observer gain with the additional goal of rejecting the effect of the disturbance t e r m 5 on the estimation error 2. This is ideally achieved, for any disturbance t e r m 5, if the transfer function
111]
Ho(s) = s2 + his + h2
s + hi
4. High-Gain Observers in Nonlinear Feedback Control
251
from 5 to :? is identically zero. While this is not possible, we can design the observer gain such that the transfer function Ho is arbitrarily close to zero. By calculating the H ~ norm of Ho(s) it can be seen that the norm can be made arbitrarily small by choosing h2 >> hi >> 1. In particular, taking hi = c~.__~'lc h2 = ~22
(4.5)
for some positive constants ~1, c~2, and e, with e << 1, it can be shown that
Hence, l i m ~ 0 Ho(s) = 0. The disturbance rejection property of the highgain-observer design (4.5) can be also seen in the time domain by representing the error equation (4.4) in the singularly p e r t u r b e d form. Towards that end, define the scaled estimation errors 771 =
:71 e
?72 = :?2
~,
(4.6)
The newly defined variables satisfy the singularly p e r t u r b e d equation s
=
ci72 =
--a1~]1
+ ~2
(4.7)
-a2~11 + eS(x, :?)
This equation shows clearly that reducing e diminishes the effect of the disturbance term 5. It shows also that, for small e, the dynamics of the estimation error will be much faster than the dynamics of x. Notice, however, that the change of variables (4.6) may cause the initial condition ~71(0) to be of order O(1/e) even when :?1(0) is of order O(1). With this initial condition, the solution of (4.7) will contain a term of the form (1/e)e -at/~ for some a > 0. While this exponential mode decays rapidly, it exhibits an impulsive-like behavior where the transient peaks to O(1/e) values before it decays rapidly towards zero. In fact, the function (1/e)e -at/~ approaches an impulse function as e tends to zero. This behavior is known as the peaking phenomenon. It is important to realize that the peaking phenomenon is not a consequence of using the change of variables (4.6) to represent the error dynamics in the singularly perturbed form. It is an intrinsic feature of any high-gain-observer design that rejects the effect of the disturbance term 5 in (4.4); that is, any design with h2 >> hi >> 1. This point can be seen by calculating the transition matrix e A~ and noting t h a t the (2,1) element is given by -2h2 when 4h2 > h~, and
:h:4h2{exP[
sin(t )
exP[(hi
252
4. High-Gain Observers in Nonlinear Feedback Control
when 4h2 < h~. The magnitude of the coefficient of the exponential mode is greater than ~ in the first case and h2/hl in the second one. Thus, as we increase hi and h2/hl, we drive this coefficient toward infinity. The peaking phenomenon was studied in the context of high-gain state feedback control of linear [39, 15, 29] and nonlinear [31, 52] systems. Its investigation for high-gain observers appeared as early as 1979 in the work of Polotski [45]. The impact of observer peaking on closed-loop stability was studied by Esfandiari and Khalil [14]. Examples 3 and 4 of [14] illustrate the peaking phenomenon and show how it could destabilize the closed-loop system, as the impulsive-like behavior is transmitted from the observer to the plant. A key contribution of [14] is the following observation. If the state feedback control 7(x) is a globally bounded function of x, it will provide a buffer that protects the plant from peaking. In particular, during the peaking period, the control ~y(5) saturates. Since the peaking period shrinks to zero as e tends to zero, for sufficiently small e the peaking period becomes so small that the state of the plant x remains close to its initial value. After the peaking period, the estimation error becomes of order O(e) and the feedback control 7(5) becomes O(e) close to 7(x). Consequently, the trajectories of the closed-loop system under o u t p u t feedback asymptotically approach its trajectories under state feedback as e tends to zero. This leads to recovery of the performance achieved under state feedback. The global boundedness of ~,(x) can be always achieved by saturating the state feedback control, or the state estimates, outside a compact region of interest. The analysis of the closed-loop system under output feedback proceeds as follows. The system is represented in the singularly perturbed form Xl
:
52
:
X2
{~1
=
- - a l ~ l + 72
e~2
=
-a2~l+eS(x,5)
(4.8)
where 21 = xl -Erh and 52 = x2-r/2. The slow subsystem of (4.8), obtained by setting e -- 0, is the closed-loop system under state feedback (4.2). The fast subsystem is er~ = [ - a l 1] def --Oe2 0 r/ = Ao~/ Let V(x) be a Lyapunov function for the slow subsystem (4.2), which is guaranteed to exist for any stabilizing state feedback control ~,(x), and let W(TI) : ~TpoT] be a Lyapunov function for the fast subsystem, where P0 is the solution of the Lyapunov equation PoAo + A T p T = --I. Define the sets fie and E by fie = {Y(x) _< c} and E = {W(~?) _< pc2}, w h e r e c > 0 is chosen such that fie is in the interior of the region of attraction of (4.2).
4. High-Gain Observers in Nonlinear Feedback Control
. . . . . . . . . . . . . . . . . . . . .
~
253
. . . . . .
....................
X~ FIGURE 1. Illustration of fast convergence to the set fie • E. The analysis can be divided in two basic steps. In the first step we show that for sufficiently large p there is c~ > 0 such t h a t for each 0 < e < cl the origin of the closed-loop system is asymptotically stable and the set ~c x E is a positively invariant subset of the region of attraction. T h e proof makes use of the fact t h a t in ~c x E, ~ is O(e). In the second step of the analysis we show t h a t for any bounded ~(0) and any x(0) 9 ~b, where 0 < b < c, there exists e~ > 0 such t h a t for each 0 < e < e~ the t r a j e c t o r y enters the set ~c • E in finite time. The proof makes use of the fact t h a t f~b is in the interior of ~tc and 7(Yc) is globally bounded. Hence, there exits a time 2"1 > 0, independent of e, such t h a t any trajectory starting in ~-~bwill remain in ~ for all t 9 [0, T1]. Then, using the fact t h a t the fast variables ~/decay faster t h a n an exponential mode of the form ( 1 / e ) e -at~E, we can show that the trajectory enters the set [ ~ • E within the time interval [0, T(e)] where lim~-~0 T ( e ) = 0. Thus, by choosing e small enough we can ensure t h a t T(c) < T1. Figure 1 gives a sketch t h a t illustrates this behavior. The full-order observer (4.3) provides estimates (21, ~2) of the full s t a t e vector which are then used to replace (x], x2) in the feedback control law. We can use the fact that y -- xl is measured in two different ways. On one hand, we can use only 22 to replace x2 in the control law, while using the measured Xl. This approach does not change the analysis of the closed-loop system and we obtain the same results as before. On the other hand, we can use a reduced-order observer t h a t estimates only ~2. Such an observer is given by (v = Yc2 =
- h ( w + hy) + r w + hy
(4.9)
where h = c~/c for some positive constants a and e with e << 1. It is not difficult to see t h a t the high-gain reduced-order observer (4.9) exhibits the
254
4. High-Gain Observers in Nonlinear Feedback Control
peaking phenomenon, and that global boundedness of the state feedback control plays the same role as in the full-order observer case. The high-gain observer (4.3) or (4.9) is basically an a p p r o x i m a t e differentiator. This point can be easily seen in the special case when the nominal function r is chosen to be zero; for then the observer is linear. For the full-order observer (4.3) the transfer function from y to ~ is given by --+
(68) 2 + Oqt~.S + OL2
[1]
as
E --~ 0
8
and for the reduced-order observer (4.9) the transfer function from y to 22 is given by (e/a)s + 1
--* s as e--* O
Thus, on a compact frequency interval, the high-gain observer a p p r o x i m a t e s /) for sufficiently small e. Realizing t h a t the high-gain observer is basically an a p p r o x i m a t e differentiator, we can see that measurement noise and unmodeled high-frequency sensor dynamics will put a practical limit on how small e could be. As we will see later on, despite this limitation there are interesting applications where the range of permissible values of e allowed successful application of high-gain observers in experimental testing. It is useful to note that for low-frequency (slow) m e a s u r e m e n t noise, we can handle the effect of measurement noise as part of the state feedback control design. This idea was used by Khalil in [22, 24, 25]. To illustrate the idea, let us reconsider the system (4.1) with m e a s u r e m e n t noise v ( t ) where ~)(t) and ~)(t) are bounded: :~1
~
X2
22 y
= =
r Xl -bv
(4.10)
Suppose we want y to track a reference signal r. In the error coordinates el---y-r,
e2=~-
the system is represented by el e2
= =
e2 r
(4.11)
The effect of measurement noise now appears as a bounded disturbance in the state equation, and can usually be handled by robust control techniques. As for the observer design, with el as the measured o u t p u t we can use the same high-gain observer as before.
4. High-Gain Observers in Nonlinear Feedback Control
255
3 Separation Principle T h e combination of globally bounded state feedback control with high-gain observers allows for a separation approach where the state feedback control is designed first to meet the design objectives, then the high-gain observer is designed, fast enough, to recover the performance achieved under state feedback. This separation approach is used in most of the papers t h a t utilize high-gain observers 9 It is proved in a generic form in the work of Teel and Praly [53], where it is shown t h a t global stabilizability by state feedback and uniform observability imply semiglobal stabilizability by o u t p u t feedback. A more comprehensive separation principle is proved by Atassi and Khalil [10]. T h e y consider a class of multi-input-multi-output nonlinear systems of the form :~ y
= = =
Ax + Br z, u ) Cx
r
=
q(x,z)
(4.12)
where u is the control input, y and 4 are measured outputs, and x and z constitute the state vector. The r x r matrix A, the r x p matrix B, and the p x r m a t r i x C, given by 0 0 A = block d i a g [ A i , . . . ,Ap],
1 0
...... 1 ...
0 0
Ai =
0 0
B = block d i a g [ B i , . . . , Bp],
...... .........
0
1 0
~"i X r i
Bi =
r~x1
C--blockdiag[Ci,...,Cp],
C~= [ 1
0
......
0 ]ixr,
where 1 < i < p and r = rl + ... + r ~ , represent p chains of integrators. The system is assumed to satisfy appropriate regularity conditions. The goal of [10] is to design feedback control to stabilize the origin of the closed-loop system using only the measured o u t p u t s y and ~. A twostep approach is followed. First a partial state feedback control t h a t uses m e a s u r e m e n t s of x and 4 is designed to asymptotically stabilize the origin9
256
4. High-Gain Observers in Nonlinear Feedback Control
Then a high-gain observer is used to estimate x from y. The state feedback control is allowed to be a dynamic system of the form 0
=
F(0, x,~)
(4.13)
The control (4.13) is implemented using =
r(~,~,;)
(4.14)
r
=
where the state estimate ~ is generated by the high-gain observer = A k + Be0(&, ~, u) + H ( y - C k )
(4.15)
The observer gain H is chosen as
H = block diag[H1,... , Hp],
Hi =
(4.16) o~i / E r i -- 1 ri--1/ i rl OLri/~
rl x I
where e is a positive constant to be specified and the positive constants c~} are chosen such that the roots of 8r~ -~ OLIi 8 r ~ - - I -~ " " " ~- OJr i - - 1 81 ~- O~ir~ z
0
are in the open left-half plane, for all i = 1 , . . . , p. The function r ~, u) is a nominal model of r z, u). The function r is required to be locally Lipschitz in its arguments over the domain of interest and globally bounded in x. For the purpose of analysis, the observer dynamics are replaced by the equivalent dynamics of the scaled estimation error ?]ij ~---
^ Xij -- Xij ~r~--j
(4.17)
for 1 < i < p and 1 _< j _< ri. Hence, ~ = x - D(c)?] where ,?]1rl,...
, ?]pl, . . . , ? ] p r p ] T
?7
~-
[7711,...
D(E)
=
block d i a g [ n l , . . . ,Dp], n~ = d i a g [ e r ' - l , . . . ,1]r,x~,
The closed-loop system is represented by k
0 ei7
=
Ax+Br
=
r
z,
x -
---- F ( ~ , x - D(e)?],() ---- Ao?] + e B 5 ( x , z, ~, D(c)?])
(4.1S)
4. High-Gain Observers in Nonlinear Feedback Control
257
where
5(x, z, ~9,D(e)~) = r
z, 7(z9, 2, ()) - r
~, 7(~, 2, ~))
1 and ~A0 = D - 1 (~)(A - HC)D(e) is an r x r Hurwitz matrix. It is shown in [10] that the o u t p u t feedback controller (4.14) recovers the performance of the state feedback controller (4.13) for sufficiently small e. The performance recovery manifests itself in three points. First, the origin (x = 0, z = 0, z9 = 0, 2 = 0) of the closed-loop system under o u t p u t feedback is asymptotically stable. Second, the o u t p u t feedback controller recovers the region of attraction of the state feedback controller in the sense that if 7~ is the region of attraction under state feedback, then for any compact set S in the interior of T~ and any compact set Q C_ R r, the set S x Q is included in the region of attraction under o u t p u t feedback control. Third, the trajectory of (x, z, ~)) under output feedback approaches the trajectory under state feedback as e --* 0. Performance recovery is shown in three steps. First, boundedness of trajectories starting in the specified compact set is established by regulating the parameter e such that state estimation is fast enough. Of paramount importance at this stage is the global boundedness of the control function. Then, these trajectories are shown to be arbitrarily close to the origin after a finite time interval; thus a property of ultimate boundedness is established. Finally, local asymptotic stability of the origin is argued in three cases: the case where perfect knowledge of the system's nonlinearity is available (r -- r the case where the origin under state feedback control is exponentially stable, and the case where the origin under state feedback control is asymptotically by not exponentially stable combined w i t h an imperfect knowledge of the system's nonlinearity. In the last case certain conditions were imposed on the growth of the modeling error due to the imperfect knowledge of the system's nonlinearity. Atassi and Khalil extended the results of [10] in two different directions. In [8] they proved the separation principle for the more general case when the state feedback control renders a certain compact set positively invariant and asymptotically attractive. This more general result allows the separation principle to be applied to a number of control tasks beyond the stabilization of an equilibrium point. Examples include finitetime convergence to a set [13], ultimate boundedness [54], servomechanisms [22, 37, 38, 18, 24], and adaptive control [23, 2, 3], In [9], they extended the result of [10] in a different direction. T h e y reviewed various techniques for the design of high-gain observers and classifled them into three groups. First, pole-placement algorithms which lead to either a two-time scale structure as in [14] or a multiple time-scale structure as in [47]. Second, Riccati equation-based algorithms which lead to either an //2 Riccati equation as in [12] and [48, Section 4.4.1] or to an Hoo Riccati equation as in [44] and [48, Section 4.4.2]. Third, Lyapunov equation-based algorithm as in [16]. T h e y showed that separation results
258
4. High-Gain Observers in Nonlinear Feedback Control
similar to those of [10] can be obtained for any one of the other high-gain observer designs provided the state feedback control is globally bounded.
4
Stabilization and Semiglobal Stabilization
Stabilization of nonlinear systems using high-gain observers appeared before the work of Esfandiari and Khalil [14], which was the first paper to draw attention to the impact of peaking on closed-loop stability and to suggest the use of globally bounded feedback controllers. The results which were obtained without globally bounded controllers are either local results where the region of attraction shrinks with decreasing e, or global results which require global Lipschitz conditions. Note that in the case of the global results peaking is present but it does not destabilize the system due to the restrictive assumptions used. It is our opinion, however, that such global results are not very useful because the presence of peaking presents a clearly unacceptable transient response. Examples of the local stabilization results can be found in the work of Tornamb~ [55] for a class of inputoutput linearizable systems and the work of Nicosia and Tornamb6 [41] on robots with elastic joints. Examples of the global stabilization results can be found in the work of Khalil and Saberi [27], Saberi and Sannuti [47], and Gauthier, Hammouri, and O t h m a n [16] for different classes of inputo u t p u t linearizable systems. The paper [27] does not explicitly use high-gain observers, but it uses approximate differentiators which are equivalent to linear high-gain observers, as shown in Section 2. Esfandiari and Khalil [14] studied the stabilization of fully-linearizable uncertain systems using high-gain observers and robust state feedback control techniques. They first gave local and global stabilization results under output feedback. Then, they illustrated the peaking phenomenon and showed how it could lead to shrinking of the region of attraction and even destabilization of the system. They then suggested the use of saturation to render the state feedback control globally bounded and overcome the destabilizing effect of peaking. They used singular perturbations as the main tool of analysis, which is the same approach used later on by Khalil and coworkers in all their work. This paper has been the impetus for a number of research contributions by Khalil's group as well as by several other researchers. A key feature of the combination of globally bounded state feedback control and high-gain observers is the recovery of the region of attraction. This feature was made explicit in the follow-up work by Khalil and Esfandiari [26] where they showed that their approach can achieve semiglobal stabilization. Teel and Praly [53, 54] picked up on the work of Esfandiari and Khalil and developed some fundamental tools for semiglobal stabilization using high-gain observers and saturation [54]. Their results covered stabiliza-
4. High-Gain Observers in Nonlinear Feedback Control
259
tion as well as ultimate boundedness. A key technical contribution of their work is the use of Lyapunov functions to prove asymptotic stability of the closed-loop system under output feedback without resorting to singular perturbation arguments as in Khalil's work. It should be mentioned, however, that while the Lyapunov argument of [54] is more elegant than the singular perturbation argument as a way of proving asymptotic stability, the singular perturbation argument shows that trajectories under o u t p u t feedback approach trajectories under state feedback as e tends to zero, a property which cannot be shown using Lyapunov theory. In [53], Teel and Praly combined results from Tornamb~ [55] and Esfandiari and Khalil [14] to give the first non-local separation principle for nonlinear systems, which we have already discussed in the previous section. Other stabilization results using high-gain observers include the work of Lin and Saberi [35], Praly and Jiang [46], Lin and Qian [34], and Jiang, Hill, and Guo [21]. Except for [21], the other papers combine high-gain observers with saturation to make the feedback control globally bounded, and use either the singular perturbation approach of Khalil or the tools of Teel and Praly. The paper [21] does not use saturation but avoids peaking by a special choice of the initial state of the observer. This is an option that is valid only for systems of relative degree two. The idea can be illustrated by the second-order example of Section 2. Recall from (4.6) that peaking is induced by the fact that rh (0) = [ X l ( 0 ) - :Cl (0)]/s Since xl is measured we can, in essence, choose 21(0) equal to xl(0), or within O(e) from it, thus avoiding an O(1/e) initial value of r/l(0). There may be some difficulty in implementing this idea due to measurement delays.
5
Nonlinear Servomechanisms
In a series of papers [22, 37, 38, 24, 25], Khalil and coworkers used high-gain observers in the nonlinear servomechanism problem. The paper [22] considers a class of input-output linearizable systems with no zero dynamics and with the disturbance satisfying a strict feedback structure. By using the tracking error and its derivatives as state variables, the system is represented as a chain of integrators with all the uncertainty in the state equation satisfying the matching condition. The paper then designs a robust controller which can achieve regional or semiglobal tracking. T h e r e are three basic ingredients of the approach used in [22]. First, by studying the dynamics of the system on the zero-error manifold, a linear internal model is identified. The internal model generates not only the trajectories of the exosystem but also a number of higher-order harmonics generated by the nonlinearities. A linear servo compensator is then synthesized and augmented with the plant. Second, the separal~ion approach is used to design a robust output feedback controller, where a state feedback controller
260
4. High-Gain Observers in Nonlinear Feedback Control
is designed first and then a high-gain observer, that estimates the derivatives of the output, is used to recover the performance achieved under state feedback. A key tool in this approach is the saturation of the state feedback control outside a compact region of interest. Third, to achieve robust regional or semiglobal stabilization of the augmented system (formed of the plant and the servo compensator), the state feedback design uses an effective strategy, whereby a robust control is designed as if the goal was to stabilize the origin. This control brings the trajectories of the system to some neighborhood of the origin in finite time. Near the origin, the robust controller acts as a high-gain feedback controller that stabilizes the disturbance-dependent zero-error manifold. It is emphasized that the output feedback controller of [22] is an error-driven controller which could not have been implemented by state feedback since the change of variables from the original states of the system to the states where the design is calculated is disturbance-dependent. The results of {22] were extended by Mahmoud and Khalil [37, 38] to input-output linearizable systems with zero dynamics, where [37] deals with the special case of constant exogenous systems and [38] deals with the more general case of time-varying exogenous signals. In [24] and [25], Khalil extended the results of [37] and [38] by designing the controller as a universal one that uses limited information about the plant under control. In particular, in the case of constant exogenous signals [24], it is shown that the controller can be designed knowing only the relative degree of the plant and the sign of its high-frequency gain. It is shown in [24] that such universal regulator reduces to the classical PI controller followed by saturation for relative-degree-one systems, and to the classical PID controller followed by saturation for relative-degree-two systems. In the more general case of time-varying signals [25], the controller can be designed knowing only the relative degree of the plant, the sign of its high-frequency gain, and the characteristic equation of the internal model. Moreover, it is shown in [25] that a ~ error in the characteristic equation of the internal model results an O(~) error in the steady-state tracking error. Isidori [18] unified the approach of Khalil with the general theory of nonlinear servomechanisms, as presented in [17]. He showed that for a class of systems that exhibit a triangular structure, exponential stabilizability of the plant at the origin is a sufficient condition for the existence of a globally-defined zero-error manifold. The manifold representation of [18] plays a key role in deriving the results of [25]. Scrrani and Isidori [49] extended the results of [18] removing the minimum-phase assumption. In a different approach to the universal regulator design, A1varez-Ramirez, A1varez, and Su~rez [7] used high-gain observers in the design of o u t p u t feedback control for a class of nonlinear systems of relative degree one. They showed that their controller is equivalent to a standard PI controller with antirest windup structure. The result has some similarities with the universal regulator of [24].
4. High-Gain Observers in Nonlinear Feedback Control
261
6 Adaptive Control The use of high-gain observers in adaptive control of nonlinear systems simplifies the analysis considerably because it reduces the Lyapunov-based adaptive design to a design under state feedback. In effect, it reduces a high-relative-degree problem to a relative-degree-one problem. This line of research was pursued independently by Khalil [23] and Jankovic [19, 20]. Khalil [23] considers a single-input-single-output minimum phase nonlinear system which can be represented globally by an input-output model. The model depends linearly on unknown parameters which belong to a known compact convex set. The paper designs a semiglobal adaptive output feedback controller which ensures that the o u t p u t of the system tracks any given reference signal which is bounded and has bounded derivatives up to the order of the system. The reference signal and its derivatives are assumed to belong to a known compact set. They are also assumed to be sufficiently rich to satisfy a persistence of excitation condition. T h e design process is simple. First it is assumed that the o u t p u t and its derivatives are available for feedback and the adaptive controller is designed as a state feedback controller in appropriate coordinates. Then, the controller is saturated outside a compact region of interest and a high-gain observer is used to estimate the derivatives of the output. It is shown via asymptotic analysis that, for sufficiently small e, the adaptive o u t p u t feedback controller recovers the performance achieved under state feedback. One drawback of [23] is that persistence of excitation is required not only for parameter convergence, but even for tracking error convergence. Without persistence of excitation, it is only shown that the mean square tracking error is of order O(e). This drawback was removed in the work of Aloliwi and Khalil [2, 3] where tracking error convergence is shown without persistence of excitation. To arrive at this result, [2] and [3] analyze the closed-loop system under o u t p u t feedback directly, rather than the separation approach used in [23]. This is one of the few results reported in this paper which cannot be proved using the separation principle of Section 3 because convergence under state feedback cannot be represented as asymptotic stability of a positively invariant set. Robustness of the adaptive controller of [23] is studied in [2, 3], where two robustness results are shown. First, it is shown that the adaptive controller is robust to sufficiently small bounded disturbance. Second, for a wide class of not-necessarily-small bounded disturbance, a robustifying control component is added to achieve a small ultimate bound on the tracking error, provided an upper bound on the disturbance is known. Similar results are obtained for unmodeled dynamics in [4]. Jankovic [20] derives an output feedback adaptive controller for a class of nonlinear systems using a high-gain observer combined with saturation. The class of systems is similar to the one used in [23], but the approach differs from [23] in two aspects. First, he uses a modified version of the
262
4. High-Gain Observers in Nonlinear Feedback Control
observer-based identification scheme of [32]. Second, the design is not based on a separation approach as in [23]. The result of [20] is semiglobal and requires a persistence of excitation condition for tracking error and parameter error convergence. The persistence of excitation condition is relaxed in [19] by choosing the adaptation gain sufficiently large. The early work on the use of high-gain observers in adaptive control, especially [23], has inspired a number of interesting extensions. Lee and Khalil [33] apply the adaptive controller of [23] to the control of an n-link robot manipulator with unknown load, using only joint position measurements. High-gain observers are used to estimate joint velocities and the control inputs are saturated outside a compact region of interest. Seshagiri and Khalil [50] design an adaptive output feedback controller for a nonlinear system where Radial Basis Function (RBF) neural networks are used to model the system's nonlinearities. This is essentially an application of the results of [3] since RBF networks are linearly parameterized in the weights and the approximation error can be treated as bounded disturbance. Zhang, Ge, and Hang [56] carry the neural network application one step further by working with multilayer neural networks which depend nonlinearly on the weights. Miyasato [40] shows how the use of high-gain observers can simplify the traditional model reference adaptive control of linear systems.
7
Sliding Mode Control
Oh and Khalil [42, 43] used high-gain observers in sliding mode control. The first paper [42] deals with stabilization of an input-output linearizable system with no zero dynamics, while the second paper [43] deals with tracking in the presence of disturbances and allows zero dynamics. Due to discontinuity of the sliding mode control, the separation approach of Section 3 cannot be used. The approach used in [42, 43] is to design the high-gain observer first; then design the sliding mode control as a globally bounded function of the state estimates to ensure attractivity of the sliding manifold. The results are semiglobal and show ultimate boundedness with an ultimate bound of order O(e).
8
Unmodeled
Fast Dynamics
Since high-gain observers extend the bandwidth of the controller, it is important to study the robustness of the control design to unmodeled fast (high-frequency) dynamics. Such a study was initiated by Aldhaheri and Khalil [1] who studied the robustness of the stabilizing controller of [14] to unmodeled actuator dynamics. Their analysis confirms the intuition gained from singular perturbation theory [30] that the design will be robust pro-
4. High-Gain Observers in Nonlinear Feedback Control
263
vided the actuator dynamics are sufficiently fast relative to the dynamics of the nominal closed-loop system. An interesting finding of [1] is the fact that the actuator dynamics need not be faster than the observer dynamics. Matlmoud and Khalil [36] extend the results of [1] in two directions. First, they consider sensor dynamics in addition to actuator dynamics. Second, they work with a general stabilizing state feedback controller in the spirit of the separation principle of Atassi and Khalil [10]. T h e y show that, given any globally-bounded stabilizing state-feedback control, the closed-loop system performance can be recovered by a sufficiently fast high-gain observer in the presence of sufficiently fast actuator and sensor dynamics. The actuator dynamics needs not be faster than the observer dynamics, but the sensor dynamics should be sufficiently faster than the observer dynamics.
9
Discrete-Time
Implementation
Dabroom and Khalil [11] studied discrete-time implementation of linear high-gain observers and their use as numerical differentiators. It is shown in [11] that discretization using the bilinear transformation m e t h o d gives better results than other discretization methods. Moreover, many of the available numerical differentiators are special cases of the bilinear discretetime equivalents of full-order or reduced-order high-gain observers.
10
Application to Induction Motors
One application of high-gain observers which has been carried successfully to experimental testing is its use in the control of induction motors. T h e r e are two distinct applications of high-gain observers in induction motors: first, their use to estimate the rotor speed from the rotor position measurement; second, their use to estimate the derivative of the stator current in senesorless control, i.e., control without measurement of the rotor position. The first application appears in the work of Aloliwi, Strangas and Khalil [28, 5, 6] and the second one in the work of Strangas et al [51].
Acknowledgement This work was supported by the National Science Foundation under grant number ECS-9703742.
264
11
4. High-Gain Observers in Nonlinear Feedback Control REFERENCES
[1] R. W. Aldhaheri and H. K. Khalil. Effect of unmodeled actuator dynamics on output feedback stabilization of nonlinear systems. Automatica, 32(9):1323-1327, 1996. [2] B. Aloliwi and H.K. Khalil. Adaptive output feedback regulation of a class of nonlinear systems: convergence and robustness. IEEE Trans. Automat. Contr., 42:1714-1716, 1997. [3] B. Aloliwi and H.K. Khalil. Robust adaptive output feedback control of nonlinear systems without persistence of excitation. Automatica, 33:2025-2032, 1997. [4] B. Aloliwi and H.K. Khalil. Robust adaptive control of nonlinear systems with unmodeled dynamics. In Proc. IEEE Conf. on Decision and Control, pages 2872-2873, Tampa, FL, December 1998. [5] B. Aloliwi, H.K. Khalil, and E.G. Strangas. Robust speed control of induction motors. In Proc. American Control Conf., Albuquerque, NM, June 1997. WP16:4. [6] B. Aloliwi, H.K. Khalil, and E.G. Strangas. Robust speed control of induction motors: application to a benchmark example. 1998. Submitted for publication. [7] J. Alvarez-Ramirez, J. Alvarez, and R. Su~rez. Robust PI control of a class of nonlinear systems. 1998. Submitted for publication. [8] A. N. Atassi and H. K. Khalil. A separation principle for the control of a class of nonlinear systems. In Proc. IEEE Conf. on Decision and Control, pages 855-860, Tampa, FL, December 1998. [9] A. N. Atassi and H. K. Khalil. Separation results for the stabilization of nonlinear systems using different high-gain observer designs. 1998. Submitted for publication. [I0] A.N. Atassi and H.K. Khalil. A separation principle for the stabilization of a class of nonlinear systems. IEEE Trans. Automat. Contr., 44, 1999. To appear. See also Proc. European Control Conf., Brussels, July 1997. WE-A-A-4. [II] A. Dabroom and H.K. Khalil. Discrete-time implementation of highgain observers for numerical differentiation. Int. J. Contr., 1999. To appear. [12] J.C. Doyle and G. Stein. Robustness with observers. Automat. Contr., AC-24(4):607-611, 1979.
IEEE
Trans.
4. High-Gain Observers in Nonlinear Feedback Control
265
[13] F. Esfandiari and H.K. Khalil. Observer-based design of uncertain systems: recovering state feedback robustness under matching conditions. In Proc. Allerton Conf., pages 97-106, Monticello, IL, September 1987. [14] F. Esfandiari and H.K. Khalil. Output feedback stabilization of fully linearizable systems. Int. J. Contr., 56:1007-1037, 1992. [15] B.A. Francis and K. Glover. Bounded peaking in the optimal linear regulator with cheap control. IEEE Trans. Automat. Contr., AC23(4):608-617, 1978. [16] J.P. Gauthier, H. Hammouri, and S. Othman. A simple observer for nonlinear systems application to bioreactors. IEEE Trans. Automat. Contr., 37(6):875-880, 1992. [17] A. Isidori. Nonlinear Control Systems. Springer-Verlag, New York, 3rd edition, 1995. [18] A. Isidori. A remark on the problem of semiglobal nonlinear output regulation. IEEE Trans. Automat. Contr., 42(12):1734-1738, 1997. [19] M. Jankovic. Adaptive output feedback control of nonlinear feedback linearizable systems. Int. J. Adaptive Control and Signal Processing, 10:1-18, 1996. [20] M. Jankovic. Adaptive nonlinear output feedback tracking with a partial high-gain observer and backstepping. IEEE Trans. Automat. Contr., 42(1):106-113, 1997. [21] Z.P. Jiang, D.J. Hill, and Y. Guo. Semi-global output feedback stabilization for the nonlinear benchmark example. In Proc. European Control Conf., Brussels, July 1997. FR-A-K-8. [22] H.K. Khalil. Robust servomechanism output feedback controllers for a class of feedback linearizable systems. Automatica, 30(10):1587-1599, 1994. [23] H.K. Khalil. Adaptive output feedback control of nonlinear systems represented by input-output models. IEEE Trans. Automat. Contr., 41(2):177-188, 1996. [24] H.K. Khalil. Universal regulators for minimum phase nonlinear systems. In Proc. American Control Conf., Philadelphia, PA, June 1998. [25] H.K. Khalil. On the design of robust servomechanisms for minimum phase nonlinear systems. In Proc. IEEE Conf. on Decision and Control, pages 3075-3080, Tampa, FL, December 1998.
266
4. High-Gain Observers in Nonlinear Feedback Control
[26] H.K. Khalil and F. Esfandiari. Semiglobal stabilization of a class of nonlinear systems using output feedback. IEEE Trans. Automat. Contr., 38(9):1412-1415, 1993. [27] H.K. Khalil and A. Saberi. Adaptive stabilization of a class of nonlinear systems using high-gain feedback. IEEE Trans. Automat. Contr., AC-32(11):1031-1035, 1987. [28] H.K. Khalil and E.G. Strangas. Robust speed control of induction motors using position and current measurement. IEEE Trans. Automat. Contr., 41:1216-1220, 1996. [29] H. Kimura. A new approach to the perfect regulation and the bounded peaking in linear multivariable control systems. IEEE Trans. Automat. Contr., AC-26(1):253-270, 1981. [30] P.V. Kokotovic, H.K. Khalil, and J. O'Reilly. Singular Perturbations Methods in Control: Analysis and Design. Academic Press, New York, 1986. [31] P.V. Kokotovic and R. Marino. On vanishing stability regions in nonlinear systems with high-gain feedback. IEEE Trans. Automat. Contr., AC-31(10):967-970, 1986. [32] M. Krstic, P.V. Kokotovic, and I. Kanellakopoulos. Adaptive nonlinear output-feedback control with an observer-based identifier. In Proc. American Control Conf., pages 2821-2825, San Francisco, CA, June 1993. [33] K.W. Lee and H.K. Khalil. Adaptive output feedback control of robot manipulators using high-gain observers. Int. J. Contr., 67(6):869-886, 1997. [34] W. Lin and C. Qian. Semiglobal robust stabilization of nonlinear systems by partial state and output feedback. In Proc. IEEE Conf. on Decision and Control, pages 3105-3110, Tampa, FL, December 1998. [35] Z. Lin and A. Saberi. Robust semi-global stabilization of minimumphase input-output linearizable systems via partial state and output feedback. IEEE Trans. Automat. Contr., 40(6):1029-1041, 1995. [36] M.S. Mahmoud and H.K. Khalil. Robustness of high-gain observerbased nonlinear controllers to unmodeled actuators and sensors. 1998. Submitted for publication. [37] N.A. Mahmoud and H.K. Khalil. Asymptotic regulation of minimum phase nonlinear systems using output feedback. IEEE Trans. Automat. Contr., 41(10):1402-1412, 1996.
4. High-Gain Observers in Nonlinear Feedback Control
267
[38] N.A. Mahmoud and H.K. Khalil. Robust control for a nonlinear servomechanism problem. Int. J. Contr., 66(6):779-802, 1997. [39] T. Mita. On zeros and responses of linear regulators and linear observers. IEEE Trans. Automat. Contr., AC-22(3):423-428, 1977. [40] Y. Miyasato. A simple redesign of model reference adaptive control system and its robustness. In Proc. IEEE Conf. on Decision and Control, pages 2880-2885, Tampa, FL, December 1998. [41] S. Nicosia and A. Tornamb~. High-gain observers in the state and parameter estimation of robots having elastic joints. Systems Contr. Lett., 19:331-337, 1993. [42] S. Oh and H.K. Khalil. Output feedback stabilization using variable structure control. Int. J. Contr., 62:831-848, 1995. [43] S. Oh and H.K. Khalil. Nonlinear output feedback tracking using highgain observer and variable structure control. Automatica, 33:18451856, 1997. [44] I.R. Petersen and C.V. Holot. High-gain observers applied to problems in disturbance attenuation, H-infinity optimization and the stabilization of uncertain linear systems. In Proc. American Control Conf., pages 2490-2496, Atlanta, GA, June 1988. [45] V.N. Polotskii. On the maximal errors of an asymptotic state identifier. Automation and Remote Control, 11:1116-1121, 1979. [46] L. Praly and Z.P. Jiang. Further results on robust semiglobal stabilization with dynamic input uncertainties. In Proc. IEEE Conf. on Decision and Control, pages 891-896, Tampa, FL, December 1998. [47] A. Saberi and P. Sannuti. Observer design for loop transfer recovery and for uncertain dynamical systems. IEEE Trans. Automat. Contr., 35(8):878-897, 1990. [48] A. Saberi B.M. Chen and P. Sannuti. Loop Transfer Recovery: Analysis and Design. Springer-Verlag, New York, 1993. [49] A. Scrrani and A. Isidori. Robust output regulation for a class of non-minimum phase systems. In Proc. IEEE Conf. on Decision and Control, pages 867-872, Tampa, FL, December 1998. [50] S. Seshagiri and H.K. Khalil. Output feedback control of nonlinear systems using RBF neural networks. 1998. Submitted for publication.
268
4. High-Gain Observers in Nonlinear Feedback Control
[51] E.G. Strangas, H.K. Khalil, B. Aloliwi, L. Laubinger, and J. Miller. Robust tracking controllers for induction motors without rotor position sensor: analysis and experimental results. IEEE Trans. Energy Conversion 1999. To appear. [52] H.J. Sussmann and P.V. Kokotovic. The peaking phenomenon and the global stabilization of nonlinear systems. IEEE Trans. Automat. Contr., 36(4):424-440, 1991. [53] A. Teel and L. Praly. Global stabilizability and observability imply semi-global stabilizability by output feedback. Systems Contr. Lett., 22:313-325, 1994. [54] A. Teel and L. Praly. Tools for semiglobal stabilization by partial state and output feedback. SIAM Y. Control ~ Optimization, 33:1443-1488, 1995. [55] A. Tornamb~. Output feedback stabilization of a class of nonminimum phase nonlinear systems. Systems Contr. Lett., 19:193-204, 1992. [56] T. Zhang, S.S. Ge, and C.C. Hang. Adaptive output feedback control for general nonlinear systems using multilayer neural networks. In Proc. American Control Conf., pages 520-524, Philadelphia, June 1998.
Output-Feedback Control of Stochastic Nonlinear Systems Stabilization, Disturbance Attenuation, and A d a p t a t i o n Miroslav Krsti5 and Hua Deng Department of Applied Mechanics and Engineering Sciences University of California at San Diego La Jolla, CA 92093-0411, USA
1 Introduction Despite huge popularity of the LQG control problem, the stabilization problem for nonlinear stochastic systems has been receiving relatively little attention until recently. Efforts towards (global) stabilization of stochastic nonlinear systems have been initiated in the work of Florchinger [6, 7, 8] who, among other things, extended the concept of control Lyapunov functions and Sontag's stabilization formula [24] to the stochastic setting. A breakthrough towards arriving at constructive methods for stabilization of broader classes of stochastic nonlinear systems came with the result of Pan and Ba~ar [21] who derived a backstepping design for strict-feedback systems motivated by a risk-sensitive cost criterion [1, 11, 19, 23]. In [2, 3], for the same class of systems as in [21], we designed inverse optimal control laws, which, unlike those in [21], can be designed in an automated manner (via symbolic software). Important extensions were reported by Tsinias [27, 281 . In this chapter, we address the output-feedback problem for stochastic nonlinear systems. The output-feedback problem has received considerable attention in the recent robust and adaptive nonlinear control literature [18, 17, 22, 26, 14, 12]. The present chapter is the first to address the outputfeedback problem in the stochastic setting. We present three results. First, in Section 3, we design an output-feedback (observer based) backstepping control law which guarantees global asymptotic stability in probability. The result in Section 3 solves only the equilibrium stabilization problem under the assumption that the noise vector field is vanishing (which preserves the equilibrium) and the assumption that a bound on the noise covariance is known. These assumptions allow some interesting nonlinear systems but exclude linear systems with additive noise! For this reason, in [4] we addressed systems with nonvanishing noise vector
270
5. Output-Feedback Control of Stochastic Nonlinear Systems
field and unknown bound on covariance, and derived both "robust" and "adaptive" controllers for the class of strict-feedback systems. In Sections 4 and 5, we extend these results to the class of o u t p u t feedback systems. The class of systems that we consider is the stochastic version of the output.feedback form, which is the broadest class for which global output-feedback controllers currently exist in the deterministic setting.
2
Preliminaries
on Stochastic
Stability
In this section, we briefly review some stochastic stability concepts and theorems from [16], which will be used in the following sections. Consider the nonlinear stochastic system
dx = f(x)dt + g(x)dw,
(5.1)
where x C lR n is the state, w is an r-dimensional Wiener process with incremental covariance E(t)E(t)Tdt, i.e., E {dwdw T } = E(t)E(t)Wdt, where E(t) is a bounded function taking values in the set of nonnegative definite matrices, f : ]R '~ ~ ]R n and g : ]Rn --+ ]Rn• are locally Lipschitz, and f(0) = 0. We first state notation which will be used in the sequel. For a matrix
x = [Xl, x2,-.. ,x l,
]X[.T" a~ (Tr {xT x } ) 1/2 z (Tr { x x T } ) 1/2
(5.2)
denotes the Frobenius norm, and obviously, IXl7 = Icol(X)l
(5.3)
where col(X) = [xT, xT,. .. ,xn]T T. D e f i n i t i o n 5.1 The system (5.1) is noise-to-state stable (NSS) ifVe > O, there exists a class Ids .function/3(., .) and a class Id function 7('), such
that P{[(x(t)l 3(Ixol,t)+~/(sUPtolE(s)E(s)TIT)} > _ l - e , Vt_> 0, Vxo E ]Rn\{0}. (5.4) NSS is a stochastic analog of input-to-state stability [25]. T h e o r e m 5.1 ([16]) Consider system (5.1) and suppose there exists a C2 .function V(x), class l~oo .functions oq, ol2 and p, and a class 1~ .function a3, such that cq(Ixl) _< V(x) <_ o~2(Ixl),
OV
1
(~T T 02V
(5.5)
)<
(5.6)
5. Output-Feedback Control of Stochastic Nonlinear Systems
271
Then the system (5.1) is NSS. Consider the nonlinear stochastic system (5.1) with additional assumptions that g(0) -- 0 and E(t) - I. D e f i n i t i o n 5.2 The equilibrium x -= 0 o.f the system (5.1) is 9 globally stable in probability ifVe > 0 there exists a class IC function ~/(. ) such that p {Ix(t)l < ~(Ix01)} > 1 - ~,
vt > O, Vx0 e ~ n \ { o } ,
(5.7)
9 globally asymptotically stable in probability if Ve > 0 there exists a class ICI: function 13(', ") such that
P{]x(t)[ < 3(Ix0l,t)} _> l - - E ,
Yt _> 0, Vx0 e IRn\{0}.
(5.8)
T h e o r e m 5.2 ([16]) Consider system (5.1) and suppose there exists a C2 .function V(x) and class ]Coo .functions ~1 and c~2, such that al(Ixl) <__V(x) <_ ~2(Ixl)
(5.9)
OV 1 T r r T 02V / EV(x) = --~xf(X ) + -~ ~g -O--~x2g~ <_ - W ( x ) ,
(5.10)
where W ( x ) is continuous and nonnegative. Then the equilibrium x ----0 is globally stable in probability and P { t ~ m W ( x ) = 0} = 1.
(5.11)
This theorem is a stochastic analog of LaSalle's theorem, and the main Lyapunov theorem comes as a corollary. T h e o r e m 5.3 Consider system (5.1) and suppose there exists a C2 function V(x), class ~o~ .functions al and a2, and a class ~ .function a3, such that OLl(lxl) ~ V(x) ~ oL2(]xl),
s
OY 02V ~~ <_ -a3(Ixl). = -~xf(X ) + 1Tr2 { gW-~x2g
(5.12) (5.13)
Then the equilibrium x = 0 is globally asymptotically stable in probability.
3
Output-Feedback Stabilization in Probability
In this section we deal with nonlinear output:feedback systems driven by white noise. This class of systems is given by the following nonlinear stochas-
272
5. Output-Feedback Control of Stochastic Nonlinear Systems
tic differential equations:
dxi dxn y
= =
X~+ldt + ~ i ( y ) T d w , udt + ~n(y)Tdw
z
Xl ,
i--l,...
,n-1 (5.14)
where ~ (y) are r-vector-valued s m o o t h functions with ~ ( 0 ) = 0, a n d w is an independent r-dimensional s t a n d a r d W i e n e r process. Since the states x2, 99 9 , xn are not measured, we first design an observer which would provide exponentially convergent estimates of the u n m e a s u r e d states in the absence of noise. T h e observer is designed as
x~
---- :~+l + k i ( y - 5:l),
i=l,..-,n
(5.15)
where 2n+1 = u. T h e observation error 2 = x - 2 satisfies
-kl ~dt + ~(y)T dw
d~c -kn =
0
...
0
Ao~cdt + ~(y)Tdw,
(5.16)
where A0 is designed to be a s y m p t o t i c a l l y stable. Now, the entire s y s t e m can be expressed as
dy d22
Ao~dt + ~(y)T dw (5:2 + 5:2) dt + ~l (y)T dw [5:3 + k2 (y - :~,)] dt
din
[u ~- k n ( y - :~1)] d t .
d2
(5.17)
O u r o u t p u t - f e e d b a c k design will consist in a p p l y i n g a backstepping procedure to the s y s t e m (y, ~ 2 , " " , ~n), while also taking care of the feedback connection t h r o u g h the 2 system. In the s t a n d a r d backstepping m e t h o d for deterministic s y s t e m s [9] (where dw/dt would be a b o u n d e d deterministic disturbance), a sequence of stabilizing functions a i ( ~ , y), where J:i = [ ~ 2 , " " , ~i] T, is c o n s t r u c t e d recursively to build a L y a p u n o v function of the form n
i~1
where P is a positive definite m a t r i x which satisfies A T P + PAo -- - I , a n d the error variables z~ are given by Zl
=
y
Zi
-~-
2Ci -- Oli--1 ( ~ i - I , Y ) ,
(5.19) i : 2,''"
,n.
(5.20)
5. Output-Feedback Control of Stochastic Nonlinear Systems w
273
~l
[[
:~-system
r~
y
W
-system
9
P FIGURE 1. Feedback structure of the system (5.17) The Lyapunov design for stochastic systems cannot be performed using the quadratic Lyapunov function (5.18) because of the term
!~
a2v /
in (5.10). We instead employ a quartic (fourth order) Lyapunov function 1 4 + (~Tp~)2. ~z~
V=~
(5.21)
i=1
Our presentation of the backstepping procedure here is very concise: instead of introducing the stabilizing functions a~ in a step-by-step fashion, we derive them simultaneously 9 A reader who is a novice to the technique of backstepping is referred to [17]. We start by an important preparatory comment. Since ~i(0) = 0, the c~'s will vanish at ~ = 0, y = 0, as well as at 5i = 0, where 5i = [zl,.-- , z~]T. Thus, by the mean value theorem, a i ( ~ i , y ) and ~(y) can be expressed respectively as i
~(Z,y)
:
~z~z(Z,y),
(59
/=1
{(y) where c~{z(~{,y) and r
:
yr
are smooth functions.
(59
274
5. Output-Feedback Control of Stochastic Nonlinear Systems
Now, we are ready to start the backstepping design procedure. According to It6's differentiation rule [20], we have (~2 + &2) dt
dZl = dzi
+ ~l(y)Tdw
(5.24)
i--1 O0~i_1 if:i+1+ kixl - E ~ (2~/+1 ~- k/X1) /=2
=
21 \(02ai_1 Oy2 )~I(y)T~I(Y)] dt i
=
OOLi_1 0----~ (2C2~- X2)
C~y-l~l(y)Tdw
(5.25)
2 , ' ' ' ,n.
As we announced previously, we employ a Lyapunov function of a quartic form =
1 4
1
n
b (5:Tp2) 2
7
\
(5.26)
/
i=2 where b is a positive constant. This form of the Lyapunov function clearly indicates that we view the system as a feedback connection in Figure 1. The first two terms in (5.26) constitute a Lyapunov function for the (Y, X2,"" ,2n)-system, while the third term in (5.26) is a Lyapunov function for the ~-system. Even though not obvious from the calculations that follow, we achieve a nonlinear smallgain global stabilization (in probability) in the style of [13]. Now we start the process of selecting the functions ~ ( ~ , y) to m a k e / : V negative definite. Along the solutions of (5.16), (5.24) and (5.25), we have
32
s
T
i=2 i--100Zi_ 1
-- E
l=2
02~-----~(2~/§ ~- klXl)
O0~i_1
0----~ (:g2 "4-:E2)
1020~i_1 T 3 n 2 00~i--1 2 T - - - - ( ~ 7 2 ) ~91(Y) ~I (Y)] q-~ EZi ( ~ ~I(Y) ~01(Y) 2
~=2
-b2w p~]~l 2 + 2bTr {~v(y) ( 2 p ~ T P = --b2wp~cl2l2 + 2bTr {~v(y) ( 2 p 2 2 T P
\
+ +
Y /
~cTP~P) ~TP2P)
~(y)T } ~(y)W}
+y3
n
[
i--1
+Ez31ai+zi.1+k~21_EOai-1 L
,=2
0~i--1 (~2 _~t_:~2) o----7-
5. Output-Feedback Control of Stochastic Nonlinear Systems
1(0~a~-1)
2 ~k aY 2 <
-
]
3 '~
(Ooti_lh 2
~i(y)T~oI(Y) "4-2EZ/2 ~ T ] i~2
[
1 1 1 bA-3bnv/-~e221PI4-4 ~=27/~ 4 ~
]
r
i
1~14+ya ~ + ~ t ( y )
3-43 '~ + 3 ~y+~y+~--~(~I(y)Tr
275
~
r
I~(y)I4y ]
i=2 i--1 Oo~i_ 1 ,~-1 [0~ _N_., 0a~_1 § 3 i~-~gi:~1 (:~/A-1~- k/Xl) -~:2 L ~ o~l ---g~y x2 3 ,
4
1
3 4 (OOQ--lh ~
+ ~ Z i + 4~i_lZi + 7V~ k Oy J Zi 2lO2t~i-l~l(y)T~l(y) 0y 2 3 ~ cQcI~i_I~ 4
n--1 'r-~CQ~n 1
3[
102a,~_1
1
3
( Oa,~-, ~ }
2 op~ ~I(Y)~I(Y) +4T_~ z~ + ~ \
3
OC~n-1
oy ] zn
]
+4-~ \ ~ J
J
(5.27)
where A > 0 is the smallest eigenvalue of P. The second equality comes from substituting ~ = z~ + a~-l, and the inequality comes from Young's inequalities in Appendix A. At this point, we can see that all the terms can be cancelled by u and (~i. If we choose El, C2 and ~i to satisfy
bA - 3bnx/-~e2[PI 4 - -~ i=2 ~/~
and
~i and
u as
4e~ - p > O,
(5.2s)
276
5. Output-Feedback Control of Stochastic Nonlinear Systems
3
3.-4.
T
o~1 = - c l y - ~)I(Y) r
3- 4
3 ~
"~(~Y - -~e~Y - ~ E ~? (~)I(y)T ~)I(y))2Y i=2
3b~v/-ff lr
14y
(5.29)
i--100~i_l
00~i--1
~i = - e ~ z , - k , ~ l + ? _ _ . - S g - ( ~ § 3 "
-~5~z,
t=2
t
1
3
4
y
(00~i_1~ ~3
2 3
46~_1z'-~v3\ Oy ] z ~ - ~ \ n--l OO~n_l
u = -cnz~-k.~ +~--g~ l-~2
+
020~i--1
1 ~
^
e
~W
/
(00~i--1~ 4
Oy ] z~
(5.30)
~
(X~+l+kZ~)
l
Oc~n-l Fc2_+ 1 02C~n_1qol(y)Wqol(y) Oy 2 Oy2 1
3
~3 (OOLn--l~ ~
3
fOOLn_l~ 4
-45~_-----~z~-~\ 0y ] z~-4--~ \ 0y ] ~ '
(5.31)
where ci > 0, then the infinitesimal generator of the closed-loop system (5.16), (5.24), (5.25) and (5.31)is negative definite:
; v < - ~ c~z2 - pl~f4
(5.32)
i=1
With (5.32), we have the following stability result. T h e o r e m 5.4 The equilibrium at the origin o.f the closed-loop stochastic
system (5.17), (5.31) is globally asymptotically stable in probability.
4
\
~lty)~lty)
Output-Feedback Noise-to-State Stabilization
In this section we deal with nonlinear output:feedback systems driven by white noise with bounded but unknown covariance. This class of systems is given by the following nonlinear stochastic differential equations:
dxi dxn y
= xi+ldt + ~i(y)T dw, = udt +pn(y)Wdw ~
Xl~
i=1,...
,n-1
(5.33)
5. Output-Feedback Control of Stochastic Nonlinear Systems
277
where T~ (y) are r-vector-valued smooth functions, and w is an r-dimensional Wiener process with incremental covariance E { dwdwT } =E(t)E(t)W dt. The observer is designed as in (5.15), and the entire system can be expressed as (5.17). The error variables z~ are defined as in (5.19), (5.20). With It6's differentiation rule, we have
dzi
=-
dzi
=
(22 + 22) dt
+~l(y)Tdw i-i Oa~_l
2~+i + ki21 - E
/=2
02t
(5.34)
1 102C~i_l
2 \ 0y 2 ) ~I(y)T~Tcfll(Y)] at i
=
~OLi--1
(2/+1 "~-klXx)
Oy (22 q- 22)
0 ~----1 ~91(y)Wdw (5.35) oy
2 , . . . ,n.
As in Section 3, we employ a quartic Lyapunov function
1 4+ ~ 1 ~~ z2 + ~ b (2 wP2)2 V(z, ~) = ~y i~2
(5.36)
Now we start the process of selecting the functions a i ( 2 i , y ) to make s in the form (5.37)
~ v ~ -p(2, y, 2) + ~(l~l)
where p is positive definite, radially unbounded, and 7 is a class/C function. Since ~(y) is a smooth function, according to mean value theorem, we can write it as
~(y) = ~(0) + y~b(y)
(5.38)
where ~b(y) is a smooth function. Along the solutions of (5.16), (5.34) and (5.35), we have ,~
s
n
[
i--I
i=2
t
z=2 O:~z
= (kz+i + kt2i)
3~
ba~-i
]
1 (02c~i_1~
2 (o~_1~ 2
i=2 -b2Tp212[ 2 + bTr { ~ ( y ) E w ( 2 p ~ 2 T P
+
2Tp2P)
E~(y) T }
(5.39)
278
5. Output-Feedback Control of Stochastic Nonlinear Systems
Therefore EV
<
-bc?Tpyclx]~+ 3bnv~[P]2
1 1)
]~~176
+ e-~ + 2~e42 [:~[4
3 v~lPI e21~(o)Tg~(O)llEi4+3bnv/-fflpi2 2
+3bnv/-~ip,2(~]qo(0)Tr +y3 (al + z2 + 5:2) +
n
[
q_ ~
,r
+
iEis
r
y3
3y2~Ol(y)TEETtfll(y) i--1
- N-" O a i - 1
Oc~i- 1
+~z~=~ ~,+z~+~+k~ ~= o~ (~+~+k~')----N--~(~+~) 2~ (~y2)~ol(y)T~']~rT~OI(y -t-
-< - bA-3bnx/KIPI2
I~(0)w~(0)l+
~_y3 (:l,1 "t- 73 (r
Z/2
~01(y)T~]ET~oI(y)
+
-4._z~
la?14
3* "i'-73 ~ ~y ~- 7E~y
E2
q- E Z/30Li q- ]ci:~l i---2
2
=
(2~/§ q- klXl)
~olty ), , T~,cy~) , ,~2 z~ + +~11102ai-lt...~__y2
X2
35~ z~ + ~ z i
3 _4 (OOLi_l~ai 3 (00~i--1~ 4 ] naT?I? ~ Oy ) Ziq-~i ~ Oy ) Zi (qoI(y)Tqo'(Y))2 +
+ ~ y-~
i=2
+3bnv~lPI2(~+@)[E, s
)
+ 1 Ird 4
(5.40)
where A > 0 is the smallest eigenvalue of P. The inequalities come from substituting ~ = z~ + a~-l, and Young's inequalities in Appendix B and (5.68), (5.69), (5.70), (5.71) in Appendix A. At this point, we can see that
5. Output-Feedback Control of Stochastic Nonlinear Systems
279
all the terms can be cancelled by u and a~. If we choose e~, e2, e3, e4 and ~ to satisfy
1+ bA--3bnv~IPI 2(2~2 I~(0)w~(0)l+~
1 ~ ~1 ~~== n4 1 4~14 ----p> 0, ~-~24]-(5.41)
and c~ and u as
OL1
--Cly--
~ 3bn~[
3~ 34 '3~ (~01(Y) T~ I ( Y ) ) 2Y - - '~Y-'~E~y
e2 I~b(y)T ~b(y)14y5) (5.42)
r ]2
i--1
-c~zi - k ~ l + ~
Oti
Oai-1
Oa~-i ^
02o~i_l / ~W z , 2 3 4 __1 f~--~-y2 3 _ ~Cz~ ~l[Y) ~I[Y)) Zi 4
3 4- (Oa~_l~ ~
1
45~ 1 zi
3 (O0~i--l~ 4
-~'i~ \ oy ] z~-~-~ \--5~y ] (~l(Y)~(Y))=z~(543) c~
U
(5.44)
where c~ > 0 and 5,~ = 0, then the infinitesimal generator of the closed-loop system (5.16), (5.34), (5.35) and (5.44) satisfies:
f_.V < -
c~z~_p[~14 +
3bn -
i=1
+3bnv/-n,Pl2(~+~)lE, 8
IPL2e~I~(0)T~(0)I+~ 3~r
1~14
i=2
(5.45)
With (5.45), according to Theorem 5.1, we have the following stability result. T h e o r e m 5.5 The closed-loop stochastic system (5.33), (5.15), (5.44) is
NSS.
280
5
5. Output-Feedback Control of Stochastic Nonlinear Systems
Output-Feedback
Adaptive
Stabilization
In this section, we deal with output-feedback systems (5.33) with an additional assumption that ~i(0) -- 0. Since ~ (0) -- 0, by the mean value theorem, ~(y) can be expressed as ~(y)
=
yr
(5.46)
where r is a smooth function. As we will see in the sequel, to achieve adaptive stabilization in the presence of unknown E, it is not necessary to estimate the entire matrix E. Instead, we will estimate only one unknown parameter 0 =]1 EET 112~using an estimate 0. Employing the same observer (5.15), the entire system is: d& dy dx2
= = :
Aob:dt + ~p(y)T dw
[e3 + k~ (y - ~)] dt
(5.47) o =
7~-,~(5,y, 8),
where a,~ and Tn are functions to be designed. In the adaptive backstepping method, the error variables zi are given by za
=
y
(5.48)
Zi
=
xi-ai-1
(&,-1,y, 8 ) ,
i = 2 , . . . ,n.
(5.49)
According to It6's differentiation rule, we have dZl
:
dz~ =
(:c2 + :~2) dt + ~ l ( y ) T dw i--1 O0~i_l (X/+I -{-k/"T1) ~i+1+ k ~ - ~ 0~---[/=2 1/O~a~-l'~
i
=
(5.50)
, ,T~,,-.T
, ,
Oai-~O
OC~i-1
Ov (~2 + ~ )
dt-
2,...,n.
~i(y)rdw (5.51)
As in the previous sections, we employ a Lyapunov function of a quartic form. In this case it also includes the parameter estimation error
=l[ rxT IJL -8,
1 4
1 "
b (~T/o;~)2_~_
V(z, ~, O) = ~y + ~ ~ z~ + ~ i-~2
1 "2
V o,
(5.52)
5. Output-Feedback Control of Stochastic Nonlinear Systems
281
where b is a positive constant and P satisfies
AT p + PAo = - I .
s
Now we start the process of selecting the functions ai(~i, Y, O) to make in the form
Z;V _< - p ( b , y, b, t})
(5.53)
where p is a positive definite function in ~, y, b for each value of 0. Along the solutions of (5.16), (5.50) and (5.51), we have
s
=
y3 ((21 ~_ Z2 ~_ X2) ~- 3y2~I(y)T~-~T~I(Y) n i--1 -[-~-~Z/3i=20~i~-Zi+l~-ki:~l ~=
OXl (Xl+l-~klXl)----~-y [x2-bx2)
ay 2 ] ~I(y)T~TqDI(Y)
2
O~0 J
3
i=2 --bf:Tp~c[~c[ 2 + bTr {q0(y)r w ( 2 p ~ T P
[
_< - b),
~
3 4-
IPI-~~ i=2 i
3 4
3bnv~lpl2~
+~1 (r162 n
~
2
[
[
i=2 1 3/020~i--1~2,, 3 f 0~i--1 ~ 4
O0 + f c r P 2 P ) ET(y) T } - - 7
li:l%y a
~I-~'~(~)I(y)T@I(y))2Y
3 - 11 (~I(y)T~I(y)) 2 y] y + ~(n C -- 1
0Oti__1
00~i--I
^
- - X 2
/=2
+~\ oy ] z, llS2 ~llL
3 4
OqOti_lb]
o~
3 4~00~i-1~ 4
1
/
Ob
~'
282
=
5. Output-Feedback Control of Stochastic Nonlinear Systems
-[bA-3bnv/-n
2 1~ _
1]
1
3n+l 2 T "l- T (?~'sl(Y) ~/)I(Y)) Y
t.
3 4
3 4
3bnx/~lpl2e~
-~-E z3 i~-ki:~l 3~
3 ^]
"--~/(X/-t-I-I-l~gl)
3 ~[Ooti_l] #
1
2
l.y - ~3z 4n \ (0~i_1 ay )~ 4] i=2
-O0li-1
3/0o~i_lXl 4 ^
4 - ~ y - -~ ~ ~ z ~ \
Ir
1 3(020ti_1~ 2
i=2
00~i--1D]
Oy 2
)
(5.54)
where A > 0 is the smallest eigenvalue of P, x,~+l = u, z,~+l = 0, an = u. The inequalities come from substituting :~i = zi +c~_ 1, Young's inequalities in Appendix C and (5.68), (5.69), (5.70), (5.71) in Appendix A. Let
T1 --
3bnxf~lPI2c 2 2
7-~ =
Ti-1 + z3wi,
o
~,~
=
Ir
14y4
3 4
+ ~y
(5.55)
i = 2,''' ,n
(5.56) (5.57)
where
02i__ lz3 (020~i-1) 2 3 (~Oli--1~ \ Oy2 +~zi \ B y )
Then
4
(5.5s)
5. Output-Feedback Control of Stochastic Nonlinear Systems
s
283
-[bA3bnv~P21~2-' ] ]-4/_~2~/4 l~ ~14]~ 1}4
<
3n+l
+36~y + 3r n
"4-E
[
L
2
+ 3bnv/-nlPI2c22 I~(Y)I4YO+ 3y~] i--1 00~i_1 (:~/+1 ~- k/~:l)
+k i : ~ l -
z30~i
i=2
T
E /=2
OOLi--1 ^
Oy x2
0X-'--"~
4_~z,+ 4~_ z , + - ~
O0~iEff1 ~ "~Z3~)j O0 j=2
\ Oy ] z~-w~O
(5.59) j =2
c30
J
If we c h o o s e el, e2 and ~?i to satisfy
bA
3bnx/'~lpi2
2 r2
1 n 1 -~E
i=2
1
(5.60)
:~ 4e4-p>O, ~i
a n d a i a n d u as
(~1
=
--Cly
3n41 --
3bnv/-~LP ]2e2
2
Jr
i--1 O~i
+
=
-- ~3 y
(r
4
3 4_ -- ~c~y
3 ^
~yo
-
(5.61)
0c~_1 (~:~+~+ k15:1) +
OOLi_1 ^ x2
/=2
4~LlZ~-~"~\ oy } 00~i--'------~1 ~[Z3COj-~ ~ j=2
j=2
O~n
~
Z 300gj--1 02"
(5.62) (5.63)
284
5. Output-Feedback Control of Stochastic Nonlinear Systems
where c~ > 0 and 5n = 0, then the infinitesimal generator of the closed-loop system (5.16), (5.50), (5.51) and (5.63) satisfies: n
s
<_ - ~
ciz 4 - p1214.
(5.64)
i~1
Since z = 0 and 2 -- 0 implies x --- 0, by Theorem 5.2, we have the following result. T h e o r e m 5.6 The equilibrium x = 0, ~ =11 EET I1~ of the closed-loop system (5.33), (5.15), (5.57) and (5.63) is globally stable in probability and
P {tli-+mx ( t ) = 0 and t--+oolim~ ( t ) = 0} = 1.
(5.65)
R e m a r k 5.1 Since s is nonpositive, EV(t) is nonincreasing. Since V is also bounded from below by zero, E Y ( t ) has a limit. Since z(t) and ~(t) converge to zero with probability one, thus E ~0(t)2~ has a limit. Denote this limit by A02. Thus
1.i.m. O(t) =11 z z
T
+Ae.
(5.66)
It is well known [5] that convergence in the mean implies convergence in probability, i.e.
=0, W>O.
REFERENCES [1] T. Ba~ar and P. Bernhard, H~-Optimal Control and Related Minimax Design Problems: A Dynamic Game Approach, Boston, MA: Birkhauser, 2nd ed., 1995. [2] H. Deng and M. Krsti~, "Stochastic nonlinear stabilization--Part I: A Backstepping Design," Systems ~ Control Letters, vol. 32, pp. 143150, 1997. [3] H. Deng and M. Krstib, "Stochastic nonlinear stabilization--Part II: Inverse optimality," Systems 8J Control Letters, vol. 32, pp. 151-159, 1997. [4] H. Deng and M. Krstid, "Stabilization of stochastic nonlinear systems driven by noise of unknown covariance," Proceedings of A CC, Philadelphia, 1998. [5] J. L. Doob, Stochastic Processes, New York: John Wiley &: Sons, 1953.
5. Output-Feedback Control of Stochastic Nonlinear Systems
285
[6] P. Florchinger, "Lyapunov-like techniques for stochastic stability," SIAM Journal of Control and Optimization, vol. 33, pp. 1151-1169, 1995. [7] P. Florchinger, "Global stabilization of cascade stochastic systems," Proceedings of the 3~th Conference on Decision ~ Control, New Orleans, LA, pp. 2185-2186, 1995. [8] P. Florchinger, "A universal formula for the stabilization of control stochastic differential equations," Stochastic Analysis and Applications, vol. 11, pp. 155-162, 1993. [9] R. A. Freeman and P. V. Kokotovi6, Robust Nonlinear Control Design: State-Space and Lyapunov Techniques, Boston, MA: Birkhauser, 1996. [10] G. Hardy, J. E. Littlewood, and G. Polya, Inequalities, 2nd Edition, Cambridge University Press, 1989. [11] M. R. James, J. Baras and R. J. Elliott, "Risk-sensitive control and dynamic games for partially observed discrete-time nonlinear systems," IEEE Transactions on Automatic Control, vol. 39, pp. 780-792, 1994. [12] M. Jankovic, "Adaptive nonlinear output feedback tracking with a partial high-gain observer and backstepping", IEEE Transactions on Automatic Control, vol.42, pp. 106-113, Jan. 1997. [13] Z. P. Jiang, A. R. Teel, and L. Praly, "Small-gain theorem for ISS systems and applications," Mathematics of Control, Signals, and Systems, vol. 7, pp. 95-120, 1995. [14] H. K. Khalil, "Adaptive output feedback control of nonlinear systems represented by input-output models", IEEE Transactions on Automatic Control, vol.41, pp. 177-188, Feb. 1996. [15] R. Z. Khas'minskii, Stochastic Stability of Differential Equations, Rockville, Maryland: S & N International publisher, 1980. [16] M. Krsti6 and H. Deng, Stabilization of Nonlinear Uncertain Systems, Springer, 1998. [17] M. Krsti6, I. Kanellakopoulos, and P. V. Kokotovi6, Nonlinear and Adaptive Control Design, New York: Wiley, 1995. [18] R. Marino and P. Tomei, Nonlinear Control Design : Geometric, Adaptive, and Robust, New York : Prentice Hall, c1995. [19] H. Nagai, "Bellman equations of risk-sensitive control," SIAM Journal of Control and Optimization, vol. 34, pp. 74-101, 1996.
286
5. Output-Feedback Control of Stochastic Nonlinear Systems
[20] B. Oksendal, Stochastic Differential Equations-An Introduction with Applications, New York: Springer-Verlag, 1995. [21] Z. Pan and T. Ba~ar, "Backstepping controller design for nonlinear stochastic systems under a risk-sensitive cost criterion," submitted to SIAM Journal of Control and Optimization, 1996. [22] L. Praly and Z. P. Jiang, "Stabilization by output feedback for systems with ISS inverse dynamics", Systems 8J Control Letters, vol. 21, pp. 19-33, July 1993. [23] T. Runolfsson, "The equivalence between infinite horizon control of stochastic systems with exponential-of-integral performance index and stochastic differential games," IEEE Transactions on Automatic Control, vol. 39, pp. 1551-1563, 1994. [24] E. D. Sontag, "A 'universal' construction of Artstein's theorem on nonlinear stabilization," Systems 8J Control Letters, vol. 13, pp. 117123, 1989. [25] E. D. Sontag, "Smooth stabilization implies coprime factorization," IEEE Transactions on Automatic Control, vol. 34, pp. 435-443, 1989. [26] A. Teel and L. Praly, "Tools for semiglobal stabilization by partial state and output feedback", SIAM Journal on Control and Optimization, vol. 33, pp. 1443-1488, Sept. 1995. [27] J. Tsinias, "The concept of 'exponential ISS' for stochastic systems and applications to feedback stabilization", preprint, 1997. [28] J. Tsinias, "Stochastic input-to-state stability and applications to global feedback stabilization", submitted to International Journal on Control for special issue on Breakthrough in the control of nonlinear systems, 1998.
5. Output-Feedback Control of Stochastic Nonlinear Systems Appendix
287
A
In this and the following appendix, we use Young's inequality [10, Theorem 156]:
xy <_ plX[ p +
[yl q,
(5.67)
where e > 0, the constants p > 1 and q > 1 satisfy (p - 1)(q - 1) : 1, and (x, y) C lR 2. Applying these inequalities leads to y3z2
y3~2
<
3c~ 4
--
~01 Y §
_< ~3e ~9 Y 4 §
n
(5.68)
AI~z 4
451
x _ ~ < ~- , y3 ~ 4 + 1 i~l~
(5.69)
n--1
i=2
Z3Ziq_1
< --
-3~ 2
4 "=
(5.70)
~ Zi4 § 1 - - 1 4" ~ i=3 6 4 1 Z,
~-~ 300~i-1~ 3 /~2 ~ ~00~i--1~ ai 4 1K-~ 1 -4 - ~ z~ o-----~x~ < ~ ~ \ o~ ] z~ + ~ ~=~ ~, 3 / ~2 ~ \ /O0~i--l~ 4 /..~..2=xl~L 1 4(5.71) oy ] ~ z~+~ 9=
.= ~i
z~\ ay ] ~I(Y)T~I(Y)
2 i--2
1 ~OC~i__l~ 4 Zi4 3 ~"~ ~ 2[ "~ <___34 ~~--2 ~i \ Oy ] § ~'~ik~I(y)T~I(y)]2 --
(5.72)
i--~2
2bTr { ~ ( y ) ( 2 P ~ : T P § 5:wPYcP)~(y) w }
< 2bn I~(Y)( 2 P ~ T P § ~Wp~p)~(y)W [o~ < 2bnx/~ I~(Y) (2 P ~ T P § xTpycP) ~(Y)T I < 6bnx/~y2[r
~y4[~b(y)14 §
(cf. (2.10)) 3bnx/~e2lp[4[xl 4 ,
where the e's, 5's, ~'s and ~'s are positive constants to be chosen.
(5.73)
288
5. Output-Feedback Control of Stochastic Nonlinear Systems
Appendix
B
Similar to Appendix A. in the following inequalities, e's and ~'s are constants to be chosen.
~1 (y)T]EET~I(y) i=2
< 4 A''~c? k,--~--Y ] z,4
(q01(y)T~l(y))27t_ 3
~sc,[
i-~2 ~z
2 ~
]ET I2
(5.74)
i=2
b ~ { ~ ( ~ ) ( 2 P ~ ~P + ~ P ~ e ) ~(~)~ } < 3b~v~l~(~)~(~)tIPl~l~l~lzl ~
= 3bnv~]p]2 [~(0)T ~(0)i]~[2 IEI2 + 6bnv~lp]2ly~(0)Tr +3bnv~lpI2y21tb(y)T ~(y)il~l~i]EI2 -<
3bnv~[P[ 2 2e22 I~(~176
+
2~2 2 i~(0)T~(0)ll~p 3bnv~[P[ 2
+3~v~lPl~L~(0)T~(~)l~y~l~l ~ + n 2 e4 2 y4[r162 + 3bnv~lPI
1~12IEI2
~
L~l~ +
3bnv~lPl 2
[E[4
2
< 3bnvqlPI 2 I~(0)T~(0)I+ ~1 + I:~l4 3bnv~'P'2e2 (~ + ~) + 2 [~(0)T~(0) [[]E[4 + 3bnv/~[P[2 E2 +3bnv~]p,2 (2,P(o)Tr
+
[El8
2 ~]r162
(5.75)
3y2~I(y)T]E~T~I(Y) ~ Y3 4 (~l(y)Tvarphil(y)) 2 ~- 3l~14 (5.76)
1020i-lza~l(y)T]EET~l(y) ~ Z1 ( ~02c3~ l ( y )1 20y 2
~1(y~21 +~,E,4
W
(5.77)
5. Output-Feedback Control of Stochastic Nonlinear Systems
289
Appendix C Similar to Appendix B, in the following inequalities, e2 is a constant to be chosen.
2 3~
_
~1 (y)T y],~-']T~pl(y) (0o~i_1"~ 4
3 ~
4
--< 4 i=2 \ Oy ] zi [I ~ T [[L A-~ EY4i=2(~bl(y)T~bl(y))2(5"78)
bWr{~(y) (2P~T P + ~TP~P) ~(y)~} 2 II ~ T
<- 3bnv/-~y21r
Iloo
2
< 3bnv/'nL'lP[2y4j~'(Y)[ 4 II EET I[2 -~ 3bnv'~lPJ ~ ]y:]4 2 2e 2
(5.79)
3v2~, (v) TEET~ol (v) 3 -<~ (~I(y)T~I(y)) 2 + ~y4 jj ~T IJ~
3 4 II y]~']T [[2oo = 3Y 4 (•I(y)T•I(Y)) 2 ~- ~Y 1 02ai_1 3 <- l z 6 (0~a,_1.~2
T
(5.80)
W 1 4
\ 0Y2 ] [[ ~-"~T IlL +~Y (~)I(y)T~I(Y)) 2
(5.81)
Output Feedback Control of F o o d - C h a i n Systems Romeo Ortega 1, Alessandro Astolfi 2, Georges Bastin a and Hugo Rodrigues-Cortes I 1Laboratoire des Signaux et Syst~mes, Ecole Sup~rieure d'Electricit~, Paris, France. 2Centre for Process Systems Engineering, Imperial College of Science, London, UK. 3Centre for Systems Engineering and Applied Mechanics, Universite Catholique de Louvain, Louvain-La-Neuve, Belgium.
1
Introduction
In this chapter, we consider the problem of output feedback control of a class of non-linear mass balance models that describe the behavior of certain food-chain systems. These models are of interest, amoung other fields, in environmental engineering. The control approach we use to solve the stabilization problem builds upon some recent developments on passivity-based stabilization of p o r t controlled Hamiltonian systems reported in [7], [5]. Since the design procedure is applicable to a broad class of mass-balance systems of similar structure (such as comportamental systems and stirred tank reactors, see [1] and the references therein), we present it in a rather general form. In this technique the original Hamiltonian structure of the system is preserved in closed-loop, and only the energy function and the dissipation are modified via the control. Preservation of the Hamiltonian structure allows stabilization to be understood in terms of energy. These feature makes the method very appealing in applications, since the action of the control has a clear physical interpretation that simplifies its comissioning. This task is particularly difficult in mass-balance systems where the control (and the system state) should be positive. One further advantage of the method, central for the developments in this paper, is that the restriction of disposing only of output-feedback (as opposed to full-state feedback) can be naturally incorporated into the controller design. In particular, we show here that to obtain an o u t p u t feedback control strategy, some of the natural damping of the mass-balance equations should be removed, leaving only the d a m p i n g of the measurable coordinate, which is necessary to ensure asymptotic stability. To better
292
6. Output Feedback Control of Food-Chain Systems
explain this modification we present first a state-feedback solution for the simplest second order model. In this case we leave untouched the natural damping of the system and apply verbatim the m e t h o d proposed in [7]. A careful observation of the energy-shaping plus damping injection conditions of [7] reveals that with a, rather unusual, injection of positive damping we can easily obtain an output-feedback solution. Furthermore, the new solution is a simple linear controller, while the state-feedback controller is nonlinear, and rather involved. It is interesting to note that, the injection of positive damping allows us to obtain a stabilizing controller for the n th order model, while the solution without removal of damping cannot be extended beyond the second order case. Some simulation results are presented to illustrate the properties of the controller, and we conclude the chapter with some open questions and final remarks.
2
Controller Design Procedure
In this section we review the basic material of [7] presented in a form suitable for the problem considered in this chapter. Even though we deal with mass-balances instead of energy-balances, to keep up with the standard notation we will use throughout the word "energy". We consider, so-called port-controlled Hamiltonian models of the form [6], [11], [101
:
= [J(x)
+g(x)u,
(6.1)
where x E ~ _ C ~'~, u C ~ C ~m, are the mass variables, and the control, respectively. The set ~ _ is the n-dimensional positive orthant. The smooth function H(x) : ~n --* ~, which typically represents the total stored energy, will denote for our mass-balance systems the total mass, and it will be non-negative. The matrices
J(x)=-JT(x),
R(x)=RT(x)>O,
Vxe~_,
capture the internal interconnections and the natural damping of the system, respectively, while g(x) defines the interconnection of the system with its environment. We assume measurable the q-dimensionai o u t p u t vector function y = h(x). This output should not be confused with the natural outputs associated to the port-controlled Hamiltonian system E defined as
gT (x ~OH(x~ The control objective is to stabilize, via output-feedback, an equilibrium c . ~ preserving in closed-loop the Hamiltonian structure. T h e latter property allows us to provide an energy interpretation of the control action.
6. Output Feedback Control of Food-Chain Systems
293
We will consider only static controllers, but as shown in [7] the procedure can be easily modified to incorporate controller dynamics. Following the principles of passivity-based control [S], [10], we will achieve the stabilization objective by the standard energy-shaping plus damping injection stages. T h a t is: 1. Assigning to the closed-loop an energy function Hd(X), which should have a strict local minimum at ~. (That is, there exists an open neighbourhood B of 9 such that Hal(x) > Hd(2) for all x E B.) We will define
Hd(x)A=H(x) + H~(x) where
Ha(x)
(6.2)
is a function to be defined.
2. Injecting some additional damping
Ra(x)
Rd(X)A=R(x) + Ra(x)
to get
>_ 0, V x E ~ _
T h a t is, we look for an output-feedback control
OH
[J(x) - R ( x ) ] - ~ x (x) +
g(x)u(h(x))
u(h(x))
= [J(x) -
(6.3) such that
OHd Rd(X)]~ (x)
holds V x E ~ _ , with Hd(x), Rd(X) defined by (6.2) and (6.3), respectively. In this way, the closed-loop dynamics will be defined as = [J(x) -
Rd(x)l-~(x),
(6.4)
and along the trajectories of (6.4) we will have
dH
[0gd(x)l T
OHd( )
(6.5)
Thus, 2 will be a stable equilibrium. For ease of presentation we will assume throughout the following: Assumption
A
[J(x) -
Rd(x)] is
invertible for every x E ~ .
It is important to remark that this does not imply that the closed-loop system is fully damped. T h a t is, we do not require Rd(x) > 0, Vx E ~ _ . Actually, it is shown in [7] that Assumption A is not needed for the proof of the proposition below. We have the following basic result.
294
6. Output Feedback Control of Food-Chain Systems
P r o p o s i t i o n 6.1 [7] Given J(x), R(x),H(x),g(x). Assume we can find and output-feedback control u(h(x)) and a matrix Ra(x) such that R(x) § Ra(x) >_O, Assumption A hold, and the vector function g ( x ) , defined as,
K(x)~[J(x) - (R(x) + R~(x))]-l[R~(X)~x(X ) + g(x)u(h(x))]
(6.6)
satisfies 9 (Integrability) K(x) is the gradient of a scalar .function. That is, 0x (x) =
(6.~)
9 (Equilibrium assignment) K(x), at 2, vemfies
OH Ox (2)
K(~:) -
(6.8)
9 (Lyapunov stability) The Jacobian of K(x), at ~, satisfies the bound
OK 02 H Ox (~) > - -5~-x 2(2)
(6.9)
Then, 9 will be a locally stable equilibrium of the closed-loop. It will be asymptotically stable if, furthermore, the largest invariant set under the closed-loop dynamics contained in x e ~
N B { --~--x( )
-~d( )-O-'~--x i, ) = 0
(6.10)
equals {~}, where Hd(X) is given by (6.2). The latter condition will be automatically satisfied if we can achieve full damping, that is, if R~(x) > 0/07- eve~ x e ~?~. Proof First, notice that, using (6.2), (6.3) and Assumption A, the identity (6.4) may be equivalently written as
Ox ( z ) = [ J ( x ) - Rd(x)]-I[R~(x)
(x) + g(x)u(h(z))]
(6.11)
For every given u(h(x)), R~(x), this is a linear PDE. A necessary and sufficient condition for the solvability of this PDE (on every contractible neighbourhood of Nr~ + ) is that the gradient of the right hand side of (6.11) is a symmetric matrix. From (6.3), (6.6) and (6.11) we see that
og~
K(x) = --~-x (x)
(6.12)
6. Output Feedback Control of Food-Chain Systems
295
Henceforth, the matrix mentioned above will be symmetric iff the integrability condition (6.7) of the proposition is satisfied. The stability proof is concluded invoking standard Lyapunov stability arguments [4]. Namely, from i6.5), we conclude that, under the standing assumptions, Hd(x) qualifies as a Lyapunov function. Asymptotic stability follows from a direct application of La Salle's invariance principle and i6.10). [:]DD R e m a r k 6.1 Notice that the construction above does not require the ex-
plicit derivation of the Lyapunov function Hdix). This can be obtained, though, as a by-product integrating K i x ) OH---~ix ) R e m a r k 6.2 Port-controlled Hamiltonian models (6.1) encompass a very
large class of physical nonlinear systems, strictly containing the class of Euler-Lagrange models considered, for instance, in [8]. They result from the network modeling of energy-conserving lumped-parameter physical systems with independent storage elements, and have been advocated in a series of recent papers [6], [11] as an alternative to more classical Euler-Lagrange (or standard Hamiltonian) models.
3
State-Feedback Control of a Simple Prey-Predator System
As pointed out in the introduction, to motivate our output-fedback control (which is given in the next section) we present first a state-feedback stabilizer for a simple second order food-chain system. The controller is obtained from a verbatim application of the m e t h o d described above. This is a systematic technique that can be efficiently combined with symbolic computation. See, for instance, the simple Maple code given in Appendix A.
S y s t e m Model We consider the normalized second order prey-predator system isee e.g.
[3]) =
f(x)-zl
=
-f(x)
-
+ u
(6.13)
The state variables xl, x2 represent the amount of mass of the two species (preys and predators) involved in the system. The function f i x) describes the predation mechanism, we consider here the classical Lotka-Volterra mechanism f(x) = XlX2. The terms - x l , - x 2 i n (6.13) represent the natural mortality of the species, while the control action u is a feeding inflow
296
6. Output Feedback Control of Food-Chain Systems
rate of preys. For the output feedback case, we will consider that the variable available for measurement is the last one in the chain, in this case,
X2. The evolution of the system is clearly restricted to the positive orthant with u > 0. T h a t is, x i ( 0 ) > 0 , a n d u ( t ) >_0, Vt > O ~
xi(t) >_ O, Vt >_ O
It is possible to show that any equilibrium of the open-loop system with a lit constant input fi _ 0 is globally asymptotically stable. The control objective is, then, to asymptotically stabilize a given non-zero equilibrium 5: E ~R~_ with a positive control. The achievable equilibria axe ~" ----[:~1,X2]T ~-- [X~,1] T, with x~ > 0 the reference for xl. If we define the total mass
H(x)
=
x 1 -J- x 2
the system (6.13) may be written in the form (6.1) with J(x)=
[
0 --XlX2
xlx2 0
]
[
R(x)=
'
Xl 0
0 X2
]
g(X)..~_g =
[0] 1
The skew-symmetry of J(x) captures the mass-conservative feature of the system without inflows and outflows.
State-Feedback Stabilization Since the system is already fully damped, i.e., R(x) > O, Vx ~ O, x E ,~_, it seems reasonable (as our first try) to set R~(x) = 0. T h a t is, we will not inject additional damping, but rely instead on the natural damping of the system to ensure the attractivity. In this case, the vector function (6.6) reduces to
K ( x ) = [ kl(X)
k2(x) I
__
--U(Z)
I-'~-XiZ2 [ 1 1
From which we immediately conclude that
x2k2 (x) = kl (x)
(6.14)
The integrability condition (6.7) in this two-dimensional case reduces to
Ok1
ox2 (x) =
Ok2 (x),
which, combined with (6.14), yields the linear P D E 0kl
Oxl
0kl clx2
( z ) - x2 ~ - - - ( x )
= 0
(6.15)
6. Output Feedback Control of Food-Chain Systems
297
A family of solutions of this P D E is easily obtained as
k (x)
=
~(x)
=
Xl+lOgx:,
for all differentiable functions O(.). From (6.14) we also obtain k2(x)
=
The equilibrium condition (6.8) imposes k1(2)
1
(6.16)
Hence, 0(.) must be such t h a t 0(C(2)) = - 1 , where C(2) = 21 + l o g 22 ---x~. It is clear then t h a t we cannot take O(4) = 4. We propose the function
9 (r
= cl e•
with cl, c2 constants to be defined. (Although this choice of function might seem a bit contrived, we should note that this is the function t h a t results if we directly apply the m e t h o d of undetermined coefficients to the P D E (6.15). See Appendix A). The equilibrium condition ~(~(2)) --- - 1 fixes the first constant as Cl = - exp -c~x~ We will now verify the Hessian condition (6.9). Some simple calculations yield OX (X) = ClC 2 expC2r
~'21 x-'~'• ~'2[1 x2_ c')1 ~2
-- 0X 2 (X)
,
which evaluated in the equilibrium point gives
T h e determinant of this m a t r i x is 1, hence it is positive definite iff c2 < 0. We will investigate now the asymptotic stability properties. To this end, we see t h a t the a~-limit set (6.10) is defined as {x e N~_ n B I - x,(1 + kl(X)) 2 - x~(1 + k2(x)) 2 = 0 } , which consists only of the points x = 0 and x = ~:. But, it can be easily shown, t h a t x = 0 is an unstable equilibrium of the closed loop dynamics. We have established the following result.
298
6. Output Feedback Control of Food-Chain Systems
P r o p o s i t i o n 6.2 Consider the system (6.13), with f ( x ) -- XlX2, in closedloop with the positive control
u(x) = (1 + XlX2)X ~ exp c(xl-x~)
(6.17)
with x~ > 0 the reference for Xl, and c < O. Then, all trajectories starting in x(O) E ~2+, will converge asymptotically to the desired equilibrium point
1) DDD Let us summarize the calculations carried out above: 1. Fix the added damping Ra(x) - to 0 in this case, since the open-loop system is fully damped -; 2. Define the vector K ( x ) , (6.6), as a function of u(x); 3. Use the integrability conditions (6.7) to eliminate the control and obtain a linear P D E (6.15) to be solved for K ( x ) ; 4. Find a solution of this P D E that satisfies the equilibrium (6.8) and Lyapunov stability conditions (6.9); 5. Derive the control law (6.17) from the definition of K ( x ) . R e m a r k 6.3 As pointed out in Remark 2 as a by-product of our analysis we can get a Lyapunov function, which in our case is
Hd(X)
:
Xl "~ X2 -H(x)
1 k
";-X 2
k~zl-x*~ 1 exp ~ '~ + 7 - (1 + x~) Y H,,(x)
where the third and .fourth right hand constant terms are added to enforce Hd(~2) ---- O. It is worth noting that Hd(x) above is the classical Lyapunov function for the stability analysis of Lotka-Volterra ecologies (see e.g. [3] and [9] among many other references). The design procedure of this paper allows to rediscover this Lyapunov function in a very natural way. R e m a r k 6.4 There is an easier way to derive the structural constraint (6.14) that does not require the inversion of the matrix J ( z ) - Rd(x). To this end, rewrite (6. 6) as
OH [J(x) - (R(x) + Ra(x))]K(x) = [Ra(x)--~xx (X ) + g(x)u(x)] The first equation of (6.18) for this example yields
--Xlkl(X ) "~- X l X 2 k 2 ( x
) ~- 0
(6.18)
6. Output Feedback Control of Food-Chain Systems
299
which, upon division by x l , is precisely (6.1~). The second equation simply de.fines the control law, in terms of K ( x ) , as
(6.19)
u(x) = -x2k2(x) - xlx2kl(x)
It is precisely this observation that will motivate the modification, introduced in the next section, that yields an output-feedback stabilizer.
4
Output-Feedback
Stabilization
There are two i m p o r t a n t drawbacks of the solution proposed in the previous section. First, it requires m e a s u r e m e n t of all the state 9 Second, it can not be extended to treat the general food-chain system model, which is of the form 51
~
XlX 2 -- X 1
5 2
~
X2X 3 -- XlX 2 -- X 2
5 3
~
X3X 4 -- X2X 3 -- X 3
5n
z
--X(n_I)X
=
x~
y
n - - X n -]- U
(6.20)
To prove the second statement, let us write the model in the form (6.1) with H ( x ) = E ~ l x i and 0 --XlX
J(x)
XlX2 2
0
" ""
0
0
X2X 3
" 9"
0
0
0
...
0
=
0
xl 0
0 x2
... ...
0 0 = RT(x)
R(x) =
0
0
...
= --JT(x)
> O, g ( x ) = g =
x~
Then, notice t h a t the distribution spanned by the vector fields defined by the column vectors obtained from the first n - 1 rows of J ( x ) - R ( x ) is not involutive. Consequently, the key P D E [J(x) - R(x)] ~ x a (x) = g u ( x )
300
6. Output Feedback Control of Food-Chain Systems
can not
be solved. In this section we show how, for our second order example (6.13), these limitations can be overcome modifying the damping of the closed-loop. In the next section we extend this result to the general n - t h order model (6.20). Towards this end, let us remove the damping from the first coordinate. T h a t is, define Ra(x) like
Ra(x)= [ -xlO 00] Notice the negative sign. With this choice, the vector function (6.6) becomes now
g(x)=
kl(X)
1
k2(z)
_
-
~
Choosing the control law as the simple o u t p u t - f e e d b a c k u(x2) =
cx2 + 1,
with c some constant to be defined, yields
K(x)
=
~1
(6.21)
which is clearly the gradient of a scalar function. Hence, the integrability condition (6.7) is satisfied. We will now verify if we can find a constant c such that the remaining stability conditions of Proposition 6.1 are also satisfied. The equilibrium condition (6.16) imposes c ----x~. For the Hessian condition (6.9) we first observe from (6.21) and ~ H ( x ) ----0, t h a t
~x~X~=-~x
~=
o
Evaluated in the equilibrium point gives
0ox22 H(~) d= O-~xK(~)[ = 71, o ] 0
1 '
which will be positive definite for any x~ > 0. Finally, a s y m p t o t i c stability is ensured because the w-limit set (6.10) is now defined as
{ which consists only of the point x = ~.
-
1-o
},
6. Output Feedback Control of Food-Chain Systems
301
T h e new L y a p u n o v function is
Hd(x) ~- Xl ~- x2 --x~ ln(xl) -- ln(x2) --(x~ -F 1 -- x~ ln(x~)), H(x)
H~(x)
where the third right h a n d c o n s t a n t t e r m is, again, a d d e d to enforce Hd(2) = O. We have established the following result. P r o p o s i t i o n 6.3 Consider the system (6.13), with f ( x ) = XlX2, in closedloop with the positive output-feedback control u(x2) --- 1 + x~x2
(6.22)
with x~ > 0 the reference for x l . Then, all trajectories starting in x(O) E 7~2+, will converge asymptotically to the desired equilibrium point (x~, 1). DDD R e m a r k 6.5 To increase the speed of convergence it is possible to inject some additional damping on the actuated coordinate x2. To this end, we choose
olxl=[Xl 0] 0
(r - 1)x2
'
with the desired damping a constant 1 < r < 1 + x~. Going through the calculations we get the control law u(x2) = r + (x~ - r + 1)x2
(6.23)
It can be shown that this control law is also globally asymptotically stabilizing. Notice that with r = 1 we recover the controller (6.22).
5
Main Result
In this section we present the generalization of the previous result to t h e n - t h order case. T h e o r e m 6.1 Consider the general food chain system (6.20) in closed-loop with the output:feedback positive control u ( z n ) = m x n + m + Xx,
?n--
n-1 2
.for n odd, and n
u(x,~) ---- ( m + x~)xn +-~,
n
m---- -~ - I
302
6. Output Feedback Control of F o o d - C h a i n Systems
.for n even, with x~ > 0 the reference f o r Xl. Then, all trajectories starting in x(O) E ~ _ will converge asymptotically to the desired equilibrium point $
=
-
[X 1 , / 2 ,
"', Xn]"
DDD Proof M o t i v a t e d b y t h e d e v e l o p m e n t s of t h e s e c o n d o r d e r case a b o v e we p r o p o s e to r e m o v e t h e d a m p i n g from all n o n - a c t u a t e d c o o r d i n a t e s . T h a t is, we choose
Ra(x)=
-xl 0
0 -x2
0
0
"'"
O
9 9 9
0
9 " 9
O
W e will n o w verify t h e t h r e e c o n d i t i o n s of P r o p o s i t i o n 6.1.
9
Integrability
T h e key e q u a t i o n (6.11) b e c o m e s t h e n
0 --XlX2
0 0
xlx2
0
0
X2X3
0 0
0 0
9 .. 9 ..
0 0
"'"
0 0
0
. . . .
Xn_lX
Xn--lXn n
k,(x) k2(x)
n-l(X)
Xn
--X 1 --X 2
--Xn_ 1
w h i c h c a n b e c o m p a c t l y w r i t t e n as f l ' ( x ) K ( x ) = ~(x). Now, , ~ ( x ) a d m i t s a f a c t o r i z a t i o n of t h e f o r m 0 -1
1 0
0 1
.-. .-.
0 0
0 0
J(x) = diag{x,}
diag{xi} 0 0
0 0
0 0
... ....
0 1
1 ___1
6. Output Feedback Control of Food-Chain Systems
303
This leads to
0
1
0
"""
-1
0
1
.-.
0 0
Xl~I(X) X2k2(X)
-1 -1
Xn--l~n--l(X) X~kn(X)
-1
0 0
:
0 0
0 0
0 0
0 -1
1 _1
Xn
XT~
From which we obtain a system of equations of the form x2k2(x)
=
-1
-Xlkl(X)+X3k3(x)
=
-1
-x2k2(x)+x4k4(x)
=
-1
)
=
-1
k,~(x)
-
u(x)
-Xn_2kn_2(x
) ~- x n k n ( x
-xn-lkn-l(X)-
(6.24)
Xn
Notice that from the first equation of (6.24) we have 1
k2(~) -
X2
Subsequently, the functions k i ( x ) , for i even, have a unique solution, which is furthermore of the form k i ( x ) = ki(xi). Now, choosing C
kl(z) -
Xl
we can also obtain a unique solution k i ( x i ) , K ( x ) is finally given by P
K(x) = [
1
1 x2
Xl
"""
m xn-- 1
for i odd. The vector function
_rat
c
Xn
IT J
,
IYt - -
I1 - - 1 2 '
for n odd, and
K ( )x
-=
c
1
Xl
x2
_ m-t-c 9. .
x~_ 1
,~ -~-
] T
x,L
'
. ~ = ~ -n1 ,
for n even. It is clear that, in both cases, the integrability conditions are satisfied. Also, from the last equation of (6.24) we compute the control law U(X)
---- - - X n [ X n _ l
kn_l
( X ) "~- k n ( X ) ]
304
9
6. O u t p u t Feedback Control of F o o d - C h a i n Systems
Equilibrium Assignment
T h e e q u i l i b r i u m c o n d i t i o n is
K(2)-
OH ax
=
-
[1
=
..
Q
'
x~_l,
x~
w h i c h is satisfied w i t h c = x~.
9
L y a p u n o v Stability
W e will n o w verify t h e Hessian c o n d i t i o n . S o m e s i m p l e c a l c u l a t i o n s y i e l d
0
OK
ai(x)
=
0
0
0
...
0
0
~
0
0
.-.
0
0
.--
0
0
0
0
0
0
0
~
0 0
0
(6.25)
..-
0
0
-..
0
.--
0
0
T h i s m a t r i x will b e p o s i t i v e definite for a n y x E ~ _ a n d a n y x~ > 0. F i n a l l y , t h e w l i m i t set for n o d d is defined as {x C , ~ N 13 [x,,-(m+x;) ~- 0} a n d ~n
{x C !}~_ N 13 [ x , ,x,, - ( ~ ) __- 0} for n even. I n b o t h cases t h e w l i m i t set consists o n l y of t h e p o i n t xn = 2n. This, t o g e t h e r w i t h u n i q u e n e s s of t h e e q u i l i b r i u m , c o m p l e t e s t h e p r o o f of a s y m p t o t i c stability. 6 . 6 The proposed control design can be easily applied to the more general class of Lotka-Volterra ecologies defined as .follows:
Remark
xi
=
x~(-k~+Zaijxj)
3:n
~- Xn(-kn q- E
i=l,...,n-1
anjXj) -k u
with k~ > 0 the natural mortality rates, aij ~- - a i j , V i ~ j, the predation coefficients and u the .feeding rate of species xn, with u(t) >_0 Vt. The procedure yields the classical Lyapunov function .for Lotka-Volterra ecologies
~
xi - g% l n ( x i ) ,
i=l
and we obtain the .following output .feedback control law u(xn)
=
+
-
with ~t the constant control that assigns the desired equilibrium, and 0 < A < ~X,e an arbitrary design parameter.
,
6. Output Feedback Control of Food-Chain Systems
6
305
Simulations
Numerical simulations of the second order model (6.13) were carried out in order to show the performance of the proposed controllers. T h e p a r a m e t e r s used in the simulations were, c -- - 0 . 2 for the state feedback controller (6.17), and r -- 1,2.1, for the o u t p u t feedback controller (6.23). T h e desired equilibrium of the system is 9 --- [1.2, 1] T. The initial conditions in all the simulations are xl (0) = 2 and x2(0) ----2.
2.,~ 2 1.8 1.6 14
1.2
0.8 0.6 0.5
1=,5
2
21.5
3
xl
FIGURE 1. Open-loop trajectory
For the sake of comparison, in Fig. 1 we present the behaviour of the open loop trajectory in the state space with a constant input fi -- 2.2, while Fig. 2 depicts the behaviour of the state and ouput feedback controllers. Finally, the control signals are shown in Fig. 3. As seem from the Figs. 2, 3 the addition of damping effectively increases the convergence rate with the additional advantage of reducing the control effort.
306
6. Output Feedback Control of Food-Chain Systems
~sta
tefeedback
1.z
O. output feedback ~
0.~ 0.5
1
1.5
xl
2
2.5
FIGURE 2. State space of the closed-loop trajectory
statefeedback
damping output feedback eedback +
/
1~0
time
[sec]ll5
;~0
25
FIGURE 3. Control signals
7
Concluding Remarks
We have illustrated in this chapter how the application of the p a s s i v i t y based controller design technique of [7] allows us to solve o u t p u t - f e e d b a c k stabilization problems for a class of m a s s - b a l a n c e systems. T h e procedure is illustrated in detail with an n - t h order food-chain model. It can, mutataemutandi, be applied also to other m a s s - b a l a n c e models studied in [1], [3],
6. Output Feedback Control of Food-Chain Systems
307
[9]. For instance, it can be shown that for the compartmental model of Section 4 in [1] the technique yields also asymptotically stabilizing controllers. However, we require in this case the knowledge of the full state. We have not stressed here the advantages of taking a physically-based approach for controller design, see e.g. [8], [7], [10] for a detailed discussion. We should underscore, however, that the preservation of a physical interpretation to the control action (in terms of damping injection) was instrumental for our result. Finally, we bring to the readers attention the simplicity of the resulting control law. This important feature is a characteristic of passivity-based controllers. As shown in this chapter the approach of [7] provides a flexible methodology to design controllers for physical systems. As discussed in t h a t paper, we can also aim at modifying the internal interconnection structure J ( x ) . In this way, we recover some of the results obtained with the technique of controlled Lagrangians, reported in [2]. Current research is under way to explore this interesting possibility for mass-balance systems.
Acknowledgements The first author would like to express his deep gratitude to Bernhard Maschke and Arian van der Schaft, with whom the basic principles underlying the developments reported here were obtained.
8
REFERENCES
[1]
G. Bastin and L. Praly. Feedback stabilization with positive control of a class of dissipative mass balance systems, accepted IFA C World Congress, Beijing, 1999.
[2]
M. Bloch, E. Leonhard and J. Marsden. Controlled Lagrangians and the Stabilization of Mechanical Systems I: The First Matching Theorem. IEEE Conf. Decision and Control, Tampa, FL, 1998.
[3] J. Hofbaner and K. Sigmund. Evolutionary Games and Population Dynamics. Cambridge University Press, 1998. [4] H. Khalil. Nonlinear systems. Prentice-Hall, 2nd edition, 1996. ISBN 0-13-22824-8.
[5]
B. Maschke, R. Ortega and A. van der Schaft. Energy-based Lyapunov functions for forced Hamiltonian systems with dissipation. IEEE Conf. Dec. and Control, Tampa, FL, 1998.
6. Output Feedback Control of Food-Chain Systems
308
[6]
B.M. Maschke and A.J. van der Schaft. Port controlled Hamiltonian systems: modeling origins and system theoretic properties. Proc. 2nd IFAC Syrup. on Nonlinear Control Systems design, NOLCOS'92, pp.282-288, Bordeaux, 1992.
[7] R. Ortega, B. Maschke, A. van der Schaft and G. Escobar. Passivity-based control of port-controlled Hamiltonian systems, LSS-SUPELEC, Prance, Int. Rep., 1998. [8] R. Ortega and A. Loria, P. J. Nicklasson and H. Sira-Ramirez. Passivity-Based Control o.f Euler-Lagrange Systems. SpringerVerlag, Berlin, Communications and Control Engineering, 1998. [9] F. Sendo and J. Ziegler. The Golden Age of theoretical Ecology. Lecture notes in B i o m a t h e m a t i c s , Springer Verlag, 1978. [10] A. van der Schaft. L2-Gain and Passivity Techniques in Nonlinear Control. Lect. Notes in Contr. and Inf. Sc., Vol. 218, SpringerVerlag, Berlin, 1996. [11] A. van der Schaft and B. Maschke. The Hamiltonian formulation of energy-conserving physical systems with external ports. Archiv fiir Elektronik und Ubertragungstechnik, 49, pp. 362-371, 1995.
Appendix A: Maple Code In this appendix we present a Maple code that guides us in the solution of the example of Section 2. The calculations proceed as follows 1. Definition of the system (with JmRZX=J(x) - R(x) and gu~gu(x)) : > with(linalg) : >
JmR := matrix(2,2,[-xl,xl*x2,-x1*x2,-x2S);
JmR:=
> gu
[ -xl [ [ - x l x2
x l x2] ] -x2 ]
:= v e c t o r ( [ 0 , u ( x l , x 2 ) ] ) ;
gu := [0, u ( x l ,
x2)]
2. Computation of K(x) and its Jacobian > K := multiply(inverse(JmR) [
,gu) ; u(xl,
x2)
K := [, [ 1 + x l x2
u(xl,
x2)
I
] x2 (1 + x l x2)]
6. O u t p u t Feedback Control of F o o d - C h a i n S y s t e m s > Jac
:= j a c o b i a n ( K , [ x l , x 2 ] ) ;
Jac
:=
[ [ [u(xl, .
.
.
.
d --- u(xl, dxl
x2) x2 .
.
.
.
.
.
.
[
.
.
.
.
.
.
2
.
.
.
.
.
[ [ [ u(xl, .
.
.
.
u(xl, .
.
.
.
~
.
.
.
.
.
d --- u(xl, dxl .
.
.
.
.
.
.
.
.
x2
.
x2) x l .
.
.
.
.
.
.
.
.
.
.
.
2
.
.
.
.
.
.
.
.
.
1 + x l x2
.
.
.
.
.
.
.
.
.
] ]
I
.
.
.
.
d --- u(xl, dx2
x2) x l .
.
.
.
.
.
.
.
2
.
.
.
2 (i + xl x2)
]
x2)
u(xl, +
.
(i + xl x2)
x2)
. . . . . . . . . . . . . . .
x2
.
] x2)] ]
(i + xl x2)
[ 2 [(i + xl x2)
u(xl,
.
1 + x l x2
x2) .
d --- u(xl, dx2
x2)
[(i + xl x2)
.
309
x2
.
.
.
.
.
.
.
.
.
] x2) ] ] .
.
.
.
.
]
x2 (i + xi x 2 ) ]
(i + xl x2)
]
3. D e f i n i t i o n of t h e t e r m
) _ 0kl
> eql2
:= Jac[2,1]-Jac[1,2];
u(xl, eql2
d --dxl
x2)
u(xl,
x2) u(xl,
x2) xl
:= 2 (I + xl x2) d --- u(xl, dx2
x2
(1 + x l
x2)
2 (1 + xl x2)
x2)
+ 1 + xl x 2 4. D e t e r m i n a t i o n of t h e c o n t r o l u(x) w h i c h s o l v e s e q l 2 = 0, i.e., w h i c h ensures the integrability condition.
310
6. Output Feedback Control of Food-Chain Systems > u_star:=rhs(pdesolve(eql2=O,u(xl,x2))); x2 u_star
) (i + xl x2)
:= _ F I ( exp(-xl)
Notice that in the line above _ F I ( - ) is any differentiable function. 5. Evaluation of the Hessian for the given control expression. > subs(u(xl,x2)=u_star,evalm(Jac)); [ x2 d [_FI( ........ ) x2 --- Z1 [ exp(-xl) dxl [. . . . . . . . . . . . . . . . . . . . . . . . . . [
i + xl x2
x2 d _FI( . . . . . . . . ) xl --- ZI exp(-xi) dx2 , ..........................
i + xl x2
[ x2 [_FI( ........ ) [ exp(-xl)
i + xl x2
] ] ] ]
i + xl x2]
d --- Z1 dxl
. . . . . . . . . . . . . . . . . . . . . . . . . . . .
[ [
i + xl x2
x2
x2 _FI( ........ ) exp(-xl) . . . . . . . . . . . . .
2
(I + xl x2)
x2 _FI( ........ ) xl exp(-xl) +
d --- ZI dx2
. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
x2 (I + xl x2)
x2
x2
] ] ] ]
(I + xl x2)] ]
x2 ~.1 := _FI( ........ ) (I + xl x2) exp(-xl)
6. The design can be concluded selecting a function F I ( - ) that satisties the equilibrium assignment and Lyapunov stability conditions of Proposition 6.1. In Section 2 we have chosen _ F I ( ~ ) -- ~k.
Output Feedback Tracking Control for Ships K. Y. Pettersen I and H. Nijmeijer 2 1Department of Engineering Cybernetics, Norwegian University of Science and Technology, Trondheim, Norway 2 Faculty of Mathematical Sciences, University of Twente, Enschede, The Netherlands 2 Faculty of Mechanical Engineering, Eindhoven University of Technology, Eindhoven, The Netherlands
1
Introduction
For most ships, measurements of the ship velocities are not available. For feedback control of the ship, estimates of the velocities must therefore be computed from the position and heading measurements. The ship position is typically measured using the Navstar differential global positioning systern (DGPS), while the heading is usually measured by a gyro compass. As the position measurements are quite corrupted by noise, numerical position differentiation is not desirable. Instead, an observer should be used to obtain velocity estimates from the position measurements. In conventional ship control systems, the estimation problem is solved using a linear Kalman filter. A linearized ship model is then used. T h e kinematic equations of motion are typically linearized about a set of 36 constant yaw angles (separated by 10 deg in order to cover the whole operating area of 360 deg). For each of these linearized models Kalman filters and feedback control gains have to be computed. The control and filter gains are then modified on-line using gain-scheduling techniques. The drawback of this approach is the considerable amount of tuning work, and the ad hoc nature of the approach which does not guarantee the desired stability and convergence properties. In [8] a nonlinear observer is developed and is proven to be globally exponentially stable. Hence, only one set of observer gains is needed to cover the whole state space. The observer is developed independently of the ship control scheme. In [1] a feedback control law for dynamic positioning of ships is developed based on the estimates from the observer in [8], giving a globally exponentially stable closed-loop system. In both these works, the dynamic positioning (DP) problem for ships is considered, and the observer and the control law are thus developed based on a ship model not
312
7. Output Feedback Tracking Control for Ships
including Coriolis and centripetal forces and moments. For the DP-problem it is a valid assumption to disregard the Coriolis and centripetal forces and moments acting on the ship, while for tracking control where the velocities of the ship cannot be assumed to be close to zero, these forces and moments must be considered in both the observer and the controller design. In this work we consider the output feedback tracking control problem for ships. The Coriolis and centripetal forces and moments must thus be included in the ship model, leading to quadratic velocity terms in the dynamics. Moreover, instead of designing an open-loop observer, we seek to combine the observer and controller design such that the controller exploits the underlying observer structure and vice versa, in order to find a computationally simple control law and observer. The observer-controller scheme is designed using a passivity-based approach. The idea of passivity-based control methods is to reshape the system's natural energy via state feedback, in order to achieve the control objective. In this way the passivity property of the system is preserved in the closed loop, and therefore the approach has been named the passivity-based approach. This approach has gained much attention, and based on this approach [17] proposed a solution to the robot position control problem, and [13] solved the problem of robot motion control. Also, for adaptive robot control the passivity-based approach has been studied extensively, see for instance [15],[9] and [12]. The output feedback control problem for robots has been considered by several authors, for instance [4] and [16] where observers based on the sliding mode concept were proposed, and in [10] where a linear high-gain strategy was proposed. The observers proposed were developed independently of the robot control scheme. In [5] a modified version of the computed-torque controller was proposed, and local exponential stability of the closed-loop system was proven. In [11] some known state-feedback controllers were considered, using velocity feedback from a nonlinear observer, and local asymptotic stability of the closed-loop systems were proven. In [2],[3] the output feedback control problem for robot manipulators was solved using the passivity-based approach. A key point in [3] was the fine tuning of the controller and observer structure to each other, providing solutions of the output feedback control problem that were conceptually simple and easily implementable in industrial applications. The output feedback tracking control problem for ships has been addressed in [14] and [18]. Both these works consider a 1 degree of freedom (DOF) nonlinear model and address the yaw angle tracking control problem (autopilot design). In [14] a passive control law without velocity feedback is proposed and proved to asymptotically stabilize the desired yaw angle. In [18], based on the ideas presented in [3] an observer-controller structure is proposed and proved to semi-globally exponentially stabilize the desired yaw angle. In this chapter we use the same ideas as presented in [3] to design a 3 DOF output feedback tracking controller for ships. However, in
7. Output Feedback Tracking Control for Ships
313
this chapter we use the same ideas as presented in [3] for the controllerobserver design for ships. However, the control law proposed in [3] uses feedback from the position measurements together with the estimated velocities. For the ship, the gyro compass measurement noise will typically be less than 0.1 deg. However, the position measurements are quite corrupted by noise, as the DGPS ineasurement noise will be in the range of 1-3 m. Therefore, filtering of the position measurements is necessary, and we seek to find a tracking control law that uses feedback from the filtered position variables. In Section 2 the ship model is presented. In Section 3 we develop a controller-observer combination for output feedback tracking control of ships, and prove that the closed-loop system is semi-globally exponentially stable. If the Coriolis and centripetal forces and moments are negligible, as for the special case where the desired trajectory is a constant position and orientation, the system is globally exponentially stable. In Section 4 simulation results for this o u t p u t feedback tracking control scheme are presented. Then the problem of bias estimation is addressed in Section 5, and simulations for the output feedback tracking control scheme including bias estimation are presented in Section 6. Finally, conclusions are given in Section 7.
2
The Ship Model
The ship model is based on [7, 6]. We use the earth-fixed vector representation M(~b)/~ + C(~b, @)//+ D ( r
~-~
(7.1)
where r~ = [x, y, r The variables x and y are the position variables, while ~/~ is the yaw angle. The vector ~-~ E ~3 is the control vector. The model has the following properties: The matrix of inertia including hydrodynamic added inertia effects, M ( r is symmetric and positive definite. The symmetry property is based on the ship having starboard and port symmetries together with the assumption of low speed, as opposed to high-speed applications, as we assume that the ships considered are conventional ships, not high-speed crafts. The matrix is bounded with respect to r 0 < Mm < [IM(~b)]l < MM
V~ C S 1
(7.2)
The Coriolis and centripetal matrix, also including added inertia effects, C(~b,//), satisfies the properties
6(r
C(r
=
C(~,,w)v
V~l, E S 1, Vv, w e '~3(7.3)
o~v -t- •w)
:
ozC(~), v) -t- Z C ( ~ , w)
V~) c S 1 , Vv, w E ~3 (7.4)
314
7. Output Feedback Tracking Control for Ships
~CM > 0
< CMitVll
lic(r
V~ 9 S 1, Vv 9 #
Fhrthermore, the m a t r i x / I f - 2C, where {/1)/}0 = metric
8T(f/[(~)) -- 2C(r
?)))8 = 0
0r
(7.5)
~o, is skew sym-
Vr 9 S 1, V?), 8 9 ~}~3
(7.6)
The damping is assumed to be linear in/1, which is a good assumption for low-speed applications and for cruising at a constant speed. The hydrodynamic damping matrix, D ( r is in general non-symmetric. T h e hydrodynamic damping is due to wave drift damping and laminar skin friction, and it is dissipative
sTD(r
>0
V r C S 1, V8 9 ~3\{0}
(7.7)
Moreover, the damping matrix is bounded with respect to
0 < Dm < IID(~')ll <_ DM
3
Vr E S 1
(7.8)
Design of an Output Feedback Tracking Control Law
In this section we develop a combined observer-controller structure for tracking control of a ship when only the position and heading measurements are available. The observer-controller design is based on passivity in the sense t h a t both the controller and observer are designed such t h a t the closed-loop system matches a predefined desired energy function. This approach was proposed for the control of robot manipulators in [3]. T h e controller proposed by [3] uses the position measurements and the estim a t e d velocities for feedback. For the ship, using a gyro compass for the heading measurement, the measurement of the yaw angle r is quite accurate. The measurement noise will typically be less t h a n 0.1 deg. However, the measurements of the position variables x and y will be corrupted by measurement noise, typically in the range of 1 - 3 m. Therefore, filtering of these measurements is necessary and we want to develop a controller t h a t uses filtered position variables. To this end, we consider the following Lyapunov function candidate
V(~l,< ~,~)=
-~TM(~)~+ !~2K~I~+~[M(~)~ + }~K~ (7.9)
7. Output Feedback Tracking Control for Ships
315
where Kpl and Kp2 are symmetric, positive definite matrices. The variable = 7]-/1, where/1 is the estimate of the variable ~1. Define the error variable = / / - ~d, where ~?d(t) is the reference. We further define //1
~--- //d -- A l e
(7.10)
i]2
=
d ~ / / - - A2/1
(7.11)
sl
=
d^ //1 d^ ~-~= ~-~e+Al~
(7.12)
s2
=
//_i12 = ~-~ d _ + A2//
(7.13)
We propose the following observer-controller scheme d^
~V
=
z + Kd/1
M(~)~
=
-C(r
(7.14) - K l S l - K p ~ + M(r
+M(r
Te =
(7.15)
C ( ~ , ~2)(77 2 - Sl) -~- D(~))~ 2 - K l S l - K p l e ~- M ( r - ( K p 2 - M(~)A~)/1
1 (7.16)
We then find that the time-derivative of V along the system trajectories is 1/
=
--sT(K1 -- 6 ( r
s2))sl + s T M ( ~ ) K d s 2 -- s T ( M ( r
-- A2)
A-C(•,//- 82))82 - sT D(~)s2 -- eTKpAle - ~TKp2A2/1 (7.17) The observer and controller matrices Kp1,Kp2, A1,A2,Kd and KI are constant, symmetric and positive definite. In the following we choose
A1
=
A2 =
diag{All,A12,A13}
(7.18)
diag{A21,A22,A23}
(7.19)
Kpl
=
al
(7.20)
Kp2
=
A2
(7.21)
Kd
=
A2 +Idi3•
(7.22)
K1
=
/113•
(7.23)
where ld
>
2CMVM A2,M + ~
(7.24)
11
>
1 M 2 r* 2--M---~k,~2,M + ld)
(7.25)
316
7. Output Feedback Tracking Control for Ships
where A2,M is the maximum eigenvalue of the matrix A2, and V M is the maximum of the reference velocity//d. The time-derivative of the Lyapunov function candidate is then
--8T1(1113• -- 6 ( ~2, 82))81
-]- sT(M(O)A2 +
ldM ( r ) )s2
--&TATAI& -- z)TATA2~
(7.26)
Using the matrix properties given in Section 2 we find that this is upper bounded by
?
<
- ( l l - CM[Is2[I)[ISlI[ 2 + MM(ld + A2,M)[[Sl[[ lls211 - ( l u M m - CMIIi7 - s2lI)llsul[ 2 - vmlls2]] 2 (7.27)
-IIh~all 2 -IIA:OlI 2 By completing the squares, we find that ~
1 M 2 (lg + n2,M))[[Sl[[ 2 2 Mr~
--(/1 - - C M I I s 2 1 1
1 --(-~ldMm - CMllil -- ~211 - 1MmA2,M)] I~112
-Dmllszl] u -IIAla[] 2 -IIA2~]I l
(7.28)
Noting that -- 82 : 81 "~- ~d --
Ale
-
AuG
(7.29)
we see that if (7.24-7.25) are satisfied, then there exists a region ~t in which for some ~ > 0 1 / < -aJly][ 2
Vy Ct2
(7.30)
where y = [81, AI~, 82, A2~] T. By Lyapunov theory we thus have t h a t y = 0 is an exponentially stable equilibrium of the system (7.1)-(7.14-7.16). We find an estimate of the region of attraction as follows. If the inequality
M,n (1 + v~)llyll < 2---~M(Zd - A2,M) - VM
(7.31)
is satisfied, then by Cauchy-Schwartz
CM(llSlll + [IAI~II + IIA2~II) < - ~ ( Z d - A2,M) - CMVM
(7.32)
If the inequality 11 [ly[I < CM
1 M2 q 2MmCM ~d+A2'M)
(7.33)
7. Output Feedback Tracking Control for Ships
317
is satisfied, then 1 2
Cull~211
Iv1 m
(Id +
(7.34)
A2,M)
Define = min{
1
Mm
A2 M) -- VM), 11
(1 + ~)(2---5-~M (ld --
'
CM
1
M2
2 M--~M
l, k~d "~- A2,M)}
(7.35) Then (7.30) holds for f~ defined by (7.36)
{Y 9 ~121 Ilyll <5} Furthermore, note that 1 12 pml[Y[I < V(y) <~PMllY[
(7.37)
where Pm: PM
=
min{Mm,Al-,~a, A2,1}
(7.38)
max{UM, A~,~, A2,m} -1
(7.39)
,/'Pro 5
(7.40)
Thus if ]ly(O)lJ < V PM
then < -c~l[y(t)[I 2
Vt > 0
(7.41)
We have thus proved the following proposition Proposition 7.1 Consider the ship (7.1) and the observer-controller scheme d
~V
^
M(~)~
=
z + (A2 + ldI3•
=
-C(r +M(r
Te
=
- L181 - A l e -~- M ( r
(7.42) 1
+ ldI3•
(7.43)
C(~2,772)(?) 2 - 81) -~- D(~2)~ 2 - 1181 - A l e Jr- M ( r
1
-(A2 - M(r
(7.44)
318
7. Output Feedback Tracking Control for Ships
where A1 =diag{All,A12,A13} > 0 and A2 =diag{A21,A22,A23} > O. Under the conditions Id
>
2CMVM A2,M + ~
(7.45)
I~
>
2 Mm
1 MS (A2,M +
(7.46)
where VM is the maximum value of the reference velocity ild, A2.M = max{A21,A22 A23} and CM, MM, Mm are de.fined in (7.2-7.5), the closedloop system is locally exponentially stable. A region of attraction is given by
A = {y 9 ~121 I]yll <
5}
(7.47)
where y = [81, AIR, s2, A2~]T
1 M.~ A2,M) - VM), ll 6 = min{ (1 + V"2) (2--~M(ld -~M
1 M ~ (ld + A2,M)} 2Mm------~M
Pm = minfMm,A~,~, A~,~} and P . = ma~{MM, Al,m, -1 A2,m}. -1
R e m a r k 7.1 The exponential convergence of y implies the exponential convergence o.f x = [sl, ~, s2, (?]T as y and x are linearly related.
R e m a r k 7.2 Note that the region of attraction can be made arbitrarily large by choosing the control parameters l d and ll large enough. This means that the closed-loop system is semi-globally exponentially stable.
R e m a r k 7.3 Note that.for the special case where the desired trajectory is a constant position 74 = constant (dynamic positioning), then the closed-loop system is semi-globaUy exponentially stable if
la
>
A2,M
(7.48)
li
>
1 2 M~4 M~ IA ~I~,M + ld)
(7.49)
R e m a r k 7.4 >From (7.28) we see that it is due to the Coriolis and centripetal te~n that the result is semi-global as opposed to global. For the dynamic positioning problem it can be assumed that the Coriolis and centripetal forces and moments are zero, and then CM = O. Then, under the conditions (7.48-7.49) the closed-loop system is globally exponentially stable.
7. Output Feedback Tracking Control for Ships
319
2.5 2 1.5 1 o.s Q. -0. -1 -1.5 -1
0
1
2 3 y-position
5
6
FIGURE 1. The ship trajectory (-) and the desired trajectory (- -) in the xy-plane.
R e m a r k 7.5 Due to the combined observer-controUer design, ~n which the observer and controller structure are chosen in accordance with each other, we have obtained a computationally simple observer and controller. The tuning of the system will therefore be quite simple. We may choose A1 and A2 to give the desired rate of convergence of ~ and ~ (A1 < A2 as we want the estimation error to converge to zero faster than ~). Then, we choose the control parameters ld and 11 satisfying the inequalities (7.~5-7.~6). The choice of ld and 11 will be a trade-off between the size of the region of attraction on the one side and the thruster limits and amplification of measurement noise (since these parameters can be viewed as the derivative gains) on the other side. The observer-controller design does however not take thruster limitations into consideration, and the closed-loop system does not respect such bounds if the initial conditions are poorly chosen.
4
Simulations
T h e simulations were performed using the m a t h e m a t i c a l model of Cybership I, a model ship of scale 1:70 of an offshore supply vessel. Cybership I has a mass of 17.6 kg and a length of 1.2 m. The centre of gravity is located at x c = - 0 . 0 4 m aft of midships, and this is the origin of the body-fixed coordinate system. Assuming t h a t the Froude n u m b e r is constant, we have
320
7. Output Feedback Tracking Control for Ships
actual ,
3 2
....
-101 -2
and
desired
actual and desired xdot [m/s]
x [m]
.
O.
,
...:
.
.,:
.
.
.
.
0.2
.
0.1
:
.
0
50
100 150 200 actual and estimated x [m]
. . . . . . . .
"20
0
:
,
0.2'
:\1
1O0
150
:
~ 50
0. . . . : 100 150 200 actual and estimated xdot [m/s]
i i
:
50
100
....
o
50 1O0 150 200 measured and estimated x [m]
50
i
.
.. . . . . .
0 0
.
~....
1
~
.
-0. 0
150
200
200
FIGURE 2. The position variable x (-) and the desired position Xd (- -) [m], the velocity variable x (-) and the desired velocity J:d(- -)[m/s], the position variable d z^ (- -), the x (-) and its estimate 2 (- -), the velocity x (-) and its estimate ~7 measurement of x (-) and the estimate ~ (- -). the following relationship between the speed of the ship and the m o d e l ship Us ~ 8.37UM(m/s)
(7.50)
where the subscripts ~q and M denote the ship a n d the model respectively. In the simulations, we introduced input m a g n i t u d e s a t u r a t i o n ~-~,.2~x= 10 N and ~-~x = 10 Nm. T h e m e a s u r e m e n t frequency was 50 Hz. In the simulations, the m e a s u r e m e n t noise in the position variables x a n d y were in the m a g n i t u d e of 10 cm, corresponding to m e a s u r e m e n t noise of m a g n i t u d e 7 m for the original supply vessel. T h e heading m e a s u r e m e n t noise was of m a g n i t u d e 0.1 deg. No apriori i n f o r m a t i o n of the s t a t e variables was assumed.
7. Output Feedback Tracking Control for Ships actual and desired ydot [m/s] : : :
actual and desired y [m] 0.1
6
o.1
.
.
.
.
.
.
.
.
.
o.o.
.
00 0
.: 0
6 4
....
0
:
50
.......
i
:
1 O0
150
.
.
.
.
.
.
i :
; ........
:: 9 .......
.
.
.
.
.
.
.
.
.
.
:~....
; ......
o.+.
50 1 O0 150 200 measured and estimated y [m] : : .
.
50 100 150 200 actual and estimated ydot [m/s]
O. o .
.
.
actual and estimated y [m] 6
321
-0 0
.......
50
~ ..........
~.........
~ ....
100
150
200
200
F I G U R E 3. The position variable y (-) and the desired position yd (- -) [m], the velocity variable ~) (-) and the desired velocity ?)d(- -)[m/s], the position variable y (-) and its estimate ~) (- -), the velocity y (-) and its estimate dz) dt (--),the measurement of y (-) and the estimate ~ (- -).
The desired trajectory was
Xd
=
2.5sin(O.O2t)
(7.51)
Yd
=
2.5(1 -- Cos(O.O2t)
(7.52)
rd
=
O.02t
(7.53)
corresponding to the model ship moving at 5 cm/s along a circle of radius 2.5 m. The initial value of the desired trajectory was at the origin of the earth-fixed coordinate system. The initial values of Cybership I were [x,y, r
7r r T = [-1,0,-~,0,0,0] T
(7.54)
322
7. Output Feedback Tracking Control for Ships
~__ " i ......i i,
actual and desired psi [rad] 4
:
i
3 ........ :
actual and desired psidot [rad/s]
:
i
........ :
-o.s
O0
50
1 O0
150
200
0
actual and estimated psi [rad] 4
9
,
4
.
.
0
50
.
.
.
.
.
.
.
1 oo
150
20O
.
50 1O0 150 200 measured and estimated psi [rad] 9
50
actual and estimated psidot [rad/s]
if~ i ............ 0
::........
~o
i
0
50
.., 1O0
i 150
91 200
.
.
.
.
.
.
.
.
1O0
.
.
.
.
.
.
.
.
150
.
200
FIGURE 4. The yaw angle r (-) and the desired yaw angle Cd (- -) [rad], the yaw rate r ( - ) a n d the desired yaw rate r -)[rad/s], the yaw angle r (-) and its estimate ~ (- -), the yaw rate r (-) and its estimate ~dr ^ (- -), the measurement of r (-) and the estimate r (- -). T h e observer and controller p a r a m e t e r s were chosen as 0.05 0 0
A1
=
A2
=
ld
=
2.5
11
=
10
0 0.05 0
0 0 0.05
[0 o 0] 0 0
0.5 0
0 0.5
(7.55) (7.56) (7.57) (7.5s)
Note t h a t the conditions (7.45-7.46) and the estimate of the region of attraction in (7.47) are conservative. In order to reduce the thruster forces c o m m a n d e d by the controller, we chose l d and ll below the bounds given in (7.45-7.46) in the simulations. We see in Figure 5 t h a t the natural logarithm of the norm of [~T, ~T]T w a s upper bounded by a decreasing straight line, showing t h a t the convergence was still exponential. The natural logarithm
7. Output Feedback Tracking Control for Ships
taul [N]
323
tau2 [N]
,~I
'~0
50
100
150
200
tau3 [Nm]
)0 The natural logarithm of the norm of [etatilde,eha
1 01
! .....
i .......
:
....... :
0
-2 -3 -5
-10
...........................
50
100
-4...
150
200
-6
50
100
150
200
FIGURE 5. The surge control force T1, the sway control force T2 [N], the yaw control torque T3 [Nm] and the natural logarithm of the norm of [~T, ~T]T. of the norm became quite influenced by noise as it took values lower t h a n approximately - 2 , corresponding to the norm value being lower t h a n 0.15. This corresponds to the fact t h a t the measurement noise was of m a g n i t u d e 0.1, and thus the influence of the noise became significant as the n o r m reached such low values. The simulation results are shown in Figures 1-5. In Figure 1 we see how the ship moves in the x y - p l a n e . We see in Figures 2c4c t h a t the position estimates ~, ~ and the yaw angle estimate r converge to the position variables x, y and the yaw angle r respectively, despite the measurement noise. The estimated position and yaw angle together with the noise-corrupted measurements which are the inputs to the observer, are shown in Figures 2e-4e. In Figures 2d-4d we see t h a t the velocity estimates are quite noise-corrupted. T h e impact of the noise on the velocity estimates will depend on the "derivative gains" Id and 11. This can be seen from (7.42), where we see t h a t the measurement noise is fed directly into the velocity estimate, and the choice of the control p a r a m e t e r s A2 and ld, and by (7.46) also 11, will therefore be a trade-off between the size of the region of attraction (7.47) and the influence of the m e a s u r e m e n t noise on the velocity estimate d~) T h e influence of the m e a s u r e m e n t noise on the dt " velocity estimate is reflected in the control force and moment, Figure 5.
324
7. Output Feedback Tracking Control for Ships
(In practice the actuator dynamics will give low-pass filtering, removing the high-frequent signals from the actual control input.) In Figures 2 a 4a and 2b-4b we see that despite the influence of the measurement noise, the controller performance is good. This may be due to the second-order filtering of the ship from force and moment to position and yaw angle. In particular the position and yaw angle tracking control is good, and for ship control this is more important than the velocity tracking.
5
Bias E s t i m a t i o n
The ship will be subject to environmental disturbances due to wind, waves and currents. We do not want the control law to react to the high-frequent components of the disturbances, because this would give increased wear and tear on the actuators. The controller should instead compensate for the slowly-varying bias of the disturbance. We therefore want to include bias estimation in the control scheme. A frequently used bias model for maritime control applications is the 1st order Markov process [8] = -T-lb + Bn
(7.59)
where b C ~}~3 is the vector of bias forces and moments and T is a diagonal matrix of positive bias time-constants. The bias model is driven by zeromean Gaussian white noise n E N3, and B is a diagonal matrix scaling the amplitude of the white noise. This model can be used to describe slowly varying environmental forces and moments due to second-order wave drift, ocean currents, wind and also to describe unmodeled dynamics. For the analysis we assume that n = 0, since the bias estimator model will be driven by estimation errors. The ship model is then M(r
+ C(r
+ D(r
= T~ + b
(7.60)
We use the observer developed in Section 3 together with a bias estimator proposed by [8], and we use the bias estimate in the controller d ~-~?) = M(r
=
-6(r +M(r
d --b dt ^
(7.61)
z + (h2 + ldI3• -llSl-
hl~+ M(r
(7.62)
+ ldI3x3)A2~
(7.63)
-T-lg + Kb~) - b -~- C(~), ?)2)(~ 2 -- 81) ~-
- ( A 2 - M(r
D(r
-
1181
-
AlL +
M(r (7.64)
7. Output Feedback Tracking Control for Ships
325
where Kb = KbT > 0. We cannot use the analysis of [8] to prove stability of the system, as the analysis is based on an open-loop model of the ship, and also the ship model does not include Coriolis and centripetal forces and moments as it is modeled for dynamic positioning purposes, not for tracking control. To prove that our closed-loop system is semi-globally exponentially stable, we use the Lyapunov function candidate V(Sl,~, s2,~, b)
IT
=
I T AI~ § l s T M ( r
~s 1 M ( r
§ ~
§
§ l"ybTb
(7.65)
where b --- b - b, and where -y is a positive constant. The time-derivative of V is then
--8T1(1113•
-
-
-s~(~dM(r
C(~,D,82))s I §
sT(M(~b)A2 +
+ C ( r ~ - s2))s2 - 4 D ( r
IdM(r - s~
--~TAITA18 -- ~)TATA2O -- 7bTgbb - .,/~TT-I~)
(7.66)
Using the matrix properties we find that this is upper bounded by ~r
__~ --(/1 -- CMI[S2I[)I[Sl[i 2 + MM(Id + A2,M)I[Sl[[ [IS2[[
(7.67)
- ( l d M m - CMIlil -- s21I)lls2112 -- Dmlls2112 + 11]s2111]')'b[I "7 1 11"7~,1: -[IAl~ll 2 - Ilh2OlJ 2 + KbMA~-llbblJ []A27)1[ - ~---~M I where KbM and TM are the maximal eigenvalues of the matrices Kb and T respectively. By completing the squares, we find that
l~r ~
S (Zd + A2,M))lIsllI 2 2x M Mm
--(Zl --CMIlS21[
(7.68)
- ( l ldMm - CMlli7 -- s2l[ -- 1MmA2,M - 2 TM )llsill 2 "7 1 1 7 _ 2Kb2MA22)ll'7/~[]2 -Dmlls2112 -IIAl~ll 2 - ~llA27)ll 2 - ( 2TM" We see then that if the following conditions are satisfied
A2m
KbM
<
ld
>
A2,M § ~
11
>
1 M 2 (A2 M + ld) 2 Mm '
(7.69)
2 X/"TTM 2CMVM
4TM
§ Mm----~
(7.70) (7.71)
326
7. Output Feedback Tracking Control for Ships
for some 7 > 0, then there exists a region f~ in which for some a > 0
? _< -allyll 2
(7.72)
Vy Cgt
where y = [Sl,AI~,S2,A2~,"ff)]T. By Lyapunov theory we thus have t h a t y = 0 is an exponentially stable equilibrium of the system (7.60-7.64). For tuning purposes, we can interpret the condition in (7.69) as an upper b o u n d on the inverse of the integral time constant of the estimation error ~, el. (7.63-7.64). Furthermore, the condition (7.70) can be interpreted as a l o w e r bound on the derivative gain of the estimation error ~), and (7.71) as a lower bound on the derivative gain of the error ~. We can find an estimate of the region of attraction along the same lines as in Section 3, and we then have the following proposition
Proposition
7.2 Consider the ship (7. 60) and the observer-controller with
bias estimator d
^
M(r
=
- llSl - Ale + M(r
-C(r
=
1
+ ldI3x3)A2~l
+M(r d^ --b dt
(7.73)
z + (As + l d h x 3 ) ~
(7.74)
- T - 1 D + Kbr]
(7.75)
--D -t- C(~), ?)2)(?) 2 - 81) Jr- D ( ~ ) ) ~ 2 -(As
llSl
- M(~b)A22)~
-
Ale
nt- M ( r
1
(7.76)
where A1 =diag{All,A12,A13} > 0 and A2 =diag{Am,A22,A2a} > O. Under the conditions A2m
KbM
<
ld
>
A2,M-t--
(7.78)
ll
>
2 Mm
1 M• (A2,M + la)
(7.79)
(7.77)
2x/~TM 2 CM VM 4TM + - Mm Mm 7
where KbM is the maximum eigenvalue of the symmetric positive matrix Kb, VM is the maximum value of the reference velocity ild , TM is the maximum of the bias time constants, A2,M = max{)~21,/~22,)~23}, A2,m = min{Am,A22,A23}, where CM,MM, Mm are defined in (7.2-7.5) and ~/ is some positive constant, the closed-loop system is locally exponentially stable.
7. Output Feedback Tracking Control for Ships
327
A region of attraction is given by
A - - {y E ~121 []y][ <
5}
(7.80)
where y = ----[81, AI~, s2, A2~, 7b] T
5
=
1 Mm min{ (1 + v/2)(2---~M (ld -- A2'M) M2 1 ~v~M
ll CM
2TM ~/CM
A2,M)}
l
VM),
(7.81)
2 M m C M ( d ~-
9 --1 --1 --1 --1 --1 P,~ = mm{M~,A1,M,A2,M, 7 --1 } and PM = maX{MM, Al,m, A2,m,3' }.
R e m a r k 7.6 For dynamic positioning the desired trajectory is a constant position, ~d = constant, and then the closed-loop system is semi-globally exponentially stable i.f KbM
<
A2m 2x/~/TM
(7.82)
ld
>
A2,M + 4TM Mm~/
(7.83)
ll
>
2 Mm
1 M ~ (A2 M + ld)
(7.84)
'
Moreover, .for dynamic positioning purposes the Coriolis and centripetal .forces and moments can be assumed to be zero. Then, from (7.68) it is seen that under the conditions (7. 82-7. 84) the closed-loop system is globally exponentially stable.
6
Simulations with an Environmental
Disturbance
The simulations were performed with a disturbance bias that initially was
b0': [ ]0
(7.85)
which was of size ~oo of the control force magnitude saturation. The matrix of bias time constants was T =
1000 0 0
0 1000 0
0 ] 0 1000
(7.86)
328
7. Output Feedback Tracking Control for Ships
2..= 2 1.~ 1 9-~ O.E
-0.5 -1 -1.5 -t
0
1
2
3
4
y-position
5
6
FIGURE 6. The ship trajectory (-) and the desired trajectory (- -) in the xy-plane.
[0 0 0]
The bias estimator gain matrix was chosen as Kb :
0
0
0.2 0
0 0.2
(7.87)
Furthermore the parameter 7 -- TM = 1000. The other observer-controller parameters, and also the noise conditions and the reference trajectory, were chosen equal to those given in Section 4. Note that the conditions given in (7.69-7.71) are conservative. The parameter Kb chosen for the simulations is chosen above the bound of (7.69) in order to obtain faster convergence of the bias estimate. Furthermore, as in Section 4 the parameters l d and ll are chosen below the bounds of (7.70-7.71) in order to reduce the thruster forces commanded by the controller. The simulation results are shown in Figures 6-11. We see in Figure 11 that the natural logarithm of the norm of [?~T,~T, bIT was upper bounded by a decreasing straight line, showing that the convergence was exponential. From Figures 7-9 we see how the position estimates follow the position variables x, y and r despite the measurement noise. The velocity estimates are quite noise-corrupted, giving control inputs that are quite influenced by the noise as seen in Figure 10, but despite this the controller performance is good. In particular, the position and yaw angle tracking control is good, despite both measurement noise and the environmental disturbance. In Figure 11 we see that the bias estimates converge quite slowly, but the convergence of the vector loT, aT, ~]T is still exponential. Simulations with a longer time-scale show that the bias error
7. Output Feedback Tracking Control for Ships
3
actual a n d d e s i r e d x d o t [m/s]
actual and d e s i r e d x [m] 9 , , 9
actual a n d estimated x [m]
!o
.
o.1
.
.
.
.
.
.
.
.
.
.
.
.
-0
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.......... .........
0
0
0.6
Lo i oo
50 100 150 200 actual a n d e s t i m a t e d x d o t [m/s] : : :
0
-0 0
3
329
i
50
100
i ........
150
200
m e a s u r e d a n d e s t i m a t e d x [m] . . .
F I G U R E 7. The position variable x (-) and the desired position Xd (- -) [m], the velocity variable x (-) and the desired velocity ~?d(- -)[m/s], the position variable x (-) and its estimate ~ (- -), the velocity ~? (-) and its estimate ~dx^ (- -), the measurement of x ( ) and the estimate ~ (- -). l) converges to zero.
7
Conclusions and Puture Work
In this work a nonlinear observer and feedback control law was p r o p o s e d for o u t p u t feedback tracking control of ships. As the m e a s u r e m e n t s were quite c o r r u p t e d with noise, the control law used the filtered m e a s u r e m e n t s together with the estimated velocities for feedback. T h e observer a n d controller design were combined in order to utilize the observer s t r u c t u r e in the controller design a n d vice versa, in order to develope a c o m p u t a t i o n ally simple observer a n d control law. T h e resulting s y s t e m was p r o v e d to be semi-globally exponentially stable. If the Coriolis a n d centripetal forces and m o m e n t s were negligible, as for the special case where the desired traj e c t o r y was a c o n s t a n t position a n d orientation, the s y s t e m was globally exponentially stable. Furthermore, bias e s t i m a t i o n was i n t r o d u c e d in order to c o m p e n s a t e for the bias of environmental forces, a n d the o u t p u t feedback
330
7. Output Feedback Tracking Control for Ships
actual
6
.
and desired y [m] . .
-
50
1O0
150
: :
0 -0
~ 2 0 0 actual estimated 6 : and T : y [m] :4 .......... !.......... i ..................
-20
actual
0.1 0.0 0 1 ~
0
O.
200
-0~
.
and desiredydot [m/s] : : 9 : .... 0 ~
,
,
,
.
9
9
50 1O0 150 200 actual and estimatedydot [m/s]
0
0
m e a s u r e d and estimated y [m] 6
.
.
i
0
50
1O0
:
150
200
FIGURE 8. The position variable y (-) and the desired position ya (- -) [m], the velocity variable ~) (-) and the desired velocity ~)d(- -)[m/s], the position variable y (-) and its estimate ?) (- -), the velocity ~) (-) and its estimate ~ y (- -), the measurement of y (-) and the estimate 7) (- -).
d^
tracking control scheme including the bias estimator was proved to give a semi-globally exponentially stable system. The results were illustrated by simulations. The position and heading measurements of the ship will include the oscillatory wave motion. It is not desirable that the controller reacts to this wave motion, because this gives increased wear and tear on the actuators and increased fuel consumption. Therefore wave filtering should be included in future work.
Acknowledgments The authors would like to thank T. I. Fossen and A. A. J. Lefeber for the interesting discussions regarding the topic.
7. Output Feedback Tracking Control for Ships actual and desired psi [rad] 4
1
9
,
,
. . . . . . . . .
i
. . . . . .
.....
O0 4
.=i...
9 . . . . .
4
0
. . . . . . .
:
1O0
........
::
,!,
i ......
i ........
150
~ . . . . . . . . . . . . . . . . . . . . . . . . . . .
0
2
:
50 1O0 150 200 measured and estimated psi [rad] : : :
50
....
.
50 1O0 150 200 actual and estimated psi [rad] 9
0
:
actual and desired psidot [rad/s] : : :
-o.!l~s O~]r
331
0
50 1O0 150 200 actual and estimated psidot [rad/s] : : :
50
1O0
150
200
200
FIGURE 9. The yaw angle r (-) and the desired yaw angle ~Pd (- -) [rad], the yaw rate ~ ( - ) a n d the desired yaw rate Cd(- -)[rad/s], the yaw angle r (-) and its estimate r (- -), the yaw rate ~ (-) and its estimate ~~b ^ (- -), the measurement of ~p (-) and the estimate r (- -).
8
REFERENCES [1] M. F. Aarset, J. P. Strand and T. I. Fossen, Nonlinear Vectorial Observer Backstepping with Integral Action and Wave Filtering for Ships, Proceedings of the IFAC Conference on Control Applications in Mafine Systems (CAMS), Fukuoka, Japan, October, 1998. [2] H. Berghuis, Model-based Robot Control: from Theory to Practice, Ph.D. dissertation, Univ. Twente, Enschede, The Netherlands, 1993. [3] H. Berghuis and H. Nijmeijer, A Passivity Approach to ControllerObserver Design for Robots, IEEE Transactions on Robotics and Automation, Vol. 9, No. 6, pp. 740-754, 1993. [4] C. Canudas de Wit and J.J.-E. Slotine, Sliding Observers for Robot Manipulators, Automatica, Vol. 27, pp. 859-864, 1991. [5] C. Canudas de Wit, N. Fixot and K. J. Astrom, Trajectory Tracking in Robot Manipulators via Nonlinear Estimated State Feedback, IEEE
332
7. Output Feedback Tracking Control for Ships taul [N]
10
tau2 [N]
C -5 -100
50
100
150
200
150
200
,0
tau3[Nm]
0
-10
0
50
100
FIGURE 10. The surge control force T1, the sway control force ~'2 [N] and the yaw control torque ~'3 [Nm].
Trans. Robotics Automat., Vol. 8, pp. 138-144, 1992. [6] T. I. Fossen, Guidance and Control of Ocean Vehicles, John Wiley &: Sons Ltd., Chichester, 1994. [7] T. I. Fossen and O.-E. Fjellstad, Nonlinear Modelhng of Marine Vehicles in 6 Degrees of Freedom, International Journal of Mathematical Modelling of Systems, Vol. 1, No. 1, pp. 17-27, 1995. [8] T. I. Fossen and J. P. Strand, Passive Nonhnear Observer Design for Ships Using Lyapunov Methods: Full-Scale Experiments with a Supply Vessel, Automatica, Vol. 35 No. 1, 1999. [9] I. D. Landau and R. Horowitz, Applications of the Passive Systems Approach to the Stability Analysis of Adaptive Controllers for Robot Manipulators, Int. J. Adaptive Control and Signal Processing, Vol. 3, pp. 23-38, 1989. [10] S. Nicosia, A. Tornamb~ and P. Valigi, Experimental Results in State Estimation of Industrial Robots, in Proc. Conf. Decision and Control, Honolulu, HI, Dec. 1990, pp. 360-365.
7. Output Feedback Tracking Control for Ships
333
bias} and its estimate [N]
biasx and its estimate[N] 0.02-
0.1. =
o.Oiol. ........ ~~-.!
0.1 0.0.= ....... 0
!
:. . . . . . . . .
: . . . .
i ........
! ......
ij~J
i .......
-0.0, I
-0.05
.... /
......... ........
.... ~ ..........
-0.1
-0.15
......
~0
-0-20
150
-0.o~i /
: .....
1~0
200
biasz and its estimate [Nm]
o.o4(
......
io
......... ....
,,o
,oo
The natural logarithm of the norm of [etatilde,ehat,b
0.0. =
0.51 . . . . . . . . . . :. . . . . . . . . . . . . . . . . . . .
0
:. . . . . . . . .
0 -0.05
-0.5
/
9i:H
-0.1
-1
i ........................
-1.5 -0.15
ii -0.20
-2
i 50
1 O0
150
200
-2.5
50
1 O0
150
200
FIGURE 11. The disturbance bias (-) and its estimate (- -) in the x - direction and the y - direction [N], the disturbance bias about the z - axis (-) and its estimate (- -) [Nm] and the natural logarithm of the norm of [~T ~T,~)]T. [11] S. Nicosia and P. Tomei, Robot Control by Using only Joint Position Measurements, IEEE Trans. Automat. Contr., Vol. 35, pp. 1058-1061, 1990. [121 R. Ortega and M. W. Spong, Adaptive Motion Control of Rigid Robots: A Tutorial, Automatica, Vol. 25, pp. 877-888, 1989. [13] B. Paden and R. Panja, Globally asymptotically stable ' P D + ' controller for robot manipulators, Int. J. Control, Vol. 47, pp. 1697-1712, 1988. [14] M. Faulsen, O. Egeland and T. I. Fossen, A Passive Feedback Controller With Wave Filter for Marine Vehicles, International Journal of Robust and Nonlinear Control, vol.8,no.15, pp.1239-1253,1998. [15] J.J.-E. Slotine and W. Li, On the Adaptive Control of Robot Manipulators, Int. Y. Robotics Res., Vol. 6, pp. 49-59, 1987. [16] J.J.-E. Slotine, J. K. Hedrick and E. A. Misawa, Sliding Observers for Nonlinear Systems, ASME J. Dynam. Syst., Measurement, Control,
334
7. Output Feedback Tracking Control for Ships Vol. 109, pp. 245-252, 1987.
[17] M. Takegaki and S. Arimoto, A New Feedback Method for Dynamic Control of Manipulators, ASME J. Dynam. Syst., Measurement, Control, Vol. 102, pp. 119-125, 1981. [18] B. Vik and T. I. Fossen, Semi-Global Exponential Output Feedback Control of Ships, IEEE Transactions on Control Systems Technology, TCST-5(3):360-370, 1997.
D y n a m i c U C O Controllers and Semiglobal Stabilization of Uncertain N o n m i n i m u m Phase Systems by Output Feedback A. Isidori 1, A. R. Teel 2 and L. Praly 3 1Department of Systems Science and Mathematics, Washington University, St. Louis, MO 63130 and Dipartimento di Informatica e Sistemistica, UniversitA di Roma "La Sapienza", 00184 Rome, ITALY. 2Department of Electrical and Computer Engineering, University of California, Santa Barbara, CA 93106, USA. aCentre Automatique et Syst~mes, t~cole des Mines de Paris, 35 rue St. Honor6, 77305 Fontainebleau c6dex, FRANCE.
1
Introduction
One of the most active research issues in nonlinear feedback theory is the synthesis of feedback laws which robustly stabilize an uncertain system with limited measurement information. In the case of output feedback without uncertainty, one of the major achievements in this area of research has been the "nonlinear separation principle" proved in [6], where it is shown that (semi)global stabilizability via state feedback and a property of uniform observability imply the possibility of semiglobal stabilization via o u t p u t feedback. To cope with the restricted information structure, the stabilization of [6] includes an approximate state observer (whose role is actually that of producing approximate estimates of a number of "higher order" derivatives of the output) earlier developed in [3] to cope with a similar (though more restricted) stabilization problem. A "robust" version of this stabilization result was given in [5], where it was shown that, in the presence of parameter uncertainties, semiglobal stabilization via output feedback is still possible if a state feedback law is known which robustly globally stabilizes the system and its value, at any time, can be expressed as a (fixed) function of the values, at this time, of a fixed number of derivatives of input and o u t p u t (a uniforTnly completely observable (UCO) state feedback, in the terminology of [5]). The design tools introduced in [3] and [5] have been recently used in [2], where a new (iterative) procedure has been proposed for the robust stabilization of certain classes of nonlinear systems. This procedure is not based
336
8. Dynamic UCO Controllers and Stabilization by Output Feedback
on the idea of solving separately a problem of state feedback stabilization and a problem of a s y m p t o t i c state reconstruction. Rather, it is based on the recursive u p d a t e of a sequence of "dynamic" output feedback stabilizers: specifically, the basic result of [2] is t h a t if a suitable subsystem of lower dimension is robustly stabilizable by dynamic o u t p u t feedback, so is the entire system. From the point of view of the approach of [5], the condition on which the result of [2] relies (that happens to be necessary in the case of linear systems) can be viewed as a condition for the existence of a dynamic feedback driven by functions t h a t are expressible in terms of the o u t p u t and its derivatives, i.e., driven by U C O functions. In this chapter we review and extend the result of [2] and we show how this result can also be obtained as a special case of a general stabilization result based on the existence of a dynamic feedback driven by U C O functions. More specifically, after some preliminary definitions in Section 2 including our definition of uniform semiglobal practical a s y m p t o t i c stability, we discuss stabilization of n o n m i n i m u m phase nonlinear systems by output feedback in Section 3. This discussion is split into two parts: the relative degree one case in Section 3.1, and the higher relative degree case in Section 3.2. The main results of these sections are t h a t if a reduced order, auxiliary system can be stabilized by dynamic output feedback then the original n o n m i n i m u m phase system can be stabilized by dynamic o u t p u t feedback. In Section 4 we show how the results of Section 3 can be viewed as special cases of a general result on semiglobal practical a s y m p t o t i c stabilization by output feedback. In Section 4.1 we present some additional definitions, including the notions of uni.formly completely observable (UCO) functions and uniform semiglobal practical a s y m p t o t i c stabilizability by dynamic UCO feedback, and a general output feedback stabilization result which expands on the ideas in [5]. This result is specialized to the case of n o n m i n i m u m phase nonlinear systems in Section 4.2. In this section, we compare and contrast the controllers developed in Section 3 explicitly for the n o n m i n i m u m phase nonlinear system case to the controllers t h a t result from following the synthesis steps presented in [5].
2
Preliminaries For simplicity all nonlinear functions in this chapter will be assumed to be sufficiently smooth so t h a t all needed derivatives exist and are continuous, all differential equations have solutions, etc. 9 We will use ~ n ( r ) , with r > 0, to denote a closed ball of radius r in ~n. 9 Unless otherwise noted, #(t) is a measurable function taking values in a compact set P C ~ P . The set of such functions is denoted A/I~,.
8. Dynamic UCO Controllers and Stabilization by Output Feedback
337
9 The origin of a nonlinear dynamical system
= f(x,#(t), k) ,
(8.1)
with x E ~ n and k E Lr~c, is said to be uniformly semiglobally practically asymptotically stable in the parameter k if for each pair of strictly positive real numbers 0 < r < R < cxD there exist k E ~ c an open set O D Bn(R), a function V : O --~ ~ > 0 that is proper on O and strictly positive real numbers 0 < q < Q < cx~ such that i.)
~n(R)
ii.)
Bn(r) D {~ E O : Y(~) < q},
c
{~ E O : V(~) < Q},
iii.) and
OV Oxf(X'#'k) <0
V#E~,VxE
{~EO:q
Uniform semiglobal practical asymptotic stability implies: for each pair of strictly positive real numbers 0 < r < R < cx~, there exist k E ~ c and T > 0 such that, for all initial conditions in ~ n ( R ) , all resulting trajectories x(t) of (8.1) with k = k are such that x(t) E B . ( r ) for all t >_ T. It also can be shown to imply: for each pair of strictly positive real numbers 0 < r < R < oe, there exist k E E~c, a compact set `4 C_ ~,~(r) and an open set G D ~ n ( R ) such that, for the system (8.1) with k = fc, the set .4 is uniformly asymptotically stable with basin of attraction G. By this we mean: 9 for each E > 0 there exists 5 > 0 such that all trajectories starting in a 5-neighborhood of .4 remain in an E-neighborhood of .4 for all time, and 9 for each c > 0 and each compact subset of ~ there exists T > 0 such that all trajectories starting in the compact subset enter within T seconds and remain thereafter in an c-neighborhood of ,4. In fact, due to recent converse Lyapunov function results (see [4], [1], [7]), these latter properties are equivalent characterizations of uniform semiglobal practical asymptotic stability. However, we are using the Lyapunov formulation here so that we can more directly appeal to the results on semiglobal practical asymptotic stabilization like [5, Proposition 3.1] where a Lyapunov formulation was used.
338
3
8. Dynamic UCO Controllers and Stabilization by Output Feedback
Stabilization of N o n m i n i m u m Phase S y s t e m s by
Output Feedback 3.1
The Relative Degree One Case
Most methods for robust stabilization of a nonlinear system by relative degree one output feedback rely on the hypothesis that the system has an asymptotically stable zero dynamics. The main reason why this hypothesis is assumed is that most of the methods in question use "high-gain" feedback in order to keep the output small, thereby enforcing a behavior whose asymptotic properties are essentially determined by the asymptotic properties of the zero dynamics. In particular, asymptotic stabilization occurs only if the latter is asymptotically stable, i.e., if the system is minimum phase. Consider robust (with respect to disturbances #(t)) stabilization of the origin for the system
9
= f0(z,y,,(t)) =
(8.2)
q(z,y,~(t))+b(y)u
where z E ~ n - 1 , Y C ~ , u C ~ , #(.) G AJp and b(y) ~ 0 for all y. In the case of uniformly globally asymptotically stable zero dynamics, i.e. (see [4]) when there exists a smooth, positive definite and proper function V(z) such that
OV Ozf~
Vz%O,
V#~P ,
the control law
u-
1
b(y)
ky,
where k is a sufficiently large number, solves the problem of semiglobal practical asymptotic stabilization of the origin. This follows from the fact that, given a compact set in (z, y) not containing the origin, for large enough ov f0(z, 0, #) - ky 2 in the derivative of the k the negative definite term -5~-~ composite Lyapunov function
U(z, y) = V(z) + y2 , i.e., in
OV
Oz fo(z, ~ , , ) + 2y[q(z, y, ~) - ky] ,
is able to dominate all nonnegative terms on the given compact set. In the case where the original output does not yield an asymptotically stable zero dynamics, one approach is to look for a new o u t p u t function, of
8. Dynamic UCO Controllers and Stabilization by Output Feedback
339
the form y - y* (z), for which the resulting system is uniformly minimum phase. Then, by following the reasoning above, the control 1 b(y) k ( y - y * ( z ) )
u-
may be used to achieve robust semiglobal practical stabilization of the origin. The potential drawback to this approach is that it requires the measurement, or at least the robust observability via the actual measured output y and the input u, of the term y*(z). Looking at the structure of the system (8.2), we see that the main information about the z subsystem that is robustly observable through the measurement y and the input u is the term q(z, y, it(t)) and perhaps its derivatives. The discussion that follows, in this and the next subsection, describes one very efficient way, suggested in [2], to use the information contained in q(z, y, it(t)) to design a stabilizing feedback law without actually requiring a measurement of q(z, y, p(t)). We will suppose A s s u m p t i o n 8.1 For the auxiliary system
= =
fo(z,~,it(t)) q(z,~,it(t)) ,
(8.3)
the controller =
N(~),
(8.4)
with N(O) -- O, is such that the origin o.f the system (8.3),(8.~) is uniformly globally asymptotically stable. Under this assumption, we can state the following result for the system (8.2) under the action of the controller
=
n(~) + M k [ y - g ( ~ ) ] 1 g 9N
-
b(y) [ - 0 T
+ Mk[y
1
-
(8.5)
- k[y -
Note that this is simply a dynamic feedback of the original (nonminimum phase) output y. T h e o r e m 8.2 Under Assumption 8.1, the origin of the system (8.2), (8.5)
is uniformly semiglobally practically asymptotically stable in the control parameter k. P r o o f . The result is established by noting, with the help of the input transformation
l
[ ONu~
u~ON
u = b - ~ --5"L(~) + (1 - - ~ - M ) v
]
,
(8.6)
340
8. Dynamic UCO Controllers and Stabilization by Output Feedback
that the system ~? = =
~1 --
fo(z,y,,(t)) n(~)
-
Mv
ON ( ON q(z,y,p(t)) +-~-~n(7~) + 1 -
]
(8.7)
v
with output 0 = y - N(W) has relative degree one with high-frequency gain identically equal to one, is minimum phase and can be written, globally, in a form that matches (8.2). Specifically, in the coordinates (z, ~, 0) where := ~ + MO, we have:
=
f 0 ( z , N ( ~ - MO) + O,#(t)) L(~ - MO) + Mq(z, N(~ - MO) + 0, #(t))
=
q(z,N(~ - MO) + O,l~(t)) + v .
=
0
(8.8)
By Assumption 8.1, when 0 is set to zero, the origin of the (z, ~) dynamics is uniformly globally asymptotically stable. It follows from the discussion above that the choice v = -kO is semiglobally practically stabilizing for the origin of (8.8). And, since N(0) = 0, the origin of (8.8) corresponds to the origin of (8.2),(8.5). Moreover, with this choice for v we see from (8.6) and the ~b equation in (8.7) that we recover the control law (8.5). A R e m a r k 8.1 If a controller of a form more general than (8.4) like 2~ =
=
exists (in the case where ~ depends on g we would need an assumption that guarantees a solution ~ to the second equation), a controller of the form (8.4) can be obtained by dynamic extension as =
~v+l
=
--frt(~v+l
-- Y)
with m a positive number. Instead of achieving uniform global asymptotic stability for the auxiliary system, this controller would, in general, achieve uniform semiglobal practical asymptotic stability in the parameter rn, at least in the case where the functions #(t) are restricted to have uniformly bounded derivatives. While this would complicate the above discussion, the conclusion of the theorem would still be the same. z~ R e m a r k 8.2 As discussed in [51, various local conditions can be imposed on the system (8.8) to guarantee uniform semiglobal asymptotic stability, as opposed to only uniform semiglobal practical asymptotic stability.
8. Dynamic UCO Controllers and Stabilization by Output Feedback
3.2
341
T h e R e l a t i v e Degree G r e a t e r t h a n O n e Case
T h e result of the previous section, on stabilization by dynamic o u t p u t feedback, can be extended to the case of outputs with relative degree greater t h a n one. Consider a nonlinear system modeled by equations of the form
~,
=
f(z, 41,... ,~-,#(t))
(8.9)
~,
=
q(z,41,... ,4~,#(t)) + b(4)u
Y
=
41
in which z E ~ n - r , #(.) C A/Ip and b(4) r 0 for all 4- This normal form m a y result from applying a globally defined, perhaps # dependent, coordinate transformation to a nonlinear system given in some other form. T h e only measurement t h a t we will assume is available is the o u t p u t y. W h a t we will show is that if a particular reduced system can be stabilized with measurements of 4 and q(z, 41,.. 9 , 4r, #(t)) then the system (8.9) can be stabilized with measurement of y only. W i t h the system (8.9), we associate an auxiliary system
5:a =
f~(xa, u~,#(t))
(8.10)
= h (xa,
in which z
"~
Xa'l /
41
X& ~
k x~,2 / j
and
f(z,
= (.
41, 9 9 . , 4r--1, Ua, ~(t))
fa'l(Xa'Ua) ) 42 ~
4r-- 1
?da
342
8. Dynamic UCO Controllers and Stabilization by Output Feedback
and
hgxa,
:= q(z,
r-1, ua,
A b o u t this system, we a s s u m e the following: Assumption
8.3
The controller ~b = u~ =
L(~,x~,2) + M y a N(~,x~,2),
(8.11)
with N ( 0 , 0 ) = 0, is such that the origin of the system (8.10),(8.11) is uniformly globally asymptotically stable. Under this assumption, we can s t a t e the following result for the s y s t e m (8.9) under the action of the controller ~b =
L(~,x~,2) +
u
b(~)
--
Mk[~r - N(~,x~,2)]
Mk[i~ - N ( ~ , x ~ , 2 ) ] ] + q ON f~,2(x~,2, ~ ) - k [ ~ - g ( ~ , Xa,2)]J Ox~,2 [L(~,xa,2) +
(8.12) Q
Note t h a t this is a d y n a m i c feedback of the o u t p u t y a n d its first r - 1 derivatives. 8.1 Under Assumption 8.3, the origin o.f the system (8.9), (8.12) is uniformly semiglobaUy practically asymptotically stable in the control parameter k.
Lemma
P r o o f . T h e p r o o f is the s a m e as the p r o o f of T h e o r e m 8.2. W i t h the i n p u t transformation
U = b - 1~ [ - ~ON - ~ L ( p , x ~ , 2 ) + ~ON f ~ , a ( x ~ , 2 , ~ ) + ( 1 - - -ONM~ ~ )
v]
(8.13)
we get the s y s t e m
Mv ~r ~--- ha(xa, ~r, #(t))-~ ~b =
L(~,xa,2) -
L(qo, Xa,2) +
(8.14) O--~a,2fa,2(Xa,2,Cr) +
1-
v
that, with o u t p u t 0 = ~ - N ( ~ , x,,,2), has relative degree one w i t h highfrequency gain identically equal to one, is m i n i m u m p h a s e a n d c a n be written, globally, in a form t h a t m a t c h e s (8.2). Specifically, in t h e c o o r d i n a t e s
8. Dynamic UCO Controllers and Stabilization by Output Feedback
343
(xa, ~, O) where ~ := qo + MO, we have:
k.. = I~(xo, N(~-MO, x~.,~)+O,.(t)) =
L(~ - MO, x~,2) + M h ~ (x~,N(~ - MO, x~,2) + 0 , # ( t ) )
=
h~(x~,N(~-MO,
(8.15)
x~,2)+O,#(t))+v.
By Assumption 8.3, when 0 is set to zero, the origin of the (x~, ~) dynamics is uniformly globally asymptotically stable. It follows, as before, that the choice v = - k O is semiglobally practically stabilizing for the origin of (8.15). And since N(0,0) -- 0, the origin of (8.15) corresponds to the origin of (8.9),(8.12). Moreover, with this choice for v we see from (8.13) and the equation in (8.14) that we recover the control law (8.12). A The dynamic controller (8.12) uses the state variables ( 1 , . . - , 4 ~ , i.e., the derivatives up to order r - 1 of the output y of system (8.9), as input. Thus, in order to find an output feedback controller, these variables must be replaced by appropriate estimates, which can be provided by a dynamical system of the form il = P~l + Qy
(8.16)
/ 110 0/
in which the matrices Q and P have the form
P .
-g2c~_2 0 1 ... . . . . . . -g"-lCl 0 0 ... --grco 0 0 ...
0
,
Q =
1 0
1 /
-g2c~_2 -gr-lcl --grc o
(8.17)
As shown in [3], it is convenient to saturate the resulting control law, at least where the estimates of ~ appear, so as to avoid the occurrence of finite escape times for large values of g. For example, we can replace the controller (8.12), which for ease of notation we now write as =
(8.18) =
with the controller --
(8.19) where (re(-) is a (by abuse of notation both a scalar and vector) saturation function (re(v) = v . m i n
1,~ T .
344
8. Dynamic UCO Controllers and Stabilization by Output Feedback
A controller of this type is able to robustly semiglobally practically asymptotically stabilize the plant (8.9). In fact, using the methods of [5] for example, it is possible prove the following result. T h e o r e m 8.4 (See also [2]) Under Assumption 8.3, the origin of the sys-
tem (8.9), (8.16), (8.19) [with C(., .) and g(., .) de.fined by the identi.fication between (8.12) and (8.18)] is uniformly semiglobally practically stable in the control parameters (k, g, e).
4
On Dynamic UCO Feedback
The basic observation of [2], summarized in Section 3.2 and on which the result of Lemma 8.1 rests, is that the term q(z, ~ 1 , . . . , ~r-1, ~r, #(t)) in the system (8.9) can be (and, in a nonminimum phase system, has to be) "isolated" from the rest of the system, using measurements only of the output and its first r - 1 derivatives, and treated as a separate source of information for feedback. Then, having a dynamic controller driven by the output its first r - 1 derivatives, as in Lemma 8.1, it is straightforward using ideas initially developed in [3] to find a dynamic output feedback controller that induces the desired properties, as in Theorem 8.4. From this point of view, the contribution in [2] is the identification of a natural (in fact, for linear systems it can be shown to be necessary) condition (Assumption 8.3) that guarantees the existence of a dynamic feedback that is expressible in terms of the output and its derivatives. Then Theorem 8.4 can be viewed as a special case of a more general result that is essentially contained in [5] (see [5, Proposition 3.1 and footnote 5]), namely that semiglobal practical stabilization by dynamic uniformly completely observable (UCO) feedback implies semiglobal practical stabilization by dynamic output feedback. We make this result explicit below.
4.1
General Results
Consider multi-input, multi-output nonlinear control systems
ic = y =
f(x,u, tt(t)) h(x,u, it(t))
(8.20)
with #(.) E Ad~,. The definition of uniformly completely observable (UCO) dynamic feedback, given next, at times implicitly constrains #(t) to be sufficiently smooth, where sufficiently smooth has to do with the number of times the output needs to be differentiated to reconstruct the UCO function.
8. Dynamic UCO Controllers and Stabilization by Output Feedback
345
D e f i n i t i o n 8.1 A .function ~(x~ u, ~) is said to be uniformly completely observable (UCO) with respect to the system (8.20) if it can be expressed as a function of a .finite number o.f derivatives of the output y and the input u, i.e., if there exist two integers ny and nu and a function q2 such that, .for each solution o.f U (nu+l)
y
=
f(x,u,#(t))
~
V
=
h(x,u,#(t))
(8.21)
we have, .for all t where the solution makes sense, ~ ( x ( t ) , u ( t ) , # ( t ) ) = r ( y ( t ) , . . . , y ( ' ~ , ) ( t ) , u ( t ) , . . . ,u('~"')(t)))
(8.22)
where y(i) denotes the ith time derivative o.f y at time t (and similarly .for u(~)).
R e m a r k 8.3 As in [5, Footnote 6], note the strong requirement that 9 is independent of #(t). On the other hand, note that the functions ~i ,
q(~l,... , ( r , # )
for the system (8.9) are UCO since we can write
~ = y(~-l),
q(~l,...,
~T, ~(t)) = y(r) _ b(y)u A
Our next definitions, on uniform semiglobal practical asymptotic stabilizability by dynamic UCO or output feedback, are closely related to our definition of uniform semiglobal practical asymptotic stability. However, as was the case in [5], we don't insist that the states of the dynamic compensator eventually become small in the closed-loop. We formulate the definition in Lyapunov function terms but, again, the definition could be formulated in terms of trajectories. D e f i n i t i o n 8.2 The origin of (8.20) is said to be uniformly semiglobally practically asymptotically stabilizable by dynamic UCO feedback if .for each pair o.f strictly positive real numbers 0 < r < R < o~ there exist: 9 a UCO .function ~(x, u, #) 9 .functions 0 and n, 9 compact sets Cn~ and Cnz , with Cn~ a subset of the interior of Cvl, 9 an open set (9 D B n ( R ) x Cnl , 9 a .function V : (9 -~ ~ > o that is proper on (9, and
346
8. Dynamic UCO Controllers and Stabilization by Output Feedback
9 strictly positive real numbers 0 < q < Q < c~ such that
i.)
( ~ ( R ) • c,,) c {~ 9 o : v(r _< Q},
ii.)
(-B~(r) • c,s) ~ {~ 9 o : v(r
< q},
iii.) and OV o x F ( x , ~) < o
V#E7 ),
VXE {~EO:q
where X and F ( X , #) are defined by =: F ( X , p(t))
(8.24)
with u = ~ (~, a ( x , u , ~(t)))
(8.25)
(and where, .for simplicity, we assume the right-hand side of (8.25) is independent of u). D e f i n i t i o n 8.3 The origin of (8.20) is said to be uniformly semiglobally practically asymptotically stabilizable by dynamic o u t p u t feedback if, in
the previous definition, we can always take a(x, u, p) = h(x, u, #). R e m a r k 8.4 In these definitions, we could allow the right-hand side of (8.25) to depend on u if we impose an extra condition that guarantees a solution to (8.25). /~ It will follow from the proof of [5, Proposition 3.11 (much like what is suggested by [5, Footnote 5]) that we have: T h e o r e m 8.5 Let p(.) E M p be sufficiently smooth with a uniform bound
on an appropriate number of derivatives. If the origin of the system (8.20) is uniformly semiglobaUy practically asymptotically stabilizable by dynamic UCO feedback then it is uniformly semiglobally practically asymptotically stabilizable by dynamic output feedback. S k e t c h o f P r o o f . Fix 0 < r < R < c~. From the assumption of uniform semiglobal practical asymptotic stabilizability by dynamic UCO feedback, this fixes a UCO function a(x, u, #), a corresponding function 9 that is used to reconstruct a from derivatives of y and u, functions 0 and ~, compact sets C~s and C,~l, an open set O, a function V and strictly positive real
8. Dynamic UCO Controllers and Stabilization by Output Feedback
347
numbers 0 < q < Q < oc. Now we apply the proof of [5, Proposition 3.1] to the control system
=
(f(x,ul,#(t))
o(n,u2) )
Y2 where the UCO feedback
( n(~,~(x, ul,#(t))) ~(X,Ul,#(t)) )
(ul)__
u2
induces the properties for the function V that are assumed in the proof of [5, Proposition 3.1] if we define the objects 1Cz~, ~zl, pl, c~ and cl used in the proof of [5, Proposition 3.1] as
and
c~:--Q,
ul:=q,
cz:=Q+l.
From here we follow the proof of [5, Proposition 3.1], but noting that dynamic extension is only needed on the input Ul and no estimates of the derivatives of Y2 -- ~ are needed. /~
~{.2 Application to N o n m i n i m u m Phase Systems We now apply this general result to the problem considered in Section 3.2. We start with an assumption that is a combination of Assumption 8.3 and Remark 8.1. Assumption
8.6
The controller u~
=
i@,~a,2,y~)
=
N(~,x~,2,y~) ,
(8.26)
is such that 1. N(0, 0, 0) = 0
and, for simplicity, N(~, Xa,2, ha(xa, ua, #)) indepen-
dent o.f ua, 2. the origin of the system (8.10), (8.26) is uniformly globally asymptotically stable; 3. the .functions #(.) are restricted so that ON Oh~ .. .I
I
348
8. Dynamic UCO Controllers and Stabilization by Output Feedback
is bounded in t > 0, uniformly in #(.), on each compact subset of the state-space. Under this assumption, we can state the following result for the system (8.9) under the action of the controller .
;
L
\
/
u
=
-ksgn(b(
))
(8.27) -
xa,
,
Note that this is a dynamic UCO feedback for the system (8.9) since, as noted in Remark 8.3, x~,2, ~ and h~(x~, ~r, #(t)) are UCO with respect to the system (8.9). L e m m a 8.2 Under Assumption 8.6, the origin of the system (8.9), (8.27) is semiglobally practically asymptotically stable in the parameter k. P r o o f . Follows from the discussion in Section 3.1. (See also [5, Lemma 2.2 (Semiglobal backstepping I)].) A The final result then follows from Theorem 8.5 and Lemma 8.2. C o r o l l a r y 8.1 Under Assumption 8.6, the origin of the system (8.9) is semiglobaUy practically stabilizable by dynamic output feedback. The controller given by Corollary 8.1, which is constructed following the proof of Theorem 8.5, is different from the one given by Theorem 8.4 together with Remark 8.1. In particular, the controller of Corollary 8.1 has the form of an observer = P~ + Qy ,
(8.28)
like in (8.16) but with 7 / E / R ~+1, where z/~+l is an estimate of ~r, plus an estimated and saturated dynamic UCO feedback
it
=
V :
cre(k2(v- u))
(8.29)
-klsgn(b(~))(~r- N(~,3~a,2,~r- b(~)u)) ,
like in (8.19). Compared to the controller (8.28),(8.29), the controller (8.16), (8.19) together with remark 8.1 has one less state and can be interpreted as using a reduced-order observer structure to accomplish the goal of robust semiglobal practical asymptotic stabilization. In [5, Section 6.2], a particular nonminimum phase nonlinear system, whose auxiliary system (using the terminology of the present chapter) is semiglobally asymptotically stabilizable by (static) UCO feedback, was considered as an illustration of the result that semiglobal practical asymptotic stabilization by (static) UCO feedback implies semiglobal practical asymptotic stabilization by dynamic output feedback. The controller used in t h a t section is the type of controller suggested by Corollary 8.1.
8. Dynamic UCO Controllers and Stabilization by Output Feedback
349
5 Conclusions This chapter presented a simple design method by which it is possible to robustly stabilize, using output feedback, a significant class of uncertain nonlinear systems whose zero dynamics are unstable. The assumption made for such systems was shown to imply the existence of a stabilizing dynamic feedback that is driven by functions that are uniformly completely observable (UCO). In this light, the result for nonminimum phase nonlinear systems was shown to be a special case of the more general result that semiglobal practical asymptotic stabilization by dynamic UCO feedback implies semiglobal practical asymptotic stabilization by dynamic output feedback. The controllers developed in this chapter specifically for nonminimum phase nonlinear systems were compared and contrasted to the controllers that prove the general stabilization result.
Acknowledgements The first author was supported in part by NSF under grant ECS-9707891, by AFOSR under grant F49620-95-1-0232, and by MURST. The second author was supported by in part by the NSF under grant ECS-9896140 and by the AFOSR undergrant F49620-98-1-0087.
6
REFERENCES [1] A. N. Atassi and H. K. Khalil. A separation principle for the control of a class of nonlinear systems. In Proc. of the 37th IEEE Conf. on Dec. and Contr., pp. 855-860, Dec. 16-18, 1998, Tampa, FL. [2] A. Isidori. A tool for semiglobal stabilization of uncertain nonminimum phase nonlinear systems via output feedback, preprint, November 1997. [3] H. K. Khalil and F. Esfandiari. Semiglobal stabilization of a class of nonlinear systems using output feedback. IEEE Trans. on Automatic Control, 38 (1993), pp. 1412-1415. [4] Y. Lin, E. D. Sontag and Y. Wang. A smooth converse Lyapunov theorem for robust stability. In SIAM J. Control and Optimization, 34(1996), pp. 124-160. [5] A. R. Teel and L. Praly. Tools for semiglobal stabilization by partial state and output feedback. SIAM J. Control Optim., vol. 33, no. 5, pp. 1443-1488, September, 1995.
350
8. Dynamic UCO Controllers and Stabilization by Output Feedback
[6] A. R. Teel and L. Praly. Global stabilizability and observability imply semi-global stabilizability by output feedback. Systems ~ Control Letters, vol. 22 (1994) 313-325. [7] A. R. Teel and L. Praly. A smooth Lyapunov function from a class-/EL: estimate involving two positive semidefinite functions. In preparation.
F a u l t D e t e c t i o n O b s e r v e r for a Class of Nonlinear Systems S. A. Ashton and D. N. Shields School of MIS, Coventry University, Coventry, United Kingdom.
1
Introduction
Different approaches for fault diagnosis, based on both hardware and software, have been given, as partially described in [7] [2]. Related work on various classes of observers has been proposed, including bilinear observers [10] [3], quasi-linear observers [2] [4], nonlinear high-gain observers [6] [9], nonlinear canonical-form observers [6] [11] and nonlinear observers, based on the existence of linearizing transformations [9] [5]. Many design methods for the robust residual generation, involving observers, rely on the solution of complex partial differential equations [2] [8] or the solution of a set of algebraic-differential (or algebraic-difference) equations [7] [4]. Observer designs obtained in [10] [3] [4], based on bilinear system models, involve only the solution of nonlinear algebraic equations or Ricatti equations, and on-line residual generation is possible for low-dimensional systems. Similar propositions are reported in [1] for systems modelled by quadratic polynomial models. However, a complete theoretical analysis is lacking of the problems associated with observer stability, fault-sensitivity, disturbance decoupling and numerical tractability of the combined observer and residual generator. This chapter extends the work in [10] and [1] and provides a design for a nonlinear fault detection observer for a class of polynomial systems such that the corresponding observer error dynamical system is linear for the fault-free case, but otherwise, nonlinear with respect to inputs, outputs and faults. A set of theoretical propositions are given for establishing the robustness, stability, existence and detectability properties of the observer, called a robust fault detection observer (RFDO). Many practical systems fall into the class of systems considered here [10] [3] [4] [1]. This chapter is organized as follows. A system description is given followed by the main section, Section 3, on observer design. Section 4 gives general sufficient conditions for detectability and Section 5 gives testable conditions for detectability including a special class of faults. Concluding remarks are given in Section 6.
354
1. Fault Detection Observer for a Class of Nonlinear Systems
2 System Description A continuous-time s y s t e m is considered with p o l y n o m i a l nonlinearities u p to degree three. T h e s y s t e m m o d e l considered is t h a t of the form m
Jc(t) = Ax(t) + Red(t) + K a f ( t ) + Bu(t) + E
i u i (t)A~,x(t)
i----1
+~
xi(t) [Axxx(t) i + Rid(t) + K i f ( t ) ]
i----1
+
~xi(t)xJ(t)
[Ai~J~xx(t) + Rind(t) + K~Jf(t)],
(2.1a)
i=1 j = l
y(t) = Cx(t) + K s f ( t ) + E~d(t) = x l ( t ) + K s f ( t ) + Esd(t),
(2.1b)
where x(t) E R '~ is the s t a t e vector, p a r t i t i o n e d as, x(t)
~- [X 1 ( t ) t , x 2 ( t ) ' ] '
,
where xl(t) C RP, x2(t) C N n-p, u(t) C Nm is the input vector, y(t) C R p is the o u t p u t vector a n d d(t) E I~q is a disturbance. Here, the fault v e c t o r is p a r t i t i o n e d as
f ( t ) = [fa(t)', f s ( t ) ' ] ' e N v, where v = vl + v2, fa(t) E ]RTM represents c o m p o n e n t or a c t u a t o r faults a n d fs(t) E I~TM represents sensor faults. For convenience, w i t h o u t loss of generality, the c o n s t a n t matrices in (2.1a) a n d (2.1b) are p a r t i t i o n e d as i i C = [Ip,Opx(,~_p)], A = [A1,A2I, A~: = [AI~,~,A2~] ,
A~x = [A t . . A. ~.x ]
~ij 1 ij = [~ij A~x~ L"I . . . . "2xx~j
(2.2)
w h e r e C E I l ~ pxn, A1, Al~x(z i 9 = 1 , - - . ,n), A lijx ~ ( , , 9j 9 = 1,... ,n), Ai~x(i 9 = id 9 9 = 1 , . - . ,n), 1,..-,m)e It~nxp a n d A2, A 2i ~ ( z . = 1 , . - - , n ) , A2xx~(~,y Ai2~(i = 1 , - . - ,rn)C N '~x('~-p). Also, B C IR'~xm, Ea C R nxq, Es @ R pxq a n d h a , /(8 E N px(v) a r e all constant matrices. It is a s s u m e d (2.1a) is controlled by the input u(t) so that: Assumption
1.1 u(t) is bounded and such that for bounded d(t) and
f(t) = 0 (all t) x(t) exists and is bounded.
3
Observer Design
For the s y s t e m described by (2.1a)-(2.1b) an observer is proposed. T h e observer is linear in z(t) a n d involves bilinear, q u a d r a t i c a n d cubic t e r m s
1. Fault Detection Observer for a Class of Nonlinear Systems
355
in y(t) and u(t). T h e candidate observer is given by m
~(t) = F z i t ) + Juit ) + E u i ( t ) H ~i Y i t ) + Hy(t) i=l P
P
P
+ E Y i ( t ) g ~ x Y i t) + E E y ' ( t ) y J ( t ) H ~ x y i t ) , i=l
i3.1)
i=l j=l
where z(t) E ~ d is a linear estimate of T x i t ). A fault detection signal is defined as
e(t) = L,z(t) + n2y(t),
where
c
d~
(3.2)
> 1).
D e f i n i t i o n 1.1 The system (3.1)-(3.2) is a robust fault detection observer
(RFDO) with respect to the system (2.1a)-(2.1b) for a class of faults, CI, i.f 1. for all bounded u(t), dit), and z(O ) (t >_ 0), and with f(t) ---- O, the error dynamic .for e(t) is assymptotically stable, so that lim e(t) = O,
t----~OO
lim e(t) = O,
t----~OO
where e(t) = z(t) - Tx(t) and e(t) is given by (3.2). 2. for all bounded u(t), d(t) and y(t) (t >_ O) there exists at least one .fault vector f(t) # O, f i t) E Cf, such that
# o, (t >_ to) with e(to) = O. 1.1 Assumption 1.1 and condition Definition 1.1(I) ensure that z(t) exists and is bounded for zero faults and bounded ziO ). (Note Cf is not restricted here.) Condition Definition 1.1(1) also ensures that both the observer error and fault signal converge to zero for any disturbance when no .faults are present. Condition Definition 1.1(2) ensures that at least one .fault exists, which can be detected for all bounded nit), dit ) and Yit), given that at a specific time the error signal was zero. Remark
D e f i n i t i o n 1.2 The RFDO (3.1)-(3.2) is called a strict R F D O (SRFDO)
if.for a class f(t) E Cf, with f(t) • O, and any bounded set {u(t), d(t), y(t)}, there exists some to >_ 0 such that, given e(to) = O,
c(t) # o, (t > to).
356
1. Fault Detection Observer for a Class of Nonlinear Systems
R e m a r k 1.2 An SRFDO ensures that all f(t) E Cf can be detected, al-
though not necessarily distinguishable. Design Problem: Find matrices F, J, H, T, H ~ ( i , . ,. m),. H ~.i( z , 9 - - , p ) , H~x~(z, i 9 99- ,p), L1 and L2 and parameters d and do so that Definition 1.1 is satisfied. First condition Definition 1.1 (1) is considered. Condition Definition 1.1(2) is addressed in Section 4. Consider the observer error
e(t) = z(t) - Tx(t).
(3.3)
Using (2.1a)-(2.1b), (3.1) and (3.3) there obtains
~(t) = n I (t) + B I (t) + QU (t) + CU (t) ,
(3.4)
where
L I (t) = Fz(t) + Ju(t) + Hy(t) - T lAx(t) + Bu(t) + Ead(t) + Kaf(t)], m
(3.5a)
m
u (t)H~y(t) - T E u (t)A~x(t), i=1
(3.5b)
i=1
P
QU (t) = E yi(t)H~y(t) i=1 n
- T E xi(t) [A~x(t) + Eid(t) + g~ f(t)] ,
(3.5c)
i=1 P
P
CU (t)= E
E yi(t)yj(t)H~xy(t) j=l
i=1
n
- TE i=l
n
E xi(t)xJ(t) [A~xx(t) + EiJd(t) + g i J f ( t ) ] , (3.5d) j=l
where (3.5a)-(3.5d) correspond to linear, bilinear, quadratic and cubic terms, respectively, in x(t). Using (3.4) and (3.2) the following proposition holds true. 1.1 If T (T ~ 0), J, H, F, L1 (L1 ~ 0), L2, Hil~x(i = 1,... ,m), H~x(i -- 1,... ,p) and H~x~(i --= 1,... ,p) can be .found such
Proposition
1. Fault Detection Observer for a Class of Nonlinear Systems
357
that the following conditions are satisfied (for some do, d _> 1)
0 > ~ e ( A i ( F ) ) ;i = 1,-.. ,d
(3.6a)
H = [TA - F T ]
(3.65)
J = TB
(3.6c) (3.6d)
L2 = - L 1 T ( ~ Od,(n--p-kq)
= [TA - F T ] D - TEor
(3.6e)
(3.6f)
Od,(,~-v+q) = L1Tf} H~x Od,(n--p-kq)
= T A ~ ,i i
.
i
=TA~;z
= 1,... ,m
(3.6g)
= 1,-.- , m
(3.6h)
2H~ =TB;~'~;i= Od,(n--p-.bq) Od,(n--p)
(3.6i)
1 , . . . ,p
E I F ) ; i = 1,'-. ,p
=T(B~-
(3.6j)
= TBix+Pq2; i = 1 , . . . , n - p
Od,v = T K i + P ; i - =
l,...
(3.6k)
,n-p
(3.61)
Od,q-=TE~+P;i = 1,--- , n - p
(3.6m)
Ou,q = T [K1, . . . , K p] [Ip | Od,q = T [El,
6 H ~0
(3.6n)
= 1,... ,v
- "" , E p ] DE.,,Z i . 9 = 1 , " " ,q
--TB~,0
(3.6o)
9 i , j = 1,. .. ,p
Od,(=-p+q) = T ( B ~ J ~
(3.6p)
- [Eij + EJi]F) ; i , j = 1 , . . . ,p
d,n ~ 1 1,7,DiTpj+p. 9xxx ~i ~a4 ~
1,
. . ., n.
(3.6q) (3.6r)
P
Od,q = T [ E ij+p + EJ+Pl];i = 1,.-. , n ; j ----1,-. 9 , n - p
(3.6s)
Od,q = T[KiX, ' ' ' , KiPl[Ip | l~(j)]E~; i, j = 1 , . . . , v
(3.6t)
Od,~ = T [ K O+p + KJ+Pi];i = 1,--- , n ; j = 1,--- , n - p
(3.6u)
Od,q = T [ E ~1 + E l i , . . . , E ip + ElP]DJE~; i , j = 1 , . . . ,q, (3.6v) where l~(i) = [0,... ,0, 1, 0,.-- ,0]' E ]R'~ is the i-th unit vector, | K r o n e c k e r product and B i x x _- A x ~i
+ [ A I ~ , " " " , A ~n]
B ~i jx = ( A x x xij+ A ~ x x +j i[ [ A x ~ + A ~l j]
+ [[A~x + A ~ ]
is the
[In | jl
, ... , [ An~j + A j~n ] ]
[In |
, - - . , [ A ~ + A ~ x ] ] [I~ | l~(j)]),
D ~ = [Ip | lq(i)] E~ + [[Ip |
...
lip | lq(q)]] [Iq | Eflq(i)]
(3.7)
358
1. Fault Detection Observer for a Class of Nonlinear Systems
and where
o= r
O(,~_p)•
=
[~,
I,~_p
[ O(n--p)Xq
In--p
oq,(~_p)],
'
(3.8)
then e(t) and e(t) are implicitly decoupled from d(t) and satisfy, respectively, ~(t) = Fe(t) + W(t)f(t),
(3.9)
where W(t) is of the form W(t) = LI* + BI*(t) + QU*(t) + CU*(t), where LI* = [HKs - TKa] ,
(3.10a)
m
i Be* (t) = E u i (t)H~xK~,
(3.10b)
i=1
P
QU* (t)= E
(yi(t) - Ip(i)'Ksf(t)) [2H~xK~ - T K i]
i=l P
+ E tp(i)'K~f(t)H;~K~,
(3.10c)
i=l P
P
c u * (t) = E E l(i)'Ksf(t)l(j)'Ksf(t) [H~xK~ - T K ij] i=1 j=l P
P
+ E E Yi(t)l(j)'K~f(t) [T[Kij +Kji] - 3 H ij~ K ~ ] i=1 j=l P
P
- T K ij] + ~ , ~-~yi(t)y~(t) [3H)~K~ ij
(3.10d)
i=1 j=l
and e(t) = L1 [e(t) - TOKsf(t)] .
(3.11)
P r o o f . Firstly, consider the fault detection signal, e(t), given by (3.2). Using the partition T -- [T1,T2], where T1 E ~ d • and 7"2 E ]Rd• (2.1b) and (3.3) the signal e(t) can be expressed as
e(t) = Lie(t) + L2Ksf(t) + L2Esd(t) + [L1T1 + L2] x~(t) + r~T2x2(t).
1. Fault Detection Observer for a Class of Nonlinear Systems
359
Now, consider L I (t) given in (3.5a). Using the partitions given in (2.1b) and (2.2), (3.5a) expands to L I (t) = Fe(t) + [FT1 - TA1 + H] xl(t) + [FT2 - TA2] x2(t) + [J - TB] u(t) + [HKs - TK] f ( t ) + [HEs - TEa] d(t). The sufficient conditions for LI(t) and c(t) to be independent of x(t), d(t) and u(t) are then given in (3.6b)-(3.6f). When these conditions hold true LI(t) and e(t) become respectively L I (t) = Fe(t) + LI* f ( t ) , where LI* is given in (3.10a) and e(t) = Lie(t) + L 2 K s f ( t ) . Now consider the bilinear terms in (3.5b). Using (2.1b), (3.3) and the partition for A,x i , in (2.2), there obtains m
m
B I (t) = E
u~(t) [H~,x - T A Ii ~ ] xl(t) + E
u i (t)H~xE~d(t)
i=l
i=1 m
i u i (t)A2~x2(t) + BI* (t) f(t),
__T E i=1
where BI* (t) is given in (3.10b). The sufficient conditions for BI*(t) to be independent of x(t) and d(t) are given in (3.6g)-(a.6h). When these conditions hold true B I ( t ) reduces to B I * ( O f ( t ) . To obtain the most general conditions for the observer to exist, given in Proposition 1.1, the non-unique structure of the polynomial forms used in (2.1a) and (3.1) must be considered. It is assumed, without loss, that Hix and H~J~ have unique forms which satisfy the conditions HIx = [ H ~ , . . . ,HPxl[I | lp( i)]; i = 1 , . . . ,p, ij ji li H**~ = H~x x = [H~x~, 9.. lj
= [H . . . . . .[Hg~, .j l .
.
pj
, g. ~ ] [. I
pi
N/p(j)] il
| lp(i)] .
.- , H ;ip~ ] [ I | lp(j)]
jp ,HJxx~][IQlp(i)];i,j = 1 , . . . ,p.
Next, consider (3.5c) which can be expanded partially in terms of x l ( t ) , x2(t), d(t) and f ( t ) using (2.1b), (2.2) and p
x i (t)A~x= , - E i=1
n--p
E i=1
i=1
n--p
x l' ( t ) A x' x -t- E x 2 ( it ) A ~ ,+p x , i=l
p
x 2' ( t ) A'+P l x x x l ( t )= - E i=1
x li ( t ) [ A l+p l x ~ , . . . , A ~ ] [ I | Ip(i)]x2(t),
360
1. Fault Detection Observer for a Class of Nonlinear Systems
where xl(t) and x2(t) are independent. Hence, (3.5c) can be written as
QU (t) = QU~ (t) + QUid (t) + QUxdf (t) + QU* (t) f(t), where QU* (t) is given in (3.10c) and where p (t)
n--p
xi(t)[H~ x
=
TA~xxlxl(t )
-
x2(t)A2~x2(t ) i+p
TEi
-
i=1
i=1 P
-
~(t)[A2~x ~ -~- r41+p [''lxx,''"
TEx
,
AI~][In-p | lp(i)]lx2(t),
i=l p
n--p
QUxd (t) = E
xil(t) [2H~E8 - T E i] d(t) - T E
i=1
x~ (t)Si+Pd(t)
i=1
P
+E
lp(i)'E~d(t)H~Esd(t),
i=l p
n--p
QUxdf (t) -= T E
lp(i)'E*d(t)Ki f(t) - T E
i=1
x~(t)Ki+P f(t)"
i=1
Sufficient conditions for QV(t) to be independent of x(t) and d(t) are thus
2H~x-- TBxxO;~ -- 1,... ,p l
Od,(n_p)
~--
*
TB~+Pq2; i --- 1,.. 9 , n - p
Od,(~-p+q) = TB~q2; i ---- 1,..- , p Od,v =TK~+P;i = 1,... ,n - p 2 H ~ E 8 = T E ~ ; i = 1,... ,p Od,q=TEi+p;i
=
1,... , n - p
(3.12a) (3.125) (3.12c) (3.12d)
(3.12e) (3.12f)
Od,q=T [K1, 9" , K p] [IpQlv(i)]Es;i = 1,... ,v
(3.12g)
1 " , H ~p ] lip @ Es]D iEs,"i = 1,.. " ,q, Od,q= [H~x,"
(3.12h)
where (I) and ko are given in (3.8) and D iE8 is given in (3.7). In obtaining the conditions (3.12a), (3.12b) and (3.12h) the following result is used: n
O=-Ex~(t)A~x(t),
iff
0 = A i + [A1, ... ,A~][In |
(3.13)
i=l
Using (3.12a), (3.12e) can be combined with (3.12c) to give (3.6j). Substituting (3.12e)into (3.12h) gives (3.6o). Hence, sufficient conditions (3.12a)(3.12h) reduce to (3.6i)-(3.6o) which, if true, imply that QU(t) reduces to QU*(t)f(t).
1. Fault Detection Observer for a Class of Nonlinear Systems Next, consider the expansion of (3.5d) in terms of xl(t), by using (2.1b), (2.2) and the equivalences
d(t)
p n--p
i=lj=l P =--
P
EE
x~ (t)mlj ( t ) [ m, i~~I,+` .P.
AI.~]~,~[I |
Ip(j)]x2(t),"
i=1 j=l P
P
Z~
x~(t)lp(j)'E.d(t)HiJ.~E~d(t)
i=l j•l P
q
~1
9 H ; = ] IS | EJ~(j)]
act),
i=1j=-1 rt
Tt
EE
xi(t)xJ(t)A~
i~l j=l p
p
p n--p
~(t)~(t)A~ i=Ij=l n--p
i~1~1
p
n--pn--p
x2(t)xl(t)A~.__ __
+
i=1 j=l
where
xl(t)
x2(t)
and
~
j
i+py+p
i=1 j=l
are independent. There obtains
CU (t) = CU~ (t) + CU~d (t) + CU~ei (t) + CU* (t) f(t), where
CU* (t) is
given in (3.10d) and where
P
P
i~I
j=l
(t)[H;~ -
TAlx~]xl(t )
n--p n--p
- T E E x2(t)x2(t)d2~x~'J
i+m+Px2(t)
i=1 j=l n--p n--p
i=I
j=l
p
p
i=1 j-~l
where
F;j ~-
[A i+pjTp [''lxxzc
[[AljTP 4- A j+pll 2xxx],
+ itx~'2mxx - -
f~nj+p aj+pnll[In |
" " " , L'eX2xxx + "~2xxx JJ
361 and
f(t)
362
1. Fault Detection Observer for a Class of Nonlinear Systems
and, also, where P
P
CU~d (t)= E E x~ (t)x~ (t) [3H~xEs
-
T E ij] d(t)
i=l j=l
p
n--p
- T E E x~ (t)x~(t)[E iJ+p + E j+pi] d(t) i=1 j=l
n--p n--p -
T E E x~(t)x~(t)E~+PJ+~d(t) i=1 j=l P
+ E
P
[3xl (t) + l(i)'E~d(t)] l(j)'E~d(t)HiJ~Ssd(t),
E
i = 1 j----1 P
P
cu~es (t) = T ~ ~ [2x~ (t) + l(i)'E~d(t)] l(j)%d(t)K~J f(t) i=l j=l p
n--p
- T E E x~(t)x~(t) [K ij+p + K j+pi] f(t) i=l j=l
n--pn--p -
T E E xi2(t)xJ~(t)Ki+PJ+Pf(t)" i=l
j=l
A set of sufficient conditions for CU(t) to be independent of x(t) and d(t) is then ~j
6H~ x = TB~;i,j = 1,... ,p --TY#+PJ+PffI"i,j 1, . . .,n. Odin - - "L a J x x x ~1 =
(3.14a)
p
(3.14b)
Od,(n-p+q) = T B ~ q 2 ; i,j = 1,.-. ,p -- rrl~i+PJTPm. Z d,n--~Z)xx x "~,~,j ~ 1,''"
(3.14c) (3.]4d)
,n --p
6H~iE~ = T[E ~i + E3~];i,j = 1,... ,p
(3.14e)
Od,q
--TIE ij+p + EJ+Pi];i = 1,--- ,p;j = 1,.-- , n - p (3.14f) = T[E i+pj+p + EJ+Pi+P];i,j = 1,... , n - p (3.14g)
Od,q
= T [ K i l , -.. ,KiP][Ip|
Od,q
= 1,... ,v
(3.14h)
Od,(v)
= T [ K ij+p + KJ+Pi];i = 1,..- , p ; j = 1 , . . . , n - p
(3.14i)
Od,(v)
= T[Ki+pJ+B § KJ+pi+p];i,j = 1,.. . , n - p
(3.14j)
= [g~l, .-- , H~P][I~ | E~]DiE~;i,j = 1,..- , q,
(3.14k)
Od,q
1. Fault Detection Observer for a Class of Nonlinear Systems
363
where 9 and ~ are given in (3.8) and D ~ Es is given in (3.7). In obtaining (3.14c)-(3.14e), (3.14g) and (3.14j) the following result is used:
O=
x~(t)x~(t)AiJx2(t),
iff
0 ----A ~j + A ji,
i=l j~l
where xl(t) and x2(t) are independent. Also, for deriving (3.14a) and (3.14b) the following is used n
n
0=EExi(t)xJ(t)AiJx(t),
iff
i=l j=l
0 = A ij + A j~ + [[mil + A l i ] , ... , [Ain + A~']][In | In(j)] + [[Aj ' + AlJ], ..- , [Ajn + AnJ]][In | l~(i)]
(3.15)
and (3.14k) is obtained by using (3.13). Using (3.14a), (3.14e) can be combined with (3.14c) to give (3.6q). The conditions (3.145) and (3.14d), (3.14f) and (3.14g), and (3.14i) and (3.14j) can be combined to give the conditions (3.6r), (3.6s) and (3.6u), respectively. Finally, using (3.14e), (3.14k) can be written as (3.6v). Thus, sufficient conditions (3.14a)-(3.14k) reduce to (3.6p)-(3.6v), which, if true, imply that CU(t) reduces to CU*(t)f(t).V7 R e m a r k 1.3 If (3.6a)-(3.6v) hold, then ~(t) by (3.~), is independent of
d(t), and x(t). A subset of these conditions have been used in [10] [3] [4] .for bilinear systems and in [1].for quadratic systems. The full set is compact, much more general and the non-uniqueness of polynomial forms has been addressed.
4
General Detectability Conditions
A general set of sufficient conditions will be given for a REDO and a SRFDO to exist for f(t) C C I, where C I is defined as the restricted class
Cf = {f(t); f(t) = a.q.q(t)}, (to _< t ~ to + h),
(4.1)
where g(t) ~ 0 is a scalar function and _.a E IRv, _.a r 0. P r o p o s i t i o n 1.2 Assume (3.6a)-(3.6v) are satisfied. Then for class C I (h =
1. (3.1)-(3.2) is a REDO i.f for some f(t) E C I (a) Hl(s,a_.)X(s) ~ 0 for all s and (b) rank(Hl(s,a)X(s), H2(s, a)) r rank(H2(s,a)) for all s,
364
1. Fault Detection Observer for a Class of Nonlinear Systems
where ~ is the s and
operator with respect to time T, T = t -- to,
H1 (s, ~) -- L1 [(sI - F ) - I [jg, Jgg, Jggg] - T O K ~ [ I , O, 0]], (4.2a) X(s) = s
g2(t), g3(t)]', (t = 7- + to)
(4.2b)
H2(s,_a) = L l ( s I - F ) - l [ J u , Jy, Jyv, Jy]
(4.2c)
and where Jg = [HK, - TKa]a_,
(4.3a)
P
Jgg = E lp(i)' K , ~ [TK' - H' Ks] a_.,
(4.3b)
i=l P P
J99g= E
E
lp(i)'Ksalp(j)'Ksa_[HiJKs - TKiJ] a_,
(4.3e)
i=1 j=l
J~ = [H~x, 9- - , g m] [Ira | Ks_~],
(4.3d)
Jy = ( 2 [ H L , - 9 9 g L ] [ z p | K~] -
T[K1, ... , KP]) lip | hi,
J y y = [3[g~xx, 11
...
(4.3e)
l p + g~xxx,... pl ,H~cxx ,gppxx][X~(p+2) |
T [ K l l ... , K ip + KP~,- 9 9 KPP]] [I~ (p+2) | P
JY = E lp(j)'Ksa_ ( T [[K lj + Kill, ... , [K pj +
(4.3f)
K3Pl]
j=l
. 3[H~x~, . . lj.
pj
, H~x]
[Ip
| K~]) [Ip | _a].
(4.3g)
2. A SRFDO exists if (la) and (lb) hold for any f(t) C C/. P r o o f . Considering the class Cf and taking the Laplace transform of the residual in (3.11) there obtains (t = T + to) ~(S) = L1 [~(s) - T~Ksa_g(s)],
(4.4)
where ~(s) = 1:e(7) and g(s) = Ce(T). From (3.9),
~(s) = (sI -- F ) - l s Expanding the summations in W(t), (4.4) can be written in the m a t i x form
f(s) = Hi (s, ~ ) X ( s ) + H2 (s, a_)Y(s), where Hl(S,_a), X(s) and H2(s,_~) are given in (4.2a)-(4.2e) and where
Y(s) = s
y'(t)g(t), yy(t)g(t), y'(t)g2(t)] '
1. Fault Detection Observer for a Class of Nonlinear Systems
365
and where
yy(t) = [(yl (t))2,... , yl (t)yP(t), (y2(t))2,.. " , y2(t)yP(t),... , (yP(t))2]. [] R e m a r k 1.4 Condition Proposition 1.2(la) is only testable if g(t) is known
which is usually not the case (only ~ is known).
5
Testable
Detectability
Conditions
A set of testable (numerically tractable) sufficient conditions will be given in this section for C I defined in (4.1) for fixed h _> 0. P r o p o s i t i o n 1.3 Assume (3.6a)-(3.6v) are satisfied. Then
1. system (3.1)-(3.2) is a RFDO if there exists at least one f(t) E Cf such that (a) L~TOK~_ r 0 or
(b)
i. J~a* # O and i~. Rank(J;a*, J~) r Rank(J~), where J~ E N d~215 and J~ E ]~do• are constant and G* E N ad+l depends only on g(t), where J~ = L1 [-TOKsc~, F* ([Id | Jg], lid | Jgg] , [Id | Jggg])] , (5.1a) G* = [g(to + h), Gg, Ggg, Ggggl' ,
(5.1b)
J~ = L1F* [[Ia | J~], [Ig | Ju], [Id | ]y], [Id | Jyy]],
(5.1c)
N=d(m
+-~P(p + 5 ) ) ,
(5.1d)
and where Jg, Jgg, Jggg, Ju, Jy, Jy and Jvy are given in (J.3a)(4.3g) and
F* = e Fh [F~ Cg . . [G . O .,
, Fd-1], , G gd - - l]' ,
. . . . Ggg [G~
G.~. = [a~
...
(5.2a) (5.2b)
(~.d-1] t ,_gg j ,
(5.2c)
Cd-ll '
(5.2d)
~~ g g g J
366
1. Fault Detection Observer for a Class of Nonlinear Systems
and where Gkg = ~0 h ak(T)g(7 + to)dT,
(5.3a)
h
Gkgg= / Ggggk :
ak(T)g2(7 + to)dT,
(5.3b)
ak(T)g3(T +
(5.3c)
to)dT.
2. system (3.1)-(3.2) is a SnFDO for the class Cf if (la) or (lb) holds true .for any f(t) E Of. P r o o f . By assumption, for is
f(t) E CI, the residual from (3.11), at t = to+h,
e(t0 + h) = L1 [e(t0
+ h) - TOK~_.g(to + h)].
(5.4)
Solving (3.9) with e(to) = 0 and using the Cayley-Hamilton t h e o r e m [6], ak(7) (k = 1 , - . - , d 1) exist such t h a t
d-1 e -F~r ~_ E a k ( T ) F k ,
k=0 and then
e(to + h) can be written as hd-1
e ( t o + h l = e F h fo EakO-)FkW(T+to)a_g(w+to)dT.
(5.5/
k=O T h e error, (5.5), can be written as
d--1
(
k=0 ~'rt
+E
p
- ~+E ~txKs~V;
i=1 P P
+EE
i=1 j=l
[2Hx~xKs- TK ~] -~Y;~
i=1
[3H:~Ks - T K ij] a_Yigjk '~
+ E E I ( j ) ' K ~ a _ ( T [ K i J + K ji] - 3H~K~)a__Y~ i=l j=l
)
,(5.6)
1. Fault Detection Observer for a Class of Nonlinear Systems
367
whe,~ a~, a ~ ~nd a G ~e g~ve~ i~ (5.3a)-(5.3c), J~, J~9 and J ~ given in
(4.3a)-(4.3c), G k=
~e
and where
/o ak(T)r
+ to)g(r + to)dr,
(5.7a)
r~;k= ~ ~kO_)y~(r+ to)g0 + to)dr,
(5.7b)
/,
h
h
r[;Jk = fo ak(r)Y~O- + t~ I"
+ to)g(r + to)dr,
(5.7c)
+ to)d~.
(S.7d)
h
Y;'2 = Jo ~(r)y'(~ + t~
The summation signs can be eliminated in (5.6) giving
e(to + h) = J;G" + J;Y*,
(59
where J~', G* and J~ are given in (5.1a)-(59 and Y* E IRu depends upon {u(t),y(t),g(t)}, where Y is given in (5.1d) and where
r* = [G, r~, ro~, ~9]'
(5.9)
and where ,
,vg
,
..
[]-l(d--1)
...
Ug(d-1)]
t
(5.30a)
(59 9 , yl(d--1) ~gg
rg_ [y ,o 9
yip0 + y;10
,''"
, Y;g(
,v;p0 9
d--l)] t ,
(5.1oc)
v l(d-1) (5.30d)
where U~k, ygk, ygjk and yg~gkare given in (5.7a)-(5.7d) above9 [] C o r o l l a r y 1.1 Let the assumptions of Proposition 1.3 hold so that (5.8)
holds true. 1. If J~ =_ O, for all a.q., then system (39 class Cf.
is not a RFDO for the
368
1. Fault Detection Observer for a Class of Nonlinear Systems
2. J{ -- 0 .for any a_ ~ 0 iff all the following hold true: O-=LIT~Ks,
(5.11a)
0 = L~ [HK~ - TKa],
(5.11b)
0 = LI* (T[KI, ..
. , K p] _
[Ip | K~]DJKs
0 = L~/~/( [Iv~ |
+
[Hx~x, I ... , HPx][/p | K~]) D iK~' (5.1 lc)
|
|
DL ,[Ip,~Qlv(v)]] [G~ + G ~ , ] ) ,
+ [[Ip~ |
(5.11d)
where (5.11a)-(5.11d) hold for k -- 1,... ,d - 1, where L~ = L l e F h F k, I(=
(5.12)
[Hl~x,... , H i p s , . .
-T[Kll, .-.
,Kip, ..
, g ~ l x , . . 9 ,HPP~] lip2 @Ks] ,Kpl, ... , K pp]
and where D Ks ~ = [I~ | lv(i)] K~, + [[Ip | lv(1)],--.
[Ip |
[I|
G~, = [Ivp | Kflv(i)] [Iv | Kflv(j)] ,
(5.13a) (5.135)
where i , j = 1,... ,v. P r o o f . From (5.8) if Y* = 0 then e(t0 + h) = J~G* and hence, Corollary 1.1(1) holds true. Now, also, J~ _~ 0 for any _~ ~ 0 iff
0 =- LITOK~_, 0 =- n~ [Jg, Jgg, Jggg], for k = 1,-.- , d - 1, where L~, Jg, Jgg and Jggg are given in (5.12), (4.3a)(4.3c). Using the equivalences given in (3.13) and (3.15) these equations can be written as (5.11a)-(5.11d) and hence, Corollary 1.1(2) holds true. []
5.1
A Special Class ( S t e p - F a u l t s )
Consider the class of faults Cf described in (4.1) where g(t) = 1 (to < t < to + h). Thus, step-type faults are considered along direction _a. P r o p o s i t i o n 1.4 Assume (3.6a)-(3.6v) are satisfied. When g(t) = 1 Proposition 1.3 can be simplified to
1. system (3.1)-(3.2) is a RFDO if there exists at least one f(t) C Cf such that
1. Fault Detection Observer for a Class of Nonlinear Systems
(a) LITr
369
#0
or
(b)
i. 3 ; # 0
and ii. Rank(Jr, J~) # Rank(3~), where J~ E IRd~ and 3~ E l~d~ J~ = L*I [Jg + Jg~ + ']ggg]
-
where -
L1Tg2Ks~_,
(5.14a)
J~ = L1F* [lid | J~], [Id | [Jy + Jy]], [Id | Jyy]] , (5.14b) P M=d (m+ 5(p+3)) , (5.14c) and where L~ = L~F -1 [e ~ - ~ ]
(5.15)
and Jg, Jgg, Jggg, Ju, Jy, Jy and Jyy are given in (~.3a)-(~.3g). 2. system (3.1)-(3.2) is a SRgDO.for the class CI, i.f.for any f(t) 6 Cf , (la) o~ (Ib) holds t~.e. P r o o f . From proof of Proposition 1.3, when g(t) = 1, the residual in (5.4) can be reduced to
e(to + h) =L1 [e(to + h) - T O K ~ ] ,
(5.16)
where
e(to + h) = e Fh fo
ak(~')FkW (T + to)a_dT.
(5.17)
From Proposition 1.3, when g(t) = 1, there follows h
G~ = Gkgg -= akggg = fo ak(7)dT and this leads to d-1
h
h
E ffO ak(7)Fkd~'= fo e-F~d~k=O
_-F-1 [I_e-Fh].
(5.18)
Also, note that when g(t) = 1 (5.19)
370
1. Fault Detection Observer for a Class of Nonlinear Systems
Using the equivalences in (5.18) and (5.19), (5.17) becomes e(to + h) = F -1 [eFh - Id] [Jg + Jgg + Jggg]
k=O
i=1
+ E
i=1
I(j)'Ksc~ (T[K ij + K ji] - 3 H ~ x K s ) a_.Yik
j=l "~- ~
[3HixJxKs-TKiJ] o~yijk)
where h
U ik =
ak(T)U~(T + to)dT,
(5.20a)
y~k = In ak('r)Y~(T + t0)dT,
(5.205)
Ph yijk --_
/0
ak(w)yi(7 + to)yJ(w + to)dv
(5.20c)
and Jg, Jgg and Jggg are given in (4.3a)-(4.3c). These results can then be used to write (5.16) in the form e(to + h) = 3; + 3~1>*,
(5.21)
where 3~ and 3~ are given in (5.1a) and (5.1c) and 1~* C ~M is defined as
where U =
[
y_
[y10,...
ul~
... ,
]p=[ynO,..
umO, "'" , u l ( d - 1 ) , ypO,...
,yl(d-l),...
"'" , U rn(d-1)
]'
(5.22a)
,
,rP(d-1)]',
,YlP~ + ypl~ ... ,YPP~
yll(d-1), .. , y l p ( d - 1 ) + Y P l ( d - 1 ) " "
(5.22b)
, ,YPP(d-I)
]' ,
(5.22c)
where U ik, Y i k and yijk are given in (5.20a)-(5.20c) above. [] C o r o l l a r y 1.2 Let the assumptions of Proposition 1.4 hold so that (8.21) holds true.
1. Fault Detection Observer for a Class of Nonlinear Systems
371
1. If 3~ -- O, all a_., then system (3.1)-(3.2) is not a RFDO for class, Cf of .faults. 2. 3~ =- 0 .for all ~ ~ 0 iff (5.11c), (5.11d) hold true, with L~ replaced by L~ in both, and
(5.23)
0 = L~ [HKs - TKa] - L I T ~ K s holds true, where L~ is given in (5.15).
P r o o f . Corollary 1.2(1) holds by letting Y* = 0 in (5.21), for which e(to + h) = J{'. If 3{" - 0 for all _~ r 0 iff ^~
0 =--L1Jg - L1TOK~_, 0 -- L~ [J~,, J,,9],
where Jg, Jg9 and Jggg are given in (4.3a)-(4.3c). >From the proof of Corollary 1.1 these equations can be written as (5.11c) and (5.11d) (replacing L~ by L~ in both) and (5.23), thus proving Corollary 1.2(2). []
5.2
N u m e r i c a l Calculation Procedure
The gain matrices in the design of (3.6a)-(3.6v) can be calculated efficiently. The equations given in (3.6h), (3.6j)-(3.6o) and (3.6q)-(3.6v) can be arranged to give the form (5.24)
Od,A; = T Z ,
where the order of the contribution to the equation in (5.24) is not important and where N ' - n - P (2m + 3v + 3q + (p + n)(q + v + 1) + p ( p -
n) + n(2 + n))
+ 2 ( 2 m + p ( 3 + p ) + 2v(v + 1) + 2q(q + 1)). Equation (3.6e) and (5.24) are combined to give the form 0d,(5q+Tn-6p = [[TA - FT] ~ - TEa [Iq, 0q,(,~_p)] ,TZ] .
(5.25)
By splitting the two terms on the right hand side of (5.25) there follows two equations in F and T which are equivalent to (5.25) Od,(5q+Tn-6p) = F T X 1 + T X 2 or Od,(5q+7n_6p ) ~- [FT, T] X ,
372
1. Fault Detection Observer for a Class of Nonlinear Systems
where X 1 = [~, On,(4q-1-6n--5p)] , [Eo
-
Aa, Z],
X = [Xl, X2]'. Using the algorithms developed in [10] [11, F, T, L1 and L2 (and the other gains) can now be calculated using SVD decompositions. Then conditions (la) and (lb) of Proposition 1.4 can be tested (or similar conditions in Proposition 1.3 if g(t) is known). A recursive algorithm for the complete design of a RFDO (or SRFDO) can be given along the lines developed in [10] [11 .
6 Concluding Remarks A nonlinear fault detection observer has been proposed in this chapter for a nonlinear system involving polynomial nonlinearities of bilinear, quadratic and cubic forms. Proposition 1.1 gives sufficient conditions for the error dynamics and fault detection signal to be robust with respect to a disturbance and Propositions 1.2-1.4 give conditions for a fault to be detectable (RFDO and SRFDO). Fault isolation can be performed by using a bank of RFDO's [7]. The design procedure here involves only efficient linear matrix calculations, thus ensuring easy assessment of fault detectability.
1. Fault Detection Observer for a Class of Nonlinear Systems
7
373
REFERENCES [1] S. A. Ashton, D. N. Shields and S. Daley. Application of a Fault Detection Method for Pipelines, System Science, Vol. 23, No. 2, pp. 97-109, 1997. [2] P. M. Frank. On-line Fault Detection in Uncertain Nonlinear Systems Using Diagnostic Observers : A Survey, Int.J.Systems Sci, Vol. 25, No 12, pp. 2129-2154, 1994. [3] A. Hac, Design of Disturbance Decoupled Observer for Bilinear Systems, ASME, J. Dynamic Syst. Measure. Control, Vol. 114, NO. 12, pp. 556-562, 1992. [4] M. Kinnaert, Y. Peng and H. Hammouri. The Rmdamental Problem of Residual Generation for Bilinear Systems up to Input Injection, Proc. IFAC con.f. ECC'95, Rome, Italy, pp. 3777-3782, 1995. [5] A. J. Krener and A. Isidori. Linearization by Output Injection and Nonlinear Observers, Systems and Control Letters, Vol. 3, pp. 47-52, 1983.
[6] H. Nijmeijer and A. Van der Schaft. Nonlinear Dynamical Control Systems, Springer Verlag, 1990. [7] R. Patton, P. Frank and R. Clark., Fault Diagnosis in Dynamic Systems, Theory and Applications, Prentice Hall, 1989. [8] R. Seliger and P. M. Frank. Robust Component Fault Detection and Isolation in Nonlinear Dynamic Systems using Nonlinear Unknown Input Observers, Preprints of SAFEPROCESS '91, Sept. 10-13, BadenBaden, FRG. Vol. 1, pp. 313-318, 1991. [9] X. H. Xia and W. B. Gao. Nonlinear Observer Design by Observer Error Linearization, SIAM J. of Control and Optimization, Vol. 27, pp. 199-216, 1989. [10] D. Yu and D. N. Shields. Bilinear Fault Detection Observer and its Application to a Hydraulic System, Int. Jnl. of Control, Vol. 64, No. 6, pp. 1023-1047, 1996. [11] A. N. Zhirabok. Fault Diagnosis in Nonlinear Systems with Uncertainies, Proc.of IFAC Syrup., Sa.feprocess '97, Hull University, Vol. 1, pp. 528-533, 1994.
Nonlinear Observer for Signal and P a r a m e t e r Fault D e t e c t i o n in Ship Propulsion Control Mogens Blanke and Roozbeh Izadi-Zamanabadi Department of Control Engineering Aalborg University Fredrik Bajers vej 7C DK-9220 Aalborg, Denmark
1
Introduction
Faults in ship propulsion and their associated automation systems can cause dramatic reduction on ships' ability to propel and maneuver, and effective means are needed to prevent that faults develop into failure. The chapter analyses the control system for a propulsion plant on a ferry. It is shown how fault detection, isolation and subsequent reconfiguration can cope with many faults that would otherwise have serious consequences. T h e chapter emphasize analysis of re-configuration possibilities as a necessary tool to obtain fault tolerance, showing how sensor fusion and control system reconfiguration can be systematically approached. Detector design is also treated and parameter adaptation within fault detectors is shown to be needed to locate non-additive propulsion machinery fault. An adaptive observer is suggested for this purpose, est trials with a ferry are used to validate the principles. Propulsion system availability is crucial for a ship's ability to maneuver. Nevertheless, control systems associated with propulsion required to be fail-operational or fault-tolerant. Instead, local safety systems protect machinery. They prevent continued operation or start-up if sensors inform that local shut-down. While fail-safe for each piece of machinery, the local safety approach is not globally fail-safe for the ship. The consequence has been many events where consequences vary from irregularity to major economic loss and causalities. Several events could have been prevented if automation systems had been designed to be tolerant to faults, with overall availability in mind. Fault-tolerant control (FTC) is a methodology where analytical redundancy is employed using software that monitors the behavior of components
376
2. Nonlinear Observer for Signal and Parameter Fault Detection for Ships
and function blocks. Without hardware redundancy, some faults m a y inevitably cause a plant shut-down, but the F T C strategy is t h a t the m a j o r i t y of faults, and in particular the ones with severe consequences, are accommodated. T h e objective is to keep plant availability but accept reduced performance as a trade-off. The first step to achieve fault tolerance is efficient detection and isolation of faults. This is a particular challenge when a system is non-linear. In this chapter, an active solution to the F T C problem is employed where on-line fault detection and isolation can trigger a discrete event signal to a supervisor-agent when a fault is detected. The supervisor-agent will activate remedial actions. Re-configuration possibilities are analyzed for a ship propulsion system consisting of a main engine with a controllable pitch propeller. It is shown that combined p a r a m e t e r and o u t p u t estimation is required and an adaptive observer is proposed for fault detection. A continous-time non-linear observer is shown to possess very useful features and can be used during b o t h detection and re-configuration. Simulations on a model of a ferry [12] illustrate performance for a selected fault scenario.
2
Ship Propulsion System
This section introduces m a t h e m a t i c a l models for ship speed, propeller and prime mover, the essential propulsion system components. T h e purpose of the modeling is to obtain information to design fault detection and isolation (FDI) modules for essential faults and to give the prerequisites for design of re-configuration when faults occur. The block diagram in Fig. 1 illustrates the structure of the propulsion system.
Shaft
YP!
~
~
mech~sm mc
J ship- .d L . u I
.....
I
T
I
.....
I
I Q~(~Vam) Ship sI~d
FIGURE 1. Structure of dynamic relations for CP propeller, shaft and diesel engine.
2. Nonlinear Observer for Signal and Parameter Fault Detection for Ships
377
I
..............
r.n. ".r
_i~. ~ n m .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
0".,-~.~3 u.
N ~ ~ ,im ) .
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
.
FIGURE 2. Hierarchy of controllers for propulsion system. The handle gives input to a combinator, efficiency optimizer, and ship speed control. Lower level controls are shaft speed (governor), propeller pitch and diesel overload blocks.
2.1
Propeller Thrust and Torque
Controllable pitch (CP) propellers have blade angle (pitch) controlled by a hydraulic servo system. Developed thrust and torque are functions of pitch, shaft speed and flow velocity through the propeller Tprop Qprop
=
:
fT_prop(O, n, Va) fQ_prop(O,n, lra)
(2.1)
These can be shown to approximately follow quadratic relations, for thrust
Tprop -~ ~nln~'~ Inl n + T,~vnYa
(2.2)
Qp~op = Q0 bl n + Qn~,o, IOl in[ n + QnvoO [hi V~
(2.3)
and for torque
These relations give a quite good approximation in the steady state cases whereas they are less applicable during large transients. The term Q0 [nl n accounts for the torque at zero pitch.
2.2
Diesel Engine Prime Mover
Elaborate details of the dynamics [3] are not important in this context, but would be for detailed design of FDI for the engine. Here, diesel torque can
378
2. Nonlinear Observer for Signal and Parameter Fault Detection for Ships
be considered linearly related to the fuel index, without dynamics involved, Qeng = K y Y
(2.4)
The dynamics of propeller and shaft is merely that of rotating inertias subjected to torque balance between prime mover torque and load torques, d r l r n2 ~ ~ t ) ~-n(Qeng-Qprop-Qf)
(z.5)
The dynamics of the prime mover and its control system is tightly coupled to the speed dynamics of the ship through the propeller (2.3). The structure of prime mover control was also shown in Fig. 1. The measured shaft speed is compared with a reference speed and the governor (speed controller) regulates the fuel injection to the engine to obtain the desired speed. Limit curves are incorporated for shaft speed dependent torque and air pressure.
2.3
HuU Resistance
Ship's resistance to motion through the water can be described to the first order by a resistance curve, which is a third to fifth order polynomial in u. The order of the polynomial is higher the closer the ship operates into the wave making region. The resistance curve is known a priory but with some uncertainty. The first order equation m(] = R(U) + (1 - t)Tprop + Text is a sufficient approximation in this context.
2.4
Actuators for Fuel Injection and Propeller Pitch
The actuators can both be modeled as first order dynamic systems with limits in rate of change and in output. The electro-hydraulic pitch control system is described by the following equations: u~ = kt (0rer - G ~ )
z maX(0min, min(uo, Om~x)) = max(0min, min(0, Om~,))
(2.6)
The diesel actuator is equivalent to this with command Yc from the governor, rate limits Y C [Yd-, Yd+] and o u t p u t Y E [0, 1].
2.5
Sensors
Sensors for propeller pitch and fuel index are conventional angle transmitters. Shaft speed is usually measured by a set of pulse pickups. A maximum
2. Nonlinear Observer for Signal and Parameter Fault Detection for Ships F u n c t i o n blocks R combi:
C optim: -
nd Vgd
~)d
=C
Service
= RCB(ha)
--
hd v~m nm
OP
Ym um
379
normal (fault)
(n, ~) demand { freeze (input fault) } best efficiency use estimate (inp.fault) } { roll-back (ref. fault) } alter limits (diesel fault) }
nd
Vgd
C over:
~
= C_OL
nm
~m
constant U
u~
{ freeze ha(Urnfault) }
C speed: hd • C_SS I" Um I
I.
C shaft: Y c = C
" " "
avoid overload {freeze (fault)} { use estimate (fault) }
.
SP (nT,nm)
estimate Um(Umfault) } { roll-back(Ur fault) } shaft control { estimate n(nmfault) } { } r o l l - b a c(nT k fault)
TABLE 2.1. Function blocks treated as virtual components. logic selects the higher of the two signals. This protects against drop out of one of the pick ups but not against a "high signal" fault or failure in a common processor/rate counter servicing both channels. The ship speed is measured by magnetic log, Pitot tube or Doppler log. The two former measure water speed close to the hull and are quite prone to fluctuations from the turbulence and cross flow.
3
Control Hierarchy
The control hierarchy includes controllers for: shaft speed; propeller pitch; diesel overload control; combinator curves from handle position to generate reference values of n and ~; efficiency optimization using n and v~; constant ship speed control. The signal flow between these function blocks is shown in Fig. 2. The interested reader can find details about the control functions in [12]. The input-output of each block is listed in Table 2.1. The table lists the service of the function block in normal operation and the desired function in case of specific faults. The listing of desired remedial actions is a result of a combined fault-propagation and structural analysis of the propulsion system, including the possibilities for re-configuration after serious faults
380
2. Nonlinear Observer for Signal and Parameter Fault Detection for Ships Constraint
Description
: fnm = n f~ : Orn = ~
f
f~ : U m :
U
f~ : Ym = Y f~ : Ky ----Kyc f~ : Qeng = K y Y
re: -Qeng = Qprop + Q I f~ : Qprop = fQprop ] ~ : Tprop = fTprop
f~o: R(U) = fRu(U) f~l: R(U) = - Text - (1 - t) Tprop
I]
sensor_n sensor_t9 sensor_U sensor_Y engine gain engine torque shaft balance propeller torque propeller thrust hull resistance. ship speed
TABLE 2.2. Static constraints for shaft [4]. The table list their input and output, faults considered, and re-configuration possibilities. An example of this analysis is provided in the next section.
4
Structural Analysis
Structural analysis [7, 10, 17] is the study of properties which are independent of the actual values of the parameters.Constraints, here used as a synonym for relations, between variables and p a r a m e t e r s from the operating model are used in the analysis. The links are represented by a g r a p h or a table, on which the structural analysis is made.
4.1
D e s c r i p t i o n o f the M o d e l
The model of the system is considered as a set of constraints, 5c = { f f , f~, ... ,- 9- , f ~ } that are applied to a set of variables Z = X U )(. X denotes the set of unknown variables while ) ( is the set of known variables: sensor measurements, control variables, constants, and parameters, and reference variables. The constraints are the relations imposed between values of the variables, as given by the relevant physical laws. The constraints for the propulsion system are listed in Table 2.2.
4.2
Formal Representation
T h e structure of the system is described by the following binary relation: S:$-x
Z ~ {0,1}
c z S(][, zj) = 1 (f~' J) --* S ( f [ , z j ) = 0
iff f [ applies to zj, otherwise.
2. Nonlinear Observer for Signal and Parameter Fault Detection for Ships
381
These relations can be represented by an incidence table or the equivalent digraph. Fig. 3 a) shows the structural table for the propulsion system. Some constraints may be expressed through non-isomorphic mappings for certain variables. Such variables can not be re-constructed through an inverse mapping from knowledge about remaining variables. Elements with this property are marked by M's (for multiple), replacing the l'es in the incidence table and unidirectional arcs in the corresponding digraph. An example of such a constraint is f~: it is always possible to compute the value of Qprop from f~ when ~, n, and V~ are known. However, knowing the values of Qprop, n, and V~ does not enable calculation of a unique r in all cases. This fact is not apparent from the equations in this chapter but is apparent when looking at the underlying propeller characteristics. The non-isomorphic problem for the Qprop relation is only present in a narrow range of transient conditions (during crash stop).
~.3
Sensor Fusion for Re-configuration
In control systems, re-configuration can be obtained either by means of hardware redundancy or the use of software redundancy. In the case where hardware redundancy exists, the scope of design is FDI algorithms and hardware switching. When analytic redundancy is available, fault tolerance is obtained by means of sensor fusion: the value of the signal which is lost or corrupted due to faults, is reconstructed using known values of other signals. The structural analysis approach is usually employed to obtain analytical redundancy relations for FDI [9]. It can, however, be used without difficulties for sensor fusion as well, since a constraint relation can be used to re-construct a signal from the other measured variables. An example for the propulsion system is a critical fault in the shaft speed measurement which can be accommodated by estimating shaft speed from other available measurements.
Fault in the Shaft Speed Measurement A critical fault in the propulsion system is a failure in the measurement of shaft speed. The constraint f~ represents this device in Table 2.2. A fault occurrence means that the constraint f~ does not hold, e.g. the values of the variable nm are not correctly related to the values of the variable n. Figure 3 b) shows that variable n is involved in 3 relations which are specified by the constraints f~, f~, and f~. Since the constraint f~ is not valid, there are two other possible ways of calculating the values of the variable n, namely through constraints f~ and f~. As it is shown in Figures 4 a) and 4 b), the ship speed can be described as a function of the other
382
2. Nonlinear Observer for Signal and Parameter Fault Detection for Ships v r
n
r~ e.~
I
I;
1
I
1
I:
1
1 1
I: I;
1
1
1 1
M
1
M
1
[,
fu ~
1 1 1 1 1
1
a)
"
../
\\
."
r_
r
b)
~r
FIGURE 3. a) The structural representation of the model by a (binary) table. l's are replaced by x's to indicate causality (calculability) between variables, b) Corresponding digraph representation. known variables as:
% = L(om, Kin, K~c, Urn) ~ = f~(O~, U~)
(4.7) (4.S)
The process to apply the sensor fusion based on this approach is the following: For the interested variable (for instance n) identify the set of related constraints (f~ and f~) and 9 choose one of the available constraints 9 check the causality for the constraint in order to find out t h a t the variable can be c o m p u t e d through this constraint.
2. Nonlinear Observer for Signal and Parameter Fault Detection for Ships
e @~__..~.___~
383
T ~
.....
e
b)
~,r
FIGURE 4. Sensor fusion methods based on structural representation: shaft speed calculation through a) propeller thrust equations f~ and b) propeller torque equation f~. 9 for all the variable connected to the chosen constraint search backward until all end variables are known variables. R e m a r k 2.1 The described procedure shall find all the existing paths from the unknown variable to sets of known variables. Some of these paths may include loops, which are related to the existing control or natural loops in a system. By examining all the constraints, the set of equations/relation by which the variable can be calculated is identified and can be used for re-configuration purposes. For the shaft speed failure, the m e t h o d is illustrated graphically in Figures 4 a) and 4 b). Grey dashed arrows show the calculation paths to the known variables. Using quadratic representation of the propeller torque, the variable n
384
2. Nonlinear Observer for Signal and Parameter Fault Detection for Ships
can be estimated from the constraint f~, but estimation of n based on static relations is obviously too primitive. A non-linear observer is employed instead.
5
Isolation of Shaft Speed and Engine Faults
This section deals with the problem of detecting whether a shaft speed fault or engine fault has occurred. The relevant dynamics to be considered was described above, leading to the constraints f7 to f11. T h e task at h a n d is to estimate a signal fault in n,~ and a p a r a m e t e r fault in K u. T h e dynamic equations directly determining shaft speed are
Itn = Q~ng - Qp~op -
QI
(5.9)
Q~,~g = K~Ym Taking ship speed U as a measured variable - a valid a s s u m p t i o n when Um is non-faulty, Qp,-op = Qo In[ n + Q~,,~ Ivgl Inl n § Q~,~y, v9 In[ (1 - w o ) U m
(5.10)
In the sequel, we use Qe,~u - Q~,~v,~ (1 - w0) for brevity. Shaft speed is positive in a controllable pitch installation, so
~ = ~ 1 ( K ~ Y m - Q f - Q o n 2 - Qonn [vg[n 2 -- Q o n u v q n U m )
(5.11)
Following the benchmark definition in [13], we need to consider faults in either shaft speed measurement or in the diesel torque coefficient, n m -= n + n f
Ku = Kyc - Ky I
(5.12)
and The detection task is hence increased from a single fault shaft speed sensor fault detection to a more complex one of simultaneous additive and non-additive faults. An adaptive observer providing simultaneous state and p a r a m e t e r estimates is a natural choice as a candidate for detection of the two particular faults.
5.1
Adaptive Observer
The dynamic relation (5.11) can be written in a form which is linear in the unknown p a r a m e t e r -- (I)(x, u2, u3) + Oul y=x
(5.13)
2. Nonlinear Observer for Signal and Parameter Fault Detection for Ships
385
using 1
= ~ ( - Q o ~ 101 n 2 - Qo,~uOnUm - Qon 2 - Q f)
x=n,
(5.14)
U l = Y m , u 2 = U m , u 3 = O m , O-- Ky It
An adaptive observer can then be build by using the measured inputs: Ym, Urn, Z~m and the measured state nm. It is noted that the more general case was treated in [8]. However, the detailed assessment of the Lipshitz conditions, that determine the gains in the adaptive observer, are easily made too conservative to get useful results. A few comments are thus considered appropriate. This leads to the following theorem. T h e o r e m 2.1 An adaptive observer .for the problem
1 (KyYm - Qe,,~ IO[ n 2 - QonuOnUm - Qon 2 - Q f )
i~ = Tt
(5.15)
is the state estimator ~=~"t
1 (_Qo~nOm~2 _ QonuOm~Um - Q0~ 2 - Q f) + Y m ~ + L ( n m - ~) (5.16)
with parameter updating O = PYre (nm - ~)
(5.17)
The adaptive observer is semi-globally asymptotically stable with Ym > 0,
P > 0,
Qeng'max (o~nmax ~ - ~ ) L > It nmax \ nmin
(5.18)
where It, Qeng,ma~, nmax, nmin, O~ and ~ are plant specific parameters.
[]
The nonlinear torque function 1
O(n,u) = y~ ( - Q o ~ I ~ 1 n2 - Qo~u vgmnUm - Qo n2 - Qf ) is Lipshitz II~(n, u) - ~(~,u)ll < ~11n -
~ll
since 9 (n, u) - ~ ( ~ , u) --It 1
y~ ( - ((Q0 + Q~n~ 10ml) n + Qo~u~mUm) - (Qo + Q ~
[~m[) fi) (n - ~)
386
2. Nonlinear Observer for Signal and Parameter Fault Detection for Ships
Practical diesel torque constraints and ship speed being dynamically related to torque lead to -
((Qo + Qonnl
ml) n +
QonumUm) <
o~Qeng'max nmin
-
-
(Qo + Qonn [~m]) ~ "~ /3Qeng'max
nmax
hence liD(n, u) - (I)(fi, u)[[ < "y []n - ~[[, where the Lipshitz coefficient is
~[ -
Qeng,max(o~nmax+/3) It nmax k, nmin
A Lyapunov function for the observer error
is V -- rtPn T ~- ~ T . Where P > 0 is a scalar. Then, using the notation (5.14), and details of the proof in [8], = 2((I)(n, u) - ~(~, u ) ) P ~ + 2ulOP~ - 2 ~ L P ~ + 20 d~ dt < 0 iff L > ~,
P > 0 and n C_ [0, nm~x], ~ C [0, nm~x].
It is noted that to obtain V < 0 it is required that f ( x , u ) O - f ( & , u)O ~ O. This implies that 0 --* 0 iff & -~ x. This requires persistent excitation in u(t).
R e m a r k 2.2 The parameters to calculate the ~/ value are, typically: a = 0.1,t3 -- 3, nmax ---- 3nmin. The a and t3 values are found from (2.6) and (5.11) using observation that maximum shaft speed is limited to 1.09 n~om, even during a crash stop. R e m a r k 2.3 The propeller coefficients are taken to be known parameters in the observer. With inherent parameter uncertainty, system identification is needed for practical application. Parameter convergence will require persistent excitation. 5.2
Identification of Propeller Parameters
Direct identification of the physical parameters is conveniently done by adjusting model parameters directly until reaching the minimum of the 2-norm of the deviation between the system output and the estimate [6]. A batch processing on selected data is easily carried out, using standard methods from system identification. It must, however, be validated for each selected sequence, that the excitation in the selected data is persistent. A nice approach to this is a sensitivity analysis [14].
2. Nonlinear Observer for Signal and Parameter Fault Detection for Ships
387
The functional of the output vector for the ship is predicted by the nonlinear model using measured input excitation # as input to the model,
9 (k) = f ( ~ ,
0)
(5.19)
where 0 is a vector containing estimates of parameters, #N is the input vector with N samples, and f (#, 0) is the relevant non-linear model. T h e discrepancy between measurement and model prediction, when both are excited by the input signal #, is the model error, e(k) = y ( k ) - ~l(k). A performance function V (0) to be minimized is then conveniently taken to be quadratic, N
(5.20) k=l
The parameter estimate 0 based on N input-output data points, #N and YN, is the value 0N that minimizes V (\ t t N. , Y1N , O ]
(5.21) The estimate 0N is obtained through minimization of this criterion. Some identification methods require the Hessian H , which can be approximately determined from the model gradient ffr (k)
0~) (k) 00
(k) -
(5.22)
and H=
o2v (O)
1 "
^ ^ =~E ffj(k) fliT(k) O0 O0 T k~l
(5.23)
The gradient 9 (k) can be determined analytically in some cases, but is always available through numerical differentiation. The normed root mean square output error C RM
S n
--
CRMS _ _ YRMS
(5.24)
is a more significant number for expressing the model fit than the not-scaled performance function V (0). The relative normed Hessian is then H,.,~ -- y~t2MsnHL
(5.25)
where L is a diagonal matrix L = diag(0) . However, a good fit, i.e. small values of enMS,~ and V, only indicates that the model structure is adequate
388
2. Nonlinear Observer for Signal and Parameter Fault Detection for Ships
for expressing the system behavior for a particular input signal. In [14] it is shown how characteristic sensitivity measures are very convenient for determining whether a good fit also implies accurate parameter estimates. The minimum sensitivity of the parameter dependent part of the model error Cp,RMSn (8) with respect to one relative parameter 8n, for arbitrary values of the remaining parameters, is found to be Si rnin = ~/(H~ "-1 (N),~)
(5.26)
Also, the ratio R of the maximum and minimum sensitivities in any direction in the parameter space is essential
R-
Smax Stain'
Smax• ~
and
Srnin = ~
(5.27)
where A denotes eigenvalues of Hr~ and represent the sensitivity in the parameter space. These sensitivity measures are used for input design. In the sequel, we also need sensitivity measures for the individual parameters
/
Si = vHrn(Og)ii
and
Ri -
S, Si mi'--~n
(5.28)
Determination of Parameter Accuracy It can be shown [15] that the estimated parameter error is inverse proportional to the sensitivity Simin. The total, relative estimated error for the i'th parameter can be determined as x m ~ritot ~" CRMSn -[-CRMSn (5.29)
S~minVW S~min
where 6RMSnXand CRMS n m are the root mean square errors caused by noise and under modeling, respectively. The determination of parameter uncertainty for a given input signal is used in the sequel to run a batch processing identification for determination of propeller parameters, and later in assurance of identification quality when a possible diesel gain fault is suspected.
5.3
Identifiability from Usual Maneuvering Data
The parameters Q~nn and Q~nu in (2.3) are identifiable if their sensitivities are adequate, according to (5.29), given a partiqular input sequence. Figure 5 shows a sensitivity plot for the two physical parameters using d a t a measured at sea-trials with the passenger ferry Dronning Ingrid [13] under usual maneuvering. This result is important for the practical applicability of the concept.
2. Nonlinear Observer for Signal and Parameter Fault Detection for Ships Normed
0.013
estimation
~,
0.07
error variance
m
,
versus parameter
389
variation
i
............... i ............... i .................................. i.................................................................. O.OE
........... +........... i................ i .................~.......................................................... /i i i [ /
0.0~
-"-; .........
i ................
"- ................
i ................
i . . . . . . . . . . . . . . . . . ~.............................
"-----" .......... i/
RMS(e.) RMS(n)
0.04
0.03
........... t ~,. ...... ~............... i................ ',............... ~ ....
"-
0.02
..........
.~ .........
~
:.+ ..........
~
i
i
i
I ................
~ .................
L .-..-~-.....L
.......... +..............
',
2
..............
+ ..............
001 0 -40
~ -30
-20
i
i
- 10
0
Parameter variation
i
i
i
10
20
30
40
(%)
F I G U R E 5. Sensitivity results: The normed variance of the estimation error calculated for variation in Q~,~ and Q~,~ya.
6
Fault Isolation
T h e c o m b i n e d p a r a m e t e r a n d sensor fault detection in one observer give rise to i n t e r d e p e n d e n c y in the fault isolation. Since remedial actions are widely different to a c c o m m o d a t e each of the two faults, fault isolation is crucial. W h e n used for detection a n d isolation, there are two observer error signals to consider e,~ = nm - ~ = n + A n K~
eo=~+
AK~
It
(6.30)
R~
It - 0 + A 0 - 0
(6.31)
where A n a n d A K y are the sensor a n d engine faults, respectively. T h e observer error signals are derived f r o m from equations (5.15) to (5.17),
~,~ =
_ (q~n~m \ ~
(n + ~) +
Q ~ mItV m
)
+ L / en - Ymeo - L A n - YmAO (6.32)
Re = PYme,~ + P Y m A n
(6.33)
T h e linearized transfer functions in an o p e r a t i n g point, {fi, 0, (7, Y'}, are easily determined, from the Laplace t r a n s f o r m of the two fault signals to those of t h e "raw" observer error signals. Using the a b b r e v i a t i o n s
o~ = 2 Q~'~'~0~
+ Q-~~u 0 0
(6.34)
2. Nonlinear Observer for Signal and Parameter Fault Detection for Ships
390
N ( s ) = s 2 + (c~ + L)s + p ~ 2
(6.35)
we get
(sL + p ~ 2 ) 8]~ N(s) A n (s) - ~--~(s)AO (s)
e,(s) =
eo( s) --
P Y (s + a)
N(s)
(6.36)
P172 A n ( s ) - -77-77_~AO( s) lv~s)
(6.37)
Since the two raw observer error signals are not linearly dependent, diagonal isolation [11] can be achieved. Nice low pass properties are obtained by defining the filtered residuals as follows,
P?
rn(8)
{
s
"t
P?
I + ST e~
=
e
(8) + e•
(6.38)
_py2 (sL + p y 2 ) AO(s)
+ r0(8) -
--An(s) I + ST
-
(6.39)
This choice assures that a steady state error gives rise to a m e a n value change in the residual. This is one of the fundamental properties t h a t should be achieved to enable easy detection also in the presence of noise [1]. Observer
error
z~kyocCurs
when
0 . 2
0
"
:
:
1_
:
L . . . . . . .
-
1
] "
....
.-..
,,
,
I m
J
L
i
I~== - 0 . 2
...............
~...............
* .................
~
...........................
*. . . . . . . . . . . . . . . . . . . .
-0.4 ..........................................................................................................
~
...............
; ...............
-0.6
0
500
1000
1500
2000
Estimated engine gain (Aky
x 108
2500
3000
3500
occurs)
9..~;=" 1 . 1
[
0
500
1000
1500
2000
2500
3000
3500
Time[see.]
FIGURE 6. Time history for obtained state error and parameter estimate when gain fault occurs.
2. Nonlinear Observer for Signal and Parameter Fault Detection for Ships
391
Simulation results To illustrate the performance of the algorithm, test d a t a collected on the Danish ferry Dr. Ingrid, were used. Fig. 6 shows a gain fault in Ky occurring at time 2500 second. T h e gain changes stepwise down by 5%, corresponding to the partial failure of a single cylinder. The raw observer error signals are plotted. Balancing of gain factors for P and L can be made such t h a t further filtering is not necessary. The e~ signal is well below a reasonable threshold value. The second fault considered is the critical failure of the shaft speed measurement. A negative fault in shaft speed m e a s u r e m e n t is shown in Fig. 7. T h e simultaneous discrepancy in b o t h signals make it necessary to use further filtering as proposed to achieve isolation.
O b s e r v e r e r r o r w h e n Anlo w o c c u r s
0.2[ '= - 0 2
~,
.................... ~. . . . . . . . . . . . .
0
500
x 105
i .................... ~...................
1000
1500
~ ...................
2000
i .................... ~. . . . . . . . . . . . . . . . . . .
2500
~ .................
[ ..................
i .....................
~ ...................................
............
~5 0
3500
E s t i m a t e d e n g i n e g a i n (Anlo w o c c u r s )
:I ............. i......... i..................;.............. i ................ ,. ...............
3000
500
1000
1500
2000
2500
3000
t
3500
Time[sec.]
FIGURE 7. Time history for obtained state error and parameter estimate when a negative fault in shaft speed measurement.
A positive fault in shaft speed was simulated and plotted in Fig. 7. T h e response of the Ky deserves some comments. W h a t happens is t h a t the nm high fault is immediately reacted to by the shaft speed governor, which drives the fuel index rapidly to zero. Y being zero is a violation of the conditions for convergence of the non-linear observer. This is quite obvious, since a zero gain is the present in the p a r a m e t e r update, (5.17). This phenomenon is not accounted for by the linear filter analysis above t h a t should provide fault isolation. It is noted t h a t none of the above simulations were m a d e with re-configuration activated. This is treated below.
392
2. Nonlinear Observer for Signal and Parameter Fault Detection for Ships Observer
0.2
error when
Z~qhig h o c c u r s
;
"T
~iii:i!:i:!i:iiii?-liiiiil!!ii!i! :iiiiil !!:!!ili:ii:!.iiiilCiiiiiiiiiiiiiiiiii iiii1
~ ~ -0.2 -0.4 -0.6
0
500
1000
Estimated
x 10 e
1500
engine
2000
gain
2500
3000
3500
' 3000
3500
(Z~r/hig h o c c u r s )
1.14 1.13
'
0
500
1000
1500
Time
2000
"
" 9 2500
[see.]
FIGURE 8. Time history for obtained state error and parameter estimate when a positive fault in shaft speed measurement.
6.1 Re-configuration Re-configuration actions depend on the actual fault t h a t has occurred. T h e key issue in this context is t h a t there is difference in the time it takes to verify which of the possible faults have occurred. Simultaneously there is a dramatic difference in the requirements of how fast a fault must be detected, isolated and accommodated. If the n-fault has happened, the diesel engine can reach and over-speed condition within fractions of a second. If the diesel engine gain has deteriorated, action within a minute span of t i m e will suffice. Since the n-fault is the most severe, and the time to isolate the actual fault is longer t h a n the required time to re-configure for the n-fault, the supervisor will need to choose the worst case a s s u m p t i o n until final isolation can be made. If necessary, re-configuration will thus first assume t h a t an n-fault has happened, and a c c o m m o d a t e to this situation. Having re-configured the n-measurement, isolation might run until finished, and the final remedial action take place. Continued observation for correct isolation of the cause of the fault is not possible when only shaft speed is monitored. In an actual application, ship speed, pitch angle and fuel throttle sensors would all be supervised, and more elaborate schemes could be employed to detect possible faults. In the solution chosen, all signals used are available by the governor. Limiting fault detection and a c c o m m o d a t i o n to signals available by the governor, would enhance the integrity of this device. Re-configuration to a c c o m m o d a t e the n fault is then to switch the controller to use the estimate of n instead of the faulty m e a s u r e m e n t rim. W h e n
2. Nonlinear Observer for Signal and Parameter Fault Detection for Ships
393
a gain fault of the engine has occurred, the remedial action is to change the overload limits within the governor to the reduced capability available from the engine. Since the two reactions are entirely different, proper isolation of the two faults is crucial. Detection and isolation of a change is done using s t a n d a r d m e t h o d s for change detection. This is not immediately possible with only one residual, redundant information in the system is needed. The possibilities can be derived from the structure diagrams. T h e y show t h a t observation of ship speed through the thrust constraints is indeed feasible. T h e slow dynamics of ship speed, and the fast reactions of a diesel engine make it necessary, however, to assume a worst case fault in the shaft speed measurement. When a discrepancy in n is observed, diesel control is maintained using the open loop nonlinear observer. In due time, other measures will show whether the fault was the less serious cylinder defect of the engine, and the re-configurated sensor signal could be switched to normal, while other appropriate steps are taken to a c c o m m o d a t i o n the fault now isolated. T h e simulations show the performance of the adaptive observer with re-configuration when a shaft speed sensor failure occurs.
II Symbol II Unit f T , ,Pr~
fQ_prop It Ky n
R(U) op Tpr
Te~t
1-t u
89
N gm
kgm 2 Nm fads -1 N N N '171,8- 1 TnS - 1
l-w
Qeng
Qf Qprop
Yd
Nm Nm Nm 0..1 -1..1
II Explanation thrust function torque function total inertia torque coefficeint sha~ speed hull resistance propeller thrust external force thrust deduction factor ship speed flow at propeller wake fraction diesel torque shaft friction propeller torque fuel index propeller pitch
TABLE 2.3. List of Symbols
II
394
2. Nonlinear Observer for Signal and Parameter Fault Detection for Ships Shaft speed signals with and without reconfiguration 14
13.5
System reconfigured
Real shaft speed with reconf.
(6~s.)
./
Ref . . . . . . shaft speed
.........
"~ 12.5
~
{ "-. [I " ~ IL
11.5 11
'','~
~ominal sh.tspeed '-
Reals.ft speed
I ~'",r "N / Fault detected N N / (681 sA "x:
without reconf.
~ ~
[
10,5 Fault occurs
41 67O
(680 s.) t
680
,
i
61)0
!
700
i
710
720
[Sec.] Fuel index measurements with and without reconfiguration Time
1
{3.9 ~Z9
O.7
I il
06 O,5
Fauh occurs
04
Fault detected
0.3
-ill J{
~..i.//
,,._~
",,.... Fuel index "". . . . inal "'...
~-, Foe} index
k
~ ,r
(reconf.)
..... " \ . .
x \..~
System reconfigured
_
Fuel index (Faulty case)
_
-\
0.2
\
0.1
J,o
do
~o
,;o
Time
[sec.}
rio
"-'
'
72O
......
730
FIGURE 9. A zoom in on the shaft speed and fuel index values in the worst case, using the observer generated shaft speed.
7
Simulation Results
The m e t h o d s are applied to the ship propulsion benchmark where the following scenario is simulated. A shaft sensor failure occurs at t = 1000(s). A statistical fault detection m e t h o d (CUSUM) has detected the sensor failing high at t --- 1001(s). The supervisor a c c o m m o d a t e s the fault at t = 1002(s). by activating a dedicated procedure that estimates the variable n. The calculated variable replaces the measured one in the shaft speed control loop. The upper part of Fig. 9 shows the real shaft speed for nominal, faulty and re-configured cases. The non-linear observer is used and w h e n the A n fault is detected, observer gains L and P are immediately set to zero. T h e Ky parameter is taken as the value it had a few seconds before the fault was detected. The estimated shaft speed is immediately used by the shaft speed controller as a substitute for the faulty sensor signal. The lower part of the figure shows fuel index. It is seen that switching from the faulty nm to the estimated fiq results in an overshoot of less than 5% in shaft speed. The failure has been simulated to happen exactly during
2, Nonlinear Observer for Signal and Parameter Fault Detection for Ships
395
a transient command to obtain a worst case condition. It is essential that the 9% critical limit of over-speed shut down of the main engine is not reached. The resulting overshoot is thus well below the critical over-speed shut down limit. For all figures, the curves represent: normal case (solid), faulty case (dash dotted), and re-configured case (dashed), reference signal (dotted).
8
Conclusions
This chapter has analyzed fault detection and isolation, and re-configuration possibilities for a ship propulsion system with a main engine and a controllable pitch propeller: It was demonstrated how fault-tolerance could be achieved against critical sensor failure and cylinder malfunction of the prime mover engine. A non-linear adaptive observer was designed for fault detection and re-configuration, and filters for efficient isolation of faults was found. A structural analysis was used to obtain a scheme of consistent re-configuration possibilities when the goal was to achieve uninterrupted prime propulsion of the ship. Simulations of a model of a ferry illustrated how a critical failure of the shaft speed measurement could be accommodated by the controller. The unavoidable penalty in control quality was found to be quite small and certainly acceptable against the alternative, which was a temporal loss of main propulsion of the ship. The essential contributions of the chapter were combining structural analysis and re-configuration design, and applying this approach to a realistic case using a nonlinea, adaptive observer.
Acknowledgments This research was partly funded by the European DISC II project under ECC DG7. Support of collaboration between the Universities of Lille (France) and Aalborg (Denmark) was received from the European Science Foundation under the COSY program. 9
REFERENCES [1] M. Basseville and I. Nikiforov. Statistical Change Detection, Prentice Hall Inc., 1994. [2] M. Blanke. Consistent design of dependable control systems. Control Engineering Practice 4(9): 1305-1312, 1996. [3] M. Blanke and J. S. Andersen. On dynamics of large two stroke diesel engines: New results from identification. Proceedings 9th IFAC World
396
2. Nonlinear Observer for Signal and Parameter Fault Detection for Ships
Conference. Budapest, Hungary, 1984. [4] M. Blanke and R. Izadi-Zamanabadi. Reconfigurable control of a ship propulsion plant. Proceedings IFAC Conference CAMS'98, Fukuoka, Japan. pp. 51-58, October 1998. [5] M. Blanke and R. Izadi-Zamanabadi and T. F. Loostma. Fault Monitoring and Re-configurable Control for a Ship Propulsion Plant. In: Journal of Adaptive Control and Signal Processing, 1999. [6] M. Blanke and R. Knudsen. A sensitivity approach to identification of ship dynamics from sea trial data. Proceedings IFAC Conference CAMS'98, Fukuoka, Japan. pp. 261-269, October 1998. [7] J. Ph. Cassar, M. Staroswiecki and P. Declerck. Structural decomposition of large scale systems for the design of failure detection and isolation procedures. International Journal of System Science 20 (1): 31-42, 1994. [8] Y. M. Cho and R. Rajamani A systematic approach to adaptive observer synthesis for nonlinear systems. IEEE Transactions of Automatic Control 42(4), 534-537, 1977. [9] V. Cocquempot, R. Izadi-Zamanabadi, M. Staroswiecki and M. Blanke. Residual generation for the ship benchmarck using structural approach. Proceedings CONTROL'98. Swansea, U.K., 1998. [10] P. Declerck and M. Staroswiecki. Characterisation of the canonical components of a structural graph for fault detection in large scale industrial plants. ECC'91. Grenoble, France. pp. 298-303, 1991. [11] Janos J. Gertler. Fault Detection and Diagnosis in Engineering Systems, Marcel Dekker Inc., 1998. [12] R. Izadi-Zamanabadi and M. Blanke. Ship propulsion system as a benchmark for fault-tolerant control. Control Engineering Practice (to appear), 1999. [13] R. Izadi-Zamanabadi and M. Blanke. A ship propulsion system model for fault-tolerant control. Tech. report, Department of Control Engineering, Aalborg University, 1998. [14] M. Knudsen. A sensitivity approach for estimation of physical parameters. Proceedings 10th IFAC Symposium on System Identification. Vol. 2 p.231, Copenhagen, Denmark, 1994. [15] M. Knudsen. Determination of parameter estimation errors due to noise and under modelling. Proceedings IEEE Instrumentation and Measurement Technology Conference, Brussels, 1996.
2. Nonlinear Observer for Signal and Parameter Fault Detection for Ships
397
[16] L. Ljung. System Identification - Theory .for the User. Prentice Hall Int., 1987. [17] M. Staroswiecki and P. Declerck. Analytical redundancy in nonlinear interconnected systems by means of structural analysis. IFACAIPAC'89, Vol. II. Nancy, France. pp. 23-27, 1989. [18] A. Tiano and M. Blanke. Multivariable identification of ship steering and roll motions. Proceedings Institute o.f Measurement and Control (UK). Special Issue on Marine Systems Modelling and Control, 1997.
N o n l i n e a r O b s e r v e r s for F a u l t Detection and Isolation P. M. Frank, G. Schreier and E. Alcorta Garcia Department of Measurement and Control University of Duisburg 47048 Duisburg, Germany
1
Introduction
One of the essential requirements of fault tolerant control (FTC) is fault detection and isolation (FDI) [9, 44]. The main task of fault detection and isolation can be roughly described as the early determination (detection) and localization (isolation) of faulty elements of a dynamic system, as well as the time of the appearance of the faults. Because of the great relevance of FDI in industrial plants as well as the availability of appropriate methods, this subject has become a fundamental issue of research within the control community during the last 25 years [4, 14, 22, 27, 31, 34, 46]. Among the well-established concepts of fault detection and isolation using analytical redundancy, i.e. based on mathematical models, are the parity space approach, the parameter estimation approach and the observerbased approach, where the parity space approach can be considered as a special version of the observer-based approach. It has also been shown that the parameter estimation approach has some interesting relationships to the observer-based approach [2, 11, 23, 30]. This is why the observer-based approach has become a most relevant subject of research in connection with FDI. The traditional observer-based approach to FDI makes use of linear models in the observer even if the plant under consideration is (as is usual) nonlinear [14, 16, 31, 33, 34]. Only in the last decade is there a trend to design nonlinear diagnostic observers for FDI in nonlinear systems [1, 7, 12, 13, 20, 26, 28, 38, 39, 47]. It is worth noting that the use of observers for FDI requires some special design efforts, because diagnostic observers are output observers with different goals from the state observers used for control. This point has often been overseen in the FDI literature in the past. As mentioned above, design methods for diagnostic observers for nonlinear systems found in the literature are often based on the assumption that the system works during normal operation in the neighborhood of a cer-
400
3. Nonlinear Observers for Fault Detection and Isolation
tain operating point. Clearly, in m a n y such cases linearisation is possible, however the linearisation errors can cause some difficulties in the F D I algor i t h m such t h a t these errors can be misinterpreted as faults and hence lead to false alarms. This is why approaches using nonlinear observer design for F D I tasks are becoming more and more i m p o r t a n t in order to augment the performance of the resulting fault detection systems. During the research of the last decade on nonlinear diagnostic observers, some results have been achieved for certain classes of nonlinear systems not only for fault detection but also for fault isolation [20, 29, 38, 41]. Nevertheless, a general theory of nonlinear F D I as well as the design of nonlinear diagnostic observers is still missing. The main reason for this is t h a t the estimation of the set or of sub-sets of the state or the measurement vector of a nonlinear syst e m is not well solved even if there are no disturbances or the nonlinearity of the system is independent of the disturbances. At present, there are att e m p t s to overcome the difficulty of analytical t r e a t m e n t of the nonlinearity by using non-analytical (qualitative and knowledge-based) methodologies such as neural networks or fuzzy techniques. In this chapter we will limit our consideration to the analytical approaches to the design of nonlinear diagnostic observers [17, 18, 50, 51]. The purpose of this chapter is to give a brief survey of the state of the art of nonlinear diagnostic observers for F D I in deterministic nonlinear dynamic systems, where quantitative models are available [3]. This survey is mainly covered by the first part of the chapter. T h e methods are discussed in terms of robustness and similarities between the different approaches, and open problems are pointed out. The second part of the chapter introduces the basic ideas of the design of a nonlinear diagnostic observer based on the explicit use of the Lipschitz constant. For simplicity of notation, the time dependence of the functions will be dropped throughout the chapter. The brackets ( ) will be used to express the functions' arguments and the brackets [] to signify Lie brackets (in addition to their standard use for matrices and to indicate priority of operations). As usual, II l I means the Euclidean norm of a vector.
2
Preliminaries
Consider a nonlinear system, described by the equations:
y
=
f(x,u,0s,
0d),
=
h ( x , u , Ofs).
x ( 0 ) = x0
(2.1)
where x C ]Rn is the state vector, u E ]Rm is the input vector, y E IRp is the output vector of the system, 01 C IR l represents the actual system parameters, i.e., when no faults are present in the system, Of = Ofo where Ofo
3. Nonlinear Observers for Fault Detection and Isolation
401
is the nominal parameter vector (understanding "fault" as an u n p e r m i t t e d parameter deviation in the system), Of~ E IR l~ represents the parameters in the o u t p u t equation (if a sensor fault occurs Of~ ~ OleO, where Of~o represent the nominal parameters in the o u t p u t equation) and ~d E ]R l" represents modelling mismatches (if the model of the system is perfectly known, 0d = 0). In an ideal case, a residual r(t) will be zero if no faults are present, different from zero when a fault is present (fault detection) and only manifest its i th element if a fault in the i th element is present (fault isolation). T h e procedure for evaluating the redundancy given by the mathematical model of a system is frequently divided into the following two steps [14]: R e s i d ual g e n e r a t i o n is the construction of signals that are accentuated by the changes in the parameter vector (faults) and R e s i d u a l e v a l u a t i o n is the decision and isolation of the occurrence of changes in the parameter vector (faults).
3
Observer-Based Residual Generation
In this section, we briefly review the principal observer-based residualgeneration methods for fault diagnosis which have been developed in recent years for special classes of nonlinear dynamic systems.
3.1
Nonlinear Identity Observer Approach
This approach to fault diagnosis was first proposed in [26] for the detection and isolation of component faults (see also [13]); for a more general class of faults in [14] and for further design considerations in [1]. T h e starting point is the nonlinear model (2.1) and the following observer structure
r
=
f ( z , u , Ofo,O) + K ( z , u ) [ y - f l ]
=
y - h(z,u, Of~o).
(3.2)
The design of the observer (3.2) is carried out on the assumption that no faults (Of = Ofo and Of~ = OleO) and no modelling mismatches (Od = O) A are present in the system. Defining the estimation error e = x - z, the differential equations governing its dynamics could be written by
=
F(z, u, Ofo, O)e - K(z, u)H(z, u, OleO, O)e ~-O1 (e j , t)
r
=
H(z,u, Oi~o,O)e+ 02(eJ,t)
(3.3)
402
3. Nonlinear Observers for Fault Detection and Isolation
where
F(z, u, Ofo, O) = Of(x, OXU'0f0 , 0) x=z
(3.4)
and
H(z, u, Ofso) -
Oh(x, u, Oi~o) ~=~ Oz
(3.5)
O1 (eJ, t) and 02(e j, t) represent the second- and higher-order terms with respect to e. The terms Ol(eJ, t) and O2 (e j, t) will be neglected here. The remaining problem is to design the matrix K(z, u), in such a way that the equilibrium e = 0 of (3.3) is asymptotically stable. A solution to this problem, with the assumption that h(x, u) = Cx and ker[C] # 0, was proposed in [1]. Following [1], the matrix K(z, u) takes the form
K(z, u) = r - l fi'(z,
u)CTQ
(3.6)
where the matrix P = pT > 0 is positive definite, and should be assigned such that
R T p of( ....Oxen~176x=z R is a negative definite matrix, where the m a t r i x / J / i s the highest rank right orthogonal matrix to C. The matrix valued function fi~(z, u) is given by
F(z,u)=diag
E
j=l
Ir
+ r
}
i = 1,-.. ,n,
where ~ j is the ij th element of the matrix P ~ 0 x
(3.7)
x=z' and Q is a m a t r i x
satisfying C T Q C - I >_ O. With this selection of the matrix K(z, u), the equilibrium e = 0 is asymptotically stable for the first order approximation [1]. In some cases the use of a constant matrix K(z, u) will be sufficient to guarantee stability of (3.3), as pointed out in [13]. It could be the case, for example, when f(x, u) has a special form i.e. f(x, u) = Ax + g(x)u and the function g(x)u satisfing a Lipschitz condition. In this case the design problem becomes similar to that for Thau's observer [43]. If f(x, u) = f(x) + g(x)u and if the vector-valued functions f(x) and g(x) satisfy some technical conditions, the observer design with constant K(z, u) = K can be achieved in a similar way to the one given in [21].
3. Nonlinear Observers for Fault Detection and Isolation
3.2
Nonlinear
403
Unknown Input Observer Approach
A direct extension of the unknown input observer (UIO) results in linear systems to the nonlinear case was considered in [47]. T h e approach takes advantage of the structure of the system model, which is assumed to be in observable canonical form [6]. In this case, a constant s t a t e t r a n s f o r m a t i o n could be used (as in the linear case), and a complete design procedure can be achieved. The class of systems considered in [47] can be described as follows
= y
Ax + B(y,u) + Ed+ K(x,u)ff
(3.8)
=
where d -- d(x, u, Od) represents the unknown inputs, f f A= Of - Ofo represents a component or an actuator fault to be detected and f~ A=Of~ - Of~o the sensor faults. The question of how to obtain a system representation according to (3.8) from a more general nonlinear system representation by a transformation of the state vector is not of concern in this section. Existance conditions and design considerations of the required transformations have been studied by Birk and Zeitz [8]. A fault-detection observer for the system (3.8) is given by
r
=
Fz+J(y,u)+Gy
=
L l z + L2y.
(3.9)
T h e following conditions on the observer matrices are necessary in order to provide total decoupling from the unknown input d and sensitivity to the fault vector f f .
TA-
FT
=
GC
F stable
J(y,u)
=
TB(y,u)
L I T + L2C =O
,
T E =O
rank{TK(x,u)}
=
rank{K(x,u)}
G If these conditions are fulfilled, the residuals obey the equations
= r
Fe-
GK(x,u)ff +TK~(z,u)f~
----- L i e + L 2 K s ( x , u ) f ~
(3.1o)
404
3. Nonlinear Observers for Fault Detection and Isolation
/x where e = Tx - z. A drawback of the nonlinear UIO (NUIO) approach as formulated in [47] is the difficulty of transforming a general system (2.1) into the required form (3.8). As for the linear case, the existence conditions of the NUIO are also very restrictive.
3.3
The Disturbance Decoupling Nonlinear Observer Approach
An alternative to the NUIO approach, considering a more general class of systems, was proposed in [38, 39]. The basic idea was the same as for the NUIO, but a nonlinear state transformation instead of a linear one is used. The class of systems that can be treated with this approach is described by
= A(x, u) + E(x, u)Od + K(x, u)f/
y = C(x, u)
(3.11)
where the matrices
A(x,u)
~= f(x,u, Ofo,O)
E(x, ~) ~= Of(x, ~, f/o0e + 0/o, Od, )
/~=o.=o
K(x, u) A= Of(x, u, frof/+ Of~ Od,) f/=O,,=O ff A=Of -0/o. The model (3.11) corresponds to a first- order approximation of (2.1) at the set point f/ = 0, 0 d : O. Here, the second- and higher-order terms with respect to f/ and Od are not considered. The first step is to find a nonlinear state transformation z = T(x) in order to decouple the faults from the disturbances. This can be met if the condition OT(x) E(x) Ox
= 0
(3.12)
is satisfied, where it is assumed that E(x, u) = E(x), which implies that the required transformation depends only on the state x (T(x)). Note that T(x) is not a similarity transformation, i.e. the transformed states span only a subspace of the one spaned by x. If the matrix E(x, u) has an explicit dependence on u(t), a transformation depending on the input u(t) (T(x, u)) will be required. Consequently, the
3. Nonlinear Observers for Fault Detection and Isolation
405
transformed system will depend on the derivative of u(t). Some ways of avoiding this difficulty are pointed out in [38, 39]. The necessary and sufficient condition to solve (3.12) is given by the Frobenius theorem [38, 39] and can be formulated as follows: Assume that the rank of E(x) is equal to q for all x and T(x) is a (n - q) vector. There exists a solution T(x) for the equation (3.12) if and only if
rank (E(x)
[e~(x), ej(x)]) = q
(3.13)
for i,j = 1 , . . . ,q and all x, where ei(x) denotes the i th column of the matrix E(x) and [ei(x), ej (x)] is the so-called Lie-bracket (w.r.t.x). If the existence conditions of the nonlinear state transformation are fulfilled, the transformed system can be described as
-
y*
--
cgT(x) [A(x, u) + K(x, u)fi] Ox c*(z,u,y)
(3.14)
where the output has been transformed in order to obtain a new output, depending only on the state z, the input u and the original o u t p u t y. T h e second step is the design of an observer for the reduced system (3.14). Some approaches were considered in [38, 39]: i) A design based on the nonlinear identity observer as described in Section 3.1. ii) If the transformed system (3.14) is in an observable canonical form, the observer can be designed directly as in Section 3.2 with d = 0. iii) The use of a parallel model of the system (i.e., observer without feedback). R e m a r k 3.1 The decoupling condition (3.12) was.formulated independently
using a different approach, the so-called "algebra o.f.functions" [41]. 3.2 A similar approach to the one considered in i) was also introduced in [41]. The observer matrix H is selected in order to stabilize the sensitivity equations of the observer with respect to parameter variations. An optimal index is also given .for the case when the decoupling condition (3.12) is not satisfied. Unfortunately, the result is extremely complex, as pointed out by Shumsky [41]. Remark
406
3. Nonlinear Observers for Fault Detection and Isolation
Note that the approach in i) could be considered as a robustification of the nonlinear identity observer (NIO) presented in Section 3.1.
3.4
Adaptive Nonlinear Observer Approach
One problem of the observer-based methods for fault diagnosis is their weakness in detecting slowly developing faults, especially when model uncertainties are present [15]. An adaptive observer-based residual generator approach to overcome this difficulty has been proposed independently in [7] and in [12]. In [7] a nonlinear observer is used in order to detect a leak in a pipeline, and the nonlinear open-loop observer's performance is improved with the adaptation of a friction coefficient. In [12] a more general case is considered. Based on a modified version of the observer proposed in [5], an adaptive residual generator is developed. Following [12], consider a nonlinear system described by
y
=
a(x) + q0(x, u) + Q(x, U)Od + G(x, u)f + g(t)
=
c(x)
x0 = x(0).
(3.15)
where the output y is considered scalar for sake of simplicity, a : ]an --~ ]a,~, Q : ]R n X ]R rn ~ ]~nxl,#, G : ] a n X ] a m ~ ] a n xl, g : ] a ~ ] a n and c : ]a~ ~ ]a are assumed to be known and smooth enough, f represents abrupt changes, and 0 d C ]aid is an unknown vector which represents, for example, unknown time-varying parameters, slowly varying faults or part of the nonlinearities of the system. Assume that
o <
IlOll -<
M <<
c~
The adaptive residual generator is given in two steps: first, a transformation ~ = T(x) C ]ak, k <_ n has to be found (if possible), defined on a neighbourhood of the initial state x0 such that:
y
=
[ 0
..-
0
1 ]~
(3.16)
and rank[Z(y, n)] = 1. Necessary and sufficient conditions for the existence of the transformation ~ = T(x) can be found in [12]. T h e second step consists of designing the adaptive residual as
3. Nonlinear Observers for Fault Detection and Isolation
=
407
FZ+Oo(y,u)+qg(y,u)~+Lly
+ [ v(t) z~(t)
r(t)
=
y(t) -
~(t)
=
rcRt)r(t)
~/(t)
=
R Y ( t ) +C~k(y,u),
=
kTV(t)+r
r
Y(0) ----0
where F is a positive definite matrix, R is a stable matrix, the elements lj of L1 are assigned such that s n + 1,~s'~-1 + ... + 12s + ll -- 0 is Hurwitz, k T = [ 0 -.. 0 1 ]. Let a matrix Q satisfy
Q F Q -1
R kT
=
•• ]
(3.17)
then
Qg(Y'U)
A= [ C~n(y'u)
E ]~nxl
(3.18)
and the conditions 1. r is bounded, ~)(T) is bounded except possibly at a countable number of points 2. 3 a, ~ such that 0 < a I < f:+Z
(~(T)~)T(T)dT
3. 3 M1 such that IY(t)OI < M1 < c~ are required in order to guarantee that ]y - zkl < K < cx~ Vt. It is usually assumed that 11011is small. Thus, it is reasonable to suppose that the estimation errors are restricted to a small range around zero [12]. A different approach is considered in [48]. It is based on a modified version of the nonlinear observer proposed recently in [21]. T h e systems considered are all observable (in the sense of [21]), and transformable into the following form
= y where
F'(x) + G'(x)u + T'(y, u)Od + p ( x ) f
---- Cx
(3.19)
408
3. Nonlinear Observers for Fault Detection and Isolation
Xn, r
F'(x)
=
[X2 , ' ' ' ,
a'(x)
=
[gl(Xl) ,''"
=
[1 0 ..- 0]
C
9. .
0
,--~1
g n ( X l ' ' ' X n ) ] T,
,
T1
9
.
9
T8
9
0
.
0
T,
.
.-.
0
0d E ] a S Sc~ will b e d e f i n e d l a t e r . I f Od ~- f : O, t h e observability g u a r a n t e e s the existence of the transformation [21]. For sake of simplicity, the SISO case is considered here; the reader is referred to [48] for the M I M O case. Under the assumption that the functions gi are Lipschitz, an adaptive observer for the system (3.19) is given by
r k
0d
=
F ' ( z ) + G ' ( z ) u + T'(y, u)Od + S L 1 C T [ z , - y]
=
zl - y
(3.20)
=
ff[zl-y]
where S ~ is the solution of 0 = - ~ / S ~ - A T S ~ - S ~ A + C T c and A is the anti-shift operator A : lR '~ --* IRn, Ai,j --- 6~,j-1. Unlike [12], in [48] no mention is made concerning to the behaviour of the adaptive observer (3.20) if 0 d is time-dependent.
3.5
The Nonlinear
Fault Detection
Filter Approach
In this section, an extension of the fault-detection filter for linear systems to a class of nonlinear systems is considered. As for the linear case, detection filters are output observers that produce residuals with directional properties [20]. Based on a T h a u - t y p e nonlinear observer, the results given in [45] for the fault-detection filter for linear systems are generalized in [20] for a class of nonlinear systems. Assume t h a t a nonlinear system is represented as 8
=
+ fix) +
+ i:1
y
=
Cx
(3.21)
where F{ E IRn, s is the number of p a r a m e t e r changes (faults fi = 0i{-010{) considered, r is a function of x(t) a n d / o r u(t) depending of the p a r a m e t e r
3. Nonlinear Observers for Fault Detection and Isolation considered, and x(t), y(t) and u(t) are as in (2.1). Fir follows:
Fir
=
409
is defined as
O[Ax +Oofif(x)+ Bu] oi,=Olo,
(3.22)
The representation used here is slightly different from t h a t used in [20] in order to maintain uniformity of presentation in this chapter. Defining the matrix F as:
F = IF1 F2 --. Fs]
(3.23)
and following [20], some assumptions are required: 1. The nonlinear t e r m f(x) is Lipschitz, i.e. I I f ( x l ) - f( )ll -< llxi - x211 2. (A, C ) i s observable
3. C F is of rank s. This means t h a t the relative degree of the fault vector with respect to the output is equal to one. Consider the next T h a u - t y p e observer for the system (3.21)
r
=
Az+f(z)+Bu+H[y-Cz
=
y - Cz
l (3.24)
where the matrices
H
=
Qh
=
Qh[CF] ~ + H E [ I - CF[CF]}]C AF1 - E
I I I ~j~jvj
j=l
Fi
=
~
s
AF=
s
s
j=I
a~v~ ii
j=l
HF
=
1pcT[I-
CF[CF]']Tc
(3.25)
and )~i, ~i are the eigenvalues and eigenvectors of the m a t r i x AH = A - H C respectively, e > 0 is a positive constant such that there exists a s y m m e t r i c and positive definite matrix P t h a t the modified Riccati equation
410
3. Nonlinear Observers for Fault Detection and Isolation
0
=
A H P + pATH + P[I~/2 - 1 C T C F ] P + I + Ie
(3.26)
is satisfied, a/ are constants such that Fi can be represented by the sumn mation ~ j = l ~J~'J, CF = [I -- CF[CF]t]C, and the superscript ~ means the pseudo-inverse of a matrix. Note that the solutions of (3.25) and (3.26) can only be obtained by an iterative algorithm. Note that (3.26) given in [20] has a misprint. R e m a r k 3.3 The Thau-type observer (3.24) .for sensor fault detection and
isolation was presented in [29]. In this ease the selection of the matrix H is simplified because no directionality is required. A different approach, including a more general class of systems, was considered in [42]. This approach is based on a generalization of the notion of (h,f)-inva riance using the so-called "algebra of functions" [41]. T h e same idea, but utilizing the disturbance decoupling approach, was proposed in [38]. Based on the invariance principle (or on the disturbance decoupling approach), a set of state transformations for the system (2.1) is defined. Each transformation maps the state of (2.1) (or the system (3.11)) into a subsystem that depends only on one fault or on a set of faults, and is robust to the rest of them. At this point, nonlinear identity observers are used to build a bank of residuals that have the desired fault directionality.
3.6
O b s e r v e r f o r F a u l t D i a g n o s i s in B i l i n e a r S y s t e m s
Fault-diagnostic methods for bilinear systems have been studied only in recent years, maybe because sometimes the linear approaches (such as the UIO) can be extended to bilinear systems, or because bilinear systems are a special case of more general nonlinear systems. The study of bilinear systems is, however, important, because a set of physical systems (nuclear reactor systems, suspension systems, fermentation processes, hydraulic drives, heat exchange systems, etc.) c o u l d b e m o d e l e d by bilinear equations [32]. Further, it is possible to take advantage of the special model structure in order to improve the design of the residuals. Different approaches have been proposed [25, 40, 48, 49]. In [40, 48, 49] the unknown input fault detection observer approach (in different versions) is extended to bilinear systems. In [25] a more general (and maybe a more realistic) class of bilinear systems was considered. The approach includes systems represented by
3. Nonlinear Observers for Fault Detection and Isolation
p
ic
=
2
A j u j x + ~ ( u , y ) + E ( E ~ x + F~)fi
Aox + E j=l
y
=
411
i=1
(3.27)
Cx.
For sake of simplicity, like in [25], only two possible faults are considered here. The approach given in [25] is reviewed. A fault-detection filter for the system (3.27) is given by P
=
Aoz + E [ A j u j z
+ Bjujy] + Pq2(u,y) + boy + R - 1 L ~ [ L l y - L2z]
j=l
r
=
L l y - L2z
R
=
-OR - A~R - Rs
+ L~L2
P j=i
if the conditions A ~ P - PA~ + [?iC
=
0
L1C - L 2 P
=
0
=
o
P [
F2 ]
with L1, L2 and P non-zero, and u(t) a 0-strictly persistent input [25] for the system P
j=l
q
=
(3.28)
L2rl,
are satisfied. Here u(t) is said to be a 0-strictly persistent input for the system (3.28) if: 3t0 > 0, 3a > 0, such that for any t > to
fo t
e--O(t--s) ~)u ( ~ - 8) T c T C~)u(f;
where I is the mxn identity matrix, r and O must be positive real.
-
s)ds
>_ o~ TI
is the transition matrix of (3.28)
412
4
3. Nonlinear Observers for Fault Detection and Isolation
Nonlinear Observer Design via Lipschitz Condition
Consider a class of nonlinear systems described by E2
=
Ax + Bu + /(x,u)
(4.29)
y
=
Cx
(4.30)
where x E ]R n, y E IRp, u E IR m and f ( x , u ) E IR q and the matrices A, B, C and E have appropiate dimensions. The matrice A, B, C and E are known. In this section, each component of the nonlinearity carl be a nonlinear function on the state and the input too. First, an observer is designed for the system (4.29), (4.30). Then its stability is discussed. It is shown in which cases this observer can be applicable. Finally the residual generation for fault detection and isolation is presented. A nonlinear observer for a class of nonlinear regular systems was presented by [24]. [36] discussed the same observer as [24], but better results of the upper bound of the Lipschitz constant were obtained by [36]. Generalizing the Lyapunov-like equation [37], the upper bound of the Lipschitz constant can be augmented/greater. The observer design was discussed also for nonlinear singular systems. [10] presented a reduced order observer for nonlinear system, which is independant on the control variable. A m e t h o d to reconstruct the whole state of a class of nonlinear singular systems is given in [19]. In this section, the gain matrix of the observer presented by [19] will be obtained with a more general Lyapunov-like equation. So a better solution for the upper bound of the Lipschitz constant can be found. 4.1
Observer Presentation
So that the observer can be designed, the following three hypotheses must be satisfied. 9 The row vectors of the matrices C and E must be a basis of the n-dimensional vector space:
rank
C
= n
9 The linear part of the system has to be observable:
rank
sE - A) C = n
(H1)
3. Nonlinear Observers for Fault Detection and Isolation
413
9 The nonlinearity f ( x , u) satisfies a Lipschitz condition, which requires that there exists a positive constant, e, such that IIf(3c, u) -
f(x,u)ll
< ell~ - yl[
(H3)
Moreover, if the measurement matrix has full row rank, i.e.: rank(C) = p
(4.31)
is satisfied, the matrix computation is much easier. Under these hypotheses the following procedure can be used to design an observer for FDI. P r o p o s i t i o n 3.1 The parameterized system equation (~,~) =
Nz+Ly+Gu+Rf(3c,
u)-P-1CT(~)-y)
(4.32)
5c =
z + Ky
(4.33)
~) =
C~
(4.34)
where k, z E IR n, ~) E IRp and f(&, u) C IR q and the matrices N, L, G, R, K and P have appropriate dimensions, is a stable observer of the nonlinear system (~.29), (4.30), where the Lipschitz constant E must hold the following inequality: < e0(a, ~)
(4.35)
with Eo(a, ,~) = Ami,~((2 - a ) c T c + ~P)
2~max(Pn)
(4.36)
and the matrices satisfy the .following conditions: N - p-1cTc G-RB NRE+LC-RA RE + KC
stable
(4.37)
=
0
(4.38)
=
0
(4.39)
=
In
(4.40)
and the matrix P, depending on the parameters a and ~, is the solution of the Lyapunov equation /u
_{_P N - o~cTc -}- ~ P = 0
(4.41)
Note that the parameters ((~,~) have to be ckosen so that the matrix P is positive definite and the condition (4.35) is satisfied.[:]
414
3. Nonlinear Observers for Fault Detection and Isolation
P r o o f . The estimation error is defined as:
e =
2 -
x
(4.42)
With the estimated state (4.33) and the condition (4.40), the estimation error becomes
(4.43)
e = z - REx
Taking into account (4.32) and (4.29), the time derivative of the estimation error becomes
=
Ne + (LC-
RA-
NRE)x
+ (G-
RB)u
+ R ( f ( Y : , u ) - f ( x , u ) ) - P - ~ c T ( ~ ) -- y)
(4.44)
Using the matrix conditions (4.38) and (4.39), the error dynamics are governed by the following equation
= (N - P-1cTC)e
+ R ( f ( 2 , u) - f ( x , u))
(4.45)
To discuss the stability, the direct method of Lyapunov is applied. Consider the following Lyapunov function (4.46)
V = eTpe
This function V is positive definite if and only if the time constant m a t r i x P is positive definite, i.e., if the eigenvalues of P are positive. So the second step is the discussion of the negative definiteness of the time derivative. Taking into account the error dynamic (4.45) and the Lyapunov equation (4.41), the time derivative becomes
II = --eT((2 -- a ) C C + ~ P ) e + 2 e T p R ( f ( : ~ , u) -- f ( x , u)) The of P R , if E is by the
(4.47)
second term must be overestimated by the greatest singular value because R can only be a square matrix if the system is regular, i.e., a square matrix with full rank. The first term can be overestimated smallest eigenvalue of the matrix:
T2
=
--
§
)CC +
,
P)llelr:
u) - f(x, u)l I
(4.48)
3. Nonlinear Observers for Fault Detection and Isolation
415
Applying the Lipschitz condition (H3), the time derivative of the Lyapunov function can be overestimated as follows:
9 = (--Am,~((2 -- a ) C C + ~P) § 2e~maxPR)Ne[I 2
(4.49)
Now it can be concluded t h a t the time derivative of the L y a p u n o v function is positive definite, if the condition (4.36) is satisfied. [] An observer for the above nonlinear systems can be designed, if the nonlinearity satisfies locally the Lipschitz condition. It has been proved above t h a t the a s y m p t o t i c stability holds if the Lipschitz condition (H3) is satisfied. There are two p a r a m e t e r s to design the observer. T h e main problem is the stability, but if there are different pairs of the p a r a m e t e r s a and ~ which satisfy the stability condition, the dynamics of the observer can be a second criterion to choose the parameters. A further interesting point will be the conditions of the p a r a m e t e r s a and ~ so that the positive definiteness of the L y a p u n o v m a t r i x P can be guaranteed. Note that the matrix c o m p u t a t i o n of the proposed observer satisfying the conditions (4.37)-(4.40) is presented in [19]. Normally, a system is not singular. But it can be said t h a t a model of a system is not fully known. In the next section details the kind of systems to which the proposed observer can be applied.
4.2
C o n t r i b u t i o n of this O b s e r v e r
A nonlinear observer for nonlinear, singular systems was presented in section 4.1. This observer is a generalization of the observer presented by [19, 24, 36, 37]. T h e proposed observer can be applied for failure diagnosis of nonlinear, singular systems. It can be shown t h a t this observer is also applicable for nonlinear systems with unknown inputs, which can be described by the following equations:
ic =
Akx + Bku + fk(x,u) + Dkd
(4.50)
y
cx
(4.51)
=
where d C lR n-q is the vector of unknown inputs and q has to be smaller t h a n n. D is a known matrix, whose rank is equal to ( n - q ) . If there exists a m a t r i x Ek, so t h a t EkDk = 0 and the hypotheses of the presented observer for the nonlinear singular system are all satisfied, the observer can also be used to reconstruct the state of the nonlinear system under unknown inputs.
416
3. Nonlinear Observers for Fault Detection and Isolation
The analytical equation of the nonlinear system with unknown inputs can be transformed into the equations of the nonlinear singular system by multiplying (4.50) from the left by the matrix Ek
Ekic = E k A k x + E k B k u + E k f k ( x , u) + E k D k d
(4.52)
Taking into account that E k D k = 0, (4.52) can be compared with the differential equation of the singular system (4.29). The following relations can be obtained
E
=
Ek
(4.53)
A
=
EkAk
(4.54)
B
=
EkBk
(4.55)
f(x,u)
=
Ekfk(x,u)
(4.56)
If the condition E k D k =- 0 is satisfied, it can be concluded, that the proposed observer can be well applied for a class of nonlinear regular systems, nonlinear singular systems and nonlinear systems with unknown inputs. Now it will be shown that the state can be well reconstructed applying the proposed observer for a class of nonlinear uncertain systems. Consider the nonlinear uncertain system described by the following equations:
5c =
(A~ + A A u ) x + (B~ + AB~,)u + f ~ ( x , u )
(4.57)
y
Cx
(4.58)
=
Under the constraint that the uncertain matrices AA and A B are parameterized as in [35]:
AA~, = D~,VFI
(4.59)
ABe, -- D~,VF2
(4.60)
Taking into account the parameterization of the uncertain matrices (4.59) and (4.60), the differential equation of the nonlinear uncertain system can be written in the form:
ic
=
A~,x + B~u + f~(x, u) + D ~ , ( V F l x + VF2u)
(4.61)
Comparing this differential equation with the differential equation of the nonlinear system with unknown inputs (4.50), it can be concluded that these two equations are equal if the unknown inputs are defined as follows:
3. Nonlinear Observers for Fault Detection and Isolation
d -- ( V F l x + VF2u)
417
(4.62)
This shows t h a t the proposed observer can be applied for the above class of nonlinear systems, even if the system is uncertain or if it has unknown inputs.
4.3
Residual
Generation
In the last paragraph, an observer for a class of nonlinear singular systems was proposed and different cases were presented of systems to which this observer can be applied. Now the aim is to give an approach for generating the residuals for nonlinear singular systems with faults a n d / o r parameter uncertainties. Consider a nonlinear singular system with faults 8f and uncertainties Od of the form:
Eic
=
A x + B u + f ( x , u ) + 62(x, U)Od + Of
(4.63)
y
=
Cx
(4.64)
where the matrices A, B, C and the nonlinear m a t r i x ~ ( x , u ) are well known. T h e residual of this observer can be defined as follows:
r
--
C(2-x)
(4.65)
where ~ - x is the state estimation error e, and the residual is equal to the estimation error of the output. T h e dynamic of the state estimation error becomes:
--- ~ - R E ~
(4.66)
Taking into account the system dynamics with faults (4.63), the observer dynamics (4.32) and the matrices conditions (4.37)-(4.40), the state estimation error becomes:
= (N - P-1cTC)e
+ R ( f ( ~ , u ) - f ( x , u ) ) - Rq2(x,u)Od -- ROf (4.67)
Determining this proposed residual, faults a n d / o r uncertainties can be detected.
418
5
3. Nonlinear Observers for Fault Detection and Isolation
Conclusions
In this chapter the different approaches to the design of nonlinear observers for residual generation for FDI in nonlinear systems have been briefly reviewed. The survey also incorperates some recent results obtained with a nonlinear observer that has been designed for a class of nonlinear singular systems. This observer is designed with a Lyapunov-like equation with two degrees of freedom. This allows the determination of an upper bound of the Lipschitz constant better than in [19]. The whole state of the system can be reconstructed if the three hypotheses given in the chapter are satisfied and if only part of the process is modeled. This observer design can also be applied to FDI of nonlinear systems with unknown inputs or for a class of nonlinear systems with uncertainties. Note however, that the relationship of of the fault sensitivity with the degrees of freedom is still an open problem. As can be seen, the fault detection problem for nonlinear systems is still neither generally nor completely solved. This chapter presents ideas for residual generation with nonlinear observers under the restriction to certain classes of nonlinear systems. Also, the fault isolation problem is a further interesting issue, in which there are still many open questions, because of the well known difficulties associated with the design of nonlinear observers not only for feedback control but also for fault diagnosis. 6
REFERENCES [1] H. K. Adjallah, D. Maquin and J. Ragot. Non-linear Observer-Based Fault Detection. 3 rd IEEE Conference on Control Applications, United
Kingdom, pp. 1115-1120, 1994. [2] E. Alcorta Garcfa and P. M. Frank. On the Relationship Between Observer and Parameter Identification Based Approaches to Fault Detection. 13th World Congres o.f IFAC, San Francisco, USA, Vol. N, pp.
25-29, 1996. [3] E. Alcorta Garcia and P. M. Frank. Deterministic Nonlinear Observerbased Approaches to Fault Diagnosis: A Survey. Control Engineering
Practice, Vol. 5, pp. 663-670, 1997. [4] M. Basseville. Detecting Changes in Signals and Systems - A Survey.
Automatica, Vol. 24, pp. 309-326, 1988.
[5]
G. Bastin and M. R. Gevers. Stable Adaptive Observers for Nonlinear Time-varying Systems. IEEE Transactions on Automatic Control, Vol.
33, pp. 650-657, 1988. [6] D. Bestle and M. Zeitz. Canonical Form Observer Design for Nonlinear
3. Nonlinear Observers for Fault Detection and Isolation
419
Time- invariant Systems. International Journal of Control, Vol. 38, pp. 419-431, 1983. [7] L. Billmann and R. Isermann. Leak Detection Methods for Pipelines. Automatica, Vol. 23, pp. 381-385, 1987. [8] J. Birk and M. Zeitz. Extended Luenberger Observer for Non-Linear Multivariable Systems. International Journal of Control, Vol. 47, pp. 1823-1836, 1988. [9] M. Blanke, R. Izadi-Zamanabadi, S. A. Bogh and C. P. Lunau. Fault Tolerant Control Systems - a Holistic View. Control Engineering Practice, Vol., pp., 1997. [10] M. Boutayeb and M. Darouach. Observer Design for Non Linear Descriptor Systems. 34th Conference on Decision and Control, New Orleans, USA, pp. 2369-2374, 1995. [11] G. Delmaire, J.-P. Cassar and M. Staroswiecki. Identification and Parity Space Techniques for Failure Detection in SISO Systems Including Modelling Error. 33 rd Conference On Decision and Control, Florida, USA, pp. 2279-2285, 1994. [12] X. Ding and P. M. Frank. Fault Diagnosis Using Adaptive Observers. SICICI'93, Singapore, 1992. [13] P. M. Frank. Advanced Fault Detection and Isolation Schemes Using Nonlinear and Robust Observers. 10th World Congress on Automatic Control IFAC'87, Vol. 3, pp. 63-68, 1987. [14] P. M. Frank. Fault Diagnosis in Dynamic Systems Using Analytical and Knowledge- based Redundancy - A Survey and some new Results. Automatica, Vol. 26, pp. 459-474, 1990. [15] P. M. Frank. On-line Fault Detection in Uncertain Nonlinear Systems Using Diagnostic Observer: A Survey. International Journal of Systems Science, Vol. 25, pp. 2129- 2154, 1994. [16] P. M. Frank and X. Ding. Frequency Domain Approach to Optimally Robust Residual Generation and Evaluation for Model Based Fault Diagnosis. Automatica, Vol. 30, pp. 789- 804, 1994. [17] P. M. Frank. Application of Fuzzy Logic to Process Supervision and Fault Diagnosis. IFAC Safeprocess, Finland, pp. 631-538, 1994. [18] P. M. Frank. Analytical and Qualitative Model-based Fault Diagnosis - A Survey and Some New Results. European Journal of Control, Vol. 2, pp. 6-28, 1996.
420
3. Nonlinear Observers for Fault Detection and Isolation
[19] B. Gaddouna Ouladsine, G. Schreier and J. Ragot. Asymtotic Observer for a Nonlinear Descriptor System. CESA IMA CS, Symposium on Control, Optimization and Supervision, France, pp. 374-379, 1996. [20] V. Garg and J. K. Hedrick. Fault Detection Filters for a Class of Nonlinear Systems. American Control Conference, Seattle, USA, pp. 1647-1651, 1995. [21] J. P. Gauthier, H. Hammouri and S. Othman. A Simple Observer for Nonlinear Systems, Applications to Bioreactor. IEEE Transactions on Automatic Control, Vol. 37, pp. 875-880, 1992. [22] J. Gertler. Model Based Fault Diagnosis. Control-Theory and Advanced Technology, Vol. 9, pp. 259-285, 1993.
[23] J. Gertler. Disgnosing Parametric Faults: Form Parameter Estimation to Parity Space. American Control Conference, Seatle, Washington, USA, pp. 1615-1620, 1995. [24] M. A. Hammami. Stabilization of a Class of Nonlinear Systems Using an Observer Design. 32 nd Conference on Decicion and Control, pp. 1954-1959, 1993. [25] H. Hammouri, M. Kinnaert and E. H. E1 Yaagoubi. Fault Detection and Isolation for State Affine Systems. European Journal of Control, Vol.4, pp. 2-16, 1998. [26] D. Hengy and P. M. Frank. Component Failure Detection Using Local Second-Order Observers. IFAC Workshop, Kyoto, Japan, 1986. [27] R. Isermann. Process Fault Detection Based on Modeling and Estimation Methods A Survey. Automatica, Vol. 20, pp. 387-404, 1984. tection and Isolation. European Control Conference, pp. 1970-1974, 1993. [28] M. Kinnaert, Y. Peng and H. Hammouri. The Fundamental Problem of Residual Generation for Bilinear Systems up to Output Injection. European Control Conference, Italy, pp. 3777-3782, 1995. [29] V. Krishnaswami and G. Rizzoni. A Survey of Observer-Based Residual generation for FDI. IFAC Safeprocess, Finland, pp. 34-39, 1994. [30] J.-F. Magni. On Continuous Time Parameter Identification by using Observers. IEEE Transactions on Automatic control, Vol. 40, pp. 1789-1792, 199s [31] L. A. Mironovskii. Functional Diagnosis of Dynamic Systems. Automation and Remote Control, pp. 1122-1143, 1980.
3. Nonlinear Observers for Fault Detection and Isolation
421
[32] R. R. Mohler and W. J. Kolodziej. An Overview of Bilinear System Theory and Applications. IEEE Transactions on Systems, Man and Cybernetics, Vol. SMC-10, pp. 683-688, 1980. [33] W. Nuninger, F. Kratz and J. Ragot. Structural Equivalence Between Direct Residuals Based on Parity Space and Indirect Residuals Based on Unknown Input Observer. IFA C Safeprocess, United Kingdom, pp. 462-467, 1997. [34] R. J. Patton. Robust Model-based Fault Diagnosis: The State of the Art. IFAC Safeprocess, Finland, pp. 1-24, 1994. [35] I. R. Petersen. A Stabilization Algorithm for a Class of Uncertain Linear Systems. Systems and Control Letters, Vol. 8, pp. 181-188, 1987. [36] G. Schreier, J. Ragot, R. J. Patton and P. M. Frank. Observer Design for a Class of Nonlinear Systems. IFAC Sa.feprocess, United Kingdom, pp. 498-503, 1997. [37] G. Schreier, P. M. Frank and F. Kratz. Stability Discussion of an Observer for a Class of Nonlinear Systems. IAR Annual Conference, France, pp. 68-75, 1998. [38] R. Seliger and P. M. Frank. Fault Diagnosis by Disturbance Decoupled Nonlinear Observers. 30 th Conference on Decision and Control, England, pp. 2248-2253, 1991. [39] R. Seliger and P. M. Frank. Robust Component Fault Detection and Isolation in Nonlinear Dynamic Systems. IFA C Safeprocess, Germany, pp. 313-318, 1991. [40] D. N. Shields. Quantitative Approaches for Fault Diagnosis Based in Bilinear Systems. IFAC, 13th Triennial World Congress, pp. 151-156, 1996. [41] A. Ye Shumsky. Failure Detection and Isolation in Nonlinear Systems Based on Robust Observer Approach. TOOLDIAG, France, pp. 524530, 1993. [42] A. Ye Shumsky. Failure Detection Filter for Diagnosis of Nonlinear Dynamic Systems. IFA C Sa.feprocess, Finland, pp. 335-340, 1994. [43] F. E. Thau. Observing the State of Non-linear Dynamic Systems. International Journal of Control, Vol. 17, pp. ~71-~79, 1973. [44] C. Thybo and M. Blanke. Industial Cost-Benefit Assessment for Fault Tolerant Control Systems. International Conference on Control, Wales Swansea, United Kingdom, pp. 1151-1156, 1998.
422
3. Nonlinear Observers for Fault Detection and Isolation
[45] J. E. White and J. L. Speyer. Detection Filter Design: Spectral Theory and Algorithms. IEEE Tranactions on Automatic Control, Vol. 32, pp. 593-603, 1987. [46] A. S. Willsky. A Survey of Design Methods for Failure Detection in Dynamic Systems. Automatica, Vol. 12, pp. 601-611, 1976.
[47] J. Wfinnenberg. Observer-Based Fault Detection in Dynamic Systems. VDI-Fortschrittsbericht, VDI-Verlag, Reihe 8, Nr. 222, Germany. [48] H. Yang and M. Saif. Nonlinear Adaptive Observer Design for Fault Detection. American Control Conference, Seattle, USA, pp. 11361139, 1995. [49] D. Yu and D. N. Shields. Fault Diagnosis in Bilinear Systems - A Survey. European Control Conference, Italy, pp. 360-365, 1995. Systems Research. 12th IFAC Congress, Sydney, Australia, Vol. 3, pp. ~85-~88, 1993. [50] Z. Zhuang and P. M. Frank. Qualitative Observer and its Application to Fault Detection and Isolation Systems. Journal of Systems and Control Engineering, I MECH E, Vol. 211, Part I, pp. 253-262, 1997. [51] Z. Zhuang, G. Schreier and P. M. Frank. A Qualitative-Observer Approach to Generating and Evaluating Residuals. 37~h Conference on Decision and Control, Florida, USA, pp. 102-107, 1998.
A p p l i c a t i o n of N o n l i n e a r O b s e r v e r s to Fault D e t e c t i o n and I s o l a t i o n H. H a m m o u r i 1, M. Kinnaert 2 and E.H. E1 Yaagoubi 3 1LAGEP, University of Lyon 1, Lyon, France 2Department of Control Engineering, Universit~ Libre de Bruxelles, Brussels, Belgium 3LCPI, ENSEM, Casablanca, Morocco
1
Introduction
Fault detection and isolation (FDI) systems differ from classical a l a r m systems by the fact t h a t they give early warning of faults. Alarm systems essentially process measured signals separately by comparing t h e m to thresholds or by computing their trend. F D I systems take into account the correlation existing between those signals thanks to the use of a m a t h e m a t i c a l model of the supervised process. A typical FDI system is made of two parts, a residual generator and a decision module. The residual generator is a filter with the a c t u a t o r comm a n d s and the measured plant outputs as inputs, which generates a set of signals called residuals. The latter have zero mean in the absence of fault (after the filter transient has vanished), and the m e a n of some of t h e m becomes distinguishably different from zero upon occurrence of specific faults. The decision module processes the residuals in order to decide whether some of t h e m have a mean significantly different from zero (fault detection). Then, by analysing the p a t t e r n of non-zero m e a n residuals, it decides what is(are) the most likely faulty component(s) . This operation is called fault isolation. In this text, only the design of residual generators is considered. T h e r e is a vast literature on this topic, and our aim is not to provide a survey but rather to stress the basic principle behind one approach to residual generation, namely one type of observer based methods. For linear systems, observer based residual generation dates back to the work of Beard [2] and Jones [11]. In their approach, the residuals are the o u t p u t error of a Luenberger observer of which the gain is tuned in a very specific way. Indeed, the particular choice of the observer gain ensures t h a t the residuals take a fixed direction or lie in a specific plane upon occurrence of a given
424
4. Application of Nonlinear Observers to Fault Detection and Isolation
fault. This problem can be seen as a simultaneous assignment of eigenvalues and eigenvectors [21]. Eigensystem assignment has also been used to tackle robustness issues in detection filters [18]. Massoumnia [14] has considered the same problem in a geometric framework. Another approach to the synthesis of observer-based residual generators was developed in [15], [22], [4]. The basic idea on which it relies in order to design a residual which is only sensitive to a given fault is the following. One has to determine from the initial model of the plant a detectable s u b s y s t e m of which the state is not affected by unknown inputs or by faults except for the specific fault to be detected. Next, a Luenberger observer can be designed for this particular subsystem and the o u t p u t error of the observer is a suitable residual. The latter approach is reviewed here for linear systems. Next its extension to nonlinear systems is considered. In particular, the application of high gain nonlinear observers for residual generation is investigated. T h e theory is illustrated by a simulation study on a hydraulic process.
2 2.1
Residual Generation for Linear Systems Problem Statement
We consider the class of continuous time-invariant linear systems described by the following state space model :
x (t)
=
Ax(t) + Bu(t) +
y(t)
=
Cx(t)
_Flvl(t) q- F2v2(t)
where x(t) C X C R n , u ( t ) E L t C R m , y ( t )
(2.1) (2.2)
E Y C R p, v~(t) C ~
C
Rn"~, i = 1, 2. X, L/, y , ?i, i = 1, 2 denote linear vector spaces. In (2.1),(2.2), x(t) denotes the state of the system, u(t), the known input signals, y(t), the measured output signals, vl(t) and v2(t) are unknown functions of time which we call failure modes. In the ith failure mode, the following relations h o l d : vi(t) 7~ O, t > to and vj(.) -- 0 , j ~ i. A, B, F1, F2 and C are known matrices, and we assume without loss of generality t h a t F1 and F2 have full column rank. Different types of faults can be modelled in the framework of (2.1),(2.2). If the dynamics of the actuators are negligible with respect to the process time constants, an actuator failure such as a valve sticking can be described as follows. The j a m m i n g of the first actuator can be modelled with F1 = B.,1, vl(t) = ~ - u ~ ( t ) where B.,ldenotes the first column of matrix B, u~(t), the first component of vector u(t), and ~ is the value at which the control signal is stuck.
4. Application of Nonlinear Observers to Fault Detection and Isolation
425
A leak in an hydraulic system can also be modelled by an additive signal, namely the flow of the leaking fluid. Even a change in the dynamics of the plant could be considered as a fault of the type indicated in (2.1), (2.2), by choosing adequately Fi and vi(t). The simplest problem of residual generator design, called the fundamental problem of residual generation (FPRG) can be stated as follows, for system (2.1), (2.2): (FPRG) Determine a linear time-invariant system with inputs u(t) and y(t), and output r(t) E R q such that : 1) In the absence of fault (i.e. when vi(t) = 0, i = 1, 2),r(t) asymptotically decays to zero. 2)In the second failure mode (i.e. when v2(t) ~ O,t >_ to,to being the fault occurrence time), r(t) asymptotically decays to zero. 3) In the first failure mode (i.e. when vl (t) ~ 0, t _> to, to being the fault occurrence time), r(t) does not asymptotically decay to zero. A restatement of this problem using the terminology of linear system theory is instrumental in the determination of a solution, especially in the framework of geometric system theory. This is the object of the next subsection.
2.2
Second Problem F o r m u l a t i o n
The most general form of linear time-invariant (LTI) system with inputs u(t) and y(t) and output r(t) is:
w (t)
=
Arw(t) + B~u(t) + M~y(t)
(2.3)
r(t)
=
C~w(t) + D~u(t) + Nry(t)
(2.4)
where w(t) E ]/Y. Subsequently, nr and q denote the dimension of w(t) and r(t) respectively. Combining (2.1),(2.2) and (2.3), (2.4) yields:
[;(:/)]
=
o
+ EB F2
F1 (2.5)
v~(t) ] (2.6) Introducing the extended state xe(t) = [xT(t), wT(t)] T, which belongs to 2(e = X| and the extended control signal ue(t) = [uT(t), vT(t)] T, which
426
4. Application of Nonlinear Observers to Fault Detection and Isolation
belongs to/~e = L/~) ]22, (2.5), (2.6) can be w r i t t e n :
xe (t)
=
Aexc(t) + Bcu~(t) + F~vl(t)
(2.7)
r(t)
=
Cexe(t) + Deue(t)
(2.8)
T h e definition of the different matrices is obvious from (2.5), (2.6). We now restate the F P R G as a set of conditions to be fulfilled by (2.7), (2.8). Clearly 1) and 2) in the definition of the F P R G are equivalent to : 1') the map u~(t) --~ r(t) is zero 2') the observable modes of the pair (Co, A~) are asymptotically stable. Several criteria can be considered for condition 3) in the F P R G , as discussed in [15I. As in the latter reference, the requirement that the system relating Vl (t) to r(t) be input observable is imposed here. Remember that the map Vl (t) --* r(t) is input observable if the magnitude vl, of a step like fault vx(t) can be determined uniquely from r(t), t >_ 0 when x~(0) = 0. Subsequently, we consider that, input observability of the map vl(t) r(t) is sufficient to guarantee condition (3) of the F P R G in practice. This yields a new statement for the F P R G : (FPRG1) Determine a system of the form (2.3), (2.4) such t h a t : 1') the m a p u ~ ,
~riszero,
2') the observable modes of the pair (C~, A~) are asymptotically stable, 3') the map f ,
2.3
, r is input observable.
Principle o f the S o l u t i o n
As already announced in the introduction, the solution relies on the determination of a detectable system with Vl as only unknown input, from the original state space model of the plant,(2.1),(2.2). To this end, an output injection map L : y ~ X, and an o u t p u t mixing map H : y ~ y are introduced in order to define the following system class :
x (t)
=
(A + LC) x(t) - Ly(t) + Bu(t) + Flvl(t) + Fuv2(t) (2.9)
z(t)
=
HCx(t)
The major part of the design consists in determining the matrices L and H so that the pair (HC, A + LC) is unobservable, and ImF2 is included in the unobservable subspace of the pair (HC, A + LC). Let h denote the dimension of this subspace. Once such matrices are obtained, there exists a
4. Application of Nonlinear Observers to Fault Detection and Isolation
427
linear change of coordinates x = T 2 such t h a t s y s t e m (2.9) can be w r i t t e n in the s t a n d a r d form for a nonobservable s y s t e m [12] :
~ (t)
--
A l l T l ( t ) - Lly(t) + BlU(t) + FllVl(t)
72 (t)
=
~2171(t) + ~2272(t) -Z2v(t) + ~2~(t) + T21vl(t) + T22~2(t)
z(t)
z
~171(t )
(2.10)
(2.11)
(110 !
where 71 E R (n-h), 72 C ~ h , a n d
T - 1F2 ~_
~22
; T - 1L -
--
; H C T = CO1
O)
T h e first (n - h) rows of T-1F2 are null since ImF2 lies in the unobservable subspace of the pair (HC, A + LC). Moreover, the pair ( ~ 1 , ~ 1 1 ) is observable by construction. This last r e m a r k implies t h a t we can build a linear observer for e s t i m a t i n g 71(t) from (2.10),(2.11) when vl(t) ----O,t >_0 : w (t) = ~ l l W ( t ) - -ily(t) +-Blu(t)
_ +K(z(t) - Clw(t)),
(2.12)
We claim t h a t the o u t p u t r e c o n s t r u c t i o n error :
r(t) -- z(t) - C l W ( t ) : ~ l ~ ( t )
(2.13)
is a suitable residual provided some additional condition are fulfilled to ensure 3') in F P R G 1 . Indeed, notice t h a t E(t) is governed by: (t) = ("All - K ~ I ) c(t) + ~ l l V l ( t )
(2.14)
Hence r(t) a s y m p t o t i c a l l y decays to zero w h e n vl ----0. To fulfil F P R G 1 , the m a p vl --* r m u s t be input observable. It can be shown t h a t this condition is verified provided,
ImF1 A S(HC, A + LCIImF2) = 0 where $ ( H C , A+LCIImF2) denotes the unobservable subspace of ( H C , A+ LC) (containing ImF2). Notice t h a t $(HC, A + LCIImF2 ) is a (C, A) unobservability subspace (u.o.s.) containing ImF2. Indeed, a subspace 7r is a (C, A) u.o.s, if it is the unobservable subspace of a pair (GC, A + MC) for some p • p a n d n • p matrices G a n d M[14]. It can be shown t h a t the set of u.o.s, containing IrnF2 has an infimal element, S*. It t u r n s o u t t h a t necessary a n d sufficient conditions for F P R G 1 to have a solution can be expressed in t e r m s of this subspace, n a m e l y :
428
4. Application of Nonlinear Observers to Fault Detection and Isolation
Theorem
4.1 [15] FPRG1 has a solution if and only if
S* (7 ImF1 = 0
(2.15)
where 8" := in f S(C, A; ImF2) is the smallest (C, A)-unobservability subspace containing ImF2. Moreover, if (2.15) holds the dynamics of the residual generator, i.e. the eigenvalues o.f All - K C 1 in (2.12), can be assigned arbitrarily.
3 3.1
Residual Generation for Nonlinear Systems Introduction
T h e basic idea behind the design of residual generators for linear systems can be extended to nonlinear systems, provided the a p p r o p r i a t e nonlinear notions are used. One of the problems that arises in the extension is the design of an asymptotic observer for the nonlinear system from which the residual is deduced as the output reconstruction error. One has to restrict the considered systems to a specific class to ensure the existence of an asymptotic observer. Here observers for uniformly observable nonlinear systems will be used. Observers with linear error dynamics have been used by Seliger and Frank [19]. Other classes of nonlinear systems have been considered elsewhere, such as bilinear systems [23], [13], and state affine systems [7]. The remaining part of this section is organised as follows. A review of some basic notions from observability theory for nonlinear systems is presented. Next, nonlinear observers for uniformly observable systems are described. Finally those preriquisites are applied to design nonlinear residual generators, and the theory is illustrated by a simulation on a hydraulic process.
3.2
Basic Notions
For the sake of simplicity, we only consider control a n n e nonlinear systems :
=
f(x) +
gi(x)u~
(3.16)
i=1
y
=
h(x)
=
(hl(x),...,hp(x))
where x(t) E ll~n,u(t) -- ( u l ( t ) , . . . ,urn(t)) r e /7 a mesurable subset of 1Rm, y(t) E ~ P are respectively the state, the input and the o u t p u t of the dynamical system (3.16). System (3.16) is said to be observable if and only if, for every pair of initial states, (x, 2), x r ~, there exist an admissible control u : [0, T]
4. Application of Nonlinear Observers to Fault Detection and Isolation
429
/~ and a t i m e instant t 9 [0, T] such t h a t y ( x , u , t ) ~ y ( 2 , u , t ) , w h e r e y(x, u, t) = h(xu(t)), and xu(t) is the unique t r a j e c t o r y of (3.16) such t h a t xu(0) = x. If such an input u exists, we say t h a t u distinguishes (x, 2). A n input u : [0, T] ~ /~/, which distinguishes every ( x, 2), x ~ 2 is said to be universal on [0, T]. S y s t e m (3.16) is said to be uniformly o b s e r v a b l e if, for every T > 0, every admissible control u : [0, T] --* U is a universal i n p u t on [0, T]. T h e observation space O(h) of s y s t e m (3.16) is defined as t h e smallest vector space containing hi,. 9 , hp and closed under the Lie derivative L x , where X stands for the vector fields f, g l , . . . ,gin. This space allows to define a geometric notion of observability, n a m e l y the r a n k observability condition. S y s t e m (3.16) is observable in the sense of r a n k at a fixed x 9 ~ n if d i m dO(h)(x) = n where dO(h)(x) = {dT(x);T 9 O ( h ) } ( d is the classical differential operator). This notion extends to nonlinear s y s t e m s the K a l m a n r a n k observability condition for linear systems. For m o r e details on this topic see [9].
3.3
High Gain Observers for Uniformly Observable Systems
In [3] (for a short proof see [5]), single-output nonlinear s y s t e m s which are uniformly observable are characterized. To describe this result, let L I (h)(x) denote the Lie derivative of a scalar function h w.r.t, the vector field f , as already m e n t i o n e d above, and let L } ( h ) ( x ) = LI(Li] - l ( h ) ( x ) ) . If 9 : x --* (h(x), L f ( h ) ( x ) , . . . , L ~ - l ( h ) ( x ) ) T = z is a local diffeomorphism, and if s y s t e m (3.16), with y E /R, is uniformly observable, t h e n t r a n s f o r m s locally s y s t e m (3.16) into the following canonical form :
I
A z ~- ~(z) ~- E i : l ~ti~l[i(Z)
=
Cz
(3.17)
(010 0) y
where
m
~-
A =
"..'"
". ".
o
0
, C=(1,0,...
,0), ~ ' ( z ) =
0
1
...
o
"yn(Z)
(3.18) and ~ i ( z ) = [~/il(Zl), ~I/i2(zl, z2), 9 9- , ~in(Z)] T (i.e. q2ij(z) = q 2 i j ( z l , . " , zj)). Under the hypothesis t h a t ~, a n d the ~ i ' s are global Lipschitz, an observer for (3.17) can take the form :
m =
+
+
-
i=1
-
y)
(3.19)
430
4. Application of Nonlinear Observers to Fault Detection and Isolation
where Se is the unique solution of the algebraic Lyapunov algebraic equation : OSe -4- A T se + S e A = c T c
(3.20)
More generally, consider the triangular form : { ~ y
--=
(3.21)
Az+~(t,u,z) Cz
where A and C are defined by (3.18), and the i th component q2~(t,u, z) of ko(t, u, z) is such that ff2i(t, u, z) = ~ i ( t , u, Z l , . . . , zi). Moreover, assume that ~ fulfils hypothesis H1) below: H1) ko is global Lipschitz w.r.t, z, locally w.r.t, u and globally w.r.t, t, i.e. Va > 0; 37 > 0; Vz, z' E /Rn; Vt > 0; Vu E /R m, Null < a, the following inequality holds: I[uo(t, u, z) - ~(t, u, z')II < 711z - z'll Then, an observer for (3.21) can take the form : = A ~ + @(t, u, 2,) - S [ ~ c T ( c 2 ,
- y)
(3.22)
where Se is given by (3.20). More precisely, the following result, which is a slight extension of the work reported in [5], holds : T h e o r e m 4.2 Va > 0; 300 > 0; V0 _> 00; 3Ae > 0; 3#0 > Os.t. Jt2,(t) - z(t)ll _<
- z(0)Jf
.for every bounded admissible control with upper bound a. Moreover, #e --~ +oo as 0 --~ +oo.
This result can be extended to the more general triangular form [8]: { ~ y
= =
A(t)z+~2(t,u,z) Cz
(3.23)
where 0
al(t) "-
A(t) =
0 ".
... "-
0 0
"
".
a._,(t)
0
...
0
J
C = (1,0,.--,0),
and the i th component ~i(t, u, z) of ~(t, u, z) is such that ~i(t, u, z) = 9 i(t, u, z l , . . . , zl). To achieve this goal, let us introduce the following hypothesis :
4. Application of Nonlinear Observers to Fault Detection and Isolation
431
H2) hi, i = 1 , . . . , n - 1 are k n o w n differentiable functions w i t h u n k n o w n derivatives, a n d there exist e > 0, M > 0, M ~ > 0 such t h a t , for every t > 0, e < [ai(t)[ < M a n d Idai(t)l <_ M ' for i = 1 , . . . , n - 1. U n d e r hypotheses H1) a n d H2), our candidate observer for (3.23) takes the form ;
= A(t)~ + ~(t, u, ~) - A - I ( t ) S [ 1 c T ( c ~
- y)
(3.24)
where So is solution of (3.20) and A(t) is the n • n diagonal m a t r i x
diag(1,al(t),al(t)a2(t), . . . ,al(t) . . .a~_l(t)). T h e convergence p r o p e r t y of the estimation error for observer (3.24) can be s t a t e d as follows. 4.3 Assume that H1) and H2) hold, and consider an arbitrary constant ~ > O, then 30o > 0;V0 _> 00; 3Ae > 0 ; 3 # e > Os.t.
Theorem
Ilk(t) - z(t)l I <_ ),0e-"~
- 40)11
.for every admissible control u with upper bound IluLl~ _< P r o o f See [8]
3.4
The Fundamental Problem of Residual Generation for Nonlinear Systems
We consider the following class of nonlinear systems: (ENL)
~
=
I(X) + E g i ( x ) u i + el(x)vl + e2(x)v2
(3.25)
i=1 y
=
h(x)
where x(t) 9 K/n u(t) = ( u l ( t ) , . . . ,urn(t)) C U, an o p e n set of K/m, vi(t) 9 K/,i = 1,2, y(t) 9 K/P. f, gi,i = 1 , . . . ,m, e j , j = 1,2 a n d h are of class C ~ . u(t) a n d y(t) are the s y s t e m i n p u t a n d o u t p u t signals. T h e y are a s s u m e d to be known, while the functions vi(t),i = 1,2, are a r b i t r a r y u n k n o w n functions of time. W h e n the first (second) failure m o d e occurs, v, (t) (v2(t)) becomes non zero while v2(t)(vl (t)), remains equal to zero. Sensor failures can also possibly be modelled by v~(t) if the sensor d y n a m i c s is included in the m o d e l ~NL [6]. We restrict our developments to the case where vl a n d v2 are scalar signals for the sake of simplicity. In order to define a residual generator for the s y s t e m ~NL, we consider the class of s m o o t h systems :
{ ~ r
= =
fr(z,u,y) hr(z,u,y)
(3.26)
with inputs u a n d y, state z 9 K / ~ and K/q-valuated o u t p u t r(t); f~ a n d h~ are of class C ~ .
432
4. Application of Nonlinear Observers to Fault Detection and Isolation
Now, let L~176 ( or simply L~176 T])) denote the space of bounded measurable functions, and I] I]~ denote its classical norm. In the sequel, for the sake of simplicity, the notation L~([0, T]) is used whatever the dimension, N, of the considered signal may be. Set B~(v0, e) = {v E L~176 T]); I I v - v011oo < e} the ball of L~([0, T]). Using the fact t h a t f, gi, ej, fr, and hr are of class C~ w.r.t, their arguments, and noticing t h a t r(t) = r(x(0), z(0), u, vl, v2, t), we deduce the following statement. For every fixed x(0) C /R ~, z(0) C //~n~, u, Vl, v 2 in L~([0, w]), ~- > 0; there exist T E]0, 7], e > 0 such that : r : B ~ ( v l , e) --~ L~([0, T ] ) : +1 ~ r(x(O), z(0), ~t, Vl, v2, 0) is Frechet differentiable at Vl. We let (DFr)(Vl) denote the Frechet derivative of r at Vl. In a similar way, y(-) = y(x(0), u, vl, v2, .) is Frechet differentiable w.r.t. vl, and we let ( D f y ) ( v l ) denote its Frechet derivative at Vl. D e f i n i t i o n 4.1 (3.26) is a residual generator.for the detection and isolation of .failure vl in system (3.25) if there exists Lt C Llo~176the space of locally bounded measurable .functions such that : 1) Vu 9 U;Vx(0) 9 ~ n ; V z ( 0 ) e ~ ;Vv2 9 Lzo C : r
(DO
r(x(O),z(O),u,O, v2)(t) ---* 0 2) 3u 9 U; ~x(0) 9 t~n; ~z(0) 9 that ( D F r ) ( v l ) # 0
as
t ---* +oo
3T > 0;3vl,ve 9 L~([0,TI) such
The determination of a residual signal which fulfils 1) and 2) above is called tile fundamental problem of residual generation ( F P R G ) for system (3.25). Condition 1 in definition 4.1 is equivalent to 1) and 2) in F P R G 1 , for the linear case. R e m a r k 4.1 In the linear case, if we denote (C, A, B) a realization of the map vl ~ r, the above Frechet derivative is given by : (DFr)(vl) : ~ ~ C
/o
e('-~)AB~(s)ds
It is independent o f t , x(O), z(O), v~ and v2. Clearly, in this case, condition 2 of definition ~. 1 is equivalent to the existence of a left inverse for the transfer .function between vl and r. The latter requirement also appears in the statement o.f the F P R G for linear systems when only scalar failure modes are considered, as is the case here (see Section 2.2). Before giving sufficient conditions for the existence of a residual generator for (3.25), we introduce a few known notions [17]. O(h) denotes the observation space of system (3.25), with vl = v2 = O. dO(h) defines a codistribution on ~ denoted by the same letters.
4. Application of Nonlinear Observers to Fault Detection and Isolation
433
K e r d O ( h ) is the distribution spanned by all vector fields X such t h a t Lx(9-) = 0 for every ~- 9 O(h). It is invariant under f, g l , . . . ,gin (i.e. for each Y 9 K e r d O ( h ) and each X 9 {f, g i , . . . ,gm}, the Lie bracket [X,Y] belongs to Kerd(D(h)). Moreover, it is obviously involutive (i.e. VX, Y 9 K e r d O ( h ) , [X, Y] 9 K e r d O ( h ) ) . Finally, notice t h a t K e r d O ( h ) can be seen as the largest distribution invariant under f, g l , . . . , g m and contained in Kerdh. Now assume t h a t dim dO(h)(x) = dim dO(h)(xo) = k < n for every x in some neighbourhood of x0. T h e n it is well known that ENL, restricted to some neighbourhood Vx0 of x0, can be transformed by a change of coordinates into the following form:
~'1
=
~'2
=
m ]1(~1) ~_ Egli(~ )Ui ~_ ~li(~)Vi Jr- ~i2(~)V2 i=i m f2(~) ~- E g 2 i ( ~ ) U i ~- e21(~)V1 "~- e22(~)v2 i=i -
i
y = ~(~1) where ~1 E ~ k , ~ 2 E ~ n - k , ~
U =
= (~a T
(3.27)
(3.28) (3.29)
~2T)T, ~1 = ( ~ a , . . . , ~ k ) T and
{n)T,
( d { 1 , . . . , d{k) spans dO(h) and (O/O~k+l,.. , O/O~n) spans K e r d O ( h ) . In the sequel, the same type of diffeomorphism will be used on a different system. For the sake of simplicity, we shall only consider the case where the diffeomorphism x --~ ~(x) is a global one (When this situation does not hold, only initial conditions and inputs for which the associated trajectories lie into Vxo should be considered). Under this assumption we s t a t e the following proposition. P r o p o s i t i o n 4.1 The F P R G .for ENL has a solution i.f there exists an output map qJ = ~ o h, with : IR p -~ ~'~p' (p' < p) such that :
1) e2 9 KerdO(q2)
(3.30)
2) There exists a set U C LloC~c such that : 9 2.i)The system (3.27) with output map ~({1) = qy(x) admits an asymptotic observer for vl = v2 = 0 and u 9 U, of the classical .form : 2 ~1
:
^I U f l ( ~ l ) ~_ Egli(~ ) i ~-P(~I, G)((~(~I) -- ~(~1)) i~l
=
K(u, C, 1) (3.a)
434
4. Application of Nonlinear Observers to Fault Detection and Isolation w h e r e ~1 ~ j ~ k , ~ = dim d O ( ~ ) , p and K are smooth .functions
9 2-ii) 3u E U; ~x(0) ~ 1R'~; ST > 0; Sv~, v2 ~ L~176 T]) s.t. DFC~(~I)(vl) ~ O. 4.2 : In the case where U is such that U][0,T] = L~([0, T],L/) for some T > O, el ~ kerdO(ff2) implies condition 2-ii) (see proposition 4.1~
Remark
[17]).
P r o o f . Under hypotheses 1) and 2), there exists a residual generator of the form (3.31), with output r = ~(~1) _ ~(~1) t h a t solves the F P R G . Indeed, we show t h a t conditions 1), 2) of definition 1 are satisfied for the output r. First notice t h a t (3.30) implies ~12(~) = 0 in (3.27). Hence, by assumption 2, r(t) --~ 0 as t --* cx~ for every v2 E Llo ~ and vl = 0, and thus conditiion 1 of definition 4.1 holds. It remains to show condition 2) of the same definition. Assume t h a t Vu C L/,Vvl E Llo~ C Llo~ C ~ n , v ~ l E J~k,DFr(~(O),~l(O),u,., V2)(Vl) = 0. This means t h a t r does not depend on Vl. Hence the controlled dynamical system (3.31) does not depend of vl, and neither does t~(~l). Thus t~(~l) does not depend of vl or, equivalently, D F ~ ( ~ I ) ( v l ) = 0. This is in contradiction with 2. ii). 9
3.5
A p p l i c a t i o n o f N o n l i n e a r O b s e r v e r s to the F P R G
We now show how the observer (3.19) can be applied for fault detection and isolation. Consider again system (3.25), and suppose t h a t there exists a C ~ function ~ : ~ P --*/R satisfying the following assumptions: A1) There exists an integer k > 1 such t h a t the Jacobian of [~ o h , . . . , L ~ - l ( ~ o h)] T is of rank k at each x C V, where Y denotes some open set o f / R n. A2) dL~(~ o h) A d L ~ - l ( p o h) A ' .. A d(p o h) = 0
dLa~LJf(p o h) A dLJf(~ o h) A . - . A d(~ o h) = 0 for i = 1 , - - . , m , j =0,..-,k-l, where A means the exterior product of differential forms, and L~ (~ o h) = ~ o h . 4.2 Under the assumptions A1) and A2), consider system (3.25), with output ~ = (~ o h)(x). There exists an infinite choice of local systems of coordinates (~1,"" , ~n) in which system (3.25) takes the form (3.27),(3.28), (3.29) where
Proposition
(3.32)
4. Application of Nonlinear Observers to Fault Detection and Isolation
~/1i(~1) = [ g1,(r -1 1
with
-j
gli(~
1
.'
~1~,(~ 1) ]:"
435
(3.33)
-j
)---- ~1,(~1, "" 9 ' ~)
and h(~1)~-~1
(3.34)
P r o o f . Using assumption A1), we can construct a diffeomorphism (I) = [ ~ 1 , " " ,O,~]T from an open subset W C V such that r = L~(T o h) for j = 0 , . - - , k 1. Now set ( ~ l , " ' , ~ n ) = (Ol(X),--. ,On(X)), ~1 = (~1," ~k) T and ~2 = (~k+l," 9 9 , ~)r. Taking the derivative of ~j along trajectories of system (3.25), for j = 1,... , k - 1, yields :
~n ~j(t) = ~j+l(t) + Eui(t)Lg~(Oj)(x(t)) + viLe, (Oj)(x(t)) + v2L~.~(Oj)(x(t)) i=1 (3.351 Assumption A2) implies that dLg,(~j) A d~j A . - - A d~l = 0, which means that Lg~(~j) = g{i(~l,"" ,~j) for i = 1,.-. ,m. Hence, (3.35) becomes: m
~ j ( t ) : ~j+l(t) + E u i ( t ) O ~ i ( ~ l , ' '
" ~j) + Vle~l(~) + v2~J2(~)
(3.36)
i=1 for j = 1,-.. , k - 1 . For the k th component, ~k, we obtain :
~k ---- Lf(42k)(x(t)) m
+
Eu~(t)La,(q)k)(x(t)) + viLe, ((I)k)(x(t)) + v2Le2 ((I)k)(x(t)) i----1
Using again assumption A2), we get : { dLf(~k) Ad~k A - . . A d~l
dLa, (~k) A d~k A... A d~l
-=
0 0
Hence,
{ Lf(~k) L.,(r
~--- ~(~1,''" ,~k) = ?~k(~ 1)
= ~d~l,"" ,r
=~(r
(3.37)
436
4. Application of Nonlinear Observers to Fault Detection and Isolation
O t h e r choices of coordinates which bring s y s t e m (3.25) into the f o r m (3.27), (3.28), (3.29) with the particular s t r u c t u r e (3.32),(3.33), (3.34) can be obtained as follows : ~[ = ~1, ~ = ~i -~- #i(~1,-. 9 , ~i--1) for i = 2 , . . . , k, where #i : f~ i-1 ---+ /R are a r b i t r a r y C ~ functions. 9 Now consider the reduced controlled s y s t e m :
m y1(~1) _[_ E g l i ( ~ i=1 _
~'1
:
1
(3.38)
)Ui
= 5(1)
where f l , ~01i, a n d / t are given respectively by (3.32), (3.33), a n d (3.34). Let W1 denote the open set :
{~ ~ ~k;3~2 c ~n-ks.t.
~2
~(W)}
W C V, where W and V are given above. W i t h o u t loss of generality, we a s s u m e t h a t we are only concerned with a set of initial states a n d a class U of b o u n d e d admissible controls u such t h a t rlulloo _< M (M is a given constant) for which the trajectories of (3.38) lie into a d o m a i n W~ C W1, and such t h a t ~Tj, g l i , j =- 1,. 9 9 , k, i --- 1, 9 9 9 , rn can be e x t e n d e d to global Lipschitz functions ~]j,~]li,j : 1 , ' ' ' , k , i = 1 , . . . , m on Z@ (i.e. ~ j l w ; = ~ T j , g l i l w ( = g l i a n d ~,t)li are global Lipschitz functions on F~k). Under this a s s u m p t i o n , an observer of the f o r m (3.22) can be c o n s t r u c t e d in order to e s t i m a t e exponentially the c o n c e r n e d u n k n o w n trajectories of (3.38). More precisely, this observer can be w r i t t e n as follows (see Section 2):
;,1
rn ~ ^1 "t/, = / 1 ( ~ 1) -1- ~--~oOli(~ ) i -- S o l C T C ( ~ i=1
where f1(~1) = Adl +
~k--1(~1,...
,~k--1)
1)
)
1 - E 1)
(3.39)
, with A, C as in (3.18), a n d
S o given by (3.20).
In order to design a residual generator, the following corollary to p r o p o sition 4.1 can be used. C o r o l l a r y 4.1 C o n s i d e r s y s t e m ( 3 . 2 5 ) , a n d s u p p o s e t h e r e e x i s t s a Coo . f u n c t i o n ~ : t ~ p --* ~ s a t i s f y i n g a s s u m p t i o n s A 1 ) a n d A 2 ) . L e t U be as above, a n d s u c h t h a t U ][0,T]= Loo([0, T ] , b / ) . f o r s o m e T > O. M o r e o v e r , assume that :
4. Application of Nonlinear Observers to Fault Detection and Isolation
a) L~2L}(q2 ) : 0 for i = 0 , . . . , k b) 3i E { 0 , . . . , k - l }
437
1.
s.t. L ~ I L } ( 9 ) # 0 .
where ~ : ~a o h. Then system (3.39) with output r(t) = ~l(t) - 9(t) (~(t) : 9 ( x ( t ) ) ) is a residual generator which detects and isolates vl. P r o o f . It suffices to check conditions 1, 2-i) and 2-ii) of proposition 4.1. 2-i) is satisfied since (3.39) is an observer which converges for every u C U. Assumption a) is nothing but condition 1). Finally, assumption b) implies el ~ kerdO(q2), and by remark 4.2, U satisfies condition 2-ii). 9
4 Hydraulic System The considered system consists of a spool valve and a single rod piston acting on an inertial load (see figure 1). The external force Fe controls the flow entering the head side chamber of the piston from a pressure supply Pa. The rod side chamber is always connected to the r e t u r n pressure Pr.
Pr
FIGURE 1. Hydraulic system. Our aim is to detect and isolate two faults in this system : a drop of the spool control force F~, and an increase of the internal leakage of the piston (which is normally assumed to be negligible).
4.1
M o d e l l i n g o f the S y s t e m
The following notations will be used:x 1,displacement of the spool, x2,velocity of the spool, xa, displacement of the piston, xa, velocity of the piston, x5, pressure in the head side chamber, Vl, failure mode corresponding to the
438
4. Application of Nonlinear Observers to Fault Detection and Isolation
control force, v2, failure mode corresponding to the internal leakage of the piston, Ap, area of the piston, D, diameter of the spool, B, bulk modulus, Cd, discharge coefficient, p, density of the fluid, K s and Rs, respectively spring and damping coefficients associated to the spool, K p and Rp, respectively spring and damping coefficients associated to the load, M s a n d MR, respectively mass of the spool and mass of the piston together with the load. The model of the process can now be derived : 21 = x2
(4.40)
22 = - (Ksxl + Rsx2) ~Ms + (F~ - FF -- vl) ~Ms
(4.41)
23 = x4 2 4 ~-~ ( - K p x
3 - Rpx4 +
(4.42)
Apx5)/Mp
B 12 :i:5 - Apx3CdIIDxl (Pa - x5) - BX4x3
(4.43)
x5x4Bv2 x3 Ap x3
(4.44)
where FF = 2CdIIDxl ( P a - x5) represents the resultant flow force acting on the spool p. We assume that the available measurements are y =
Ix1 x3 IT.
[y,
All the state variables x i , i = 1 , . . . , 5 take values in closed intervals [ai, bi], i = 1 , . . . , 5. The position measurements are calibrated so t h a t the lower bound of the interval is positive, and thus the division by xa in (4.44) does not cause any problem.
3.2
Design o f a Residual G e n e r a t o r
First a residual generator to detect and isolate v2 is obtained by noting t h a t proposition 4.2 can be used with (~ o h) (x) = x3 = Y2, if we consider xl = Yl as a known input in (4.44). Indeed, it is easily checked t h a t conditions A1) and A2) are fulfilled with k = 3. Besides, the change of coordinates :
[ ~1 ~2 ~3 ~4 ~5 ]T
[ X3 X4 n p x 5 / M p
Xl
X2 ] T
transforms equations (4.42), (4.43) and (4.44) i n t o :
~2
~3 -- Kp~I/Mp - RP~2/MP
q-~3) ff-'~ppt~dIID ~Yl~'IV~ . .
~1 t" Mp
-
M~
-
.
MP~I
(4.45)
4. Application of Nonlinear Observers to Fault Detection and Isolation
439
which is in the form (3.27), with ?1 (~1) and gll (~1) respectively given by (3.32) and (3.33). Besides (~1) = ~1 = (~ o h) (x) = Y2
(4.46)
T h e functions ~72(~) = - K p ~ I / M p - R p ~ 2 / M p , ~ 3 ( ~ ) = ~-IA~B~ Mp +~3), and / /2 B dHD ~1 V - ~ _~Pa -- M__~ Ap ~3/ have bounded derivatives for any ~13(~) -~pt~ in
W~ = {~8.t.~l 9 [a3,b3],~2 9 [a4,b4],~3 e [Mpah, - ~Pb 5 -- e]}. T h e arbitrarily small number e appearing in the upper b o u n d for ~3 is introduced to take care of the fact t h a t ~3 = ~MP55 zeros the argument of the square root in g13(~) (indeed b5 = P~). This phenomenon cannot occur on the actual system, as there will always be a small leak around the piston which will prevent ~3 from reaching this extreme value. It is now straighforward to extend ~2(~), ~/3(~) and g13(~) to global Lipschitz functions on ~ 3 . Conditions a) and b) of corollary 4.1 ( in which Vl and v2 are interchanged as our aim is to detect v2 here ) are fulfilled for any input signal t h a t keeps the state trajectory of (4.45) into W~. Hence a residual generator of the form (3.39) with output rl = ~'1 - Y2 can be designed on the basis of model (4.45). This was verified by simulating the process described by equations (4.40) to (4.44) together with the residual generator. The numerical values were taken as described in Appendix A. vl (t) was chosen as a step-like failure signal between 10 s and 20 s, while v2(t) was non zero between 30 s and 40 s. Figure 2 shows that the resulting residual is indeed unaffected by vl and t h a t it allows to detect and isolate v2. We now illustrate the performance of an observer of the form (3.24) by considering the system m a d e of equations (4.40),(4.41),(4.43), (4.44), with output Yl = Xl. It turns out t h a t the output reconstruction error for this observer, r2, is a signal which is sensitive to vl and v2. Hence the combined monitoring of rl and r2 allows one to detect and isolate b o t h failure modes provided they do not occur simultaneously. The equations mentioned in the previous p a r a g r a p h can be put in the form of model (3.23) thanks to the following obvious change of coordinates: [ Z 1 Z2 Z3 Z4 Z5 I T = [ X l
X2 X5 x4(1 + Xh/B)
X3 ]T
which is indeed a diffeomorphism for any x in W = {x 9 /R 5 : x~ 9 [ai,b~],i = 1 , . . . ,5}. W h e n vl = v2 = 0, A(t) and ~ ( t , F~, z) can be written:
A=
0
1
0
0
0 o 0
0 o 0
pMs Y ~ J o 0
- y - -B~
0
0
440
4. Application of Nonlinear Observers to Fault Detection and Isolation o
,~
o
o
mq
N
o
I
/
E/J
of
';
a
7
;
g3~ 9~-~
N1
d-
.2H
o
c3 I
I0
20
T
30
40
5O
F I G U R E 2. Residual rl and failure modes vl and v2.
a n d ~ ( t , F~, z) =
0 -- 2CdIIDpMsYlP,a -
(-Kpy2+Apz3)(l+
(/(SYl
+ Rsz2)/Ms
@ Fe/Ms
z_~A1..2_ _ R r z ~ + z 4 f _ B_B~_ , BJMp Mp ~+z.3/B~ y ~ ~ + f ~ ( y l , y 2 , z3))
with f ~ (yl, Y2, z3) = A---~.CdHDyl J ~ (Pa - z3)) Py2 V ~ Notice t h a t Yl (t) and y2(t) belong to closed intervals with a positive lower bound, and the c o m p o n e n t s of 9 have b o u n d e d derivatives with respect to z on the d o m a i n W ' = { z E 1~ 5 : zl E [ a l , b l ] , z 2 C [a2,b2],z3 C [as, b5 - e],z4 9 [a4(1 + a s / B ) , b 4 ( 1 + b 5 / B ) ] , z 5 9 [a3, b3]}, where e is an arbitrarily small real introduced for the s a m e reason as above. Hence h y p o t h e s e s H1) a n d H2) can be fulfilled by e x t e n d i n g ~ ( t , Fe, z) to a global Lipschitz function w.r.t, z and t. An observer of the form (3.24) can thus be designed, a n d the second residual is defined by r2(t) -- 21 - y l . Figure 3 shows the evolution of r2(t) for faults Vl (t) a n d v2 (t) with relatively realistic values. T h e t i m i n g of the faults, and the numerical values used for the s i m u l a t i o n are the s a m e as for figure 2. T h e influence of v2 on r2 does not a p p e a r on this plot, because it is low in c o m p a r i s o n with the effect of vl.
4. Application of Nonlinear Observers to Fault Detection and Isolation o u~Q
441
o
,J
N u-~ u3
f
aJ
~ct
,L__
o tO
30
20
40
S0
T
FIGURE 3. Residual r2 and failure modes vl and v2.
5
Conclusions
T h e fundamental problem of residual generation for linear systems has been reviewed, and the principle behind its solution has been described. Next, a formulation of the the same problem for nonlinear systems which are affine in the control signals and the failure modes was derived. Sufficient conditions for the existence of a solution have been presented. T h e possibility to use high gain nonlinear observers in the design of a residual generator has been investigated. The theoretical results have been illustrated by simulations on a hydraulic process. T h e considered observers are based on a single measurement. Extension of the results to the multi-output situation is a topic for further research.
6
REFERENCES
[1] P. Alexandre and M. Kinnaert. Numerically reliable algorithm for the synthesis of linear fault detection and isolation filters, based on the geometric approach. Proc. of the 1993 I E E E Conference on Systems, Man and Cybernetics, vol.5, pp 359-364, 1993. [2] R. V. Beard. Failure accommodation in linear systems t h r o u g h selfreorganization. Ph.D. Dissertation. Dep. Aeronautics and Astronautics. Mass. Inst. Technol., Cambridge, MA, 1971. [3] J. P. Gauthier and G. Bornard. Observability for any u(t) of a class of bilinear systems. I E E E Trans. Automatic Control. 26, pp 922-926,
442
4. Application of Nonlinear Observers to Fault Detection and Isolation 1981.
[4] P. M. Frank. Fault diagnosis in dynamic systems using analytical and knowledge based redundancy. A survey and some new results. Automatica, 26(3), pp 459-474, 1990. [5] J. P. Gauthier, H. Hammouri and S. Othman. A simple observer for nonlinear systems, application to bioreactors. IEEE Trans. Automatic Control, 37 (6), pp 875-880, 1992. [6] J. P. Gauthier and I. Kupka. Observability and observers for nonlinear systems. SIAM J. Control and Optimization, 32(4), pp 975-994, 1994. [7] H. Hammouri, M. Kinnaert and E.H. E1 Yaagoubi. Fault detection and isolation for state affine systems. European Journal of Control, 4, pp 2-16, 1998. [8] H. Hammouri, M. Kinnaert and E.H. E1 Yaagoubi. Observer based approach to fault detection and isolation for nonlinear systems. IEEE Trans. Automatic Control, 1998. [9] R. Hermann and A.J. Krener. Nonlinear controllability and observability. IEEE Trans. on Automatic Control, 22(5), pp 728-740, 1977. [10] M. Hou and P.C. Muller. Fault detection and isolation observers. Int. J. Control, vol.60 (5), pp 827-846, 1994. [11] H. L. Jones. Failure detection in linear systems. Ph.D. Dissertation. Dep. Aeronautics and Astronautics, Mass. Inst. Technol., Cambridge, MA, 1973. [12] T. Kailath. Linear Systems. Prentice-Hall, Englewood Cliffs, N.J, 1980. [13] M. Kinnaert. Innovation generation for bilinear systems : application to robust fault detection. Proc. of the 1998 American Control Conference, Philadelphia, pp 1595-1599, 1998. [14] M. Massoumnia. Geometric approach to the synthesis of failure detection filters. IEEE Trans. Automatic Control, AC-31, pp 839-846, 1986. [15] M. Massoumnia, G.C. Verghese and A.S. Willsky. Failure detection and identification. IEEE Trans. Automatic Control, AC-34, pp 316-321, 1989. [16] R. Nikoukhah. Innovation generation in the presence of unknown inputs:application to robust failure detection. Automatica, 30(12), pp 1851-1867, 1994.
4. Application of Nonlinear Observers to Fault Detection and Isolation
443
[17] H. Nijmeijer and A.J. van der Schaft. Nonlinear Dynamical Control Systems. Springer-Verlag, 1990. [18] R. J. Patton and J. Chen. Robust fault detection using eigenstructure assignment: a tutorial consideration and some new results. Proc. of the 30-th IEEE Conference on Decision and Control, pp 2242-2246, 1991. [19] R. Seliger and P.M. Frank. Robust component fault detection and isolation in nonlinear dynamic system using nonlinear unknown input observers. Proc. of the IFAC/IMACS Symposium SAFEPROCESS, pp 313-318, 1991. [20] P. M. Van Dooren. The generalized eigenstructure problem in linear system theory. IEEE Trans. Automat. Control, vol. AC-26, pp 111-129, 1981. [21] E. White and J.L. Speyer. Detection filter design: spectral theory and algorithms. IEEE Trans. Automatic Control, AC-32, 7, pp 593-603. [22] J. Wunnenberg. Observer based fault detection in dynamic systems. VDI-Fortschrittsberichte. Reihe 8, Nr. 222, 1990. [23] D. Yu and D.N. Shields. A bilinear fault detection observer. Automatica, 32(11), pp 1597-1602, 1996.
Appendix A: Numerical Values used for the Simulation of the Hydraulic System Ms = 0.1 kg, R s = 2.10 Ns/m, Ks -= 103 N/m, D p = 0.2 m (Diameter of the piston), D -- 0.01 m, p -- 840 k g / m 3, B -- 109 N / m 2, Cd = 0.7 k g / m 3, Pa = 220 105N/m 2, M p --- 5 103kg, Rp = 104 Ns/m, K p = 5 105
N/m
Innovation Generation for Bilinear S y s t e m s with U n k n o w n Inputs M. Kinnaert and L. E1 Bahir D e p a r t m e n t of Control Engineering Universit~ Libre de Bruxelles Brussels, Belgium
1
Introduction
Fault detection systems are typically made of two parts : a residual generator and a decision system. The first module generates sequences, called residuals, from the sampled input and output signals of the supervised process. These sequences have nominally zero mean in the absence of fault (after a possible transient has vanished), and their m e a n becomes distinguishably different from zero upon occurrence of a fault. T h e decision system evaluates the residuals in order to determine whether their m e a n differs significantly from zero. This task can be efficiently performed by a p p r o p r i a t e statistical tests [2]. However, such tests are often based on s t a n d a r d hypotheses, such as whiteness of the evaluated sequence. To be able to know the properties of the distribution of the residual, the problem of residual generation has to be stated in a stochastic setting. For linear models with additive faults, the innovation sequence (associated with a K a l m a n filter) has been used as a residual. Indeed, it is a sufficient statistics, which means t h a t the information a b o u t the faults contained in the measurements is retained in the innovation. Moreover, it is zero-mean in the absence of fault, and it is white with known covariance. Hence, it can be evaluated, for instance, by cumulative sum (CUSUM) or generalized likelihood ratio (GLR) tests, depending on whether the fault magnitude is known or not [2]. Yet such a signal can only be obtained from a standard linear model without unknown inputs. T h e latter often arise due to the presence of unmeasured signals or unknown p a r a m e t e r s which cannot be modelled as r a n d o m processes with known statistics. To handle this problem, Nikhoukhah [16] has developed, for linear time invariant (LTI) systems, a m e t h o d to design a filter which, from the observed quantities (measured plant inputs and outputs), generates a signal with the following properties. It is zero m e a n and white in the absence of fault; it is decoupled
446
5. Innovation Generation for Bilinear Systems with Unknown Inputs
from the unknown inputs, and it preserves, as much as possible, information on the faults. The resulting residual is called innovation because, in the absence of unknown inputs, it coincides with the innovation of a Kalman filter designed for the plant model. Our aim in this chapter is to extend the known results for LTI systems to the class of bilinear systems. There are several reasons for the choice of this particular class of systems. First, when modelling systems using physical laws, one often obtains bilinear models. This is for instance the case for processes involving heat transfer for which energy balance equations yield products of temperature (state) and flow (input). Secondly, bilinear models can give a significantly better approximation of the behaviour of a nonlinear system over its working range, than what can be achieved using a linearized model around the nominal set point. This will be illustrated by an example below. Finally, observer theory, which is a fundamental ingredient for the design of residual generators, is well developed for bilinear systems [11], [5] [4]. The latter can actually be seen as a particular class of linear timevarying systems. Observer-based residual generators for bilinear systems have been developed and studied in a deterministic setting in [20], [21] and [15]. In those papers the state equation describing the residual generator is linear and time-invariant up to an output injection. This choice of structure restricts the set of systems for which the problem admits a solution [13]. A more general structure is considered in [19]. However the authors do not deal with robustness issues in that paper, since there are no unknown inputs beside the faults in the problem they consider. Moreover, they obtain an estimate of the fault signals, which is a nice feature, but requires stronger hypotheses on the class of monitored systems than what is needed in our work. Our formulation of the problem of innovation generation for bilinear systems is directly inspired by Nikoukhah's work [16]. However, the proposed solution does not resort to transfer function manipulation, like in [16], since this approach cannot be extended to time-varying systems. First a uniformly completely observable subsystem without unknown inputs is extracted from the original system. Next a Kalman filter is designed for the bilinear system resulting from the first step. The work reported here is built on previous results obtained in [9] and [14]. The first paper describes a geometric approach to the synthesis of residual generators for deterministic bilinear systems. No constructive design procedure is obtained in that work. In the second paper, an algebraic approach is considered to solve the same problem. The algorithm derived in that paper is, however, limited to a smaller class of bilinear systems t h a n the results presented here, if we translate them for deterministic systems. The chapter is organized as follows. The problem of innovation generator design for bilinear systems is stated in Section 2. A solution is presented in section 3. Next the issue of innovation monitoring by G L R test is addressed
5. Innovation Generation for Bilinear Systems with Unknown Inputs
447
in section 4. Finally the theory is applied to design and validate a fault detection and isolation system for a three-tank process.
2
P r o b l e m Statement
T h e following class of discrete-time bilinear systems is considered :
x(k + 1) = Aox(k) + E i % l u i ( k ) A i x ( k ) + ~p"_l(Edx(k) + Fd)d~(k)
-~Bu(]~) Jr- Ei=lnf ( E [ x ( k ) + F / ) fi(k) + G w ( k ) y(k) = C x ( k ) + Du(k) + g w ( k ) (2.1) where x(k) e R '~, u(k) e R m, y(k) e R p, d(k) 9 R '~, f ( k ) 9 R "~s, w(k) 9 R n'" are respectively the state, the known inputs, the measured outputs, the unknown inputs, the failure modes, and a zero m e a n Gaussian white noise sequence with covariance matrix E ( w ( k )w(T) T) ----In,,, 6k~ (where In,o denotes the n~-by-nw identity matrix and 5k~- = 1 if k = v and it is null otherwise), ui(k) denotes the i th component of u(k), and similar notations are used for the other variables. W i t h f ( k ) set to zero, (2.1) describes the normal (fault free) behaviour of the monitored system. We assume t h a t H has full row rank. Moreover, we let L/ denote the set of admissible input sequences. Consider the following class of systems :
{
~(k + 1) = A(k)5(k) + [~y(k)y(k) + [~u(k)u(k) r(k) = 5s(k) + D~y(k) + D~u(k)
(2.2)
where A(k),/~y(k) a n d / ~ u ( k ) are time-varying matrices of a p p r o p r i a t e dimensions, while C, b y and D~ have constant entries. D e f i n i t i o n 5.1 A system of the .form (2.2) is called an innovation filter
(or an innovation generator) for system (2.1) if there exists a set Ll in Lt such that, in the absence of fault, for all u E ~ , (2.2) i s uniformly asymptotically stable, and the output r is a zero mean white noise sequence which is invariant under u and d, once the transient due to initial conditions has vanished. Notice that, in the linear case, an e x t r a condition is added in the definition in order to ensure t h a t no useful information on f , contained in y, is lost. This notion is difficult to translate in a nonlinear framework. However the innovation filter resulting from our algorithm will be seen to offer certain guarantees on this point.
448
3
5. Innovation Generation for Bilinear Systems with Unknown Inputs
Design Procedure
The algorithm is based on the following two theorems of which the proofs yield a constructive design procedure. T h e o r e m 5.1 There exists an innovation filter of the form (2.2) for system (2.1) if the .following two statements hold true : A
A
1. There exist constant matrices P, Ai, Bi, i = 0, ..., m, L1 and L2, with P, L1 and L2 different from zero, such that : PA~
-
A~P = B i G
p [EdiF d] = 0
i = O, ..., m
(3.3)
i = l,...,nd
(3.4)
L , C - L2P = 0
(3.5)
2. There exists a set l]t in LI such that, .for all u C l~ the following system is unifo~nly completely observable and uniformly completely controllable : { ~(k + 1) = A(k)~(k) + G(k)w(k) q(k) = L2S(k)
(3.6)
where fit(k) = ,4(u(k)) - S ( k ) R - 1 L 2 , ft(u(k)) = rio + ~-~=1 ui(k)fii, R = L 1 H H T L T, S(_k) = ( P G - B ( u ( k ) ) H ) g T n T, B ( u ( k ) ) = Bo + ~-'~im=lui(k)Bi, and G(k) = P G - l~(u(k))H - S ( k ) R - 1 L I H . P r o o f . As there is no need to consider the term in f ( k ) for this proof, we let f ( k ) = 0 in (2.1). Define z(k) = Px(k). Then, from (2.1) and (3.4) m
z(k + 1) = P A o x ( k ) + ~ u i ( k ) P A i x ( k )
+ PBu(k) + PGw(k)
(3.7)
i=1
Now substituting (3.3) for PA~,i = 0 , . . . , m in (3.7), and taking the output equation of model (2.1) into account, (3.7) yields :
z(k + 1) = A ( u ( k ) ) z ( k ) + B ( u ( k ) ) ( y ( k ) - D u ( k ) ) + P B u ( k ) +(PG - B(u(k))H)w(k)
(3.8)
where A(u(k)) = Ao + ~m=l ui(k)Ai and B ( u ( k ) ) = Bo + }-'~-1 ui(k)Bi. (3.8) is a bilinear system up to output injection which is only affected by d via y. In order to build an innovation filter, we shall design a Kalman
5. Innovation Generation for Bilinear Systems with Unknown Inputs
449
filter for estimating z. To this end, we have to determine that part of the measurement y which depends on z, u and w only. This is achieved by defining the signal q as :
q(k) = Lly(k) = L1Vx(k) + L1Du(k) + L 1 g w ( k )
(3.9)
and, from (3.5),
q(k) -- n2z(k) + nlDu(k) + L1Hw(k)
(3.10)
The noise terms are correlated in (3.8) and (3.10). Yet one can easily transform (3.8), (3.10) into a system for which the state excitation noise and the observation noise are not correlated, by adding and subtracting S(k)R-lq(k) in (3.8), with S(k) and R as defined in the theorem statement [8]. This yields :
z(k + 1) = ~(k)z(k) + ~y(k)y(k) + ~(k)u(k) + ~(k)w(k)
(3.11)
where [~y(k) = B(u(k))+ S(k)R-1L1, B~(k) = P B - B ( u ( k ) ) D - S(k)R -1 L1D, and the other notations are as in the theorem statement. The Kalman filter for (3.11), (3.10) is: ~(k + 1) = A(k)~(k) + By(k)y(k) + [l~(k)u(k)
+r(k)(Lly(k) - L2~(k) - L1Du(k))
(3.12)
where the gain is obtained from F(k) = ft(k)II(k)L~(L2H(k)L T + R) -1
(3.13)
with II(k) given by H(k + 1) --- -fi(k)II(k)LT(n2H(k)n T + R)-ln2H(k)ffi(k) T +A(k)H(k)fi(k) T + Q(k) with II(0) = H0
(3.14)
where II0 is a positive definite matrix and Q(k) = G(k)G(k) T. The filter (3.12) is of the form of the first equation in (2.2) with: fi~(k) = f t ( k ) - r ( k ) L 2 , By(k) = By(k)+r(k)L1, and B~(k) = B~,(k)-F(k)LID. By the second condition in the theorem statement, the system (3.12), (3.13), (3.14) is uniformly asymptotically stable for all u C/2 [12]. Now set
r(k) = q(k) - L2~(k) - L1Du(k) = - L 2 ~ ( k ) + Lly(k) - L1Du(k) (3.15) (3.15) is of the form of the second equation in (2.2), and r(.) is a white noise sequence once the transient due to initial conditions has vanished, since it is the innovation of a Kalman filter. 9 The next theorem gives necessary and sufficient conditions for the existence of a solution to the equations (3.3), (3.4), (3.5). The proof of the
450
5. Innovation Generation for Bilinear Systems with Unknown Inputs
result is constructive and directly yields a solution for P, L1, L2, Ai, Bi, i ----
0,''" ,m. For the clarity of the developments, before stating the theorem, we rewrite (3.4) as follows : PK = 0
(3.16)
where K -- [E 1d ...E,~ /~d1 " " F rtdJ" d] d The singular value decomposition (SVD) of several matrices will be used. For a matrix M r (the exponent indicates the iteration number in the recursive method resulting from the proof below), with rank rM~, the different factors of this decomposition are denoted in the following way:
M r = ~M~MVM -VTqr'
17"~T
_-- [U~41 U)~/2]
y~/~2
where E ~ is a r M ~ - b y - r M , diagonal matrix containing the singular values of M r, and U~4 , V~/ are orthogonal matrices. T h e o r e m 5.2 : Equations (3.3)-(3.5) and (3.16) have a solution such that P ~ O, L1 ~ 0 and both matrices have full row rank if and only if there exists an integer o~ > 0 such that K s = 0 or Q(~-I) has full column rank, and none of the matrices K ~ K 1 , . . . , K (a-1), Q0, Q 1 , . . . , Q(a-1) has .full row rank. The matrices K j and QJ, j -- O, ... , ~ are given by the .following recursive formulas : 9 WhenQJ ~0:
n(J+l) i
rTjTAj{ITj j j j z V K 2 , , i k ~ , K 2 -- U K I V ~ I ( E Q )
-1
jT j j U~IC UK2 )
i ~- 0 , . . .
,m
(3.18) C (j+l) = u ~ T c J u ~ 2 K(J+I) = UJKT[AJoUJK1V~2 A{UJK1V~2 . . .AJmUJK1V~21 Q(i+l) = C ( j + I ) U ~ + ' )
(3.19) (3.20) (3.21)
W h e n QJ -- O: A~ j+l) = uJKT A{uJK2
i ----0 , . . . , m
(3.22)
C (j+l) = CJUJK2
(3.23)
K(J+ 1) --- TTJ TF AjTTj J J J J vg2L "'0 ~ K1 AIU~:I 99. AmUJgl]
(3.24)
Q(j-{-1) __--c(J+l)U~I+1)
(3.25)
5. Innovation Generation for Bilinear Systems with Unknown Inputs
451
where A ~ = A~, i = 0 , . . . , m , K ~ = K , C O = C, QO = C O u O 1 = C U K 1 "
Proof. If part The sufficiency part of the proof is constructive and it can be divided into two sections : first the computation of P, Ai, Bi, i = 0 , . . . , rn t h a t fulfil (3.3) and (3.4), next the computation of L1 and L2 that fulfil (3.5) for the matrix P obtained in the first step. C o m p u t a t i o n o f P, Ai, B~, i = 0 , . . . , m Set p0 _ p and B i - Bi. Then (3.3) and (3.16) can be written as follows, given the notations introduced for the initialization of the recursive formulas : A
A
A
P 0 Ai0 - A l P ~ = B^o iC o
i = 0,...,m
P~176= 0
(3.26)
(3.27)
Equation (3.27) is fulfilled if and only if (3.28)
pO = p 1 u O T
for some appropriate matrix p1. (3.28) can only be written when r a n k K ~ < n (and hence K ~ has not full row rank), which holds by hypothe-
sis. Substituting (3.28) into (3.26), and multiplying the resulting expression on the right by U ~ ---[U~ U~ yields:
p l tZK2~itzK1 t r OnOtrO T plu~
~Ot-',OtrO
(3.29)
~ J~'i~ U'K1
A ~ 1 7 62 - A i P1 ~ B ? C ~ 1 7 62
(3.30)
for i = 0 , . . . , ra. Define Q0 as in the theorem statement, and introduce its SVD in (3.29). Three situations are considered successively: 9 Q0 # 0 and Q0 has not full column rank Equation (3.29) yields, after multiplication on the right by [V~I V~21: p1TTOTAOTTO l]'O ~0rr0 x-0 ~'~K2":~i ~K1 vQ1 ~ ~'~it~QlZ~Q
i=0,...
(3.31)
,m
p I uOT2A oUO 1V~ 2 = 0
(3.32)
For a fixed p1, any/~0 that solves (3.31) has the form : ~0 : p1uOTAOUOlV21(~)-Iu~T
+ ~Iu~T
(3.33)
452
5. Innovation Generation for Bilinear Systems with Unknown Inputs for some m a t r i x / ~ of appropriate dimension, and for i = 0 , . . . , m. Indeed, U~2 exists since Q0 has not full row rank by hypothesis. Substituting (3.33) for ~o into (3.30) yields: 0
plrrOTAOfTTO ~K2~i~vg2 =
0
0
-- U ~ I V ~ I ( E Q )
--1
OT
U~lC
~ lrr0T~0rr0 "t-~i ~ Q 2 ~ J ~ K 2
0
0
U~2) - ~ i p 1 i = O,.. . , m,
(3.34)
which can be written as follows using the recursive formulas (3.18), (3.19) : p1Ail
--
~ i p 1 = B~C 1
i = 0,...,m.
(3.35)
Moreover, the set of equations (3.32) can be written : p1K1
= 0
(3.36)
thanks to (3.20). Notice that (3.35), (3.36) have the same form as (3.26), (3.27). If K 1 = 0, one directly finds a solution for (3.35), for instance p1 = I,.~i = A~,~/1 = 0, i -- 0 , . . . ,m. If K 1 ~ 0, one repeats the above procedure, starting from (3.28), with adequate changes of exponents. 9
Qo~_o
(3.29) yields: p1TTOTAOTTO ~ K 2 .r~i t.J K 1
---- 0
(3.37)
Hence defining K 1, A 1 and C 1 respectively as in (3.24), (3.22) and (3.23), and setting/~} = ~ 0 , i = 0 , . . . , m , (3.30) and (3.37) take the form of (3.35) and (3.36). The iterative procedure continues if K 1 ~ 0; otherwise it stops since the solution for p l , . ~ , / ~ , i = 0 , . . . m can be obtained as above . Q0 has full column rank In this case, V~ = V~I and (3.33) is obtained from (3.31) as a solution for/~o. Substituting this expression in (3.30) yields (3.35). T h e difference with the first case (Q0 ~ 0 and Q0 has not full column rank) is that no equation like (3.32) arises. Hence one can directly solve (3.35) (p1 = I, A~ = A ~ , / ~ = 0, i = 0 , . . . , m). The combination of the above three cases defines a recursive procedure which stops either when K ~ = 0 or Q(~-D has full column rank, for some integer c~ > 0. At that stage, one is left with the equations : P"A?
- AiP~
= B~C"
i : O, ..., m
(3.38)
5. Innovation Generation for Bilinear Systems with Unknown Inputs
453
for which P ~ ---- I, Ai = A~ a n d / ~ -- 0, i -- 0 , . . . , m is a solution. For P ~ = I, the m a t r i x P resulting from the recurrence based on (3.28), say P*, is easily seen to be: P* = U ( 2 - 1 ) T . . . u I T UOKT
(3.39)
Moreover, /~i, i ---- 0 , . . . , m can be c o m p u t e d by a p p l y i n g backward, from j -- c~ - 1 to j = 0 the following recursive formulas d e d u c e d from (3.33) :
{ BJi ---- r ( J T 1 ) r r J T z i J r r J l'rj (v'J ~ - l r r J T ~(J+l)rlJT ~j ./~J+li '-'K2~'~i"K1 "Qlk~-"Q] VQ1 -}- --i "-'Q2
QJ whenWhen QJ =7~O0 (3.40)
with B~ = 0, i = 0 , . . . , m ,
a n d by r e m e m b e r i n g t h a t /~o = /~i,i =
0,... ~Trt. C o m p u t a t i o n of L1 a n d L2 It now remains to solve (3.5). A particular solution c o r r e s p o n d i n g to P* is : L~ : U ~ - I ) T
. . . U ~ T U ~ T2
(3.41)
L~ = U ~ - 1 ) T . . . ugoT2cuO2 . . . U(K2-1)
(3.42)
One can check t h a t (3.5) is fulfilled as follows. Let us c o m p u t e L I P * , with P* given by (3.39) : L~P* = U;2 -1)T . ..u~Tcu~ Noticing t h a t rri
rriT = I -- rri
'JK2"~K2
2 ,..U(K~-I)U(~-I)T...U rriT i = 0,
' ~ K I ~ K I ,
. . .
,
~
(3.43)
c~ -- 1, a n d s u b s t i t u t i n g
successivelyU(K2-1)U~2- I)T, U(2-2)U(a2-2)T..., uO2uOT in terms of these expressions in (3.43), one gets :
Lip*
.~_ U ~ - I )T . . . u ~ T c - V ~ 2 - 1 ) r . . . u ~ T C U O l UOT1 - Ec~--: U ~ - - I ) T . . . U ~ T 2 C U 0 2 . . . U ~ 2 U ~ f + l l ) U ~ 2 + l l ) T u ~ f T . . . N
~
(3.44) T h e first t e r m on the right h a n d side of (3.44) is n o t h i n g b u t L ~ C . Hence, to conclude this p a r t of the proof, it suffices to show t h a t all the other terms are equal to zero. Let us consider an a r b i t r a r y t e r m of the s u m (the same reasoning also applies to the second t e r m of the right h a n d side of (3.44)). Using the recursive formula (3.19), one easily deduces :
U ~ - I )T . . . s ~ T c s ~ : U~-I)T...
. . . U~72U(I~Wl l ) U}~Wl l )T u~TT . . . S Or U ~ ; 1 ) T c ( i + I ) U ~ + l l ) U ~ + 1 1)Tu~(T2... GOT
=U~-I)T...U~+I)TQ(i+i)u(~+I)Tu~T...uOT
(3.45)
454
5. Innovation Generation for Bilinear Systems with Unknown Inputs
where the definition of Q(~+I), (3.21), was used to obtain the last expression. "r i Finally notice that, in the right hand side of (3.45), U Q2 (i+l)Trl(~+l) ~ ( ~ ( i + I ) T . . ( i + 1 ) ~-~(/q-1). , ( i + I ) T
,~
.
2 %1 Z~Q v~}1 ----u, a n a hence any t e r m in the s u m is null in 3.44). This also holds for the second t e r m of the right hand side of (3.44) as already mentioned. In the expressions for L~ and L~, one should set U~2 = I when QJ = O, which corresponds to what is done in the definition of C (j+l) (compare (3.23) and (3.19)). It remains to verify t h a t P* and L~ are non-zero. As all the matrices U~T, i = 0 , . . . , ( a - 1) have full row rank, one easily checks t h a t r a n k P * = .rr((x--1)T ran~uR2 . The latter is non-zero as K (~-1) has not full row rank by hypothesis. Hence P* r 0, and P* has full row rank. On the other hand, as Q i , i = 0, .. . , a -- 1 have not full row rank, U~2 iT ,i = 0 , . . . a 1 exist. B y a similar argument as for P* one concludes t h a t L I has full row rank and is different from zero. O n l y if p a r t Assume t h a t equations (3.3)-(3.5) and (3.16) have a solution with P ~ 0 and L1 # 0, b o t h matrices having full row rank. Let the dimensions of P and L1 be denoted e x n and s • respectively. From (3.16), K cannot have full row rank. Hence P = p 0 has the form (3.28) for some m a t r i x p1 t h a t fulfils (3.29), (3.30). Equivalently, p1 must fulfil (3.35) for some matrices / ~ , i = 0 , . . . , m, linked t o / ~ via (3.40) and it must also fulfil (3.36) when Q0 has not full column rank. p1 has dimensions e x (n - r K ) where r g denotes the rank of matrix K . Now, two situations can be distinguished : 1. if Q0 has full column rank or K 1 = 0, then a = 1, and p1, -4i, i = 0 , . . . , m must be a solution of (3.35) for s o m e / ~ , i = 0 , . . . , m. 2. if none of the conditions in 1. hold, K 1 cannot have full row rank by (3.36), and p1 must be of the form p1 = p 2 u I T where p 2 is an • (n - r a n k K 1 - r a n k K ) matrix. P2 must fulfil equations of the form (3.29), (3.30) with all the exponents increased by 1. One can repeat for p2 the same procedure as for p1, and so on. This corresponds to an iterative procedure. One realizes t h a t there must exist a finite integer a, for which either Q(~-I) has full column rank or K s = 0. Indeed, as the number of columns of P J , j = 0, 1 , . . . keeps decreasing when the number of iterations increases, this iterative procedure must stop, otherwise P cannot have full row rank. It remains to show t h a t the existence of a solution implies t h a t Q0, Q1, ... , Q(~-I) cannot have full row rank. To this end, let us consider equation (3.5). Introduce (3.28) into (3.5), and multiply the resulting equation on the right by [U~ U~ This yields: LICU~
- L 2 - p l ---- 0
(3.46)
5.
Innovation
Generation
for Bilinear
Systems
with Unknown
Inputs
455
(3.47)
L 1 C U ~ = LI Q ~ = 0
(3.47) implies t h a t Q0 cannot have full row rank, as L1 ~ 0. Now, substituting p1 for its value in terms of p2 in (3.46), and multiplying the resulting expression on the right by [ U l l U~2], one d e d u c e s : L1CU~
- L2P 2 = 0
(3.48)
-- 0
(3.49)
L1CU~ T h e latter equation can be written : 0
0T
0
1
0
0T
0
1
L,U~21U~21CU~2Uk1 + L , U ~ 2 U ~ 2 C U ~ 2 U k l ----0
(3.50)
By (3.47), L1 is necessarily of the form L1 ~--- ~lVQ2Flrr0T for some non zero • (p - r a n k Q ~ matrix L~. Substituting this value for L1 in (3.50), and taking (3.19) and (3.21) into account, one obtains : L~Q 1 -- O. Hence Q1 cannot have full row rank. Proceeding in the same way as above, one gets L{QJ -- 0, 0 < < ~ - 1, where L ~ ~ L1, L{ -1 ~ rJH(J--1)T and _ j __ ~lVQ2 L~ ~ 0. Hence QJ, 0 < j _< ~ - 1 have not full row rank. 9 T h e matrices P* and L~ resulting from T h e o r e m 2 have full row rank, and moreover P* and L~ have the largest possible rank among the set of solutions to (3.3)-(3.5) . Indeed, from the necessity part of the proof of T h e o r e m 1, one notices that any pair of matrices P and LI t h a t make a solution of (3.3)-(3.5) (together with adequate matrices L2, A~, B~,i --or 1 , . . . , m ) must be of the form P = P ~ P * , L 1 = L1L1 for some matrices P~, L~' of appropriate dimensions. Hence z = P*x and q = L~y have the largest possible dimension, which is a normal requirement for avoiding loss of information on f . The design method resulting from Theorems 1 and 2 can be s u m m a r i z e d as follows: A
*
1. determine a solution to (3.3),(3.4) by applying the recursive formulas (3.18)-(3.21) or (3.22)-(3.25) for j = 0 , . . . , a . If one of the matrices K 0 , K 1 , . . . , K ( ~ - I ) Q 0 , Q 1 , . . . , Q ( ~ - I ) has not full row rank, the procedure stops : the algorithm does not give a solution. Otherwise, compute P* according to (3.39); set Ai = A{, i -- 0 , . . . , m and comp u t e / 3 i , i = 0 , . . . , m from (3.40). 2. Solve (3.5), with P = P*. A solution is given by (3.41), (3.42). 3. I m p l e m e n t the innovation filter (3.12)-(3.14). To be able to use the innovation filter described above for fault detection, one should monitor on-line its output, or ~/function of its o u t p u t , b y adequate statistical tests. This issue is discussed in the next section.
456
4
5. Innovation Generation for Bilinear Systems with Unknown Inputs
Innovation Monitoring
4.1
Introductory Remark
Two situations must be distinguished depending on whether E[ -- 0, i = 1 , . . . , n f or not in (2.1). In the first case, the faults are additive, n a m e l y they only change the m e a n of the innovation. Then, the latter is known to be a sufficient statistic for the faults f , and it can be monitored by the generalized likelihood ratio (GLR) test, for instance [18], [2]. If some of the matrices E[,i = 1 , . . . , n f are non zero, the faults are not additive, and the innovation is not a sufficient statistic for f anymore [2]. Monitoring the innovation could still allow one to detect the faults, but it is not the best solution. One potential approach could be to work with the leastsquares-score associated to the innovation filter [1]. However, some issues still have to be clarified for that method. Therefore, only additive faults will be considered in the remaining p a r t of Section 4. T h e distribution of the residual in the absence and in the presence of faults is first determined, before presenting a review of the G L R test.
4.2
Innovation in the Presence of Additive Faults
To be able to apply the G L R test, step-like faults will be considered, namely f(k) = #l{k_>to}, where # is a constant vector, and l{k_>to} is equal to 1 when k _> to and it is null otherwise. It is straightforward to c o m p u t e the signature of the fault on the innovation (also called the dynamic profile of the fault). Indeed, with f(k) non zero, (3.11) can be written :
z(k + 1) = fi.(k)z(k) + B~(k)y(k) + [~(k)u(k) + P F f #l{k>_to} (4.51) By subtracting (3.12) from (4.51), and by substituting (3.10) for (3.15), one deduces : ez(k + 1) =
(fi(k) - r(k)L2)Ez(k) + PFS~l(k>to} +(G(k) - F(k)LiH)w(k) r(k) = L2ez(k) + L1Hw(k)
q(k) in (4.52)
(4.53)
where ez(k) = z(k) - ~(k). Hence the innovation can be written :
r(k) = ro(k) + p(k, to)#
(4.54)
where to(k) is the innovation for the fault free system, and p(k, to)# is the signature of the fault. T h e latter is null for k < to and it can be obtained
5. Innovation Generation for Bilinear Systems with Unknown Inputs
457
by simulating (4.52),(4.53) with w ~- 0 and ez(t0) = 0 for k _> to. Since the noise w(k) is assumed to have a normal distribution (see model (2.1)), r(k) is also Gaussian. More precisely,
= N(0,
when no fault has occurred
(4.55)
after occurrence of a fault
(4.56)
= •(p(k, to)., r ( k ) )
where N'(~, O) denotes the normal distribution with mean ~ and variance O, E(k) = L2II(k)L T + L1HHTL T, and II(k) is given by (3.14).
4.3
Generalized Likelihood Ratio Test
For the sake of simplicity, # is assumed to be a scalar (see [18] for the nonscalar case). The GLR test is aimed at choosing between two hypotheses: 9 H0 : no fault has occurred : a fault of unknown magnitude, #, has occurred at an unknown time instant, to _< k, where k denotes the present time instant.
9 H1
To explain the idea behind this test, let us first assume that # is known. Classical tests between both hypotheses rely on the log-likelihood ratio of H1 versus H0 for the residual sequence, namely:
s(k)
= in
p•(r(k))
(4.57)
where p , (7"(k)) (P0 (r (k))) is the probability density of r (k) under hypothesis H1 (H0). This quantity has the following fundamental property:
E.(s(k)) > 0
E0(s(k)) < 0
where E~ and E0 denote expectation under the distributions associated to p , (.) and Po (') respectively. Therefore, the cumulative sum of log-likelihood ratios, Sk = ~ i =k l s(i) has a negative drift in the absence of fault, and a positive drift when a fault has occurred. Hence the maximum likelihood estimate of the fault occurrence time, t0, can be computed by maximizing, w.r.t, to, the log-likelihood ratio of H1 versus H0 for the residual samples from time to to k, namely :
k
p,(r(i))
(4.58)
i=to
This yields the cumulative sum (CUSUM) test which amounts to computing the function: gCUSUM(k)---- max S~o
l
458
5. Innovation Generation for Bilinear Systems with Unknown Inputs
and to generating an alarm when gcusuM(k) > e, where e is a user defined threshold. Now, when # is not known, a similar reasoning still holds. Yet the loglikelihood ratio is a function of #, Stko(#). In order to evaluate this function, # is replaced by its maximum likelihood estimate, t~; hence the name generalized likelihood ratio. The maximum likelihood estimates of to and # are obtained by solving the following double maximization problem: (to,Yt)
=
arg{ max supS~ (#)} l~to<_k
p
o
(4.59)
When the probability density functions appearing in (4.57) correspond to Gaussian distributions, it is possible to find an explicit expression for /hk(t0), the maximum likelihood estimate of # at time k assuming the fault occurred at time to. This expression is then used for computing the loglikelihood ratio of H1 versus H0 given r ( 1 ) , . . . ,r(k), and it yields the following G L R test function: 1 gOLR(k) ---- max sup Stko(#) = ~ max
l<_to<_k
it
k (~-~'~i=to p(i, to)TE(i)-lr(i)) 2 k
l<_to
(4.60) As previously, when gGLR(k) exceeds a suitable threshold, hypothesis H1 is considered to be true, and an alarm is generated. The maximization problem in (4.60) is performed over all possible past time instants. This increasing time span yields an increasing search duration for finding the optimum. To avoid that problem, the fault occurrence time is estimated using a window of fixed size, M, in practice. Thus (4.60) becomes : gcLR(k) =
max
k--M<_to<_k
supS~ (#) p o
(4.61)
Upon occurrence of a fault, the G L R test provides an estimate of the fault magnitude and the fault occurrence time. This allows one to keep monitoring the system in its faulty behaviour, to detect whether further changes take place. The procedure used to u p d a t e the innovation generator upon occurrence of a fault is described in [7]. It requires the introduction of an additional tuning parameter, namely a tolerance on the estimated fault magnitude, #tol. The system is considered to be in faulty mode when the GLR test has been triggered and the estimated fault magnitude is greater than #tol.
5. Innovation Generation for Bilinear Systems with Unknown Inputs
459
Design and Validation of a FDI System for a three Tank Process
5 5.1
Process Description
The flowsheet of the three tank process is given in Figure 1. It consists of three tanks, R1, R2 and R3, which serve to supply water from the reservoir R0. The capacity of the latter reservoir is much greater than the capacity of R1, R2 and R3, so that its level remains practically constant during the operation. In the study reported here, tanks R1 and R3 play the role of buffers for supplying R2. The level in tank R1 is controlled by manipulating the speed of pump P1, while the level in R3 is adjusted by manipulating the aperture of valve Vh. P u m p P2 runs at constant speed. Two PI controllers are used to drive the levels in R1 and R3 at their desired reference value.
:::--:::[::U::-:-: ::
I
:-: :-=-.
[] L
_
V4
V5
FIGURE 1. Flowsheet of the three-tank process. The plant can be modelled by a nonlinear state space model of the form:
X(t) = f(Z(t), U(t))
(5.62)
Y(t) = X(t) + N(t)
(5.63)
where X(t) = [ hi(t) h2(t) h3(t) ]T is the state vector consisting of the levels in tanks R1, R2 and a3. U(t) = [ ~ l ( t ) Sh(t) ]T is the command input vector made of the speed of rotation of p u m p P1 and the position of the stem of the continuous valve Vh. Y(t) are the measured outputs, and N(t) is a white noise process. The explicit form of (5.62) can be found in [6], where the model parameters have been identified from experimental data taken on the actual plant. In the same reference, a simulator that reproduces the process behaviour under faulty and fault free situations is
460
5. Innovation Generation for Bilinear Systems with Unknown Inputs
described. Here, only three faults will be considered, namely a leak in tank R1, emulated by opening valve V3, a clog in the branch with pump P2, and a bias on the measurement of the level in R3, h3.
5.2
Design and Validation of the Innovation Generator
A bilinear model of the following form has been identified from d a t a obtained by running the simulator around a set point (w~ Sh, 0 hi, 0 h 0, h 03), without introducing any noise in the simulations:
x(k + 1) = Aox(k) + wl(k)Alx(k) + sh(k)A2x(k) + Bu(k) +fl(k)Elx(k) + Flfl(k) + f2(k)E2x(k) + F2f2(k) y(k) =
+ E3f3(k)
(5.64)
(5.65)
where x(k), y(k),wl(k) and Sh(k) denote deviations from the nominal set point, and u(k) = [ wl(k) Sh(k) ]T. fx(k), f2(k) and f3(k) correspond to the faults mentioned in the previous section, in the same order (leak, clog, bias). The aim is to detect and isolate the faults. To this end, one approach consists in generating three innovation signals, each of them being only sensitive to a single fault. Due to a lack of space, only the design of the innovation filter aimed at detecting fault f3 will be considered here. The reader is referred to [7] for the complete design of the FDI system. To generate an innovation sequence sensitive to the sensor fault f3 only, the two other faults must be considered as unknown inputs (vector d in (2.1)). Thus, equation (3.4) can be written P[ E1 F1 E2 F2 ] = PK12 -- 0. In the present case, K12 has full row rank, and thus this equation has no solution. To alleviate this problem, an approximate decoupling with respect to fx and f2 will be achieved using the method of [17]. The procedure consists in computing a matrix K~"2 which has not full row rank and which is close to K12 in the sense that the following Frobenius norm is minimum: tiE12 - KI*211F. Then, one substitutes KI*2 for K12 in (3.4), and the design is pursued by following the procedure presented at the end of Section 3. The variance of the measurement noise was evaluated from experimental data, and the variance of the state excitation noise was chosen in such a way that the transient of the innovation upon occurrence of a fault is sufficiently fast while keeping a reasonable sensitivity to the fault. A linear time-invariant innovation filter has also been designed for the same application. It is based on a linearized model of the plant (around the same set point that was used for model (5.64), (5.65). The innovation sequences obtained with both types of filters are now compared by inputting simulation data in faulty and fault free conditions in the innovation generators. Constant faults f3, f l and f2 are simulated respectively in the time
5. Innovation Generation for Bilinear Systems with Unknown Inputs
461
intervals [600s, 800s], [1850s, 2050s], and [4600s, 4800@ T h e simulations reported here were performed in open-loop with input signals covering a large working range. The c o m m a n d signal deviations from their nominal value are plotted in Figure 2. Figure 3 represents the corresponding o u t p u t signal deviations. The effect of the three faults is indicated. T h e innovation sequences aimed at detecting fault f3, and obtained from the linear and the bilinear models, are plotted in Figures 4. The following points can be
-
z~oo
~oo
~oo
FIGURE 2. Command signal deviations from their nominal value.
20
hi
10 0
'~
r
-20
-30 0
100
200
300
E, to. 400
,500
FIGURE 3. Output signal deviations from their nominal value.
o.4[
/
..... ~ e a r mo~l
FIGURE 4. Innovation sequences obtained from linear and bilinear models noticed : 9 The innovation filter based on the bilinear model generates a signal with lower magnitude t h a n its counterpart based on the linear model, when the c o m m a n d signal deviations from their nominal value are large, in the absence of faults. 9 The error introduced by the approximate decoupling procedure of [17] is acceptable since each innovation sequence is significantly more
462
5. Innovation Generation for Bilinear Systems with Unknown Inputs affected by the fault to which it must be sensitive, namely f3, t h a n by the other faults.
T h e same conclusions remain valid when the system operates in closed-loop.
5.3
Evaluation of the Innovation Sequence
To tune the G L R test, the following heuristic rules might be helpful : 9 The test window M should be approximately equal to the duration of the transient in the innovation upon occurrence of a fault. 9 The test threshold should be fixed beyond the typical test o u t p u t in the absence of fault in order to avoid most false alarms. Indeed the innovation is not perfectly zero m e a n due to modelling imperfections, and to the effect of K a l m a n filter transients caused by a b r u p t changes in the c o m m a n d signals. The innovation filter for detection and isolation of fa, followed by a G L R test, was applied on actual plant data. T h e obtained results are plotted in figure 5, were the innovation sequence, the test function, the e s t i m a t e d fault magnitude, and the actual and estimated fault occurrence times are successively presented. The fault occurrence and disappearance times are quite well estimated. One notices t h a t the residual m e a n is not exactly zero after the initialization stage, which lasts for 150 seconds. This is due to modelling uncertainties. Yet, reasonable results are obtained although the innovation generator and G L R test were designed from model (5.64), (5.65), which was identified from plant simulator data. T h e robustness of
io
.
. ~
2
OlS [
0,
(b)
I
(c)
i ............................... i ! ~ o
o
~
IOOO
FIGURE 5. (a)Innovation for detection of f3, (b) GLR test function, (c) Estimated fault magnitude, (d) True fault durations for f3 and fl (dashed line) and estimated fault duration for f3 (solid line) the G L R test with respect to modelling uncertainties (including errors in
5. Innovation Generation for Bilinear Systems with Unknown Inputs
463
the state excitation noise variance) is illustrated by the above results. It was already noticed in several other applications, as reported in [3] for instance.
6
Conclusions
A m e t h o d to design an innovation filter for bilinear systems with unknown inputs has been presented. Further work is needed to ensure t h a t no information on the fault is lost in the innovation generation. The issue of sensitivity of the residual to the fault has not been addressed here. Some results on this point were obtained in [10] in a deterministic framework. The innovation monitoring by the G L R test has been reviewed for additive faults. In the case of non-additive faults further work is needed to combine innovation filters with adequate statistical detection methods. Simulation results have shown that, for some nonlinear systems, the use of a bilinear model instead of a linear model, for innovation generation, can make the residual magnitude small over a larger working range.
7
REFERENCES
[1] M. Basseville. On-board component fault detection and isolation using the statistical local approach. Automatica, 3 4 ( 1 1 ) , pp. 1391-1415, 1998. [2] M. Basseville and I.V. Nikiforov. Detection of Abrupt Changes, Theory and Application, Prentice-Hall, Englewood Cliffs, N.J., 1993. [3] M. Basseville and A. Benveniste. Design and comparative study of some sequential jump detection algorithms for digital signals. IEEE Trans. Acoustics, Speech and Signal Processing, A S S P - 3 1 ( 3 ) , pp. 521-535, 1983. [4] G. Besanqon and H. Hammouri. On uniform observation of nonuniformly observable systems. Systems and Control Letters, 2 9 ( 1 ) , pp. 9-19, 1996. [5] G. Bornard, N. Couenne and F. Celle. Regularly persistent observers for bilinear systems, in New Trends in Nonlinear Control Theory, Lecture Notes in Control and Information Science, 122, Springer-Verlag, pp. 130-140, 1988. [6] Dolanc G., D. Juricic, A. Rakar, J. Petrovcic and D. Vrancic. T h r e e - t a n k benchmark test. Report COPLOOTR.for Copernicus Project CT9~-0237, 1997.
464
5. Innovation Generation for Bilinear Systems with Unknown Inputs
[7] L. E1 Bahir and M. Kinnaert. Fault detection and isolation for a three tank system based on a bilinear model of the supervised process. Proceedings of the UKACC International Conference on CONTROL'98, pp. 1486-1491, 1998. [8] G. C. Goodwin and K.S. Sin. Adaptive Filtering Prediction and Control, Prentice-Hall, 1994, page 251. [9J H. Hammouri, M. Kinnaert and E.H. E1 Yaagoubi. Residual generator synthesis for bilinear systems up to output injection. Proceedings of the 33rd IEEE Conference on Decision and Control, Orlando, December 14-16, pp. 1548-1553, 1994. [10] H. Hammouri, M. Kinnaert and E.H. E1 Yaagoubi. Fault detection and isolation for state affine systems. European Journal of Control, 4, pp. 2 16, 1998. [11] S. Hara and K. Furuta. Minimal order state observers for bilinear systems. International Journal of Control, 24(5), pp. 705-718, 1976. [12] A. H. Jazwinski. Stochastic Processes and Filtering Theory, Academic Press, 1970, page 240. [13] M. Kinnaert. Robust fault detection based on observers for bilinear systems. Internal Report. Universit~ Libre de Bruxelles, 1998. [14] M. Kinnaert, Y. Peng and H. Hammouri. The fundamantal problem of residual generation for bilinear systems up to output injection. Proceedings of the 3rd European Control Conference, pp. 3777-3782, 1995. [15] C. Mechmeche, S. Nowakowski and M. Darouach. A failure detection procedure for bilinear systems based on a new formulation of unknown input bilinear observers.Proc, o.f SAFEPROCESS'9~, pp. 64-68, 1994. [16] R. Nikoukhah. Innovation generation in the presence of unknown inputs: application to robust failure detection. Automatica, 30(12), pp. 1851 1867, 1994. [17] R. J. Patton and Jie Chen. Optimal unknown input distribution matrix selection in robust fault diagnosis. Automatica, 29(4), pp. 837-841, 1993. [18] A. S. Willsky and H.L. Jones. A generalized likelihood ratio approach to the detection and estimation of jumps in linear systems. IEEE Trans. Automatic Control, AC-21, pp. 108-112, 1976. [19] H. Yang and M. Saif. State observation, failure detection and isolation (FDI) in bilinear systems. Proc. of the 3~th IEEE Conference on Decision and Control, pp. 2391-2396, 1995.
5. Innovation Generation for Bilinear Systems with Unknown Inputs
465
[20] D. Yu and D.N. Shields. A bilinear fault detection observer. Automatica, 32(11), pp. 1597-1602, 1996. [21] D. Yu, D.N. Shields and S. Daley. A bilinear fault detection observer and its application to a hydraulic drive system. Int. J. Control, 64(6), pp. 1023-1047, 1996.
Synchronization Through Extended Kalman Filtering C~sar C r u z 1 and Henk Nijmeijer 2
1Department of Electronics ~z Telecom., Scientific Research and Advanced Studies Center of Ensenada (CICESE), M~xico 2Faculty of Mathematical Sciences, University of Twente, Enschede and Faculty of Mechanical Engineering, Eindhoven University of Technology, The Netherlands
1
Introduction
Synchronization is a fascinating phenomenon and has been observed in many diverse systems. Synchronization in chaotic systems may bring many interesting possibilities in practical applications. For example, it is believed that synchronization plays a crucial role in information processing in living organisms and could lead to important applications in speech and image processing (Ogorzalek [17]). Another area where synchronization may play an important role, is (secure) communication. Due to the fact that chaotic signals are noise-like and unpredictable in nature, such signals can possibly be used as potential carriers for secure communication ([20], [12], [18], [6], [10]). Moreover, the synchronization property of chaotic circuits has revealed potential applications to secure communications (see, e.g., [5], Kocarev et al., [12]; Parlitz et al., [18]; Cuomo et al., [6]; Halle et al., [10]). However, the problem of obtaining two chaotic systems oscillating in a synchronized way is a nontrivial question. This is because any small difference in initial conditions would be exponentially amplified and thus the motions of the systems rapidly become uncorrelated (Ogorzalek [17]). Despite this fact, Pecora and Carroll discovered (see [5], [19]), that in particular cases it is very well possible that synchronization between a chaotic transmitter and driven receiver is possible. Much further work about synchronization can be found in the special issue [24]. In this paper we will concentrate on synchronization for systems in discrete time. Mostly, synchronization has been studied for systems in continuous time but many ideas go through in discrete time. On the other hand, the theory is less far developed for discrete time systems -even despite the fact that in many cases the actual implementation is done in discrete time. Consider an autonomous, discrete-time, dynamical system x (k + 1) = f (x (k)),
(1.1)
470
1. Synchronization Through Extended Kalman Filtering
where x (k) is a n-dimensional vector. The Pecora-Carroll scheme for synchronization can be described as follows. Divide the system into two subsystems via x ( k ) = ( X l ( k ) , x2 ( k ) ) T, where Xl (k) is nl-dimensional and x2 (k) is n2-dimensional, with nl + us = n (often n I : 1, s e e [5], [6], [19]). We refer to xl (k) as the drive signal. So, the transmitter system (1.1) can be written as Xl (k ~- 1)
:
fl (Xl (]g),x2 (k)),
(1.2)
x2(k+l)
=
f2(xl(k),x2(k))
(1.3)
where f (x) = (fl (xl, x2), f2 (xl, x2)) T. The driven replica subsystem is described by x2n ( k + 1) = f2 (Xl ( k ) , x ~ (k)),
(1.4)
the so-called receiver dynamics. In (1.4), x2n (k) is the response variable and (1.4) is known as the response subsystem. The receiver system (1.4) synchronizes with the transmitter system (1.2) and (1.3), if lim [[ x2 (k) - x~ (k) I[ = 0,
(1.5)
no m a t t e r which initial values x 1 (0), x2 (0) and x2R (0) have. In the context of communication the drive signal is transmitted from the transmitter system to the receiver system. The full state x (k) of the transmitter is unknown at the receiver. By driving with the known signal xl (k) to a replica response subsystem (1.4), we can then obtain x2 (k), if the copy synchronizes with the full system (1.2)-(1.3) (according to (1.5)). In general, no matter if fi is chaotic or complex, no complete general answer exists to the problem whether a replica system (1.4) would achieve synchronization according to (1.5). For that reason several attempts for achieving synchronization of signals like x2 (k) and x~ (k) have been proposed. The above idea of synchronization by decomposition into subsystems (by using a replica (1.4)) has first been given in Pecora and Carroll [19]; the ocurrence of such synchronization is conditioned on whether the conditional Lyapunov exponents for (1.4) are negative. In such case, the system (1.1) is said to possess a self-synchronizing property. Note however, the negativity of the conditional Lyapunov exponents is not a guarantee for the successful synchronization, cf. Badola et al. [2]. However, recent research has shown, that one is not necessarily constrained to the use of a replica (1.4) when choosing a system to achieve synchronization (at least when one is partly free to choose the receiver dynamics-driven by the transmitter's drive signal), i.e., it is also possible if, instead of (1.4), we utilize a different response system
x y (k + 1) = f ~ (X 1 (k), x2R ( k ) ) .
(1.6)
1. Synchronization Through Extended Kalman Filtering
471
This may leads to considerably more flexibility in applications like secure communication. This added flexibility may facilitate potential improvement in synchronization, e.g., Ding and Ott [8] have obtained synchronization using (1.6) in case a replica (1.4) does not synchronize. Another adventage of using (1.6) is to improve the convergence of the synchronization. In particular, we like to recall the (reduced) observer viewpoint advocated by Nijmeijet and Mareels in [15] which basically admits -under suitable assumptionsthe construction of dynamics (1.6) such that (1.5) holds, whatever initial conditions (1.2), (1.3) and (1.6) have. A possible extension of [15] in discrete time has been given by Huijberts et al. [11]. On the other hand, for communication purposes (specifically, additive signal masking and recovery), it is required that the synchronization is not affected by some noise in the synchronizing drive signal. In other words, the use of synchronized chaotic systems for secure communications relies on the robustness of the synchronization to perturbations in the drive signal and in the system dynamics, [6], see also [20]. For certain synchronized chaotic systems, the ability to synchronize is robust. In the Lorenz system, for example, one can observe such property (Cuomo et al. [6]) for an exact replica (1.4). On the basis of these considerations, we propose a system of the form 2 ( k + 1) = f R (21 (k),~:(k)),
(1.7)
where 21 (k) is the drive signal corrupted by noise from the transmitter dynamics i.e., 21 (k) = xl (k) + v (k), with v (k) the noise signal, and 2 (k) is the state estimate for the original system (1.1), provided by an extended Kalman filter (EKF). In the next section, we give the particular equations for the E K F and further background on it. Among some advantages of using (1.7) as receiver/estimator for (1.1) are: 9 The E K F possesses some natural robustness property to additive Gaussian noise in the drive signal (see Cuomo et al. [6]). 9 The E K F is easily implemented. 9 Flexibility in applications since a decomposition into two subsystems (1.2)-(1.3) is not necessary. The idea of using an E K F to estimate the states of a chaotic system is given perhaps for the first time by Fowler [9]. Cuomo et al. [6] shown that the E K F estimates of a Lorenz system approach its true states. Recently in [21[, Sobiski and Thorp describe a method of synchronizing two chaotic systems by implementing an E K F for continuous-time systems. Motivated by these papers and by its implication in secure communications, we study synchronization between the discrete-time systems (1.1) and (1.7) when the E K F is used as receiver driven by a noisy drive signal. Besides synchronization per se, we also study the utility of using the E K F
472
1. Synchronization Through Extended Kalman Filtering
for reconstruction of a binary message. The idea is t h a t some p a r a m e t e r in the t r a n s m i t t e r can be used as the (binary signal) message carrier. In the E K F we include in this case as extra state a p a r a m e t e r - e s t i m a t o r , see also [21] where a similar idea was used in continuous time. At the same time, we mention some differences with [21]: 9 The context we use here is discrete-time. This on the one hand makes the presentation even simpler t h a n in continuous time. On the other hand, to the best of our (knowledge) using the E K F for synchronization in discrete time is new. 9 By means of extensive simulations we make an a t t e m p t to give an evaluation of the performance of the E K F by changing filter-parameters (covariance noise, initial error, etc.). 9 For the given examples a rigorous result that ensures the local (optimal) convergence of the E K F is given. The organization of this paper is as follows: in Section 2 we present our approach to achieve synchronization of discrete-time systems via an extended K a l m a n filter which is driven by a noisy drive signal from the transmitter. By using computer simulations, the approach used in this s t u d y is illustrated with two examples in Section 3.1, while in Section 3.2 an application of these results to secure communication is given. Finally, Section 4 contains some concluding remarks.
2
An E x t e n d e d K a l m a n Filter as Receiver
Briefly, given a stochastic (linear) model description, the K a l m a n filtering problem is to produce an estimate 2 (k) of the state model x (k), using measurements till time k so as to minimize the mean-square error between estimate and state. From this viewpoint, the K a l m a n filter is the optimal linear filter since it produces an estimate minimizing a mean-square error. For application to nonlinear models, the so-called extended K a l m a n filter is often used in practice, but, in general, no guarantee in this case of producing a good (optimal) state estimate can be given. In this case the nonlinear system is linearized by employing the best estimates of the s t a t e as the reference values used at each stage for the linearization, i.e., the E K F consists of using the classical K a l m a n filter equations for the first-order approximation of the nonlinear system a b o u t the last estimate. As a direct consequence of taking this approximation, the E K F is no longer linear or optimal. For further details we refer the reader to Anderson and Moore [1] and references therein. We consider transmitter dynamics of the form x(k+l)=f(x(k))+w(k),
x(0)=x0,
(2.8)
1. Synchronization Through Extended Kalman Filtering
473
with transmitted signal y ( k ) = h ( x ( k ) ) + v (k).
(2.9)
Typically, x (k) is an n-dimensional vector and y (k) is a scalar signal (although much of what follows can be extended to a vector signal y (k)). In (2.8) w (k) represents the noise in the dynamics of the transmitter which is assumed to be a zero mean noise process with E [w (k) w T (l)] = QSkz > 0, with Ski the Kronecker delta function. Also v (k) is a zero mean noise process with E [v (k)v (l)] = R6kl > 0; v (k) and w (k) are assumed to be independent. R e m a r k 1.1 Although it m a y not be necessary to introduce the d y n a m i c s noise w (k) in (2.8), we .find it convenient and more flexible to do so. In .fact .from certain perspective it can be argued that an error free d y n a m i c s would be an over-idealization, which m a y prohibit a successful synchronization through the E K F , see Sorenson [23]. In the theoretic developments to .follow one might think about the covariance m a t i x Q, as being reasonably small, m i m i c k i n g at least a very accurate modeling. For convenience we have assumed in the examples o.f Section 3 that w (k) = 0 .for all k. The receiver dynamics we propose is a filter that will produce an estimate for the state x (k) given the measurements y (k) according to (2.9). The E K F that we use here as the receiver dynamics for (2.8) and (2.9) is given as follows, cf. [1]. 1) Measurement update equations: ~:(k) = ~:(k/k - 1) + K ~ ( k ) [y (k) - h (~: ( k / k - 1))],
(2.10)
the vector 2 (k) is referred to as the .filtered estimate of the state x (k). The covariance of the error in 2 (k) is given by P~(k) = [~ - K~(k)H~(k)] P ~ ( k / k - 1).
(2.11)
2) Time update equations: The (one-step ahead) predictor of x (k + 1) is given by ~(k + l / k ) -- f (~ (k)),
(2.12)
the covariance matrix of the prediction error is
P~ (k + l/k) = F~ (k) P~ (k) F [ (k) + Q,
(2.13)
where K s ( k ) = P ~ ( k / k - 1)H~(k) [ H ~ ( k ) P ~ ( k / k - 1)HT(k) + R ] - I
(2.14)
474
1. Synchronization Through Extended Kalman Filtering
is known as the Kalman gain matrix, and
& (k) = o f (x (k)) Ox (k)
He (k) =
(2.15)
Oh (x (k))] cox (k) x(k)=e(k/k-1)"
(2.16)
R e m a r k 1.2 In practice it might be impossible to determine x (0) exactly. In this case, x(O) is assumed to be a Gaussian random variable of known mean value E { x ( 0 ) } = 2(0) and known covariance matrix E ~[x (0) - 2 (0)] [x (0) - 2 (0)] T} = P (0), and it is independent of w (k) %
and v (k). The filter is initialized by setting 2 (0) = 20 and P (0) = P0 = PoT > 0. Thus, x (0) is given and we choose arbitrarily 20, P0 = PoT > 0. The definition of synchronization given in the introduction, i.e., equation (1.5), can be extended to include approximate or noisy synchronization to accommodate inaccurate parameters and non-ideal signal transmission. In this case, the receiver (2.10)-(2.13) can not synchronize with the transmitter (2.8) in the way that condition (1.5) is fulfilled so we need to replace it by a weaker condition lim I I x ( k ) - k ( k ) I I
k ----*r
<- P,
Vk>7,
(2.17)
where p should be related to R and is a constant of the synchronization error. If for a given p there exists a time instant 7 (to be called the synchronization time) such that condition (2.17) is fulfilled, then the transmitter (2.8) and the EKE receiver (2.10)-(2.13) are approximately synchronized. R e m a r k 1.3 Also one might consider as an adequate condition .for approximate synchronization in the noisy context lim
fl E ( x ( k ) -
s
ll ~ p,
V k > T.
In particular this may be a more relevant requirement if w (k) is not necessarily bounded. Since we will later on assume that both v (k) and w (k) are bounded it suffices to take (2.17) as the definition .for approximate synchronization. We define for ei ( k ) = xi ( k ) - ~:i ( k ), i = 1, 2 , . . . , n 7-~(p)=min{lei(k)l
k=T,~'+l,...},
(2.18)
1. Synchronization Through Extended Kalman Filtering
475
and the synchronization time by ~-=max(T~),
i=1,2,...
,n.
(2.19)
R e m a r k 1.4 F r o m (2.17) note that there exists a c o m p r o m i s e b e t w e e n the quantities p and T, since if p increases t h e n T decreases, a n d vice versa.
The E K F is often used to design observers (to deal with state estimation) for forced or non forced nonlinear systems. In spite of the fact that only local convergence is ensured, this method is widely used in practice and often gives convincing results (for a summary of the theory and applications see, e.g. Boutayeb et el. [4], Bards et al. [3], La Scala et al. [13], Ljung [14], Song and Grizzle [22] and references therein). The convergence aspects of the E K F when it is used as a deterministic observer for discrete-time system, are analyzed through a Lyapunov approach in Boutayeb et al. [4], and Song and Grizzle [22]. We follow the approach proposed in La Scala et al. [13], for the establishing the convergence of the E K F when applied to a stochastic, discrete-time nonlinear system with a linear output map. To this end, define the error in the filtered state as e (k) =a x (k) - ~ (k).
(2.20)
From (2.10) we have e (k) = [I - K~ (k) H~ (k)] e ( k / k - 1) - K~ (k) v (k), where e ( k / k - 1) a= x (k) - ~ ( k / k - 1) is the error in the predicted state estimate, thus, e (k + l / k ) = o f (x (k))- e (k) - O f (x (k) - e (k)) + w (k) Of is the remainder term from the Taylor series expansion of f , i.e.,
of
O f (a, b) = f (a + b) - f (a) - -~a ( a ) . b.
So, for the (corrupted) drive signal (2.9), we have the error dynamics equation e(k)
=
[I-K~(k) H~(k)]F~(k-1)e(k-1)
(2.21)
- [I - K~ (k) H~ (k)] O f (x (k - 1) , - e (k - 1))
+ [I - K~ (k) H~ (k)] w (k - 1) - K~ (k) v (k). From the last equation, we see that the dynamics for the filtering error of the E K F driven by a noisy drive signal from the transmitter, is composed by the sum of the error dynamics for the deterministic case (neglecting
476
1. Synchronization Through Extended Kalman Filtering
linearization errors), and nonlinear perturbation terms driven by the noise processes and remainder term from the Taylor series expansion of f. Consider the time-varying linear system
(k+l) v (k)
=
F~(k)~(k)+w(k),
=
H i ( k ) + v (k),
(2.22)
with Fz (k) = (Of/Ox) (r (k) - r (k)). Define the observability Gramian of Fz, R - : H along a trajectory {z (k)} of (2.22) as k
O(k,N)
q~T (i, k) H T (i) R - 1 H (i) 9 (i, k)
: i=k--N
for some N _> 0 and for all k > N, where O(k2, kl) = Fz ( k 2 - 1) Fz (k2 - 2)-.. Fz (kl). Similarly, the controllability Gramian of [F~, Q] along a trajectory {z (k)} of (2.22) as k-1
C(k,N)=
E
O(k'i+l)QOT(k'i+l)"
i=k- N
A system is said to be controllable (observable) along a trajectory {z (k)} if there exists N such that for all Rx > 0 there exist 0 < er < Rx, ai (Rx,er,N) and bi (Rx,e~,N), i = 1,2, such that for some arbitrary sequence {~(k)}, ]l~ (k)ll < R~, and for all {r such that I1r (k)[] < ~
alI >_C (k, N) > a2I,
0 < a2 ~_ al < 00,
(2.23)
b l I < O ( k , N ) <_b2I,
O
(2.24)
where these Gramians are evaluated along the trajectory z (k) = r (k) r (k) of (2.22). The following assumptions are needed, cf. [13]: 1) The transmitted signal (2.9) is a linear in x, i.e., y (k) = H x (k) + v (k).
(2.25)
2) f e C a (JR'~, Nr'), (Of /Ox) (X) is invertible for all x e N'~, and 3) for all k (see Remark 1.3)
IIx(k)ll ~ IIw(k)ll < IIwI] < c~
and
Rx,
IIv(k)l[ <_ I[v!] < cx~.
(2.26) (2.27)
1. Synchronization Through Extended Kalman Filtering
477
From Assumption 2), we can find p/ > 0, i - 1, 2, 3 such t h a t
"~x (x) Of <_Pl, for all ]]x[] < Rz + r P4 > 0 such t h a t
02f -g~x2 (x) < p2,
0 3 f (x) Ox3 <_P3
Furthermore, by the continuity of f, there exists a
0 f (Xl)
Of (x2)
- ~xx
~ P4
]Ix1
-
x2l[
for all ]l:rlII, ]Ix21] < R~, ~nd Of (Zl) - Of 02I z2) ~xx ~xx (x2) - ~ (x2)-(xl -
1 _< ~P4 Ilxl - x2[[ 2
(see, La Scala ~t aZ. [13] ). Let
__1 +b2, s = - - +1 b 2 + p ~ 5 {
1
p=al+b-~l'
q = a2
i.
a2
Since Pz (k) and Pz (k + 1/k) are defined by means of the linear system (2.22), we find, using [7] the bounds (depending on er, P~ and N)
q-'I q-lI
< Pz(k) <_ pI, < Pz(k + l/k) < sI.
Let 1
z=min
{ ( i ()) v/2 r'74
-Pl +
P21 + q1
1
'
(2.28)
1 2 IIz (k)[I 4 -< where 0 < 3' < 1/sp 2, provided t h a t -1/sp2+plp4q I[z (k)l [2 +~P4q -0'- Define a, 3 > 0 via k
~3ak = (pq)l
1
-
(2.29)
and consider 5 (k + 1) = A (k) ~"(k) + ]2 (k, 2, (k)) where A (k) = [I - K~ (k + 1) H] F~ (k),
?2 (k, 2, (k)) =
cgK~(k + 1) HF~ (k) z (k) 2, (k) Oz (k)
(2.30)
478
1. Synchronization Through Extended Kalman Filtering
and
=x(k)-z(k). Note that (2.30) is the linearized, undriven component of the E K F error dynamics, neglecting linearization errors. In La Scala et al. [13] an explicit expression for an upper bound on f2 is given as
IJs (k,
(k))lJ <
=
(2.31)
where 5k =- 5pp55~ 1 (1 + sp25~1), Ilgl[ < p5 and 5p = 2plp2p. Let
~l =
max
{(,,wll-I- ' 0 pq
-~p4c r 1
-F pp5t521
Ilvll,
(2.32)
2
5k (Ps ]'w]] + ['v[' + -~P4P5er) + 2pqPl } , and
= 2plp55k + PlPh~k2r
(2.33)
with 5k2 (Pl, P2, P3, Ph, (~2,P) > 0 such that
OUKs (k + 1)
_< 5k2.
Assume there exist ~, ~ > 0 as defined in (2.32) and (2.33) such t h a t II[I - K s (k + 1) H] [w (k) - O/(x (k),e (k))] - K s (k) v (k)l I < , e r , and 02 ~ z 2 [I - Ks (k + l) g ] F x ( k ) . z ( k )
_i
for all IIz(k)l I < er and for all k > 0. T h e o r e m 1.1 [EKF Stability] (La Scala et el. [13]) Consider the error
dynamics of the E K F given in (2.21) when the E K F is applied to a signal model (2.8)-(2.25) which satisfies the standing assumptions 1)-3). Select N and 0 < e~ < Rx such that the controllability and observability conditions (2.23) and (2.24) are satisfied. Then if fl (~ + ~) + a < 1,
(2.34)
~{s + (~ < 1,
(2.35)
1. Synchronization Through Extended Kalman Filtering lle(O)ll < ~
(pq)-89
479 (2.36)
where cz, ~ and 13, ~ , ~ , and ~ are given in (2.28), (2.29), (2.31), and (2.32), and (2.33), respectively, we have that the error dynamics satisfies
(2.37) for all O ~ k < T, and
ile(k)l[
< f l ~ ( R x - f l c z ) < ~r 1-(~+r -
(2.38)
.for all k >_ ~-.
It is interesting to observe that the above theorem implies that under suitable technical conditions on the system dynamics f we obtain the 'practical stability' condition (2.37)-(2.38). It is clear that in the given noisy context this is the best we can hope far; convergence to zero is obviously impossible. The constant er will play in the next section the approximate synchronization constant, see (2.17). Similarly the synchronization time T in (2.19) is related to the integer N.
3 Examples 3.1
Synchronization
E x a m p l e 1 (see Badola et al. [2]) Consider two coupled logistic maps as the transmitter dynamics x , (k + 1)
=
(1 - e) #Xl (k) (1 - x 1 (k)) --~ Ex2 ( k ) ,
x2(k+l)
=
( 1 - e) # x : (k) ( 1 - x2 (k)) + ex, (k) .
(3.39)
Treating y (k) = x2 (k) as the (ideal) drive signal (nl = n2 = 1), Badola et al., in [2] investigated the synchronization of x~ (k) and the receiver signal x~ (k) of which the dynamics were taken as x~ (k + 1) = (1 - ~) #x R (k) (1 - x~ (k)) + ex2 (k).
(3.40)
In Badola et al. [2] it turned out that only for particular initial conditions synchronization between (3.39) and (3.40) occurs. We therefore reconsider (3.39) in the frame of Section 2. To do this, we use an E K F presented in the previous section as receiver dynamics for the noisy transmitter xx (k + 1)
=
(1 - ~) , x l (k) (1 - xl (k)) + ~x2 (k) + Wl (k), (3.41)
x2(k+l)
=
(1-e)#x2(k)(1-x2(k))+eXl(k)+w2(k)
480
1. Synchronization Through Extended Kalman Filtering
and the (corrupted) drive signal y ( k ) = x2 (k) + v ( k ) .
(3.42)
T h e E K F will yield the state estimates 21 (k) and 2 2 (k) for the signals xl (k) and x2 (k). T h e structure of the E K F is given by =
21 (k -~- 1)
(1 - c) #21 (k) (1 - 21 (k)) -{- s
(k)
(3.43)
-~- k 1 (k) [y (k) -- 22 (k)],
22 (k + 1)
=
(1 - e) #k2 (k) (1 - 22 (k)) + s
+
(k) [y (k) -
(k)
(k)]
where the gain vector (kl (k), k2 (k)) T is given via equations (2.10)-(2.14). We investigate the evolution of the estimation process created by the E K F with the assumption t h a t the initial m a t r i x P0 is of the form P0 = diag{p0i}, i = 1, 2. Also, the variance of the noise R was fixed at 0.00005 and P01 -- P02 = 100. To simplify the presentation we realized the dynamics noise as being identical zero or, which is equivalent, the covariance Q of (wl, w~)T was supposed to be extremely small of the form Q -- diag{qi}, i = 1,2, with q~ = 10 -s. For the p a r a m e t e r values of e = 0.09 and p = 3.7 and initial conditions in [0, 1] • [0, 1], we have t h a t IIx (k)l I < R~ = x/T-fi for all k > 0. Although Of (x)/Ox is not invertible everywhere, it turns out t h a t when initializing (3.41) at x C0) = (0.4, 0.7) the Jacobian remained nonsingular along the trajectory. T h e controllability and observability conditions hold for all k _> 1. Finally, we take ~r = 1, Pl = 4.76, P2 --~ P 3 : f14 --~ 3.36 and p5 = 1 to satisfy the T h e o r e m 1.1. Initial conditions x (0) = (0.4, 0.7) have been used for the subsequent simulations. For the above p a r a m e t e r values of # and e, the t r a n s m i t t e r (3.41) is apparently chaotic. Following [2], Xl (k) and x R (k) do not synchronize for these p a r a m e t e r values and initial conditions xln (0) = 21 C0) = 0.65 while we obtain synchronization (according to (2.17)) using the E K F as receiver. Figure 1 shows the synchronization error between t r a n s m i t t e r and receiver dynamics. We see that, after some transient behavior, the approximate synchronization is clearly visible; according to (2.19) it is obtained when 7- = 3 when p = 0.04 is considered (see Figure 2). To evaluate the performance of the E K F from the point of view of sensitivity to initial errors, twenty different Monte Carlo runs were taken in order to obtain root-mean-square error statistics. T h e results are s u m m a rized in Table 1.1; where sdi (i ---- 1, 2) is the sum of square errors given by N
sdi=E i~1
(xi(k)-2i(k)) 2,
k=0,1,...,N
(3.44)
1. Synchronization Through Extended K a l m a n Filtering 0.1
,
0.0
:
0
,
,
.....
:
,
,
,
: .....
:
'
:
:
[
:
:
! '
!
!
481
-0.0s
"~ -o.1 -o.15
o.01 ~
~
0
-O.01H J
:
:
, "0,02~' . . . . . . . . . -0.031
!
i
..........
'
~
. . . . . . . . . .
...... . . . . . . . . . . . . . ......
-0.04
!
-
....
~
....
:
.....
........ : . . . . . : ......
. . . . . . . . . . . . . . . . . . . . -0 oe i , , , , ,
........ .:.: .....
:
1/ I 9
~
.....
-0.05
" 0
10
20
30
40
50
60
;
70
i
80
;
90
100
F I G U R E 1. Synchronization errors e~ (k) = x~ (k) - & ~ (k) (i = 1, 2) for transmitter (3.39) and E K F receiver (3.43) for Example 1: e(0) = ( - 0 . 2 5 , - 0 . 0 5 ) , R -- 0.00005, # = 3.7, and e = 0.09. w h e r e xi (k) a n d 2~ (k) a r e t h e t r u e a n d e s t i m a t e d s t a t e s , r e s p e c t i v e l y , a n d N t h e n u m b e r of t i m e steps. T h u s , t h e m e a n s q u a r e e r r o r ( M S E ) is o b t a i n e d b y ~dg . W i t h t h e p u r p o s e t o k n o w t h e s a m e s t a t i s t i c s , w h e n t h e t r a n s i e n t h a s d i e d o u t we define t h e t r u n c a t e d m e a n - s q u a r e e r r o r as N
TMSE-
N + 1I - - T E
(x (i) - k (i)) 2 .
(3.45)
T h e r e s u l t s in T a b l e 1.1 show t h e g o o d p e r f o r m a n c e of t h e E K F for t h e s y s t e m (3.41). I n p a r t i c u l a r , it s h o u l d b e o b s e r v e d t h a t even w i t h l a r g e r init i a l errors, t h e t r u n c a t e d m e a n - s q u a r e e r r o r s r e m a i n w i t h i n s i m i l a r ranges. Example
2 (see H u i j b e r t s et al. [11]) C o n s i d e r t h e t h i r d o r d e r t r a n s m i t t e r
x~ (k + 1)
=
(1 - e) # x l (k) (1 - Xl (k)) + 5x 2 (]~),
x2 (k + 1)
=
(1 - E) px2 (k) ( 1 -
x~ (k + 1)
=
(1 - ~) px3 (k) (1 - x3 (k)) + Exl (k)
(3.46)
x2 (k)) + cx3 ( k ) ,
as a n e x t e n s i o n of t h e s y s t e m (3.39). S i m i l a r l y t h a t t h e l a s t e x a m p l e we consider the noisy transmitter x l (k + 1)
=
(1 - e) # x l (k) (1 - x l (k)) + ex2 (k) + w l ( k ) , (3.47)
x2 (k + 1)
=
(1 - s [tx 2 (k) (1 - x 2 (k)) -J- s 3 (k) -+- w 2 (]g),
x3 (k + 1)
=
(1 - ~) ~ x 3 (k) (1 - x3 (k)) + ~xl (k) + w3 ( k ) ,
482
1. Synchronization Through Extended Kalman Filtering
0.0
~o -0.02
~
-0.
5
10
~" 0.0
15
........................:
0
25
..............
~
-0.02
-0.
20
~
......................................
)
5
10
15
20
25
F I G U R E 2. Approximate synchronization when p = 0.04 is considered for Example 1: R = 0.00005, /z = 3.7, and e = 0.09. in this case the (corrupted) drive signal is y (k) = x2 (k) + v ( k ) .
(3.48)
T h e equations for the E K F (receiver s y s t e m for (3.46)) are
~l(k+l)
=
(1-e)#~,(k)(1-~l(k))+e22(k)
(3.49)
+ kl (k) [y (k) - ~2 (k)], + k2 (k) [y (k) - ~ (k)], ~a (k + 1)
=
(1 - ~) ~ a (k) (1 - ~3 (k)) + ~ 1 (k) + ks (k) [y (k) - ~ (k)]
with gain vector (k 1 (k), k 2 ( k ) , k 3 (k)) T given via equations (2.10)-(2.14). In the following simulations we take x (0) = (0.2, 0.4, 0.6), R = 0.0001, P0 = diag{p0i}, p0i = 100, i = 1,2,3, # = 3.7, a n d e = 0.35 (in this case, we take this value since e s t i m a t i n g the signals xl (k) a n d x3 (k) is m o r e difficult because Xl (k) is only indirectly influenced via x3 (k)) have been used. Again, to simplify the p r e s e n t a t i o n we realized t h e d y n a m i c s noise as being identical zero or, which is equivalent, the covariance Q of (wl, w2, w3) T was s u p p o s e d to be e x t r e m e l y small of the f o r m Q = diag{qi}, i = 1,2,3, qi = 10 - s . For the above p a r a m e t e r values of e a n d # and initial condition in [0, 1.2] x [0,1.2] x [0, 1.2], we have t h a t [Ix(k)[} < R~ = ~ for all k > 0. Again, we have t h a t f ( x ) / O x is not invertible everywhere, it t u r n s out t h a t w h e n
1. Synchronization Through Extended Kalman Filtering
483
x (o)
~ (o)
x (o) - ~ (o)
sdl
sd2
~
tsdl
tsd2
(0.4,0.7)
(0.4,0.7) (0.39,0.69) (0.37,0.67) (0.35,0.85) (0.35,0.90) (0.35,0.95) (o.35,1) (0.30,0.85) (0.25,0.85) (0.20,0.85) (0.65,0.75) (0.70,0.75) (0.75,0.75)
(0,0) (0.01,0.01) (0.03,0.03) (0.05,-0.15) (0.05,-0.20) (0.05,-0.25) (0.05,-0.30) (0.10,-0.15) (0.15,-0.15) (0.20,-0.15) (-0.25,-0.05) (-0.30,-0.05) (-0.35,-0.05)
0.0019 0.0021 0.0022 0.0052 0.0057 0.0062 0.0082 0.0214 0.0745 0.1697 0.0666 0.1026 0.1870
0.0002 0.0003 0.0003 0.0225 0.0400 0.0625 0.0900 0.0225 0.0226 0.0226 0.0025 0.0025 0.0025
0 0 0 2 2 4 6 3 8 8 3 3 8
0.0019 0.0021 0.0022 0.0027 0.0030 0.0032 0.0035 0.0028 0.0036 0.0046 0.0023 0.0024 0.0056
0.0002 0.0003 0.0003 2.4625e-05 2.7650e-05 2.8954e-05 3.3452e-05 2.7802e-05 4.3795e-05 5.0678e-05 2.6513e-05 4.1756e-05 7.0948e-05
TABLE 1.1. Dependence of the synchronization time on the initial condition and truncated mean-square error according to (3.45) for Example 1: p0~ ---- 100, i = 1, 2, R ----0.00005, # ----3.7, e = 0.09, p = 0.04, N ----100. initializing (3.47) at (0.2, 0.4,.0.6) the J a c o b i a n r e m a i n e d nonsingular along the trajectory. T h e controllability a n d observability conditions hold for all k > 1. Finally, we take ~r = 1.2, Pl = 5.94, P2 = P3 = P4 4.2 a n d P5 = 1 in order to satisfy the T h e o r e m 1.1. Figure 3 shows the synchronization error evolution between (3.47) a n d (3.49) for ~ (0) = (1.3, 6.4, 0). Notice t h a t we have a s s u m e d here a r a t h e r large initial error as to see the clear effect t h a t the E K F needs m o r e time for a p p r o x i m a t e synchronization. Again, we can see, after some transient behavior, t h a t a p p r o x i m a t e s y n c h r o n i z a t i o n is achieved; according to (2.19) it is o b t a i n e d at T = 8 when p ----0.04 is considered (see Figure 4). T w e n t y different Monte Carlo runs were taken in order to o b t a i n r o o t m e a n - s q u a r e error statistics. T h e results are s u m m a r i z e d in Table 1.2 (sd~ a n d T) a n d Table 1.3 (tsd~). :
3.2
Secure C o m m u n i c a t i o n
Finally, in this subsection, we w a n t to present an illustration of t h e p o t e n t i a l use of synchronized systems t h r o u g h the E K F in secure c o m m u n i c a t i o n s . P a r a m e t e r switching is the simplest form of chaotic p a r a m e t e r m o d u l a tion. In this m e t h o d the message s (k) is s u p p o s e d t o be binary, a n d is used to m o d u l a t e one or more p a r a m e t e r s of the (switching) t r a n s m i t t e r , i.e., s (k) controls a switch whose action changes the p a r a m e t e r values of t h e t r a n s m i t t e r . Thus, according to the value of s (k) at a n y given i n s t a n t k, the t r a n s m i t t e r has either the p a r a m e t e r set value p or t h e p a r a m e t e r set value p. At t h e receiver, s (k) is decoded by using the s y n c h r o n i z a t i o n error t o
484
1. Synchronization Through Extended Kalman Filtering
(o) (0.2,0.2,0.6)
(o)
x (o) - ~ (o) (0,0,0) (0.21,0.21,0.61) (-0.01,-0.01,-0.01) (0.22,0.22,0.62) (-0.02,-0.02,-0.02) (0.23,0.23,0.63) (-0.03,-0.03,-0.03) (0.24,0.24,0.64) (-0.04,-0.04,-0.04) (0.25,0.25,0.65) (-0.05,-0.05,-0.05) (0.26,0.26,0.66) (-0.06,-0.06,-0.06) (0.28,0.28,0.68) (-0.08,-0.08,-0.08) (0.3,0.3,0.7) (-0.1,-0.1,-0.1) (0.4,0.4,0.8) (-0.2,-0.2,-0.2) (-0.05,-0.05,0.35) (0.25,0.25,0.25) '(0.2,0.4,0.6) (1.1,2.2,0.1) (-0.9,-1.8,0.5) (1.1,3.9,0) (-0.9,-3.5,0.6) (1.3,6.4,0) (-1.1,-6,0.6) (1.3,7.4,-0.1) (-1.1,-7,0.7)
(0.2,0.2,0.6)
sdl sd2 0.0025 0.0010 0.0028 0.0012 0.0030 0.0014 0.0035 0.0020 0.0051 0.0026 0.0078 0.0035 0.0112 0.0044 0.0191 0.0072 0.0277 0.0108 0.1130 0.0439 0.6244 0.0825 0.7098 3.2463 0.8112 12.2512 1.3486 36.0032 1.4776 49.0023
sds ~" 0.0037 0 0.0040 0 0.0044 0 0.0049 0 0.0057 3 0.0089 3 0.0107 5 0.0126 5 0.0158 5 0.1377 10 0.3338 17 0.3684 6 0.3984 6 0.4523 8 0.7087 11
TABLE 1.2. Dependence of the synchronization time on the initial condition for Example 2: poi = 100, i ---- 1,2,3, R = 0.0001, # --- 3.7, 9 -- 0.35, p ---- 0.04, N -- 100.
x (o) (0.2,0.2,0.6)
~ (o)
(0.2,0.2,0.6)
(0.21,0.21,0.61) (0.22,0.22,0.62) (0.23,0.23,0.63) (0.24,0.24,0.64) (0.25,0.25,0.65) (0.26,0.26,0.66) (0.28,0.28,0.68) (0.3,0.3,0.7) (0.4,0.4,0.8) (-o.o5,-o.o5,o.35) (0.2,0.4,0.6) (1.1,2.2,0.1) (1.1,3.9,0) (1.3,6.4,0) (1.3,7.4,-0.1)
x (o) - ~ (o) (0,0,0) (-0.01,-0.01,-0.01) (-0.02,-0.02,-0.02) (-.03,-.03,-.03) (-0.04,-0.04,-0.04) (-0.05,-0.05,-0.05) (-0.06,-0.06,-0.06) (-0.08,-0.08,-0.08) (-0.1,-0.1,-0.1) (-0.2,-0.2,-0.2) (0.25,0.25,0.25) (-0.9,-1.8,0.5) (-0.9,-3.5,0.6) (-1.1,-6,0.6) (-1.1,-7,0.7)
tsdl 0.0025 0.0028 0.0030 0.0035 0.0010 0.0013 0.0017 0.0019 0.0030 0.0017 0.0100 0.0005 0.0005 0.0018 0.0024
tsd2 0.0010 0.0012 0.0014 0.0020 0.0004 0.0005 0.0007 0.0009 0.0011 0.0016 0.0059 0.0005 0.0007 6.0014 0.0015
tsda 0.0037 0.0040 0.0044 0.0049 0.0017 0.0020 0.0022 0.0024 0.0025 0.0028 0.0062 0.0010 0.0011 0.0020 0.0026
TABLE 1.3. Truncated mean-square error according to (3.45) for Example 2: po~ = 100, i = 1,2,3, R---- 0.0001, #--- 3.7, 9 -- 0.35, p - - 0.04, N = 100.
1. Synchronization Through Extended Kalman Filtering
485
02
iiiiiiiiiiiiiiiiiiiii -1 .-*
0
o.2
.....
"%
iiiiiiiii
i{iiiiiiii
iiiii .... iiii 84
10
20
30
40
50
60
70
80
90
100
10
i 20
, 30
i 40
i 50
, 60
"/0
80
L 90
100
....
....
i
,'o
:
~o
......
~o
~o
io
......
~o
i
;o
.....
~o
;o
~
F I G U R E 3. Synchronization errors e~ (k) ----x~ (k) - 2~ (k) (i ---- 1,2, 3) for transmitter (3.46) and EKF receiver (3.49) for Example 2: e-(0) = ( - 1 . 1 , - 6 , 0 . 6 ) , R = 0.0001, # --- 3.7, and e = 0.35. decide w h e t h e r the received signal corresponds to one p a r a m e t e r value, or the other (it can be i n t e r p r e t e d as a zero or one). T h e usefulness of this simple idea has been d e m o n s t r a t e d by Parlitz et aL [18] and C u o m o et el. [6] for a replica. In our case, to t r a n s m i t s (k) via p a r a m e t e r m o d u l a t i o n scheme, the E K F is modified to e s t i m a t e the value of this p a r a m e t e r . Thus, a c o m b i n e d s t a t e a n d p a r a m e t e r e s t i m a t i o n is m a d e by the extension of the s t a t e vector with the u n k n o w n p a r a m e t e r . Let # be the p a r a m e t e r to be m o d u l a t e d in t h e t r a n s m i t t e r d y n a m i c s (3.41) a n d (3.47); in b o t h examples, the p a r a m e t e r value e was fixed. If no other apriori i n f o r m a t i o n is available, a n additional s t a t e # (k) is used to e x t e n d the original s t a t e vector by t r e a t i n g # as a function of t i m e according to p (k + 1) = # (k). So, the noisy t r a n s m i t t e r d y n a m i c s (3.41) and (3.47) b e c o m e for E x a m p l e 1: 3:1 (k ~- 1)
=
(1 - e) # (k) xl (k) (1 - xl (k)) + ex2 (k) + Wz ( k ) , (3.50)
x2 (k + 1) = (1 - c ) # ( k ) x 2 (k) (1 - x2 (k)) + exl (k) + w2 ( k ) , #(k+l)
= #(k)+w3(k),
a n d for E x a m p l e 2: xl (k + 1) ---- (1 - e) # (k) x I (k) (1 - x 1 (k)) Jr- Ex 2 (k) ~- w 1 (k), (3.51) x2 (k + 1) = (1 - e) # (k) x2 (k) (1 - x2 (k)) + Ex 3 (k) + w 2 ( k ) , x3 (k + 1) = (1 - e ) # ( k ) x 3 (k) (1 - x3 (k)) + eXl(k) + w3 ( k ) , # (k + 1) = # (k) + w4 (/g),
486
1. Synchronization Through Extended Kalman Filtering 0.
!
0.0
...........
-0.132 -0
.
.
.
0
00~
~.oo.oi! ........
!
~
.
.
..............................
.
.
.
.
.
.
. 10
15
20
,
,
,
,
............
i.............. i i ..............
........ :~ .........
i o
-0.02
. . . . . . . . . . . . . . . . . . . . .
$
~ o. ~
:
. . . . . . . . . . .
25
............... ................ i .............. . . . . . . . . . . . . .
........
FIGURE 4. Approximate synchronization when p = 0.04 is considered for Example 2: R -= 0.0001, # = 3.7, and e -- 0.35. respectively, with the (corrupted) t r a n s m i t t e d signal v (k) = x2 (k) + v ( k ) .
(3.52)
We note that b o t h examples satisfy the conditions of T h e o r e m 1.1; for communication purposes we take values of Rx smaller t h a n v/1-fi and x/3.15 for Examples 1 and 2, respectively. We use a 'modulation rule' to m o d u l a t e s (k) in the p a r a m e t e r # of the t r a n s m i t t e r (3.50) and (3.51). T h e n the E K F used as receiver maintains synchronization by estimating the changes in the modulated p a r a m e t e r tt (while the p a r a m e t e r e is fixed at the same value as in the transmitter). So, s (k) can be recovered by the estimation given through the EKF. The modulation rule is given by
#(k) =/t+a-s(k),
~(k) =#+a-~(k)
(3.53)
where a is a constant and a (k) is the recovered message. T h e message is defined as follows
s(k)=
0, 1, 0, 1, 0,
0
An illustration for the binary communication of E x a m p l e 1, via modulation and estimation of p a r a m e t e r # with a = 0.08, i.e., when # is switched between # (0) = 3.7 and # (1) -- 3.78 is shown in Figure 5. Figure 6 shows
1. Synchronization Through Extended Kalman Filtering
487
3.79
3.7~
3.74
3~72
37
13
100
200
300
4~
~00
600
700
~
900
1~
FIGURE 5. Estimated and true value # for Example 1: x(0) = 5(0) = (0.4,0.7,3.7), R=0.00005, and a = 0.08. binary communication for E x a m p l e 2, via modulation and estimation of p a r a m e t e r p with a --- 0.1, i.e., when # is switched between # (0) = 3.7 and # (1) -- 3.8.
4 Concluding Remarks We have discussed the use (in discrete-time) of an extended K a l m a n filter (EKF) as receiver system for chaotic synchronization purposes. Synchronization is obtained between t r a n s m i t t e r and receiver dynamics when the E K F is driven by a noisy drive signal from the transmitter. T h e computer simulation results presented, show t h a t our chaotic synchronization approach is robust to additive Gaussian noise. Besides synchronization per se, we have presented the utility of using the E K F for reconstruction of a binary message. In this case, by modulating a p a r a m e ter in the t r a n s m i t t e r and estimating this p a r a m e t e r via a modified E K F . Thus, we expect t h a t it can be possibly applied to experimental systems, especially, for secure communication systems based on signal masking and p a r a m e t e r modulation. C o m p l e m e n t a r y simulations showed that synchronization and binary communication are also possible in case x (0) r ~ (0). Although, the synchronization time depends on the initial conditions and is different to the case x (0) = 2 (0). Obviously, the synchronization time is smaller for x (0) ~ ~ (0) and for smaller noise variance.
488
1. Synchronization Through Extended Kalman Filtering
3.7,~
3.76
317~
3=72
FIGURE 6. Estimated and true value tt for Example 2: x(0) = 5(0) = (0.2, 0.4, 0.6, 3.7), R=0.0001, and a = 0.1.
Acknowledgements This chapter was realized during a postdoctoral stay of the first authorat the Univ. of Twente, supported by CONACYT (M6xico) under Grant 973093. 5
REFERENCES [1] B.D.O. Anderson and J.B. Moore, Optimal Filtering, Prentice-Hall, INC. Englewood Cliffs, New Jersey, 1979. [2] P. Badola, S.S. Tambe, and B.D. Kulkarni, "Driving systems with chaotic signals", Physical Review A, 46(10), pp. 6735-6737, 1992. [3] J.S. Baras, A. Bensoussan and M.R. James, "Dynamic observers as asymptotic limits of recursive filters: special cases", S I A M Journal on Applied Mathematics, 48(5), pp. 1147-1158, 1988. [4] M. Boutayeb, H. Rafaralahy and M. Darouach, "Convergence Analysis of the Extended Kalman Filter Used as an Observer for Nonlinear Deterministic Discrete-Time Systems", IEEE Trans. Automat. Contr., 42(4), pp. 581-586, 1997. [5] T.L. Carroll and L.M. Pecora, "Synchronizing chaotic circuits", IEEE Trans. Circuits Syst., 38, pp. 453-456, 1991. [6] K.M. Cuomo, A.V. Oppenheim and S.H. Strogratz, "Synchronization of Lorenz-based chaotic circuits with application to communication", IEEE Trans. Circuits Syst. II, 40(10), pp. 626-633, 1993.
1. Synchronization Through Extended Kalman Filtering
489
[7] J.J. Deyst, Jr. and C.R. Price, "Conditions for asymptotic stability of the dicrete minimum-variance linear estimator", IEEE Trans. Automat. Contr, 13(6), pp. 702-705, 1968. [8] M. Ding and E. Ott, "Enhancing synchronism of chaotic systems", Physical Review E, 49(2), pp. 945-948, 1994. [9] T.B. Fowler, "Application of Stochastic Control Techniques to Chaotic Nonlinear Systems", IEEE Trans. Automat. Contr., 34(2), pp. 201205, 1989. [10] K.S. Halle, C.W. Wu, M.Itoh and L.O. Chua, "Spread spectrum communication through modulation of chaos", International Journal of Bifurcation and Chaos, 3(2), pp. 469-477, 1993. [11] H.J.C Huijberts, T. Lilge and H. Nijmeijer, 1999 "A control perspective on synchronization and the Takens-Aeyels-Sauer Reconstruction Theorem", To appear in Phys. Rev. E. [12] Lj. Kocarev, K.S. Halle, K. Eckert and L.O. Chua, "Experimental demostration of secure communications via chaotic synchronization", International Journal o.f Bifurcation and Chaos, 2(3), pp. 709-713, 1992. [13] B.F. La Scala, R.R. Bitmead and M.R. James, "Conditions for Stability of the Extended Kalman Filter and Their Application to the Frequency Tracking Problem", Mathematics of Control, Signals, and Systems, 8, pp. 1-26, 1995. [14] L. Ljung, "Asymptotic behavior of the extended Kalman filter as a parameter estimator for linear systems", IEEE Trans. Automat. Contr., 24, pp. 36-50, 1979. [15] H. Nijmeijer and I.M.Y. Mareels, "An Observer Looks at Synchronization", IEEE Trans. Circ. Syst. I, 44(10), pp. 882-890, 1997. [16] H. Nijmeijer, "On Synchronization of Chaotic Systems", Proc. 36th IEEE Conf. on Decision and Control, San Diego, USA, pp. 384-388, 1997. [17] M.J. Ogorzalek, "Taming Chaos-Part I: Synchronization", IEEE Trans. on Circuits and Systems-I: Fundamental Theory and Applications, 40(10), pp. 693-699, 1993. [18] U. Parlitz, L.O. Chua, Lj. Kocarev, K.S. Halle and A. Shang, "Transmission of digital signals by chaotic synchronization", International Journal of Bifurcation and Chaos, 2(4), pp. 973-977, 1992.
490
1. Synchronization Through Extended Kalman Filtering
[19] L.M. Pecora and T.L. Carroll, "Synchronization in chaotic systems", Phys. Rev. Let., 64, pp. 821-824, 1990. [20] G. P@rez and H.A. Cerdeira, "Extracting Messages Masked by Chaos", Physical Review Letters, 74(11), pp. 1970-1973, 1995. [21] D.J. Sobiski and J.S Thorp, "PDMA-I: Chaotic Communication via the Extended Kalman Filter", IEEE Trans. Circuits and Systems-I: Fundamental Theory and Applications, 45(2), pp. 194-197, 1998. [22] Y. Song and J.W. Grizzle, "The Extended Kalman Filter as a Local Asymptotic Observer for Discrete-time Nonlinear systems", Journal of Mathematical Systems Estimation and Control, 5(1), pp. 59-78, 1995. [23] H.W. Sorenson, "Least-squares estimation: from Gauss to Kalman", IEEE Spectrum, 7, pp. 63-68, 1970. [24] Special Issue on Chaos Synchronization and Control: Theory and Applications of the IEEE Trans. Circuits and Systems I, 44(10), 1997.
Nonlinear Discrete-Time Observers for S y n c h r o n i z a t i o n Problems T. Lilge Institut fttr Regelungstechnik University of Hannover Hannover, Germany
1
Introduction
In recent years there has been an increasing interest in the synchronization of two coupled dynamical systems. Pecora and Carroll presented in their fundamental work [15] the synchronization of two identical Lorenz systems with different initial conditions if the second systems was driven by a state variable of the first system. In the following, many other works considered the synchronization of continuous-time systems, e.g. [14, Chapter 15], [18], [17] or [16], but also of discrete-time systems, e.g. [1], [11] or [19]. One possible use of synchronizing two systems can be found in secure communications where the receiver has to reconstruct the desired information from the transmitted state variable(s) of the transmitter system regardless of the initial conditions of transmitter and receiver. In many examples found in the literature, the corresponding receiver system is an identical copy of the transmitter driven by the transmitted signals. There is often no theoretical proof given for the synchronization of transmitter and receiver. Moreover, the time until the systems synchronize can not be influenced and can be very long. Finally, synchronization of some systems can not be achieved for arbitrary initial conditions, cf. [1]. > F r o m a control perspective, the problem of synchronization can be regarded as an observer problem, cf. [13] for continuous-time and [5] for discrete-time systems. This work focuses on a design of nonlinear observers for discrete-time systems (transmitters) of the form
xk+l
=
f(xk),
xk e ~
Yk
=
h(xk),
Yk C J~
(1.1)
where the o u t p u t y can be interpreted as the transmitted signal and f : ~ n __, ~ n , h : ~ --* ~ a r e smooth functions with f(O) = 0 and
492
2. Nonlinear Discrete-Time Observers for Synchronization Problems
h(0) = 0. For reasons of readability the step variable k is written as an index, i.e. xk = x(k). A design procedure for receiver systems (observers) of the form
Xk+l
---~
f(:?k,Yk,Yk-:, ...,Yk-n+l),
Xk E IR '~
(1.2)
is presented. System (1.2) exploits the transmitted signal Yk and n - 1 buffered past measurements Yk-:, ...,Yk-n+: in order to compute an estimate for the states of system (1.1) such that
lim tIik - xk][ = 0
(1.3)
k--*oo
and the convergence rate can be determined by an appropriate choice of the observer parameters. An observer design via nonlinear observer form with linearizable error dynamics allows to choose the eigenvalues of the error dynamics in observer form as known from linear observers. The discrete-time nonlinear observer form is given by
zk+l = A z k + f ~ ( y k ) ,
yk=h~(z~,k),
(1.4)
where z G ~ n , zn,k is the n-th component of zk, f~ : ~ ---* ~'~, h~ : /R --*/R is the invertible output function and
A
0
---
0
0 /
1
-.-
0
0
.
o..
"
.
0
..-
1
0
This observer form has been obtained by Lee and Nam in [8] but also by Chung and Grizzle in [3]. The main drawbacks of their approaches are that the dynamics f in (1.1) has to be a diffeomorphism and that a system of partial differential equations has to be solved in order to obtain the coordinate transformation into canonical form. In addition, the output function hz has to be linear (Yk = zn,k). An extension to a nonlinear output function hz and some simplification for calculating the transformation is given by Ingenbleek in [6] and [7]. Brodmann in [2] and Lin and Byrnes in [10] presented a transformation of system (1.1) into nonlinear observer form (1.4) via the so called nonlinear observability form. In a first step, the system is transformed into observability form. The calculation of the transformation map is straightforward. A crucial point is computing the inverse map in order to obtain the system
2. Nonlinear Discrete-Time Observers for Synchronization Problems
493
representation in observability form. However, this inversion problem also appears when system (1.1) is directly transformed into observer form as presented in [8, 3, 7]. In the second step, the system is transformed from observability form into observer form, which is only possible if the Hessian matrix of the nonlinear function showing up in observability form is diagonal. Since the transformation map between these two forms is always invertible, every system in observer form also has a representation in observability form. This implies that an existing representation in observer form can always be found from observability form. In addition, there are no restrictions for the system function f and the calculation of the transformation does not require solving a set of partial differential equations. However, the necessity that the Hessian matrix of a nonlinear function is diagonal is very restrictive. In this chapter, an extended observer form depending on past measurements of the system output Yk is considered (see also [9] for systems with input). This considerably enlarges the class of systems for which a nonlinear observer with linearizable error dynamics can be designed. Once, a system is given in observability form, the extended observer form is also available. Furthermore, the transformation into this extended observer form offers some degrees of freedom which allow to design several observers with different characteristics. Necessary and sufficient conditions are derived for (1.1) to be equivalent to a system in extended observer.form (EOF)
Zk+ 1
0
-..
0
0 /
1
.-.
0
0
. . . . 0
Yk
.,.
zk § 1
0
fz,o(Yk) fz,l(Yk-l,Yk) '
(1.5)
fz,,~-1 (Yk-n+l, ..., Yk)
hz(Zn,k),
with z E 1Rn, fz,, ://~tt+l---+ J~, and h~ : ~ - - * ~ is the invertible o u t p u t function of the system in EOF, i.e. zn,k can be calculated by z,~,k = h~ 1(Yk). This canonical form is obtained by state and output transformation and results in a linear structure with additive nonlinearities just depending on the actual and n - 1 buffered past output measurements. Therefore, an observer design based on (1.5) with linear observer error dynamics is possible. The rest of this chapter is organized as follows. In the next section, necessary and sufficient conditions are given for system (1.1) being equivalent to a system representation in extended observer form. Section 3 focuses on the observer design via observer form and includes a discussion of some degrees of freedom in the coordinate transformation. Three alternative observers via E O F with different structure and characteristics are presented in Section 4. Finally, the observer design is applied to a synchronization problem
494
2. Nonlinear Discrete-Time Observers for Synchronization Problems
in the field of communication in discrete-time and to a synchronization problem of the continuous-time R6ssler system.
2
S t a t e E q u i v a l e n c e to a S y s t e m in E x t e n d e d Observer Form
After a short review of results concerning the observability of system (1.1) and its equivalence to a representation in observer form as presented in [2] and [10], the necessary and sufficient condition for system (1.1) being equivalent to a system in E O F is presented in this section. Theorem
2.1 ([12, 2]) I f the so-called observability map
hof~ )
h o f l (x) ~(x)
=
.
(2.6)
ho ff~-l(x) has .full rank at x = O, then the system (1.1) is strongly locally observable atx =0. Since the observability m a p r of a strongly locally observable system (1.1) has full rank at x = 0, the state transformation s = r with s k = (Yk, yk+l, ..., Yk+n-1)T, leads to a new local representation of system (1.1) around x = 0, the so called nonlinear observability form. D e f i n i t i o n 2.1 ( N o n l i n e a r o b s e r v a b i l i t y f o r m ([12, 2])) The local representation in nonlinear observability form of a strongly locally observable system (1.1) at x = 0 has the f o r m
8k+ 1
, Yk+n--1
8n,k
Yk+n
f~(Sk)
with
sk -- r
and
fs(sk) = h o f ~ o r
Yk ---- Sl,k~
(2.7)
r xk = r
I f r is a global diffeomorphism the state transformation s = r to a global system representation in observability form.
leads
2. Nonlinear Discrete-Time Observers for Synchronization Problems
495
As shown in [2] and [10], system (1.1) is equivalent to a system in nonlinear observer form (1.4) if and only if the observability form (2.7) exists and there exists analytic functions f~,~, : / R ~ ~ , # = 0, 1, . . . , n - 1, and h~ : N - - * ~ such t h a t the Hessian matrix of hzl(f~(sk)) is diagonal, i.e. n-1
hzl(fs(Sk)) = E
fz,,(S,+l,k).
(2.8)
tt=0
Then, the observer form (1.4) contains the functions fz,~ as the n components of f~ and hz as the o u t p u t function. Equation (2.8) represents a very restrictive condition for a system being equivalent to a representation in nonlinear observer form. As presented in the following theorem, the conditions for a system (1.1) being equivalent to a representation in extended observer form (1.5) are less restrictive and the class of systems for which an observer with linearizable error dynamics can be designed, is considerably enlarged. 2.2 A discrete-time system (1.1) with single output is state equivalent to a system in extended observer form (1.5) via state and output transformation i.f and only i.f the observability map (2.6) is a global diffeomorphism, i.e. a global system representation in observability .form (2.7) exists.
Theorem
Proof. The transformation from observability form (2.7) into E O F (1.5) can be found by consideringr the structure of the E O F and taking into account sk = (Yk, .. , y k + ~ - l ) 9 Then, z,~ k = h-/~(Yk) where yk is replaced by sl,k. From the last component of the system function in E O F , it follows z,~-l,k = Z n , k + l -- f z , n - 1 (Yk-,~+l, ..., Yk) where zn,k+l can be replaced by h~-1 (yk+l) and yk+~, # = 0, 1, by s~+l,k. Continuing in this way, the state variables zi,k, i = 1, 2, ..., n in E O F result from si,k and Yk-n+l, ..., Yk- 1 in the form
Zr~ , k
=
h;l(Sl,k)
Zn--l,k
=
h;l(s2,k)
Zn--2,k
Zl, k
-
f ....
l(Yk-n+l,...,Yk-l,sl,k)
:
Zn--l,k+l
=
h2~(sa,k)
-- fz,n-2(Yk-n+2,
~-
z2,k+l - fz,l(Yk-1, Yk)
...,Yk)
f~,~-l(Yk-n+2,...,Yk-1, sl,k, s2,k) --f~,~-2(Yk-n+2, ..., Yk-1, sl,k) -
(2.9)
h_Sl(sn,k) -- ~t,=l n-1 f~,,(Yk-t, sl,k,..., s,,k). Since h~ is invertible, the inverse transformation exists and can easily be calculated:
496
2. Nonlinear Discrete-Time Observers for Synchronization Problems
81, k S2,k
: =
hz(Zn,k) hz(Zn-l,k + f . . . . l ( Y k - n + l , . . . , Y k - l , S l , k ) )
:
Sn, k
(2.10)
:
hz(z1, k -[- E ~ - ~ fzd~(Yk-l,81, k, ...,8p,k)).
For the rest of the proof it has to be shown how the functions h, and f , . , with p = 0, 1, ..., n - 1 in E O F depend on the function f , in observability form. Equation (2.10) and Yk+i-1 = si,k for i = 1, 2, ..., n lead to the last component of the system representation in observability form
Sn,k+l
z
hz
rt--1 ) Zl,kq-1 -J- ~/~=lfz,,(Yk, Sl,k+l, ...,S/~,k+l)
= hz(fz,o(sl,k)-~E~Ifz,#(81,k,s2,k,...,8tz-l-l,k)) =
h z ( E . =n-1 ofz,.(Sl,k,
8 2 , k , ...,
S~+l'k ) .
Since sn,k+l = f~(sk), it follows
n--1 hzl (fs(sk)) = E fz'tz(S1'k' S2'k' "'" 8#+1'k)' /t=0
(2.11)
which is the counterpart of (2.8). The functions f~,u and hz are showing up in the system representation (1.5) in EOF. In contrast to (2.8), which is very restrictive, (2.11) can always be fulfilled. Setting h ; 1 (fs(sk))) = f~(sk) leads to
n--1 L(81c) = E fz'l~(81'k' 82'k' "'" 81~d-l'k)" ~z=0
(2.12)
Since fs and fz,,~-i depend on the same arguments, it is always possible to fulfil (2.11) by an appropriate choice of fz .... 1. The transformation between observability form (2.7) and extended observer form (1.5) is always invertible and the inverse transformation can easily be calculated. Therefore, each system representation in E O F can be transformed into observability form which implies t h a t the transformation of a system into E O F via observability form includes all existing solutions. [] As a consequence, there are no more conditions for the system representation in observability form (2.7). A remaining drawback is finding the inverse x -- @-1 (s) of the observability map.
2. Nonlinear Discrete-Time Observers for Synchronization Problems
497
Remark
2.1 I f the observability m a p (2.6) is a local d i f f e o m o r p h i s m at x = 0 s y s t e m (1.1) is locally equivalent to a s y s t e m i n extended observer .form (1.5) and only a local observer can be obtained. Equation (2.11), which describes the relation between observability and extended observer form, leads to some freedom for the system representation in E O F because the selection of functions f~,u and h~ which are matching this equation is not unique. The influence of these functions on the observer design is considered in the next section.
3
Observer Design via Extended Observer Form
/o oo/(
)
The dynamical system
~k+l
=
1 .
... ..
0 .
0 .
0
,..
1
0
+(
/)k
=
h~(~n,k),
fz,l(Yk-l,Yk) .
~k +
fz,n-l(Yk-n+l,...,Yk)
)T
qo
k > n-
1,
z,,,k
(3.13) is a nonlinear observer for a system in E O F (1.5). It is easy to check that, for k > n - 1, this observer leads to the linear error dynamics in E O F
ek+l
=
zk+l - zk+l =
0
.-.
0
-qo
1
9 9 9
0
-ql
.
-.
.
.
0
...
1
-qn-1
) ek.
(3.14)
The parameters q0, ql, ..., q,~-l, are the coefficients of the characteristic polynomial which allow to assign the eigenvalues of the error dynamics in EOF. Since system and observer equations in E O F are depending on the selection of functions hz and fz,~, it is of interest to know how the observer equations in x-coordinates depend on these functions. As shown in [9], the observer equation (3.13) in E O F transformed back into x-coordinates, in which the system (1.1) is described, does not depend on the selection of functions fz,0, f ~ , l , . . . , f~,n-1. As a consequence, the selection process of fz,~ for a given hz is not suitable to affect the behavior of the observer in x-coordinates and the simple choice
498
2. N o n l i n e a r D i s c r e t e - T i m e O b s e r v e r s for S y n c h r o n i z a t i o n P r o b l e m s
fz,t,(sl,k, ..., s~+l,k)
=
0
f z , n - l ( 8 1 , k , ..., 8n,k)
:
hz 1 (fs(sk))
for
# = 0, 1, ..., n -- 2
(3.15) (3.16)
does not require any computational effort and leads to the same observer equations in x-coordinates like other selections. However, it is often more suitable to calculate the observer equations in EOF and afterwards transform the observer state zk back into x-coordinates because this leads in general to shorter computation times. Since in this case the calculation of zk requires all past measurements appearing in the EOF the selection of functions fz,~ has an influence on the number of past measurements having to be buffered for the online calculation of the observer. In [2], the nonlinear output function hz in observer form could help in some cases to fulfill (2.8). Here, the nonlinear output function hz in extended observer form can therefore help in some cases to reduce the number of necessary past measurements. In addition, h~ considerably affects the transformation between original form and EOF (x- and z-coordinates). The transformation from xcoordinates into EOF is given by substituting si,k by h(fi-1(xk)) in the components of the transformation map (2.9) from observability form into EOF. This leads to expressions of the form h-~1(h(.)) which in turn are added to the functions fz,~. The structure of f~,~ follows from (2.11) which left hand side is of the form h~1(fs(.)). Substituting fs by h(fn(xk)) also results in an expression of the form hzl(h(.)). Therefore, the selection of hz I or hz respectively is rather important for the structure and the nonlinearities of the transformation from x-coordinates into EOF. Especially, a choice of h~ being of the same structure as h is mostly beneficial because in this case, cancellations of nonlinear expressions may occur in hz~(h(.)) leading to a simpler transformation rule. In what follows, this general requirement is explained in more detail for a special form of the system output function h. If h depends on a linear combination of state variables, i.e. h(xk) h(vTxk) with v T being a constant n-dimensional row vector, the transformation into EOF leads to hz(zmk ) = [t(vTxk). It is advisable to choose hz(zn,k) = ~t(zn,k) which leads to a linear component zn,k = vTxk of the transformation from x-coordinates into EOF. Nonlinearities resulting from h and hz in the other n- 1 components are also cancelled which can be seen by inserting (2.6) in (2.9). This ensures, that the dynamic of the observer error in x-coordinates does not considerably differ from the one assigned in EOF.
2. N o n l i n e a r D i s c r e t e - T i m e O b s e r v e r s for S y n c h r o n i z a t i o n P r o b l e m s
4
499
Alternative Observer Structures via EOF
4.1
Observer Equations
T h e results presented in this subsection are based on the choice (3.15) with fz,, = 0 for # = 0, 1, ..., n - 2. This choice represents a very special case because iterating s y s t e m (1.5) in E O F with a r b i t r a r y initial conditions leads to zi,k = 0
for
k > i
and
i = 1,2, ..., n - 1.
(4.17)
Since Z~-l,k = 0 for k _> n - 1, the last c o m p o n e n t of the s y s t e m function in E O F results in zn,k+l = fz,n-l(Yk-n+l, ...,Yk) a n d f r o m hz(z,~,k+l) = Yk+l, it follows
fz,n-l(Yk--n+l, ...,Yk) --~ h z l ( Y k + l ) for k > n -- 1.
(4.18)
Besides observer (3.13) (in the following called o b s e r v e r 1), these results offer a design of three other observers in E O F . Replacing fz,n-l(Yk-~+l,...,Yk) by hz-1 (Yk+l) in (3.13) with f z , , = 0 for # = 0, 1, ..., n - 2 leads to o b s e r v e r 2. Using (4.17), the resulting o b s e r v e r 3 is given by
Zk+l
~
( ~ / 0
1
9A l Z l ' k
+
%n-lzn-l,k
fz,,~-I (Yk--n+l,..., Yk) ~)k = hz(~,~,k), k > n - 1 ,
An(~n,k --
h21(yk))
(4.19) where Ai are the eigenvalues of the observer error d y n a m i c s in E O F for
k>n-1
ek+l = ~k+l - Zk+l --
(
A1
...
0 /
"
"-.
"
0
...
),~
ek.
(4.20)
T h e convergence rate for the i-th c o m p o n e n t of the observer error can be assigned by )~. Finally, replacing f~,n-l(Yk-n+l,...,Yk) in (4.19) by h-~l(yk+l) leads to o b s e r v e r 4 with error d y n a m i c s (4.20) 9 T h i s observer is similar to the one presented by Ciccarella et al. in [4]9 Since the selection of the functions f z , , according to (39 represents a necessary condition for the three observers p r e s e n t e d in this section, the only remaining degree of f r e e d o m in the t r a n s f o r m a t i o n into E O F is the choice of the o u t p u t function hz.
500
2. Nonlinear Discrete-Time Observers for Synchronization Problems
3.2
M a i n C h a r a c t e r i s t i c s o f the O b s e r v e r s
A comparison of the different observers derived in the previous section is done for the special case hz(zn,k) = zn,k because it leads to a straightforward investigation of the observer characteristics. It is only necessary to consider the observer equations and error dynamics in EOF: Observers 2, 3 and 4 are based on the same system equations in E O F because the transformation from x-to z-coordinates is always identical (the degrees of freedom in the transformation from observability form into E O F are chosen following (3.15)). Since observer 1 in x-coordinates does not depend on the choice of the functions fz,a, it is also possible to choose the transformation from observability form into E O F in compliance to (3.15) without affecting the characteristics of this observer. Then, the transformation between x-and z-coordinates is identical for all four observers which implies that differences can only have their cause in EOF. Taking into consideration that h~(zn,k) = z,~,k, the following characteristics for the observers in E O F can be found: 9 Since the observer equations of observers 2 and 4 do not depend on system parameters, these observers are robust to parameter uncertainties. Observers 1 and 3 are depending on parameters. 9 Since the observer equations of observers 2 and 4 depend on Yk+l, the observer state zk+l can not be calculated before step k + 1. Using observers 1 or 3, the estimation zk+l can already be calculated within step k. 9 Using observers 3 and 4, it is easy to check from error dynamics (4.20) that Hei,k+lll < I]ei,kll, i = 1,2,...,n, if the eigenvalues of the error dynamics are chosen within the unit circle. This does not hold for the error dynamics (3.14) of observers 1 or 2. 9 Assuming the measured output signal is ~]k = Yk + ry,k where ry,k is an additive noise, the resulting error dynamics for observer 4 and k > n - 1 are given by
/ 0 ek+l =
0 r y , k + l -- )~nry,k
+
"
".
"
0
...
)~n
ek.
(4.21)
e n _ 1 = 0 and k > n - 1, ek = ( 0 ... 0 r y , k )T is a solution of this difference equation. This also holds for observer 2. Therefore, using observer 2 or 4, noise is not filtered regardless of the chosen eigenvalues whereas observers 1 and 3 are able to filter noise. This especially holds for observer 1, which, designed for a linear system For
2. Nonlinear Discrete-Time Observers for Synchronization Problems
xk+l = A xk,
Yk ~- c T Xk
501
(4.22)
with A a matrix and c a vector of appropriate dimensions, leads to the well known linear observer
(4.23)
2ck+1 = A 2 k + g ( c T x k - - Y k )
with the gain vector g. Choosing all eigenvalues of the observer error near the system eigenvalues (if possible) the gain and therefore the influence of noise at the output measurements on the observer is small. It even vanishes if system and observer eigenvalues are identical. Assuming small noise and system states near the operating point, the results from the linear case can be adopted to nonlinear systems. Therefore, good robustness of observer 1 to noise at the o u t p u t measurements designed for a nonlinear systems is achieved by eigenvalues in E O F near to those of the linearized system. To summarize, table 2.1 shows the main characteristics of the observers in E O F with hz(zn,k) ---- z,~,k. Supposing a smooth t r a n s f o r m a t i o n m a p between E O F and x-coordinates, it is probable t h a t the characteristics of the observers are also found in x-coordinates. TABLE 2.1. Characteristics of the observers in EOF (hz(z,~,k) = z,~,k). Observer Characteristic Filtering of measurement noise Robustness to model uncertainties Transient behavior C o m p u t a t i o n for step k + 1 at step
5
1
2
_
k
3
4
+
-
+
_
+
k+l
k
+
k+l
An Example in the Field of Communication
As an example for reconstructing a desired information from the transmitted signal, the presented observer design is applied to the second order system
xk+l Yk
= ~
(1 - e)#Xl,k(1 - Xl,k) + ~X2,k "~ (1 e)#x2,k(1 x2,k) +wkx2,k J ' Xl,k
(5.24)
502
2. Nonlinear Discrete-Time Observers for Synchronization Problems
where # a n d e are constants, Yk = Xl,k is the t r a n s m i t t e d signal a n d wk contains the desired information. T h e signal wk is a discrete-time signal with 0.06 _< wk _< 0.12 and a step w i d t h of 5, i.e. w takes a new value for k = 0, 5, 10, 15, ... a n d remains c o n s t a n t for other values of k. For w = e, s y s t e m (5.24) is identical to the one presented in [1]. A n observer design via E O F for w = e was already considered in [5]. T h e observer design via E O F for s y s t e m (5.24) is based on the t h i r d order model
x2,k+l
=
(1 - E)#x2,k(1 -- x2,k) + W k X l , k
Wk+l
,
(5.25)
Wk Yk
=
Xl,k.
T h e signal w is assumed to be constant. Since this only holds for t h e d u r a t i o n of five steps, this a s s u m p t i o n requires an observer t h a t converges fast enough. For xl,k > 0, the representation of s y s t e m (5.25) in observability form exists and an observer design via E O F is possible. T h e observer equations are o m i t t e d for reasons of space. 1
I
I
I
60
80
0.9 "~
0.8
~
0.7
.~
0.6
L
=~ 0.5 0.4 0.3 0
20
40
100
F I G U R E 1. Transmitted signal yk ----xl,k Figure 1 shows the t r a n s m i t t e d signal yk --- xl,k for initial conditions xl,0 = 0.4, x2,0 --- 0.2 and w k as presented in Figure 2, which also shows the r e c o n s t r u c t e d signal w~,k using observers 1, 3 and 4 via E O F w i t h 21,0 = 0.5, 52,0 = 0.5 and w~,0 ---- 0. T h e eigenvalues of the observer error d y n a m i c s in E O F were chosen to A1 = A2 = ,k3 -- 0 . 1 . T h e reconstruction of w k using observer 1 has a delay of 3 steps a n d reaches satisfactory a c c u r a c y just before the observer starts to converge to the next value of wk (A d e a d - b e a t design with "~1 : )~2 : "~3 : 0 considerably improves the behaviour). Since observer 3 shows b e t t e r transient behaviour t h a n observer 1 (see Section 4.2), the r e c o n s t r u c t i o n of wk is quite
2. Nonlinear D i s c r e t e - T i m e Observers for Synchronization P r o b l e m s 0.121-
o. 9I
0.08 I- I 0.07 F 0.06 i 0
0.11 0.1 0.09 0.08 0.07 0.06
0
'
'
'
'
'
503
' ~
~1
i 20
i 40
i 60
80
1O0
' 20
' 40
' 60
' 80
1O0
I 60
, 80
100
' 60
' 80
100
I
k t.,r
0.11
0
0.08 0.07 0.06 0
I 20
I 40 k
"~" 0.11 .~ 0.08 0.07 0.06
0
20
40 k
F I G U R E 2. Information signal wk and the e s t i m a t e d signal w~,k using observers 1, 3 and 4 with A1 = )~2 = A3 = 0.1
g o o d a l t h o u g h it also h a s a d e l a y of 3 s t e p s . B e s t r e s u l t s c a n b e a c h i e v e d u s i n g o b s e r v e r 4. F o r k > 6 t h e r e c o n s t r u c t i o n of wk is e x a c t e x c e p t for a d e l a y of 2 s t e p s , i.e. we,k = w k - 2 . T h i s o b s e r v e r e x p l o i t s t h e a c t u a l m e a s u r e m e n t Yk for e s t i m a t i n g wk w h i c h o b v i o u s l y l e a d s t o t h i s g o o d t r a n s i e n t b e h a v i o u r w h e n wk t a k e s a n e w v a l u e .
6
Observer Design for the R6ssler S y s t e m
T h i s s e c t i o n f o c u s e s o n t h e d e s i g n a n d r e a l i s a t i o n of a n o b s e r v e r for t h e continuous-time R6ssler system which has the form
504
2. Nonlinear Discrete-Time Observers for Synchronization Problems
5:(t)
--x2(t) -- x3(t) x l ( t ) -t- ax2(t)
=-
)
,
y(t) = x3(t)
(6.26)
c + x 3 ( t ) ( x 1 ( t ) -- b) with the coefficients a, b, c > 0. As shown in [13] the initial condition x3(0) > 0 leads to y(t) = x3(t) > 0 for all t _> 0 a n d the observer p r o b l e m is well posed. 15
i
i
i
i
,
20
25
30
20
25
30
i
i
5 0
......................................................................
-5 -10 -15
0
5
10
15 t
10 t",l
0 -5 -10 -15
.
0
5
.
.
10
.
.
15 t
.-.
25 20 15 10 5 0 -5
i
0
1
,
I
I
I
I
I
5
10
15
20
25
t
/
d 30
FIGURE 3. State variables of the R~ssler system Figure 3 shows the trajectories of the s y s t e m states for initial conditions x(0) = ( 5 5 5 ) T and coefficients a = c ----0.2 a n d b = 7.5. T h e r e exists two approaches for the realisation of an observer for a continuous-time system: 1. T h e observer is designed in continuous-time. T h e c o m p u t a t i o n of this observer in a real-time p r o g r a m with c o n s t a n t sampling time requires the discretization of the observer equations. Since the o u t p u t signal y between the samplings is u n k n o w n between two samplings, t h e discretization can not be exact even in the linear case. In addition, t h e sampling time has to be chosen w i t h respect to the s y s t e m s a n d t h e observer dynamics.
2. Nonlinear Discrete-Time Observers for Synchronization Problems
505
2. The system can be, at least approximately, discretized which allows an observer design for the discrete-time model. The obtained observer can directly be implemented in a real-time program. Whereas a linear system can always be exactly discretized, the discretization of a nonlinear system is in general an approximation. In what follows, a discretized continuous-time observer and the discretetime observer 4 via E O F are designed for the R0ssler system and implemented in a real-time program which reads all T -- ~ s s the o u t p u t y of the simulated R0ssler system. The two observers are compared for initial conditions ~(0) = ( 2 2 2 )T and different error dynamics.
6.1
Observer
Design in Continuous-
Time
The transformation
z2(t)
=
z3(t)
x2(t)
(6.27)
In(x3(t))
leads to a system representation in new coordinates (cf. [13])
~(t) y(t)
=
=
0 1 1 e z:~(t)
lO) (ez) a
0
0
0
z(t) +
0
,
ce -z:~(t) - b
(6.28)
which allows to design an observer with linearizable error dynamics. T h e resulting observer has the form
Q
~(t)
=
--:~2(t) -- y ( t ) ~- Ol(/~t(:~3(t)) -- l r t ( y ( t ) ) ) 5:l(t) + a~:2(t) + 0 2 ( l n ( 5 ~ 3 ( t ) ) -- l n ( y ( t ) ) ) :~3(t) (:~1 (t) ~- y--~t) -- b ~- 03 (/n(:~3(t)) - / n ( y ( t ) ) ) )
) (6.29)
with
03 = --(a ~- q2),
O1 = 1 Jr- oaa -- ql,
02 = qo + 03 -- o l a
(6.30)
and q0, ql, q2 the coefficients of the desired characteristic polynomial of the observer error dynamics. For the realisation in a real-time program, the differential equations are numerically c o m p u t e d using the R u n g e - K u t t a algorithm.
506
6.2
2.
N o n l i n e a r D i s c r e t e - T i m e O b s e r v e r s for S y n c h r o n i z a t i o n P r o b l e m s
Observer Design in D i s c r e t e - T i m e
A continuous-time system of the form
2(t) = f(x(t)),
y(t) = h(x(t))
(6.31)
can be discretized by a Taylor-Series expansion
9 (t + T)
+
T 2 ..
=
z(t) + T
x(t) + . . .
=
x(t) + T f(x(t)) + - T ~ f ( x ( t ) )
T 2 Of x t
+...
(6.32)
with sampling time T. Setting x(t) = x(kT) =: xk, x(t + T) = x((k + 1)T) =: xk+l leads to the discrete-time system representation
xk+l
=
xk + T f(xk) + - ~ - ox~: J~ ~J
yk
=
h(xk)
"'"
(6.33)
In practice it is necessary to neglect higher order terms of the Taylorseries expansion. The higher the order of the first neglected term, the more accurate is the discretization. However, the complexity of the obtained system representation also increases with the number of considered terms. For the R6ssler system, all terms with order higher t h a n one must be neglected. Otherwise, the inverse of the observability m a p is not found. Therefore, the observer design is based on the discrete-time system model
Xk+l
Yk
z
:
Xl,k - T(x2,k + za,k) ) x2,k + T(xl,k + ax2,k) x3,k + T(c + x3,k(xl,k -- b))
(6.34)
X3,k.
For this system, observer 4 via E O F with linearizable error dynamics was designed and compared to the continuous-time observer.
6.3
Observer Errors f o r S l o w E r r o r D y n a m i c s
First the eigenvalues of the error dynamics were chosen to Ai,c = - 1 0 (continuous-time) and Ai,d = e T~'c ~ 0.925 (discrete-time) with i = 1,2, 3. Figure 4 shows the observer errors ei,k = ~i,k - xi,k for observer 4 via E O F . During the peaks of x3 (t), there are considerable observer errors. Especially ]e2,kl reaches values up to 1 whereas the observer errors of the continuoustime observer are always smaller t h a n 0.003 for t > 1.5s which can be seen in Figure 5.
2. Nonlinear Discrete-Time Observers for Synchronization Problems |
9
507
I
0.8 0.6 0.4 0.2
el - e2 --e3 ---
i ~
0
r,
i!
,1
,Ib
-0.4 -0.2 -0.6 -0.8
i
-1
0
,
,-
ij
f J
I
I
5
10
15
20
25
30
t
FIGURE 4. Observer error using discrete-time observer 4, A~,a = e -l~
0.003
el - e2 ...... e3 .......
0.002 0.001 o
?
-0.001 -0.002 -0.003 0
5
10
I
I
I
15
20
25
30
t
F I G U R E 5. Observer error using discretized continuous-time observer, )%c = - 1 0
6.#
Observer Errors for Fast Error Dynamics
W h e r e a s the eigenvalues of a discrete-time observer can be chosen to Ai,d = 0 (dead-beat), the error d y n a m i c s of the continuous-time observer can not be chosen arbitrarily fast for a given sampling time. Otherwise, t h e errors of the numerical c o m p u t a t i o n of the differential equations c o n s i d e r a b l y increase. T h e observer errors using observer 4 with d e a d - b e a t design are s h o w n in Figure 6. For t > 2, t h e y are nearly identical to the case w i t h slow d y n a m i c s whereas the continuous-time observer with Ai,c = - 1 2 0 leads to higher observer errors (see Figure 7). In addition, the c o r r e s p o n d i n g discrete-time eigenvalue is e -12~ = 0.391 a n d therefore the convergence rate is smaller c o m p a r e d to a d e a d - b e a t design.
508
2. Nonlinear Discrete-Time Observers for Synchronization Problems
0.8
1t
0.6 t-r 9
113
'
,~
O.4 0.2 t ~ 0 -O.2 -0.4
i
' ~t
i!
'
Ii
'
i'i i
' !
~
-0.6
'II~!
-0.8
V 0
el - e2 e3 ...... ....... '
5
10
',e
15
20
t
25
FIGURE 6. Observer error using discrete-time observer 4 with
30
Ai,d = 0
1 0 er
-1 -2 -3
i!
-4
ii
el - -
ii
ii
e2 ......
ii
-5 -6
0
I
I
I
I
I
5
10
15
20
25
t
30
FIGURE 7. Observer error using discretized continuous-time observer with Ai,c : -120
6.5
Concluding Remarks
T h e simulations have shown that the continuous-time observer leads to better results if the error dynamics are 'slow'. However, since the sampling time has to be chosen with respect to the observer dynamics, the eigenvalues of the observer dynamics must lie within a certain range or the sampling time has to be decreased. A discrete-time observer even allows a dead-beat design because the sampling time has to be chosen only with respect to the system dynamics. This point is very i m p o r t a n t if high convergence rates are desired. However, a crucial point is finding a discrete-time representation of the system with satisfying accuracy. In the considered case, only a coarse discretization was possible. ghrther simulations have shown t h a t the discrete-time observer for this
2. Nonlinear Discrete-Time Observers for Synchronization Problems
509
example is much more sensitive to noise but less sensitive to parameter uncertainties.
7
Discussion and Conclusions
In this chapter a discrete-time observer design via nonlinear observability form and extended nonlinear observer form using additional past output values is presented. Like the design procedure presented in [2] and [10], it does not require a diffeomorphic system function and the transformation can be calculated without solving a system of partial differential equations. In addition, the main problem of [2] and [10], the restrictive condition that the Hessian matrix of the nonlinear function appearing in observability form has to be diagonal, does not occur. Every strongly locally observable system is state equivalent to a system in EOF. The observer design was applied to a problem in the field of communication in discrete-time and to a synchronization problem in the continuoustime context. For the latter, the performance of the discrete-time observer depends on the accuracy of the system discretization. 8
REFERENCES [1] P. Badola, S. S. Tambe, and B. D. Kulkarni. Driving systems with chaotic signals. Physical Review A, 46(10):6735-6737, 1992. [2] M. Brodmann. Beobachterentwurf.fiir nichtlineare zeitdiskrete Systeme. Number 416 of line 8 in VDI-Fortschrittberichte. VDI-Verlag, Dt~sseldorf, 1994. Dissertation, Universit~it Hannover. [3] S.-T. Chung and J. W. Grizzle. Sampled-data observer error linearization. Automatica, 26(6):997-1007, 1990. [4] G. Ciccarella, M. Dalla Morn, and A. Germani. A robust observer for discrete time nonlinear systems. Systems ~ Control Letters, 24:291300, 1995. [5] H. Huijberts, T. Lilge, and H. Nijmeijer. A control perspective on synchronization and the takens-aeyels-sauer recontruction theorem. Accepted for publication in Physical Review E, 1998. [6] R. Ingenbleek. Beobachtbarkeit und Beobachterentwur.f fiir zeitdiskrete nichtlineare Systeme. Number 03/93 in Forschungsberichte. Universit,it Duisburg, Duisburg, 1993. [7] R. Ingenbleek. Zustandsbeobachter fiir zeitdiskrete nichtlineare Systeme - Geometrische Analyse und Synthese. Number 527 of line 8 in
510
2. Nonlinear Discrete-Time Observers for Synchronization Problems VDI-Fortschrittberichte. VDI-Verlag, Dttsseldorf, 1996. Dissertation, Universit~tt Duisburg.
[8] W. Lee and K. Nam. Observer design for autonomous discrete-time nonlinear systems. Systems ~ Control Letters, 17:49-58, 1991. [9] T. Lilge. On observer design for nonlinear discrete-time systems. European Journal of Control, (4):306-319, 1998. [10] W. Lin and C. I. Byrnes. Remarks on linearization of discrete-time autonomous systems and nonlinear observer design. Systems ~ Control Letters, 25:31-40, 1995. [11] M. Loecher and E. R. Hunt. Control of high-dimensional chaos in systems with symmetry. Physical Review Letters, 79(1):63-66, 1997. [12] H. Nijmeijer. Observability of autonomous discrete time nonlinear systems: A geometric approach. Int. Journal of Control, 36(5):86774, Nov. 1982. [13] H. Nimeijer and I. Mareels. An observers look to synchronization. IEEE Transactions on Circuits and Systems, 44(10):882-90, 1997. [14] E. Ott, T. Sauer, and J. A. Yorke. Coping with Chaos. John Wiley & Sons, Inc., New York, 1994. [15] L. M. Pecora and T. L. Carroll. Synchronization in chaotic systems. Physical Review Letters, 64(8):821-824, 1990. [16] K. Pyragas. Generalized synchronization of chaos in directionally coupled chaotic systems. Physical Review E, 51(2):980 994, 1995. [17] K. Pyragas. Weak and strong synchronization of chaos. Physical Review E, 54(5):R4508-R4511, 1996. [18] T. Stojanovski, U. Parlitz, L. K., and R. Harris. Exploiting delay reconstruction for chaos synchronization. Physics Letters A, 233:355360, 1997. [19] Y. Zhang, M. Dai, W. Hua, Y. Ni, and G. Du. Digital communication by active-passive-decomposition synchronization in hyperchaotic systems. Physical Review E, 58(3):3022-3027, 1998.
Chaos Synchronization Ulrich Parlitz 1, Lutz Junge I and Ljupco Kocarev 2 1Drittes Physikalisches Institut, Universit~tt G6ttingen, Bttrgerstr. 42-44, D-37073 G6ttingen, Germany 2Department of Electrical Engineering, St Cyril and Methodius University, Skopje, P O Box 574, Macedonia
1
Introduction
Synchronization is a phenomenon of interest in many scientific areas ranging from celestial mechanics to laser physics, from electronics to communications, and from biophysics to neuroscience [1]. In particular, synchronization of chaotic dynamics [7, 4, 24] has attracted much attention during the last years because of its role in understanding the basic features of man-made and natural systems. Thus, for example, optical communication using chaotic waveforms demonstrated experimentally [8, 37, 38] and theoretically [3], is possible because of chaos synchronization between transmitter and receiver. On the other hand, the evidence of chaotic behavior in the brain [34] and the importance of synchronization in perceptive processes of mammals [33] indicate a possible role of chaos synchronization in neural ensembles [32] as well. The phenomenon of synchronization also occurs for uni-directionally coupled systems and in this case the driven system (or response system) may be viewed as a nonlinear observer of the driving system. Or, conversely, nonlinear observer theory may be used to construct pairs of uni-directionally coupled synchronizing systems. Such pairs may then be used for system and parameter estimation or for potential applications in communication systems, see [1] and [31]. In all these cases, the two coupled systems are (almost) identical and therefore identical synchronization occurs which means that the difference of drive and response state vectors converges to zero for t --~ oc. If two different systems are coupled, more sophisticated types of synchronization [5] may occur like generalized synchronization or phase synchronization that will be discussed in Sections 3 and 4. We shall begin with a presentation of synchronization phenomena of spatially extended systems that are given in terms of (chaotic) partial differential equations. For more information about chaos synchronization and related problems the reader is refered to the article collections in [31].
512
3. Chaos Synchronization
2 Synchronization of Spatially Extended Systems In the following we shall discuss identical synchronization of uni-directionally coupled spatially extended systems t h a t are described by a partial differential equation in the form
Ou F ( u , OU 02u O---t = cgx' Ox 2 ' ' ' " )'
x 9 [o,n]
(2.1)
with spatial length L. In this chapter we consider only one dimensional P D E ' s . Generalizations for higher dimensional systems are straightforward. For this class of systems we define: Two spatially extended systems are called synchronized, if their states u(x, t) and v ( x , t ) converge to each other in the whole spatial domain, i.e. if Vx E [0, L] : l i m t ~ I[u(x,t) - v(x,t)[[ = 0 As in the case of low dimensional systems there exists an invariant m a n ifold u = v (also called synchronization manifold), whose stability properties determine the occurence of stable synchronization. If the transverse system w • = ~1 ( u - v) has an asymptotically stable fixed point at the origin, then this manifold is asymptotically stable and synchronization occurs. Indeed, all known techniques (see [1], [31]) for verifying synchronization such as necessary criteria like negative conditional Lyapunov exponents or sufficient criteria like Lyapunov functions and stability of unstable periodic orbits, can be generalized and can in principle be applied to spatially extended systems, too. On the other hand, the generalization of the coupling techniques used for low dimensional systems is not so straightforward. A coupling along the whole spatial axis is possible for numerical simulations but m a y turn out to be impractical or even impossible for experiments. A similar argument holds for local pinning coupling schemes that are used for synchronizing coupled m a p lattices (CMLs)[19, 15, 16, 11, 9]. These schemes use coupling in points which is not only practically impossible but also in some sense useless for PDE's. An alternative is the sensor coupling scheme, introduced in [20], which generalizes the pinning schemes to systems with continuous space variables. The idea is, t h a t typical experimental measurement devices have a finite resolution l and measure local spatial averages of the desired quantity. T h e left plot of Figure 1 shows the concept of the sensor coupling scheme and the right plot illustrates the notion of a measured sensor signal. According to [20] we want to call these elements sensors. Each sensor measures a scalar time-series of the form
3. Chaos Synchronization --1.-
,
d
513 ,
L
111
I12
V
U3
space
FIGURE 1. Principle of the sensor coupling scheme. Left: sketch, right: visualization of three sensor time series measured from spatio-temporal chaos.
~.(t)
=
1 fnd+l/2 u ( x , t ) d x ,
7 Jn~-u2
n=l,...,N
(2.2)
which is averaged over a width 1. Because of the exponential decrease of spatial correlations in spatially extended chaotic systems, we need several but a finite number N of coupling signals that contain all the necessary information to reconstruct the whole state in the synchronization process. Therefore the N sensors are distributed with equal distance d = L (for periodic b o u n d a r y conditions) along the spatial axis. Numerical investigations have shown t h a t the equidistant arrangement is nearly optimal for systems with extensive chaos [13]. Now we have to choose a coupling scheme t h a t will be applied locally using the sensor signals as driving forces. To do this we measure in the driven syst e m N sensor signals at the same positions and apply a diffusive coupling t e r m with coupling strength c f ( u , v) = { e(~'~ 0- V~)
:: elsend - I/2 < x < nd + I/2
(2.3)
at each sensor position. As an example we shall examine now the one dimensional complex Ginzburg-Landau model 0U
0--/= #u - (1 - ia)lul2u + (1 + i/3)Au,
u 9 [0, L]
(2.4)
with periodic boundary conditions. This equation possesses uniform travelling wave solutions. For 1 - a/3 < 0 they become unstable and different
514
3. Chaos Synchronization
I!
1.04
1.03
400
400
1.04
1.03
1.02
1.02
300
300 1.01
200
1.01
2OO 1.00
100 0
0.99
~il
1.00
:iii
100
0.99
I
o
20 4 0 6 0 8 0 1 0 0
0
20
40
60
80
100
v
F I G U R E 2. Synchronization of two Ginzburg-Landau equations in the phase turbulent regime (Left: drive, right: response) using N = 15 sensors with width I = 3 and coupling strength e = 0.2. The amplitudes of drive and response P D E are grey scaled. t y p e s of t u r b u l e n c e occur. I n t h e following we will c o n s i d e r two p a r a m e ter sets, # -- 1.0, a -- 2.0,/3 = 0.7 c o r r e s p o n d i n g to p h a s e t u r b u l e n c e a n d tt = 1.0, a = 2.0,/3 = 1.2 which yields defect t u r b u l e n c e . In b o t h cases extensire chaos is o b s e r v e d a n d tile L y a p u n o v d i m e n s i o n DL of t h e u n d e r l y i n g a t t r a c t o r increases w i t h t h e s y s t e m size L like DL ~ 0.102L for p h a s e t u r b u l e n c e a n d w i t h DL ~ 0.332L for defect t u r b u l e n c e . In o r d e r to achieve s y n c h r o n i z a t i o n we d r i v e an i d e n t i c a l c o p y of (2.4) using N sensor signals (2.3) t h a t are a p p l i e d in i n t e r v a l s of w i d t h 1.
Ov
0--t = # v - (1 - ia)lvl2v + (1 + il3)Av + f ( u , v)
(2.5)
N o t e t h a t this is a local c o n t r o l t e c h n i q u e a n d t h e d r i v e n s y s t e m (2.5) evolves freely b e t w e e n t h e sensor locations. F i g u r e 2 shows t h e s y n c h r o n i z a t i o n of d r i v e (left) a n d r e s p o n s e (right) in t h e p h a s e t u r b u l e n t r e g i m e . F o r this e x a m p l e we u s e d N -- 15 e q u a l l y s p a c e d sensors w i t h w i d t h l = 3 to s y n c h r o n i z e two G i n z b u r g - L a n d a u e q u a t i o n s w i t h l e n g t h L = 100. A t t = 170 t h e c o u p l i n g is s w i t c h e d on a n d t h e r e s p o n s e s y s t e m q u i c k l y converges to t h e s y n c h r o n i z e d s t a t e . I n t h e b e g i n n i n g of t h e c o u p l i n g t h e p e r t u r b a t i o n i n t r o d u c e d t h r o u g h t h e sensors signals i n d u c e s a p e r i o d i c p a t t e r n , which d e c a y s very fast d u e t o t h e s y n c h r o n i z a t i o n . S i m i l a r results have b e e n o b t a i n e d for defect t u r b u l e n c e . If t h e s e n s o r c o u p l i n g is a p p l i e d o n l y l o c a l l y one m a y o b s e r v e local synchronization as it is shown in Fig. 3. R e p l a c i n g t h e d r i v i n g sensor signals b y v a n i s h i n g sig-
3. Chaos Synchronization
0
20
40
60
80
100
x
0
20
40
60
80
515
100
x
FIGURE 3. Local synchronization of defect turbulence of the Ginzburg-Landau equations. The left figure shows the dynamics of the response system and the right figure the synchronization error between drive and response that vanishes (dark areas) in those intervals where sensors are placed (N = 2*6, l = 2, e = 2.0). rials one m a y also suppress (locally) the chaotic oscillations of the response system and stabilize the homogeneous state. Furthermore, the sensor coupling has also been succesfully applied to a pair of Kuramoto-Sivashinski equations and provides nonlinear observers that can be used for estimating p a r a m e t e r s of P D E s from time series [13].
3
Generalized Synchronization
If a pair of very similar or even identical systems is coupled one m a y observe identical synchronization (IS) where the difference of the state vectors of b o t h systems converges to zero, even in the case of chaotic dynamics. This kind of synchronization, however, cannot be expected for coupled systems that are of completely different origin (e.g., an electrical circuit coupled to a mechanical system). W h a t does "synchronization" mean in such a more general case? Periodic systems are usually called synchronized if either their phases or frequencies are locked. For chaotic systems, however, the notions of "frequency" or "phase" are in general not well defined and can thus not be used for characterizing synchronization (except for some class of chaotic systems where a phase variable can be introduced to quantify chaotic phase synchronization t h a t will be discussed in the next section). In this Section we present different notions of generalized synchronization (GS) t h a t have been proposed during the last years [4, 29, 2, 17, 26, 6, 12, 35, 21]. Basically two types of generalized synchronization of uni-directionally coupled systems have been investigated in the literature so far. In its strongest form GS leads to the existence of a function t h a t m a p s (asymptotically for t --* oc) states of the drive system to states of the response
516
3. Chaos Synchronization
system. In this case the chaotic dynamics of the response system can be predicted from the drive system. W h e t h e r such a function exists and whether it is continuous or even smooth depends on the features of the drive and response system [6, 12, 35]. As an example consider a m-dimensional (chaotic) dynamical system x n+l = f ( x n)
(3.6)
t h a t drives the following one-dimensional system
yn+l = byn + cos(27rx~)
(3.7)
with b < 1. Is is easy to see that 0<3
y,~
1~
bi cos(2~x~_i) '
(3.8)
i=1
For invertible drive dynamics x n1- - i can be replace by f l ~ ( X n) and since the cosine function is bounded and continuous, the state of the response system yn is a continuous function of the state of the drive system: oo
=
g(x
=
1E
bi c~
(3.9)
i=1
In order to study the smoothness feature of this function we c o m p u t e its first derivative
Og 27r oo l " Oxj (x) = - - - ~ E b i / ) x . (x) sin(27rf~-i(x)). i=1
(3.10)
3
For g being differentiable this sum has to converge, or in other words the coefficients ci = bi ~Oxj (x) have to be less than 1 for large i. This is the case when the contraction of the response system is stronger t h a n t h a t of the drive, or in terms of Lyapunov exponents, if In b < Am where Am denotes the smallest Lyapunov exponent of the drive. To illustrate this point we shall use as drive system Arnold's c a t - m a p
x~ -kl
=
(x~-~-x~)
x~ +1
---- (x~ + 2 x ~ )
rood 1 rood 1
(3.11) (3.12)
which is an ergodic m a p on the torus with a constant Jacobian matrix. T h e eigenvalues of this matrix are #1 -- (3 + x/5)/2 and #2 = (3 - vf5)/2. T h e Jacobian matrix D f - i ( x ) of the iterated inverse m a p is given by A - i = U. d i a g ( # l i, # ~ i ) . U t~ where U is an orthogonal 2• matrix. Since #1 > 1 > #2 the coefficients ci can for large i by approximated by ci ~ c o n s t . . (b/#2) ~. Therefore, the first derivative of g diverges for b > #2 (or in t e r m s of
3. Chaos Synchronization (a)
517
i O.B
0.8
0.6
0.6 89 0.4-
x2
0.4
0.2 0
0 0
0,2
0.4
0.6 x 1
0.8
1
0
0.2
0.4
0.g
0_8
1
x I
FIGURE 4. Grey scaled plot of the response state y vs. the (xl, x2) coordinates of the driving cat map. (a) b = 0.01 (b) b = 0.4. Both figures have been computed by iterating the dynamical systems (3.7), (3.11) and (3.12) and transients have been discarded.
Lyapunov exponents In(b) > ln(/~2) = A2). In this case the function g is essentially a Weierstrass function. Figure 4 shows how the function g looses its smoothness when b is increased from 0.01 = b *2 = (3-v~)/2 = 0.382 to 0.4 = b > #2. Typically a function exists if the response system is asymptotically stable when driven by the coupling signal and no subharmonie entrainment occurs [21]. If, for example, a periodic orbit of the drive entrains a stable periodic orbit of the response with twice the period (i.e. T o : TR = 1 : 2) then any point on the attractor of the drive is m a p p e d to two points on the response orbit and in this case there exists a relation but not a function. This multivaluedness always occurs for subharTnonic periodic entrainment with TD < TR. Note t h a t identical synchronization implies GS in any diffeomorphic equivalent coordinate system. On the other hand, if GS is observed between two dynamical systems with a diffeomorphic function this diffeomorphism can be used to perform a change of the response coordinate system such t h a t in the new coordinate system the response system synchronizes identically with the drive system. To find evidence for the existence of a (continuous) function relating states of the drive to states of the response one m a y apply nearest neighbors statistics [29]. This approach for identifying generalized synchronization can be applied to uni- and bi-directionally coupled systems if the original (physical) state spaces of drive and response are accessible. If only (scalar) time series from the drive and the response system can be sampled, t h e n delay embedding [30] m a y be used to investigate neighbourhood relations in the corresponding reconstructed state spaces. In this case, however, only generalized synchronization of un/-directionally coupled systems can be detected by predicting the (reconstructed) state of the response system using
518
3. Chaos Synchronization
a time series from the drive system. A prediction of the evolution of the drive system based on data from the response system is always possible (i.e. with and without generalized synchronization), because (almost) any time series measured at the response system may also be viewed as a time series from the combined systems drive ~z response and may thus be used to reconstruct and predict the dynamics of drive and response. In this sense a time series based test provides no information about GS in the case of bi-directionally coupled systems. In such a case where drive and response are not related by a function, a second, weaker notion of GS may apply that assumes only asymptotic stability of the response system but not the existence of a function mapping states of the drive to states of the response system [2, 21]. This type of GS can be verified using the so-called auxiliary system m e t h o d where two identical copies of the response system are driven by the same driving signal. Identical synchronization of both response systems indicates GS in the weaker sense. Note that using the auxiliary systems approach one may also observe nonidentical (i.e. generalized) synchronization of identical systems that fail to synchronize identically. Current research in the field of generalized synchronization focuses on the question whether the different phenomena and approaches for characterizing (generalized) synchronization can be unified in a mathematically rigorous sense using the notion of non'really hyperbolic invariant manifolds that are smooth and persistent under perturbations of the system(s) [39, 10].
4
Phase Synchronization
Another generalization of the notion of identical synchronization is the phenomenon of phase synchronization (PS) [36, 27, 22, 25, 18]. It can easily be observed when a well defined phase variable can be identiffed in both coupled systems. This can be done heuristically for strange attractors that spiral around some particular point (or "hole") in a twodimensional projection of the attractor, like the R~ssler attractor shown in Fig. 5. In such a case, a phase angle r can be defined that de- or increases monotonically. Phase synchronization of two coupled systems occurs if the difference [r (t) --r between the corresponding phases is bounded by some constant. A more general definition includes rational relations Inr - rnr ] ( const for arbitrary integers n and m. Using the phase angle r one may define a mean rotation frequency = limt_~ r and in the case of PS, this mean rotation frequencies of the drive and the response system coincide, i.e., also for chaotic systems PS leads to the frequency entrainment known from coupled periodic oscillations. The amplitudes of both systems remain in this case completely uncorrelated [27].
3. Chaos Synchronization
519
This phenomenon may be used in technical or experimental applications where a coherent superposition of several o u t p u t channels is desired. In more abstract terms PS occurs when a zero Lyapunov exponent of the response system becomes negative. This leads to a reduction of the degree of freedom of the response system in the direction of the flow. For systems where a phase variable can be defined the direction of the flow coincides in general with the coordinate t h a t is described by the phase variable. A zero LE t h a t becomes negative reflects in this sense a restriction t h a t is imposed on the motion of the phase variable. If the zero LE t h a t decreases is the largest LE of the response system then phase synchronization occurs together with GS [22]. If there exist, however, in addition to the formerly zero LE, other LEs which are and remain positive, PS occurs but no GS. This scenario for the onset of PS m a y be observed for a sinusoidally driven RSssler system [36]:
21
=
0 . 4 -~ X l ( X 2 - 8 . 5 )
22
=
--Xl -- X3 + a cos(t)
X3
"~-
X2
(4.13)
+ 0.15x3.
Figure 5 shows the onset of PS when the driving amplitude a exceeds some critical value of ac ~ 0.4. The solid gray lines belong to the chaotic attractor of the driven system (4.13) and the black dots are plotted at times tn = n27~ yielding a stroboscopic phase portrait. As can be seen in Fig. 5a these dots are scattered on the chaotic a t t r a c t o r if the driving foice is too weak, indicating no fixed phase relation of the chaotic oscillation with respect to the driving signal. Figure 5b shows the distribution near the onset of PS where the dots already s t a r t to form a cluster. If the amplitude a is sufficiently high, phase sychronization occurs as can be seen for an amplitude of a = 0.7 in Fig. 5c. This transition can also be studied in terms of the Lyapunov exponents of the response system. Figure 6a shows the two largest exponents A1 and A2 of the RSssler system (4.13) plotted in dependence on the coupling p a r a m e t e r a. For a > 0.4 the zero exponent starts decreasing while A1 remains positive. The driven system thus looses a degree of freedom although it stays chaotic. This degree of freedom is associated with the zero Lyapunov exponent A2, i.e., with the (tangential) direction of the trajectories. This direction, however, is exactly the direction of the spiraling motion around the "hole" in the attractor t h a t was used for introducing a phase variable.
520
3. C h a o s S y n c h r o n i z a t i o n
(a) I0
0 x3
-10
'-lOl ....
Oi . . . . x2
irO '
,
(b) I0
0 x 3
-10
'
t
.
.
.
-I0
.
i
.
.
0
.
.
!
,
,
1o
x 2
(o) 10
0 x 3
-10
'-10' . . . .
O~ . . . .
1'0 '
'
'
x 2
F I G U R E 5. P h a s e s y n c h r o n i z a t i o n of a p e r i o d i c a l l y d r i v e n R0ssler s y s t e m (4.13) (a) a = 0.1, no PS; (b) a = 0.5, o n s e t of PS; (c) a = 0.7, full PS.
3. Chaos Synchronization
521
(a) 0.1 0.05 Ak 0 --0.05 --0.1 &
(b)
o:6
o:8
150 e
100
50
0
0:2
0:4
a
F I G U R E 6. (a) The two largest Lyapunov exponents A1 and A2, and (b) the mean synchronization error e (4.14) of a pair of identical R~ssler response systems (4.13) vs. the driving amplitude a.
Figure 6b shows the m u t u a l averaged s y n c h r o n i z a t i o n error e
e=
1 Y ~ ~=111x(tn) - ~(tn)ll2
(4.14)
of two identical R6ssler systems t h a t are driven by the same sinusoidal signal and sampled with t n : n " 27r/25. For GS such a c o m p a r i s o n with an auxiliary s y s t e m would result in an a s y m p t o t i c a l l y vanishing error, b u t here b o t h response systems are chaotic a n d t h e PS leads only to a decrease of e by a factor of a b o u t two. In this sense PS leads to a constructive interference of chaotic response signals t h a t has also been observed in m e a n field variables of arrays of slightly different response systems which were driven by a c o m m o n signal [25, 18]. A n o t h e r p h e n o m e n o n t h a t is closely related to PS is lag synchronization t h a t was observed recently by R o s e n b l u m et al. [28] a n d leads to synchronization with some time delay between drive a n d response.
522
5
3. Chaos Synchronization
Conclusions
In this chapter we have addressed specific topics and examples of "chaos synchronization": synchronization of spatially extended systems (PDEs), generalized synchronization and phase synchronization. In particular synchronization phenomena of uni-directionally coupled identical systems (here: pairs of PDEs) are very closely related to questions of observability and observer design. If the underlying dynamics is chaotic, intermittent breakdown of synchronization may occur if not not all of the unstable orbits which form the skeleton of the chaotic attractor are synchronized (for details see [1] and references cited therein). This phenomenon can be excluded if proper (global) stability conditions can be established (for example, in terms of Lyapunov functions). Another topic that is worth mentioning are potentim applications of (chaos) synchronization like system identification and data encryption. For system identification or parameter estimation a model equation is varied until it synchronizes with a given time series. This approach has been successfully applied to maps, coupled ODE's and coupled PDEs in order to estimate some free parameters [23, 14]. The advantage of chaotic dynamics for this task is the fact that a larger part of the state space is explored compared to periodic solutions. The second potential application mentioned above, "chaos communication", was actually for m a n y researchers the main motivation to study synchronization mechanisms of uni-directionally coupled systems. The basic idea is to transmit a modulated chaotic signal and to use synchronization for recovering the message at the receiver (see [37, 38] for a recent fast optical implementation and [1] for other examples). Whether this approach can really compete with standard cryptographie is still an open question but some special applications seem possible.
Acknowledgments This work was supported by the German Science Foundation (DFG grant Pa 643/1-1) and a binational German-Macedonian grant (MAK-004-96). 6
REFERENCES [1] Chaos 7(4), pp. 509-826, IEEE Trans. Circuits and Systems, part I 44, 1997. [2] H.D.I. Abarbanel, N.F. Rulkov and M.M. Sushchik. Generalized synchronization of chaos: The auxiliary system approach, Phys. Rev. E 53(5), pp. 4528-4535, 1996. [3] H.D.I Abarbanel and M. Kennel. Synchronizing High-Dimensional
3. Chaos Synchronization
523
Chaotic Optical Ring Dynamics, Phys. Rev. Lett. 80(14), pp. 31533156, 1998. [4] V.S. Afraimovich, N.N. Verichev and M.I. Rabinovich. Stochastic synchronization of oscillations in dissipative systems, Radiophys. Quantum Electron. 29, pp. 795-803, 1986. [5] I.I. Blekhman, A.L. Fradkov, H. Nijmeijer and A.Yu. Progromsky. On self-synchronization and controlled synchronization, Syst. Contr. Lett. 31(5), pp. 299-306, 1997. [6] M.E. Davies and K.M. Campbell. Linear recursive filters and nonlinear dynamics, Nonlinearity 9, pp. 487-499, 1996. [7] H. Fujisaka and T. Yamada. Stability Theory of Synchronized Motion in Coupled-Oscillator Systems, Progr. Theor. Phys. 69(1), pp. 32-47, 1983. [8] J. Goedgebuer, L. Larger and H. Porte. Optical Cryptosystem Based on Synchronization of Hyperchaos Generated by a Delayed Feedback Tunable Laser Diode, Phys. Rev. Lett. 80(10), pp. 2249-2252, 1998. [9] R. O. Grigoriev, M. C. Cross and H. G. Schuster. Pinning Control of Spatiotemporal Chaos, Phys. Rev. Lett. 79(15), pp. 2795-2798, 1997. [10] M. Hirsch and C. Pugh. Stable Manifolds and Hyperbolic Sets, In Global Analysis, AMS Proc. Symp. Pure Math. 14, 1970. [11] G. Hu and Z. Qu. Controlling Spatiotemporal Chaos in Coupled Map Lattice Systems, Phys. Rev. Lett. 72(1), pp. 68-71, 1994. [12] B.R. Hunt, E. Ott and J.A. Yorke. Differentiable generalized synchronization of chaos, Phys. Rev. E 55(4), pp. 4029-4034, 1997. [13] L. Junge, U. Parlitz, Z. Tasev and L. Kocarev. Synchronization and control of spatially extended systems using sensor coupling, To appear in the Int. J. Bi.f. Chaos, 1999. [14] L. Junge and. U. Parlitz. Control and synchronization of spatially extended systems, Proceedings of the 1998 International Symposium on Nonlinear Theory and its Applications - NOLTA '98, Le R~gent, Crans-Montana, Switzerland, Sept. 14-17, pp. 303-306, 1998. [15] L. Kocarev, Z. Tasev, T. Stojanovsk and U. Parlitz. Synchronizing spatiotemporal chaos, Chaos 7(4), pp. 635-643, 1997. [16] L. Kocarev, Z. Tasev and U. Parlitz. Synchronizing Spatiotemporal Chaos of Partial Differential Equations, Phys. Rev. Lett. 79(1), pp. 51-54, 1997.
524
3. Chaos Synchronization
[17] L. Kocarev and U. Parlitz. Generalized Synchronization, Predictability, and Equivalence of Unidirectionelly Coupled Dynamical Systems, Phys. Rev. Lett. 76(11), pp. 1816-1819, 1996. [18] G. Osipov, A. Pikovsky, M. Rosenblum and J. Kurths. Phase synchronization effects in a lattice of nonidentical R6ssler oscillators Phys. Rev. E 55(3), pp. 2353-2361, 1997. [19] N. Parekh, S. Parthasarathy and S. Sinha. Global and Local Control of Spatitemporal Chaos in Coupled Map Lattices, Phys. Rev. Left. 81(7), pp. 1401-1404, 1998. [20] U. Parlitz and L. Kocarev. Synchronization of Chaotic Systems, In H.-G. Schuster (Ed.) Handbook of Chaos Control, Wiley-VCH, 1998. [21] U. Parlitz, L. Junge and L. Kocarev. Subharmonic entrainment of Unstable Periodic Orbits and Generalized Synchronization, Phys. Rev. Lett. 79(17), pp. 3158-3161, 1997. [22] U. Parlitz, L. Junge, W. Lauterborn and L. Kocarev. Experimental observation of phase synchronization, Phys. Rev. E 54(2), pp. 21152117, 1996. [23] U. Parlitz, L. Junge and L. Kocarev. Synchronization-based parameter estimation from time series, Phys. Rev. E, 54(6), pp. 6253-6260, 1996. [24] L.M. Pecora and T.L. Carroll. Synchronization in chaotic systems, Phys. Rev. Lett. 64(8), pp. 821-824, 1990. [25] A.S. Pikovsky, M.G. Rosenblum and J. Kurths. Synchronization in a population of globally coupled oscillators, Europhys. Letters 34(3), pp. 165-170, 1996. [26] K. Pyragas. Weak and strong synchronization of chaos, Phys. Rev. E 54(5), pp. R4508-R4511, 1996. [27] M.G. Rosenblum, A.S. Pikovsky and J. Kurths. Phase Synchronization of Chaotic Oscillators, Phys. Rev. Lett. 76(11), pp. 1804-1807, 1996. [28] M.G. Rosenblum, A.S. Pikovsky and J. Kurths. From phase to lag synchronization in coupled chaotic oscillators, Phys. Rev. Lett. 78(22), pp. 4193-4196, 1997. [29] N. Rulkov, M. Sushchik, L. Tsimring and H.D.I. Abarbanel. Generalized synchronization of chaos in directionally coupled chaotic systems, Phys. Rev. E 51(2), pp. 980-994, 1995. [30] T. Sauer, J.A. Yorke and M. Casdagli. Embedology, J. Stat. Phys. 65(3,4), pp. 579-616, 1991.
3. Chaos Synchronization
525
[31] H.-G. Schuster (Ed.). Handbook of Chaos Control, Wiley-VCH, 1998. [32] S. J. Schiff, P. So, T. Chang, R.E. Burke and T. Sauer. Detecting dynamical interdependence and generalized synchrony through mutual prediction in a neural ensemble, Phys. Rev. E 54(6), pp. 6708-6724, 1996. [33] W. Singer. Synchronization of cortical activity and its putative role in information processing and learning, Ann. Rev. Physiol. 55, pp.349374, 1993. [34] C. Skarda and W. J. Freeman. How brains make chaos to order to make sense of the world Behav. Brain Sci. i0, pp. 161-195, 1987.
[35] J. Stark. Invariant graphs for forced systems, Physica D 109, pp. 163179, 1997. [36] E.-F. Stone. Frequency entrainment of a phase coherent attractor, Phys. Lett. A 163, pp.367-374, 1992. [37] G.D. Van Wiggeren and R. Roy. Communication with Chaotic Lasers, Science 279, 1998. [38] G.D. Van Wiggeren and R. Roy. Optical Communication with Chaotic Waveforms, Phys. Rev. Lett. 81(16), pp. 3547-3550, 1998. [39] S. Wiggins. Normally Hyperbolic Invariant Manifolds in Dynamical Systems, Springer-Verlag, New York, 1994.
Lecture Notes in Control and Information Sciences Edited by M. Thoma 1993-1999 Published Titles:
Vol. 186: Sreenath, N. Systems Representation of Global Climate Change Models. Foundation for a Systems Science Approach. 288 pp. 1993 [3-540-19824-5]
Vol. 195: Liu, D.; Michel, A.N. Dynamical Systems with Saturation Nonlinearities: Analysis and Design 212 pp. 1994 [3-540-19888-1] Vol. 196: Battilotti, S.
Vol. 187: Morecki, A.; Bianchi, G.;
Jaworeck, K. (Eds) RoManSy 9: Proceedings of the Ninth CISM-IFToMM Symposium on Theory and Practice of Robots and Manipulators. 476 pp. 1993 [3-540-19834-2]
Noninteracting Control with Stability for Nonlinear Systems 196 pp. 1994 [3-540-19891-1] Vol. 197: Henry, J.; Yvon, J.P. (Eds) System Modelling and Optimization 975 pp approx. 1994 [3-540-19893-8]
Vol. 188: Naidu, D. Subbaram
Aeroassisted Orbital Transfer: Guidance and Control Strategies 192 pp. 1993 [3-540-19819-9] Vol. 189: Ilchmann, A. Non-Identifier-Based High-Gain Adaptive Control 220 pp. 1993 [3-540-19845-8] Vol. 190: Chatila, R.; Hirzinger, G. (Eds) Experimental Robotics I1: The 2nd International Symposium, Toulouse, France, June 25-27 1991 580 pp. 1993 [3-540-19851-2] Vol. 191: Blondel, V. Simultaneous Stabilization of Linear Systems 212 pp. 1993 [3-540-19862-8] Vol. 192: Smith, R.S.; Dahleh, M. (Eds) The Modeling of Uncertainty in Control Systems 412 pp. 1993 [3-540-19870-9] Vol. 193: Zinober, A.S.I. (Ed.)
Variable Structure and Lyapunov Control 428 pp. 1993 [3-540-19869-5]
Vol. 198: Winter, H.; NQE,er, H.-G. (Eds) Advanced Technologies for Air Traffic Flow Management 225 pp approx. 1994 [3-540-19895-4]
Vol. 199: Cohen, G.; Quadrat, J.-P. (Eds) 1lth Intemational Conference on Analysis and Optimization of Systems Discrete Event Systems: Sophia-Antipolis, June 15-16-17, 1994 648 pp. 1994 [3-540-19896-2] Vol. 200: Yoshikawa, T.; Miyazaki, F. (Eds)
Experimental Robotics II1:The 3rd Intemational Symposium, Kyoto, Japan, October 28-30, 1993 624 pp. 1994 [3-540-19905-5] Vol. 201: Kogan, J. Robust Stability and Convexity 192 pp. 1994 [3-540-19919-5] Vol. 202: Francis, B.A.; Tannenbaum, A.R.
(Eds) Feedback Control, Nonlinear Systems, and Complexity 288 pp. 1995 [3-540-19943-8] Vol. 203: Popkov, Y.S.
Vol. 194: Cao, Xi-Ren Realization Probabilities: The Dynamics of Queuing Systems 336 pp. 1993 [3-540-19872-5]
Macrosystems Theory and its Applications: Equilibrium Models 344 pp. 1995 [3-540-19955-1]
Vol. 204: Takahashi, S.; Takahara, Y. Logical Approach to Systems Theory 192 pp. 1995 [3-540-19956-X] Vol. 205: Kotta, U. Inversion Method in the Discrete-time Nonlinear Control Systems Synthesis Problems 168 pp. 1995 [3-540-19966-7] Vol. 206: Aganovic, Z.; Gajic, Z. Linear Optimal Control of Bilinear Systems with Applications to Singular Perturbations and Weak Coupling 133 pp. 1995 [3-540-19976-4] Vol. 207: Gabasov, R.; Kirillova, F.M.; Pdschepova, S.V. Optimal Feedback Control 224 pp. 1995 [3-540-19991-8] Vol. 208: Khalil, H.K.; Chow, J.H.; Ioannou, P.A. (Eds) Proceedings of Workshop on Advances inControl and its Applications 300 pp. 1995 [3-540-19993-4] Vol. 209: Foias, C.; Ozbay, H.; Tannenbaum, A. Robust Control of Infinite Dimensional Systems: Frequency Domain Methods 230 pp. 1995 [3-540-19994-2] Vol. 210: De Wilde, P. Neural Network Models: An Analysis 154 pp. 1996 [3-540-19995-0]
Vol. 211: Gawronski, W. Balanced Control of Flexible Structures 280 pp. 1996 [3-540-76017-2] Vol. 212: Sanchez, A. Formal Specification and Synthesis of Procedural Controllers for Process Systems 248 pp. 1996 [3-540-76021-0] VoI. 213: Patra, A.; Rao, G.P. General Hybrid Orthogonal Functions and their Applications in Systems and Control 144 pp. 1996 [3-540-76039-3]
Vol. 214: Yin, G.; Zhang, Q. (Eds) Recent Advances in Control and Optimization of Manufacturing Systems 240 pp. 1996 [3-540-76055-5] Vol. 215: Bonivento, C.; Marro, G.; Zanasi, R. (Eds) Colloquium on Automatic Control 240 pp. 1996 [3-540-76060-1]
Vol. 216: Kulhav~, R. Recursive Nonlinear Estimation: A Geometric Approach 244 pp. 1996 [3-540-76063-6] Vol. 217: Garofalo, F.; Glielmo, L. (Eds) Robust Control via Variable Structure and Lyapunov Techniques 336 pp. 1996 [3-540-76067-9] Vol. 218: van der Schaft, A. L2 Gain and Passivity Techniques in Nonlinear Control 176 pp. 1996 [3-540-76074-1] Vol. 219: Berger, M.-O.; Deriche, R.; Herlin, I.; Jaffr6, J.; Morel, J.-M (Eds) ICAOS '96: 12th International Conference on Analysis and Optimization of Systems Images, Wavelets and PDEs: Pads, June 26-28 1996 378 pp. 1996 [3-540-76076-8] Vol. 220: Brogliato, B. Nonsmooth Impact Mechanics: Models, Dynamics and Control 420 pp. 1996 [3-540-76079-2]
Vol. 221: Kelkar, A.; Joshi, S. Control of Nonlinear Multibody Flexible Space Structures 160 pp. 1996 [3-540-76093-8] Vol. 222: Morse, A.S. Control Using Logic-Based Switching 288 pp. 1997 [3-540-76097-0] Vol. 223: Khatib, O.; Salisbury, J.K. Experimental Robotics IV: The 4th International Symposium, Stanford, Califomia, June 30 - July 2, 1995 596 pp. 1997 [3-540-76133-0]
Vol. 224: Magni, J.-F.; Bennani, S.; Terlouw, J. (Eds) Robust Flight Control: A Design Challenge 664 pp. 1997 [3-540-76151-9] Vol. 225: Poznyak, A.S.; Najim, K. Learning Automata and Stochastic Optimization 219 pp. 1997 [3-540-76154-3] Vol. 226: Cooperman, G.; Michler, G.; Vinck, H. (Eds) Workshop on High Performance Computing and Gigabit Local Area Networks 248 pp. 1997 [3-540-76169-1]
Vol. 227: Tarbouriech, S.; Garcia, G. (Eds) Control of Uncertain Systems with Bounded Inputs 203 pp. 1997 [3-540-76183-7] Vol. 228: Dugard, L.; Verriest, E.l. (Eds) Stability and Control of Time-delay Systems 344 pp. 1998 [3-540-76193-4]
Vol. 234: Arena, P.; Fortuna, L.; Muscato, G.; Xibilia, M.G. Neural Networks in Multidimensional Domains: Fundamentals and New Trends in Modelling and Control 179 pp. 1998 [1-85233-006-6] Vol. 235: Chen, B.M. Hoo Control and Its Applications 361 pp. 1998 [1-85233-026-0] Vol. 236: de Almeida, A.T.; KhatJb, O. (Eds) Autonomous Robotic Systems 283 pp. 1998 [1-85233-036-8] Vol. 237: Kreigman, D.J.; Hagar, G.D.; Morse, A.S. (Eds) The Confluence of Vision and Control 304 pp. 1998 [1-85233-025-2] Vol. 238: Ella, N. ; Dahleh, M.A. Computational Methods for Controller Design 200 pp. 1998 [1-85233-075-9]
Vol. 229: Laumond, J.-P. (Ed.) Robot Motion Planning and Control 360 pp. 1998 [3-540-76219-1]
Vol. 239: Wang, Q.G.; Lee, T.H.; Tan, K.K. Finite Spectrum Assignment for Time-Delay Systems 200 pp. 1998 [1-85233-065-1]
Vol. 230: Siciliano, B.; Valavanis, K.P. (Eds) Control Problems in Robotics and Automation 328 pp. 1998 [3-540-76220-5]
Vol. 240: Lin, Z. Low Gain Feedback 376 pp. 1999 [1-85233-081-3]
Vol. 231: Emel'yanov, S.V.; Burovoi, I.A.; Levada, F.Yu. Control of Indefinite Nonlinear Dynamic Systems 196 pp. 1998 [3-540-76245-0]
VoL 241: Yamamoto, Y.; Hara S. Learning, Control and Hybrid Systems 472 pp. 1999 [1-85233-076-7]
Vol. 232: Casals, A.; de Almeida, A.T. (Eds) Experimental Robotics V: The Fifth International Symposium Barcelona, Catalonia, June 15-18, 1997 190 pp. 1998 [3-540-76218-3] Vol. 233: Chiacchio, P.; Chiavedni, S. (Eds) Complex Robotic Systems 189 pp. 1998 [3-540-76265-5]
Vol. 242: Conte, G.; Moog, C.H.; Perdon A.M. Nonlinear Control Systems 192 pp. 1999 [1-85233-151-8] Vol. 243: Tzafestas, S.G.; Schmidt, G. (Eds) Progress in Systems and Robot Analysis and Control Design 624 pp. 1999 [1-85233-123-2]