Applications of Space-Time Adaptive Processing Edited by Richard Klemm
The Institution of Electrical Engineers
Published by: The Institution of Electrical Engineers, London, United Kingdom © 2004: The Institution of Electrical Engineers This publication is copyright under the Berne Convention and the Universal Copyright Convention. All rights reserved. Apart from any fair dealing for the purposes of research or private study, or criticism or review, as permitted under the Copyright, Designs and Patents Act, 1988, this publication may be reproduced, stored or transmitted, in any forms or by any means, only with the prior permission in writing of the publishers, or in the case of reprographic reproduction in accordance with the terms of licences issued by the Copyright Licensing Agency. Inquiries concerning reproduction outside those terms should be sent to the publishers at the undermentioned address: The Institution of Electrical Engineers, Michael Faraday House, Six Hills Way, Stevenage, Herts., SGl 2AY, United Kingdom While the authors and the publishers believe that the information and guidance given in this work are correct, all parties must rely upon their own skill and judgment when making use of them. Neither the authors nor the publishers assume any liability to anyone for any loss or damage caused by any error or omission in the work, whether such error or omission is the result of negligence or any other cause. Any and all such liability is disclaimed. The moral right of the authors to be identified as authors of this work have been asserted by them in accordance with the Copyright, Designs and Patents Act 1988.
British Library Cataloguing in Publication Data Klemm, Richard Applications of space-time adaptive processing 1. Adaptive signal processing 2. Adaptive antennas 3. Radar 4. Sonar I. Title II. Institution of Electrical Engineers 621.3'848 ISBN 0 85296 924 4
Typeset in India by Newgen Imaging Systems (P) Ltd., Chennai, India Printed in the UK by MPG Books Limited, Bodmin, Cornwall
Preface
I have been asked frequently which is the more difficult task, to write a book on your own or to edit a multiauthor book such as the one at hand. I have tried both and found that each of the two kinds of project has its own charming facets. It is a pity that the work is done now - 1 enjoyed so much working on these books. My first book 'Space-time adaptive processing - principles and applications' (IEE, 1998) contains mainly a summary of my own work in this fascinating area, specialising in the most popular application: clutter suppression for airborne radar. The book has been so well received that a second extended edition 'Principles of space-time adaptive processing' appeared in 2002. While working on the second edition it came to my mind that this book contains only a subset of the broad field of space-time adaptive processing (STAP) and, moreover, reflects only my personal view of the subject. In particular, aspects of STAP operation on real clutter data are missing. Therefore, I proposed to the IEE to edit another book on STAP comprising a large variety of contributions by different distinguished authors so as to cover the entire area of space-time processing as much as possible. In contrast to my first book, applications of STAP are emphasised in this volume. The publisher kindly agreed to this ambitious plan, and I approached a large number of scientists well known in the STAP field and asked them for cooperation. I am amazed that almost all individuals I contacted immediately agreed to contribute. The total number of contributors amounts to 45! Waves are by nature functions of space and time. Whoever deals with the interpretation of waves has to apply space-time processing techniques. The fundamental paper by Brennan and Reed 'Theory of adaptive radar' (IEEE Trans. AES, 9, (2), March, 1973, pp. 237-252) has been formulated already in space-time notation, thus addressing the effects of broadband array antennas. In this paper by 'time' the fast (range equivalent) time was meant. Three years later the same authors extended their ideas to the use of array antennas in the space-slow time domain (pulse-to-pulse) for clutter rejection in moving radar systems. This was the first publication on what most people in the radar community understand by STAR The book is subdivided into two main sections: A 'Suppression of clutter in moving radar' and B 'Other space-time processing applications'. Each main section is divided in different parts dedicated to specific aspects of space-time processing.
Section A consists of four parts which deal with various aspects of the traditional STAP in GMTI (ground moving target indication) applications for moving radar platforms such as an aircraft or a satellite. Here the reader may find detailed information on topics such as STAP and SAR, space-based MTI, specific antenna configurations, STAP performance in real, heterogeneous clutter, specific (e.g. non-linear) algorithms and processor architectures, robust signal detection techniques, non-adaptive space-time clutter filters, effect of range ambiguous clutter etc. Section B includes specific applications of space-time techniques in various disciplines such as fast time STAP for broadband radar (jammer cancellation, superresolution), tracking of ground targets with STAP radar, interference reduction in over-the-horizon radar (with reference to terrain scattered jamming). Another part is dedicated to applications in seismics and acoustics. The last part deals with spacetime techniques as proposed for communication systems, including mitigation of mutual interference in cellular phones, reduction of multipath effects in underwater communications, interference suppression for GPS and space-time coding. At the end of each chapter a brief summary is given in which the major insights are highlighted. Moreover, each chapter concludes with a list of references which helps the interested reader to find in-depth background literature. The total number of references amounts to about 900. I hope that the reader will enjoy reading this unique book and will appreciate the effort made by 45 leading experts in the space-time processing field in order to bring their individual expertise to the reader's attention. In particular, by having a look 'over the fence' in other fields I expect some cross-fertilisation between different but somehow related disciplines. It is intended that scientists working in different disciplines may learn from each other, and that new ideas based on the fruit harvested already in a neighbour's garden are stimulated. If this expectation comes true the team of authors has reached its goal. I want to express my gratitude to all the authors who did a tremendous work in contributing to this unique book and encouraging the editor in his ambitious undertaking. It was a real pleasure to work with all of them. I am grateful to K. Kriicker and J. Ender of FGAN for supporting this work. I want to thank the IEE personally and on behalf of all the authors for the excellent job done. Specifically I would like to thank the anonymous reviewers for their revision of the manuscript, the commissioning editor Sarah Kramer and the editorial assistant, Wendy Hiles, for the excellent cooperation and the high quality of the final product. Richard Klemm
Glossary
a a ABF ACP ACE ADC AEP AEW AIC ALQ AMF AoA Ar ARMA ASB ASEP ASFF ATI AWACS AWGN
auxiliary channel vector noise-to-clutter ratio adaptive beamforming auxiliary channel processor adaptive coherence estimator analogue-to-digital conversion auxiliary eigenvector processor airborne early warning akaike information criterion adaptive linear quadratic adaptive matched filter angle of arrival received signal amplitude autoregressive moving average adaptive sidelobe blanker auxiliary sensor/echo processor auxiliary sensor FIR filter processor along-track interferometry Airborne Warning and Control System additive white Gaussian noise
b B B BASS-ALE Bc #D /3 BER BF BICM BK
beamformer vector bandwidth number of beamformer elements broadband signal subspace spatial-spectral estimation clutter bandwidth Doppler bandwidth look angle relative to array axis bit error rate beamforming bit interleaved coded modulation backward method
bk BLAST BLE BS Bs BW
beamformer weights Bell Labs layered space-time transceiver block linear equaliser basestation system bandwidth beamwidth
c C c cF Cr CALC CCD CCI CDMA CE CFAR CGM CIG CMP CMT CNR Coho COMET CPI CRB CRP CRS CSM CSST CW
light velocity number of space-time channels vector of clutter echoes vector of clutter spectral components transformed vector of clutter echoes constrained averaged likelihood ratio concealment, camouflage and deception cochannel interference code division multiple access capacity efficient constant false alarm rate conjugate gradient method common image gathers common midpoint covariance matrix taper clutter-to-noise power ratio coherent oscillator covariance matching estimation techniques coherent processing interval Cramer-Rao bound common reflection point common reflection surface cross spectral metric coherent signal subspace transformation continuous wave
d dimSS D(O) D{(p) ds DSW DUM dx dy dz D3LS AR
sensor spacing dimension of signal subspace vertical sensor directivity pattern horizontal sensor directivity pattern subarray displacement direct subarray weighting direct uniform manifold model sensor spacing in ^-direction sensor spacing in y-direction sensor spacing in z-direction direct data domain least-squares width of range bin
DF DFB DFT DL DMO DoA DoF DPCA DS
decision feedback Doppler filter bank discrete Fourier transform downlink dip moveout direction of arrival degrees of freedom displaced phase centre antenna direct sequence
e/ E{} ESPRIT
unit vector (7-th column of unit matrix) expectation estimation of signal parameters by rotational invariance techniques envelope of transmitted waveform
E(t) f F F (p FAP FB cpc /c /D FD FDFF FDSP FFT FIR O^ FL <^L F-LAS /Ny /PR /r FREQUENCY /s F-SAS FT O* FW
vector of D F T or D F B output signals D F T matrix, Doppler filter bank normalised target Doppler frequency azimuth (optimum) fully adaptive processor forward-backward method clutter angle of arrival carrier frequency Doppler frequency frequency domain frequency domain FIR filter frequency dependent spatial processing fast Fourier transform finite impulse response spatial phase term forward looking look angle fading-large angle spread Nyquist frequency pulse repetition frequency, PRF relative clutter Doppler frequency, normalised by maximum clutter Doppler frequency frequency associated with TIME (fast radar time) spatial frequency fading-small angle spread Fourier transform temporal phase term forward method
G(.) GAA GER GIP GLRT GSLC GMTI
transmit directivity pattern Gaussian angle of arrival generalised eigenrelation generalised inner product generalised likelihood ratio test generalised sidelobe canceller ground moving target indication
H HRR
platform altitude above ground high range resolution
I ICI ICM IF IF IID HR IMM int{} ISAR ISI
identity matrix intercell interference internal clutter motion improvement factor intermediate frequency independent and identically distributed infinite impulse response interacting multiple models next integer number inverse synthetic aperture radar intersymbol interference
j J J JD JDL JDL-GLR ji jik JNR Joint-STARS
jammer vector number of jammers Fisher information matrix joint detection joint domain localised joint-domain localised generalised likelihood ratio /-th jammer signal component /fc-th element of the Fisher information matrix jammer-to-noise ratio joint strategic target attack radar system
K KASSPER
reduced number of antenna channels knowledge aided sensor signal processing and expert reasoning
L
number of FIR filter taps, temporal dimension of space-time FIR filter reflectivity function local area network wavelength linear dispersion
L(.) LAN A LD
LMI LMS LO LoS
lean matrix inversion least mean square low observable line of sight
M MAC MAI MCARM MDL MDV MEM MF MFR MGLRT MIMO MISO ML MLSE MMSE MRC MS MSAR MTD MTDI MTI /x MUSIC MVE
number of echo pulses or temporal samples media access layer multiple access interference multichannel airborne radar measurement minimum description length minimum discernable velocity maximum entropy method matched filter matched filter response modified generalised likelihood ratio test multiple input, multiple output multiple input, single output maximum likelihood maximum likelihood sequence estimator minimum mean square error maximum ratio combining mobile station multichannel SAR moving target detector moving target detection and imaging moving target indicator factor determining the number of Doppler channels multiple signal classification minimum variance estimator
N n N TVe np NIC NIP NIR NMO iix Nt
noise covariance matrix noise vector number of antenna elements number of eigenvalues noise spectrum vector network interface card normal incidence point noise-to-interference ratio normal moveout transformed noise vector number of transmit elements
O OAP
zero matrix optimum adaptive processor
OC OFDM coc coj) OPP OUS
optimum combining orthogonal frequency division multiplexing carrier frequency angular Doppler frequency orthogonal projection processor overlapping uniform subarray configuration
PAM Pc PCI PD PDA PDF PDOF PDR PEP PHY Pj Pn Pr PRF PRI PRT Ps PSD PSF x/s PSK PST Pt
pulse amplitude modulation clutter power principal components inverse pulse Doppler probabilistic data association probability distribution function product of the spatial and temporal degrees of freedom pulse Doppler radar pairwise error probability physical layer jammer power noise power received signal power pulse repetition frequency pulse repetition interval pulse repetition time signal power power spectral density point spreading function crab angle phase shift keying power-selective training transmitted power
q Q Q QAM Qc, Q (c) QF Qi Qj Qn QoS QoT QPSK Qs QT
interference vector number of data vectors, number of normal modes interference + noise covariance matrix quadrature amplitude modulation clutter covariance matrix interference + noise power spectral matrix interference covariance matrix jammer covariance matrix noise covariance matrix quality of service quality of transmission quaternary phase shift keying signal covariance matrix transformed interference + noise covariance matrix
R R R /?2 w r9o RCS RF Rg p RL-STAP RMS Rs R(t) Rx
number of range increments range covariance matrix of signal + interference + noise two-way slant range quarter wave sampling interval radar cross section radiofrequency ground range autocorrelation Rome Laboratory space-time adaptive processing root-mean-square slant range range receive
s signal vector S signal + clutter covariance matrix s(-) steering vector SAR synthetic aperture radar SAS symmetric auxiliary sensor configuration SC single carrier SCNR signal-to-clutter + noise ratio SCR signal-to-clutter power ratio SD/TD/CDMA space division/time division/code division multiple access SDM spectral density matrix SDMA space division multiple access SER symbol error rate Sp signal spectrum vector E DPCA shift operator SIMO single input, multiple output sine (X) sin(x)/x SINR signal-to-interference-plus-noise ratio SIRP spherically invariant random process SISO single input, single output SL sidelooking SM spatial multiplexing SM signal match SMI sample matrix inverse SNIR signal-to-noise + interference ratio SNR signal-to-noise power ratio SoI signal of interest s r (t) received signal ST space-time ST transformed signal vector STAP space-time adaptive processing
STBC STFT STTC SVD
space-time block code short-time Fourier transform space-time trellis code singular value decomposition
t T T r xt TCM TDD TDMA 0 TIME time tr truAM Ts TSI Tx
time pulse repetition interval space-time transform matrix echo delay round trip delay trellis coded modulation time division duplex time division multiple access depression angle echo delay time, range time (fast time) pulse-to-pulse time (slow time) trace of a square matrix true array manifold spatial transform matrix terrain scattered interference transmit
UAV UESA UL ULA UMTS
unmanned air vehicle U H F electronically scanned array uplink uniform linear array universal mobile telecommunications system
vc Vp f ra d VSAR vt i>tan
radial clutter velocity platform velocity (x-direction) radial target velocity velocity SAR target velocity tangential target velocity
WAVES WNSF WSF WVD
weighted average of signal subspaces weighted noise subspace fitting weighted subspace fitting Wigner-Ville distribution
x xp Xt Xx
vector of received echoes spectral vector of received echoes jc-coordinate of /-th sensor transformed vector of received echoes
y yc yi
output signal correction pattern y-coordinate of i-th sensor
ZF H ZO *
zero forcing z-coordinate of/-th sensor zero offset conjugate complex or conjugate complex transpose
* 0 O
convolution Kronecker product zero vector
List of Contributors
Yuri I. Abramovich CSSIP, SPRI Building, Technology Park Adelaide, Mawson Lakes, South Australia 5095
Russell D. Brown Department of Electrical and Computer Engineering, Syracuse University, Syracuse, New York 13244-1240, USA
Stuart J. Anderson CSSIP, SPRI Building, Technology Park Adelaide, Mawson Lakes, South Australia 5095
Jeffrey T. Carlo AFRL/SNRD, 26 Electronic Parkway, Rome, New York 13441-4514, USA. e-mail:
[email protected]
Stephan Benen ATLAS ELEKTRONIK GmbH, Sebaldsbriicker Heerstr. 235, D-28305 Bremen, Germany Steffen Bergler Geophysical Institute, University of Karlsruhe, Hertzstr. 16, 76187 Karlsruhe, Germany R. S. Blum ECE Department, Lehigh University, 19 Memorial Drive West, Bethlehem, PA 18015-3084, USA. Tel: (610) 758-3459; Fax: (610) 758-6279; e-mail:
[email protected]. Johann F. Bohme Ruhr-Universitat Bochum, Fakultat Fiir Elektrotechnik, 44780 Bochum
Pei-Jung Chung Ruhr-Universitat Bochum, Fakultat Fiir Elektrotechnik, 44780 Bochum Fabiola Colone Dept. INFOCOM, University of Rome 'La Sapienza', Via Eudossiana 18, 00184 Rome, Italy. Tel: +39-06-44585472; Fax: +39-06-4873300 Eric Duveneck Geophysical Institute, University of Karlsruhe, Hertzstr. 16, 76187 Karlsruhe, Germany Alfonso Farina AMS (Alenia Marconi Systems) Chief Technical Office Scientific Director, Via Tiburtina km. 12.400, 00131 Rome, Italy. Tel: +39-6-41502279;
Fax: +39-6-4150-2665; e-mail:
[email protected] Christoph H. Gierull Defence R&D Canada, Ottawa (DRDC-O), 3701 Carling Ave., Ottawa, ON, Canada, KlA 0Z4. e-mail:
[email protected] Dhananjay Gore Qualcomm Inc., 9940 Barnes Canyon Road, San Diego, CA 92121, USA. e-mail:
[email protected] Alexei Y. Gorokhov CSSIP, SPRI Building, Technology Park Adelaide, Mawson Lakes, South Australia 5095 Peter Hubral Geophysical Institute, University of Karlsruhe, Hertzstr. 16, 76187 Karlsruhe, Germany
Yung P. Lee Science Applications International Corporation, 1710 SAIC Drive, McLean, VA 22102, USA. Tel: 703-676-6512; Fax: 703-893-8753; e-mail:
[email protected] Chuck Livingstone Defence R&D Canada, Ottawa (DRDC-O), 3701 Carling Ave., Ottawa, ON, Canada, KlA 0Z4 Pierfrancesco Lombardo Dept. INFOCOM, University of Rome 4 La Sapienza', Via Eudossiana 18, 00184 Rome, Italy. Tel: +39-06-44585472; Fax: +39-06-4873300; e-mail:
[email protected] .it,
[email protected] .it Dirk Maiwald ATLAS ELEKTRONIK GmbH, Sebaldsbrucker Heerstr. 235, D-28305 Bremen, Germany
Richard Klemm FGAN-FHR, Neuenahrer Str. 20, D 53343 Wachtberg, Germany. Tel: ++49 228 9435 377; Fax:++49 228 348 618; e-mail:
[email protected]
Jiirgen Mann Geophysical Institute, University of Karlsruhe, Hertzstr. 16, 76187 Karlsruhe, Germany
Wolfgang Koch FGAN-FKIE, Neuenahrer Strasse 20, D 53343 Wachtberg, Germany. Tel: +49-(0)228/9435-529; Fax: -685; e-mail:
[email protected]
K. F. McDonald MITRE Corporation, 202 Burlington Road, Bedford, MA 01730-1420, USA. Tel: (781) 271-7739; Fax:(781)271-7045; e-mail:
[email protected]
Stephen M. Kogon MIT Lincoln Laboratory, 244 Wood Street, Lexington, MA 02420-9108, USA
William L. Melvin Georgia Institute of Technology, Georgia Tech Research Institute (GTRI), Atlanta, GA, USA
Wilbur L. Myrick SAIC, 4501 Daly Drive, Chantilly, VA 20151,USA. e-mail:
[email protected] Rohit Nabar ETF El 19, Sternwartstrasse 7, Zurich CH8092, Switzerland, e-mail:
[email protected]
Tapan K. Sarkar Department of Electrical Engineering and Computer Science, Syracuse University, 123 Link Hall, Syracuse, New York 13244-1240, USA. e-mail:
[email protected]; http ://web. syr.edu/~tksarkar
Ulrich Nickel FGAN-FHR, Neuenahrer Str. 20, 53343 Wachtberg, Germany
Helmut Schmidt-Schierhorn ATLAS ELEKTRONIK GmbH, Sebaldsbriicker Heerstr. 235, D-28305 Bremen, Germany
Tim J. Nohara Sicom Systems Ltd., 67 Canboro Rd., 2nd Floor, RO. Box 366, Fonthill, Ontorio, LOS IEO
Richard A. Schneible Department of Electrical and Computer Engineering, Syracuse University, Syracuse, New York 13244-1240, USA
Arogyaswami Paulraj Smart Antennas Research Group, Packard 272, Stanford University, Stanford, CA 94305, USA. e-mail:
[email protected]
Nicholas K. Spencer CSSIP, SPRI Building, Technology Park Adelaide, Mawson Lakes, South Australia 5095
Peter G. Richardson QinetiQ Malvern, Malvern Technology Centre, St. Andrews Road, Malvern, Worcs., UK, WR14 3PS. Tel/Fax: 01684 894316/01684 894185; e-mail:
[email protected] Magdalena Salazar-Palma Dpto. Senales, Sistemas y Radiocomunicaciones, ETSI Telecommunicacion, Universidad Politecnica de Madrid, Ciudada Universitaria, s/n 28040, Madrid, Spain, e-mail:
[email protected] Sumeet Sandhu Intel Corporation, M/S RNB 6-49, 2200 Mission College Blvd, Santa Clara, CA 95052, USA. e-mail:
[email protected]
L. Timmoneri Technical Directorate, Radar & Technology Division, Alenia Marconi Systems, Via Tiburtina km. 12.400, 00131 Rome, Italy. Tel: +39-6-41502279; Fax: +39-6-41502665; e-mail:
[email protected] Christoph M. Walke COHAUSZ & FLORACK, Patent- und Rechtsanwalke, Kanzlerstrasse 8a, 40472 Dusseldorf H. Wang Department of Electrical and Computer Engineering, Syracuse University, Syracuse, New York 13244-1240, USA
R Weber Sicom Systems Ltd., 67 Canboro Rd., 2nd Floor, P.O. Box 366, Fonthill, Ontario, LOS IEO
Rolf Weber Ruhr-Universitat Bochum, Fakultat Fiir Elektrotechnik, 44780 Bochum
Michael C. Wicks AFRL/SN, 26 Electronic Parkway, Rome, New York 13441-4514, USA. e-mail:
[email protected]
Michael Zatman MIT Lincoln Laboratory, 244 Wood Street, Lexington, MA 02420-9108, USA Y. Zhang Department of Electrical and Computer Engineering, Syracuse University, Syracuse, New York 13244-1240, USA Michael D. Zoltowski School of Electrical Engineering, Purdue University, West Lafayette, IN 47907-1285, USA. e-mail:
[email protected]
Contents
Preface ...............................................................................
xix
Glossary .............................................................................
xxi
List of Contributors ............................................................. xxxi Section A. Suppression of Clutter in Moving Radar Part I. 1.
Space-slow Time Processing for Airborne MTI Radar
Space-time Adaptive Processing for Manoeuvring Airborne Radar ..................................................................................
5
1.1
Introduction .............................................................
5
1.2
STAP Fundamentals ...............................................
6
1.3
Clutter Angle-Doppler Relationships ....................... 1.3.1 Straight and Level Flight ............................ 1.3.2 Effect of Variations in Platform Orientation .................................................
9 9 11
1.4
Clutter Suppression in Forward-looking Radar ....... 1.4.1 Mainlobe Clutter Suppression ................... 1.4.2 Sidelobe Clutter Suppression ....................
12 12 18
1.5
Slow Moving Target Detection under Conditions of Manoeuvre .............................................................. 1.5.1 Effects of Platform Manoeuvre .................. 1.5.2 Motion Compensation ...............................
23 23 24
Jammer Rejection under Conditions of Manoeuvre .............................................................. 1.6.1 Mainlobe Clutter Filtering Requirements ...
27 27
1.6
This page has been reformatted by Knovel to provide easier navigation.
v
vi
Contents 1.6.2
Advantages of Using STAP .......................
27
Summary ................................................................
33
Non-linear and Adaptive Two-dimensional FIR Filters for STAP: Theory and Experimental Results ...........................
37
2.1
Introduction .............................................................
37
2.2
Adaptive Linear Filters ............................................
38
2.3
AR-based FIR Filters ..............................................
45
2.4
Non-linear Combination of Non-adaptive Filters ..... 2.4.1 Filter Bank Design ..................................... 2.4.2 Detection Threshold and Performance ...... 2.4.3 AR-based Non-linear Detector ..................
51 52 55 56
2.5
Non-linear Combination of Adaptive AR-based Two-dimensional FIR Filters ...................................
61
2.6
Conclusions ............................................................
66
2.7
Acknowledgments ...................................................
69
2.8
Appendix: ML Estimation of Two-dimensional AR parameters ..............................................................
69
Space-time Techniques for SAR ........................................
73
3.1
Summary ................................................................
73
3.2
Description of the Problem and State of the Art .....
73
3.3
Model of MSAR Echoes .......................................... 3.3.1 Aberrations Due to Target Motion ............. 3.3.2 Space-time-frequency Representation ......
76 76 77
3.4
Processing Schemes .............................................. 3.4.1 Taxonomy of Processing Schemes for MSAR ........................................................ 3.4.2 MTI + PD ................................................... 3.4.3 DPCA ........................................................ 3.4.4 Along-track Interferometry (ATI)-SAR ....... 3.4.5 Processor in the Space-time-frequency Domain ...................................................... 3.4.6 Optimum Processing for MSAR .................
82
1.7 2.
3.
This page has been reformatted by Knovel to provide easier navigation.
82 87 94 95 98 107
Contents
4.
3.5
Conclusions ............................................................
119
3.6
Acknowledgments ...................................................
120
Σ∆-STAP: an Efficient, Affordable Approach for Clutter Suppression .......................................................................
123
4.1
Definition of the Difference (∆) Beams ....................
123
4.2
Σ∆-STAP Algorithms ...............................................
125
4.3
Analytical Performance Formulas of Σ∆-STAP ....... 4.3.1 SINR Potential ........................................... 4.3.2 Probabilities of Detection and False Alarm .........................................................
129 129
4.4
A Real-data Demonstration of Σ∆-STAP ................
131
4.5
Desired ∆-beam Characteristics ............................. 4.5.1 Mathematical Equivalence of Subarray and Σ∆-STAP .............................................
135
Summary ................................................................ 4.6.1 Advantages of the Σ∆-STAP Approach ..... 4.6.2 Limitations of Σ∆-STAP ............................. 4.6.3 Potential Applications of Σ∆-STAP ............
143 143 145 146
STAP with Omnidirectional Antenna Arrays .......................
149
5.1
Introduction ............................................................. 5.1.1 Preliminaries on STAP Antennas .............. 5.1.2 The Circular Ring Array Concept ...............
149 149 151
5.2
Array Configurations for 360° Coverage ................. 5.2.1 Four Linear Arrays ..................................... 5.2.2 Displaced Circular Rings ........................... 5.2.3 Circular Planar Array with Randomly Distributed Elements ................................. 5.2.4 Octagonal Planar Array .............................
152 153 156
5.3
Discussion .............................................................. 5.3.1 Directivity Patterns .................................... 5.3.2 Range-ambiguous Clutter .........................
164 164 165
5.4
Effect of Array Tilt ...................................................
167
4.6
5.
vii
This page has been reformatted by Knovel to provide easier navigation.
130
142
157 160
viii
Contents 5.4.1
Side-looking Linear and Rectangular Arrays ........................................................ Omnidirectional Arrays ..............................
167 168
5.5
Conclusions ............................................................
169
Part II.
Space-slow Time Processing for Space-based MTI Radar
5.4.2
6.
7.
SAR-GMTI Concept for RADARSAT-2 ...............................
177
6.1
177 177
Introduction ............................................................. 6.1.1 Background ............................................... 6.1.2 Addition of MTI Modes to Spaceborne SAR ........................................................... 6.1.3 RADARSAT-2 Moving Object Detection Experiment ................................................
179
6.2
Analysis of SAR-GMTI Modes for RADARSAT-2 ... 6.2.1 Background ............................................... 6.2.2 Statistical Models of Measured Signals ..... 6.2.3 SCNR Optimum Processing ...................... 6.2.4 SAR Displaced Phase Centre Antenna ..... 6.2.5 SAR Along-track Interferometry .................
180 181 184 188 193 194
6.3
SAR-STAP Scheme for RADARSAT-2 ................... 6.3.1 Detection ................................................... 6.3.2 Parameter Estimation ................................
196 196 201
6.4
Conclusions ............................................................
202
6.5
List of Symbols .......................................................
203
STAP Simulation and Processing for Spaceborne Radar ..................................................................................
207
7.1
Introduction .............................................................
207
7.2
Spaceborne Radar Applications and Design .......... 7.2.1 Spaceborne MTI Radar Applications ......... 7.2.2 Spaceborne MTI Radar Design .................
208 208 209
7.3
STAP Processing for SBR ......................................
212
This page has been reformatted by Knovel to provide easier navigation.
178
Contents 7.3.1 7.3.2 7.3.3
8.
ix
Typical GMTI Signal Processing ............... Extension to Other Modes ......................... Other Issues ..............................................
212 215 216
7.4
Simulation and Processing for SBR ........................ 7.4.1 User Interface ............................................ 7.4.2 Model the Radar ........................................ 7.4.3 Model the Environment ............................. 7.4.4 Generate the Signals ................................. 7.4.5 Model the Processing ................................ 7.4.6 Evaluate the Results .................................
217 218 224 225 227 228 229
7.5
Discussion and Conclusions ...................................
231
Techniques for Range-ambiguous Clutter Mitigation in Space-based Radar Systems .............................................
235
8.1
Introduction .............................................................
235
8.2
Moving Target Detection with SBR ......................... 8.2.1 STAP for SBR Systems .............................
236 238
8.3
Clutter Characteristics of Pulse-Doppler Waveforms in SBR ..................................................................... 8.3.1 Clutter Doppler Ambiguities ....................... 8.3.2 Clutter Range Ambiguities .........................
240 241 242
Impact of Range-ambiguous Clutter on STAP Performance ...........................................................
244
Range-ambiguous Clutter Mitigation Techniques with Pulse-Doppler Waveforms .............................. 8.5.1 PRF Diversity ............................................ 8.5.2 Aperture Trade Offs ...................................
247 247 249
8.4 8.5
8.6
8.7
Long Single Pulse Phase-encoded Waveforms ..... 8.6.1 Properties of Long Single Pulse Phaseencoded Waveform (LSPW) ...................... 8.6.2 Integrated Sidelobe Clutter Levels ............ 8.6.3 STAP Simulations .....................................
250
Summary ................................................................
260
This page has been reformatted by Knovel to provide easier navigation.
252 254 257
x
Contents
Part III. 9.
Processing Architectures
Parallel Processing Architectures for STAP .......................
265
9.1
Summary and Introduction ......................................
265
9.2
Baseline Systolic Algorithm ....................................
265
9.3
Lattice and Vectorial Lattice Algorithms ..................
269
9.4
Inverse QRD-based Algorithms ..............................
271
9.5
Experiments with General Purpose Parallel Processors ..............................................................
272
9.6
Experiments with VLSI-based CORDIC Board .......
273
9.7
Modern Signal Processing Technology Overview and Its Impact on Real-time STAP .........................
275
9.8
Processing of Recorded Live Data ......................... 9.8.1 Systolic Algorithm for Live Data Processing ................................................. 9.8.2 Data Files Used in the Data Reduction Experiments .............................................. 9.8.3 Performance Evaluation ............................ 9.8.4 Detection of Vehicular Traffic ....................
278 280 284
9.9
Concluding Remarks ..............................................
285
9.10
Appendix A: Givens Rotations and Systolic Implementation of Sidelobe Canceller ....................
286
9.11
Appendix B: Lattice Working Principle ....................
288
9.12
Appendix C: the CORDIC Algorithm .......................
289
9.13
Appendix D: the SLC Implementation via CORDIC Algorithm .................................................................
292
Appendix E: an Example of Existing Processors for STAP .................................................................
293
9.14
Part IV.
277
Clutter Inhomogeneities
10. STAP in Heterogeneous Clutter Environments .................. 10.1
277
Introduction ............................................................. 10.1.1 Adaptivity with Finite Sample Support .......
This page has been reformatted by Knovel to provide easier navigation.
305 305 307
Contents
xi
10.1.2 STAP Performance Metrics ....................... 10.1.3 Covariance Matrix Errors ...........................
308 311
10.2
Classes of Space-time Clutter Heterogeneity ......... 10.2.1 General Simulation Characteristics ...........
312 315
10.3
Amplitude Heterogeneity ........................................ 10.3.1 Clutter Discretes ........................................ 10.3.2 Range-angle Varying Clutter RCS ............ 10.3.3 Clutter Edges .............................................
315 315 320 322
10.4
Spectral Heterogeneity ...........................................
325
10.5
CNR-induced Spectral Mismatch ............................
327
10.6
Targets in the Secondary Data ...............................
330
10.7
Joint Angle-Doppler Mismatch and Clutter Heterogeneity .........................................................
337
10.8
10.9
Site-specific Examples of Clutter Heterogeneity ..... 10.8.1 Measured Multichannel Airborne Radar Data ........................................................... 10.8.2 Site-specific Simulation .............................
339 339 342
STAP Techniques in Heterogeneous Environments .......................................................... 10.9.1 Data-dependent Training Techniques ....... 10.9.2 Minimal Sample Support STAP ................. 10.9.3 Clutter Discretes ........................................ 10.9.4 Targets in Training Data ............................ 10.9.5 Covariance Matrix Tapers ......................... 10.9.6 Knowledge-aided Space-time Processing .
344 344 348 350 350 351 352
10.10 Summary ................................................................
353
10.11 Acknowledgments ...................................................
353
11. Adaptive Weight Training for Post-Doppler STAP Algorithms in Non-homogeneous Clutter ............................
359
11.1
Introduction .............................................................
359
11.2
Training of STAP Algorithms ..................................
361
11.3
Post-Doppler STAP Algorithms ..............................
364
This page has been reformatted by Knovel to provide easier navigation.
xii
Contents 11.4
Phase and Power-selected Training for STAP .......
365
11.5
Experimental Results .............................................. 11.5.1 Example of Phase/Power Selection .......... 11.5.2 STAP Results ............................................ 11.5.3 Experimental Versus Theoretical STAP Performance ..............................................
367 368 369
Summary ................................................................
372
12. Application of Deterministic Techniques to STAP ..............
375
11.6 12.1 12.2 12.3 12.4
Introduction .............................................................
372
375
3
Direct Data Domain Least-squares (D LS) Approach, One Dimension ...................................... 3
D LS Approach with Main Beam Constraints .........
379 385
3
A D LS Approach with Main Beam Constraints for Space-time Adaptive Processing ............................ 12.4.1 Space-time D3LS Eigenvalue Processor .................................................. 12.4.2 Space-time D3LS Forward Processor ........ 12.4.3 Space-time D3LS Backward Processor ..... 12.4.4 Space-time D3LS Forward-backward Processor ..................................................
387 389 390 392 393
12.5
Determining the Degrees of Freedom ....................
394
12.6
An Airborne Radar Example ................................... 12.6.1 Simulation Setup ....................................... 12.6.2 Case I: Single Constraint Space-time Example ..................................................... 12.6.3 Case II: Multiple Constraint Space-time Example .....................................................
396 396
12.7
Conclusions ............................................................
408
12.8
List of Variables ......................................................
408
13. Robust Techniques in Space-time Adaptive Processing .... 13.1 Introduction ............................................................. 13.1.1 Initial Development of Space-time Adaptive Processing (STAP) Algorithms ..................
413 413
This page has been reformatted by Knovel to provide easier navigation.
398 403
414
Contents
xiii
13.1.2 Hypothesis Testing Problem ......................
417
13.2
Real-world Detection Environments .......................
418
13.3
Non-homogeneity – Causes and Impact on Performance ........................................................... 13.3.1 Signal Contamination ................................ 13.3.2 Non-homogeneity Detection ...................... 13.3.3 Knowledge-based Signal Processing ........ 13.3.4 Analysis of Degraded Performance Due to Non-homogeneity ..................................
420 423 425 428 428
13.4
Antenna Array Errors ..............................................
430
13.5
Deviation from Gaussian Assumption .....................
431
13.6
Jamming and Terrain Scattered Interference ......... 13.6.1 Constraining Detection Schemes .............. 13.6.2 Two-stage Processors ............................... 13.6.3 Three-dimensional STAP ..........................
433 434 434 436
13.7
Reduction in Computational Complexity ................. 13.7.1 Reduced-rank Methods and Covariance Matrix Tapers ............................................ 13.7.2 Techniques Implementing Limited Reference Cells ......................................... 13.7.3 Low Complexity Approaches to STAP .......
437
Conclusions ............................................................
443
13.8
437 439 441
Color Plates: Applications of Space-time Adaptive Processing .............................................................. 463a
Section B. Miscellaneous Space-time Processing Applications Part V.
Ground Target Tracking with STAP Radar
14. Ground Target Tracking with STAP Radar: the Sensor .....
467
14.1
Introduction .............................................................
467
14.2
Properties of the STAP Radar Sensor .................... 14.2.1 Processing Techniques .............................
467 468
This page has been reformatted by Knovel to provide easier navigation.
xiv
Contents 14.2.2 Array Properties ......................................... 14.2.3 Summary of the Data Output Provided by the STAP Radar ........................................
473
14.3
The Scenario .......................................................... 14.3.1 SNIR and Pd of a Moving Target ............... 14.3.2 System Aspects .........................................
474 474 480
14.4
Degrading Effects ................................................... 14.4.1 Bandwidth Effects ...................................... 14.4.2 Doppler Ambiguities .................................. 14.4.3 Range Ambiguities .................................... 14.4.4 STAP Radar under Jamming Conditions ...
486 486 488 489 492
14.5
Issues in Convoy Tracking ...................................... 14.5.1 Convoy Detection by Range-only Information ................................................. 14.5.2 Convoy Detection by Azimuth Variance Analysis .....................................................
494
Summary ................................................................
499
15. Ground Target Tracking with STAP Radar: Selected Tracking Spects ..................................................................
501
14.6
472
495 496
15.1
Introduction ............................................................. 15.1.1 Discussion of an Idealised Scenario .......... 15.1.2 Summary of Observations .........................
501 502 505
15.2
Tracking Preliminaries ............................................ 15.2.1 Coordinate Systems .................................. 15.2.2 Target Dynamics Model ............................
507 507 509
15.3
GMTI Sensor Model ................................................ 15.3.1 GMTI Characteristics ................................. 15.3.2 Convoy Resolution .................................... 15.3.3 Doppler Ambiguities .................................. 15.3.4 Measurements ...........................................
510 510 512 513 513
15.4
GMTI Data Processing ........................................... 15.4.1 Prediction .................................................. 15.4.2 Data Processing ........................................
514 514 515
This page has been reformatted by Knovel to provide easier navigation.
Contents
xv
Filtering Process ........................................ Realisation Aspects ................................... Discussion ................................................. Retrodiction ............................................... Effect of Doppler Ambiguities ....................
517 518 519 522 524
15.5
Road Map Information ............................................ 15.5.1 Modelling of Roads .................................... 15.5.2 Densities on Roads ...................................
528 529 530
15.6
Quantitative Discussion .......................................... 15.6.1 Simulation Parameters .............................. 15.6.2 Numerical Results .....................................
533 533 534
15.7
List of Variables ......................................................
537
15.4.3 15.4.4 15.4.5 15.4.6 15.4.7
Part VI.
Space-fast Time Techniques
16. Superresolution and Jammer Suppression with Broadband Arrays for Multifunction Radar ......................... 16.1
Introduction .............................................................
16.2
Broadband Array Signal Model and Beamforming .......................................................... 16.2.1 Received Signal and Notation ................... 16.2.2 Digital Beamforming with Subarray Outputs ...................................................... 16.2.3 Influence of Channel Imperfections ...........
16.3
16.4
543 543 544 545 548 553
Superresolution with Broadband Arrays ................. 16.3.1 Spatial-only Processing of Broadband Data ........................................................... 16.3.2 Space and Time Processing Methods ....... 16.3.3 Conclusions on Broadband Superresolution .........................................
559
Jammer Suppression with Broadband Arrays ........ 16.4.1 General Principles of Adaptive Interference Suppression .......................... 16.4.2 Spatial-only Adaptation .............................
582
This page has been reformatted by Knovel to provide easier navigation.
561 566 581
583 589
xvi
Contents 16.5
Part VII.
16.4.3 Space and Time Adaptation ......................
590
Final Remarks .........................................................
595
Over-the-horizon Radar Applications
17. Stochastically Constrained Spatial and Spatio-temporal Adaptive Processing for Non-stationary Hot Clutter Cancellation ........................................................................ 17.1
Overview .................................................................
17.2
SC STAP Fundamentals and Supervised Training Applications .............................................. 17.2.1 SC STAP Algorithm: Analytic Solution ...... 17.2.2 SC STAP Algorithm: Operational Routines .................................................... 17.2.3 SC STAP Algorithm: Efficiency Analysis by Simulation Results ................................ 17.2.4 SC STAP Algorithm: Efficiency Analysis by Real Data Processing ........................... 17.2.5 Summary ...................................................
17.3
17.4
603 603 604 611 624 626 638 642
SC STAP Unsupervised Training Applications ....... 17.3.1 Operational Routine for Unsupervised Training ..................................................... 17.3.2 Operational SC STAP Algorithm: Simulation and Real Data Processing Results ...................................................... 17.3.3 Summary ...................................................
647
SC STAP Convergence Analysis ............................ 17.4.1 Introduction ................................................ 17.4.2 Conditional Loss Factor η1 Analysis: LSMI Versus SMI for SC SAP ................... 17.4.3 Conditional Loss Factor η1 Analysis: LSMI for SC STAP .................................... 17.4.4 Conditional Loss Factor η2 Analysis: Exact PDF for a Single Stochastic Constraint .....
665 665
This page has been reformatted by Knovel to provide easier navigation.
649
656 664
667 678 681
17.5
Part VIII.
Contents
xvii
17.4.5 Conditional Loss Factor η2 Analysis: Approximate PDF for Multiple Stochastic Constraints ................................................
685
List of Variables ......................................................
690
Applications in Acoustics and Seismics
18. Space-time Adaptive Matched Field Processing (STAMP) .............................................................................
701
18.1
Introduction .............................................................
701
18.2
Adaptive Matched Field Processing (MFP) ............
703
18.3
Wideband-narrowband Feedback Loop Whitenoise-constrained Method (FLWNC) ......................
705
18.4
MFP Examples .......................................................
707
18.5
Space-time Adaptive Matched Field Processing (STAMP) .................................................................
709
Forward Sector Processing Simulation Geometry ................................................................
711
Summary ................................................................
713
19. Space-time Signal Processing for Surface Ship Towed Active Sonar .......................................................................
715
18.6 18.7
19.1
Introduction .............................................................
715
19.2
Narrowband Multiple Ping Processing .................... 19.2.1 Data Model ................................................ 19.2.2 Fully Adaptive CW Processing .................. 19.2.3 Partially Adaptive Processing Techniques ................................................
720 720 721
19.3
FM Processing ........................................................ 19.3.1 Image Processing Background .................. 19.3.2 Echogram Image Enhancement ................ 19.3.3 Automatic Echogram Detection .................
724 726 726 726
19.4
Experimental Results .............................................. 19.4.1 Sonar System Description .........................
727 727
This page has been reformatted by Knovel to provide easier navigation.
723
xviii
Contents 19.4.2 19.4.3 19.4.4 19.4.5
CW Pulse Sea Data Analysis .................... Echogram Sea Data Analysis (ACTAS) .... Echogram Enhancement ........................... Automatic Echogram Detection .................
728 729 730 730
20. EM and SAGE Algorithms for Towed Array Data ...............
733
20.1
Introduction .............................................................
733
20.2
Signal Model ...........................................................
734
20.3
EM and SAGE Algorithms ...................................... 20.3.1 EM Algorithm ............................................. 20.3.2 SAGE Algorithm ........................................
736 736 739
20.4
Fast EM and SAGE Algorithms ..............................
741
20.5
Recursive EM and SAGE Algorithms ..................... 20.5.1 Recursive EM Algorithm ............................ 20.5.2 Recursive SAGE Algorithm .......................
742 743 745
20.6
Experimental Results .............................................. 20.6.1 EM and SAGE Algorithms ......................... 20.6.2 Recursive EM and SAGE Algorithms ........
746 747 749
20.7
Conclusions ............................................................
751
21. The Common Reflection Surface (CRS) Stack – a Datadriven Space-time Adaptive Seismic Reflection Imaging Procedure ...........................................................................
755
21.1
Introduction .............................................................
755
21.2
Seismic Reflection Imaging .................................... 21.2.1 The Seismic Wavefield .............................. 21.2.2 Acquisition of Reflection Seismic Data ...... 21.2.3 Seismic Reflection Processing ..................
756 756 758 762
21.3
Common Reflection Surface Stack ......................... 21.3.1 Classic Data-driven Approaches ............... 21.3.2 Second-order Traveltime Approximations .. 21.3.3 Physical Interpretation of the Coefficients ... 21.3.4 Implementation .......................................... 21.3.5 Practical Aspects .......................................
766 767 768 769 771 772
This page has been reformatted by Knovel to provide easier navigation.
Contents
xix
21.3.6 A Synthetic Data Example .........................
773
21.4
CRS Attributes and Velocity Model Estimation .......
775
21.5
Conclusions ............................................................
777
21.6
Glossary .................................................................. 21.6.1 List of Variables ......................................... 21.6.2 Specific Terminology .................................
778 778 779
Part IX.
Space-time Techniques in Communications
22. STAP for Space/Code/Time Division Multiple Access Systems ..............................................................................
785
22.1
Introduction .............................................................
785
22.2
System Model .........................................................
789
22.3
Time Domain Linear Joint Detection ....................... 22.3.1 Zero Forcing Block Linear Equalisation ..... 22.3.2 Minimum Mean Square Error Block Linear Equalisation ....................................
791 792
22.4
Frequency-domain Linear Joint Detection .............. 22.4.1 Block-diagonal FD System Model ............. 22.4.2 FD ZF-BLE and MMSE-BLE ......................
793 793 796
22.5
Performance of FD Joint Detection ......................... 22.5.1 Exploitation of Spatial and Frequency Diversity ..................................................... 22.5.2 Intracell Interference Cancellation ............. 22.5.3 Intra- and Intercell Interference Cancellation ...............................................
797 798 804
22.6
Conclusions ............................................................
821
22.7
List of Variables ...................................................... 22.7.1 Variables with Roman/Calligraphic Letters ....................................................... 22.7.2 Variables with Calligraphic Letters ............ 22.7.3 Variables with Greek Letters .....................
822
This page has been reformatted by Knovel to provide easier navigation.
793
813
822 823 823
xx
Contents
23. Underwater Communication with Vertical Receiver Arrays .................................................................................
827
23.1
Introduction .............................................................
827
23.2
The Underwater Acoustic Channel ......................... 23.2.1 Transmission Loss and Ambient Noise ..... 23.2.2 Sound Speed Variability ............................ 23.2.3 Multipath Propagation ............................... 23.2.4 Doppler Effect ............................................ 23.2.5 Summary ................................................... Underwater Acoustic Communications – a Brief Overview ................................................................. 23.3.1 Incoherent Digital Receivers ...................... 23.3.2 Coherent Digital Receivers ........................
828 828 829 830 831 832
23.4
Spatial-temporal Receiver Architecture .................. 23.4.1 Communication Over Channels with ISI .... 23.4.2 Multichannel Digital Receiver .................... 23.4.3 Signal Model .............................................. 23.4.4 Multichannel Equalisation ..........................
834 834 835 837 839
23.5
Multichannel Constant Modulus Algorithm ............. 23.5.1 Blind Stochastic Gradient Descent Algorithms .................................................. 23.5.2 The Constant Modulus Algorithm .............. 23.5.3 Experimental Results ................................
841 841 842 844
23.6
Super-exponential Blind Equalisation ..................... 23.6.1 Iterative Shalvi-Weinstein Algorithm .......... 23.6.2 Recursive Shalvi-Weinstein Algorithm ....... 23.6.3 Adaptive Implementation ........................... 23.6.4 Experimental Results ................................
847 847 849 850 853
23.7
Concluding Remarks ..............................................
853
23.3
24. Reduced-rank Interference Suppression and Equalisation for GPS and Downlink CDMA ............................................. 24.1 Reduced-rank Interference Suppression and Equalisation ............................................................ This page has been reformatted by Knovel to provide easier navigation.
832 832 833
857 857
Contents 24.1.1 Motivation for Reduced-rank MMSE Processing ................................................. 24.1.2 Understanding the Multistage Wiener Filter .......................................................... 24.1.3 Lattice Structure of the MSWF .................. 24.1.4 MSWF Related to Wiener-Hopf Filter Weights .....................................................
xxi 857 858 861 862
24.2
Application of MSWF to CDMA Downlink ............... 24.2.1 Introduction ................................................ 24.2.2 Data and Channel Model ........................... 24.2.3 Edge of Cell/Soft Hand-off ......................... 24.2.4 Chip-level MMSE Estimator ....................... 24.2.5 Performance Examples .............................
24.3
Application of MSWF to GPS Jammer Suppression ............................................................ 24.3.1 Introduction ................................................ 24.3.2 Power Minimisation and Joint Spacetime Preprocessing .................................... 24.3.3 Space-time Filter Characteristics ............... 24.3.4 Data and Channel Model ........................... 24.3.5 Dimensionality Reduction Techniques ...... 24.3.6 Performance Examples .............................
871 872 873 875 876
Summary of Concepts Involving Reduced-rank Filtering ...................................................................
879
25. Introduction to Space-time Coding .....................................
883
24.4
864 864 865 866 866 868 871 871
25.1
Introduction .............................................................
883
25.2
Multiple Antenna Channel Model ............................
885
25.3
Benefits of Smart Antenna Technology .................. 25.3.1 Array Gain ................................................. 25.3.2 Diversity Gain ............................................ 25.3.3 Multiplexing Gain ....................................... 25.3.4 Interference Reduction ..............................
887 887 888 891 893
This page has been reformatted by Knovel to provide easier navigation.
xxii
Contents 25.4
Background on Space-time Codes ......................... 25.4.1 Space-time Trellis Codes .......................... 25.4.2 Linear Space-time Block Codes ................
894 895 897
25.5
New Design Criteria ................................................ 25.5.1 Error Performance ..................................... 25.5.2 Capacity Performance ............................... 25.5.3 Unified Design ...........................................
898 899 900 901
25.6
Receiver Design ..................................................... 25.6.1 Modulation and Coding for MIMO .............
905 905
25.7
Concluding Remarks ..............................................
906
Index .................................................................................. 909
This page has been reformatted by Knovel to provide easier navigation.
Section A
Suppression of clutter in moving radar
Parti
Space-slow time processing for airborne MTI radar
Chapter 1
Space-time adaptive processing for manoeuvring airborne radar Peter G. Richardson
1.1
Introduction
Since the early 1970s, STAP (space-time adaptive processing) methods have been actively considered for look down airborne radar where target signals have to compete with strong ground clutter returns. Most previous STAP research has been devoted to SLAR (sideways looking airborne radar) applications where the plane of the receiving antenna is coaligned with the direction of travel. For this type of antenna configuration, a linear relationship between the angular location and Doppler frequency of the clutter can be exploited to allow clutter rejection and enhanced signal detectability via twodimensional filtering in the spatial and temporal frequency domains. Appropriate filters can be realised via the sampling of a coherent pulse train with a phased array antenna. Non-adaptive filter weight solutions which theoretically achieve full clutter suppression in SLAR correspond to the well known DPCA (displaced phase centre antenna) technique, e.g. see References 1 and 16. STAP offers the advantage over DPCA that the filter weights are calculated adaptively. This leads to robustness in the presence of errors, e.g. amplitude and phase mismatch errors between channels, or drift in the platform velocity. STAP also offers the capability to simultaneously suppress jamming and clutter. Over the last ten years there has been growing interest in applying STAP techniques for clutter suppression in antenna orientations other than sideways looking. These include forward-looking array geometries where the plane of the array is transverse to the direction of travel [2-^], inclined looking array applications [5,6] and circular arrays [7]. In these cases, the plane of the receiving antenna is not coaligned with the direction of travel, and hence there is no longer a linear dependence between the clutter Doppler and spatial frequencies. The implication of this is that STAP cannot provide the full clutter cancellation that is theoretically possible in SLAR. However,
a major advantage of STAP over conventional MTI (moving target indication) and Doppler filtering approaches is that it offers the capability for detecting slow moving targets, i.e. targets lying within the Doppler bandwidth of mainlobe clutter. Analysis and simulations performed in Reference 2 demonstrated that a sideways looking geometry is not a prerequisite to achieving this. Further improvements in detection of targets, that are masked by either mainlobe or sidelobe clutter, can be achieved by adapting in a range-dependent, or Doppler-dependent manner. Examples of rangedependent STAP can be found in References 4 and 7. Examples of Doppler dependent, or post-Doppler, STAP can be found in Reference 7 and Chapter 9 of Reference 8. SLAR related STAP research is often directed at applications where the airborne radar platform is very large and stable, and the effects of platform manoeuvre can be assumed to be small. Typical examples are AEW (airborne early warning) radar [9-11 ] and SAR (synthetic aperture radar) [12]. In contrast, in forward-looking applications the array antenna is likely to be located in the nose of a relatively small and highly manoeuvrable airborne platform, e.g. as in AI (airborne intercept) radar. In these cases, the radar may be required to function while the platform is performing a steep dive or rolling at a significant rate, and any assumption that platform manoeuvre effects are negligible is less easy to justify. In this chapter we will assess the benefits of using STAP for simultaneously suppressing clutter and jamming in forward-looking phased array geometries. We will initially analyse the clutter suppression problem and show how significant rejection of both mainlobe and sidelobe clutter is achievable using appropriate STAP approaches. This will include a brief consideration of the effects of variations in the platform orientation (rather than the array orientation) on clutter suppression and slow moving target detection. The effects of manoeuvre (e.g. platform yaw, pitch or roll) on clutter and jammer suppression will then be examined and the relative merits of various approaches for compensating for platform manoeuvre will be assessed.
1.2
STAP fundamentals
Throughout this chapter we will consider a pulse-Doppler airborne radar where a coherent burst of M pulses are transmitted at a pulse repetition interval r. We will assume a phased array receive antenna consisting of N elements. The data received by the array at a time t can then be represented by the space-time snapshot vector: x(t) = [x\ (t), x2(t), X 3 (O,-.., xM(t)]T = [xi(t),xi(t
- r),xi(t - 2 r ) , . . . 9xx(t - (M - l ) r ) f e CMNxl
(1.1)
where xm(t) e CNxl is used to denote the spatial snapshot of data corresponding to the (M — m + l)th pulse repetition interval (PRI). In large phased array applications, the cost and complexity associated with digital adaptive processing at element level leads to the need to reduce the number of spatial channels. This reduction is commonly achieved by analogue beamforming of subarrays of elements prior to digitisation, e.g. see References 2, 4 and 8, Chapter 6.
For a system with K subarrays, the spatial channel reduction can be described mathematically by the NM by KM transformation matrix T:
(1.2)
where IM is the M by M identity matrix, Ts is the N by K spatial transformation matrix (K < N) and (g) represents the Kronecker matrix product. The transformed (KM by 1) space-time snapshot vector is then: xT(t) = THx(t)
(1.3)
where the superscript H denotes the Hermitian transpose operator. It is well known that the subarray level STAP filter weights that maximise the output signal-to-noise-plus-interference ratio (SNIR) are given by: w = kQ^sT(0,(/>,fD) = k[THQT]-lTHs(6,(f>,fD)
(1.4)
KMxKM
where k is an arbitrary scalar, QT e c is the covariance matrix of the transformed (subarray level) interference plus noise data, Q e cNMxNM is the corresponding covariance matrix for element level data, sj(#, 0, /b) is the subarray level steering vector for the target signal Doppler frequency / D and direction of arrival (0,0), and s(0,0, / b ) is the corresponding element level steering vector for the target signal. The covariance matrix QT must be estimated from the incoming data and this is usually achieved by forming the maximum likelihood estimate: (1.5) where S is the number of weight training data samples employed. It is usual to collect training data from range samples (sometimes referred to as fast time samples) that neighbour the range (or ranges) at which the adaptive weights are applied. Assuming data that are jointly Gaussian, independent and identically distributed, the value of S must exceed 2KM — 3 ~ 2KM to ensure that the expectation of the ratio of the adaptive SNIR to optimum SNIR is greater than 0.5 (i.e. the loss due to covariance matrix estimation is less than 3 dB) [13]. To determine the optimal subarray weights in equation (1.4), we clearly need to collect data from all M pulses. However, for large values of M it is unlikely that there will be sufficient weight training data available to support the adaptive weight calculation. In addition, the estimation and inversion of a KM by KM covariance matrix may not be possible in real time. It is therefore almost always necessary to reduce the number of temporal channels within the STAP architecture. This can be achieved using pre-Doppler STAP, where a space-time adaptive filter with a small number of tap delays is used as a prefilter to conventional Doppler processing, or by using post-Doppler STAP, where STAP with a small number of temporal or Doppler channels is employed after Doppler processing. Further details of these approaches
can be found in References 8 and 14 and will not be repeated here. It should be noted that in Reference 8 the pre-Doppler STAP approach is referred to as 'space-time FIR filtering'. Although pre-Doppler STAP is usually the more computationally efficient of the two methods, the post-Doppler STAP approach is often found to provide the more effective clutter suppression. In this chapter, most consideration will be given to pre-Doppler STAP implemented at subarray level in a large circular planar array in a forward facing geometry. With this architecture, clutter and jamming suppression can be achieved simultaneously by adaptively combining the outputs of the K spatially separated subarrays and L (with L <$C M) temporally separated pulse returns. However, a further stage of conventional range-Doppler processing over M-L +1 pulses is necessary to recover the coherent processing gain for the desired signal. In pre-Doppler STAP, the adaptive weight solution is given by equation (1.4) with QT now being the KL by KL covariance matrix of the data and Sj(O9 0, / D ) now being the corresponding KL by 1 signal steering vector. The anticipated signal direction is known to within a transmit beamwidth accuracy, but in many practical cases information on the target Doppler frequency is unavailable. In these cases, the STAP calculation must either be repeated for a large number of trial signal Doppler frequencies or the calculation must be modified to exploit the a priori knowledge of the signal direction alone. The latter can be achieved using a linearly constrained power minimisation weight calculation where linear constraints that utilise the available information on the signal direction of arrival are specified. If the constraint equations are given in matrix form by: CHw=f
(1.6)
where C is the constraint matrix a n d / is the vector composed of the right-hand side values of the equations, then the optimal solution for w becomes: (1.7) Note that if we have a single constraint matched to the signal Doppler frequency and angular location, then equation (1.7) is identical to the maximum SNIR solution given in equation (1.4) with
In the absence of signal Doppler information, it is appropriate to use a constraint that exploits knowledge of the transmit beam look direction. This can be achieved using the approach described in Reference 15 whereby the spatial beamforming weights at one tap of the space-time filter are constrained using a constraint vector of the form: (1.8) where ep is the L by 1 constraint vector with unity value at the /7th element and zeroes elsewhere and S5(O9 (J)) is the A' by 1 spatial steering vector. This type of clamped tap constraint tends to bias the solution towards one which provides a uniform response
in Doppler at the beam look direction. However, it cannot prevent signal suppression in cases where the desired signal power is significantly above the background noise level. If the weights at the first tap are constrained, the approach is mathematically equivalent (apart from an arbitrary scaling factor) to the least-squares space-time FIR (finite impulse response) filtering approach described in Chapter 7 of Reference 8.
1.3
Clutter angle-Doppler relationships
1.3.1 Straight and level flight The shape and extent of the clutter domain within the multi-dimensional sampling space associated with space-time filtering provides a useful insight into the magnitude of the clutter rejection problem. For space-time filtering implemented in a planar array, the clutter spectrum is confined to a three-dimensional sampling space defined by the temporal (Doppler) frequency component, / D , the azimuthal spatial frequency component, / u and the elevation spatial frequency component / v . Consider a regular planar array of elements in the airborne geometry shown in Figure 1.1. The radar platform is travelling parallel to the ground at constant velocity v with the velocity vector at an azimuth angle \j/ relative to the x-axis. The receiving array is located in the y-z plane with the antenna normal parallel to the x-axis. The angular location of the clutter scatterer S is defined by the azimuth angle 0 and elevation angle (declination in this case) 0. Assuming the receiving array is regular with horizontal interelement spacing du and vertical spacing dy, we can then define normalised Doppler and spatial frequencies for S:
(1.9)
Figure 1.1 Airborne radar geometry - straight and level flight
where normalisation has been carried out with respect to the sampling frequencies. Using equation (1.9), it is easily shown that the relationship between the Doppler frequency / b and spatial frequencies / u , / v is given by: (1.10) where /3U = 2vr/du and /3y = 2vr/dv. Equation (1.10), which defines an ellipsoid, holds for all clutter scatterers, regardless of the scatterer location S. It therefore follows that clutter collected from all ranges and all angles is confined to the surface of an ellipsoidal bowl with the major axis, in general, cutting diagonally through the three-dimensional sampling space. Maximum eccentricity arises as the geometry tends towards the sideways looking case (i.e. ^r -> 90°), with the bowl degenerating to a plane in the limit (\fr = 90°). In this case, the simple linear relationship between / u and / D can be exploited to achieve full clutter suppression. Minimum eccentricity arises in the case of a forward-looking array (i.e. \j/ = 0°). The clutter surface that results for a forward facing array geometry with clutter unambiguous in Doppler is shown in Figure 1.2. Note that there is an implicit narrowband assumption here (i.e. k can be assumed to be constant). For finite bandwidths the clutter bowl has finite thickness. In Figure 1.2 far range returns (shallow elevation angle) correspond to the rim of the clutter bowl, whereas short range returns correspond to the bottom of the bowl. It has been assumed that the antenna has a negligible backlobe response, and hence only that part of the bowl corresponding to returns emanating from forward of the array is shown. The bowl corresponds to all clutter returns (both mainlobe
Figure 1.2
Clutter bowl -forward-looking geometry, straight and level flight
and sidelobe) received by the array. For cases where a narrow beam is scanned away from array broadside at a shallow grazing angle to ground, the mainlobe clutter corresponds to a small patch on the bowl surface as shown in Figure 1.2. Slow moving targets are likely to lie within the spatial and Doppler bandwidth of mainlobe clutter, but will usually be distinct from the bowl surface as also shown in Figure 1.2. Two-dimensional filtering is therefore appropriate for separating the targets from the clutter.
1.3.2 Effect of variations in platform orientation The effect of changing the orientation of the radar platform, relative to the ground, is merely to cause a rotational transformation of the coordinate system shown in Figure 1.2. Providing that the velocity vector remains in the same fixed position relative to the array face, the shape and total extent of the clutter is unaffected by the transformation, i.e. the clutter is still confined to the surface of an ellipsoidal bowl. This is of significance for a manoeuvring platform as it implies that any change in the platform orientation that takes place between adaptive processing intervals will not significantly change the clutter suppression problem. However, it should be noted that stationarity is assumed within the adaptive processing and any changes in the clutter angle-Doppler characteristics within this interval will have a detrimental effect. In addition, the distribution of sidelobe clutter power does change with platform orientation, e.g. clutter returns that previously corresponded to the backlobes of the antenna could be moved into the beam sidelobe region. An example is shown in Figure 1.3 for the extreme case where the radar platform is diving directly at the ground (i.e. a 90° dive). All of the ground is now visible to the receiving antenna, and hence the clutter covers the full surface of the bowl.
Figure 1.3 Clutter bowl - diving platform
1.4
Clutter suppression in forward-looking radar
1.4.1 Mainlobe clutter suppression Ideally, to achieve full clutter suppression, we require a single STAP weight solution which places a null response over the entire surface of the clutter bowl and maintains a reasonable gain at the target location in angle and Doppler. Although a STAP solution of this type does not exist, adequate control of sidelobe clutter can often be achieved by maintaining low sidelobe levels on both transmit and receive. The problem that then remains is detection of slow moving targets possessing a low radial velocity (i.e. targets which have only a small component of velocity in the direction of the receiving radar) as these are often indistinguishable in Doppler from the mainlobe ground clutter returns. Scenario geometries involving crossing targets, e.g. as shown in Figure 1.4, are particular cases where the slow moving target detection problem occurs. In many practical applications, mainlobe clutter corresponds to only a small patch on the bowl surface as depicted in Figure 1.2. In these circumstances it is usually possible to produce a null response over the mainlobe clutter patch, while maintaining a high response to slow moving targets, using STAP with small numbers of spatial and temporal degrees of freedom. This is extremely important as slow moving targets that lie within the Doppler and spatial extent of mainlobe clutter cannot be detected using conventional Doppler processing approaches or by maintaining low beampattern sidelobe levels. Details of a mechanism by which the mainlobe clutter cancellation can be achieved using a space-time filter implemented in a forward-looking linear array of sensors are given in Reference 2. For a linear array geometry, the clutter rejection problem becomes two-dimensional and it is appropriate to consider the projection of the clutter bowl onto the / u , / D plane. At shallow grazing angles, the mainlobe clutter becomes confined to an arc close to the perimeter of the resulting elliptical clutter domain within the two-dimensional sampling space. An approximate linear relationship between
platform velocity v
tncget velod^
Figure 1.4 Forward-looking radar - slow moving target geometry
/ u and / D can then be exploited (see Reference 2) to produce a null along the mainlobe clutter arc while maintaining an adequate response to slow moving targets possessing the same Doppler frequency as the clutter. In the analysis in Reference 2, it is assumed that the sensors have virtually identical responses (to within a scaling factor) for clutter scatterers within the main beam footprint. Using the additional assumption that the transmit beam is at a shallow grazing angle to the ground, it is then shown that the space-time filter response can be rendered virtually independent of the mainlobe clutter scatterer location if the sensor spacing d satisfies: d = 2vrtan(f)o
(1.11)
where 0o is the beam scan direction in azimuth. It then follows that a null response to mainlobe clutter can be achieved using a very simple space-time filter comprising two sensors with two taps per spatial channel by subtracting the return from one sensor from the return received one PRI earlier at the other sensor. In Reference 2 the sensor spacing condition given by equation (1.11) is referrred to as a 'localised DPCA condition' as it is analogous to the condition d = 2vx required for range-ambiguous DPCA clutter cancellation in SLAR and downwards looking radar [16]. An important difference is that, in the forward-looking airborne radar case, the condition is dependent on azimuth scan angle implying that, for a fixed antenna, the PRI must be changed with the beam steer direction. However, in STAP the two phase centre locations required to satisfy equation (1.11) can be synthesised adaptively and hence the PRI restriction no longer exists. It should be noted that for broadside steer (i.e. 0 = 0°) equation (1.11) leads to a required sensor spacing of zero, thus implying that conventional MTI (moving target indication) filtering is virtually optimal. It is in scenarios where the beam is at large scan angles that STAP offers the most significant advantages over conventional MTI clutter rejection, as it is in these cases that the mainlobe clutter encompasses a large Doppler spread and is most likely to mask targets of interest. The potential benefits of using STAP for slow moving target detection in forwardlooking airborne radar are illustrated in Figures 1.5 and 1.6. The surface plots show the range-Doppler detection maps that resulted from simulations of a radar operating at an altitude of 5 km in a medium PRF (pulse repetition frequency) mode where data are both range and Doppler ambiguous. The receiving antenna was a circular planar array consisting of 2029 elements. The radar beam was steered at a declination of 5° and scanned off at an angle of 60° from broadside to the array. A slow moving target, lying within the Doppler bandwidth of mainlobe clutter, was located at the beam steer direction. The range-Doppler map that resulted from applying conventional beamforming and Doppler processing to a burst of 16 pulses is shown in Figure 1.5. The slow moving target is masked by mainlobe clutter which is evident as a ridge centred at zero Doppler. The result obtained using pre-Doppler STAP with M = 16 (pulses), L = 2 (taps) and K = 16 (subarrays), with the clutter covariance matrix estimated from all range samples available within coherent processing interval, is shown in Figure 1.6. In this case, the mainlobe clutter has been suppressed leaving the slow moving target clearly evident. For the STAP result, the vertical stave subarray scheme shown in Figure 1.7 was employed. The result therefore confirms that elevation degrees of
range Doppler
Figure 1.5
Range-Doppler map - conventional output, shallow grazing angle scenario (from [17J)
target
range Doppler
Figure 1.6
Range-Doppler map - 2 tap STAP, 16 vertical stave subarrays, shallow grazing angle scenario (from [17J)
freedom are unnecessary to achieve slow moving target detection in scenarios where the beam is at a shallow grazing angle to the ground. In radar geometries where the beam is at a steep grazing angle to the ground, the mainlobe clutter patch is close to the bottom of the bowl and the Doppler
Figure 1.7
Circular planar array partitioned into 16 vertical stave subarrays
frequency, / D , becomes dependent on both azimuth and elevation spatial frequencies /u> / v Consequently, to achieve slow moving target detection, a planar array offering azimuth and elevation degrees of freedom is required [17]. Elevation degrees of freedom are also important for rejection of range-ambiguous clutter [3]. The advantage to be gained through having elevation degrees of freedom in forward-looking airborne radar is illustrated in Figures 1.8 and 1.9. The results are again for a medium PRF scenario, but now with the radar beam steered at a declination of 30° and azimuth of 54°. The simulated data was processed using pre-Doppler STAP with three taps per subarray channel. Figure 1.8 shows the range-Doppler map obtained using the vertical stave scheme shown in Figure 1.7. The lack of vertical degrees of freedom has, in this case, led to poor mainlobe clutter rejection and residual clutter is evident in the range-Doppler map. Figure 1.9 shows the corresponding result for the array partitioned into 16 subarrays in the chequerboard type configuration shown in Figure 1.10. In this case the combination of both elevation and azimuthal degrees of freedom has led to the clutter being successfully rejected leaving the slow moving target clearly visible. Simulation results illustrating that STAP slow moving target detection performance is not sensitive to variations in the platform orientation are shown in Figures 1.11 and 1.12. These results correspond to the case of a radar platform diving directly at the ground (i.e. a 90° dive). The radar beam was again scanned to a direction of 60° away from broadside, thus causing the mainlobe clutter to be extended in Doppler. The range-Doppler map that resulted from conventional beamforming and Doppler processing is given in Figure 1.11. In comparison with the previous
range Doppler
Figure 1.8
Range-Doppler map - 3 tap STAP, 16 vertical stave sub arrays, steep grazing angle scenario (from [17J)
target
range Doppler
Figure 1.9
Range-Doppler map - 3 tap STAP, 16 chequerboard subarrays, steep grazing angle scenario (from [17J)
Figure 1.10
Circular planar array partitioned into 16 chequerboard subarrays
range Doppler
Figure 1.11 Range-Doppler map - conventional output, steep dive scenario (from [17])
target
range Doppler
Figure 1.12
Range-Doppler map - 3 tap STAP output, steep dive scenario (from [17])
results, the mainlobe clutter is foreshortened in range due to the extremely steep grazing angle of the beam. However, the slow moving target is obscured because it is range-ambiguous with the mainlobe clutter and lies within the mainlobe clutter bandwidth Doppler. Figure 1.12 shows the result of applying pre-Doppler STAP using the chequerboard subarray configuration shown in Figure 1.10 with three taps per subarray channel. The mainlobe clutter has again been rejected to reveal the slow moving target.
1.4.2 Sidelobe clutter suppression In low altitude scenarios, the interference suppression afforded by the transmit and receive beam sidelobes may be insufficient to allow detection of targets that compete in both range and Doppler with sidelobe clutter. Short range clutter, emanating from the nadir region, can be particularly dominant and may mask targets of interest in range-ambiguous modes of operation. There is therefore considerable interest in STAP techniques that can provide suppression of both mainlobe and sidelobe clutter. To achieve full clutter suppression using space-time filtering in forward-looking airborne radar requires a weight solution that can place a null over the full surface of the clutter bowl. Although this is not generally possible, significant levels of rejection of both mainlobe and sidelobe clutter can be achieved with planar arrays offering very high levels of digitisation. In these cases, STAP solutions providing a low response over large regions of the clutter bowl are realisable. Simulation results that demonstrate this for a small 61 element hexagonal array are shown in
range
Doppler
Range-Doppler image - conventional output, 61 element planar array
range
Figure 1.13
Doppler
Figure 1.14
Range-Doppler image - 4 subarray, 3 tap STAP output, 61 element planar array
Figures 1.13, 1.14 and 1.15. The results are for a medium PRF scenario with the radar platform at a height of 300 m and the beam scanned to an angle 45° away from broadside. Two adaptive array architectures were considered in the simulations. In the first, the array was partitioned into four quadrant subarrays and the processing was carried out at subarray level. In the second architecture, the adaptive processing was performed at element level. Figure 1.13 shows the range-Doppler image obtained from conventional beamforming and coherent integration of 64 pulses. A slow moving target is masked by a band of mainlobe clutter that covers all ranges. There are also significant levels of
range
Doppler
Figure 1.15
Range-Doppler image - 61 spatial channel, 3 tap STAP output, 61 element planar array
sidelobe clutter evident in the range-Doppler image and this has masked a target with a higher relative velocity. Figure 1.14 shows the result of using three tap pre-Doppler STAP applied at subarray level with the adaptive weights calculated using a clamped tap constraint (as described in Section 1.2) and data collected over the full ambiguous range interval. In this case STAP has just rendered the slow moving target visible, but the fast target is still masked by residual sidelobe clutter. It is also apparent that adapting to reject the mainlobe clutter has caused a raising of the thermal noise level. The slow moving target SNIR value estimated directly from the range-Doppler detection map using a simple cell averaged CFAR (constant false alarm rate) scheme was 17.3 dB. The result illustrates that slow moving target detection is possible using STAP with very small numbers of spatial channels. Figure 1.15 shows the result of applying three tap pre-Doppler STAP at element level (i.e. using all 61 spatial channels). In this case, significant rejection of both mainlobe and sidelobe clutter has been achieved and both targets are now clearly visible. Target SNIR values estimated directly from the range-Doppler detection map were 17.8 dB for the slow moving target and 19.5 dB for the fast target. It should be noted that this level of performance has been achieved without resorting to range or Doppler-dependent STAP approaches. The clutter suppression performance of computationally efficient STAP techniques for large element digitised phased arrays is compared in Reference 18. Sidelobe clutter can be widely spread in angle and Doppler, and is difficult to suppress using subarrayed adaptive processing. Although it is not possible to form extended regions of low response in the angle-Doppler domain, the clutter rejection can be improved by sequentially adapting over different regions of the clutter bowl. This can be achieved by adapting in either a range dependent or Doppler dependent manner. The disadvantage is that fewer training data samples are available for
Figure 1.16
Clutter bowl partitioned into range intervals
covariance matrix estimation, i.e. S is reduced in equation (1.5). It is also possible to use a combination of range and Doppler-dependent adaptation, but this further limits the amount of available training data. The principles of the range and Dopplerdependent approaches are briefly described below. Figure 1.16 shows the clutter bowl partitioned into regions associated with different range intervals. It is apparent that adapting to data corresponding to each interval in turn leads to a reduction in the angle-Doppler region over which clutter rejection is required. For example, at long ranges the clutter from a particular range interval corresponds to a very narrow annular ring that is elliptical in the / u , / D domain. The clutter can then be adaptively rejected using spatial filtering alone, e.g. see Reference 19, or by using STAP to form an elliptical null in the corresponding / u , / D plane, e.g. see References 3 and 4. It should be noted that the former approach relies on a planar array with elevation degrees of freedom, which allow spatial nulls to be formed at particular elevation angles, and is not appropriate for detecting targets masked by mainlobe clutter. The potential benefits of using range-dependent STAP can be deduced from examining the way that the ground clutter returns are sampled. Figure 1.17 shows a linear array of sensors, with interelement spacing, d, in a forward-looking geometry. The radar platform is moving at velocity v parallel to the x-axis. The returns from the ground are sampled using a space-time filter with M taps per spatial channel with the tap delay equal to the radar PRI r. If we now consider the sampled ground returns (i.e. ground clutter) alone, then the space-time filter is equivalent to a virtual planar array with y-dimension spacing d and x-dimension spacing 2ur as shown in Figure 1.17. Returns from clutter at a single range lie at a constant cone angle from the array centre. Range-dependent clutter suppression therefore requires nulls to be formed at a fixed cone angle, y, from the equivalent planar array. It is well known that nulls of this type can be synthesised using circular array geometries. Hence rangedependent clutter suppression is possible using space-time filter weights that lead to
rangs iing
clutter
Figure 1.17
Equivalent spatial planar array, STAP in forward-looking airborne radar
the equivalent planar array being circular. The configuration of the two-dimensional array will, in general, be different for moving targets and hence desired signals will not be attenuated to the same degree as clutter. Only azimuthal and temporal degrees of freedom are employed in the space-time filtering approach shown in Figure 1.17. However, in practice, the nulls required for clutter suppression will have a finite width due to the limited range resolution associated with the transmitted pulse width. In low-altitude scenarios, the ground region associated with the shortest range returns can have a particularly large extent in elevation, and clutter suppression is therefore rendered difficult with a twodimensional filter operating in the / u and / D domain. Additional complications arise where the clutter is range ambiguous, as a circular null is then required for each ambiguous range ring. Some of the far range returns will be ambiguous with short range clutter covering significant areas of the angle-Doppler domain. Although it is possible to form concentric nulls in cases where there are large numbers of azimuthal and temporal degrees of freedom, the extended region over which rejection is required is likely to also lead to target suppression. Better performance is therefore achievable with range-dependent STAP implemented in a planar array offering elevation degrees of freedom. It is also interesting to note that for STAP implemented in a planar array, the equivalent spatial array shown in Figure 1.17 becomes a volume array. The implication of this is that nulls covering extended range regions are realisable, and hence the number of range intervals requiring independent adaptation can be greatly reduced. Figure 1.18 shows the clutter bowl partitioned into regions associated with different Doppler intervals. In this case the clutter for each Doppler interval corresponds to an annular ring that is elliptical in the / u , / v plane. Isolation of clutter into distinct Doppler bins can be achieved by carrying out full Doppler filtering prior to, rather than after, STAR Doppler-dependent STAP is usually referred to as post-Doppler STAP as it entails carrying out STAP after Doppler filtering.
Figure 1.18
Clutter bowl partitioned into Doppler intervals
If we assume idealised Doppler filtering with no sidelobe leakage of clutter into adjacent Doppler bins, then the clutter in each Doppler bin can be rejected using a spatial filter operating in the / u , / v domain. Clutter at a single Doppler frequency will be at a fixed cone angle from broadside to the array, and it follows that for clutter suppression, we require an elliptical null (semi-elliptical for straight and level flight) in the / u , / v domain. However, in practice, the Doppler filter sidelobe responses must be controlled using carefully chosen Doppler windowing functions. This leads to poorer Doppler resolution, which may necessitate the use of space-time filtering solutions in each Doppler bin. In addition, Doppler ambiguous clutter leads to a requirement for concentric spatial nulls. This can be mitigated using additional temporal (or Doppler) degrees of freedom in an analogous manner to the way elevation degrees of freedom can be used to counter the effects of range ambiguities in range-dependent STAR The simplest post-Doppler STAP approach of spatial adaptive beamforming after Doppler processing is often referred to as factored STAR It should be noted that for linear arrays in forward-looking geometries, clutter corresponding to a small Doppler interval can cover a significant extent in the spatial frequency (/ u ) dimension, and hence the benefits of using the post-Doppler STAP approach are unclear. In the following sections we will concentrate mainly on the effects of platform motion on slow moving target detection and jammer rejection. Range-dependent and Doppler-dependent STAP techniques, designed to enhance the rejection of strong sidelobe clutter, will therefore not be considered further.
1.5
Slow moving target detection under conditions of manoeuvre
1.5.1 Effects of platform manoeuvre In forward-looking radar applications, it is often necessary to detect and track targets while the radar platform is undergoing high rates of manoeuvre. The platform
ep 'ssoi >IINS 2 tap STAP 3 tap STAP
roll rate, degs/PRI
Figure 1.19
Loss in SNlR as a function of roll rate, slow moving target scenario (from [17])
motion may typically take the form of rapid yaw, pitch or roll. If there is no motion compensation, the mainlobe clutter extent within the three-dimensional STAP sampling space is increased. The spreading is due to both the movement of the receiving array and the transmit beam. Simulation results showing the effects of platform roll on STAP slow moving target detection have previously been presented in Reference 17. The results, showing the loss in the output SNIR for a forward-looking airborne radar operating at an altitude of 5 km, are summarised in Figure 1.19. The curves shown are forpre-Doppler STAP architectures with two and three taps per subarray channel implemented in the chequerboard subarray scheme shown in Figure 1.10. The scenario geometry and radar waveform parameters were identical to those used to produce Figure 1.6. The adaptation was carried out using all available range samples with the STAP weights updated every PRI in an attempt to compensate for the effects of motion. The SNIR values were estimated directly from the range-Doppler maps using a simple cell averaged CFAR scheme. The curves in Figure 1.19 show the loss in SNIR (relative to the optimal value) as a function of roll rate. The results indicate that roll rates as small as 0.005° per PRI have significantly reduced slow target detection performance in this case, despite the fact that the adaptive weights were updated at the PRI rate. However, it should be noted that the effects of motion have been accentuated here by the simplified quantised transmit beampattern employed within the clutter model.
1.5.2 Motion compensation The effects of the platform motion on mainlobe clutter rejection can be reduced by motion compensating the beam steering to ensure that the same region of ground is illuminated by the radar main beam throughout the coherent processing interval. This ensures that both the transmit and receive beams are motion stabilised. The technique can be applied effectively in both pre-Doppler and post-Doppler STAP approaches.
target
range Doppler
Figure 1.20
Range-Doppler map - 3 tap STAP output, 0.01° /PRI platform yaw, PRI weight updates
Simulation results showing the effectiveness of the approach for a radar platform yawing at a rate of 0.010° per PRI are shown in Figures 1.20 and 1.21. For adaptive suppression of the clutter, a pre-Doppler STAP algorithm with three taps per subarray channel was implemented in the chequerboard subarray scheme shown in Figure 1.10. Figure 1.20 shows the range-Doppler map produced when the adaptive weights were updated every PRI. The effect of adapting to clutter and the platform rotation has significantly reduced the SNIR from the optimal level of 32 dB for this scenario. The output SNIR estimated from the range-Doppler map in this case was 15.4dB. The result obtained using motion-compensated beam steering with the beam resteered at the PRI rate is shown in Figure 1.21, and it is clear that more effective clutter suppression has been achieved. The SNIR estimated from the range-Doppler map shown in Figure 1.21 was 28.9 dB, an improvement of 13.5 dB. It should also be noted that this result was achieved using a single adaptive weight vector, calculated from data collected over the full coherent processing interval, rather than by updating the adaptive weights at the PRI rate. Motion-compensated beam steering is only effective for compensating for the effects of motion on signals within the mainlobe of the radar beam. The motioninduced angle-Doppler spreading of sidelobe clutter is therefore not mitigated with this approach. Another drawback of the method is that very fine control of the beam steer direction may be required. In conventional phased array beamforming, beam steering accuracy to very small fractions of a beamwidth is not usually necessary, and the scanning is typically achieved using analogue phase shifters employing small numbers of bits. The phase settings for the array elements are therefore heavily
target
range Doppler
Figure 1.21 Range-Doppler map - 3 tap STAP output, 0.01°/PRI platform yaw, motion-compensated beam steering quantised. In the simulation results above, the ability to steer the beam to an accuracy of greater that 1/200th of a beamwidth was assumed. For the array size considered, this leads to a requirement of more than 8 bits resolution for the element level phase settings. Time-dependent adaptive weighting methods have previously been suggested for countering motion effects in adaptive arrays [21]. In this approach, motion compensation is achieved by assuming that the optimal adaptive weight vector is a function of time, and expanding the weight vector as a Taylor series. Assuming that (k •+- l)th order terms and above can be neglected, the adaptive weight vector then becomes: w(t) = w(0) + tw(l) + • • • + tkw{k) + <9O ( * +1) )
(1.12)
where t is the sample time of the data to which the weights are applied. Typically, a first-order expansion is used, and hence the adaptive weights are a linear function of time. This leads to a requirement for weight solutions for w^ and w^\ and a doubling of the dimensions of the covariance matrix to be inverted in equation (1.4). For a fixed number of subarrays and taps, this implies a factor of eight increase in the computational load associated with the adaptive weight calculation. Fast time, i.e. range sample, dependent adaptive weighting has previously been suggested as a technique for compensating for the effects of range dependency of clutter in nonsidelooking [7] and bistatic radar applications [22]. For motion compensation in STAP, it is the platform movement that takes place from pulse-to-pulse that is of greatest relevance and a single fast time-dependent model is unlikely to be valid for modelling both the platform motion and range dependency of the clutter. To model
for platform motion effects alone, a slow time (PRI rate) model is therefore most appropriate. The weights are then given by: w(m) = w{0) + mw(l) + • • • + mkw{k) + O(wiM))
(1.13)
where m is the PRI of the data to which the weights are applied. It should be noted that in the simulation examples considered within this section, there are enough training data samples available to allow the adaptive weights to be recalculated every PRI. The main advantage of using the time-dependent weighting approach rather than rapid updating of the adaptive weights (e.g. recalculation of the adaptive weights every PRI) in these cases is therefore computational efficiency. For example, if we assume 32 pulses in a burst, then for matrix inversion we can expect of the order of 327V3M3 floating point operations to be required. In contrast, if a first-order time-dependent weighting model is adequate, the inversion requires 8 A/"3 M 3 operations, a reduction by a factor of four. Time-dependent weighting will therefore produce similar results to those shown in Figure 1.20, but with reduced computation. The main deficiency of time-dependent weighting of the receive beam is that it cannot fully compensate for the fact that the region of ground illuminated by the transmit beam is changing with time. If uncorrected, this causes a spreading of mainlobe clutter within the range-Doppler detection map.
1.6 1.6.1
Jammer rejection under conditions of manoeuvre Mainlobe clutter filtering requirements
It is well known that spatial adaptive beamforming methods can be used to reject sidelobe jamming or jamming in the skirts of the mainlobe of the receive beam. In look down airborne radar, it is important to remove mainlobe clutter returns from the sensor data before attempting jammer rejection [2]. Failure to do this leads to the adaptive beamformer attempting to reject the mainlobe clutter, which in turn leads to a significant loss of main beam gain and high adaptive sidelobe levels. The mainlobe clutter Doppler frequency is usually known to a high enough accuracy to allow the clutter to be removed using conventional MTI filtering prior to adaptive beamforming. We will refer to the approach of cascading conventional MTI clutter filtering with spatial adaptive beamforming as MTI/ABF. The obvious disadvantage of using MTI/ABF, rather than STAP, is that it cannot provide detection of targets lying within the mainlobe clutter Doppler bandwidth [2]. Within this section we will analyse the potential advantages of the pre-Doppler STAP approach under conditions of manoeuvre. 1.6.2
Advantages of using STAP
Pre-Doppler STAP techniques can provide simultaneous suppression of clutter and jamming. The PRI tap delay spacing employed in STAP architectures designed for clutter rejection is usually large in comparison to the inverse of the instantaneous (or processed) bandwidth of the received signals. Wideband noise jamming signals
are therefore uncorrelated from tap to tap in the processor and, because of this, STAP usually offers little advantage over spatial adaptive beamforming in terms of jammer rejection performance. The jammer rejection performance of pre-Doppler STAP and MTI/ABF approaches has previously been compared in Reference 20 where analysis and simulation results demonstrated that the STAP approach is relatively robust under conditions of manoeuvre. The advantage of using STAP rather than MTI/ABF can be deduced from examination of the structure and rank of the jamming covariance matrix in both cases. Any motion-induced increase in the rank of the matrix is likely to lead to a larger number of adaptive degrees of freedom being required for jammer rejection, higher adaptive sidelobe levels and a decrease in SNIR. Consider a pre-Doppler STAP configuration comprising K spatial channels with L taps per channel. If we assume that the jamming, clutter and thermal noise are mutually uncorrelated, the covariance matrix can be expressed as: Q = Qj + Qc + Qn
(1-14)
where Qj, Qc and Qn are covariance matrices of the jamming, clutter and background noise, respectively. With the additional assumption of wideband noise jamming that is uncorrelated from tap to tap in the STAP architecture, the jamming covariance matrix can be expressed as: 'Qx
Qj=
0
...
&
° 0
0
0"
; QL
(U5)
where Qm denotes the K by K spatial covariance matrix of the jamming data at the mth tap of the processor. It therefore follows that the rank of Qj is given by: L
PiQj)=YsP(Qm)
(1.16)
m=\
where p(A) denotes the rank of matrix A. Under the assumption of wide sense stationarity, where there is no relative motion of the jammers, we have g j = Q2 = - • • = QL. Assuming that the propagation time across the receiving array is negligible compared with the reciprocal of the receiver bandwidth, it then follows that p(Qj) = Lp(Q1) = LJ where J is the number of jammers. Under conditions of manoeuvre, we have Qx ^ Q2 / • • • ^ QL in equation (1.15). If we now assume that the platform is manoeuvring at a constant rate we have p(Qx) = P(Q2) = • • • = P(Qi) and it follows that we again have: P(Qj) = Lp(Qx)
(1.17)
It should be noted that, in theory, the relative jammer motion now leads to the rank of the spatial jamming covariance matrix, Qx, being K, and hence it also follows that the rank of Qj is KL. However, in practice, the relative motion during covariance matrix estimation will be small and many of the eigenvalues of Qx will be below background
noise level. It then follows that the apparent rank of Q1, i.e. the number of dominant eigenvalues, will be closer to J than to K. Equation (1.17) is therefore significant as it predicts that both with and without platform motion, the apparent rank of the space-time jamming covariance is L times the apparent rank of the spatial jamming covariance matrix. From the structure of Qj in equation (1.15) it is also apparent that complete jammer rejection can be achieved by applying spatial jammer rejection weights at each column of taps within the pre-Doppler STAP architecture. In the presence of platform motion, differing jammer rejection weights can be applied at each tap, therefore compensating for the effect of the apparent jammer motion between PRIs. For the MTI/ABF processor, the jammer covariance matrix Qj becomes a K by K matrix. The effect of including the clutter rejection filters is to cause the multichannel sensor data to contain components spread over a time period corresponding to the impulse response length of the filters. For example, if we assume that the clutter rejection is achieved using FIR filters with L taps, each filter output sample will contain contributions spread over a duration of L — 1 PRIs. It therefore follows that, if the covariance matrix estimation in equation (1.5) is carried out using samples collected over a duration T, the clutter filtering causes the matrix to contain jamming data components spread over a time interval (L — l)r + T, where r is the PRI, rather than T. Under conditions of wide sense stationarity, the rank of the covariance matrix Qj will clearly be unaffected, but if there is relative jammer motion the clutter filtering will increase the apparent rank of Qj, with the increase depending on the ratio of the PRI, r, to the covariance matrix estimation interval T. The jammer rejection performance of MTI/ABF is therefore more sensitive to platform motion effects than is pre-Doppler STAR It should be noted that, in contrast to pre-Doppler STAP, post-Doppler STAP is likely to offer poorer jammer rejection than MTI/ABF under conditions of manoeuvre. This is because MTI clutter rejection is achieved by combining small numbers of pulses, whereas post-Doppler STAP architectures involve coherent integration of a full burst of pulses prior to the adaptive processing. Hence, in post-Doppler STAP, the prefiltering causes jammer data components spread over a time interval (M — 1)T rather than (L- l)r with M ^> L. The effects can be reduced by applying time-dependent weighting [21], but this is at the cost of a significant increase in computation. Confirmation of the validity of the theoretical analysis can be obtained from examination of the eigenvalues of the covariance matrix, Q, used in the adaptive weight calculation. The number of eigenvalues significantly above the thermal noise level gives an indication of the number of adaptive degrees of freedom required to achieve jammer and clutter suppression. If the number of dominant eigenvalues approaches the total number of adaptive channels, then poor SNIR is likely to result. In Figure 1.22, pre-Doppler STAP covariance matrix eigenspectra are plotted for varying numbers of taps for a simulation scenario with the radar platform rolling at a rate of 0.36° per PRI. The receive antenna used in the simulation was the 2029 circular planar array partitioned into 16 subarrays shown in Figure 1.10. The radar was operating in the presence of three strong (60 dB above noise at the element level) sidelobe jammers.
1 tap 2 taps 3 taps 4 taps
eigenvalue no.
Figure 1.22
Pre-Doppler STAP jamming plus thermal noise covariance matrix eigenspectra, roll rate 0.36°/PRI, sidelobe jamming scenario
The eigenspectra in Figure 1.22 were obtained from the covariance matrices of the jamming plus thermal noise (i.e. with clutter excluded). The important point to note is that, as the number of taps is increased, the proportion of the total spectrum affected by jamming remains the same, as predicted by equation (1.17). This implies that the SNIR output should remain relatively unaffected as the number of taps in the processor is increased. In contrast, the eigenspectra shown in Figure 1.23 reveal that for the MTI/ABF covariance matrix the proportion of dominant eigenvalues increases with the number of taps. Simulation results showing the loss in SNIR as a function of the number of clutter rejection filter taps for the sidelobe jamming scenario are summarised in Figure 1.24. The results show the effect of adapting to jamming and thermal noise data alone, and hence spatial adaptive beamforming can potentially provide optimal performance. Exclusion of clutter allows the impact of the MTI filtering on the sidelobe jamming rejection to be examined in isolation. The adaptive weights were calculated from data collected over a PRI and the SNIR values were calculated from the average of ten simulation runs. The loss in SNIR is given as the output SNIR relative to the optimal level. It is evident from Figure 1.24 that there is only a slight loss in SNIR as the number of taps is increased in the STAP approach. This loss is mainly due to the increase in number of adaptive channels relative to the amount of data used in the adaptive weight calculation. In contrast, the MTI/ABF result shows higher losses in SNIR as the number of taps in increased. However, the difference in performance between STAP and MTI/ABF is always small (i.e. always less than 1 dB). This is due to the fact that the total number of spatial degrees of freedom (1.16) is far in excess of the number of jammers (1.3), and hence, when using the MTI/ABF approach, wide nulls can be formed to counter the apparent spatial spread of the jammers. The penalty
ABF 2 tap MTI/ABF 3 tap MTI/ABF 4 tap MTI/ABF
eigenvalue no.
SNIR loss, dB
Figure 1.23 MTI/ABF jamming plus thermal noise covariance matrix eigenspectra - roll rate 0.36° /PRI, sidelobe jamming scenario
STAP MTI/ABF
no. of taps
Figure 1.24
SNIR loss as a function of number of taps - sidelobe jamming scenario, no clutter, roll rate 0.36°/PRI (from [20])
for doing this is an increase in mean sidelobe levels, as shown in the results plotted in Figure 1.25. The sidelobe levels were calculated from the spatial beampattern evaluated at the target Doppler frequency. Manoeuvring adaptive arrays are extremely sensitive to mainlobe interference, e.g. see Reference 23. This is because extended nulls within the mainlobe of the beam are required to account for the relative jammer motion and this can lead to significant loss in gain in the desired signal direction. The relative performance of
mean sidelobe level, dB
STAP MTI/ABF
no. of taps
Mean sidelobe level as function of number of taps - sidelobe jamming scenario, no clutter, roll rate 0.36°/PRI
SNIR loss, dB
Figure 1.25
STAP MTI/ABF
no. of taps
Figure 1.26
SNIR loss as afunction ofnumber of taps - mainlobejamming scenario, roll rate 0.036°/PRI (from [20])
STAP and MTI/ABF architectures in a mainlobe jamming scenario is therefore of particular interest. Figure 1.26 shows the SNIR loss (relative to the optimal value) as a function of the number of clutter rejection filter taps for a mainlobe jamming simulation scenario where the radar platform was rolling at a rate of 0.036° per PRI. The receive array was again the chequerboard subarrayed configuration shown in Figure 1.10. The jammer was initially located at the —3 dB point of a beam steered off at an angle of 60° from broadside to the array. Clutter was not included in the
simulation. Figure 1.26 shows that there is a significant loss in SNIR as the number of taps is increased in the MTI/ABF architecture. In comparison, STAP is relatively robust to the manoeuvre effects, and for four taps the output SNIR is more than 10 dB higher than that achieved with MTI/ABF. The losses when using MTI/ABF are significantly above those obtained in the sidelobe jamming scenario despite the fact that the platform rotation rate is a factor often less. In Section 1.5.2, it was shown that motion compensation of the beam steer direction helps to reduce the effects of platform motion on mainlobe clutter rejection when using STAR In mainlobe jamming scenarios, motion-compensated beam steering also helps to correct for the relative jammer motion, and hence it is clearly of benefit to apply this technique in scenarios containing mainlobe clutter and jamming. The motion-compensated beam steering approach is appropriate to both MTI/ABF and STAP architectures, but does rely on the phase quantisation being fine enough to allow accurate resteering of the beam. It should also be noted that there are cases where the approach will offer little or no benefit in terms of interference rejection. The case where the beam is at broadside to the array and the platform is rolling is an obvious example. Fortunately, the effects of the relative jammer motion are likely to be insignificant in this geometry, as the mainlobe jammer angular displacement from the roll axis is small. In contrast to the mainlobe jamming case, motion-compensated beam steering tends to accentuate the relative motion of sidelobe jammers. In situations where the number of spatial degrees of freedom far exceeds the number of sources of interference, the effects on adapted sidelobe levels and jammer rejection are likely to be small.
1.7
Summary
STAP techniques can provide simultaneous rejection of jamming and clutter in airborne radar. Although, in the past, STAP has been considered mainly for SLAR applications, there has been growing interest in applying the technique to non-side-looking radar geometries. In this chapter we have considered STAP for forward-looking airborne radar where the array of sensors is orientated transversally to the direction of travel. In forward-looking applications (e.g. AI radar), the effects of platform manoeuvre can be of greater significance than in typical SLAR applications (e.g. AEW and SAR). For example, detection and tracking of targets may be required in situations where the radar platform is performing a steep dive or rolling at a rapid rate. One of the greatest advantages of STAP over conventional signal processing methods is the potential it provides for detecting targets which possess the same Doppler as mainlobe clutter returns (i.e. slow moving targets). In this chapter, we have paid particular attention to the effect of variations in platform orientation and manoeuvre on slow target detection performance. In the scenarios involving constant platform velocities, it has been demonstrated that the slow target detection capability is not sensitive to the platform orientation. In particular, it has been shown that slow target detection can be achieved when the radar platform is performing a steep dive.
The effects of platform manoeuvre on STAP slow target detection performance have also been examined. Simulation results indicate that performance can be sensitive to the effects of roll and yaw. The most effective way of countering the effect of the platform motion is to motion compensate the beam steering to ensure that the target and the same region of ground are illuminated by the radar main beam throughout the coherent processing interval. Techniques involving application of time dependent weighting, or rapid updating of the adaptive weights are only partially effective as they cannot fully compensate for the effects of the transmit beam motion. It has been demonstrated that, under conditions of manoeuvre, pre-Doppler STAP techniques can provide better jammer rejection performance than architectures which cascade conventional clutter filtering and spatial adaptive beamforming. In cases where there is no compensation of the beam steer direction, differences in performance are most apparent in the presence of mainlobe jamming. The effects of platform motion on mainlobe clutter and jammer rejection can be reduced by motion compensation of the beam steering direction, but this approach is likely to accentuate the relative motion of sidelobe jammers.
References 1 RICHARDSON, R G.: 'Relationships between DPCA and adaptive space time processing techniques for clutter suppression'. Proceedings of the international conference on Radar, Paris 1994, pp. 295-300 2 RICHARDSON, R G. and HAYWARD, S. D.: 'Adaptive space-time processing for forward looking radar'. Proceedings of IEEE international Radar conference, Alexandria, VA, USA, May 1995, pp. 629-634 3 KLEMM, R.: 'Adaptive airborne MTI: comparison of sideways and forward looking radar'. Proceedings of IEEE international Radar conference, Alexandria, VA, USA, May 1995, pp. 614-618 4 KLEMM, R.: 'Adaptive airborne MTI with tapered antenna arrays', IEE Proc, Radar Sonar Navig, 1998,145, (1), pp. 3-8 5 WANG, Y.-L., PENG, Y-N., and BAO, Z.: 'Space-time adaptive processing for airborne radar with various array orientations', IEE Proc. Radar Sonar Navig., 1997,141, (6), pp. 330-341 6 BOSARI, G. K.: 'Mitigating effects on STAP processing caused by an inclined array'. Proceedings of IEEE national Radar conference, Dallas, TX, 1998, pp. 135-140 7 ZATMAN5M.: 'Circulararray STAP',IEEETrans. Aerosp. Electron. Syst, 2000, 36, (2), pp. 510-517 8 KLEMM, R.: 'Principles of space-time adaptive processing' (The Institution of Electrical Engineers, London, UK, 2002) 9 WANG, H., ZHANG, Y, and ZHANG, Q.: 'A view of the current status of space-time processing algorithm research'. Proceedings of IEEE international Radar conference, Alexandria, VA, USA, May 1995, pp. 635-640
10 BROWN, R. D., WICKS, M. C , ZHANG, Y., ZHANG, Q., and WANG, H.: 'A space-time adaptive processing approach for improved performance and affordability'. Proceedings of IEEE national Radar conference, Ann Arbor, Michigan, 13-16 May 1996, pp. 321-326 11 FARINA, A., SAVERIONE, A., and TIMMONERI, L.: 'MVDR vectorial lattice applied to space-time processing for AEW radar with large instantaneous bandwidth', IEE Proc, Radar Sonar Navig., 1996, 143, (1), pp. 41-46 12 ENDER, J. H. G.: 'Space-time processing for multichannel synthetic aperture radar', Electron. Commun. Eng. J., 1999, February, pp. 29-38 13 REED, I. S., MALLETT, J. D., andBRENNAN, L. E.: 'Rapid convergence rate in adaptive arrays', IEEE Trans. Aerosp. Electron. Syst, 1974,10, (6), pp. 853-863 14 WARD, J.: ' Space-time adaptive processing for airborne radar'. Technical report no. 1015, MIT Lincoln Laboratory, December 1994 15 HERBERT, G. M. and RICHARDSON, P. G.: 'A constrained adaptive pattern synthesis technique for space-time filtering architectures'. Proceedings of the DGON international Radar symposium, 1998, Munich, Germany, pp. 857-866 16 TAM, K. and FAUBERT, D.: 'Displaced phase centre antenna clutter suppression in space-based radar applications'. Proceedings of Radar '87^ IEE Conf. Publ. 281, pp. 385-389 17 RICHARDSON, P. G.: 'Space-time adaptive processing for manoeuvring airborne radar', Electron. Commun. Eng. J., 1999, February, 77, (7), pp. 57-63 18 PAINE, A. S.: 'Comparison of partially adaptive STAP techniques for airborne element digitised phased array radar'. Proceedings of the IEE international Radar conference 2002, Edinburgh, UK, October 2002, IEE Conf. Publ. 490, pp. 181-185 19 REES, H. D. and SKIDMORE, I. D.: 'Adaptive attenuation of clutter and jamming for array radar', IEE Proc, Radar Sonar Navig, 1998, 145, (4), pp. 193-199 20 RICHARDSON, P. G.: 'Effects of manoeuvre on space-time adaptive processing performance'. Proceedings of the Radar '97 conference, Edinburgh, October 1997, IEE Conf. Publ. 449, pp. 285-290 21 HAYWARD, S. D.:' Adaptive beamforming for rapidly moving arrays'. Proceedings of the CIE international conference on Radar (IEEE Press), Beijing, China, October 1996, pp. 480^83 22 MELVIN, W. L., CALLAHAN, M. J., and WICKS, M. C : 'Adaptive clutter cancellation in bistatic radar'. Record of 34th Asilomar conference on Systems, signals and computers, IEEE 2000, pp. 1125-1130 23 BALLANCE, W. P. and MILLER, T. W.: 'Impact of mainlobe interference angular extent on adaptive beamforming'. Conference record of 25th Asilomar conference on Signals, systems and computers, CA, November 1991, pp. 989-993
Chapter 2
Non-linear and adaptive two-dimensional FIR filters for STAP: theory and experimental results Pierfrancesco Lombardo and Fabiola Colone
2.1
Introduction
A significant challenge for the effectiveness of STAP techniques against real data is presented by the operation against severe and non-homogeneous interference environments. In particular, an airborne early warning (AEW) surveillance radar platform, whose mission is to detect low radar cross section targets, must contend with high levels of undesired clutter returns from both land and sea surfaces. This must operate with a large number of degrees of freedom (DoF) to be able to cancel strong clutter echoes accurately. However, it is impossible to use such a large number of degrees of freedom adaptively, since this would yield unacceptable adaptivity losses [I]. Moreover, the real-time implementation requirements demand the use of filters with a low computational cost. The clutter environment also includes returns from clutter of various sea states, terrain types (i.e. desert, hills, mountains) and large discretes and becomes particularly severe in regions encompassing varying ground surfaces such as regions connecting land and sea. This clutter non-homogeneity limits the amount of homogeneous secondary data available for the adaptive algorithms. Moreover, the presence of interfering targets as well as intense, high-power coherent jamming also affects radar system performance, by contaminating the estimations of the clutter characteristics. Thus the techniques to be applied in practice should be robust to their presence. Many reduced DoF techniques have been proposed to achieve at the same time a reduced computational cost and the requirement for a limited set of homogeneous secondary data without suffering very large adaptivity losses. However, when operating in a highly non-homogeneous environment or in the presence of interfering targets, the robustness of most linear approaches can be gained only at the expense of
a large increase in the computational load. In this chapter, we describe three possible solutions for robust and effective STAP of radar data: (i) adaptive two-dimensional FIR filters with small support (ii) non-linear non-adaptive schemes (iii) non-linear combination of adaptive two-dimensional FIR filters. The performances are evaluated and compared both by a theoretical analysis and by application to a set of recorded radar data. In summary, the non-linear adaptive detector promises remarkable detection performance in a non-stationary clutter background containing interfering targets.
2.2
Adaptive linear filters
We consider a radar system with K spatial channels. Each of these receives M echoes from a transmitted train of M coherent pulses with a pulse repetition interval (PRI) of T seconds. Let xmj be the radar echo at the /th spatial channel (/ = 1 , . . . , K) in response to the rath pulse (ra = 1 , . . . , M). The KM echoes can be arranged into the ATM-dimensional column vector x = [x* . . . x^]*, xm being the column vector of the K echoes received at the rath pulse. The corresponding vector for the echoes from a target with Doppler frequency F (normalised to PRF = 1 /PRI), direction of arrival (DoA) (p and complex amplitude A is defined as: sf(F,(p) = As = A[s*(F,
] j
% X]
(2.1)
s*Q~ s being Q = \/Q YlH=X ^ ^l anc^ ^ t n e detection threshold. This detector can also be interpreted as the comparison of the output power of the SMI (sample matrix inverse)
filter, normalised to the peak of the signal, Z(x), to the MSMI adaptive detection threshold, AMSMI: S+Q-1X
Z(I) = — 7 -1— ^ S^Q- S
rj
AMSMI
= — r — 1— (s*Q~ s)
(2.2)
Slightly better performance is usually provided by Kelly's GLRT (generalised likelihood ratio test), obtained from the maximisation of the likelihood function of the joint set of the Q + 1 data vectors (primary and secondary data) over the parameters A and Q, independently under the hypotheses Ho and H\. Taking the ratio of the maximised likelihood functions obtained under the two hypotheses yields the same detection statistic T(±) in equation (2.1) divided by (1 + x * Q - 1 x ) [2]. This compares Z(x) to a modified adaptive threshold AGLRT = ^GLRT(X) = ^MSMI • P -h X + Q -1 X],
which is data dependent. Unfortunately, to have good performance both tests require a number Q of secondary data at least equal to twice the number of DoF, namely 2KM9 while the probability of detection (P^) degrades very fast below such a number [3]. Therefore, as the number of DoF increases, the improved clutter cancellation capability goes together with the need for larger areas of homogeneous clutter and a high computational load for weight calculation and filtering. When these are not available, reduced DoF techniques should be used. A careful review of the many different reduced DoF algorithms can be found in Reference 1. Among them, we consider the joint domain localised (JDL) technique as our reference adaptive linear detector [4]. The technique basically consists in taking the bidimensional fast Fourier transform (FFT) of the received data vector and discarding all the transformed data but KpMp of them, localised around the position of the Doppler angle plane, which is tested for detection. Either the MSMI or Kelly's GLRT is then applied to the selected KpMp data. Notice that only the KpMp DoF that are localised on a small region of the two-dimensional joint spectral domain around the target position are used adaptively. The remaining DoF, instead of being discarded, are used non-adaptively (i.e. by the non-adaptive transformation). The computational cost of the adaptive portion is largely reduced; however, the weight calculation must be repeated for any Doppler channel of interest, where a new reduced DoF covariance matrix must be estimated. Even with very few adaptive DoF, as for Kp = 3, Mp = 3, the JDL keeps good performance. This yields a drastic reduction in the size of the required homogeneous clutter area; however, at least 2 = 18 range cells must still be homogeneous and free from interfering targets. We refer to the case of Q = 32 to have negligible losses. The performance degradation due to the presence of an interfering target in the secondary cells is easily verified from Figure 2. Ia, which shows the P& as a function of the interference to noise ratio (INR) for the JDL (dashed lines). We consider, as a reference for the analysis in this chapter, the case of a side-looking array of K — 3 spatial channels, M — 64 temporal pulses and Q = 32 secondary data, operating against a clutter with Gaussian shaped temporal decorrelation [5] and clutter-to-noise ratio, CNR = 16 dB at the input of the individual channel. The target is assumed to have a signal-to-noise ratio, SNR = —2 dB, DoA cp = 45°, and normalised Doppler
JDL MSMI Q=32 JDL GLRT Q=32 JDL MSMI Q= 16 JDL GLRT Q=16 AR Q=32 ARQ-8 ARQ=4
INR, dB
JDL MSMI Q=32 JDL GLRT Q=32 JDL MSMI Q=16 JDL GLRT Q=I6 AR Q=32 ARQ=8 ARQ=4
b
Figure 2.1
SNR, dB
Performance of linear schemes (JDL with Kp = Mp = 3 and AR with L = 3) for K = 3, M = 64, and a target at
frequency F = —0.156, which is close to the clutter main beam for the spectral folding due to the low PRP value used in this analysis. Moreover, the threshold is set so as to yield a probability of false alarm Pfa = 10~3 in the absence of interfering targets. As a worst case, the interfering target is assumed to share the same DoA and Doppler frequency as that of the desired target. Its presence in the secondary data affects the estimate of the covariance matrix; therefore, the resulting adaptive detection scheme tends to equalise the space-time spectrum, thus putting a notch in the DoA and Doppler frequency of the target. As the INR increases, the notch becomes deeper and the desired target is quickly nulled. Obviously, using less secondary data reduces the probability that the secondary data are affected by interfering targets, therefore we consider both the cases of Q = 32 and Q = 16 secondary data. Notice that, when an interfering target is present in the 16 cells closest to the CUT, the JDL plots with Q = 16 start decreasing by about 5 dB before the corresponding plots with Q = 32 secondary data. This is because the undesired contribution in the sample covariance matrix is averaged with a smaller number of uncontaminated data. It must also be noticed that the JDL plots with Q = 16 secondary data show a lower probability of detection than the corresponding plots with Q = 32 secondary data even for very low INR values. This can be verified from Figure 2.1b, representing the P& as a function of the SNR under the same conditions in the absence of interfering targets. As apparent, the JDL with Q = 16 secondary cells yields about 3 dB of loss with respect to the case of Q = 32. Moreover further reductions of Q would show a drastic fall in the detection performance. Therefore, with the JDL, there is an intrinsic limit in the possibility of reducing the probability of interfering targets by reducing the size of secondary data, unless accepting the presence of significant detection losses even in the absence of interferences. Finally, it is interesting to observe from Figures 2.1a and 2. Ib that, with both 2 = 1 6 and Q = 32 secondary cells, the plots of MSMI and GLRT are very close to one another. Thus, in the following, only the JDL MSMI is considered. The detection performance against a non-homogeneous environment is demonstrated in this chapter with reference to a data set provided by the Naval Research Laboratory (USA), which consists of land, sea and land-sea interface scenarios of various terrain types and sea states. The data have already been processed by using systolic-based adaptive algorithms [6] and various STAP filters [7,8]. These data were recorded by a modified AN/APS-125 radar at UHF on a P-3A aircraft and include flight tests taken in 1979 and 1980 over the Appalachian mountain regions, Maryland, Virginia and midwestern areas of the United States. The receiving system consists of eight identical channels giving / and Q signals at 10 bits within a baseband of 5 MHz. For the purpose of our analysis, a non-adaptive digital beamforming is applied to the spatial channels, which combines, with appropriate phases, K = 3 overlapped uniform subarrays (OUS) of four adjacent channels. The processed file (DL075) refers to littoral clutter; the transition between ground and sea echoes is well apparent in Figure 2.2a, which shows the power received at the first channel and first PRI versus range. The data refer to 18 PRIs, each one containing 896 range cells for each channel. The first 450 samples refer to sea clutter and have a power level of about 1OdB;
power, dB output power, dB
range bin
test statistic, 101ogl0 (T)
range bin
c
Figure 2.2
range bin
Results of JDL against recorded data by AAFTE a Power level versus range for first antenna element and first pulse b JDL MSMI output power versus range for 1 target injected and Q = 32 c JDL MSMI test statistic versus range for 1 target injected and Q = 32
from sample 550 to 896 ground clutter echoes are received, which have a level of about 30 dB; samples in between are characterised by a high non-homogeneity, since they are reflected from the sea coast region which is primarily sandy beaches with small hills and cliffs near the water. To show the clutter cancellation performance of the JDL, a fictitious point-like target is injected at range cell 480 at
of the signal at the output of the normalised filter, Z(x), of the JDL detector (see equation (2.2)), with Mp = Kp = 3 and Q = 32, is portrayed in Figure 2.2b. It is noted that about 25 dB of clutter cancellation has been obtained on the sea area and about 35 dB on the land area and the target appears well above the clutter level. The behaviour in terms of detection is easily verified from the test statistics, T(x) (see equation (2.1)), in Figure 2.2c, which shows that using the JDL an appropriate threshold can easily be selected to detect the target without having false alarms. To show the performance of the JDL in the presence of interfering targets, we proceed as follows. Nine sets of two or three fictitious point-like targets have been added to the data set at the same target DoA 90° and Doppler —213 Hz, at range cells and power levels described in Table 2.1. Sets (A, C, E, F, I) named Group I, have two targets with the same power level; sets (B, D, G, H), named Group II, have targets with different power levels. The proposed STAP schemes have been applied to these data to assess their comparative performance. Notice that almost all targets are well below the clutter level (compare Figures 2.3a and 2.2a). Figures 2.3b,c show the CFAR statistic T(\) of the JDL MSMI versus range, with Q = 32 and Q = 20 secondary data, respectively. A few considerations apply: (i)
The JDL with Q = 32 correctly detects the targets in Group I (italic labels) but looses the smallest targets in Group II (bold labels). When the smallest target of a configuration from Group II is in the CUT, one or two stronger targets affect the estimation of the covariance matrix and the threshold setting. Following the above discussion - and recalling that all targets have been considered at the same DoA and Doppler frequency - this implies that a notch is set in the target space-time location so that the small target is not detected. For targets of the same power level, the same notch operates on the target in the CUT. However, due to its original power level, this still exceeds the threshold despite the notch in its space-time location. (ii) Strong false alarms appear at range cells 507, 577, 603; namely, when setting a threshold to detect most of the targets, a detection is declared also in these range cells, however no targets were injected there. This is independent of the presence of interfering targets. The high values of the test statistic, which are not Table 2.1 Synthetic targets test configuration Set Group Range bin Power (dB) Set Group Range bin Power (dB)
A I 90,93 35,35 F I 540,543 15, 15
B II 180,183,186 0,15,15 G II 630,633,636 15,0,25
C D E I II I 270,273 360,363 450,453 0,0 0,15 15,15 H / II I 720,723 810,813 25,0 0,0
input power, dB test statistic, 101og10 (T)
range bin
I i
test statistic, 10 log 10 (T)
range bin
c
range bin
Figure 2.3
Results of JDL against recorded data by AAFTE for the target test configuration of Table 2.1 a Synthetic targets test configuration b JDL MSMI test statistic versus range for Q = 32 c JDL MSMI test statistic versus range for Q = 20
related to injected targets, come from the non-homogeneous area of transition between sea and land. Such values were also present in Figure 2.2c; however, in the case of sequences of closely spaced targets (with various power levels and partial cancellation of each one due to the presence of the neighbours in the sample estimate of the covariance matrix) they directly yield false alarms. In fact, to detect all targets, we need to set a lower threshold than that in Figure 2.2c which is also exceeded by all these values.
(iii) The JDL with Q = 20 shows similar behaviour to the JDL with Q = 32, but there are significantly more false alarms. This shows that reducing the number of secondary data increases the fluctuation of the test output. The undesired effect from adjacent targets is removed only if the target distance is larger than Q/2 = 1 0 cells; however, it is not removed in the case of sequences of targets more closely spaced (as considered in our test configuration). A possible approach to reduce the effect of the interfering targets in the secondary data consists in devising adaptive filtering techniques that intrinsically need a very limited amount of secondary data, Q, so that the probability of incurring in the performance degradation due to interfering targets is largely reduced. In particular, only targets with distance less or equal to Q/2 show an interfering effect. Obviously, the JDL scheme with Mp = 3, Kp = 3 does not allow us to work with a number of secondary data Q lower than 18, unless accepting large adaptivity losses. Therefore, alternative approaches must be considered to reduce Q further. A two-dimensional FIR filtering scheme is presented in the next section that is able to operate with a very small number of secondary data, without significant adaptivity losses.
2.3
AR-based FIR filters
To derive the AR-based FIR (finite impulse response) filter, the space-time disturbance echoes (clutter-plus-noise) are represented with a two-dimensional autoregressive model: AR(L — 1, A, R) of order L — 1 and matrix parameters A and R: (2.3) d (p) being the ( ^ x I ) echoes vector at the /7th PRI, w(/?) is a sample from a white vectorial complex Gaussian process with zero mean value and (K x K) covariance matrix R, and A = [A^ - 1 , A £ _ 2 , . . . , A*]* the (K(L — 1) x K) matrix of the two-dimensional AR parameters. By assuming that the AR poles are not close to the unit circle, and extending to the two-dimensional domain the approximation in Reference 9, the joint PDF of the data vector in the CUT can be approximated under the hypothesis of target absence (HQ):
(2.4)
To simplify the notation of equation (2.4), we define the (K x KL) matrix H = [—A* 1^], where I^ is a ^-dimensional identity matrix. By rearranging the data into the (KL x (M - L + I)) matrix X:
(2.5) which implicitly defines the vectors x ^ , ra = l , . . . , M — L + l,it yields: (2.6) Similarly, the approximate PDF, /?i(x), under hypothesis H\ (namely assuming also the presence of a target with complex amplitude A and space-time steering vector s), is obtained by replacing in equation (2.6), X with (X — AS), where S = [S^1JL S ^ • • • s^jT L + 1 ) ] is the target matrix obtained by applying the same reordering operation as for X. By assuming known disturbance characteristics, namely known A and R matrices, the generalised likelihood ratio test is easily obtained by maximising the ratio p\(x)/po(x) with respect to the target complex amplitude, A. The maximisation yields the maximum likelihood (ML) value for the target complex amplitude A = fr{X*H*R-1HS}/^r{S*H*R-1HS}, and finally the logarithmic GLRT: (2.7) where XAR is the detection threshold. The denominator of equation (2.7), Tr{S*H*R~1HS} = E{jl}, is the expected value of the average clutter power jx at the output of the filter, estimated over the Q secondary data, thus the filter is intrinsically CFAR. As apparent, the denominator is a constant independent of the data matrix X in the CUT, so that the numerator defines the filter shape. By exploding the trace into the sum of its components, and using the identity s ^ = e^2nF^n~V)sA)F it is rewritten as:
(2.8)
where the adaptive weight vector w = H*R 1 Hs^] 7 is implicitly defined. Equation (2.8) shows that the GLR test is given by the application of the twodimensional adaptive filter w to the data with L ^-dimensional support, followed by a bank of Doppler filters (each one centred on a different target speed). It is important to observe that only a very limited number of coefficients (L taps for each of the K spatial channels) is required to implement this filter. Further - unlike the JDL - only the target steering vector depends on the Doppler frequency, therefore the adaptive portion (H*R~ 1 H) of the filter weight vector is independent of the considered Doppler channel and can be evaluated and applied to the data only once. This is a interesting advantage of the AR-based FIR filter, that can be classified as a pre-Doppler technique, where the optimisation is independent of the subsequent Doppler filter bank. These characteristics yield a low computational cost. Moreover, the space-time FIR filter is especially appealing since it was shown to be coincident with the optimum adaptive processor (OAP) in References 10 and 11, under specific assumptions on the clutter characteristics. In the more general case under analysis, the improvement factor (IF) obtained with this AR-based two-dimensional FIR filter is compared in Figure 2.4 with both the OAP and the JDL detector, for the case of a radar operating with K = 3 spatial channels, M = 64 PRIs, a Gaussian-shaped temporal clutter correlation, CNR = 16 dB and beam pointing at
Figure 2.4
Comparison of IF curves for K = 3, M = 64,
for AR filters with L = 3 and L = 6 taps while the JDL operates with Kp — 3 and Mp = 3. As apparent, the considered clutter model does not follow exactly the two-dimensional AR model; however, the two-dimensional FIR filter shows only a limited performance loss close to the notch with respect to both OAP and JDL, despite the large reduction of the computational load. Moreover, as the number of taps, L, increases from L = 3 to L = 6, the performance of the AR-based FIR filter close to the clutter notch also gets closer to the JDL, which shows a trade off between computational cost and performance. Further increases of the number of taps L would not yield better performance close to the clutter notch. A different behaviour is present far from the clutter notch, where filters with more taps show larger losses. This performance loss far away from the clutter notch is due to the fact that a FIR filter with L taps uses a shortened coherent CPI of (M — L + 1) samples, because the filter does not slide over the edges of the data record. It is to be noticed that the use of FIR filters deriving from a two-dimensional AR model of the clutter returns was first introduced by Klemm and Ender in References 12 and 13. In contrast to our GLRT approach, based on the approximate PDF in equations (2.4)-(2.6), they applied the minimum mean square error criterion at the filter output, obtaining slightly different expressions for the filter parameters. The performance of the two filters is largely comparable, thus similar considerations as above apply. For the practical application of the AR detection scheme, the clutter parameters A and R cannot be considered known, and must be estimated from the data themselves. To this purpose, we use the independent echo vectors received from Q adjacent range cells that are considered homogeneous and target free. As for the primary data, the KM echo vector y^ in each secondary range cell is rearranged into the matrix Y^, for k = 1 , . . . , <2; then these matrices are collected into the global (KL x (M — L + 1) Q) matrix of the secondary data Y = [Yi Y2 • • • Yg]-As shown in the Appendix (Section 2.8), the ML estimates of the matrix parameters A and R are obtained from the sample covariance matrix My = YY*. For convenience, we decompose the My in blocks of size K(L — 1) and L, respectively: (2.9) Therefore, we have the ML estimates:
(2.10)
where [M F 1]L,L is the last KxK block in the main diagonal of the inverse of My. These values are used inside the practical detection scheme to replace the known values. Further consideration is in order for the practical application of the AR-based two-dimensional FIR filter. The denominator in equation (2.7) is coincident with the normalising CFAR term only if the data follow exactly the two-dimensional AR model. For the general case, where real or simulated data with unknown spectral
shape are fed into the detector, the denominator must be replaced by the estimated clutter power /i(Y):
(2.11) The performance of the AR-based FIR filter with L = 3 taps is compared with the JDL with Kp — 3 and Mp = 3 in Figure 2.1b (solid lines). A few comments are in order: (i) the FIR filter with Q = 32 cells has the best performance (ii) the FIR filter with 2 = 8 cells shows a loss of 2 dB with respect to the FIR filter with Q = 32 and less than 1 dB with respect to the JDL with Q = 32, but it largely outperforms the JDL with 2 = 16 (iii) the FIR filter with Q = A cells is comparable to the JDL with 2 = 16. Comparatively, the JDL scheme suffers high adaptivity losses that largely compensate the little advantage shown in the IF. The case of L = 6 could achieve better performance, but it also requires slightly more secondary data (larger Q) corresponding to the larger number of filter parameters to estimate. An important point to be noticed is that the AR-based FIR filters take great advantage from a much more accurate estimate of the clutter covariance matrix. This is obtained by using the sample matrix My, which contains the average made inside the CPI (other than on the range cells), and follows from the exploitation of the AR properties. The possibility of operating with 2 = 8, instead of Q = 32, secondary data largely reduces the probability of having interfering targets in the secondary data. Moreover, clutter edges have very little influence on the detector performance. On the other hand, when interference is present, its effect is more devastating than with Q = 32, since it is not largely averaged with uncontaminated data. This is apparent from Figure 2.1a, where we consider the unfavourable case of an interfering target present in the secondary data of the AR with Q = 32, 2 — 8 a n d 2 = 4. As expected, the reduction of Q implies a faster decrease in the detection capability. This is the price to be paid to reduce the probability of having interfering targets in the secondary data. Figure 2.5 shows the application of the adaptive FIR filters to the AAFTE data under the same condition as Figure 2.2. Figure 2.5a shows the output power versus range, when operating with L = 3, 2 = 8 for the same case as for Figure 2.2b; a comparable (slightly better) clutter cancellation is obtained with the AR-based FIR filter. Figure 2.5b depicts the corresponding test statistic that clearly shows the effective detection of the target. Figures 2.6a-c show the detection statistic for the target test configuration of Table 2.1, respectively with Q = 8, Q = 20 and Q = 32 secondary data.
output power, dB test statistic, 10 log , 0 (T)
range bin
b
Figure 2.5
range bin
Results of the AR-based two-dimensional FIR filter against recorded data by AAFTE for 1 target injected and Q = 8 a Output power versus range b Test statistic versus range
The following considerations are in order: (i)
The AR-based FIR filter with Q = 8 is slightly worse than the JDL with Q = 20 (Figure 2.3c). (ii) The AR-based FIR filter with Q = 20 detects all targets of Group I except for set A, while it fails to detect the smaller targets in Group II, but has less false alarms than does the JDL with Q = 20. (iii) The AR-based FIR filter with Q =32 is comparable to the JDL with Q =32. Notice that it also loses set A, which has higher power than the clutter and could be detected before cancellation. However, the number of false alarms is largely reduced with respect to the JDL (namely, setting a threshold to detect the same targets, it yields a lower number of false alarms). Therefore, the use of two-dimensional FIR filters with short temporal support and limited adaptivity losses does not solve all of the problems. A number of techniques have been investigated in the literature to cope with the problems deriving from the presence of interfering targets, by properly selecting or discarding the range cells of secondary data (see for example References 14 and 15). However, these techniques appear to have a very high computational cost. Alternative techniques are investigated below, based on a non-linear combination of filters.
test statistic, 101og10(T) test statistic, 101og10 (T)
range bin
test statistic, 101og10 (T)
range bin
c
range bin
Figure 2.6
2.4
Test statistic versus range for the AR-based two-dimensional FIR filter against recorded data by AAFTE for the targets test configuration of Table 2 J a Q=S b Q = 20 c Q = 32
Non-linear combination of non-adaptive filters
When considering the expected characteristics of the space-time cancellation filter, the following considerations are in order. Whichever the radar and clutter characteristics are, the clutter spectrum mainly appears as a ridge with known shape in the Doppler frequency-angle plane that is modulated by the transmitting antenna
pattern. The slope of the F-coscp clutter trajectory (namely the footprint in the F-cos
a bank of P fixed filters is designed, with notches of different widths, and normalised so as to give the same output when fed with the expected target (ii) the average power at the filters' output is evaluated on Q range cells (iii) the filter with minimum output power is selected; notice that, due to the normalisation, at point (i), the filter with the widest notch does not necessarily yield the minimum output power (iv) the output of the selected filter corresponding to the CUT is scaled by the corresponding normalising CFAR term and compared with the detection threshold to test for the target presence. The scheme of this filter is depicted in Figure 2.7 for a bank of P = 3 filters: filter A, filter B and filter C. This non-linear combination of non-adaptive filters has apparently a very limited level of flexibility, consisting in the automatic selection of the filter with minimum output power for each range cell [17]. However, if the filters are carefully designed, this detection scheme can yield very convincing results. Its computational cost is P times higher than for a fixed weights linear filter, but much less than for the OAP filter, which requires the real-time inversion of the KM x KM estimated clutter covariance matrix.
2.4.1 Filter bank design To design the filter bank, it is useful to recall that the clutter covariance matrix Q is a Toeplitz block matrix with blocks related to the spatial channels. We assume a spatially homogeneous scene and consider a ratio a = d/(2vT) between spatial and temporal sampling intervals, being d the antenna spacing and v the platform velocity [1,18]; a is also referred to as the space-time slope. In general, for side-looking configurations, Q is obtained from the usual space-time correlation coefficient expression [19]: (2.12)
where pt(At) is the temporal correlation coefficient, which depends on the ICM and is a function of the temporal displacement At = mT of the two clutter echoes. ps[At + As/(2v)] is the spatial correlation coefficient, which is independent of the ICM and encodes the decorrelation due to the combined effect of the radar platform motion (and thus of the different view angle over the scene) and the antenna pattern. Since this term depends on the spatial position of the receiver, it is a function of both the temporal displacement At (which implies a different position of the platform) and the spatial displacement of the antennas in the array As. Let us assume that: (i) the nominal clutter returns describe a diagonal of the Doppler-angle plane with slope a, (ii) the target DoA is cp, and (iii) the expected normalised target Doppler frequency is F; then a suitable two-dimensional filter has the main beam at (
clutter filter null COS(T)
Figure 2.7
Sketch of the non-linear space-time processor a Filter A b Filter B c Filter C d Detector scheme
being 8(x) the Kronecker delta function, and appropriate pt(At) correlation function. In particular, a Gaussian filter bank is obtained by considering a Gaussian shaped Pt(At) with one-lag correlation coefficient pt. To this purpose, we select P = 3 values: pt\ = 0.9, ptB = 0.99 and ptQ = 0.999, and we build, respectively, the covariance matrix QA, of filter A, QB of filter B and Qc of filter C, corresponding to clutter echoes with different intensity of internal motion. The optimum processor is obtained by a linear combination of the KM echoes with weights w = Q - 1 S. Therefore, a Gaussian filter bank is easily obtained by considering P = 3 optimum filters corresponding to the three covariance matrices above. To properly select the best filter in the bank, each filter is normalised so that it produces unity output when fed with the target steering vector, s. As mentioned above, this normalisation guarantees that the non-linear scheme does not always select the filter with the widest notch, thus yielding a performance degradation of the very slow targets. The instantaneous power for the CUT at the output of filter A is:
(2.13) The CA-CFAR normalising term for this filter, evaluated over Q adjacent range bins, is an estimate of the clutter power at its output and is given by: (2.14) where the explicit dependence on x refers to the selection of the samples y&, adjacent to the CUT with echo vector x. Similar expressions are valid for filters B and C. The non-linear detector selects the filter with the minimum estimated clutter output power on the Q adjacent range cells:
(2.15) Finally, the non-linear scheme compares it to the CA-CFAR detection threshold corresponding to the selected filter. In analogy to the case of the GLRT, it can be written in terms of test statistic:
(2.16)
which is to be compared to the threshold G, to test for detection. The set of three twodimensional filters is repeated for the different values of the expected target Doppler frequency.
2.4.2 Detection threshold and performance A closed-form expression of the false alarm probability (as well as of the P& for Swerling I targets) has been obtained for P = 2 filters [7]. Define the clutter power at the output of the two filters as o\ = E{Z2A},G^ = £{Z|}, the output cross correlation coefficient, p = \E{ZAZ^}\/(GA<JB), and the target power, erf. When selecting the filter based on the Q adjacent range cells, Pd = PA + PB, where: (2.17) with GA 0
=
GB
V W B + 1 + (1 -
=
G, aA 2
P )MA1
2
= 2
G2JaI - [1 + (1 - P 2 )MAL £A
=
2
- 4 p a i / ^ , and /xA = G A /(1 + crf/a A). PB is
obtained from PA by exchanging suffix A with B. The Pfa is easily obtained by setting at = 0. Even in the presence of P > 2 filters, assuming that the clutter background is approximately homogeneous, the non-linear filter has to choose essentially between the two filters which are closest to the ideal optimum, while the others give a negligible contribution to the detection performance. When evaluating the performance, the theoretical values of the filters are known, so that we are able to choose the two correct filters and apply the theoretical formula. Also, the detection threshold can be easily set using equation (2.17). Figure 2.8a shows the power at the output of the non-linear scheme, Z(x) (see equations (2.13) and (2.15)), when operating with Q = 20 against the AAFTE data file with an injected target as in Figure 2.2b. It is now apparent that the output power has a low fluctuation level and performs at least as well as the JDL with Q = 32, even if it only operates selecting the most appropriate filter out of a set of three. Moreover, the target is now clearly visible. Obviously better cancellations could be achieved by using full adaptivity and larger regions of secondary data, at the expense of higher computational cost. However, when dealing with small regions of data we show that the behaviour of the non-linear non-adaptive detection scheme is comparable to the best results obtained by the reduced DoF techniques. The behaviour in terms of detection is easily verified from the test statistics, 7\x) (see equation (2.16)), in Figure 2.8b, which shows that the non-linear processor properly detects the target. The filter selection made by the non-linear non-adaptive detector is portrayed in Figure 2.8c. The narrowest filter (C) is selected only for a fraction of returns along range and mainly inside the land area. In contrast, in the sea area, there are large regions where the widest filters (A) and (B) are selected (bins 50 to 180 and 290 to 390). Moreover, in the highly non-homogeneous areas, the selection largely oscillates among all three filters; this certainly applies to the sea to ground transition (bins about 450 to 480) and to the land region in cells 810 to 890, containing probably a river bank.
output power, dB test statistic, 101og10(T)
range bin
filter selection
range bin
c
Figure 2.8
range bin
Results of the non-linear non-adaptive Gaussian-based filter versus range against recorded data by AAFTE for 1 target injected and Q = 20 a Output power b Test statistic c Non-linear filter selection
2.4.3 AR-based non-linear detector Although the use of filters based on Gaussian-shaped autocorrelation functions is a simple and effective way to obtain a set of similar filters with notches of different width, two considerations are in order: (i) the Gaussian filters are generally not an optimal choice, since the Gaussian model is arbitrary, and (ii) the filters have the
full implementation cost of MK complex products, since they have MK non-zero coefficients. A significant reduction of the implementation cost can be achieved by replacing them with FIR filters with a small temporal support. A non-adaptive version of the two-dimensional AR-based FIR filters presented in Section 2.4 can be used to this purpose. To derive the filter coefficients for this case, we proceed as follows. From the theory of the stationary two-dimensional-AR processes, it is known that the relationship between AR parameters, A and R, and the covariance matrix Q can be written as H • QAF = [0K • • • 0K R], where QAF = £ { x ^ x ^ T } for any value of m and is coincident with the generic rath (KL x KL) block on the main diagonal of Q. Therefore, H + R" 1 = Q ^ [ O * • • • O^ I*]* so that H* R" 1 is the last (KL x K) block column of Q ^ , and R" 1 is the last (K x K) block in the main diagonal of Q^f. An AR-based FIR filter bank can be obtained by using the relationships above to approximate the two-dimensional Gaussian filters with twodimensional AR-based FIR filters with L taps. Figures 2.9a,b show, respectively, the output power and the test statistic for the same case as Figure 2.8a,b. As apparent, with L = 3 taps the clutter cancellation is very similar to the case of the Gaussian filters and the target is easily extracted by the filter. The corresponding filter selection is reported in Figure 2.9c. In this case, the selection shows fewer fluctuations than in the case of the Gaussian filters and the wider filter A is selected most of the time. The filter selection is still largely fluctuating in most non-stationary clutter areas. The performance of the non-linear non-adaptive detection scheme based on the bank of AR FIR filters against the target test configuration of Table 2.1 is portrayed in Figure 2.10, to investigate the robustness of this detection scheme to the presence of interfering targets in the secondary data. Notice that the secondary data are only used here to select the best filter in the bank and to provide the CFAR normalisation. Figure 2.10a shows the test statistic as a function of the range cell. As apparent, the targets of Group I are properly detected, but the smallest targets of Group II are not detected by the filter. Notice also that set I is not clearly extracted; this is probably due to its closeness to a highly homogeneous area of ground clutter (probably containing the river banks). The filter selection is expected not to be affected by the presence of the interfering targets in the secondary data, since all filters in the bank are normalised so as to yield unity gain for the desired target. Therefore, the degraded performance of the technique must be mainly attributed to the interfering targets affecting the CA-CFAR normalisation. To counteract this effect the following scheme is proposed: (i) (ii) (iii)
(iv)
select the best filter as before (the filter yielding the minimum output power, as estimated on all the secondary data) split the secondary data into P blocks of Q cells evaluate the P normalised filter outputs, obtained by dividing the filter output from the CUT by the mean filter output on each block of Q range cells (this is the CA-CFAR output obtained by considering only the secondary cells of the individual block) take the median of the P normalised filter outputs.
t a
I o
range bin
P O
i O
range bin
I 1 c
Figure 2.9
range bin
Results of the non-linear non-adaptive AR-based filter versus range against recorded data by AAFTE for 1 target injected and Q = 20 a Output power b Test statistic c Non-linear filter selection
The median operator is selected in order to reject the anomalous outputs that are corrupted by the presence of the interfering target. This is obviously based on the assumption that only a small number of blocks is expected to be affected by the interfering targets. The resulting test statistic is shown in Figure 2.10b for P = 5 blocks with Q = 4 cells, for a total of 20 secondary data. As apparent, almost all targets are clearly extracted and a threshold could be selected to detect all of them without causing false alarms. This is not quite true for target set I, due to the above mentioned
test statistic, 101og10(T) test statistic, 10 log 10 (T)
range bm
block selection
range bin
c
Figure 2.10
range bin
Results of the non-linear non-adaptive AR-based filter versus range against recorded data by AAFTE for the targets test configuration of Table 2.1 and Q = 20 a Test statistic b Test statistic with block selection c Block selection around target set D
non-homogeneity. Notice also that the strong effect of the target in the CFAR scaling factor, yielding very low returns around the target positions in Figure 2.10a, is not present in Figure 2.10b. To demonstrate the effectiveness of this median selection for the CFAR, Figure 2.10c shows the selected CFAR block versus range cells, around the targets of set D. The small circles represent the blocks affected by interfering targets,
and the solid line represents the block selected by the median operator. As apparent, the blocks affected by interfering targets are never selected. As a final step in the non-adaptive non-linear detector, we notice that the selection of the best filter of the bank can be done without using the secondary data. The non-linear selection strategy can operate directly by choosing the filter yielding the minimum output power on the CUT, after normalisation as in equation (2.13), namely: Z(x) = min{ZA(x), Z B (x), Z c (x)}
(2.18)
Test statistic, 10 log.0 (T)
The same CA-CFAR scheme above is then applied, consisting of the normalisation by the median of the average output powers on the P blocks of Q range cells. The resulting test statistic is reported in Figure 2.11a, which is largely comparable with Figure 2.10b. Similarly Figure 2.11b, shows the correct selection of a block of secondary data not affected by interfering targets. As apparent, this filter yields a further reduction in computational cost, since only one of the three filters must be applied to all range cells. Moreover, it yields a very effective detection scheme that is robust to the presence of interfering targets. However, target set I is still not detected which is due to the clutter non-homogeneity.
block selection
range bin
b
range bin
Figure 2.11 Results of the non-linear non-adaptive AR-basedfilter with block selection and test on the CUT versus range against recorded data by AAFTE for the targets test configuration of Table 2.1 and Q — 20 a Test statistic b Block selection around target set D
2.5
Non-linear combination of adaptive AR-based two-dimensional FIR filters
From the analysis of the previous sections, we observe that both AR-based adaptive FIR filters and non-linear combinations of non-adaptive filters are able to work with a limited amount of secondary data, moreover: (i) The non-linear bank of non-adaptive filters performs very well against the nonhomogeneous background and in the presence of interfering targets, but it largely depends on the appropriate selection of the shape of the filters in the bank. The filters should be changed in accordance to the platform flight conditions (pitch, bank, roll, crab, wind speed, ground speed), and the radar parameters (antenna pointing, PRF). Moreover, it might happen that none of the used filters is really appropriate to clutter cancellation, with the consequent performance degradation. (ii) The AR-based adaptive FIR filters are able to adapt to any kind of mismatch, such as the variations in platform flight conditions and radar parameters. However, their performance is largely degraded by the presence of interfering targets in the secondary data. To obtain a robust detection scheme that at the same time copes with the changing radar and platform conditions and is not largely affected by the presence of interfering targets in the adjacent range cells, non-linear adaptive schemes (in contrast to the nonlinear non-adaptive schemes of Section 2.4) are presented. In these adaptive schemes, a set of P AR-based FIR filters is obtained by estimating the covariance matrix on P different blocks of Q range cells around the CUT and estimating the corresponding matrices A and R to be used inside equation (2.8). Two non-linear adaptive schemes are obtained by applying a non-linear selection among the outputs of P linear adaptive filters, according to two possible criteria [20]: Minimum residual power (MRP) Each filter is normalised so as to give the same output to the received target, as for the non-linear combination of non-adaptive filter of Section 2.4. Thus, for filter A:
(2.19) where HA and RA are estimated from the first block of Q secondary range cells; similarly ZB(X), ZC(X), . . . are obtained from the second, third, . . . block of Q secondary data. Then, each of the P filters is applied to the CUT and the non-linear scheme selects the filter that yields the minimum output power. Finally, the selected
filter is normalised by the CFAR scaling factor computed on the range cells of the corresponding block; namely, for filter A:
(2.20)
before comparing to the detection threshold. In the presence of interfering targets in one or more blocks of secondary data, the filters obtained from the estimates on the contaminated blocks yield a worse clutter cancellation than the filters based on the non-contaminated blocks. Therefore, the latter filters are selected, which makes the non-linear adaptive detector robust to the presence of interfering targets. Median test output (MTO) The output of the CFAR test statistic 7} = Z///x r , i = A, B, . . . is evaluated for each of the P filters (see Figure 2.12 for P = 5 blocks yielding filters A to E). The median value of the P outputs on the CUT is selected and compared to the detection threshold. Assuming that interferences don't affect more than (P — l)/2 blocks of secondary data, namely (P — l)/2 filter outputs, the median output should correspond to a non-contaminated AR filter. The performance of these non-linear adaptive detectors that use the AR-based FIR filter are reported in Figures 2.13a and 2.13b, respectively, for Q = 4 and Q = 8 cells in each of the P blocks. In both Figures the P& of the linear AR-based FIR filter with global number of cells 2 = 4, 8,32 is reported for comparison. We notice that the MRP detector operating with blocks of Q cells always provides a P$
median
CUT
Figure 2.12
Scheme of the MTO detector with P = 5 blocks of Q secondary cells
ARQ = 32 ARQ = 8 ARQ = 4 MRPP = 2Q = 4 MRPP = 4Q = 4 MTOP = 3Q = 4 MT0P = 5O = 4
SNR, dB ARQ = 32 ARQ = 8 ARQ = 4 MRPP = 2Q = 8 MRPP = 4Q = 8 MTOP = 3Q = 8 MTOP = 5Q = 8
b
Figure 2.13
SNR, dB
Comparison of non-linear adaptive schemes to the linear AR-based filter with Q = 32,8,4. Pd versus SNR for K = 3, M = 64, target at (p = 45° and F = -0.156 a MRP and MTO with blocks of Q = 4 b MRP and MTO with blocks of Q = 8
comparable with the linear FIR filter with the same total number of cells Q; this is almost independent of the value of P. In contrast, the MTO detector operating with P blocks of Q cells, is approximately comparable with the linear AR-based FIR filter operating with P • Q cells. Figures 2.14a and 2.14b show, respectively, Pfa and P& for the case of one interfering target in the secondary data, as a function of the INR, using blocks with Q = 4
JDL MSMI Q = 32 ARQ = 32 ARQ = 4 MRPP=2Q=4 MRPP=4Q=4 MTOP = 3 Q = 4 MTOP = 5 Q = 4
MRPP=2Q=4 MRPP = 4 Q = 4 MTOP=3Q = 4 MTOP = 5 Q = 4
INR, dB
INR, dB
MRP P=2 Q=4 MRP P=4 Q=4 MTO P=3 Q=4 MTO P=5 Q=4
MRP P=2 Q=4 MRP P=4 Q=4 MTO P=3 Q=4 MTO P=5 Q=4
probability of correct selection
probability of correct selection
JDL MSMI Q=3; AR Q=32 ARQ=4 MRP P=2 Q=4 MRP P=4 Q=4 MTO P=3 Q=4 MTO P=5 Q=4
MRP P=2 Q=4 MRP P=4 Q=4 MTO P=3 Q=4 MTO P=5 Q=4
INR, dB
Figure 2.14
Comparative performance of linear and non-linear adaptive schemes using AR-based FIR in the presence of interfering targets a Pfa versus JNR for one interference b Pd versus JNR for one interference c Probability of correct filter selection for one interference d Pfa versus JNR for two interferences e Fd versus JNR for two interferences / Probability of correct filter selection for two interferences
data. Notice that the Pfa of the standard JDL, as well as of the linear AR-based FIR filter, shows a fast decrease, which also implies a corresponding loss of the detection performance (as observed from Figure 2.1a). As apparent, all non-linear adaptive techniques show a Pfa fairly constant and independent of the interference power.
The Pd of the MTO detector both with P = 3 and P = 5 clearly outperforms the MRP detector with both P = 2 and P = 4. Moreover, the MTO detector is almost unaffected by the presence of the interference; MRP shows performance degradation when the INR is not large enough to allow a clear identification of the corrupted block of secondary data. As expected, a larger P yields better performance for both schemes, which is paid in terms of an increase of computational cost. Figure 2.14c shows the probability of correct filter selection, namely the probability of selecting a filter with coefficients computed from a block not affected by the interfering target. As apparent, the MTO non-linear adaptive detector always selects uncorrupted blocks, whereas the MRP detector is unable to perform a correct selection before the ESFR is high enough; this causes its performance degradation. Figures 14d-f show similar plots for the case of two interfering targets, displaced so as to affect two different blocks of secondary data. As apparent, only the filters with P > 4 are able to maintain a false alarm rate approximately constant (Figure 2.14d); moreover, only the MTO detector with P = 5 yields a high detection performance (Figure 2.14e). This is also confirmed by the probability of correct filter selection in Figure 2.14f. Notice that with this new non-linear adaptive filter, globally P • Q secondary data are used, so that the probability of including interfering targets is P times larger than when using a single block of Q secondary data. However, in the presence of closely spaced target sequences these interferences cannot be avoided and the presented non-linear adaptive filters are effective despite their presence. This can be easily verified from Figures 2.15 and 2.16 which show the results of the nonlinear adaptive AR schemes against the real data set, with the test configuration of Table 2.1. The non-linear adaptive MRP detector with P = 2 blocks of Q = 16 cells and the non-linear adaptive MRP detector with P-A blocks of Q = 8 are shown in Figures 2.15a and 2.15b, respectively. They yield a very similar detection capability except for set G, where all targets are extracted when operating with P = 4, and the central target is lost when operating with P = 2. In fact, in the latter case both adjacent blocks of secondary data are affected by the strong interfering targets, while the central target of set G is small. Figure 2.16a shows the test statistic of the MTO detector with P = 3 and Q = 10. As apparent, all targets are extracted; when setting a threshold to detect all of them, only a single false detection cannot be avoided. Figure 2.16b shows the result obtained using the MTO detector with P = 5 blocks of Q = 4 cells. As apparent, the targets are extracted better from the clutter and they can be very easily detected by setting a threshold, without suffering any false alarm. Finally, Figure 2.16c shows that increasing the size of the individual blocks does not yield any further performance improvement. Figures 2.17a and 17b show the filter selection around target set D, corresponding to the results of Figures 2.15b and 2.16b, respectively. Notice that in this case there are some errors in the filter selection; however, these errors never appear in a cell containing a real target to be detected (cells 360 and 363). Moreover, the test statistic always has a low value in correspondence of such errors, implying that the filter operates correctly and no targets are detected in those range cells. This justifies the good performance of the non-linear adaptive detectors. It is also interesting to observe that both non-linear adaptive MRP and MTO detectors are
test statistic, 10 log 10 (T) test statistic, 101Og10 (T)
range bin
range bin
Figure 2.15
MRP test statistic versus range against recorded data by AAFTE for the targets test configuration of Table 2.1 a P = 2 and Q = 16 b P = 4 and 0 = 8
also able to correctly extract target set I, which was lost by the non-linear non-adaptive schemes. This can easily be explained recalling that this target set is close to a largely non-homogeneous area, containing a river and its banks. Under such conditions, the bank of adaptive filters (MTO with P = 5 blocks of Q = 4 cells) clearly outperforms the bank of non-adaptive filters (AR-based non-linear with median CFAR on P = 5 blocks of Q = 4 cells).
2.6
Conclusions
In the practical application of adaptive STAP filters against real data, one of the main problems is represented by non-homogeneity of the clutter background and the possible presence of interfering targets among the secondary data used to derive the filter parameters. The latter problem was recognised to be critical for the possibility of properly extracting sequences of closely spaced target echoes. Since the probability that interfering targets affect the secondary data is directly proportional to the number of secondary range cells, a first way to mitigate the problem can be to use STAP techniques that are effective even when operating with a very limited amount of secondary data. A two-dimensional FIR filtering scheme, based on the AR model for the disturbance, was considered to this purpose. The filter was shown to perform
test statistic, 101og10(T) test statistic, 101og10(T)
range bin
test statistic, 10 log 10 (T)
range bin
c
Figure 2.16
range bin
MTO test statistic versus range against recorded data by AAFTE for the targets test configuration of Table 2.1 a P = 3 and Q = 10 b P = 5 and Q = 4 c P = 5 and Q = 6
effectively against the real environment. Specifically for our case even with Q = S secondary data such a filter was shown to yield a detection performance very close to the JDL with Q = 32 secondary data. Moreover, the AR-based FIR filter is a pre-Doppler technique, where the optimisation is independent of the subsequent Doppler filter bank. This also requires a largely reduced computational cost, which is especially important for real-time applications. Unfortunately, when using only few secondary data, the possible presence of an interfering target among them produces a much more devastating effect. This is because the undesired contribution in the sample covariance matrix is averaged with
filter selection filter selection
range bin
b
Figure 2.17
range bin
Filter selection versus range against recorded data by AAFTE for targets test configuration of Table 2.1 (zoom around target set D) a MRPP = 4 a n d £ = 8 b MTO for P = 5 and Q = 4
a smaller number of uncontaminated data. Therefore, the use of two-dimensional FIR filters with short temporal support and limited adaptivity losses does not solve all of the problems, and alternative techniques were considered. In particular, a nonlinear non-adaptive scheme was considered, which is based on the selection of the most appropriate non-adaptive filter out of a bank of filters with different cancellation characteristics. The scheme was shown to yield a good clutter cancellation against the real data and to select the most appropriate filter even in the presence of interfering targets in the secondary data. However, the subsequent CA-CFAR normalisation is still largely affected by the presence of the interference. To solve the problem, a new non-linear non-adaptive filter was introduced that first selects the most appropriate non-adaptive two-dimensional cancellation filter and then selects a block of secondary data that is likely to be uncontaminated. This is obtained by splitting the secondary data in P blocks of range cells and applying the median operator to their mean power, based on the assumption that only a few range cells are expected to be contaminated by other targets. Moreover, to reduce the computational cost of the non-linear filter, the Gaussian filters are replaced with AR-based non-adaptive FIR filters and the filter selection is performed using only the output power corresponding to the CUT. This scheme is shown to be effective against the real data even when a few sets of closely spaced targets are injected. The only weakness of this filter consists in the limited capability of such a scheme to adapt to different and changing clutter conditions as well as to mismatches between assumed and real platform motion parameters and to potential calibration errors in the antenna array.
Finally, two new non-linear adaptive STAP schemes have been presented for the target detection against a non-homogeneous environment. These filters are obtained by selecting the most appropriate filter among a set of two-dimensional AR-based adaptive FIR filters, obtained by estimating the AR parameters on P different blocks of secondary data. Their effectiveness against the presence of interfering targets has been shown both by means of simulated analysis and by application to a real data set. The adaptive nature of the individual filters ensures that the non-linear adaptive scheme is able to cope with any mismatch in system and environment conditions. Moreover, the low computational cost of the proposed schemes, together with their robustness to non-homogeneity and interference, makes them appealing for the practical real-time application. 2.7
Acknowledgments
The authors gratefully thank Dr Alfonso Farina (AMS) for many fruitful discussions on these topics and for his encouragement in this work. They also acknowledge the possibility to use the AAFTE data set collected during the work of Dr Fred Staudaher and Dr Fred Lee and their group at the Radar Division of NLR. Finally, they acknowledge the collaboration with FIAR Spa that originated some of the theoretical and simulated results presented in Sections 2.3 and 2.5. 2.8
Appendix: ML estimation of two-dimensional AR parameters
Using the matrix Y, the joint PDF of the secondary data is simply written as:
(2.21) By defining the matrix L so that R it yields:
x
= LL* and M^y (2.22)
where B =
(2.23)
For convenience, we decompose the matrix MLY in blocks of size K(L — 1) and K, respectively:
(2.24)
Using equation (2.24), the value of B that maximises the exponent of equation (2.22), under the constraint of having the last ^-dimensional block equal to the identity matrix, is:
(2.25)
Therefore, we have the ML estimate A = M^00MyOi • Using this value, we have for the trace: tr{K*KMLY} = ?r{-M* y o l M-^ 0 0 M L y 0 i + M L n i } = f r f - L L ^ M ^ 1 ] ^ } (2.26) where [My ]^,L is the last KxK block in the main diagonal of the inverse of My. Therefore, the maximum of the PDF with respect to A can be rewritten as: max{/?0(Y)} =
(TT-K\R\)^M-L+1)
exp[-;r{R~ 1 PV^ 1JZ i l l
A
By maximising equation (2.27) with respect to R, 1/(2(M-L +I))[M"1]^.
(2.27)
'
it yields R
=
References 1 KLEMM, R.: 'Principles of space-time adaptive processing-principles and applications' (IEE, London, 2002, 2nd edn.) 2 KELLY, E. J.: 'An adaptive detection algorithm', IEEE Trans. Aerosp. Electron. Syst., 1986, AES-22, (1), pp. 115-127 3 BRENNAN, L. E., MALLET, J. D., and REED, I. S.: 'Adaptive arrays in airborne MTI radar' IEEE Trans. Antennas and Propagation, 1976, AP-24, (5), pp. 607-615 4 WANG, H. and CAI, L.: 'On adaptive spatial-temporal processing for airborne surveillance radar systems', IEEE Trans. Aerosp. Electron. Syst., 1994, AES-30, (3), pp. 660-670 5 LOMBARDO, P., GRECO, M. V., GINI, R, FARINA, A., and BILLINGSLEY, J. B.: 'Impact clutter spectra on radar performance prediction', IEEE Trans. Aerosp. Electron. Syst., 2001, AES-37, (3), pp. 1022-1038 6 FARINA, A., GRAZIANO, R., LEE, F., and TIMMONERI, L.: 'Adaptive space-time processing with systolic algorithm: experimental results using recorded live data'. Proceedings of international conference on Radar, Radar 95, Washington D.C., 8-11 May 1995, pp. 595-602 7 FARINA, A., LOMBARDO, R, and PIRRI, M.: 'Non-linear space-time processing for airborne early warning radar', IEEProc, Radar Sonar Navig., 1998,145, (1), pp. 9-18 8 FARINA, A., LOMBARDO, P., and PIRRI, M.: 'Non-linear STAP processing', Electron. Commun. Eng. J., 1999, 11, (1), pp. 41-48
9 KAY, S. M. and NAGESHA, V.: 'Maximum likelihood estimation of signals in autoregressive noise', IEEE Trans. Signal Process., 1994, SP-42, (1), pp. 88-101 10 LOMBARDO, P.: 4DPCA processing for SAR moving target detection in the presence of internal clutter motion and velocity mismatch'. Microwave sensing and synthetic aperture radar, EUROPTO, 2958, September 1996, Taormina, Italy, pp. 50-61 11 LOMBARDO, P.: 'Optimum multichannel SAR detection of slowly moving targets in the presence of internal clutter motion'. CIE-ICR'96, international Radar conference, Beijing, China, 8-11 October 1996, pp. 321-325 12 KLEMM, R. and ENDER, J.: 'Multidimensional digital filters for moving sensor arrays'. Proceedings IASTED on Signal processing and digital filtering, June 1990, Lugano, Switzerland, pp. 9-12 13 KLEMM, R. and ENDER, J.: 'Two-dimensional filters for radar and sonar applications'. Signal Processing V EUSIPCO, September 1990, Barcelona, Elsevier Science Publisher, B.V., pp. 2023-2026 14 WICKS, M. C , MELVIN, W. L., and CHEN, P.: 'An efficient architecture for nonhomogeneity detection in space-time adaptive processing airborne early warning radar'. IEE international Radar conference, Radar'97, October 1997, Edinburgh, UK, pp. 295-299 15 ADVE, R. S., HALE, T. B., and WICKS, M. C : 'Practical joint domain localised adaptive processing in homogeneous and nonhomogeneous environments. Part 2: Nonhomogeneous environments', IEEProc, Radar SonarNavig., 2000,147, (2), pp. 66-74 16 FARINA, A.: 'Linear and non-linear filters for clutter cancellation in radar system', Signal Process., 1997, (59), pp.101-112 17 FARINA, A., LOMBARDO, P., and CARAMANICA, F.: 'Non-linear nonadaptive clutter cancellation for airborne early warning radar'. IEE international Radar conference, Radar'97, Edinburgh, October 1997, pp. 420^24 18 KLEMM, R.: 'Adaptive clutter suppression for airborne phased array radar', IEE Proc. F and H, 1983,130, (1), pp. 125-132 19 LOMBARDO, P.: 'Echoes covariance modelling for SAR along-track interferometry'. IEEE international symposium IGARSS '96, Lincoln, Nebraska, USA, May 1996, pp. 347-349 20 LOMBARDO, P. and COLONE, F.: 'Non-linear STAP filters based on adaptive 2D-FIR filters'. IEEE Radar conference, Alabama, USA, May 2003, pp. 51-58
Chapter 3
Space-time techniques for SAR Alfonso Farina and Pierfrancesco Lombardo
3.1
Summary
This contribution describes the application of STAP (space time adaptive processing) to synthetic aperture radar (SAR) systems. SAR is a microwave sensor that allows us to have a high resolution mapping of electromagnetic (EM) backscatter from an observed scene. A two-dimensional image is provided in the radar polar coordinates, i.e. slant range and azimuth. High resolution in slant range is obtained by transmitting a coded waveform, with a large value of the time-bandwidth product, and coherently processing the echoes in a filter matched to the waveform. High resolution along the transversal direction is achieved by forming a synthetic aperture. This requires us to: (i) put the radar on board of a moving platform, e.g. an aircraft or a satellite (ii) record the EM signals from each scatterer which is illuminated by the moving antenna beam in successive instants of time (iii) coherently combine the signals - via a suitable azimuthal matched filter - thus focusing the sliding antenna pattern in a narrower synthetic beam [I]. The advantage of combining SAR and STAP is evident: the detected moving target is shown on top of the SAR image of the sensed scene. This Chapter describes in detail the multichannel SAR (MSAR). 3.2
Description of the problem and state of the art
In many applications (e.g. surveillance) of SAR, it is desirable to detect and possibly produce focused images of moving objects. A moving low RCS object is not easily detectable against strong echoes scattered from an extended fixed scene. When detected, its resulting image is smeared and ill positioned with respect to the stationary background. These shortcomings are a direct consequence of the SAR image
formation process. The cross-range high resolution in an SAR is obtained by taking advantage of the relative motion, supposed known, between the sensor and the scene. If, however, there is an object moving in an unpredictable manner, the image formation process does not function properly. Basically, the main degradations due to the target motion are: (i)
(ii)
The range migration through adjacent resolution cells (due to the radial velocity of target with respect to radar) causes a reduction of the signal-to-clutter power ratio, which can seriously impair the detection capabilities. Furthermore, range migration causes a decrease in the integration time and a consequent loss of resolution. Even in the absence of range migration, or after its correction, the phase shift induced by the motion causes: an ill-positioning (along track) of the target image with respect to ground, mainly owing to the range component of the relative radar-target velocity; a smearing of the image due to the uncompensated cross-range velocity and/or range acceleration.
An initial possibility that has been studied for discriminating the moving target signals from the fixed scene returns is on the basis of their different Doppler frequency spectra; see, for instance, Reference 2 and Reference 37. In fact, the target spectrum has a Doppler centroid approximately linearly proportional to the along-range velocity of target and a spectrum width depending on the azimuthal velocity and the radial acceleration components of target. Assuming that the radar pulse repetition frequency (PRF) is high enough to make available a region in the Doppler frequency domain not occupied by the stationary scene, the method works as follows (see Section 3.4.2 for details). First, transform a sequence of radar target returns to the frequency domain. Second, locate spectral bands, outside the narrowband frequency around the origin corresponding to stationary scene, and determine the centre frequencies of such bands. Third, translate each outlying spectral band to the origin, convert the resulting signal back to the time domain and correlate with the reference function of the conventional SAR. The correlator output will show the peaks in the correct locations of the targets. A refinement of this basic technique aims at the image formation of each target: it is obtained by also matching the width, not only the mean value, of the spectral band outside the stationary scene Doppler spectrum. The method suffers, however, from three shortcomings: (i)
It requires the use of a high PRF which causes a corresponding reduction of the SAR swath width and an increase of the data throughput. (ii) It does not correctly focus the image of a target having a quite arbitrary path. The spectrum of the target echoes alone is not sufficient; we need to know the instantaneous phase law to form the synthetic aperture with respect to the moving target. (iii) It does not succeed with a target whose motion has a small range velocity component, so that its spectrum is superimposed on the clutter (i.e. on the stationary scene) spectrum.
A distinct advantage of this system relates to the possibility of using it with conventional, non-multichannel phased array radar antennas. More powerful methods have been conceived to overcome these drawbacks; they are based on the use of more than one antenna, on board the moving platform, to cancel the clutter and detect the slowly moving targets. The radar system uses an array of antennas, mounted on the platform along the flight direction, and corresponding receiving channels. This makes available a certain number of space samples (echoes received from different antenna elements) and time samples (echoes collected at different time instants). These echoes are coherently combined, with proper weights, in a space-time processor to cancel the echo backscattered from the ground and enhance the target echo. Space-time effectively reduces the lower bound on the minimum detectable target velocity that would be established by using only frequency filtering. It measures the relative phase between two or more coherent signals, received from different antennas, rather than the Doppler frequency shift within a single receiving channel. Instead of using mono-dimensional filtering, clutter cancellation is the result of a powerful two-dimensional (in the temporal frequency, i.e. Doppler, and in the spatial frequency, i.e. azimuth angle) filtering. Furthermore, this method does not require necessarily to work with high PRF values. This is the STAP approach: see References 3, 4 and 5 for details. However, the conventional STAP techniques have to be extended to be made compatible with the MSAR case, which is intrinsically characterised by a long integration time, during which both target and clutter Doppler and direction of arrival (DoA) change. These characteristics are reviewed in Section 3.3, and Section 3.4.1 presents a taxonomy of a number of MSAR processing techniques proposed in the literature to jointly exploit the effectiveness of SAR and STAP techniques. The way to fully-fledged STAP wasn't immediate: after leaving the single-channel approach (Section 3.4.2) it passed through ATI applied to SAR (described in Section 3.4.4) and DPCA (see Section 3.4.3); both techniques being essentially based on the use of the echoes captured by two antennas. A refinement of ATI-SAR was the VSAR (velocity SAR) method [6]. In a way similar to the progression from a two-pulse canceller to a bank of Doppler filters to reject clutter and detect moving targets in a conventional search radar, the technique can be generalised to a linear array of identical antennas. For each channel a complex SAR image is focused. A Fourier transform along the physical aperture (i.e. the channel number) is applied to each pixel, and this corresponds to a multiple beam former. For each Fourier cell the related image shows the scene for a certain range of radial velocities (velocity SAR image [6]). This method wasn't, however, originally designed to suppress clutter, since no attempt is made to subtract signals. The requirement to cancel the echoes from stationary scatterers leads directly to adaptive space-time filtering. Two classes of advanced STAP techniques are considered in particular for the MSAR case: (i) time-domain reduced DoF (degrees of freedom) adaptive two-dimensional finite impulse response (FIR) filters (Sections 3.4.5.2, [7-11] and Section 3.4.6) (ii) frequency-domain reduced DoF adaptive processing [12].
Finally, Section 3.4.6 considers the optimum MSAR approach in a special case for the clutter characteristics, to gain insight into the properties of the optimum filter and its effectiveness. This can be used for comparison of the different filters and yields the upper bound for achievable performance. Once the presence of a moving target has been detected, we have to estimate its phase modulation law to be able to form a high resolution image of it. A method for providing the estimate is by means of time-frequency analysis of the received signal (Section 3.4.5.1). This, combined with STAP, brings us to the joint spacetime-frequency processing presented in detail in Section 3.4.5.2. The time-frequency representation is obtained by evaluating the Wigner-Ville distribution (WVD) of the signal. This distribution is a signal representation consisting in the mapping of the signal onto a plane whose coordinates are time and frequency. The WVD, in particular, produces a mapping such that the signal energy is concentrated along the curve of the instantaneous frequency. This frequency is obtained as the centre of gravity of the WVD; the instantaneous phase is derived by integration of the instantaneous frequency. The clutter echoes are cancelled by the adaptive space-time processor, where the space-time covariance matrix of clutter is estimated online and used to evaluate the optimal weights of the two-dimensional filter. The time-frequency analysis provides an estimation of the instantaneous frequency of the possibly present moving target, and - by integration - the original instantaneous phase. The phase is used to compensate for the shift due to the relative target-radar motion.
33
Model of MSAR echoes
3.3.1 Aberrations due to target motion There are some interesting effects that occur with moving targets. A moving target with a radial component of velocity vr results in a Doppler shift on each echo of: /o = ^
(3.1) C
where /o is the radar carrier frequency and c is the velocity of propagation. Thus the Doppler history of the sequence of echoes is shifted in frequency (solid line of Figure 3.1), and is matched filtered with a reference chirp (dashed line of Figure 3.1). This produces a shift in the azimuth position, which is given by the product of the target Doppler shift and the slope of the Doppler history
yb-£ = ^
0.2)
where v is the platform speed, r is the platform-target range and d is the along-track dimension of the antenna (the real aperture). It is well known that an image from an aircraftborne SAR of a moving train having a component of velocity in the range direction appears shifted, so it appears to be travelling not along the railway track, but displaced to the side! As another example, a ship in a satellite SAR image with a radial velocity of 10 m/s at a range of 1200 km would experience an azimuth shift of
5 CN
Il ^S
JO
I I ex
a O
1 synthetic aperture length = rl/d
Figure 3.1 A target with a radial velocity (solid line) is matched filtered with the SAR azimuthal reference chirp (dashed line) 1.7 km. On the other hand, the ship wake (which is stationary) appears in the correct position. Thus the ship appears not at the tip of the wake, but displaced in azimuth. This effect is visible in a number of spaceborne SAR images. From knowledge of the geometry, and of the azimuth shift, it is possible to estimate the target velocity.
3.3.2 Space-time-frequency representation The MSAR allows Doppler and angular information to be decoupled by means of the along-track interferometric processing. The derivation of optimum clutter cancellation schemes for moving target detection requires the definition of a model for the covariance of the MSAR space-time clutter echoes to take full advantage of this potentiality. Moreover, a proper representation space for the resulting MSAR space-time model is required to adequately interpret the results. Both models and representations are available separately for single-channel SAR, where the long observation time and the consequent non-linear phase history of the echoes are the main issues, and for multichannel systems [4, Chapters 2 and 3], which assume, on the contrary, a short integration time and focus on the angular characteristics. In this section, a simplified closed-form model for the MSAR echoes is described, which takes into account both non-linear phase modulation and angular position at the same time and defines a proper representation space for it. Even though simple to handle, the model encodes all the main characteristics of the scattering from the observed scene [13]. 3.3.2.1 Echoes from homogeneous correlated clutter A simplified condition is assumed in the following, for which the radiation patterns of transmitting and receiving antennas have the same width. This allows us to reduce again the analytical complexity of the analysis, while considering the effect of parameters. The simplified model is used in the present section to include the effect
TX/RX array antenna
Figure 3.2
The multichannel SAR observation geometry (PRT:pulse repetition time)
of the terrain reflectivity correlation. With reference to the slant plane geometry in Figure 3.2, assume that the radar transmitting antenna moves, with constant velocity v, along a straight trajectory at a fixed distance Ro from the g-axis (crossing the r-axis at time t = 0) and has a fixed pointing, orthogonal to the flight path. The sequence of echoes relative to the TV pulses, transmitted with pulse repetition time PRT, is received by a uniform linear array of K antennas, parallel to the radar trajectory. The K receiving antennas have separate receiving channels and phase centres at distance Sk = (k — (K — l)/2) A, k = 0 , . . . , K — 1 from the transmitter. A pointlike scatterer T is assumed to be for t = 0 on the g-axis at the position xq and to move with velocity components vq and vr along the two axes q and r, named along-track and cross-track. It is further assumed that: (i) the range cell migration is negligible or has been corrected for (ii) the two-way antenna pattern, for the channel k, has the Gaussian shape gk [y] = \/{y/nXq) exp[— (y — sk/2)2/X^\, Xqy being the along-track position of the scatterer relative to the phase centre of the transmitter (iii) quadratic terms in the receiving antennas displacements can be neglected and the Fresnel approximation is valid for the distance of the scatterer from the transmitter (narrow antenna pattern). Thus the echo received by the &th receiving antenna at time tn = (n — (N — l)/2) PRT, n = 0 , . . . , Af — 1 can be modelled as:
(3.3)
where Ao and O are constant amplitude and phase terms and fy = 2vr/X, the Doppler frequency due to the cross-track velocity for the wavelength X. Equation (3.3) takes into account the central Doppler frequency and DoA of the scatterer, which are the usual model for space-time processing techniques based on a short integration time. Moreover, the quadratic term encodes the time varying Doppler and DoA, observed in the long SAR integration time. The clutter echo Ck (t), received at time t and receiving element Jc, is obtained by integrating the echoes from the infinitesimal clutter patches of dimension dxq, modelled as equation (3.3) with instantaneous reflectivity Ao = IJi(xq, t)dxq. The correlation of the clutter echoes received at times tn x = t,tn2 = t+r and receivers in Sk1, Sk2 = Sk1 + 2rj is:
(3.4) the symbol (•) standing for the statistical expectation. The model for homogeneous correlated clutter is obtained in the hypothesis of factorisation of the spatial and temporal correlations, px(x) and pt(t), of the clutter reflectivity: (3.5) Using a Gaussian spatial correlation, with variance o2,px(x) = (\f2nGc)~x 2 exp[—x /(2 a*)], the double integration in equation (3.4) becomes a Gaussian function of the space-time displacement, z = (v — vq)r + rj9 of the two-way phase centre and is thus stationary in both space and time:
(3.6) where E 2 = 1/[1/X2 + (2nXq/(XRo))2] is the square of the maximum SAR resolution. By taking the bidimensional Fourier transform of equation (3.6) with respect to time: t —> f, and space: r\ —> sin(0)/X ^ 6/X, we obtain the clutter power spectral density (PSD) as a function of frequency / and angle 0: (3.7) where Ft [/], is the Fourier transform of pt(r). In the absence of temporal decorrelation (pt (r) = 1), P(f, 0) reduces to a sheet for / = fd + (v — vq)0/X, whose Gaussian shaped amplitude is controlled by the variance £ 2 + o2. This reduces to the square of the SAR resolution for uncorrelated reflectivity and grows with the spatial correlation a2 otherwise. This can be especially important for the new generation of VHF SAR systems, to which the reflectivity of a natural scene can appear more correlated than
to microwave sensors. It applies also in a number of remote sensing applications over smooth surfaces, such as sea or ice covered regions and in some planet exploration missions, where the correlation of the surface scattering is not negligible and also a coherent contribution can be present in the echo. This is evident from the behaviour of the PSD that for a\ —> oo behaves as a Dirac delta for 0 = 0. As expected, the mean cross-track velocity of the observed surface (as, for instance, the sea surface) produces a constant frequency shift, whereas a mean along-track velocity changes the slope of the clutter ridge. The presence of temporal decorrelation spreads the PSD around the straight line above, according to the function FVf/], and Xq controls its amplitude, affecting the angle over which the transmitted power is spread.
3.3.2.2 Representation of MSAR echoes Adequate representation strategies are available separately for single-channel SAR, where the long coherent integration interval (CPI) and the consequent non-linear phase history of the echoes are the main issues, and for multichannel MTI systems, which assume, on the contrary, a short CPI and focus on the angular characteristics. In the present section it is shown that the representation planes used for the systems above are inadequate to fully represent the MSAR echoes. The need for a proper representation space for the MSAR echoes is thus illustrated, which takes into account both non-linear phase modulation and angular position at the same time. The angle-frequency plane (O9 f) in Figure 3.3 is the preferred representation space for the clutter covariance in multichannel systems [4, Chapters 2 and 3]. A short observation time is assumed, so that Doppler frequency and DoA of the single scatterer are constant and the array of receivers is used to exploit the scatterers' angle. The PSD in equation (3.7) is directly mapped into this plane, which seems thus appropriate for homogeneous clutter. However, when the observation time is long, both DoA and Doppler frequency of the clutter echo from the single surface scatterer depend on time (equation (3.3)). This can't be represented in the (O, f) plane, where the contribution from each patch of an homogeneous clutter moves in time along a line with constant slope (see arrows in Figure 3.3). Since the scatterers that quit the antenna beam are replaced by new ones entering, on average, the same PSD is observed. Moreover, since the time-varying characteristic of the echo's DoA and Doppler spreads its power over a line in the (O9 / ) plane, the gain of the SAR coherent integration over the noise can't be related directly to Doppler or angular filtering in this plane. This applies also for the detection of targets, with a motion different from that of the clutter patches. The time-frequency plane (t, / ) in Figure 3.4 is the preferred representation space for single channel SAR, where a long observation time is considered. It allows the correct representation of the time-varying characteristic of the Doppler frequency but doesn't contain angular information. Thus the Doppler frequency / of the generic scatterer, shown in this plane, relates to a combination of cross-track velocity and angular position: this ambiguity cannot be resolved. This is shown in Figure 3.4, where solid and dashed lines represent the echo from the same homogeneous clutter
Figure 3.3 Frequency angle plane: typical of STAP (PRT: pulse repetition time)
Figure 3.4
Frequency-time plane: typical of SAR (PRT: pulse repetition time)
patch, received by different channels. The clutter edge is localised in time in the (t9 / ) plane and the SAR coherent integration gain is now clear. From the analysis above it can be seen that, to interpret the MSAR echoes properly, a higher dimensional space is required, which allows both Doppler frequency
Figure 3.5
Space-frequency-time volume: typical of MSAR (PRT: pulse repetition time)
and DoA to be represented as functions of time [13]. The space-time-frequency space (0, t, / ) , in Figure 3.5, is thus introduced. Its integration over the temporal axis collapses it back into the (O, f) plane, and integration over the angular axis collapses it into the (t, / ) plane. In this space the motion of the scatterer echoes in the Doppler angle limits of the antenna beam are correctly represented for both homogeneous and non-homogeneous scene: ( ). Moreover, the generic target echo: (_._), with velocity components different from the clutter background can be effectively represented in the volume of Figure 3.5 and discriminated from the clutter echoes. This representation can be used to get inside the characteristics of the different processing schemes for the processing of MSAR data and fully understand the effect of their application.
3.4
Processing schemes
In the following several processing schemes for the detection and imaging of moving targets are described.
3.4.1
Taxonomy of processing schemes for MSAR
A wide range of processing architectures has been derived in the last few years to detect and focus moving targets using SAR systems with K antennas, based on the digitised echoes received during the long sequence of N PRI from the Q range cells under analysis. The above mentioned architectures are characterised by different approaches to deal with the three main parameters of the received echo signals: alongtrack velocity (Doppler rate), cross-track velocity (Doppler), and DoA. A brief review of the different processing architectures is reported below.
3.4.1.1
1st processing architecture: optimum filter with very long integration time (due to the extended synthetic aperture required to achieve high spatial resolution) With respect to the conventional optimum STAP filter [4, Chapter 4], this optimum filter (see Figure 3.6) is characterised by a target steering vector with Doppler frequency and DoA that cannot be taken constant during CPI. As apparent from Section 3.3.2, with MSAR these quantities might change considerably during long CPI. This filter has the obvious advantage of yielding an optimum approach, however, it has the full computational complexity due to the three-dimensional filter bank (Doppler rate, Doppler, DoA). Only in special cases can this optimal solution be easily derived and interpreted [10, H]. Moreover, the adaptivity of the filters (ideally the filter weight vector is w = s^R" 1 where H stands for complex conjugate transpose) requires very many taps for each filter of the bank and a very large training area with homogeneous clutter, to estimate the clutter covariance matrix R.
3.4.1.2
2nd processing architecture: STAP filtering + coherent integration for SAR processing [9] This suboptimum scheme (see Figure 3.7) is obtained by using a STAP cancellation filter which performs the clutter cancellation and collapses the K channels into a single ideally clutter-free channel. A two-dimensional filter bank (Doppler rate, Doppler) follows to detect the target. This is based on the consideration that cross-track and speed both contribute to modify the linear phase term (Section 3.3.1). Moreover, the filter effect of STAP is related independently to cross-track speed and DoA. The effect of the STAP filter on the second stage is also analysed in Reference 14. This architecture is less computationally demanding and yields a reasonable suboptimum processing scheme.
K channels
Figure 3.6
target detection & imaging
Optimum processing scheme of MSAR
K channels
Figure 3.7
STAP: 3D filter bank: •DoA •Doppler •Doppler rate
STAP Cancellation filter
1 channel
2D filter bank: •Doppler •Doppler rate via Wigner-Ville
Space-time-frequency processing
target detection & imaging
3.4.1.3
3rd processing architecture: formation of SAR image & detection of moving targets This processing scheme requires the application of a conventional MTI clutter canceller on each channel separately. Then the formation of K SAR images is performed (see Figure 3.8), with the corresponding phase compensation (as it happens for Interfermetic SAR (InSAR)). This is followed by a one-dimensional (Doppler) filter bank for the final target detection. The reduced computational cost of this processing architecture yields largely degraded performance. It is especially difficult to compensate for the along-track speed of the target different from the clutter. This technique does not exploit the spatial processing dimension of the multichannel antenna for clutter cancellation. It is therefore (except for the signal to noise power ratio (SNR)) equivalent to a single-channel MTI SAR. 3.4.1.4
4th processing architecture: joint domain localised (JDL) for MSAR [15] The scheme is depicted in Figure 3.9. First a time-varying transformation is applied to the space-time data to compensate for the time-varying Doppler frequency of the clutter echoes. Therefore, a large reduction of degrees of freedom is performed in the Doppler-DoA plane, after two-dimensional FFT. This is followed by an adaptive optimum generalised likelihood ratio test applied in the reduced domain [16]. This
MTI
SAR
MTI
SAR
MTI
SAR
ID filter bank: •Doppler
K channels
Figure 3.8
Channelised MTI&SAR processing time-varying transformation
NK samples
Figure 3.9
2DFFT
discard data
Yes target GLRT
No target
to compensate for cluster of 2D beams adaptivity on generalised Doppler frequency in Doppler & spatial a reduced number likelihood and DoA variation frequencies domain OfDoF ratio test during CPI
Joint domain localised (JDL) on MSAR
RX-K
SAR processor 1
SAR processor 2
TX
SAR processor K
velocity processor
Figure 3.10
Velocity SAR
scheme has low computational complexity, close to optimum detection performance, and reduced size of required homogeneous clutter area. However, it requires a proper time-varying transformation and is less intuitive for targets with non-negligible alongtrack velocity component. 3.4.1.5 5th processing architecture: velocity SAR (VSAR) [6] As sketched in Figure 3.10, the set of complex video signals is processed by SAR processors to produce complex images: Yk(x, y), k = 0 , . . . , K — 1 with x, y azimuth, range coordinates. Phase of formed images is preserved. The post-processing of complex images brings out the velocity dimension. This step involves processing of each pixel for all images. Consider the whole set of images as a single threedimensional image: Yk(x9 y, k); the velocity processing applies along the rc-axis and provides the resulting three-dimensional image: Z(;t, y, v) in azimuth, range and velocity of the observed scene. Velocity processing in its simple form involves Fourier transformation of K points along the A>axis. Finally, each velocity plane is shifted by an amount proportional to its velocity to correct the motion-induced azimuth displacement. All the planes are summed to produce the final undistorted VSAR. The method is not designed to suppress clutter, since no attempt is made to subtract signals. The requirement to cancel the echoes from stationary scatterers leads directly to adaptive space-time filtering. 3.4.1.6 6th processing architecture: processing in the frequency domain This method finds its mathematical foundation in the stochastic stationary characteristic of the clutter processes that have natural description in the Fourier domain via the spectral density matrix. The Fourier transforms (FT) of the process for fixed
FFT
FFT
FFT receiving channels
spatial null for each frequency bin target match in space
target match in frequency Threshold
Figure 3.11 Frequency domain processing for MSAR [12] frequencies are asymptotically independent when the time base tends to infinity. Thus cross-correlations between frequency channels are negligible. Therefore, frequency processing is valid asymptotically, since the spectral representation provides a time signal extending from — to + infinity. For SAR we have in the limit an infinite sequence of equispaced samples along the pulse-to-pulse time (slow time). This is in contrast to the finite pulse train of airborne early warning (AEW) search radar. For azimuth and range compressions the FT is performed in any case. Moderate time base (= synthetic aperture length) is sufficient to make negligible the cross-terms between the frequency channels. In Doppler domain processing, the clutter energy is collected in one-dimensional subspace. This is the key to using projection methods (see Figure 3.11) with the advantage that the clutter is cancelled perfectly without sensitivity to its amplitude variations. A good estimate of the covariance matrix is achievable since there is a large number of range bins. Even though clutter to noise power ratio (CNR) varies with range, the structure of underlying clutter subspace remains unchanged. The computational load is affordable if only a few spatial channels (three is the minimum) are used for long sample sequences. SMI (sample matrix inverse) and eigendecomposition of 3 by 3 matrices are performed very fast. For details see Reference 12. 3.4.1.7
7th processing architecture: displaced phase centre antenna (DPCA) for SAR The concept of DPCA requires an antenna divided in two subarrays and the transmission of two pulses. Due to the aircraft movement, the transmitting and receiving
antennas form two bistatic configurations for the two pulses. If the subarrays' displacement, the platform speed and the radar PRF are properly selected, the global two-way phase centres of the bistatic antenna configurations are coincident. The stationary clutter is seen at two different time instants from the same radar location and it can be cancelled by subtracting the corresponding echoes. The DPCA corresponds to a kind of spatial MTI: returns from stationary objects cancel, while moving target echoes are maintained. The extreme simplicity of this approach is paid for by very strong constraints on the antenna, PRF and platform speed, which are often unacceptable. Moreover, the technique has no ability to compensate for any mismatch from this idealised condition by means of adaptivity. 3.4.1.8
8th processing architecture: along-track interferometty SAR (ATI-SAR) This operates similarly to DPCA using two displaced antennas connected to two receiving channels. An SAR image is generated for each channel; thereafter, the time delay between the azimuth signals is compensated for using two different azimuth reference signals. By multiplying the first image by the complex conjugate of the second, the remaining phase is zero for stationary objects and non-zero otherwise. Basically this technique is closely related to DPCA and STAR It may suffer from constraints imposed on the processing according to SAR requirements (SAR requires low PRF values to have a wide along-range swath width, and STAP needs large PRF to detect also fast moving targets).
3.4.1.9 9th processing architecture: pulse Doppler and MTI [2] This technique is derived from the well known moving target detector (MTD) processing widely used in conventional ground-based radar. After range compression, an azimuth processor is developed to detect the presence of a moving point-like target in each range cell and to measure its velocity components. The method is straightforward and simple to implement under the conditions that the target velocity is bounded within a minimum detectable value and the unambiguous value, and that range migration is negligible. In the following we will describe in some detail the following processing schemes: MTI + PD (Section 3.4.2), DPCA (Section 3.4.3), ATI-SAR(Section 3.4.4), joint space-time-frequency (Section 3.4.5), optimum filter for a special case (Section 3.4.6). We prefer to follow the historical derivation rather than the optimum approach from which all the suboptimum schemes can be derived.
3.4.2 MTI+ PD The material of this section is derived from Reference 2 which describes the work done by colleagues of the authors. The processing technique, referred to as moving target detection and imaging (MTDI), has been derived by the well known MTD processing widely used in conventional ground-based radars. A range-Doppler SAR processing is adopted; the raw data are first processed along the range direction and then along the azimuth. An azimuth processor is developed to detect the presence of
beam footprint
Figure 3.12
Geometry of SAR and moving target
a moving point-like target in each range cell and to measure its velocity components. The method is valid under the following hypotheses, namely: • •
the target velocity is bounded within minimum detectable and maximum unambiguous values (i.e. PRF/2) the SAR system and the moving target cause a negligible range migration.
It should be noted that with a one-channel antenna no detection of endoclutter targets (i.e. targets with Doppler frequency within the clutter spectrum) is possible. Increasing the PRF may not be desirable because of an increase in range ambiguities, and likewise for SAR applications. The mathematical model of the echo received by the radar antenna during the synthetic aperture time interval is reported in this section. Figure 3.12 sketches the geometry of an SAR system and a moving object of interest. The antenna, on board an aircraft, moves along the azimuth direction x. The antenna beampattern is directed orthogonal to the flight path; 0 (the off-nadir angle) is the angle formed by the normal to the ground and the line from the radar to the central point of the scene; Vx and vy are the velocity components of the target along the reference coordinates. Along the azimuth, the radar transmits pulse trains with repetition frequency PRF, each pulse having a linear frequency modulation (chirp). To account for some unpredictable changes in the environment wherein the transmitter operates, an unknown initial phase 0o» modelled as a random variate uniformly distributed in (0,2n), is assumed in the transmitted signal. At the nth azimuth position of the antenna, the transmitted pulse is expressed as: Tn(t) = A cos(2jtf0t + at2 + 0O),
~
X
-
(3.8)
where A and r are the pulse amplitude and the pulse width, respectively, /o is the carrier frequency, a = 2JTB/T is the chirp rate, and B is the chirp bandwidth. Consider now a point-like scatterer on a completely absorbing background; the echo received by the antenna at the nth azimuth position, after down frequency conversion, can be modelled as: Sn(t) = AlGOg0(O,
v sin 0 Fdc = - 1 Y -
F =
( 3 - 10 >
2(V - vx)2
* —5i*T~
(3 11}
-
where V is the platform speed, Vx and vy are the target velocity components respectively along the x- and y-axes, and Ro is the range between the platform and the scene centre. The above approximation leads to a useful expression for the bandwidth of the azimuth chirp, i.e.: Bd =
2{V Vx)
~
= FjrTi
(3.12)
LJ
with L being the along-track antenna length and 7} the integration time, i.e. the time during which the target is illuminated by the antenna main lobe. The signal Sn (t), sampled at the proper frequency Fc, is stored as a column in the holographic matrix S. The dimensions, Nsr and Nsa, of the matrix S represent the number of range and the azimuth samples, respectively. The entry Smn of S can be reparameterised to elicit the dependence on the Doppler centroid Fdc and the Doppler rate F^ r , namely: Smn = Alcrogo($,
(3.13)
where tm = m/Fc is the range time (fast time) and tn is the azimuth time (slow time). A conventional SAR processor, designed to provide the image of a stationary scene, works separately along the range and the azimuth directions. Specifically, at the first stage each column of the holographic matrix is convolved with the impulse response function (IRF) hr of the range filter yielding the range-compressed data. At the second
stage, each row of the resulting matrix is convolved with the IRF ha of the azimuth filter, thus providing the final image. Useful expressions for hr and ha, derived from equation (3.13), are: M*m) = expOa£),
m = 1,2,...,JV,
(3.14)
ha(tn)
n = l,2,...,Na
(3.15)
= exp(j27tFdr0tl),
where Fjro is the Doppler rate (equation (3.11)) for Vx = 0. The conventional processing fails when the convolutions are implemented by means of equations (3.14) and (3.15) on the signals coming from moving objects for which F& ^ F^rO a n d F^c 7^ 0. In MTD technique the Doppler frequency shift is used as a means of detecting moving targets embedded in a strong ground backscatter (clutter). A delay-time canceller behaves as a filter to reject the low Doppler components associated with the clutter and to preserve the high Doppler components of the moving targets. The canceller is cascaded with a bank of narrowband filters (channels), uniformly spread in the PRF interval. When matched filtering is performed, only that Doppler channel containing the target echo will supply a significantly non-zero output. These concepts are amenable to extension to SAR application: in fact, for sensing and imaging purposes, the stationary scene resembles clutter and the point-like moving object represents the target. It appears convenient to refer to this approach as MTDI, since it is the natural extension of MTD to SAR. However, some novel problems, arise when we deal with SAR. First, the representation of the signal returned by a scatterer results in an intrinsically two-dimensional problem. In some airborne cases, however, the dwell time is usually short and the range cell migration is avoided. Whenever the SAR system parameters do not induce range migration by themselves, a moving target could still be displaced over more range lines if its radial velocity is high enough to encompass more range lines during the integration time, i.e. if: (3.16) where Afrcm is the number of range migration cells and 8r is the range resolution of the SAR system. Assuming an airborne SAR with 7} = 0.2 s, 6 = 45°, 8r = 4 m, and substituting the numerical values in equation (3.16), a maximum target speed of 30m/s along the range direction is allowed to avoid range migration (Nrcm < 1). Under these assumptions the pattern is not truly bidimensional and the processing can be performed by a separate filtering of the rows and the columns of the holographic matrix. Another problem which arises when dealing with moving targets is the aliasing effect due to the broadening and shifting of the signal spectrum. Assume, in fact, that PRF = Bd'. in this case a radial velocity component of the target causes a Doppler shift according to equation (3.10) and a significant aliasing cannot be avoided. Furthermore, if the target velocity Vx is opposite to that of the platform, an increase of the nominal bandwidth occurs according to equation (3.12). As a consequence of these concurrent phenomena, the azimuth compression cannot be perfectly achieved and the target is not correctly imaged. If instead the
sampling frequency PRF is ^> Bd, then the discrete and the analogue representations can be considered equivalent. More precisely, PRF must be constrained by the inequality: (3.17) where F j c m a x and F^ r m a x are available from equations (3.10) and (3.11) on substituting the quantities Vx and vy with their maximum values. The basic considerations leading to the MTDI algorithm as a means of detecting moving targets are discussed below. When the system PRF complies with equation (3.17), the ground echo and the echoes of the moving targets are spectrally separated: the former is concentrated in [-Bd/2, BdIT) and the latter are allocated in (-PRF/2, -Bd/2) U (Bd/2, PRF/2). The scheme of the MTDI processor is shown in Figure 3.13. The raw data are stored in the holographic matrix and fed into the range processor. Then, the spectrum of the azimuth line is computed via a fast Fourier transform (FFT) and split throughout the Ndc channels, each one encompassing a bandwidth as large as Bd. The rejection of the ground echo is performed leaving out the channel centred on the zero Doppler frequency, after phase compensation of the platform speed V. The processing that follows performs azimuth compression on each stream of data. The procedure consists in evaluating the product between the data coming from a single channel and a suitable reference function, making an inverse Fourier transform and comparing the maximum value attained by the signal with a threshold. Finally, if a moving point is detected, a fine estimate of the chirp parameters - as discussed in
reference chirp FFT M/N pt.
to Doppler estimator channel 1 holographic matrix
range processing
reference chirp
range-compressed transformed data
IFFT
threshold
IFFT
threshold
Fin Fie, estimation image formation estimation
channel 2 Mpt. FFT
channel N IFFT
Figure 3.13 Processing scheme of MTDI [2]
threshold
Fdr,
F^
estimation
Reference 2 - is provided to improve the performance of the image-forming algorithm. The range of detectable velocities is limited between a minimum detectable and a maximum unambiguous values. The minimum and maximum velocities correspond to a Doppler displacement equal to the first and the last available Doppler channel centre frequencies, respectively. Conservative values (with respect to the off-nadir angle) are: (3.18) (3.19) For airborne SARs operating at millimetric wavelengths, values of 1 m/s for vm[n and 20 m/s for vmax are found using a PRF of 10 kHz. In Reference 2 the problem of probing the presence of a chirp signal embedded in a disturbance environment (backscattering by fixed scene and thermal noise) is modelled as a binary decision test; also the detection performance is found. Details are in the quoted Reference. A high quality image of moving targets calls for suitable procedures for estimating the chirp parameters. Here we introduce two algorithms for Doppler centroid and Doppler rate estimation. The algorithms are suited for airborne SARs and pointlike targets. The chirp parameters are related to the moving point velocities through equations (3.10) and (3.11), the knowledge of the former allows one to recover the latter. 3.4.2.1 Doppler centroid estimate The target radial velocity causes a Doppler frequency shift given by equation (3.10). The algorithm discussed here involves a correlation between the power spectrum of the azimuth signal and that of a reference signal, namely a chirp signal with no Doppler shift and nominal Doppler rate. The correlation peak is centred on the searched Doppler shift. More formally, the algorithm consists of the following steps: (a)
Discrete Fourier transformation (DFT) of the azimuth signal: (3.20)
where Nt,a is the azimuth FFT block length, and computation of the power spectrum: Sx(m) = \X(m)\2 (b)
(3.21)
Extraction of a subset of samples S(m) from Sx(m) according to the following rule: once the channel where detection occurs is selected, pick up all samples encompassing a bandwidth as large as 2 Bj centred around the channel reference frequency. So we are ensured that only a small amount of the signal energy is left.
(c)
Evaluation of the cross-correlation Rs(m) between Sim) and reference spectrum Sr(m)\ Rs{m) = S(m) ® Sr(-m)
(d)
(3.22)
where
where Q\ indicates the set of samples belonging to the lower frequencies (m = 1, . . . 5 ni, say) and ^2 indicates the corresponding set belonging to the upper frequencies {m = n\ + 1 , . . . , np, say) (np is the total number of points of Rs(m)). We select n\ so that the difference \e\ —£21 is minimum. The knowledge of n\ can be used for the evaluation of the Doppler centroid. 3.4.2.2 Airborne SAR auto foe us The Doppler rate is the other parameter required to perform a sharp focusing of the target echo. A widely used algorithm for Doppler rate estimation is based on the multilook technique; unfortunately, this technique does not work well if applied to airborne SARs. Consider, in fact, the Doppler rate resolution achievable by this algorithm [2]: (3.24)
where: Ni is the number of looks, A / is the centre-looks distance. Table 3.1 reports the values assumed by the above-specified parameters, together with the Doppler resolution, for typical spaceborne and airborne SAR systems. It is easy to observe that the resolution is good for Seasat SAR but poor for the airborne SAR. In Reference 2 a new algorithm is introduced which performs an accurate estimation of the Doppler rate for both spaceborne and airborne SARs provided that the Doppler centroid has been previously corrected. Consider the modulus of the mutual energy € between the received signal x(t) and the reference signal xr(t): (3.25) where /S is the Doppler rate to be estimated. The received chirp x(t, fi), after Doppler centroid correction, can be simply expressed as: X(t,Fdr)=exp{jFdrt2}
(3.26)
Table 3.1
fdr0 A/ Bd NL 8fdr
Some typical parameters for spaceborne and airborne SARs SEASAT SAR
Airborne SAR
520Hz/s 967Hz 1389Hz 4 0.8Hz/s
3112Hz/s 300Hz 400Hz 4 292Hz/s
If we choose: xr(t,l3) = exp{jl3t2}
(3.27)
and collect all inessential terms in a complex constant K, we obtain: (3.28) which can be approximated as: (3.29) Expression (3.29) can be regarded as the integration of Na pulses having the same modulus, but time-varying phase. The integration process leads to the maximisation of the cross-energy modulus if and only if (Fdr — /3) is equal to zero or, in other words, if the estimated Doppler rate is equal to the true one. Thus, the estimation rule amounts to evaluating the maximum of expression (3.29), with respect to j6, namely: (3.30)
where Qp is a suited domain for /3. After the estimation of Fdc and Fdr, the azimuth signal can be correctly compressed by means of a classical SAR azimuth processing to obtain the position and the reflectivity of targets. Moreover, vx and vy can be evaluated by applying equations (3.10) and (3.11). The algorithms above have been simulated to check their ability to generate correctly focused images of moving and stationary point-like targets over a fixed background; the successful results are reported in Reference 2.
3.4.3
DPCA
The concept of DPCA, displaced phase centre antenna, was conceived by F. R. Dickey, F. M. Staudaher and M. Labitt; in 1991 they received the IEEE-AESS Pioneer award for this invention [17]. One major application of DPCA is in the Joint-STARS (joint
two-element DPCA array position when the 1st pulse is transmitted subarray used for receiving echoes of 1st Tx pulse subarray used for receiving echoes of 2nd Tx pulse
'platform y ' platform
array position when the 2nd pulse is transmitted position of Tx/Rx phase centres for the two pulses
Figure 3.14
Working principle of DPCA
strategic target attack radar system) [18]. Figure 3.14 shows the working principle of the system; it depicts the radar antenna moving along track when two pulses are transmitted and corresponding echoes received. The antenna is divided into two subarrays; the whole antenna transmits two successive pulses, the corresponding echo of the first pulse is received by the fore subarray, the echo of the second pulse is received by the aft subarray. Due to the aircraft movement, the transmitting and receiving antennas form two bistatic configurations for the two pulses. However, if the antenna dimension, the platform speed and the radar PRF are properly selected, the phase centre of the bistatic antenna configuration for the first transmitted-received pulse coincides with the phase centre of the second transmitted-received pulse. Thus, the clutter is seen at two different time instants from the same radar location: the platform motion has been compensated, consequently the Doppler spectrum of clutter is not spread due to platform motion. The clutter is cancelled by subtracting the echoes of the first pulse received by the fore subarray from the echo of the second pulse received by the aft subarray; DPCA corresponds to a kind of spatial MTI. Stationary targets should cancel, but echoes from moving targets will give a non-zero result from the subtraction, so they should remain. Coe and White [19] have carried out a theoretical and practical evaluation of this technique. They have found that cancellation of the order of 25 dB is possible. The major DPCA sources of limitations are: loss of receiving aperture, PRF-velocity constraint, performance limited by subarray mismatch of radiation patterns, two degrees of freedom (DoFs) only, the system is not adaptive. STAP is the natural generalisation of DPCA, in fact it has the following features: • • • • •
generalisation of adaptive array of antennas for jammer nulling generalisation of DPCA for clutter cancellation freedom in shaping the null fewer constraints on the spacing of the antenna elements compensation of platform motion along (as DPCA) and orthogonal to the array thus avoiding clutter spectral spreading.
3.4.4 Along-track interferometry (ATI)-SAR In a similar manner to DPCA (see Section 3.4.3), along-track interferometry (ATI) uses two displaced antennas connected to two receiving channels. For each channel
an SAR image can be generated. The time delay between the azimuth signals can be compensated for during the azimuth compression using two different reference signals, incorporating the azimuth chirps generated in the two channels by a common point scatterer. If the first image is multiplied by the complex conjugate of the second, the remaining phase is zero for stationary objects and non-zero otherwise. If the two receivers are spatially separated by the distance J, the interferometric phase is approximated by tp = —(27t/X)d(vr/va), where vr is the radial velocity of the target. An example of parameters for ATI-SAR is: d = 1 m, va = lOOm/s, A = 3 cm, vr = 1 m/s, cp = 120°. ATI-SAR was originally developed to measure the speed of ocean surfaces; in Reference 12 the capability of SAR to detect slowly moving targets is shown on live data. In Reference 20 an in-depth analysis of the capabilities of ATI-SAR to detect moving targets is presented. In this section some analytical and simulation results concerning detection of a moving target are presented. ATI-SAR has the following limitation: to have a high phase sensitivity, the two antennas have to be widely separated, but this leads to a comb of blind velocities Vbhnd = kvah/d, where k is an integer: this limits the useful target velocity interval; moreover, the above mentioned distortions in the SAR image of moving target (Section 3.3.1) remain. The probability density function (PDF) of the interferogram phase (p for just stationary clutter and thermal noise has been found to be [21]: (3.31) where
Clutter and noise are independent random processes with zero mean and Gaussian PDF. Radar echoes i\, /2 received by the two antennas at the same time have a correlation coefficient y which depends on the endogenous correlation coefficient of clutter Yc and the clutter-to-noise power ratio CNR = Pc/Pn: thus the presence of noise reduces the clutter coherence. In practice, the interferometric phase is estimated by averaging L looks as follows: (3.32)
The corresponding PDF becomes:
(3.33)
where 2^i( # ) denotes the Gaussian hypergeometric function. For L = I equation (3.33) reduces to equation (3.31). Recently, a mathematical expression of the PDF has been presented in Reference 20 also for the case of target presence. In the following we illustrate some simulation results concerning the PDF of an interferogram in the presence of target, clutter and noise. The mathematical expression (3.33) plays an important role in finding the detection threshold for a prescribed value of false alarm probability (P/ fl ). By numerical integration the following numerical values for the threshold Th have been found: y = 0.9 -* Th = 1.0505; y = 0.98 -> Th = 0.3802; y = 0.99 -> Th = 0.2301; the prescribed false alarm probability, the clutter-to-noise power value and the number of averaged looks are: P/a = 10~4, CNR = 2OdB, L = 4. The variation of the detection threshold with the number of averaged looks for CNR = 2OdB, y = 0.98 and pfa = 10- 4 is as follows: L = 1 -> Th = 2.8414; L = 2 -> Th = 0.9505; L = 4^ Th = 0.3802;« = L -> Th = 0.3002; L = 10 -> Th = 0.2201. The target has been modelled as Swerling I. Figure 3.15 depicts the histogram of a phase interferogram for the following parameters: two antennas, L = 4, clutter Doppler phase = 0, CNR = 2OdB, y = 0.98, target Doppler phase: target = 1.3 rad. It is noted that when the signal-to-clutter power ratio (SCR) is very small in dB, the histogram coincides with the one predicted by equation (3.33); with the increase of the SCR the peak of the interferogram migrates on the target Doppler phase. Figure 3.16
Figure 3.15
Histogram of phase interferogram
SCR, dB
Next Page
cp, rad
Figure 3.16
Contour curves with constant Pd versus target Doppler phase and signal to clutter power ratio
illustrates the contour curves with constant detection probability versus target Doppler phase and SCR. The relevant parameters are: two antennas, CNR = 20 dB, number of averaged looks L = 4, correlation coefficient: y = 0.98, Pfa = 10~4 (detection threshold = 0.3802 rad). The following comments are in order: the contour curves are aliased in the interval cp = [—Tt9 Jt]; detection is practically zero for low values of the target Doppler phase because of the numerical value of the detection threshold; the detection increases with the increase of the SCR value. The next areas of research are in the field of removal of aliasing by using different values of carrier frequency [22]. In References 23 and 24 the possibility to do ATISAR with a one-bit coded SAR signal is demonstrated for moving target detection: this is another lively area of research; it has a bonus in terms of simplicity of processing implementation.
3.4.5 Processor in the space-time-frequency domain In this section a method for detecting and imaging objects moving on the ground and observed by an SAR is described. The method is based on the combination of two processing steps: space-time processing which exploits the motion of an antenna array for cancelling the echo from background, and time-frequency processing which exploits the difference in time allocation of the instantaneous spectrum corresponding to echoes from the ground or from moving objects, for an adaptive time-varying filtering and for the estimation of target echo instantaneous frequency, necessary for producing a focused image of it. The design and performance of the space-time
SCR, dB
Previous Page
cp, rad
Figure 3.16
Contour curves with constant Pd versus target Doppler phase and signal to clutter power ratio
illustrates the contour curves with constant detection probability versus target Doppler phase and SCR. The relevant parameters are: two antennas, CNR = 20 dB, number of averaged looks L = 4, correlation coefficient: y = 0.98, Pfa = 10~4 (detection threshold = 0.3802 rad). The following comments are in order: the contour curves are aliased in the interval cp = [—Tt9 Jt]; detection is practically zero for low values of the target Doppler phase because of the numerical value of the detection threshold; the detection increases with the increase of the SCR value. The next areas of research are in the field of removal of aliasing by using different values of carrier frequency [22]. In References 23 and 24 the possibility to do ATISAR with a one-bit coded SAR signal is demonstrated for moving target detection: this is another lively area of research; it has a bonus in terms of simplicity of processing implementation.
3.4.5 Processor in the space-time-frequency domain In this section a method for detecting and imaging objects moving on the ground and observed by an SAR is described. The method is based on the combination of two processing steps: space-time processing which exploits the motion of an antenna array for cancelling the echo from background, and time-frequency processing which exploits the difference in time allocation of the instantaneous spectrum corresponding to echoes from the ground or from moving objects, for an adaptive time-varying filtering and for the estimation of target echo instantaneous frequency, necessary for producing a focused image of it. The design and performance of the space-time
filter is described in References 25-27 and 9. Some of the theoretical aspects of STAP of interest for this chapter are discussed also in a number of publications; Reference 11 discusses the optimum processing scheme for a special case and an approximation of it realised with a two-dimensional FIR (finite impulse response) filter. Other relevant References are 28-30. Here we concentrate on the time-frequency step (Section 3.4.5.1) and on the joint space-time-frequency (Section 3.4.5.2). 3.4.5.1 Joint time-frequency analysis by Wigner-Ville distribution (WVD) The aim of this section is to show how time-frequency representation by WVD of the echoes received by an SAR provides a useful tool for detection of moving objects and estimation of instantaneous phase shift induced by relative radar-object motion [27]. The phase history is then used to compensate for the received signal and to form a synthetic aperture with respect to the moving object, necessary to produce a high resolution image. A method for extracting the instantaneous phase is based on the time-frequency (TF) distribution of the received signal. The WVD has been chosen here because it presents some important features concerning detection and estimation issues. There are simple methods for analysing signals in the TF domain, such as the short-time Fourier transform (STFT), but they do not exhibit the same resolution capabilities in the TF domain as does the WVD. In particular, since the STFT is based on an FT applied to a time-windowed version of the signal, with the window central instant varying with time, the frequency resolution is inversely proportional to the window duration. The narrower the window, the better is the time resolution, but the worse is the frequency resolution and vice versa. Conversely, the WVD does not suffer from this shortcoming. The WVD provides a higher concentration of signal energy in the TF plane, around the curve of signal instantaneous frequency (IF). This allows a better estimation of IF in the presence of noise and this information is exploited for the synthesis of the long aperture with respect to a moving object. On the other hand, the WVD poses other problems since it is not a linear transformation. This causes the appearance of undesired cross-products when more than one signal is present. Mapping of the received signal in the TF plane provides a tool for the synthesis of the optimal receiver filter without a priori knowledge of the useful signal, provided that the signal-to-noise ratio is sufficiently large. The TF representation provides a unique tool for exploiting one of the most relevant differences between useful signals and disturbances in the imaging of small moving objects, namely the instantaneous frequency and the bandwidth. In fact, it can be shown that, although the bandwidth occupied by a target echo during the observation interval necessary to form the synthetic aperture mainly depends on radar-object motion, the instantaneous bandwidth is proportional to object size. Therefore, the echo corresponding to a small target can occupy a large band during the overall observation time, but its instantaneous bandwidth is considerably narrower (i.e. the echo backscattered by a point-like target has zero instantaneous bandwidth but it may exhibit a large overall bandwidth). Conversely, the echo from the background and the receiver noise have a large instantaneous bandwidth. Therefore, even if the useful signal and the disturbance may have a large total band, the possibility of tracking the instantaneous bandwidth,
made available by the TF representation, allows a discrimination of the useful signal from the disturbance not possible by conventional processing. Another important and unique advantage related to use of the WVD is that it allows the recovery of the echo phase history even in the case of undersampling, as shown in Reference 31. This is particularly important in SAR applications since it allows us to work with a PRF lower than the limits imposed by the signal bandwidth occupied during the observation interval. Owing to the target motion, this bandwidth may be considerably larger than the bandwidth occupied by the background echo. According to conventional processing, we should then use a correspondingly higher PRF. Conversely, if the useful signal has a large total bandwidth, but a narrow instantaneous bandwidth, the TF representation prevents superposition of spectrum replicas created by undersampling because, even if the replicas occupy the same bandwidth, they occur at different times. This property allows us to recover the desired information even from undersampled signals. Since the PRF value imposes a limit on the size of the monitored area, due to time - and then range - ambiguities, the possibility of using a low PRF prevents the reduction of the region to be imaged, as well as an increase of the data rate. The WVD of a signal is defined as: (3.34)
where s(t) represents the analytic signal. The estimation of the instantaneous frequency of the signal is done as follows. Express the signal in terms of its envelope and phase: s(t)=a(t)exp{j
(3.35)
It can be shown that the local or mean conditional frequency of the WVD distribution, defined as: (3.36) is equal to the signal IF: (3.37) The estimate of the mean conditional frequency of the WVD then provides the information about the signal IF. This is exactly the information we need for rephasing the received signal in order to produce a focused image. We will explain now the rationale for using the WVD to detect a moving target and estimate its parameters with SAR data. The received signal is the sum of echo from the moving object, whose phase modulation is unknown, plus clutter (the echo from fixed background), plus noise. The echoes from the fixed scene represent, in our case, a disturbance. The aim is to detect the presence of moving objects and to estimate their motion parameters. The detection and parameter estimation performance depend on the signal-to-clutter power ratio (SCR). If this ratio is small, we have to first process the received signal in order to improve it as much as possible.
This operation can be carried out by matched filtering. However, the matched filter can be defined only if the shape of the useful signal is known, and this is not the case. It turns out that the two operations, detection and parameter estimation, cannot be separated: estimation of the useful signal parameters can be carried out only after having detected the presence of a useful signal; a reliable detection, on the other hand, requires the knowledge of the signal parameters, in order to carry out a proper matched filtering, before the detection itself. It is then necessary to carry on these two kinds of operation contemporaneously. The time-frequency analysis of the received signal, in particular the WVD, provides a powerful tool for achieving the aforementioned requirements and extracting the desired information, namely the energy and the phase history. These two pieces of information are what we need for our purposes: the detection of the presence of a moving target is made by comparing the energy with a suitable threshold; the instantaneous phase is used to phase-compensate the received signal for a correct coherent integration, necessary to imaging purposes. As regards the effect of disturbances in the received signal, it is useful to recall that matched filtering can always be performed by cascading: a clutter cancellation followed by a coherent integration. The first operation does not require knowledge of the useful signal shape, whereas the second operation does. In particular, a correct coherent integration requires knowledge of the signal phase history. The time-frequency analysis aims to facilitate extraction of the signal phase history. Therefore, the sequence of operations to be performed on the received signals is the following: a clutter cancellation is performed first and the output of the cancellation filter is analysed in the TF domain. If the clutter has been reduced to a power level sufficiently smaller than the useful signal, the parameters estimated by the WVD are correct, within an error depending on the achieved SCR. These parameters provide the information necessary to set up a correct coherent integration. This rather intuitive reasoning for using the WVD in detection and estimation problems is formalised into the framework of the matched filter theory in Reference 27; here we give a summary of the main results concerning the application to SAR problems. The signal received by an SAR is given by the sum of the echoes from the ground, the echoes from a possible moving object and receiver thermal noise. The echo from the ground can be modelled as a correlated random process, whose power spectral density is proportional to the antenna power radiation pattern. The echo from a moving object is characterised by an unknown modulation, induced by the relative motion between the radar and the object. In SAR imaging, we are interested in the phase modulation induced by the relative radar-target motion. If the transmitted signal is: p(t)=a(t)Qxp(j27tf0t)
(3.38)
where a(t) is the analytic signal and /o is the carrier frequency, the echo received by a point-like scatterer at a distance r(t) from the radar is proportional to: (3.39) During the observation interval, the amplitude of each backscattering coefficient can be assumed constant since the aspect angle does not vary by an amount such as
to justify a change in the reflectivity characteristics (this assumption underlies all SAR signal processing). The only modulation of interest then is phase modulation. In the formation of the synthetic aperture, we are interested in the last term of the equation (3.39). The conventional techniques for autofocusing extended scenes compensate only linear and quadratic phase shifts. This means that only slow fluctuations (with respect to the integration time) are compensated. Conversely, the TF approach allows estimation and compensation of any kind of phase history. The distance r(t) can be modelled as the sum of slow and rapid fluctuations, with respect to the observation interval. Slow variations can be expressed as a low-order polynomial, whereas fast variations follow a sinusoidal behaviour, whose period is shorter than the time observation interval. The change in the distance causes a phase modulation, which must be compensated for to obtain a correct image. The variation of distance can cause migration of the received echo over different range cells, depending on the ratio between the amount of variation and the range resolution. The range migration problem can be faced according to the double resolution strategy outlined in the Section 3.4.5.2. According to the strategy, all the phase estimations are carried out on data whose range resolution is such as to consider the migration negligible. According to the formulation of matched filtering in terms of the WVD, the processing scheme for detecting and estimating the parameters of the echoes from moving objects is sketched in Figure 3.17. The received signal is first range compressed. Then it is processed by an MTI filter to reduce the clutter contribution; in Section 3.4.5.2 we explain how to use STAR The analysis of the signal in the TF domain allows estimation of
MTI
estimate instantaneous frequency
WVD
range compressed data
integration
generate reference signal
FFT
modulator
display
detection threshold
Figure 3.17
Processing scheme which uses the Wigner—Ville distribution
the signal instantaneous frequency. Some smoothing can be applied on the WVD to improve the estimation accuracy. The instantaneous frequency is then integrated to obtain the phase modulation to be used for phase compensation of the received signal. At this point an FFT is sufficient to provide the high cross-range resolution image. A threshold is then applied to the envelope signal to check for detection. An example of an image relative to a moving point-like target superimposed on a fixed extended scene is reported in Reference 27; the simulation results convey a positive feeling on the performance of the processing scheme of Figure 3.17. 3.4.5.2 Joint space-time-frequency analysis Space-time and time-frequency processing can be combined to form a processor which allows both the cancellation of the clutter echoes and the compensation of the target motion, necessary for the formation of a high cross-range resolution image. The overall space-time-frequency processing is shown in Figure 3.18. The echo from the ground is cancelled by means of space-time processing. Having cancelled the background echo, we proceed to the estimation of the target instantaneous frequency which is done by resorting to the WVD. Before proceeding to the estimation, however, some care must be devoted to the range migration problem. In fact, the relative radartarget motion causes not only a phase shift, but also a range migration which cannot be neglected if it overcomes the range resolution. The radar echoes are sampled and arranged in a matrix, whose columns are relative to successive transmitted pulses and the entries of each column contain the echoes coming from different distances. Given a certain point, if its distance (during the time on target) from the radar does not vary by an amount bigger than the size of a range resolution cell, the echoes from that point are all stored on one column. If all the points composing the observed scene satisfy this condition, no range migration occurs and the samples can be first processed in range,
adaptive space-time processor on range compressed data
corner turning
compute Wigner-Ville distribution estimate instantaneous frequency
FFT
generate reference signal
integration
Figure 3.18
Processing scheme of joint space-time-frequency
display
column by column, and then in cross range, row by row. If, however, the echoes from some point occupy more than one row, cross-range processing cannot be performed directly on the collected data, row by row. Some algorithm for compensating for range migration must be applied in order to realign the data, before cross-range processing. Range migration compensation techniques have already been examined in the literature in the imaging of stationary scenes. The problem is harder in the imaging of moving targets, however, since the radar-target distance is not known. Two main problems arise when dealing with moving targets, in presence of range migration: the migration causes a spreading of the target energy through many range cells, therefore the signal-to-noise ratio, for each range cell, decreases and this causes a loss of detectability, and due to the lack of knowledge of the target range migration law, it is not known a priori which samples of the matrix of the collected data are to be taken for carrying out the estimation of the phase history. Here we follow a two-step approach for compensating for the migration problem: a coarse range resolution, and a fine range resolution analysis (see Figure 3.19). The rationale of the approach is based on the fact that the migration problem can be neglected if the amount of migration does not exceed the range resolution. Therefore, we can initially work in a low-resolution mode, where the range resolution has been degraded by an amount such as to make the migration negligible. A first detection is performed on these data: if a detection occurs, the processing chain for the estimation of the target echo phase history is enabled. The estimation is carried out in the time-frequency domain, according to the criterion described previously. Having once estimated the phase 0(0, we can evaluate the law of variation of the distance by exploiting the relationship between phase shift and distance:
(3.40)
high-resolution channel
fine range alignment
compute distance
low-resolution range compression
full resolution compression
generate compensation signal
estimate instantaneous phase
Low-resolution channel
Figure 3.19
Two-step approach to account for range migration problem
By using d(t), we can compensate for the range migration by properly rearranging the data in the matrix containing the received samples. These data can now be phase shifted to compensate for the phase shift induced by the target motion. The data then undergo a full range and cross-range compression for obtaining a high-resolution image. The price paid by adopting the two-step resolution approach is the SNR loss inevitably related to the degradation of the range resolution in the coarse resolution mode. It is important to point out that, even if the cause of both phase shift and migration problems is the same, namely the variation of the radar-target distance, the phase shift is much more sensitive to range variation than to range migration. In fact, for example, in an X-band radar (X = 3 cm), having a range resolution of 1 m, an error on the distance of 7.5 mm causes a phase shift of n radians, while the corresponding range migration is absolutely irrelevant. This means that, having once estimated the phase history by the described procedure, the resulting value for the distance variation d(t) is estimated with a good accuracy. The performance of the space-time-frequency processing scheme has been evaluated by a simulation program which generates the echoes from an extended surface and from a point-like target and then applies the proposed algorithm [9]. The target has been supposed moving on the terrain (shadowing effects have been neglected) at a constant velocity, in a direction oblique with respect to the radar motion. The velocity parameters have been chosen to make evident the presence of range migration and of cross-range smearing of the target image. The ground reflectivity has been assumed equal to the target reflectivity (this is a quite pessimistic assumption, because in many cases of practical interest the target reflectivity is higher). The thermal noise, 4OdB below the target return, has also been assumed for the received signal. The ground echo is first cancelled, by using a STAP with a two-element antenna and two time samples. The two antennas are spaced by d = vT. An SAR image is then formed by conventional techniques. The result is shown in Figure 3.20 (from Figure 14 of
* * * *
range
Figure 3.20
Image of moving target, after clutter cancellation (reproduced with permission from Figure 14 of Reference 9)
range
Figure 3.21 Image of moving target, after range migration andphase compensation (reproduced with permission from Figure 15 of Reference 9) Reference 9). The smearing of the moving point-like target is evident. Given the motion parameters, the target has migrated over six range cells. This is the result of the broadening of the target image even in range, as well as in cross range. Anyway, the target echo causes a detection and initialises the motion estimation channel. The high resolution data are smoothed in range to decrease the range resolution. Then the processor looks for the range cell with the maximum energy content and computes the WVD of that cell only. The frequency history, and then the phase history, are evaluated, according to the procedure outlined previously. The phase history is then used for compensating for the range migration and the phase shift on the high-resolution range data. The final image is shown in Figure 3.21 (from Figure 15 of Reference 9). The sharpening of the target image is quite evident. The method above corrects only the (translation) motion of a point-like target. If a focused image of a three-dimensional object is required, for recognition purposes, the motion parameters must be estimated to compensate correctly for the relative signals [32, 33]. Since the motion of any rigid body can be decomposed as the sum of the translation of a point plus the rotation of the other points around that point, in some cases, depending on the target dimension and kind of motion, compensation for only the translation motion is not sufficient and we still have to compensate for the rotational motion. This second compensation can be efficiently performed in the frequency domain, by applying, for example, the method proposed in Reference 32. Algorithms have been conceived to produce fine resolution images of moving targets having any translation or rotation motions. They require the presence of multiple prominent points in the target image. The echo from a first point is initially analysed. Its phase is computed and subtracted pulse-to-pulse from the phase of the incoming signal. This operation removes the effect of the target translation motion and makes this first point effectively the new centre of the scene. At this point, if the rotational motion is negligible, conventional SAR processing yields the focused image. If,
conversely, the rotation cannot be neglected, two other prominent points are required to estimate the rotation parameters. These parameters are used to compensate for the rotational motion in the frequency domain and to help in applying the SAR processing correctly. The algorithm requires that the prominent points be separable and that their phases be estimated without interfering with each other. In some cases, however, this assumption might not be met: at high resolution, separability is more likely to occur, but the range migration could complicate the phase estimation problem; at low resolution, the range migration could be negligible, but it is more likely that some prominent points might occupy the same range resolution cell. Further analyses are necessary in the imaging of extended targets when more dominant scatterers occupy the same resolution cell. In this case, in fact, the bilinearity of the WVD creates undesired cross-product terms which can seriously impair the estimation of the instantaneous frequency. A reduction of these undesired terms could be achieved by resorting to other time-frequency representations, such as the Choi-Williams distribution [34]. Another approach to reduce the appearance of undesired crossproduct terms is described in Reference 35. It is shown that it is still possible to estimate the signal parameters, even for multicomponent chirp signals embedded in noise, by combining WVD with the Hough transform.
3.4.6
Optimum processing for MSAR
This section focuses on the optimum MSAR detection of slowly moving targets (SMT), whose Doppler frequency is well inside the clutter spectrum, by means of space-time processing [4, Chapters 4 and H]. Clutter cancellation is easily performed by space-time processing assuming the idealised condition of DPCA condition (PRF matched to the ratio between antenna displacement and platform speed) and absence of the intrinsic temporal decorrelation of the clutter echoes, known as internal clutter motion (ICM). However, it is well known that the cancellation can be highly affected by the presence of ICM which has the strongest impact on the detection of the SMT, where the MSAR provides a large performance improvement over conventional multichannel systems. Moreover, the exact matching of DPCA condition is difficult to achieve in practice. Therefore, to obtain a significant analysis of its effectiveness we consider separately two cases of mismatch from the idealised conditions: (i) the presence of ICM and (ii) the velocity mismatch. A complete analysis is derived for a special case [10,13]. A space-time exponential model is considered for the clutter covariance matrix, related to a Cauchy antenna pattern (i.e. antenna gain G(O) = Go • (O2 + 0J))"1, 0 being the azimuth angle, #o a parameter that controls the beamwidth and Go a suitable coefficient to encode the antenna gain) which yields a double exponential Doppler spectrum for the fixed clutter and a single pole description of the internal motion process. The simplified model allows closed form expressions to be identified, both for optimum detector and detection performance. In the following, we first consider the different possibilities for ordering the long sequence of MSAR echoes, then we discuss the special structure of the clutter covariance matrix in the two above mentioned mismatch conditions. The special structure
allows us to obtain a formal expression for the space-time cancellation filter and its properties, and finally to discuss its performance. 3.4.6.1 Echoes ordering strategy Initially, we assume, as in Reference 36, that the DPCA condition is verified. With reference to the slant plane geometry in Figure 3.22, and assuming that: (i) the effect of the antenna pattern modulation on the target echoes in the observation time is negligible, and (ii) the far field approximation can be used for the antenna element displacements and the Fresnel approximation for the distance of the target from the transmitter, the target echo received by the A:th (k = 0 , . . . , K — I5) element at Xn = (n-(N - l)/2) PRT, n = 0 , . . . , N - 1 can be rewritten as:
(3.41) where Ao is the amplitude of the echo, O is a constant phase term and A the carrier wavelength. xq = xq/(vPRT) is the normalised target along-track position, fy = 2vrPRT/X is the normalised Doppler frequency due to the cross-track velocity component and sq = vq/v is the relative along-track velocity. Also, d — A/(vPRT) is the ratio between spatial and temporal sampling, being A the antenna element separation, and /x = 2(vPRT)2/(XRo) is the rate of the linear frequency sweep for
time
along-track spatial axis
Figure 3.22
Bidimensional space-time data collection plane (K = 4, N = 10)
a fixed target. The terms in the first line of equation (3.41) take into account the central target Doppler frequency and DoA. They are the usual model for space-time processing techniques based on a short integration time. The terms in the second and third lines are characteristic of the SAR and encode the time-varying Doppler and DoA observed in the long SAR integration time. Assuming that the clutter echoes can be modelled as a zero mean Gaussian random process, derived from a spatially independent Gaussian distributed scene reflectivity (see Section 3.3.2.1) with temporal correlation (ICM) pt(At), the correlation of the echoes received at times ^i, t2 by elements ^ i , Ski is: (3.42)
where Pc is the total clutter power. Since the NxK echoes arise from the space-time sampling, they are by their nature bidimensional, and have not any natural unidimensional order. This is illustrated in Figure 3.22, where the space-time location of the phase centres relative to the combination of transmitting and receiving antenna for each echo is sketched. As in Reference 36, the DPCA condition is assumed, namely PRT and platform speed are matched so that d = 2. It is useful to arrange these echoes in a vector, which is an ordered sequence of blocks s = [sj, sj9... ]T. Three strategies can be devised to build the blocks [11], keeping fixed a parameter, which correspond to the three directions in which the points in Figure 3.22 can be aligned: (A)
fixed-time blocks (Figure 3.23a):
(B)
fixed element blocks (Figure 3.23b):
(C) fixed spatial position blocks (Figure 3.23c):
The rotation matrices T ^ , Tca and Tcb are defined, to convert different ordering strategies: SB = T ^ S A , SC = TCflSA, Sc = T ^ S B - It is worth recalling that all the elements of the matrices above are zero except for one element equal to one in each row and column and have the properties T ^ T ^ = T ^ T ^ = I, T ^ T ^ = T^T Cfl = I, TcfcT^ = TlbTcb = I and Tcb = TcaTla, being I the NK x NK identity matrix (see also Reference 30).
Figure 3.23
Strategies for collecting and processing the SAR echoes
3.4.6.2 Covariance matrix properties: a special case for ICM An interesting condition, which allows us to study the detection performance and the interpretation of the optimum detection structure, is obtained under the hypothesis that the transmitting antenna pattern has a Cauchy shape [H]. Antenna sidelobes are not taken into account but this is not essential here, since targets competing with the mainlobe clutter are considered. Thus an exponential spatial correlation is obtained: ps(i) = p[l\ where ps is the correlation coefficient at a spatial distance equal to half the inter-element spacing. The worst case arises when the temporal correlation, which encodes the ICM, also has an exponential shape, pt(i) = p\ , being pt the one-lag correlation coefficient, since it corresponds to the maximum entropy spectrum. With reference to ordering strategy (A), the normalised covariance matrix is symmetric, due to the broadside antenna pointing and has a Toeplitz-block-Toeplitz structure:
(3.43)
where the K x K block P ; is given by:
(3.44)
For the interpretation of the optimum detector, this case is especially appealing, since the covariance matrix in equation (3.44) has the properties below. We define the KxK matrices H and H / :
(3.45)
which are related via a rotation around their centre, and are apparently not full rank. Under the hypotheses above, the blocks P1- can be written as the product of the ith power of one of the matrices times the first block Po as P; = FTPo = Po(H7*) . Thus RA = TA diagyyjPo,..., Po), where:
(3.46)
and diag^fPo,..., Po) is a block diagonal matrix, with TV blocks Po- It can be shown that FA can be factored as
(3.47)
where the blocks Q; are square matrices of order /, with exponentially decaying elements with one-lag correlation coefficient pt and U is a N x N upper triangular matrix, obtained from an exponential square matrix, with one-lag correlation coefficient ps, pt, nulling its lower diagonal part. From equations (3.46) and (3.47) and from the symmetry of RA we have R^ 1 = diag^fP^ 1 ,..., P^ 1 }T~^1
(3.48)
Moreover, using HH'H = H and H H H ; = H', it can be shown that the inverse matrix F^ 1 is block tridiagonal. Thus, R^ 1 is block tridiagonal and symmetric:
(3.49)
Moreover each of its blocks is itself a tridiagonal band matrix: Y = (1 — P^)P 0 1 + P^P-'CHH' + H'H); Z = P Q 1 H ; Y 0 = P^ 1 + p^1HH; % = Y~x + p , 2 P ^ H H ' . Moreover, the decomposition R^ = LeLg applies, where:
(3.50) and blocks
(3.51)
Interpretations of the optimum detector Assume that the data vector x (NK x 1) is ordered according to strategy (A). As well known, the optimum detector for the target vector s, known aside for amplitude and initial phase, against a Gaussian clutter with covariance matrix RA, is obtained comparing the modulus of s ^ R ^ x to a threshold. Since we focus on the detection of SMT, whose power spectrum competes with the main clutter return, thermal noise is neglected, assuming the ICM to be the dominant disturbance. Using the decomposition of the matrix RA, the following interpretations of the optimum filter s^R^ x
data in space only filter a rotation unidim. space-time filter b rotation time only filter c rotation match to signal
Figure 3.24
Decomposition of the optimum detector in a sequence of three uni-dimensional filters
yield [H]: (i)
The bidimensional space-time cancellation filter in the optimum detector, corresponding to R^ l , can be thought of as the cascade of three mono-dimensional filters along the three data ordering directions. This follows from the decomposition in equation (3.48) for R^ 1 , which can be implemented as depicted in Figure 3.24: first the matrix diag^fP^" 1 ,... ,P^"1} is applied to the data, ordered with strategy (A). It operates separately on its blocks, performing a spatial filtering only. The data are then rearranged by T ^ into ordering (B), where the filter diag^{U~ lr , I , . . . , I, U" 1 } along the diagonal of the space-time plane is performed. Thereafter, the filtered data are rotated to ordering (C), where the pure temporal filtering B d i a g A ^ + ^ _ i ( Q ^ 5 Q - 1 , . . . , Q " 1 , . . . , Q " 1 , . . . ,Q^ 1 ,Q^ 1 J is applied. Finally, the filtered data are rotated back to the initial ordering; this can be avoided by storing s with strategy (C). Thus, the three ordering strategies combine in the detector. It is instructive to observe that the filtering along the diagonal of the space-time plane affects only the data in the first and last antenna elements, corresponding to a border effect only. It is required to compensate for the absence of the adjacent element samples. This is interesting, since the selected model is a separable random Markov field, for which in theory it should be possible to split the filter into the cascade of two monodimensional filters. Unfortunately, the data are collected on an area of this plane, which is not aligned with the main correlation axes. Thus the physical sampling process induces in the optimum filter a coupling of the two directions even in this ideal case of decoupled exponential correlations, (ii) The filtering operated in R^ x is a shift invariant space-time FIR filter with nine taps, which uses the nine echoes collected at the nine most adjacent space and time positions, except for the border effects. This is a consequence of the block-tridiagonal structure of the R^ 1 , with blocks being three-element band matrices, as outlined in equation (3.49). Thus the inverse covariance matrix operates combining the data received at a given
time and element with only the data received one element and one PRT apart, as illustrated in Figure 3.23d (a). It is to be noted that the shift invariance is not available for the first and last blocks, where border effects must be compensated for. Moreover, the filtering is non-causal, since both preceding and following samples are used even though this is not an issue for the FIR characteristic, (iii) S77R^1X is obtained filtering the data with a shift invariant causal space-time filter with four taps, which uses only the data already received at the closest space-time sampling positions, matching them to an equally filtered reference signal. This is a similar interpretation to (ii), and derives from the structure of the decomposition in equations (3.50) and (3.51): s ^ R ^ x = (LgS)77LgX. This is illustrated in Figure 3.23d (fi). This one-lag space-time memory is obviously a consequence of the double exponential model, which can be regarded as being a worst case analysis. In fact, the exponential autocorrelation is the one characterised by the maximum entropy, once the one-lag decorrelation is constrained. Specifically, using only adjacent space-time samples prevents the whole length of the receiving array from being used to obtain a narrow null along the clutter ridge. The width of the null is instead controlled by the interelement spacing. The three interpretations are of high interest for the long sequence of SAR echoes, since the batch processing is replaced with a space-time FIR filter, with a major reduction in the computations. In the general correlation case the smallest rectangular region around each space-time sample is not a sufficient support for the optimum filter. 3.4.6.3 Covariance matrix properties: a special case for velocity mismatch Using the exponential spatial autocorrelation function, another case of departure from the ideal conditions can also be analysed in closed form: the presence of a mismatch between the radar platform velocity and the PRF, so that the DPCA condition is not any more exactly matched. We assume, for the sake of simplicity, the absence of internal clutter motion and a small mismatch. Under these conditions, the ordering strategies above can still be used with obvious extensions from the case of perfect DPCA. Assuming that the velocity mismatch is a fraction y of the theoretical velocity, we have for the actual one-lag correlation coefficient pse = ps • pe, where pe = pYs. Under the hypothesis that \K • y\ < 1, namely that the mismatch is smaller than the antenna spacing, using ordering strategy (C), the covariance matrix Rc takes the form:
(3.52)
time
along-track spatial axis
Figure 3.25
Bidimensional space-time data collection plane (K = 4, N = 10). SAR echoes selection strategy in the presence of velocity mismatch, v ^> 0
where the KxK
block M / is given by:
(3.53)
for y > 0 (see Figure 3.25) and by its transpose for y < 0 (see Figure 3.26). It is worth observing that the matrix M/ is not just obtained from the matrix Qh by replacing pt with pe. In particular, the elements of M/ can have modules larger than one, since the velocity error causes either reduction or increase to the ideal spatial correlation, depending on the positions of the elements. This was not the case in the presence of internal clutter motion, which was always decreasing the correlation between samples. The structure of the matrices in equations (3.52) and (3.53) is evident from the sketch of observation geometry and correlations in Figures 3.25 and 3.26. In the following we proceed with reference to the case of y > 0, which can be extended to the case y < 0. To get a decomposition of the covariance matrix similar to the one obtained for the internal clutter motion [10], we define the K x K
along-track spatial axis
D
Figure 3.26
Bidimensional space-time data collection plane (K = 4, N = 1O)SAR echoes selection strategy in the presence of velocity mismatch, y < 0
matrices He and Hfe:
(3.54)
which are related via a rotation around their centre; namely it yields the property H'e — &KHe&K, where $ „ is the (« x n)-dimensional rotation matrix:
(3.55)
Thus the ith block can be rewritten as M1- = pleHeVe = p^VeH^ = pe(UfeVe)T, being Ve a K x K exponential matrix with one-lag correlation coefficient pe. Thus the covariance matrix can be decomposed as Rc = Fc • diag^{P^,..., Ve}, and:
(3.56)
The special structure of the matrix Fc yields a closed-form inverse as a block tridiagonal matrix. By setting be = pse/{\ — p2sepj ), the inverse covariance matrix R^ 1 is also block tridiagonal:
(3.57)
where Ye0 = V;1+bepsep-K+l?JlUe*K;Y'eo = Vjl+bepsepJK+lVjl Ufe
K = 1 l K+l 1 1 *KYeo*K; Ze = PJ H*; \ e = V~ + bePsepJ (VJ H6 + VJ H.'e)*K are themselves tridiagonal matrices. Moreover, a triangular block decomposition applies, similar to equations (3.50) and (3.51). These properties of the covariance matrix Rc with ordering strategy (C) - obtained in the case of DPC A velocity mismatch, but in the absence of ICM - yield an optimum detector with very similar interpretations to the case of covariance matrix RA with ordering strategy (A) - obtained in the case of matched DPCA speed and presence of ICM: (i)
The bidimensional space-time cancellation filter in the optimum detector, corresponding to R^ 1 , can be thought of as the cascade of three mono-dimensional filters along the three data ordering directions. (ii) The filtering operated in R^ x is a shift invariant space-time FIR filter with nine taps, which uses the nine echoes collected at the nine most adjacent space and time positions, except for the border effects. (iii) s^R^ x is obtained filtering the data with a shift invariant causal space-time filter with four taps, which uses only the data already received at the closest space-time sampling positions, and matching them to an equally filtered reference signal. Unfortunately, it is to be noticed also that the structure of the clutter covariance matrix does not maintain a similar pattern when both internal clutter motion and velocity mismatch are present. In particular, its inverse is no longer a block-tridiagonal matrix. Therefore these exact closed-form interpretations do not strictly apply. However, a
suboptimum detection scheme can be obtained by extending such a support region, so as to maintain the FIR characteristic, while achieving detection performance close to the optimum. It is interesting to compare the scheme above with those derived in References 7 and 8, which have been the first to the author's best knowledge to introduce FIR filters for SAR detection. Those filters are similar to the one in (iii). The main difference is that in References 7 and 8 the filter LB is derived by using the criterion of the minimum mean square error for clutter cancellation, whereas here it is derived, for a special case, from a maximisation of the detection probability. This study of a special case allows the interpretation of the optimum detector in closed form. This shows that the scheme of References 7 and 8 has the same structure as that of the optimum filter. It is also important to recall that for long integration times (large covariance matrix), the DFT matrix gets very close to the matrix of the eigenvectors, of the clutter covariance matrix, so that the clutter sample at the different frequencies tends to have a low correlation. In this case, the Doppler processing is also very close to the optimum filter, so that both classes of STAP techniques for the MSAR case: (i) time domain reduced DoF (degrees of freedom) adaptive two-dimensional FIR filters (ii) frequency domain reduced DoF adaptive processing, can be interpreted as low-cost approximations of the optimum STAP filter for MSAR. 3.4.6.4 Performance analysis Thanks to the simple form of the inverse covariance matrix the signal-to-clutter ratio of the optimum filter SCR = S^R -1 S can be evaluated in closed form. The analysis is shown with reference to the case of the presence of ICM [H]. For simplicity, only the condition with xq = sq = O (see Figure 3.2) is considered, this is a broadside target with no along-track velocity component:
(3.58) where /3= sin(n\iN)/(N sin(nfi)) and it has been considered that sm(nfjiK)/ (K sin(niJi)) & 1, assuming N ^$> K. The SCR is shown as a function of the normalised Doppler frequency fy in Figure 3.27 for different integration times corresponding to N = 128, 256 and 512 samples, K = A and setting AQ/PC = 1. Three temporal correlation coefficients pt = 0.9,0.99 and 0.999 and a fixed ps = 0.5 are considered. The figure shows that pt has a strong influence on the SCR, which grows with it at almost all Doppler frequencies. However, assuming equal total clutter power, an opposite behaviour appears for the stationary targets, since some clutter power spreads away from the diagonal of the space-time plane and a fixed target
SCR, dB
Figure 3.27
Optimum SCR as a function of fN: (*)p, = 0.999, (+)p r = 0.99, (o) pt = 0.9, ( )N = 512, ( - - -)N = 256, {....)N = 128
very close to this diagonal competes with a reduced clutter power. It also appears, from both equation (3.58) and Figure 3.27, that the SCR grows with the integration time N. In fact, only one adjacent temporal and spatial sample is necessary to whiten the clutter spectrum, as required by the optimum detector. The whitening implies also a partial target cancellation, which strongly affects the performance especially in the SMT region (see the sinusoidal term of equation (3.58)). The longer sample sequence of the SAR allows the coherent integration to gain in SCR over the whitened clutter. This extra gain can be essential in the region of the SMT, where the SCR is low, to permit detection. As a final consideration, it is interesting to observe that both the scheme of References 7 and 8 and the scheme of Reference 12 share the same structure for the optimum filter in this special case, and have in general also very similar detection performance and are therefore attractive for the practical implementation.
3.5
Conclusions
In this chapter processing techniques have been described to combine the SAR and STAP functions; the goal is to obtain a well focused target image in the right place on the image of the stationary scene. A taxonomy of different processing schemes has been presented, with different characteristics in terms of computational cost,
detection performance and operational limitations. Among the presented techniques, some are recognised to be better suited to providing a proof of concept (DPCA, ATI-SAR), while others can be only exploited for interpretation purposes (optimum filter). However, some approaches are recognised as being appropriate for practical use. Two classes belong to this group of techniques: (i) the cascade of a space-time FIR filter and a two-dimensional filter bank detector (space-time-frequency approach and approximation of optimum filter), and (ii) the processing in a confined Doppler plane (Doppler processing and MSAR JDL). As considered in detail in Section 3.4.6, these two schemes share the same structure as the optimum filter in special cases, and have in general also very similar detection performance, resulting therefore in being attractive for the practical implementation. When imaging is a main issue the use of time-frequency analysis combined with the STAP techniques appears to be applicable to the case of a very general target motion trajectory and to give good detection and imaging performance.
3.6
Acknowledgments
Moving target detection and imaging with SAR has been the subject of R&D work for a number of years in Selenia/Alenia (now AMS). The authors acknowledge with thanks the cooperation of a number of colleagues, notably Professor S. Barbarossa (today with the University of Rome 'La Sapienza') and Dr. E. D'Addio. More recently Dr. A. Gabrielli (AMS) has cooperated in the development of ATI-SAR work.
References 1 AUSHERMAN, D., KOZMA, A., WALKER, J., JONES, H., and POGGIO, E.: 'Developments in radar imaging', IEEE Trans. Aerosp. Electron. Syst., July 1984, AES-20, pp. 363-400 2 D'ADDIO, E., Di BISCEGLIE, M., and BOTTALICO, S.: 'Detection of moving objects with airborne SAR', Signal Process., 1994, 36, pp. 149-162 3 WARD, J.: 'Space-time adaptive processing for airborne radar'. MIT Lincoln Laboratory, technical report TR-1015, December 13, 1994 4 KLEMM, R.: 'Principles of space-time adaptive processing' (IEE Publishers, London, 2002, 2nd edn.) 5 KLEMM, R.: 'Introduction to space-time adaptive processing', Electron. Commun. Eng. J., February 1999,11, (1), pp. 5-13 6 FRIEDLANDER, B. and PORAT, B.: 'VSAR - a high resolution radar system for detection of moving targets', IEE Proc, Radar Sonar Navig., August 1997, 144, (4), pp. 205-218 7 KLEMM, R. and ENDER, J.: 'Multidimensional digital filters for moving sensor arrays'. Proc. IASTED on signal processing and original filtering, June 1990, Lugano, Switzerland, pp. 9-12
8 KLEMM, R. and ENDER, J.: 'Two-dimensional filters for radar and sonar applications'. Signal Processing V EUSIPCO, September 1990, Barcelona, Elsevier Science Publisher, pp. 2023-2026 9 BARBAROSSA, S. and FARINA, A.: 'Space-time-frequency processing of synthetic aperture radar signals', IEEE Trans. Aerosp. Electron. Syst, April 1994, AES-30, (2), pp. 341-358 10 LOMBARDO, P.: 'DPCA processing for SAR moving target detection in the presence of internal clutter motion and velocity mismatch'. Microwave sensing and synthetic aperture radar, EUROPTO series, 2958, September 1996, Taormina, Italy, pp. 50-61 11 LOMBARDO, P.: 'Optimum multichannel SAR detection of slowly moving targets in the presence of internal clutter motion'. CIE-ICR '96, international Radar conference, Beijing, China, 8-11 October, 1996, pp. 321-325 12 ENDER, J. H. G.: 'Space-time processing for multichannel synthetic aperture radar', Electron. Commun. Eng. J. (Special issue on STAP), February 1999 11, (1), pp. 29-40 13 LOMBARDO, P.: 'Echoes covariance modelling for SAR along-track interferometry', IEEE international symposium IGARSS '96, Lincoln, Nebraska, USA, May 1996, pp. 347-349 14 LOMBARDO, P.: 'Estimation of target motion parameters from dual-channel SAR echoes via Time-Frequency analysis'. IEEE national Radar conference, NatRad'97, Syracuse, NY, May 1997, pp. 13-18 15 LOMBARDO, P.: 'A joint domain localized processor for SAR target detection' . Presented at European conference on Synthetic aperture radar, EUSAR'98, Friedrichshafen, Germany, 25-27 May 1998, pp. 263-266 16 WANG, H. and CAI, L.: 'On adaptive spatial-temporal processing for airborne surveillance radar systems', IEEE Trans. Aerosp. Electron. Syst, July 1994, 30, (3), pp. 660-670 17 'IEEE AESS 1991 award to F. R. DICKEY, F. M. STAUDAHER, and M. LABITT', IEEE Aerosp. Electron. Syst. Mag., 32, May 1991, p. 32 18 SHNITKIN, H.: 'Joint STARS phased-array radar antenna', IEEE Aerosp. Electron. Syst. Mag., October 1994, pp. 3 4 ^ 1 19 COE, D. J. and WHITE, R. G.: 'Experimental moving target detection results from a three-beam airborne SAR'. Eusar 96, Konigswinter, Germany, 1996, pp. 419-422 20 GIERULL, C : 'Moving target detection with along-track SAR interferometry'. DREO Technical Report 2002-000, 09 January 2002 21 LEE, J. S., HOPPEL, K. W., MANGO, S. A., and MILLER, A. R.: 'Intensity and phase statistics of multilook polarimatric and interferometric SAR imagery', IEEE Trans. Geosci. Remote Sens., 1994, 5, pp. 1017-1028 22 PASCAZIO, V. and SCHIRINZI, G.: 'Estimation of terrain elevation by multifrequency interferometric wide band SAR data', IEEE Signal Process. Lett., 2001, 8, pp. 7-9 23 PASCAZIO, V., SCHIRINZI, G., and FARINA, A.: 'Along track interferometry by one bit coded SAR signals', in POSA, F. and GVERRIERS, L. (eds): SAR Image Analysis, Modeling and Techniques III, SPIE, 2000, 4173, pp. 259-266
24 PASCAZIO, V., SCHIRINZI, G., and FARINA, A.: 'Moving target detection by along track interferometry'. Proceedings of IGARSS 2001, 1 Sidney, Australia, July 2001, pp. 3024-3026 25 BARBAROSSA, S. and FARINA, A.: 'A novel procedure for detection and focusing moving objects with SAR based on the Wigner-Ville distribution'. Proceedings IEEE international Radar conference, Arlington, VA, May 7-10, 1990, pp. 44-50 26 BARBAROSSA, S. and FARINA, A.: 'Detection and imaging of moving objects with SAR by a joint space-time-frequency processing'. Proceedings of the Chinese international conference on Radar, Beijing, China, October 22-24,1991, pp. 307-311 27 BARBAROSSA, S. and FARINA, A.: 'Detection and imaging of moving objects with synthetic aperture radar. Part 2: Joint time-frequency analysis by Wigner-Ville distribution', IEE Proc. F, Radar Signal Process., 1992, 139, (1), pp. 89-97 28 FARINA, A. and TIMMONERI, L. 'Space-time processing for AEW radar'. Proceedings of IEE international conference on Radar 92, Brighton, UK, 12-13 October 1992, pp. 312-315 29 LOMBARDO, P. and FARINA, A.: 'Dual antenna baseline optimisation for SAR detection of moving targets'. ICSP - international conference on Signal processing, Beijing, China, 14-18 October 1996, pp. 431^34 30 LOMBARDO, P.: 'Data selection strategies for radar space time adaptive processing'. Presented at IEEE Radar conference, Radar'98, Dallas, Texas, USA, May 12-13, 1998, pp. 201-206 31 BARBAROSSA, S.: 'Parameter estimation of undersampled signals by WignerVille analysis'. IEEE international conference on Acoustics, speech and signal Processing, ICASSP '91, Toronto, May, 1991, pp. 3253-3256 32 WERNESS, S., CARRARA, W., JOYCE, L., and FRANCZAK, D.: 'Moving target imaging algorithm for SAR data', IEEE Trans. Aerosp. Electron. SySt., 1990, AES-26, (1), pp. 57-67 33 FIENUP, J. R.: 'Detection of moving targets in SAR imagery by focusing', IEEE Trans. Aerosp. Electron. SySt9 July 2001 AES-37, (3), pp. 794-809 34 COHEN, L.: 'Time-frequency distributions - a review', Proc. IEEE, 1989 77, (7), pp. 941-981 35 BARBAROSSA, S. and ZANALDA, A.: 'A combined Wigner-Ville and Hough transform for cross terms suppression and optimal detection and parameter estimation'. Proceedings of the IEEE international conference on Acoustics, speech and signal processing, ICASSP '92, March 1992, San Francisco, pp. 173-176 36 BARILE, E. C , FANTE, R. L., and TORRES, J. A.: 'Some limitations on the effectiveness of airborne adaptive radar', IEEE Trans. Aerosp. Electron. Syst, October 1992, AES-28, (4), pp. 1025-1032
Chapter 4
EA-STAP: an efficient, affordable approach for clutter suppression Hong Wang, Richard A. Schneible, Russell D. Brown and Yuhong Zhang
We now focus on adaptive clutter suppression with sum (E) beam and difference (A) beams. It is assumed for this chapter that spatial adaptive presuppression of jammers has been applied as necessary prior to STAR The most well known applications of A-beams are monopulse tracking [1] and platform motion compensation [2], with the latter being also called the EAimplementation of displaced phase centre antennas (DPCA). Griffiths [3] uses E A-beams for wideband adaptive beamforming of non-pulsed systems. Brown et al. [4, 5] propose STAP with E A-beams for clutter suppression of airborne surveillance systems, which is called EA-STAP. The most important feature of E A-STAP stems from the fact that antenna engineers have excelled in the design of low-sidelobe E A-beams, whether it is for a phased array or reflector-feed antenna. In fact, analogue beamforming techniques for E A-beams are so well developed that the more expensive digital beamforming method for E A-beams does not seem necessary. In other words, STAP with E A-beams may well be an affordable approach to high-performance airborne surveillance radars.
4.1
Definition of the difference (A) beams
Consider a planar aperture of either a phased array or a reflector-feed antenna which has, in addition to the sum-beam channel, at least one more receiver channel. Let G(0a, 0e) be the response pattern of the E-beam with 0a and 0e being the azimuth and elevation angles, respectively. Let F(0a, 0e) be the response pattern of the second channel. For every given steering angle (Oao,Oeo) of the E-beam, we assume: G(OaO9Oe0) = maxG(0a,0e)
(4.1)
If correspondingly we have for any 6e: F(9a0,ee) = 0
(4.2)
the second channel is said to produce an azimuth difference beam to be denoted as Aa. Similarly, an elevation difference beam, Ae, is defined by: F(O09OeO) = O
(4.3)
and an azimuth-elevation difference beam, Aae, by: F(OaO9Se) = O and
F(O09Oe0) = 0
(4.4)
The above definition is much idealised and simplified only to preserve the most basic aspect of the A-beams for signal processing oriented readers. More general descriptions can be found in antenna books such as Reference 6. Figure 4.1 shows an example of Aa-9 Ae- and Afle-beams along with the E-beam, where the steering (E-beam pointing) angle is assumed at (Oao9 6eo) = (O90). Usually, all three A-beams can be obtained simultaneously from a single aperture, resulting in a system with four receiver channels.
beam norm, gain, dB
norm, gain, dB
beam
I Figure 4.1
A ae -beam norm, gain, dB
Ae-beam PQ
An example of E- and A-beams of a planar array with tapering: H-azimuth 35 dB Taylor, Y-elevation of 25 dB Taylor, A-azimuth of 30 dB Bayliss, and A-elevation of 25 dB Bayliss
Z-beam A-beam Taylor(35)/Bayliss(30) normalised antenna pattern, dB
Nc=l6
sin 6
Figure 4.2
An example of null-peak aligned H A-beams with tapering: 35 dB Taylor for E and 3OdB Baylissfor A
Figures 4.2 and 4.3 are two examples of different A^-beams with their respective E-beams, where the idealised planar array is assumed to have fixed column combiners and thus does not have Ae- or Aae-beams. The purpose of these two Figures is to show the null alignment of the E- and A-beams of different tapering without any array error, which affects the performance of EA-STAP.
4.2
XA-STAP algorithms
Figure 4.4 illustrates the block diagram of general EA-STAP where the dimension of the processor's spatial DoF, Nps, is determined by the number of A-beams. Under the assumption of no redundant A-beams, Nps is equal to the number of A-beams. Much of this block diagram follows the general STAP configuration. Let Xs, Nt x 1, be the E-channel data of a range cell before temporal DoF reduction, where Nt is the number of the pulses in the CPI. Let XA, NpsNt x 1, be the stacked A-channel data of the same range cell before temporal DoF reduction. We can express the data after DoF reduction as: (4.5) (4.6)
normalised antenna pattern, dB
Z-beam A-beam Hanning/full-cycle sine Nc=16
Figure 4.3 An example of null-aligned T^A-beams with tapering: Hanningfor E and full-cycle sine for A where T, Nt x (Npt +1), is the temporal DoF reduction matrix, and Npt represents the processor's temporal DoF whose specification is determined by the clutter suppression need and the available sample support, and I(^W is the Nps by Nps identity matrix. The estimated correlation matrix can be written as: (4.7) where (4.8)
(4.9)
(4.10)
(4.11) with y-£,k, yAk, k = 1,2,3,..
.,K being the selected/conditioned samples.
From phased array beamformer or reflector antenna with X-A feed system Z (sum) channel
A (difference) channel(s)
receiver and A/D
receiver and A/D
temporal DoF reduction
temporal DoF reduction
joint space-time domain adaptive processing with Nps=no. of A-channels and Npt = the required and supportable temporal DoF
clutter suppressed outputs at all Doppler bins and all range cells for further processing
Figure 4.4
Block diagram of general E A-STAP where Nps (Npt) is the processor s spatial (temporal) DoF
Let S/, Nt x 1, be the temporal steering vector of a chosen Doppler bin, and denote: (4.12) (4.13) and (4.14) For EA-STAP, the test statistic is computed using modified sample matrix inversion (MSMI) or generalised likelihood ratio (GLR). The MSMI and GLR tests are, respectively: (4.15)
and (4.16) where the filtering weight vector, (Nps + l)(Npt + 1) x 1, is given by: (4.17) We note that equations (4.15) and (4.16) differ from the general MSMI and GLR expressions only because of the deep central nulls of the A-beams. As an example we now consider a typical S A-STAP algorithm of Figure 4.5 where a single A-beam is employed and the temporal DoF reduction is the Doppler-domain localised processing approach (DDL). Under the assumption of no zero padding and a uniform pulse train, the matrix T in equations (4.5) and (4.6) is just the standard DFT From phased array beamformer or reflector antenna with Ir-A feed system E (sum) channel
A (difference) channel
receiver and A/D
receiver and A/D
1-D DFT of min. Nt points
1-D DFT of min. Nt points
modified SMI or GLR algorithm of 6th order (as shown) for theyth Doppler bin with Nps = 1 and Npt = 2
clutter suppressed outputs at they'th Doppler bin and all range cells for further processing
Figure 4.5
Block diagram of a typical HA-STAP with a single A-channel and DDL(2)
matrix for Npt + 1 frequency bins, and st of equation (4.12) becomes a vector of Npt zeros centred around the only non-zero entry of 1.0. Obviously, further computation reduction can be achieved with such st in equation (4.15) or (4.16). This algorithm with Npt = 2 has been applied to measured airborne radar data [4] and the results are presented in section 4.4.
4.3 4 J.I
Analytical performance formulas of EA-STAP SINR potential
The SES[R potential is defined as the output SINR of the filtering portion of the STAP when the correlation matrix is exactly known. The filtering weight vector for the known correlation matrix Q is:
(4.18)
(4.19)
(4.20)
(4.21) (4-22) (4.23) (4.24)
(4.25)
with (4.26) (4.27) (4.28) (4.29) The SINR potential can be found by: (4.30) (4.31) where Ps is the power of the input signal.
4.3.2 Probabilities of detection andfalse alarm 4.3.2.1 MSMI The test of equation (4.15) leads to the probability of false alarm [7]: (4.32) where (4.33) (4.34) (4.35) The probability of detection is given by: (4.36) where, for the case of non-fluctuation target:
(4.37) with (4.38) being the output SINR potential, and a is the amplitude of the input signal.
For the case of the Swerling I target fluctuation model where the target amplitude is assumed to be Gaussian with zero mean and variance o^: (4.39) with (4.40) 4.3.2.2
GLR
The probability of false alarm [7] is: (4.41) The probability of detection is: (4.42) where fp is defined in equation (4.33), and for the case of non-fluctuation target:
(4.43)
with fi of equation (4.38). For the case of the Swerling I target, we have: (4.44) where /3 is given in equation (4.40).
4.4
A real-data demonstration of DA-STAP
To illustrate its simplicity and yet good performance, we will use airborne radar data for a performance demonstration of I] A-STAP against a popular STAP approach, and the conventional non-adaptive approach involving the E-channel only. The airborne radar referenced here was developed by the Air Force Research Laboratory, Rome NY site, and was called the MCARM (multichannel airborne radar measurement) system. The L-band MCARM system has sixteen half-wavelength spaced columns each with a receiver channel, i.e. Nc — Ns = 16. Separately, an analogue beamformer delivers a E-beam of an over —50 dB sidelobe level (Figure 4.6) and a A^-beam with a
—25 dB central null (Figure 4.7). One should note that the above analogue beamforming performance with such a small array is rarely achieved by digital beamforming with current hardware. The data set is part of the Flight 2 data collected over a rural area in the eastern shore region south of Baltimore, MD in early 1995.
magnitude, dB sat
14.4 (is, RxBeam 7, Pos 64, 1270 MHz, RG 70 cells, IPP #1.4
Points @ 0.2 deg. Beam W on Edge, **51.3 dB RMSide**, & 42.1 deg N width 16 az by 8 el Elements, File; 25053011, Time: 5:16:25; 7/25/94
angle, deg
Figure 4.6
Measured E receive pattern at boresight of MCARM system
magnitude, dB sat
14.4 |is, RxBeam 7, Pos 64, 1270 MHz, RG 70 cells, IPP #1.4
PonjtS^O.2 deg. Beam W on Edge, **43.3 dB RMSide**, & 55.9 deg N width 16 az by 8 el Elements, File; 25053091, Time: 5:16:47; 7/25/94
angle, deg
Figure 4.7
Measured A receive pattern at boresight of MCARM system
The three processing approaches compared are: 1
EAa-STAP with DDL(2) and MSMI(6): (see Figure 4.5) Doppler domain localised adaptive processor and modified sample matrix inversion 2 FA-STAP with MSMI( 16): factored approach STAP (FA-STAP) [8] with modified sample matrix inversion and Blackman temporal tapering 3 Conventional PD with CA(18): the conventional pulse-Doppler processor (twopulse canceller) followed by a Doppler filter bank with Blackman temporal tapering and a cell average CFAR using eighteen adjacent cells (CA-CFAR[18]).
test statistic, rjMSm
For all the three approaches given above, a known target signal of —40 dB input SINR is injected at range bin 290, and the thresholds for the same false alarm probability of 10~6 are calculated and marked in the respective output range-Doppler plots of Figures 4.8-4.10. For each range bin, adjacent secondary range bins provide samples for its correlation matrix and/or CFAR threshold estimation. Reliable detection of the same injected target under the given conditions can be achieved only by the EA^-STAP among the three, even though the FA-STAP uses more receiver channels, more samples for correlation matrix estimation and more computation time than does the EA^-STAP. One should note that the above real data demonstration does not involve any effort to optimise the performance of each approach through data conditioning or sample selection, other than excluding the cell under test from the secondary data. Even without such optimisation, the SINR potential of EA^-STAP compares favourably to factored approach STAP and low-sidelobe aperture pulse-Doppler processing. The results shown in Figure 4.11 show the advantage of the E Aa-STAP over the competing processors in the mainlobe clutter region, under conditions similar to the MCARM experiment.
injected target:
CTXTD —
range bin index
Figure 4.8
normalised Doppler frequency
MCARM range-Doppler plot: Y>ka-STAP with DDL(2) and MSMI(6)
test statistic, rjMsm
injected target:
SINR =
range bin index
Figure 4.9
normalised Doppler frequency
MCARM range-Doppler plot: FA-STAP with MSMI(16)
test statistic
injected target:
SINR = PD(CA-CFAR: 18 cells)
range bin index
Figure 4.10
normalised Doppler frequency
MCARM range-Dopplerplot: conventional PD with CA-CFAR(18)
One may also notice that the MCARM system has a very good, low sidelobe, SVbeam to begin with; thus it is interesting to see whether EA^-STAP can still offer a significant improvement over the conventional PD approach if the implementation of the E-beam is not as good. Figures 4.12-^1.14 serve to answer this question, where we have omitted the effects of the temporal DoF reduction in the SINR potential calculation to concentrate on the effects of the S A^-beams. For the three pairs
SINR potentials, dB
input: INR = 50 dB SNR = 0 dB
Bound Hanning full-cycle sine FA (Blackman) PD (Blackman)
normalised Doppler frequency
Figure 4.11 SINR potentials: E A-STAP versus FA-STAP and PD of EA a -beams of —30, —40, —50dB peak E-beam sidelobe levels, respectively, the E A^ can provide significant detection performance in the mainlobe clutter region, outperforming conventional techniques. As anticipated, the SES[R improvement in the sidelobe clutter region is significant, even if the sidelobe clutter level using the chosen transmitter and E-beam is relatively high to begin with. EA^-STAP places nulls in the adapted antenna pattern without significantly changing the existing sidelobe level of the E^-beam.
4.5
Desired A-beam characteristics
Based on fundamental principles of interference cancellation, we may hypothesise the following two desired characteristics of A-beams for EA-STAP: 1
Except the central null, all other nulls of the A-beam should be located at the same angles as the E-beam nulls, at which the interference components in the E-channel have already been significantly attenuated. Null alignment is required since the A-beam cannot be used to improve performance at angles where nulls occur. 2 At each non-null angle of the E-beam, the magnitude of the A-beampattern should be sufficiently larger than that of the E-beampattern. This characteristic reduces errors in adaptive weight estimation, and ensures that the resulting weight for the A-channel is reasonably small so as to minimise degradation of output SINR.
normalised antenna pattern, dB
E-beam A-beam
Nc=l6
SINR potentials, dB
sin 6
SINR bound XA-STAP MTI-DFT with 7OdB cheb.
normalised Doppler frequency
Figure 4.12 a E A-beams b comparison of SINR potentials of EA-STAP (Npt = Nt - 1) and non-adaptive £-only processor: peak sidelobe level (PSL) of E-beamat -3OdB
normalised antenna pattern, dB
X-beam A-beam
Nc=l6
SINR potentials, dB
sin 6
SINR bound IA-STAP MTI-DFT with 7OdB cheb.
Nc=\6 N = 32 L-.PSL =-4OdB (T : 0.22 m/s
normalised Doppler frequency
Figure 4.13 a E A-beams b comparison of SINR potentials of E A-STAP (Npt = Nt - 1) and non-adaptive S-only processor: PSL of E-beam at —40 dB
normalised antenna pattern, dB
Z-beam A-beam
SINR potentials, dB
Nc=\6
SINR bound SA-STAP MTI-DFT with 70 dB cheb.
N = 16 N = 32 Z : PSL =-5OdB <Jv: 0.22 m/s
normalised Doppler frequency
Figure 4.14 a E A -beams b comparison of SINRpotentials of EA-STAP (Npt = Nt — l) and non-adaptive E-only processor: PSL of E-beam at —50 dB
radar platform
ground clutter patch
Figure 4.15
Radar observation geometry and coordinates
The analysis of this section is provided to confirm the above conditions. To simplify the analysis as much as possible, we will use the output SINR potential with the omission of the temporal DoF reduction effects. We will examine the SINR potential as a function of the angle of arrival of a single scatterer, and of the H A^-beampatterns which are formed by a phased array. Consider a ground scatterer located at the angle ^r relative to the platform velocity, as shown in Figure 4.15. Let u = cos V^, and ft(u) = 2Vpu/(XFpr). The correlation matrix due to the single scatterer and receiver noise is: (4.45) where (4.46) with the assumption (4.47) and (4.48) Equation (4.45) can be rewritten as: (4.49)
Denote: (4.50) (4.51) and %c(u)=a2c(u)/cr2n
(4.52)
To avoid lengthy expressions, we temporarily drop the argument u in the following derivation. Denoting:
and
we have: (4.53) (4.54) (4.55) We note that x/> and Q^ differ from x and Q by the temporal DoF reduction. Using the Sherman-Morrison formula [9], we arrive at: (4.56) Further assume that Wj is even symmetric and WA odd symmetric so that: wgw A = 0
(4.57)
Under the above condition, we obtain: (4.58) (4.59) and (4.60)
Therefore we have from equations (4.26) and (4.31): (4.61) and (4.62) where as = W^s5. This is the output SINR potential for a single scatterer with angle of arrival \j/ under the stated assumptions. If F^(u) ^ 0, it is a monotonically increasing function of |F A (w)| 2 , and its minimum is at F&(u) = 0 with F^(u) ^ 0. First of all, one should notice that F^ (u) and FA (M) of equations (4.50) and (4.51) are equivalent to the E- and A-beampatterns, respectively. If F^ (u) = 0 for some u = cos yjr, equation (4.62) becomes: (4.63) which does not depend on F&(u). In other words, the output SINR potential of equation (4.62) will not decrease due to F&(u) = 0 only if F^(u) = 0 at the same u = cos \fs. This confirms our first hypothesis stated at the beginning of this section. Figure 4.16 illustrates the null-alignment effect for a linear array.
SINR potentials, dB
SINR potentials of ZA^-STAP
SINR bound DPCA patn-taylor (30 dB) Hanning/full-cycle sine Taylor (40 dB)/Bayliss (30 dB) Tx antenna: uniform
Nc=\6 Nt=\6 Npt=Nr\ av=0mJs
normalised Doppler frequency
Figure 4.16
Effect o/SA -patterns on S A -STAP: null-aligned patterns versus null peak aligned patterns
The second observation follows from equation (4.62). Moreover, one should note that equation (4.63) is also the highest SINR potential. We can thus define the loss of the SINR potential as the ratio of equation (4.62) over equation (4.63), denoted by L. If we require L > Lj, the specified tolerable SINR potential loss, then we must have: (4.64) For large Nt, strong clutter, or large \F^(u)\ (i.e., in the mainlobe region), we would like to have: (4.65) For example, Lj = 0.5(—3 dB) is corresponding to |FA(W)| 2 > |Fs(w)| 2 and Lj = 0.8(-ldB)tol01og(|F A (w)| 2 /|F E (w)| 2 ) > 6 dB. Equation (4.64) or equation (4.65) can serve as a convenient guideline for EA-beam design.
4.5.1 Mathematical equivalence ofsubarray and T1A-STAP We now consider the case of two subarrays of Figure 4.17 whose beampatterns, E\ (0) and E2(0), are not necessarily equal. If one applies STAP directly to the subarray outputs x, its SINR potential would be: SINR= PssHQ~ls
(4.66)
where the correlation matrix is Qx = E(xxH), and the steering vector s is: (4.67)
array
Now consider applying STAP to: x = Tf x
Figure 4.17
Subarray ing and I] A-STAP
(4.68)
with (4.69) where I ^ is the Nt by Nt identity matrix. Then: (4.70) (4.71) with the new steering vector: s = Tf s
(4.72)
Therefore, the SINR potential remains the same, i.e.:
(4.73)
We note that this equivalence still holds even if the A'-beam is not a A-beam. Although the above is a special case of the general SINR-invariance under the orthogonal transformation, HA-STAP provides the following advantages: 1 2
4.6
Using a real A-beam eliminates the need to calibrate and store the steering vector information as the subarray STAP does with equation (4.69). Even if both subarrays are designed to have low sidelobes, combining the two does not ensure a E-beam of the same low sidelobe level, which may even result in a need to have a separately designed transmit beam and thus increase the overall system cost. The sidelobe level (and gain) of the transmit beam has a direct impact on the overall system performance.
Summary
We summarise this chapter in terms of the advantages of the EA-STAP approach and its limitations.
4.6.1 Advantages of the S A -STAP approach With only two channels, we have shown that the EA-STAP approach can lead to a clutter suppression performance potential as high as that of other STAP approaches that require many more channels. The following are its advantages over others: 1 Affordability - affordability has long been an issue with STAP-based systems. Although progress has been made in reducing the expense of front-end analogue electronics for phased array transmit-receive (TR) modules, the associated digital electronics for A/D conversion, matched filtering, and channel equalisation
remain part of the onboard processing system. Analogue beamforming and minimisation of the number of digitised receiver channels provides a substantial payoff in total system cost. Reliability is increased and maintenance costs are reduced by simplifying system interconnects. In these areas, EA-STAP greatly reduces system cost. 2 Data efficiency - correlation matrix estimation for EA-STAP can be performed with training sets of fewer than 20 secondary data vectors. This feature provides good performance in severely non-homogeneous environments where other STAP approaches break down, regardless of how high their performance potentials are with known a priori clutter statistics. With an integrated non-homogeneity detector, the performance of EA-STAP can be further improved by selection of the secondary data vectors to be used among those in the neighbourhood of the test cell. 3 Channel calibration - channel calibration is a problem issue for many STAP approaches. In order to minimise performance degradation, the channels with other STAP approaches must be matched across the signal band, and steering vectors must be known to match the array. Considering the fact that channels generally differ in both elevation and azimuth patterns (magnitude as well as phase) even at a fixed frequency, the calibration difficulty has long been underestimated, as revealed by recent STAP experiments. The so-called element-space approaches, with an adaptive weight for each error-carrying element which hopefully can be modelled by a complex scalar, offer one possible solution to the calibration problem. This comes at a significantly increased system implementation cost, since each element then needs a digitised receiver channel. Unfortunately, such a solution can rarely be realised for an AEW system with a practical aperture size operated in non-homogeneous environments. With spatial DoF reduction required to bring down the number of adaptive weights to a sample-supportable level, the element errors are no longer directly accessible by the adaptive weights, and thus the embedded robustness of the element-space STAP approaches is lost. In contrast, EA-STAP uses two well structured channels to begin with, and its corresponding signal (steering) vector remains well calibrated for target detection as long as the central null of the A-beam is correctly placed (a reasonable property of good antenna design). Therefore, EA-STAP greatly simplifies the calibration issue in practice. 4 Response pattern - STAP has long been known to exhibit adapted spatial response patterns that are undesirable in some applications, e.g. very high sidelobe levels in some interference-free regions, loss of the mainlobe gain and significantly shifted mainlobe peak. These undesirable beam characteristics can be largely attributed to excessive spatial DoF and associated unconstrained estimation errors. With its spatial DoF equal to one and with the critical null location of the A-beam, EA-STAP offers much more desirable and predictable response patterns than do other STAP approaches with excessive DoF. This property can be maintained even when spatial presuppression of jammers is performed ahead of the EA processor.
5
Computation load - although the trend is toward more affordable computing hardware, STAP processing still imposes a considerable processing burden which increases sharply with the order of the adaptive processor and radar bandwidth. In this respect, EA-STAP reduces computational requirements in matrix order N^ adaptive problems. Moreover, the signal vector characteristic (sparse) can be exploited to further reduce test statistic numerical computations. 6 Applicability to existing systems - with other approaches, the application of STAP almost always requires completely new system hardware, from an expensive phased array to multichannel receivers which may well lead to a demand for a new platform. A significant advantage of SA-STAP, in contrast to other STAP approaches, is its applicability to existing radar systems, both phased array and continuous aperture. Adaptive clutter rejection in the joint angle-Doppler domain can be incorporated into existing radar systems by digitising the monopulse difference channel, or making relatively minor antenna modifications to add such a channel. Such a relatively low cost add-on can significantly improve the clutter suppression performance of an existing airborne radar system, whether its original design is based on low sidelobe beamforming or EA-DPCA. In fact, it is not difficult to see from the discussions of this chapter that the EA-STAP is a very natural combination of the traditional antenna design-based approaches and modern signal-processing-based approaches, making the best use of the strength of each and avoiding the weakness of each. For example, the traditional low sidelobe design and EA-DPCA have been operating at their hardware limits for the very difficult tasks of further lowering the sidelobe level or matching the DPCA patterns. With EA-STAP, one can still take whatever the traditional approaches can offer and at the same time use adaptive processing to improve performance towards the theoretical limit, as well as lifting the PRF-V^ constraint of the EA-DPCA. In this regard, EA-STAP does not attempt to do the whole job alone, and utilises the inherent desirable properties of the traditional non-adaptive aperture. It avoids many of the pitfalls of other adaptive techniques such as the dependence of quality sample support and uncontrolled response patterns with excessive DoF. EA-STAP also offers a new look at the so-called beam-space class of STAP approaches which usually involve identically shaped beams only. By starting with a set of well developed, relatively low-cost beams, a new class of practical STAP system concepts can emerge and may serve to fulfil different system application needs.
4.6.2 Limitations ofllA-STAP We will concentrate on the EA^-STAP for discussion herein. The results can be generalised to EA^- and EA^-STAP. 1 Spatial ambiguities - any two-channel system will lack sufficient spatial DoF for clutter suppression whenever spatial ambiguities occur due to the use of a very low PRF and/or high clutter Doppler induced by a very fast moving platform. The E Aa-STAP is not exceptional. Although the PRF selection for STAP-based
systems requires design attention, the spatial ambiguity problem may limit the application of EA43-STAP without the use of additional adaptive beams. 2 Beampattern mismatch - another limitation comes from the design of the E A^-beams. A deep null of the E A^-beam in the sidelobe region of the E-beam can cause poor clutter suppression performance, unless it is aligned with a null of the E-beam or the transmit beam has a sufficiently low sidelobe level. Before one applies the EA^-STAP, therefore, the S A^-beampatterns should be carefully measured and optimised if possible. 3 Jammer cancellation -jammer suppression is not a feature of EA^-STAP. However, possible application of spatial presuppression of jammers with additional auxiliary channels can make EA^-STAP suitable for such applications. 4 Limited monopulse capability - E A-STAP, as formulated here, does not provide a clutter-free difference channel for monopulse tracking. This capability still exists, however, in regions where clutter can be separated from the target by conventional Doppler processing.
4.6.3 Potential applications of E A-STAP As a large inventory of various airborne radars already exists worldwide, the potential market of a E A-STAP-based add-on upgrade can be huge. It is important to demonstrate the performance gain of such an upgrade, in order for system users to make informed and reasonable decisions. In many cases, a new multichannel system may have a performance gain that is smaller than that which can be achieved by a much lower cost EA-STAP upgrade of an existing system.
References 1 SHERMAN, S. M.: 'Monopulse principles and techniques' (Artech House, 1980) 2 STAUDAHER, F. M.: 'Airborne MTF, in M. I. Skolnik (Ed.): 'Radar handbook' (McGraw-Hill Inc., 1990) chapter 16 3 GRIFFITHS, L. J.: 'Adaptive monopulse beamforming'. Proceedings of IEEE, August 1976, pp. 1260-1261 4 BROWN, R. D., WICKS, M. C , ZHANG, Y, ZHANG, Q., and WANG, H.: 'A space-time adaptive processing approach for improved performance and affordability'. Proceedings of IEEE national Radar conference, Ann Arbor, MI, May 13-16, 1996, pp. 321-326 5 BROWN, R. D., SCHNEIBLE, R. A., WANG, H., WICKS, M. C , and ZHANG, Y.: 'STAP for clutter suppression with sum and difference beams', IEEE Trans. Aerosp. Electron. Syst., April 2000, 36(2), pp. 634-646 6 ELLIOTT, R. S.: 'Antenna theory and design' (John Wiley & Sons, Inc., Hoboken, New Jersey, 2003) p. 142 7 CAI, L. and WANG, H.: 'Performance comparisons of modified SMI and GLR algorithms', IEEE Trans. Aerosp. Electron. Syst, May 1991, AES-27(3), pp. 487^91
8 DiPIETRO, R. C. 'Extended factored space-time processing for airborne radar systems'. Proceedings of 26th Asilomar conference on Signals, systems, and computers, Pacific Grove, CA, November 1992, pp. 425-430 9 GOLUB, G. H. and VAN LOAN, C. F. 'Matrix computations' (John Hopkins University Press, London, 1996) p. 50
Chapter 5
STAP with omnidirectional antenna arrays Richard Klemm
5.1
Introduction
Airborne radar is an essential tool for military surveillance and reconnaissance. There are two stringent requirements related to surveillance and reconnaissance radar: •
•
GMTI (ground moving target indication) capability for detection of moving targets on the ground. The detection of low Doppler targets, i.e. targets whose Doppler frequency appears within the clutter Doppler band requires space-time processing techniques. 360° azimuthal coverage is an important operational requirement to obtain a complete picture of the area of interest. Existing systems such as AWACS use a mechanically rotating antenna. Joint-STARS has GMTI capability, but is based on side-looking antennas which permit only the view in a limited sector.
In this chapter we discuss possibilities of replacing the mechanically steered antenna by a phased array. The phased array antenna offers various advantages such as look agility through electronic beam steering, adaptive beamforming including jammer cancellation and superresolution, and slow target detection by STAP-based GMTI techniques. The major part of this chapter has been reported at a RTO IST/SET symposium [2]. In the following we discuss the possibility of detecting low Doppler targets with an array antenna with 360° azimuthal coverage. Is this contradictory?
5.1.1 Preliminaries on STAP antennas Basically, one can conduct STAP with any kind of antenna array configuration if the array is fully digitised, i.e. all the individual array elements have their own receive channels including an analogue-to-digital converter. The associated optimum STAP
processor exploits the echo data received by all the array elements. Obviously, for realistic array apertures having several hundreds to thousands of elements, this may cause unsolvable problems in various respects: • • • •
processing power computing time, particularly in view of real-time processing requirements availability of a sufficient amount of training data for the adaptive process numerical accuracy.
Suboptimum techniques with the potential of real-time processing for adaptive beamforming and STAP for fully digitised arrays have been proposed by Mather etal. [15]. The linear equispaced array has unique properties among all possible array configurations. With a linear equispaced array suboptimum processor architectures can be designed which achieve near-optimum clutter rejection performance while using a very limited number of weighting coefficients. This can be explained as follows. A linear equispaced array can be subdivided in displaced subarrays of uniform shape, see the top of Figure 5.1. Overlapping or disjoint subarrays are possible. The degree of overlap determines the locations of the subarray phase centres or, in other words, the spatial sampling. It can be shown that best STAP performance can be achieved by choosing the PRF so that spatial and temporal sampling are equal. All subarray elements are combined by subarray beamformers which altogether point in the same look direction. If the shapes of the subarrays are uniform they have the same directivity pattern, which in turn receives the same clutter Doppler spectrum. There is only a spatial phase between the individual subarray output signals due to the spatial displacement of the subarrays. Therefore, clutter cancellation can be done just by subtracting two of these spectra from each other after weighting one of them with the spatial phase factor. In other words, only one weighting coefficient is required. For more details on subarray forming for STAP the reader is referred to Klemm [8, 9, chapter 6] and the literature quoted there. If the idea of uniform subarrays is combined with some technique to reduce the temporal dimension [9, chapters 7, 9] almost optimum clutter rejection can be achieved by using a very small number of spatial and temporal weighting coefficients. Reduction of the number of degrees of freedom requires, of course, that the tolerances and errors in the receive channels are small.1 The disadvantage of linear (or planar) arrays is that their field of view is limited to about —60° . . . 60° in azimuth because of the directivity patterns of the individual array elements. In the following we discuss a few alternative array configurations. These concepts are based on an omnidirectional radiating element as has been used in the crow's nest antenna (Wilden and Ender [16]), combined with the principle of forming uniform subarrays.
If you want to reach the optimum at low expense you have to pay for it in terms of accuracy
beamformers
inverse of beam time covariance matrix
subarray combination (secondary beamformer)
Doppler filter bank
test function
Figure 5.1 STAP processor with overlapping subarrays
5. L 2
The circular ring array concept
The UESA radar (UHF electronically steered array) is an experimental array radar with 360° azimuthal coverage and electronic beam steering. Figure 5.2 shows the geometry of the UESA array. The elements are placed on a circle which lies in a horizontal plane. One third of the elements are active whereas the others are idle. This limitation is caused by the horizontal directivity of the radiating elements. Several authors (Bell et al. [3], Fuhrmann and Rieken [6], Li et al. [14], Zatman [18]) have discussed issues arising in STAP processing for such an array configuration. Some properties of ring arrays with a moving active sector should be mentioned: •
Power budget - the SNR (signal-to-noise ratio) achieved by an active array is the product of the transmitted power times transmit antenna gain times receive
excited elements idle elements
Figure 5.2
Circular ring array
gain. Each of these quantities is proportional to the number of array elements N, Therefore: SNR (X N3
•
•
5.2
(5.1)
This means that the SNR obtained by an array that uses only N/3 elements is 14.3 dB lower than that achieved by an array with N elements. Since the minimum detectable velocity (width of the clutter notch) depends strongly on the SNR this results in a degradation in slow target detection. Array curvature - the curvature of the active sector does not permit the design of identical subarrays looking in the same direction. It is, therefore, not easily possible to design subarray beams with uniform beampatterns in order to reduce the number of antenna channels which means a reduction of the signal vector space in the spatial dimension. This aspect has not been discussed in the literature quoted above. Directivity pattern - the array curvature will affect the directivity pattern. Some broadening of the main beam and some rise of the sidelobes can be expected.
Array configurations for 360° coverage
In this chapter three array configurations are discussed which offer both the aptitude for STAP and 360° azimuthal coverage. Other than the UESA array these concepts are based on a magnetic ring dipole as shown schematically in Figure 5.3. This dipole has been developed for the crow's nest antenna (Wilden and Ender [16]) which is
Figure 5.3
Circular dipole
a spherical array with 360° coverage. The two-way gain of this dipole is slightly above 1 dB whereas a directive dipole on the front of a metal plate as used in linear or vertical planar arrays as well as in the UESA ring array has about 6 dB two-way gain. We assume in the subsequent discussion that the directive dipole in front of a metal plate has 6 dB gain over the omnidirectional ring dipole. The three concepts are: • • •
displaced ring arrays randomly thinned circular planar array octagonal array.
The three concepts will be analysed in terms of the achievable SNR and the directivity pattern. Moreover, these concepts are compared with the concept of four linear (or rectangular planar) arrays. If not denoted otherwise the radar parameters listed in Table 5.1 have been used in this investigation.
5.2.1 Four linear arrays We start the discussion with the concept of four linear arrays arranged in a horizontal plane as depicted in Figure 5.4. One of the arrays is active while the other three
Table 5.1 Radar parameters Platform velocity Range Number of elements Element spacing Number of processed echoes Clutter-to-noise ratio, single element, single pulse Signal-to-noise ratio, single element, single pulse Wavelength ^^Nyquist
vp = 240 m/s 10 km N = 960 X/2 (Nyquist) 24 20 dB — 10 dB A = 0.03 m 32 000Hz
idle elements
active elements
Figure 5.4
Four linear (or rectangular) arrays
are idle. In practice rectangular or trapezoidal planar arrays are used so as to focus the transmitted energy and the receive gain in the vertical dimension. There are operational radar systems existing which follow this design concept. Compared with an array that uses all transmit and receive elements simultaneously, the achievable SNIR is reduced according to equation (5.1) by 4 3 (about 18 dB). Figures 5.5 and 5.6 show the SCNR plotted versus the target velocity for a sidelooking (array aligned with the flight path) and a forward-looking (array perpendicular to the flight path) linear uniformly spaced array. The number of transmit and receive elements was assumed to be 240 (one fourth of the total number of array elements according to Table 5.1). Both arrays look in broadside direction, that means, the sidelooking array is steered in the cross-flight direction whereas the forward-looking array looks in the flight direction. In both examples four curves have been plotted
SNlR, dB
Linear array, side-looking. Nc: 2, 3, 4, 6 (lowest to highest curve)
SNIR, dB
Figure 5.5
Figure 5.6
Linear array, forward-looking. Nc: 2, 3, 4, 6 (lowest to highest curve)
for different numbers of array channels Nc = 2,3,4,6 which have been generated by forming overlapping uniform subarrays. The subarray displacement has been chosen so that the spatial (subarray displacement) and temporal sampling (PRF) are both at Nyquist frequency. That means, the element displacement is chosen to be half the wavelength, and the PRF is chosen so that the phase advance between any two
azimuth, deg
Figure 5.7
Directivity pattern of the linear array versus azimuth (deg)
adjacent elements is equal to twice the distance the array moves during one pulse repetition interval (PRI). This is also called the DPCA condition. A very narrow clutter notch can be noticed in both examples. It is remarkable that for the given set of parameters the SNIR curves are independent of the number of subarrays. This follows from the earlier discussion on subdividing a linear array in uniform subarrays. For the side-looking array the clutter notch appears at zero velocity (tangential motion of the clutter relative to the radar). For the forward-looking array the clutter velocity is determined by the platform velocity times the cosine of the depression angle. Figure 5.7 shows the associated directivity pattern. Notice that the array uses 'directive elements; therefore, no backlobes show up. The picture shows the far sidelobes. The near sidelobes can be seen in Figure 5.21a.
5.2.2
Displaced circular rings
The concept of displaced circular rings has been evolved from a discussion of the properties of the ring array with active sector, see Section 5.3.2. We replace the directive sensors in the ring array by omnidirectional ring dipoles according to Figure 5.3 and use all of them simultaneously. As mentioned above the crow's nest antenna [16] has been designed in this way. However, a circular array has obviously no aptitude for forming subarrays with uniform shape according to Figure 5.1. How can we use STAP with a circular ring array, avoiding the unrealistic option of using a fully adaptive processor. The solution might be an array configuration as shown in Figure 5.8. A number of identical circular ring arrays are arranged in such a way that a group of identical subarrays displaced by
quadrapacks
circular subarray
Figure 5.8
Displaced circular ring arrays
a certain distance is obtained. In practice, such a configuration can be generated by designing groups of sensors (for instance, quadrapacks as in the example Figure 5.8). All elements belonging to a certain ring form a subarray and have to be combined by a subarray (or primary) beamformer. All subarray beams have the same look direction. Now we have a sequence of uniform subarrays displaced by a constant distance in the flight direction. In essence we designed a linear side-looking array with circular ring-shaped radiating elements. In Figure 5.9 the SNIR for different numbers of channels are plotted. We notice that, in contrast to the linear array of Figures 5.5 and 5.6, the width of the clutter notch depends strongly on the number of subarrays. The more subarrays that are formed the narrower the clutter notch is. Notice that the gain outside the clutter notch is 12 dB higher than that obtained by the linear array. Recall from Section 5.2.1 that the reduction of the number of elements by a factor of four results in a loss of 18 dB. Taking into account the gain of the individual element antenna (we assume that the gain of the ring dipole according to Figure 5.3 and Section 5.3.2 is 6 dB less than that for a directive dipole in front of a metal plate) we come up with a difference in SNIR of 12 dB between the displaced ring arrays and four linear arrays concept. Directivity patterns of the displaced ring antenna are shown in Figure 5.10 for different numbers of antenna channels. The maximum of the far sidelobes is about —20 dB. The near sidelobes are depicted in Figure 5.21b. In comparison with Figure 5.21a we notice that the displaced ring antenna produces a broader mainlobe and higher sidelobes than the linear array using 960/4 = 240 array elements.
5.2.3
Circular planar array with randomly distributed elements
Randomly distributed elements have been used for radar antenna design in the ELRA system (Groger et al. [7], Wirth [17, pp. 390]) and in the crow's nest antenna
SNIR, dB
Figure 5.9
Figure 5.10
Displaced ring arrays. Nc: 2, 3, 4, 6 (lowest to highest curve)
Directivity pattern of the displaced ring array versus azimuth (deg) a Nc = 2 b Nc = 3 c Nc = 4 d Nc = 6
(Wilden and Ender [16], Wirth [17, pp. 77]). By distributing the array elements in a random fashion thinned arrays can be generated which offer a narrower beam than an array with half-wavelength spacing, however at the expense of raised sidelobes. We consider now a horizontal planar circular array with random distribution of omnidirectional elements according to Figure 5.3. It is obvious that such an array cannot be subdivided into a number of uniform subarrays. We apply therefore again the technique used already for the displaced ring arrays. At the position of each element a doublet, triplet or quadruplet etc. of sensors is placed. Of course, as in the previous section, the sensors within such a group have to be aligned in the flight direction and displaced by a constant amount so as to design a linear side-looking array with uniform circular planar subarrays as elements. Examples of such nested arrays are shown in Figures 5.11 to 5.14. Notice that the number of elements has been kept constant. As before, uniform subarrays are obtained by combining, for example, all first, second etc. elements of all doublets, triplets etc. of the array. Comparing the four figures it is obvious that the shape of the total array is influenced by the number of subarrays. This was also the case in the displaced ring array concept. The SNIR curves in Figure 5.15 are very similar to those obtained with displaced ring array concept in Section 5.2.2. Again we notice that, in contrast to the linear array, the width of the clutter notch depends on the number of array channels. As can be seen from Figures 5.16 and 5.21c one gets a very narrow beam. The maximum sidelobe is at about —20 dB (lower than for the 240 elements linear array), the average sidelobe level is about —30 dB.
Figure 5.11 Randomly thinned array using sensor doublets (axes in m)
Figure 5.12
Randomly thinned array using sensor triplets (axes in m)
Figure 5.13
Randomly thinned array using sensor quadruplets (axes in m)
5.2.4
Octagonal planar array
The octagonal array has been proposed already in Klemm [9, p. 193]. In the example in Figure 5.17 it is shown how the octagonal array is subdivided into four overlapping uniform subarrays. In this example the subarray displacement is equal to the element
Randomly thinned array using sensor sextuplets (axes in m)
SNIR, dB
Figure 5.14
Figure 5.15
Randomly thinned array. Nc: 2, 3, 4, 6 (lowest to highest curve)
spacing. If the element displacement is half the wavelength then this kind of subarray formation is in agreement with the PRF chosen to be Nyquist of the clutter Doppler bandwidth (in our examples: PRF = 32 000 Hz), i.e. the spatial frequency is equal to the Doppler frequency.
Figure 5.16
Directivity pattern of the randomly thinned array versus azimuth (deg) a Nc = 2 b Nc = 3 c Nc = 4 d Nc = 6
Figure 5.17
Formation of overlapping subarrays in an octagonal array
Octagonal array with 976 elements (axes: element numbers)
SNIR, dB
Figure 5.18
Figure 5.19
Octagonal array. Nc: 2, 3, 4, 6 (lowest to highest curve)
In Figure 5.18 we find a realistic design example with 976 elements which is about the same size as the arrays discussed before. In Figure 5.19 we find the SNIR curves for the four chosen numbers of subarrays (Afc = 2,3,4,6). As can be seen, these SNIR curves are again much more similar to
azimuth, deg
Figure 5.20
Directivity pattern of the octagonal array versus azimuth (deg)
those obtained with linear arrays (Figures 5.5 and 5.6). The clutter notch is narrow compared with those of the displaced ring and random planar arrays. The directivity pattern (Figure 5.20) exhibits a relatively broad beam which is a consequence of the small aperture. Notice that for this array the elements are displaced by half the wavelength. The diameter of this array is about 50 cm whereas the randomly thinned array has a diameter of about 4 m. Of course, if the budget permits, much larger octagonal arrays can be designed, leading to narrower beamwidth, lower sidelobes and even narrower clutter notches. In addition, because this is a regularly half-wavelength spaced array, it can be tapered so as to reduce the sidelobes even further. For irregular or thinned arrays tapering does not give any advantage.
5.3
Discussion
5.3.1 Directivity patterns In Figure 5.21 we compare the directivity patterns of the four array architectures discussed above in the near neighbourhood of the mainbeam. The apertures of the four antennas are approximately: a = 3.6m; b = 2m; c = 4m; d = 0.55m. The different sizes of the apertures result from the requirement that for a given number of elements (960) no spatial ambiguities in the directivity patterns (grating lobes) must occur. The randomly distributed array appears to achieve the best compromise between beamwidth and the level of the near sidelobes.
Figure 5.21
Comparison of main lobe patterns a 4 linear arrays b displaced circular rings c randomly thinned array d octagonal array
5.3.2 Range-ambiguous clutter If the PRF is chosen so that the radar is range ambiguous the received clutter is a superposition of the clutter in the actual range bin and clutter components coming from ambiguous range bins. In the case of a linear side-looking array the Doppler frequency of clutter returns is range independent. Therefore, the ambiguous clutter returns coming from different range bins exhibit altogether the same Doppler frequency. This means, all ambiguous clutter arrivals contribute to the same clutter notch, the clutter notch is practically the same as in the case of range-unambiguous operation. For all other array configurations ambiguous clutter returns exhibit Doppler frequencies different from the Doppler in the actual range bin. Therefore, the ambiguous clutter returns cause a broadening of the clutter notch or even additional clutter notches, which results in degraded moving target detection performance. Even the performance of a side-looking linear array can suffer from aircraft crabbing caused by cross wind [H]. Let us compare the behaviour of one of our planar arrays, for example the randomly thinned array, with the four linear arrays concept described in Section 5.2.1. If there is no aircraft crabbing, ambiguous clutter returns do not alter the clutter notch
SNIR, dB
Figure 5.22
Effect of range ambiguities. Comparison of linear (thick) and randomly spaced planar (thin) array. Nc = 4
as long as those arrays parallel to the flight path are used. However, if one of the arrays arranged in the cross-flight direction is in operation the ambiguous clutter returns have a strong influence on the SNIR curve. For illustration, consider Figure 5.22. The thick curve has been calculated for a linear forward-looking array (orientation in the cross-flight direction) whereas the thin line belongs to a randomly thinned array as treated under Section 5.2.3. Both curves were calculated for PRF = 32 000 Hz. For the assumed geometry 40 ambiguous range bins occur. The influence of the ambiguities on the forward-looking array is dramatic (compare with Figure 5.6 where the ambiguous clutter returns were ignored). It is, furthermore, remarkable that the SNIR curve of the randomly distributed circular array according to Figure 5.13 is not affected by the range ambiguities (compare with the third lowest curve in Figure 5.15). The reason for this beneficial behaviour is the fact that by composing the array of doublets, triplets, etc. arranged in the flight direction, linear arrays with Nc highly directive elements (subarrays with beamformers) are generated whose axes are in the flight direction. This is the architecture of a side-looking array which, as is well known, is not affected by ambiguous clutter returns because the clutter Doppler is range independent. If the linear forward-looking arrays in Figure 5.4 are replaced by vertical planar arrays with vertical degrees of freedom (subarrays in the vertical array dimension) ambiguous clutter returns can be cancelled by placing vertical nulls in the directions of the ambiguous clutter returns. Of course, when considering a constant total number of elements, this means reducing the horizontal aperture by a certain factor, with the natural consequences, such as a broadened mainbeam and a broadened clutter notch.
5.4
Effect of array tilt
So far we have assumed that the various array configurations, i.e. the displacements of the subarrays, are aligned with the flight path. In the following we briefly discuss the impact of a horizontal tilt angle on the STAP performance of the above described array configurations. Horizontally tilted geometries may be generated intentionally, for example, by mechanical steering of space-based radar systems [12], or may be caused by wind drift (aircraft crabbing [H]). We assume a range-ambiguous mode such as medium PRF (MPRF) or high PRF (HPRF) which are both ambiguous in range, Doppler and, hence, produce ambiguous clutter returns.
5.4.1
Side-looking linear and rectangular arrays
In Figure 5.23 the impact of a horizontal tilt angle on the improvement factor achieved by the ASEP ([9], Chapter 9) is illustrated for a side-looking linear array in a rangeambiguous radar mode. The upper curve shows the performance without tilt angle, the lower curve has been calculated for a tilt angle of 5°. For the parameters chosen the number of ambiguous clutter returns between the radar and the line of sight amount to 40. The effect of the tilt angle on the chosen look direction (pi = 0° has been corrected by beam squint. As has been shown in Reference 9, chapter 3, for a side-looking linear array without tilt the clutter Doppler is range independent. Therefore, ambiguous returns assume the same Doppler frequency so that all the ambiguous clutter arrivals fall into the same clutter notch of the STAP filter.
Figure 5.23
Side-looking linear array, horizontal tilt angle 0°, 5°
Figure 5.24
Side-looking planar array, horizontal tilt angle 20°. Rectangular planar array; number of columns c = 24\ number of rows r = +7; o 2; * 4; x 6
Comparing the two curves in Figure 5.23 one can notice that even for a small tilt angle (5°) the clutter notch is broadened considerably by the ambiguous clutter returns which leads to losses in the detection of slow targets. For larger tilt angles the losses would increase. As mentioned above, crab angles up to 17° due to cross wind have been observed during a flight campaign with the AERII SAR instrument [5]. In Figure 5.24 a vertical rectangular array of radiating elements was assumed. The horizontal tilt angle is 20°. The curves have been plotted for different numbers of rows of radiating elements, i.e. different numbers of vertical degrees of freedom of the STAP processor. As can be seen, the ambiguous clutter returns are perfectly cancelled if the number of rows in the array is equal to four or larger. Obviously the vertical degrees of freedom of the planar array serve for vertical nulling of ambiguous clutter arrivals which appear at different depression angles.
5.4.2
Omnidirectional arrays
Let us now consider the effect of range-ambiguous clutter returns on the clutter rejection performance of the above described array configurations with 360° azimuthal coverage. In Figure 5.25 the effect of array tilt in the presence of range-ambiguous clutter on the improvement factor achieved by a displaced ring configuration (see Figure 5.8) is shown. The upper curve has been calculated for zero tilt angle and serves as a reference. It can be noticed that some degradation occurs on the left-hand side of the clutter notch (negative Doppler) similar to the effects observed for side-looking arrays (see Figure 5.23). However, much stronger losses can be observed at positive
Figure 5.25
Effect of antenna tilt on STAP performance (displaced horizontal circles), tilt angle: tilt angle = o 0°, * 5°, x 10°, + 20°
Doppler frequencies. Even for small tilt angles (e.g. 5°) dramatic losses show up. These losses come through the array backlobe due to the fact that we used omnidirectional radiating elements. In contrast, for the linear array (Figure 5.23) cos2-shaped directivity patterns were assumed. For the other two omnidirectional array concepts (displaced randomly thinned array, octagonal array) similar results are obtained as can be seen in Figures 5.26 and 5.27. Following the result given in Figure 5.24 it appears that the only remedy against the degrading effect of an array tilt is to use a horizontal array with two-dimensional degrees of freedom, i.e. two-dimensional subarrays. Such an array is shown schematically in Figure 5.28. Originally this kind of array was designed for the nose radar of a fighter aircraft, such as the AMSAR project [I]. In Figure 5.29 improvement factor curves are shown for the circular array shown in Figure 5.28. In contrast to the other array configurations which are in essence linear arrays with complex subarray structures, this array has a two-dimensional subarray structure and has, therefore, degrees of freedom in the azimuth and elevation dimensions. This array is obviously capable of nulling the ambiguous clutter arrivals in the vertical dimension. Therefore, the improvement factor curves for 0° and 20° coincide.
5.5
Conclusions
Several array architectures with 360° azimuthal coverage and STAP aptitude have been compared. Most of the concepts are based on circular dipoles. An important feature of such array antennas is that all radiating elements are always active.
Figure 5.26
Randomly thinned circular horizontal array, tilt angle = o 0°, * 5°, x 10°, + 20°
Figure 5.27
Octagonal horizontal array tilt angle = o 0°, * 5°, x 10°, + 20°
The results can be summarised as follows: 1
Four horizontal linear arrays • Only one fourth of the elements is busy (in our example 240 out of 960 elements). About 12 dB loss in SNIR has to be taken into account.
Figure 5.28
Circular planar array with checkerboard subarrays
Figure 5.29
Checkerboard circular array, tilt angle = o 0°, * 20°
• •
Very narrow clutter notches (corresponding to the MDV) are produced by the STAP filter. The width of the clutter notch does not depend much on the number of array channels.
•
The directivity pattern exhibits a relatively narrow beam and the usual sidelobe pattern. Since half-wavelength spacing is used, tapering for reducing the sidelobe is possible. • The performance of the forward-looking arrays is affected by ambiguous clutter returns which cause additional clutter notches. • Compensation for range-ambiguous clutter returns can be accomplished by replacing linear arrays by vertical planar arrays with vertical degrees of freedom. The adaptive STAP processor will cancel the ambiguous returns by spatial nulling. 2 Displaced ring arrays concept • Clutter notches are broader than for a linear array. • This disadvantage is offset by the fact that the achievable SNIR is 12 dB higher than for the 240 elements linear array. • The width of the clutter notch increases with the number of subarrays (array channels). • Compared with the linear 240 elements array the beamwidth is enlarged and the sidelobes are raised. • The displaced rings form a side-looking linear array with directive elements generated by the ring-shaped subarrays. Therefore, range-ambiguous clutter returns fall altogether on the same Doppler frequency and thus do not cause any distortion of the clutter notch. • Reduction of the sidelobes by tapering is probably not possible. 3 Randomly thinned circular planar arrays • Clutter notches are broader than for a linear array (similar to the displaced rings concept). • As before, this disadvantage is offset by the fact that the achievable SNIR is 12 dB higher than for the 240 elements linear array. • The width of clutter notch increases with the number of subarrays (array channels). • The achievable beamwidth is comparable to linear 240 elements array, and the near sidelobes are lower. • The randomly thinned circular planar array form a side-looking linear array with directive elements generated by the shifted uniform subarrays. Rangeambiguous clutter returns fall altogether on the same Doppler frequency and thus do not cause any distortion of the clutter notch. • Reduction of the sidelobes by tapering is not possible. 4 Octagonal array. All the subsequent aspects are also valid for a quadratic or rectangular array. It is expected, however, that the octagonal shape approximates better a circular array than does a quadratic array. It is expected that the beam shape does not vary with azimuth angle as much as for a quadratic array. • Clutter notches are narrow. • The width of the clutter notch is almost independent of the number of subarrays. • The achievable SNIR is 12 dB higher than for the linear 240 elements array. Excellent clutter rejection performance is obtained with two subarrays only.
•
The achievable beamwidth is much larger than for the linear array because all elements are distributed in a plane instead of a line. A narrow beam can be obtained only by increasing the number of elements. • The randomly thinned circular planar array form a side-looking linear array with directive elements generated by the shifted uniform subarrays. The subarray configuration has the properties of a side-looking array and, hence, receives clutter returns with Doppler frequencies constant with range. The clutter notch is not affected by range-ambiguous clutter returns. • Effects of aircraft crabbing on the detectability of slow moving targets can be mitigated by subdividing the array in the cross flight dimension. • Reduction of the sidelobes by tapering is possible because we deal with a half wavelength spaced array. 5 Effect of array tilt • Linear arrays as well as omnidirectional arrays based on a linear subarray structure are very sensitive to horizontal tilt if the radar is operated in a rangeambiguous mode. • Among those omnidirectional arrays considered the octagonal array appears to be the preferable configuration. For small tilt angles (about 5°) the losses are tolerable. • Due to the two-dimensional nature of the clutter returns (azimuth, elevation) the array needs a two-dimensional subarray structure in order to spatially cancel with range-ambiguous clutter returns. Perfect cancellation of the range-ambiguous clutter returns can be achieved with an array with a two-dimensional subarray structure. • The price for the robustness against array tilt by using an array with twodimensional subarray structure is an increase in required degrees of freedom in two array dimensions (array channels). The minimum amount of degrees of freedom required is a topic for further studies. • The geometry of range-ambiguous clutter arrivals is range dependent. Therefore, the STAP processing has to be carried out in a range-dependent fashion (as has been proposed in Reference 15) which involves high computational workload. • Doppler compensation techniques as used in References 13 and 4 may be a future way to avoid range-dependent STAP processing. Currently no technique for Doppler compensation of range-ambiguous clutter data is available.
References 1 ALBAREL, G., TANNER, J. S., and UHLMANN, M.: 'The trinational AMSAR programme: CAR active antenna architecture'. Radar '97, 14-16 October 1997, Edinburgh, Scotland, pp. 344-347 2 KLEMM, R.: 'Adaptive antennas for ground surveillance radar'. Proceedings of RTOIST/SET symposium, 7-8 April 2003, Chester, UK
3 BELL, K. L., VAN TREES, H. L., and GRIFFITHS, L. J.: 'Adaptive beampattern control using quadratic constraints for circular arrays'. ASAP 2000 workshop, 13-14 March 2000, MIT Lincoln Laboratory, pp. 43-48 4 BICKERT, B., ENDER, J., and KLEMM, R.: 'Verification of adaptive signal enhancement algorithms based on STAP techniques using four channel AER-II radar data in a forward looking air-to-ground GMTI mode'. Proceedings of TIWRS 2003, Elba, Italy, 15-18 September 2003 5 ENDER, J.: 'Experimental results achieved with the airborne multi-channel SAR system AER-II'. Proceedings of EUSAR'98, 25-27 May 1998, Friedrichshafen, Germany, pp. 315-318 6 FUHRMANN, D. R. and RIEKEN, D. W.: 'Array calibration for circulararray STAP using clutter scattering and projection matrix fitting'. ASAP 2000 workshop, 13-14 March 2000, MIT Lincoln Laboratory, pp. 79-84 7 GROGER, L, SANDER, W., and WIRTH, W. D.: 'Experimental phased array radar ELRA with extended flexibility'. Radar '90, Arlington, VA, 1990, pp. 286-290 8 KLEMM, R.: 'Antenna design for airborne MTI'. Proceedings of Radar 92, October 1992, Brighton, UK, pp. 296-299 9 KLEMM, R.: 'Principles of space-time adaptive processing' (IEE Publishing, London, UK, 2002) 10 KLEMM, R.: 'A planar antenna for GMTI radar with 360° coverage'. Proceedings of IEE Radar, 2002, Edinburgh, Scotland 11 KLEMM, R.: 'Effect of aircraft crabbing on sidelooking STAP radar'. Proceedings of EUSAR, 2002, Cologne, Germany, pp. 203-208 12 KOGON, S. M. and ZATMAN, M.: 'Techniques for range-ambiguous clutter mitigation in space-based radar systems', in: KLEMM, R. (Ed.): 'Applications of space-time adaptive processing' (IEE Publishing, London, UK, this volume) 13 KREYENKAMP, O. and KLEMM, R.: 'Doppler compensation in forward looking STAP radar', IEEProc, Radar Sonar Navig., 2001,148, (5), pp. 253-258 14 LI, T., SIDIROPOULOS, N. D., and GIANNAKIS, G. B.: 'PARAFAC-STAP for the UESA radar'. ASAP 2000 workshop, 13-14 March 2000, MIT Lincoln Laboratory, pp. 49-54 15 MATHER, J. L., REES, H. D., and SKIDMORE, I. D.: 'Adaptive clutter and jammer cancellation for element-digitised airborne radar'. Proceedings of 33rd Asilomar conference, Pacific Grove, CA, 24-27 October 1999, pp. 92-97 16 WILDEN, H. and ENDER, J.: 'The crow's nest antenna - experimental results'. IEEE international Radar conference, Arlington, VA, 1990, pp. 280-285 17 WIRTH, W. D.: 'Radar techniques using array antennas' (IEE Publishing, Stevenage, Herts., UK, 2001) 18 ZATMAN,M.: 'Circular array STA?\ IEEE Trans. Aerosp. Electron. Syst., April 2000, 36, (2), pp. 518-527
Part II
Space-slow time processing for space-based MTI radar
Chapter 6
SAR-GMTI concept for RADARSAT-2 Christoph H. Gierull and Chuck Livingstone
6.1
Introduction
6.1.1 Background The transportation infrastructure of today's world has been readily observable from space as road systems, seaports and airports since the early days of earth observation satellites. As the spatial resolution of spaceborne sensors becomes finer, increased infrastructure detail can be observed. The construction, maintenance, management and evolution of transportation routes and facilities are major financial investments for all levels of government in all nations. The data used for infrastructure planning, design and operation has been based on observations of traffic flow and local spot tests. Up to the present time, there has been no economically feasible way to provide velocity and density measurements of traffic from remote sensing systems. The fundamental technology needed to map moving vehicles is, in fact, well known. Due to the urgent information requirements and high risks associated with military activities, extensive research into motion measurement radars (for example Reference 1) has resulted in the development of operational airborne facilities for detecting and mapping the movement of vehicles on the earth's surface (ground moving target indication or GMTI radars). The Joint-STARS radar ground surveillance aircraft [2,3], developed in the United States in the late 1980s, is a system of this type. Although Joint-STARS has been shown to be effective in its military role, the effective coverage area is relatively small and the cost of operation is high (even by military standards). Manned, airborne GMTI systems will not provide economically viable data sources for civilian applications. Many studies [4-7] have shown that space-based GMTI radar systems can solve the area coverage problem at spatial scales suitable for both civilian and military requirements. To date, the technology needed to implement the GMTI space-based radars (SBRs) has not been sufficiently mature and the estimated capital costs were
prohibitive. Recent advances in efficient, compact, satellite design and in active array antennas have changed the economics of SBR GMTI to a point where the development of a custom designed SAR GMTI demonstrator satellite is being planned. The payoff of proposed applications of low-cost satellite technologies that include active array antennas to commercial SAR satellites has made investment more appealing. At present (2004), no civilian spaceborne radar system has a GMTI capability; however, several spaceborne SAR systems have been, are being and will be flown in the near future. Today's civilian spaceborne SARs were an outgrowth of applications developed for civilian airborne SAR data. The airborne SAR systems evolved from military SAR reconnaissance radars developed in the 1960s and 1970s. The demand for spaceborne SAR data has now grown to the point where significant commercial funding of spaceborne radar system development is feasible. Due to the large capital costs associated with the development and operation of airborne GMTI sensors, these sensors have not been used for transportation system monitoring or other uses. All development drivers exist only in the public sector and are defined by national (primarily military) needs. The air to space path followed by SAR systems is unlikely to occur for GMTI systems. The data to develop volume markets for GMTI measurements must come from space. A fully functional GMTI radar system searches large areas in a sector scan mode to detect targets of interest, dwells on selected measurement tiles several times during the tile access time to measure velocity vectors and compiles target tracks. These radars are much more complex than SARs and must acquire and process or acquire and downlink much more data than an SAR system in the same observation time. SAR modes can be incorporated as a subset of functions into a full GMTI radar. Design studies for spaceborne GMTI emphasise the need for large, real-time, onboard processors to reduce downlinked data volume. Although suitable processing capability can now be developed for ground installations at reasonable cost and weight, space qualified (radiation tolerant) versions are still in the future. Although all of the processes needed for GMTI have been developed for airborne systems, differences in platform velocity, range to target and accessible depression angles between airborne and spaceborne radars result in several unknown parameters for the optimisation of spaceborne GMTI sensors. Cost, complexity, available technology and design risk have all combined to preclude the construction and launch of a spaceborne GMTI.
6.1.2 Addition of MTI modes to spaceborne SAR Instead of deploying a fully functional spaceborne GMTI a polarimetric SAR system that has two parallel receiver channels and associated data recorders and downlinks can be easily converted into a two-channel displaced phase centre antenna (DPCA) radar if a full corporate feed is used for each of the polarisation channels. The essential conversion is a switch that replaces the first splitter link in the feed chain and allows the two halves of the antenna to be routed to the two receiver channels. If an active antenna (distributed transmit/receive (TR) modules) design is used, additional beamforming controls can be imposed to minimise azimuth sidelobes and to match the two subbeams. Provided that the radar is designed to operate as an interleaved
pulse (phase coherent) polarimeter, the SAR PRF will be in a range suitable for DPCA or DPCA-like STAP processing. When the GMTI operation is constrained to the SAR beams defined by the rest of the radar control, GMTI and SAR functions can be simultaneously performed in parallel streams in ground processors. Limited azimuthal beam steering may or may not be available depending on the SAR antenna design drivers. If, in addition to the multi-aperture antenna capability, the radar control is designed to generate the SAR operating parameters from look-up tables of actuator settings on board the spacecraft, the addition of a limited set of GMTI modes requires only additional memory space. Since the SAR is being built regardless of GMTI, this is the lowest cost approach to developing an experimental spaceborne GMTI radar. Although the subset of possible GMTI operating modes available from a radar of this type is small, it can be used to validate GMTI parameters and algorithms needed for more sophisticated radars (experimental functions). The radar can also be used to begin the investigation of possible high-volume applications of GMTI data.
6.1.3 RADARSA T-2 moving object detection experiment RADARSAT-2 is the second commercial Canadian SAR satellite and, like its predecessor, RADARSAT-I, represents a major advance in space-based SAR capability. Figure 6.1 shows an artistic conceptional view of the RADARSAT-2 system deployed in space, which will be owned and operated by MacDonald
Figure 6.1 Artist s conceptional view ofRADARSAT-2 (© Canadian Space Agency)
Dettwiler & Associates (MDA). The RADARSAT-2 system is being designed to provide extended access to the RADARSAT-I modes (with an added cross polarisation channel, HV). These are augmented by a set of new modes that include: an expanded RADARSAT-I mode set (VV and VH polarisations), polarimetric modes, high-resolution modes and an experimental GMTI mode. RADARSAT-2 increases terrain access opportunities by allowing the satellite to roll so that all modes can be used either to the right or the left of the satellite track. As the satellite traverses its orbit it will be mechanically steered in azimuth to coarsely compensate for earth rotation (yaw steering). RADARSAT-2 achieves its expanded capability by taking advantage of the inherent flexibility of an active array antenna design in which transmitter, receiver and control functions are integrated with, and distributed over, the antenna structure. Other functional features that arise from technical advances include increased attitude knowledge and control precision (star tracker attitude measurement), improved orbit knowledge (GPS receivers), programming flexibility (computer architecture advances and distributed control), increased recorder service life and input data rates (solid state recorders) and increased output data rates (dual data downlinks). Further details can be found in Reference 8. RADARSAT-2 MODEX will provide the first opportunity to routinely measure and monitor vehicles moving on the earth's surface from space. In historical terms this is a GMTI mode. The RADARSAT-2 SAR antenna design allows the antenna to be partitioned into two halves along the direction of flight and thus permits two closely spaced observations to be made of the same scene to observe temporal changes [9]. As the radar system is fundamentally a strip mapping SAR, the total observation time for any point on the earth's surface is limited to the time that the radar beam illuminates that point as it sweeps by. The dwell time for velocity measurements cannot exceed the real aperture time of the radar. Object motion will be measured with the following SAR-GMTI techniques: along-track interferometry (SAR-ATI), SAR-DPCA and space-time adaptive processing (SAR-STAP). The RADARSAT-2 MODEX capability is being built into the satellite system by the developer MDA, with collaboration and sponsorship from the Canadian Space Agency (CSA) and the Canadian Department of National Defence (DND). One of the key objectives of the DND GMTI project is to assess the strengths and weaknesses of space-based SAR GMTI systems for measuring cultural activities on the earth's surface. The development of the ground processing/analysis segment for RADARSAT-2 MODEX will be based upon models derived from theoretical understanding of the measurement process. These models are validated by simulations [10] and airborne SAR-GMTI experiments. The experimental airborne GMTI measurements use the two aperture ATI mode of the CV-580 SAR system operated by Environment Canada to provide experimental data for RADARSAT-2 resolution and incidence angles [H].
6.2
Analysis of SAR-GMTI modes for RADARSAT-2
The aim of this section is to evaluate RADARSAT-2's potential detection performance for the different SAR-GMTI techniques based on a statistical description of the
detection problem. In the subsequent analysis, we limit the model complexity (degrees of freedom (DoF)) to the number of antenna elements (phase centres) rather than the complete dimension of space-time samples, commonly used in STAR In fact, the spatial snapshot vectors are considered as mutually stochastically independent along the time (azimuth) or frequency (Doppler) direction, respectively, therefore reducing the model complexity to the space dimension only. The underlying reason for this is the practical execution of such techniques for combined SAR-GMTI purposes. On one hand, there are the classical methods such as DPCA and ATI which are applied on the processed SAR images. Adjacent SAR image pixels are independent as sampling meets the Nyquist criterion. On the other hand, there are raw data-based techniques like raw DPCA [12] or STAP which are efficiently implemented in the Fourier or Doppler domain, where the frequency bins are asymptotically independent. The statistical dependence between the samples is negligible when the time base for the Fourier transform becomes sufficiently large, which is easily satisfiable in case of SAR with a large number of transmit pulses [13].
6.2.1
Background
6.2.1.1 Classical GMTI radar The classical airborne or spaceborne GMTI radar is not an imaging system. The radar outputs are the position coordinates and radial velocity vector components of moving targets. When the radar is designed to observe a scene containing moving targets from different azimuth angles (angle measured from the direction of travel of the radar) the velocity vector of the targets can be inferred. For both airborne and spaceborne radar systems, the motion of the radar and the angular width of the radar beam are combined to embed a component of the radar velocity into the radar returns from stationary scene components. From the radar's viewpoint, the moving scene elements are embedded in a moving scene. The Doppler spectral lines associated with the moving targets are embedded in the scene Doppler spectrum. One effective technique for separating the spectra of moving targets from that of the static scene is to partition the radar antenna into two or more spatial subapertures distributed along the radar's direction of motion. Each spatial subaperture is coupled to its own receiver channel and defines a spatial DoF for adaptive processing of the channel output set. Proper synchronisation of the radar sample frequency (PRF) and the along-track position of the phase centres allows the use of a DPCA background (clutter) cancellation technique for enhancing the moving target signal. However, the alignment of the receiver channels with the flight direction is not imperative because STAP, for instance, works also for forward-looking arrays. 6.2.1.2 Combining SAR and GMTI Because it is not an imaging sensor, a classical GMTI radar only needs to detect and extract measurements from a relatively small number of points (target and potential target locations) within the illuminated scene. Appropriate choices of beam scanning and sampling parameters (PRF) allow the radar to accommodate range (or azimuth)
ambiguous measurements and to search large areas for moving targets by rapidly stepping the beam position to dwell on scene tiles for predefined dwell times. When the radars operate in these modes the acquired data does not meet SAR imaging criteria and the GMTI and SAR modes of radar operation are incompatible. When the radar beam orientation is fixed for long periods of time and when the radar PRF, range gate delay and swath are selected to minimise range and azimuth ambiguities, the same data stream can be processed, through separate processing paths, to produce SAR images, temporal SAR interferometer velocity images and GMTI outputs. In this case, the opportunity exists to apply knowledge gained through one process to the results of another with a net increase in information output. The measurement of object motion using SAR requires two operations. These operations are: the detection of the movers in the SAR data, and the estimation of their velocity vectors and azimuthal locations in the ground-range plane. Target detection and parameter estimation can either be performed incoherently, with a single SAR aperture, or coherently (with much higher fidelity), with two or more apertures [13-20]. 6.2.1.3 SAR imaging of moving targets A conventional range/azimuth coordinate system is assumed in which the azimuth direction on the imaged surface is taken to be parallel to the motion of the radar. Assuming range compression, the imaging geometry model for any target point (x, v, z) on the imaged surface can be expressed in terms of the systematic phase history. If the radar position is given by Rp(O = [xp, yp, zp]T, and the target position is given by Rr(O = [XT, yr, ZTY\ a n d t is defined to be zero when the radar is broadside to the target, then the relative systematic phase history is given by:
te {-|, | J
(6.1)
where range R(O = Rp(O - Rr(OFor an SAR operation, the duration, T9 is the maximum usable synthetic aperture time determined by the two-way antenna pattern.
assumptions. They will generally be displaced in azimuth from their proper image position and will be superimposed on correctly positioned radar returns from unrelated terrain. Signals from badly mismatched objects may not be discernible at all easily in the SAR image. The image-domain representation of mismatched moving targets will be dominated by four effects: range walk, azimuth impulse response broadening, azimuth smearing and azimuth displacement. Depending on the characteristics of target motion, any combination of the above four effects may be apparent [22-24]. 6.2.1.4 SAR measurement of target motion Once a mover is located, the range walk, azimuth displacement, azimuth smear and defocusing effects all provide non-coherent clues to target motion in SAR data. Single aperture techniques can be used to provide a coarse estimation of target velocity for sufficiently strong, sufficiently fast targets. Successful velocity estimation will be contingent upon reliable separation of target size effects and motion spreading effects. If the SAR signal spectrum is filtered into overlapping subbands in the azimuth direction, each subband will then correspond to a subdivision of the radar's azimuth illumination beam into overlapping subbeams. When each subbeam data set is focused to an SAR image, each of the images will have observed the same scene elements at different times. Features whose range location has shifted from one subbeam scene to the next are moving target candidates. The shift divided by the difference in observation time provides a first, rough estimate of radial velocity. Moving target candidates that are not visible in an SAR image processed with a SWMF can, for instance, be found by processing the scene data set for several hypothetical radial velocities that are selected to be in a reasonable range. Image features that become more prominent at non-zero hypothetical speeds are identified for further investigation. A combination of subbeam centroid shift and target enhancement as a function of matched filter velocity hypothesis provides a refined radial velocity estimate. The sensitivity of this non-coherent approach and the resulting accuracy of the inferred target velocities are dependent on the fundamental resolution of the radar mode used to generate the images. If the radar antenna is partitioned into two or more apertures that are distributed along the direction of motion of the radar, and if the design allows each aperture to be assigned to a separate, physical receiver channel, then coherent processing of simultaneously received signals can be used to detect moving targets and estimate their velocities and positions. The fundamental principle of this process is that each aperture observes the scene from the same point in space at a different time. In consideration of sampling ambiguity effects, the azimuth sampling of the radar (PRF) must be increased to adequately sample each aperture. Partition of the antenna into two apertures results in each aperture having a larger azimuth beamwidth than the combined antenna. The synthetic aperture time T will increase and thus the observation time available for the incoherent (impulse response-based) estimation processes discussed previously will increase. For an aperture phase centre separation of distance d the time lapse between terrain observations from the same point in space is: r = d/1|\p ||, where T « J1 and \p is the radar platform velocity. Typically r
has values between 0.5 ms and a few ms. When relatively calibrated SAR data from each aperture are focused to a complex image (using the same matched filter for each image) and then spatially registered to each other, the image content will be common except for scene changes over time r. For land (as opposed to ocean) scenes, 'fixed' elements are stationary over time r and their image representations will be very nearly identical. Moving objects, however, will be displaced between the two scenes and these displacements can be measured to fractional wavelength accuracies as phase shifts. When a so-called three-beam DPCA process is used, monopulse processing techniques can be employed between beam pairs to refine estimates of the azimuth position of observed targets. Existing, operational GMTI radars use three physical phase centres to provide robust DPCA clutter suppression and monopulse target position measurement operation. So-called adaptive DPCA clutter suppression algorithms use both spatial and temporal degrees of freedom to detect and measure moving targets. These algorithms are special cases of space-time adaptive processing (STAP) [25,26]. Its successful application to multichannel SAR data has, for instance, been described in References 13 and 14.
6.2.2 Statistical models of measured signals As long as the SWMF is strictly a linear operator on the received data, the shown statistical properties are valid for both the raw data and the processed SAR images. Even though in reality one has to deal with a variety of clutter properties, particularly heterogeneous terrain or water surfaces, the following performance analysis concentrates for clarity on homogeneous clutter. Extensions to the heterogeneous case can be found elsewhere, e.g. Reference 27. 6.2.2.1 Stationary clutter The quadrature components of a radar channel for the stationary world (clutter) are often modelled as zero-mean Gaussian processes which implies Rayleigh distributed magnitude. The model is warranted for rough (on the scale of the wavelength) but homogeneous backscatter magnitude terrain because the sum of independent (rough), identically (homogenous) distributed scatterers, will be complex Gaussian distributed by the central limit theorem. In a typical SAR image, the model results in granularity commonly denoted as speckle. The model has been validated over homogeneous agricultural and natural areas, but breaks down over more heterogenous or smooth surfaces such as urban or sea surfaces, see e.g. Reference 28. For a multichannel SAR system, the one-channel analysis has to be extended to several channels. As a starting point consider Af channels and define the column vector Q = [ Q i , . . . , QNV with elements Qt = y ^ J Z / , where Z^ is a standard complex normal distributed variable, i.e. Z/ ~ M (0,1). The mean power level of the /th channel is denoted by Prj for i = \,...,N and T is the transpose operator. The vector Q is modelled as a multivariate complex Gaussian random vector with density /q(q) Z=Z ^-N det(Q)""1 exp(—q*Q -1 q), where Q denotes the clutter (plus noise)
covariance matrix, the superscript * means complex conjugate transpose and det(Q) is the determinant of Q [29]. The following matter considers RADARSAT-2 where only two channels are involved, i.e. the dimension of Q is chosen to be N — 2. The covariance matrix can be written to:
(6.2) where Prt = Pcj -f Pn, p = y/Pc,\ Pc,i/(Pr,\ Pr,2), I is the identity matrix and p exp (jO) is the complex correlation coefficient between the two channel outputs. The magnitude of the complex correlation coefficient p will be referred to as coherence for simplification of notation. Cross-channel correlation is a potential source of information in different areas of remote sensing. It is a composite measure of all effects which lead to decorrelation of the signals, for example, temporal decorrelation due to surface backscatter variation and the unavoidable receiver or sensor noise. Typical values for solid surfaces in single-pass across-track interferometry as well as for along-track interferometry are larger than 0.95 but can be significantly lower for water surfaces p < 0.8. In order to reduce the speckle, the data are multilook processed which requires averaging several independent one-look interferograms. The n-look sample covariance matrix is given as: (6.3) where n is the so-called number of looks and q^ the &th one-look or single-look sample. Even though the following theoretical derivation of the PDFs is based on the assumption of statistically independent samples, it has been shown that this PDF can still be applied if certain dependence between samples occurs. In such a case, compensation with an effective number of looks (which is smaller than the averaged sample size) can account for the statistical dependence [30,31]. The random matrix A = nQ is well known to be complex Wishart-distributed, A ~ Wp(n,Q) [29], with probability density:
/A (A)
= ,r ( n)rt ( -\Vdet( Q r exp(-tr{Q"'A})
(6 4)
-
For across-track interferometry the interferometric phase 6 in equation (6.2) depends on the illumination geometry and the terrain elevation, for along-track interferometry this value is often assumed to be zero (0 = 0), i.e., the clutter is assumed to be stationary. In cases of internal clutter motion, such as a sea surface current, 0 is a function of the current and wave surface velocities.
6.2.2.2 Moving targets Deterministic target model - As a first intuitive model, a deterministic moving target signal s(#) = \/T^[l,e~J^]T can be considered as superimposed upon the stationary clutter returns in each channel. Without loss of generality, the Doppler phase of targets in the fore channel can be set to zero. The real amplitude *J~FS determines the signal-to-clutter ratio (SCR). Since n single-look cells in equation (6.3) are summed, the resulting statistics depend on both the number of moving targets (included in the entire multilook cell) and their dimension compared to the multilook cell size. In the most general case, each single-look cell contains a moving point scatterer with different RCS and different radial velocities. This scenario might happen when observing either urban areas with a potentially large number of movers or extended targets where different scattering centres may have different radial velocity components, such as large ships rolling and pitching in the sea. In such cases, the /th channel output is:
Xi(Jc) = s(k,») + qiik) = y[pMe>m
+ qt(k\
k = 1,2, . • • ,ft
(6.5)
The composite random vectors x(k) = [x\(k),X2(k)]T are mutually independent identical complex normal distributed with expectation s(fc, #) = ^ Ps(k){\,e^k)Y fork = 1 , . . . , n and covariance matrix Q, i.e. X(A:) ~ Af?r(s(k), Q). In contrast to equation (6.3), the random matrix A = YHc=\ X(A;)X(Zc)* is now non-central complex Wishart distributed, A ~ Vl^p (ft, Q, Q)9 with non-centrality matrix Q = Q - 1 MM*, where: (6.6) The density function of the positive definite A is [32]:
As this density function contains a hypergeometric function of matrix arguments, analytical analysis is exceedingly complicated. However, for two practical cases statistical evaluation becomes possible. First, regarding SAR systems with resolutions of a few metres to submetre range, the assumption of a single moving target within a multilook cell for a reasonable number of looks (MO) seems adequate, i.e. s(k, 0) = S(1O-). This mover can either be a single-point scatterer or an extended target consisting of several scattering centres with approximately the same amplitudes and radial velocities, such as a large truck etc. Under the assumption that the target covers only / < n single-look cells, the matrix of expectation vectors in equation (6.6) is given as:
where the vector e/ has / components equal to one and the rest zeros. In this case the matrix product MM* (and therewith also the non-centrality parameter
Q = Q 1 MM*) has rank one. The non-centrality matrix can be written as £2 = lQ~ls($)s(&)*. The density function of A now involves hypergeometric functions of scalar arguments:
(6.7) The equivalent density function for real variables (when the non-centrality matrix has rank one) was originally derived by Anderson and Girshick [33] where they applied the identity 0F\(a;X) = V(a)/(VX)a~lIa~\(2VX) for the confluent hypergeometric function. / v ( ) denotes the modified Bessel function of order v. Second, when the number of looks becomes sufficiently large, the second moment matrix A can be simplified because the cross terms (of independent random variables with zero mean) tend to zero for increasing n. Hence it can be rewritten as a superposition of the target signal matrix on the clutter covariance matrix: (6.8) Gaussian target model - In the previous moving target model it was assumed that the single mover amplitude was constant from one-look cell to the next one-look cell. In practise this will not be the case because varying propagation conditions, system instabilities and, primarily, variation of the target RCS caused by changing aspect angles are reasons for amplitude fluctuation. If a target consists of many single reflectors, then the backscattered signal is composed of many single contributions with quasi random relative phases. Accordingly, it can be approximated as complex normal distributed. Over short periods of time, e.g. corresponding to the fast time or range direction in SAR, the aspect angle changes only a little and the amplitude can be assumed to be constant. This model is commonly named the Swerling I case. In the flight direction of the SAR, i.e. slow-time or azimuth direction, the pulses may be far enough apart that the amplitudes of the single returns can be considered as stochastically independent. This so-called Swerling II case is investigated in the subsequent sections. Depending on the geometric resolution compared with the target dimensions, two Swerling II applications have to be distinguished. If the mover dimension is of the order of the multilook cell size the resulting covariance matrix for signal plus interference (clutter and noise) can be modelled analogously to equation (6.5), but where the deterministic target signals have to be replaced by the random variables 5/ (k) for / = 1,2. As mentioned above, the composite random vectors S(Jc) = [S\(k)9S2(k)]T are assumed to be mutually independent complex normal distributed S ~ TV^(O, Qs) with expectation zero and covariance matrix:
where ps denotes the coherence of the moving target signals. It is important to note that in the absence of temporal decorrelation, the coherence between the two channel outputs will be exactly identical for the target and the clutter. With temporal decorrelation, the target coherence can either be larger or smaller than the surrounding clutter coherence. A perfectly steady vehicle, for instance, smoothly moving over a terrain of vegetation might have a larger coherence than the vegetation. The contrary, where the vehicle is rolling and bouncing over terrain which is otherwise perfectly stationary, leads to a lower coherence for the target compared with the surrounding terrain. However, this motion-induced decorrelation will not change the principle form of the optimum filter (equation (6.12)). Therefore, by setting ps = 1, one can concentrate on the deterministic signal vector s(#) = ^fP~s[l,exp(—j$)]T. Since the clutter and the target are mutually stochastically independent, the sum of them will still be complex normal distributed, i.e. X ~ A/^ (O9 Qs + Q)- The resulting sample covariance matrix: (6.9)
with \(k) = q(ifc) + s(ifc) is therefore complex Wishart distributed A ~ W*p(n,R) with n degrees of freedom. The composite covariance matrix is given as R = Qs + Q which has the same structure as in equation (6.2). In cases where the moving target dimension is smaller than the multilook cell size, the model in equation (6.9) is no longer adequate. Instead, we get for the sample covariance matrix: (6.10)
when it is assumed that the mover is only contained in the first / G {2,..., n] onelook cells. Therefore, the random matrix A consists of a sum of two independent complex Wishart distributed matrices B ~ W*p(Z,R) and C ~ W ^ (n - /,Q). To the best knowledge of the authors the corresponding density function has not yet been derived in closed form.
6.2.3 SCNR optimum processing 6.2.3.1 Known covariance matrix Let the signal output, i.e. the test statistics on which the actual detection is based, be the magnitude of y = b*X, where b is the filter or beamformer vector and X = [Xi, X2]T is the sensor outputs. Due to one of the most basic facts in detection theory, the probability of detection of a signal in noise depends mainly on the SNR. Therefore, one criterion for adjusting the beamformer vector b could be maximisation of the signal-to-clutter-plus-noise ratio (SCNR) K of the signal output when the processor
is matched to Doppler phase #: (6.11) where s(#) = y^[l,exp(—jtf)] 7 ' is the given target signal vector at this Doppler phase. It is easy to verify that the optimum filter is: bopt(#) = K T 1 / 2 s ( # ) =
YQ~1S(#)
(6.12)
The optimum filter can be interpreted as a two-step process, the decorrelation (whitening) of the clutter with Q" 1 / 2 and a matched filtering with the adapted signal s = Q~1//2s. Applying the optimum filter (e.g. for y = 1) to the measured data vector x, yields for the signal output: y= sWQ-1!
(6.13)
where the data vector x contains only clutter plus thermal noise, i.e. X = Q - A/2 (0, Q), the beamformer output (equation (6.13)) will be a linear combination of normal variables which is also complex normal distributed with zero mean and variance: a 2 = E|v| 2 = S(^) + Q" 1 EXX*Q~1s(#) = s(^)*Q~ 1 sW
(6.14)
Therefore, the magnitude \y\/cry is Rayleigh distributed and the detection threshold can be calculated via the cumulative distribution function of the Rayleigh distribution for any given false alarm rate a. If, for instance, a deterministic target signal s(#) is included in x, such that the corresponding random vector X = s(#) + Q, the magnitude \y\/cry will be Rice distributed with parameter s(^)*Q - 1 s(#)/<7y, from which the probability of detection Pd for any arbitrary Doppler phase # can be directly calculated. If the target signal S itself can be considered as a Gaussian random variable, i.e. X — A/2 (0, s(#)s(#)* + Q) see subsection 6.2.2, the output y is zero-mean complex normal distributed with variance: a] = JWQ- 1 S(O)(I + 8(O)+Q-1S(O))
(6.15)
and the magnitude \y\/cry follows a Rayleigh distribution. Figure 6.2 illustrates the histogram of \y\ for the deterministic target and Figure 6.3 for the Gaussian target with # = 0.5 rad over varying SCR based on simulated data. The superimposed solid curves are the corresponding Rayleigh and Rice density functions; a perfect match can be recognised. Since the random target model performance is generally worse in terms of lower probability of detection, it will serve as the worst case scenario for the remainder of this chapter. Depending on the spatial resolution of the SAR system, one possibility for increasing Pd is to average several independent time samples (snapshots) xm for m = 1 , . . . , n in order to reduce the noise component. However, applying the average on the complex data, i.e. \y\ = 1/M| YTm=\ s (#)*Q~ lx ml does not yield the desired effect, because the expectation and variance of the Rayleigh and Rice distribution are reduced
Figure 6.2
Histograms and theoretical PDFs of beamformer output for SCNR-optimum processing over varying SCR with deterministic target model
in the same way. In other words, this average only leads to a rescaling of the abscissa in Figures 6.2 and 6.3 and hence Pd remains unchanged. Instead, it is advantageous to average the magnitudes, i.e. \y\ = 1 / M ^ = 1 |s(^)*Q~ 1 x m |, therefore reducing the variance by a factor of \/n while keeping the expectation constant. However, assuming that the moving vehicle does not cover the entire multilook cell (which is likely for the rather moderate resolution of spaceborne SAR systems), the effective SCR will decrease by roughly a factor of l/n if / denotes the number of single-look cells covered by the target. As a consequence, there is a trade off in terms of reduction of noise versus reduction of SCR. Figure 6.4 shows the resulting probability of detection Pd for the Gaussian target model versus radial velocity for varying SCR for a number of temporal samples n — 9, when only / = 3 samples contain the target. Since the PDF in this case is unknown (see equation (6.10)), a numerical study has been performed instead. The Doppler phase, which is approximately given as:
(6.16)
Figure 6.3 Histograms and theoretical PDFs of beamformer output for SCNR-optimum processing over varying SCR with Gaussian random target model
was determined in accordance to RADARSAT-2 system parameters: wavelength X = 0.0556 m, sensor spacing d = 7.5 m and platform velocity vp = 7500 m/s. The false alarm rate was Pf — 10~4. One can see, for instance, that a target moving with a radial velocity of 18 km/h (5 m/s) and an SCR of 10 dB can be detected with a 97 percent probability, which corresponds to an improvement of about 17 percent compared with the single-look detector (due to limited space, the corresponding single-look result is not shown here). 6.2.3.2 SMI, LSMI and EVP In practice the covariance matrix is not known and is usually estimated from a set of sample vectors with the same clutter statistics, but which have to be free of moving target signals. Using the sample covariance matrix of equation (6.3) instead of the asymptotic (Q in equation (6.13)) is commonly called the sample matrix inversion (SMI) technique. Its loss compared with the optimum solution was first studied in Reference 34. If only a very few samples n are available for computing Q, a loading of the diagonal of Q prior to inversion can reduce the SCNR loss significantly. This method called loaded SMI (LSMI) has been studied, e.g. in Reference 35.
SCR = O SCR = 5 SCR=IO SCR= 15 SCR = 20 SCR=25
Figure 6.4
Probability of detection of the SNCR optimum processing for the Gaussian target model versus radial velocity for varying SCR (dB) after 9-look averaging. Only three single-look cells contain the target, Pf = 10~4
However, for side-looking SAR-GMTI the entire range dimension can support the covariance matrix estimation, assuming a sufficiently broad antenna pattern in elevation. Therefore the estimation loss can be neglected in most practical cases. Due to range dependence or heterogeneity of clutter returns, this is not the case, for instance, for squinted or forward-looking geometries [36]. Another interesting class of techniques, the so-called subspace or projection methods are a special case of the inverse of the covariance matrix when the CNR tends to infinity. Rewrite equation (6.2) as: (6.17) Use the matrix inversion lemma to get: (6.18)
Equation (6.18) tends for infinite CNR, P«/||p|| 2 = Pn/(Pc,i + Ptf) -* 0, towards a multiple of the projection PjJ- = (I — pp*/||p|| 2 ) onto the complement of the clutter subspace spanned by the vector p. The importance of such techniques for practical applications stems from the fact that projections suppress the clutter theoretically down to zero (independently of the actual clutter power) whereas the amount of clutter suppression achieved by SMI and LSMI is inversely proportional to the average clutter power level within the training data [26,37]. The arguably best known approach to estimate the clutter subspace basis vector p for finite sample size is based on the eigenvector decomposition of the sample covariance matrix. This eigenvector projection method (EVP) has been intensively investigated in the past by many authors for different radar applications such as superresolution (here it is traditionally called MUSIC) or interference suppression in large phased arrays, e.g. Reference 38. However, the EVP features an extensive computation oftentimes preventing its use in practical applications. Faster and computationally much less intensive techniques possessing similar fidelity have, for instance, been proposed [37] which will also be considered for RADARSAT-2-GMTI.
6.2.4 SAR displaced phase centre antenna The classical DPCA two phase centre clutter suppression can be regarded as a special case, or more precisely as an approximation, of the SCNR optimum processing. The inverse of the 2 x 2 covariance matrix of equation (6.2):
(6.19)
If one now assumes that the power in the two channels is identical, Pr?i = Pr\ = Pr and that 0 = 0, i.e. perfect calibration (removal of any channel imbalances), equation (6.19) yields: (6.20) Instead of scanning for a variety of values of ft in the matched filter, it has been found to be sufficient to use a fixed beamformer vector s(#o) = V^* P» exp(—y^o)]7, i.e. a single mismatched steering vector. Choosing, for instance, fto = ^ 5 which corresponds in the terminology of conventional MTI radar to the optimum speed, the signal output will be y = const.* (JCI — #2). In other words, for #0 = n, the optimum filter simplifies to the difference between the channel outputs regardless of the coherence [16,18,19]. Recalling that X ~ A^(O 5 Q), the magnitude of the difference \Y\/ay = \X\ X2I is analogous to the SCNR optimum processor which is Rayleigh distributed for the clutter-only case and Rice distributed, with parameter 2y/7^ sin(ft/2)/ay, for the deterministic target case. The variance is given as ai — Pr,i + Pr,2 — 2p/Pr~JP~^ COS(O).
In case of the Gaussian target model, i.e. X ~ №2 (Q9 R) with:
(6.21) the resulting beamformer output \Y\/ay is again Rayleigh distributed with &2y = 2(Pr + P5)(I - pcosO), where:
(6.22)
The receiver operating characteristics calculated on the basis of this PDF look similar to Figure 6.4, except that a loss of 2-3 dB compared with the optimum solution, i.e. when the optimum filter is optimally matched to the signal s(#), can be found. In particular, the probability to detect the aforementioned target moving with 18 km/h and a SCR of 1OdB is reduced to 93 percent from 97 percent for the optimum processing.
6.2.5
SAR along-track interferometry
In contrast to the previously introduced techniques, SAR along-track interferometry (ATI) exploits only the interferometric phase, i.e. the phase difference between the channels and ignores the magnitude information. After coregistering the channels and under ideal conditions the two channel signals are identical for stationary terrain (clutter). Hence, it can be cancelled out by computing the phase difference (i.e. the interferogram), leaving only the moving targets in the differential data. In practice, the unavoidable phase noise limits the degree of cancellation [17,32]. The argument of the off-diagonal elements A12 = A^1 in equation (6.3) describes the phase of the multilook interferogram for the clutter: (6.23)
The integration of the PDF of equation (6.4) with respect to the diagonal elements An and A22 then leads to the joint density function of the multilook interferogram's phase and magnitude [39,40], and further integration with respect to the magnitude
r\ leads to the marginal phase density:
(6.24) for — TV < r/r < it and where 2F\{a^b\c\y) is Gauss's hypergeometric function. It should be noted that, although n is an integer in the derivation of the complex Wishart density function, the function in equation (6.24) is still a density function when n is any real number n e IR greater than one [31]. The statistics of the interferometric phase when a moving target signal is superimposed upon the clutter have first been investigated in Reference 32. For a deterministic target signal s(#) and a sufficiently large number of looks n, i.e. when equation (6.8) holds, the joint density function of the interferometric phase \jr and normalised magnitude rj was determined to be:
(6.25) where <5 denotes the SCR 8 = Ps/^Pr,\Pr,2> Unfortunately, integrating equation (6.25) with respect to rj to get the marginal phase PDF seems to be intractable. Alternatively, numerical integration can be used. For the Gaussian target model, the PDF of x/r has the same form as equation (6.24) except that the correlation coefficient p has to be replaced by p and the phase 0 by 6 given in equation (6.22). For increasing SCR values, the phase density becomes wider than the original clutter PDF because the factor p is always less than one if # ^ 0, assuming that ps = p. At the same time the peak of the PDF migrates towards the target phase value #. When the SCR tends to infinity (Ps -> oo), p tends to p and 0 towards ^, i.e. the phase density function in that case is identical to the original clutter PDF except for the Doppler shift towards the target frequency. This behaviour of the PDF is illustrated in Figure 6.5, where the perfect agreement between theory and simulation can be seen. The target Doppler phase was chosen to 1.3 rad. Figure 6.6 shows the corresponding probability of detection as a function of target radial velocity for n = 9 and / = 3. It is found that the probability of detection is always less than that of DPCA. Particularly, the example target moving with 18 km/h and possessing a SCR of 10 dB can only be detected with 60 per cent chance compared with 93 per cent for multilook DPCA. However, even though the performance of DPCA seems to be significantly better than that for ATI, one has to bear in mind that this is only true for homogeneous
ATI phase, rad
Figure 6.5
Histograms and theoretical PDFs ofinterferometricphase over varying SCR with Gaussian random target model True phase \jr = 1.3 rad, n = 1
terrain. It was shown in numerous publications (e.g. References 27 and 40) that the interferometric phase is invariant against inhomogeneity of the clutter, hence preserving its detection capability in fairly heterogeneous composite terrain. In contrast, the PDF for DPCA will show larger tails in such cases, i.e. a higher probability of larger amplitudes. In order to keep the false alarm rate constant, the detection threshold must inevitably be increased which in turn decreases the probability of detection. A quantitative comparison of the performance loss caused by extremely heterogeneous terrain, such as urban areas, is currently under way, e.g. Reference 41.
6.3
SAR-STAP scheme for RADARSAT-2
6.3.1 Detection This section gives a brief summary of the theoretical background and the resulting algorithms for adaptive space-time (or more precisely space-frequency) processing in conjunction with SAR for RADARSAT-2. An elaboration of this topic and its successful application to experimental airborne SAR data can be found in References 13 and 14 and also in Chapter 3 in this book.
SCR = 0 [dB] SCR = 5 [dB] S C R - 1 0 [dB] SCR= 15 [dB] SCR = 20 [dB] SCR = 25 [dB]
v rad , m/s
Figure 6.6
Probability of detection of ATI for the Gaussian target model versus radial velocity for varying SCR (dB) after 9-look averaging, Pf = 10~4
Although for the statistical performance analysis in Section 6.2.3 it was sufficient to look at the problem in a static way, such as using two processed SAR images, STAP working on the raw data requires consideration of the time history of the received clutter and moving target echoes. Let x(/,S) be the received echo consisting of a moving target signal s(t,%) superimposed upon the stationary clutter qO) and the unavoidable thermal receiver noise n(t). The parameters describing the moving target, such as velocity in the along-track direction va, velocity in across-track or range direction vr and its location xo at time t = 0 are combined in the vector S = [va,vr,xo]T. In the following, the Fourier transform of the signal will be denoted as X(&>,§) = .^{x^S)}. Even though the correlation time r c of the clutter q(t) in the time domain is rc = l/BC9 where Bc = 2up@3dBA is the clutter bandwidth constrained by the antenna beamwidth @3dB> the frequency bins in the Doppler domain are asymptotically mutually independent. For a sufficiently long time base of the Fourier transform, the receiver output vector can be considered as being stochastically independent and complex normal distributed, i.e. X(O)9 S) - Np(S(CQ9 S), Q((o)), where S(ty, S) is the Fourier transform of s(t, S) and Q(co) the so-called spectral power density matrix of the clutter plus noise. Under this premise, the SCNR optimum filter for a particular frequency is given, analogously
to equation (6.14), y(oo) = S(co, ^TQ'1 (co)X((o) with the optimum SCNR: /CoPt(Co) = S(co, §)*Q~1 (a>)S(G>, §)
(6.26)
Figure 6.7 shows an experimental data example of this SCNR optimum processing in the Doppler range domain for an airborne two-channel side-looking SAR system [11], with S (co, £) = S chosen as fixed to S = [ 1, — 1 ] T . The left-hand image shows y before any clutter cancellation, i.e. y(co) = S*X(co) (the clutter energy spread over the entire clutter bandwidth can be clearly recognised) and the right-hand one after suppression with SMI, y(co) = S*Q~1X(ct>). The clutter covariance matrix was estimated over all the range bins and hence possesses a very low variance. In the right-hand image, one can clearly recognise the enhanced moving target signals which were partly or fully covered by the clutter. The fast moving cars on the highway (upper part) are at least partly outside of the clutter bandwidth, whereas the slow moving vehicle in the bottom part was completely covered. Since the target energy in S(&>, £) spreads over a certain bandwidth, depending on the parameter vector £, the optimum filter is modified as an integration over the target bandwidth:
range bins
range bins
(6.27)
Doppler frequency, Hz
Figure 6.7
Doppler frequency, Hz
Doppler range image of two-channel SAR data before (left) and after STAP processing (right)
To analyse the potential performance of SAR-STAP it is advantageous to have a closer look at the model for the received target signal: Si(t,$)
= Di(ute))e*PU2PRite))
(6-28)
where R(t) denotes the slant range distance and ui(t) the direction history (directional cosine) from the /th antenna to the moving target on the ground. D1-(M) describes the two-way antenna pattern of the /th channel. In the far-field assumption, u\ (t) = u \ (t) and the distances can be written as R((t) = 2R(t) + u\(t)d, where d is the spacing between the receivers and R(t) the slant range distance from any reference channel to the scatterer. The two-dimensional signal vector becomes: (6.29) In array processing terminology the vector a(w) is called the steering or direction of arrival (DoA) vector. Using a special property of chirp signal possessing a large time bandwidth product, the Fourier transform of equation (6.29) can be written in analytical form to: S(CD,!;) = y(coMu(co,i;))
(6.30)
[13], i.e. as a multiple of the DoA vector in direction:
(6.31)
For non-moving objects this dependency tends to a straight line u(co) — —co/(2fivp) where the slope is determined by the platform velocity vp. For spaceborne systems with vp ^$> va, equation (6.31) can further be simplified to u(co, £) = —co/(2fivp) + vr/Vp for any ground moving vehicle. Inserting the normalised vector of equation (6.30) into the optimum SCNR of equation (6.26) leads to the so-called space-time characteristics (STC) or transfer function: (6.32) which is an excellent tool for the examination of moving array systems for GMTI [13]. Figure 6.8 shows the anticipated STC of the two-channel RADARSAT-2 antenna when the signal is transmitted from one half of the antenna and received on both halves, resulting in a phase centre separation of about 3.75 m. For simplicity, the single-element pattern was chosen of exponential form with a beamwidth 0 3 dB = 0.42° (horizontal dashed lines) corresponding to the 7.5 m subaperture.
directional azimuth, deg
Doppler frequency, Hz
Figure 6.8
Space-time characteristics of the two-channel RADARSAT-2 antenna, d= 7.5m
The vertical dashed lines represent RADARSAT-2's maximum PRF of 3.8 kHz and combined with the horizontal lines indicate the GMTI-relevant Doppler direction plane. Bright white indicates perfect clutter suppression, i.e. a detection capability as it would be without any clutter present. Obviously, a fully white plane would be desirable. The adaptive filter (even though it is only of dimension two) forms a sharp notch along the clutter trajectory u(co) = —co/(2pvp) (white solid line). Target motion will create a deviation from this straight line, see equation (6.31), i.e. will more or less fall in the white area and hence become detectable. Most significantly, two ambiguity zones (notches) caused by the spatial undersampling of the array at either side of the plane are visible. Targets with a Doppler direction trajectory getting close to these notches will be partly suppressed as well and will be less detectable. However, the first ambiguous target velocity Vy can be seen to be at the corresponding direction u(v^) = 0.42°, resulting in vay = 0.007vp = 200km/h, which might, at least for ground moving vehicles, only create problems in rare circumstances such as fast traffic on a German autobahn. If, in contrast, the signals are transmitted from the entire aperture of 15 m and received on both halves, the effective two-way antenna beamwidth decreases to 03 ^B = 0.27° (the phase centre separation remains the same)
and the ambiguity notches are (although still existing) less pronounced. However, the clutter-free Doppler zone significantly increases (target detection is straightforward in this area) because of the narrower clutter bandwidth caused by the smaller antenna beamwidth.
6.3.2 Parameter estimation The entire slow-time range of the scene under consideration can be divided into segments of equal duration which are individually transformed in the Doppler domain. The detection and parameter estimation is then done in the Doppler range domain separately for each segment, for instance, by comparing the clutter-suppressed pixels with a predefined CFAR threshold. The duration (length) of the time segments can be chosen as the maximum time stationary targets stay in one Doppler cell. Under the assumption of statistical independency between the Doppler bins, the clutter suppression can be done individually for each frequency bin. For a side-looking SAR, the sample covariance matrix can be averaged over the entire range dimension, usually providing a very low variance of the estimate. Applying the inverse (or eigenvector projection) to the data in this Doppler bin means clutter suppression (with a performance specified by the STC of the moving array), leaving only the moving targets in the data. Having the Doppler frequency ft and the slant-range position Rt determined at this stage, the only remaining unknown parameter is the azimuth location xt or equivalently the target direction ut = xt/Rt. For a multichannel SAR system with N > 2, Ender [42] has proposed a very elegant way to estimate that direction. He used the measured array manifold (estimated via the first eigenvector of the sample covariance matrix) to yield high resolution azimuth spectra for DoA estimation. Unfortunately, this approach cannot be applied to a two-channel system like RADARSAT-2 which has only one spatial degree of freedom (DoF). This DoF is spent suppressing the clutter and hence there is none left to retrieve the direction information. One possible way to overcome this dilemma is the use of several space-time (or frequency) samples to increase the dimensionality. For instance, taking M subsequent time segments, i.e. time segments of same length but staggered by 1 , . . . , M PRIs, transformed into the Fourier domain will all cover exactly the same Doppler frequencies.1 For each Doppler bin the M, Af-element vectors are concatenated to form the NM-dimensional space-time vector X(&>) = [Xi (&>),... ,XM(CO)]T. The estimation of the corresponding space-time covariance matrix is done analogously along the entire range direction. As a main difference, the clutter will not be compressed into one large eigenvalue but spread over N + /3(M — 1) eigenvalues, where /3 denotes the number of half interelement spacing traversed by the platform during one PRI [25,26,37]. For example, if we chose M = 3, and recalling N = 2 and p = 0.52 (PRF = 3.8 kHz) for RADARSAT-2, the clutter rank will be roughly three, i.e. another three DoF would theoretically be available to estimate 1
In classical terminology for airborne MTI this technique is known as PRI-staggered STAP
the target direction. One disadvantage is that many slower targets (which are close to the clutter subspace) will have significant power distributions in the second and third eigenvalues. Hence, adapting and suppressing two or three eigenvalues would also suppress target energy. In other words, there is a trade off between improved DoA estimation and reduced detectability. However, as a compromise the detection could preferably be done without staggering (keeping also the computational load smaller) and the enlarged dimensionality only be used to estimate the location of the target. A drawback of this approach is the increased computational complexity due to the order of the space-time covariance matrix. The inversion or decomposition of such matrices is numerically very intensive. However, many fast methods which avoid these intensive operations but possess almost optimum fidelity are known from classical radar array applications such as jammer suppression or superresolution [26,37,43].
6.4
Conclusions
RADARSAT-2 is a two-aperture SAR interferometer. When used for GMTI measurements, RADARSAT-2 will use beams in the 40° to 50° incidence angle range to maximise the radial velocity component of vehicle motion. The airborne experimental SAR reported here was designed to replicate the RADARSAT-2 GMTI mode resolution and observation geometry as closely as possible and to test data processing algorithms that will be migrated to the RADARSAT-2 GMTI processor. The greatest difference between the airborne and space-based SAR/GMTI capabilities arises from the relationship between the platform velocity and the along-track velocities of moving targets. In the airborne case, the target speeds are a significant fraction of the radar speed and reasonably accurate azimuthal target speed estimates can be made. This is not true for space-based radars. The RADARSAT-2-GMTI processor, which is currently under development, will likely be composed of all presented techniques, i.e. ATI, DPCA and STAR Each technique has strengths and weaknesses. For instance, ATI and DPCA are computationally simple and robust but, because they are based on the processed SAR images, are strongly dependent on the stationary world processing fidelity. On the other hand STAP, working in the raw data domain, circumvents the processing problem, but is computationally much more demanding, particularly for SAR where often several thousands of pulses are used to form the image. Another essential criterion is the robustness against varying degrees of heterogeneity of the underlying terrain in order to provide similar GMTI performance even in challenging environments such as sea surface or urban areas. Feeding the received echoes into parallel GMTI processing chains will result in an estimated target parameter pool, where the redundant information can be used to reject doubtful hits and enhance stable detections and estimates. Last but not least, RADARSAT-2 Modex is designed as an experimental rather than operational/commercial mode with the goal of identifying the potential and weaknesses of single-pass spaceborne SAR-GMTI.
6.5
List of symbols
a b ft d Di K k n N PCi Pn Pri Ps \/r q Q Q Qs pe-i9 R(t) R s S ft u va Vp vr frad jco x X £ y
direction of arrival (DoA) vector beamformer weights wave number sensor spacing horizontal sensor antenna pattern signal-to-clutter-plus-noise ratio wavelength number of independent samples (looks) number of sensors (N = 2 for RAD ARS AT-2) clutter power at /th sensor noise power received echo power at ith sensor signal power interferometric phase clutter-plus-noise vector, clutter-plus-noise covariance matrix estimated clutter-plus-noise covariance matrix signal covariance matrix complex correlation coefficient between sensors range signal-plus-clutter-plus-noise covariance matrix signal vector signal spectrum vector Doppler phase directional cosine along-track component of target velocity platform velocity across-track (range) component of target velocity radial target velocity along-track target location at t = 0 received echo vector spectrum of received echo vector signal parameter vector beamformer output, output signal
References 1 STONE, M. L. and INCE, W. J.: 'Air-to-ground MTI radar using a displaced phase center phased array'. Proceedings of IEEE international Radar conference, 1980, pp. 225-230 2 BROADBENT, S.: 'Joint-stars: force multiplier for Europe', Janes Defense Weekly, April 1987, (19), pp. 729-731
3 COVAULT, C.: 'Joint-stars patrols Bosnia', Aviation Week and Space Technology, February 1996, (9), pp. 4 4 ^ 9 4 TSANDOULAS, G.: 'Space based radar', ScL, 1987, (237), pp. 257-262 5 BIRD, J. and BRIDGEWATER, A.: 'Performance of space-based radar in the presence of earth clutter', IEE Proc. F, Commun. Radar Signal Process., 1984, 131, (5), pp. 491-501 6 CANTAFIO, L. J.: 'Space-based radar handbook', (Artech House, Dedham MA, 1989) 7 CURRY, G. R.: 'A low-cost space-based radar system concept', IEEE Aerosp. Electron. Syst. Mag, September 1996, pp. 21-24 8 Canadian Space Agency and MacDonald Detwiler & Associates: 'RADARSAT-2 programme', http://www.space.gc.ca/radarsat-2, http://radarsat.mda.ca, 2002 9 THOMPSON, A. A. and LIVINGSTONE, C. E.: 'Moving target performance for RADARSAT-2'. Proceedings of IGARSS, July 2000 10 CHIU, S. and LIVINGSTONE, C : 'A simulation study of RADARSAT-2 GMTI performance'. Proceedings of IGARSS, Sydney, Australia, 2001 11 LIVINGSTONE, C , SIKANETA, L, GIERULL, C. H., CHIU, S., BEAUDOIN, A., CAMPBELL, J., BEAUDOIN, J., GONG, S., and KNIGHT, T.A.: 'An airborne SAR experiment to support RADARSAT-2 GMTI', Can. J. Remote Sens., December 2002, 28, (6), pp. 1-20 12 GIERULL, C. H. and SIKANETA, I. C : 'Raw data based two-aperture SAR ground moving target indication'. Proceedings of IGARSS, Toulouse, France, 2003 13 ENDER, J. H. G.:' The airborne experimental multi-channel SAR system AER-II'. Proceedings of EUSAR '96 conference, Konigswinter, Germany, 1996, pp. 49-52 14 ENDER, J. H. G.: 'Space-time processing for multichannel synthetic aperture radar', Electron. Commun. Eng. J., February 1999, pp. 29-38 15 BARBAROSSA, S.: 'Detection and estimation of moving objects with synthetic aperture radar; part 1: optimal detection and parameter estimation theory', IEE Proc. Ff Radar Signal Process., 1992,1, (139), pp. 79-88 16 SUN, H., SU, W., GU, H., LIU, G., and NI, J.: 'Performance analysis of several clutter cancellation techniques by multi-channel SAR'. Proceedings of EUSAR conference, Munich, Germany, 2000, pp. 549-552 17 YADIN, E.: 'Evaluation of noise and clutter induced relocation errors in SAR-MTI'. Proceedings of IEEE international Radar conference, 1995, pp. 650-655 18 YADIN, E.: 'A performance evaluation model for a two port interferometer SAR-MTI'. Proceedings of IEEE national Radar conference, 1996, pp. 261-266 19 STOCKBURGER, E. F. and HELD, D. N.: 'Interferometric moving ground target imaging'. Proceedings of IEEE international Radar conference, 1995, pp. 438^43 20 SIKANETA, I. C , GIERULL, C. H., and CHOUINARD, J. -Y: 'Metrics for SAR-GMTI based on eigen-decomposition of the sample covariance matrix'. Proceedings of international Radar 2003, Adelaide, South Australia, 2003
21 FRANCESCHETTI, G. and LANARI, R.: 'Synthetic aperture radar processing' (CRC Press, 1999) 22 RANEY, R. K.: 'Synthetic aperture imaging radar and moving targets'. IEEE Trans. Aerosp. Electron. Syst., 1971, 7, (3), pp. 499-505 23 FREEMAN, A.: 'Simple MTI using synthetic aperture radar'. Proceedings of IGARSS, 1984, pp. SP-215 24 ENDER, J. H. G.: 'MTI-SARprocessing'. Carl-Cranz-Gesellschaft, course notes SE 2.06 on SAR-principles and applications, 1997 25 WARD, J.:'Space-time adaptive processing for airborne radar'. Technical report TR-1015, MIT Lincoln Laboratory, December 1994 26 KLEMM, R.:' Space-time adaptive processing' (IEE Publishing, Stevenage, UK, 1998) 27 GIERULL, C H . : ' Statistical analysis of multilook SAR interferograms for CFAR detection of ground moving targets: IEEE Trans. Geosci. Remote Sens., April 2004,42, (4), pp. 691-701 28 CONTE, E., LONGO, M., and LOPS, M.: 'Modelling and simulation of nonRayleigh radar clutter'. IEE Proc. F, Radar Sonar Navig., April 1991, 138, (2), pp. 121-130 29 GOODMAN, N. R.: 'Statistical analysis based on a certain multivariate complex Gaussian distribution (an introduction)', Ann. Math. Stat, 1963 (152), pp. 152-180 30 JOUGHIN, I. R. and WINEBRENNER, D. P.: 'Effective number of looks for a multilook interferometric phase distribution'. Proceedings of IGARSS9 Pasadena, CA, 1994, pp. 2276-2278 31 GIERULL, C. H. and SIKANETA, L: 'Estimating the effective number of looks in interferometric SAR data', IEEE Trans. Geosci. Remote Sens., August 2002, 40, (8), pp. 1733-1742 32 GIERULL, C. H.: 'Moving target detection with along-track SAR interferometry - a theoretical analysis'. Technical report TR 2002-084, Defence R&D Canada, Ottawa, Canada, 2002 33 ANDERSON, T. W. and GIRSHICK, M. A.: 'Some extensions of the Wishart distribution', Annals of Mathematical Statistics, December 1944, 15, (4), pp. 345-357 34 REED, L. S., MALLETT, J. D., and BRENNAN, L. E.: 'Rapid convergence rate in adaptive arrays', IEEE Trans. Aerosp. Electron. Syst, 1974, AES-10, (6), pp. 853-863 35 CHEREMISIN, O. P.: 'Efficiency of adaptive algorithms with regulized sample covariance matrix (in Russian)', Radiotechnology and Electronics (Russia), 1982, 2, (10), pp. 1933-1941 36 KREYENKAMP, O. and KLEMM, R.: 'Doppler compensation in forwardlooking STAP radar', IEE Proc. F, Radar Sonar Navig., 2001, 148, (5), pp. 253-258 37 GIERULL, C. H. and BALAJI, B.: 'Minimal sample support space-time adaptive processing with fast subspace techniques', IEE Proc. F, Radar Sonar Navig., October 2002,149, (5), pp. 209-220
38 NICKEL, U.: 'On the application of subspace methods for small sample size', AEUInt. J. Electron. Commun., 1997, 51, (6), pp. 279-289 39 LEE, J.-S., HOPPEL, K. W., MANGO, S. A., and MILLER, A. R.: 'Intensity and phase statistics of multilook polarimetric and interferometric SAR imagery', IEEE Trans. Geosci. Remote Sens., September 1994, 32, (5), pp. 1017-1028 40 JOUGHIN, I. R., WINEBRENNER, D. R, and PERCIVAL, D. B.: 'Probability density functions for multilook polarimetric signatures', IEEE Trans. Geosci. Remote Sens., May 1994, 32, (3), pp. 562-574 41 SIKANETA,I. C. andGIERULL, C. H.: 'Parameterestimationforphase statistics in interferometric SAR'. Proceedings of IGARSS, Toronto, ON, Canada, 2002 42 ENDER, J. H. G.:' Signal processing for multi-channel SAR applied to the experimental SAR system AER'. Proceedings of international Radar conference, Paris, 1994 43 WIRTH, W.: 'Radar techniques using array antennas' (IEE Publishing, Stevenage,UK,2001)
Chapter 7
STAP simulation and processing for spaceborne radar Tim J. Nohara and Peter Weber
7.1
Introduction
Spaceborne radar (SBR) has been proposed for various military and civilian applications [I]. Military applications include wide-area surveillance (WAS), theatre defence and disarmament functions. Civilian ones include remote sensing, air-traffic control, space exploration and law enforcement. The 1970s saw the first spaceborne synthetic aperture radars (SAR) with GEOS-C launched in 1975 and SEASAT in 1978 [2]. These provided remote sensing of the earth, including data such as high-resolution topography and ocean dynamics. Spaceborne SAR has matured and today includes RADARSAT-2, which is due to launch in 2005. SBR for WAS and theatre defence, however, is still in the experimental domain. In the 1980s and early 1990s, research focused on WAS and airborne moving target indication (AMTI) designs [ 1,3-6]. More recently, focus has shifted to theatre defence applications where ground moving target indication (GMTI), and combined SARGMTI modes are of interest [7]. MTI (i.e. GMTI and AMTI) radars exploit space-time adaptive processing (STAP) techniques to detect targets that would otherwise be buried in clutter. In Canada, an experimental GMTI mode is being developed for RADARSAT-2. This programme, along with earlier programmes such as the United States' Discoverer-II programme, are intended to provide much needed design and performance data, which are necessary to take spaceborne MTI and SAR-MTI radars from experimental to operational systems in the next ten years. To help mitigate the exorbitant cost of fielding and testing spaceborne radar prototypes, engineers have come to appreciate the key role that simulation technologies can play in the development process. Recently, many texts have been published that are dedicated to simulating electronic systems such as radars [8-10], and there are now organisations dedicated to promoting the use of simulation technologies through
the development of standards and other support [H]. In addition, there are technical conferences focused entirely on modelling and simulation [12]. Today, good mathematical models are available to represent the entire spaceborne radar, along with sufficient, general-purpose computing power to implement and exercise these models in reasonable times. As a result, spaceborne MTI radar designs can be accurately modelled, their performance evaluated and trade-offs can be carried out over a wide range of design parameters and operating conditions, all before a prototype is built and launched for experimental validation. The principal objective of this chapter is to discuss the design of computer simulation tools suitable for modelling and evaluating the performance of spaceborne MTI radars employing STAP techniques. This objective is met by first reviewing spaceborne MTI radar applications and radar design. This is followed by a review of the STAP techniques typically considered for spaceborne MTI radar. With this background, the design of spaceborne radar (SBR) simulation tools is examined in detail.
7.2
Spaceborne radar applications and design
Reviews of both spaceborne radar MTI applications and typical MTI radar designs are presented. These allow the extraction of modelling and processing requirements for computer simulation tools that can be used for design trade-offs and performance assessments of an SBR.
7.2.1
Spaceborne MTI radar applications
Two key advantages of SBR are a greatly increased field of view (FoV), and global coverage without the political, strategic or geographic issues associated with surfacebased and airborne radars. The portion of the earth visible to a spaceborne platform is referred to as the FoV. As much as one third of the earth's surface can be within the FoV at one time. As a result, large search rates can be achieved and a constellation of SBRs can be designed to meet coverage requirements. WAS is used to protect nations with large geographical areas from threats such as ICBMs or enemy aircraft. Such nations would create search fences surrounding their territories wherein threats are detected. The fence acts as a tripwire when crossed. SBR would detect the intrusion, and report the location to friendly forces to respond. WAS requires radars with airborne moving target indication (AMTI) capability. Theatre defence is used when conflict occurs in some region, and timely intelligence concerning ground-troop movement is needed. During the Persian Gulf War (1990-91), the Joint-STARS and AWACS airborne radars were able to provide long-range air-to-ground and air-to-air surveillance. Such intelligence is only available while assets are deployed in the region. An SBR constellation can provide continuous surveillance in several regions, and can also be quickly deployed to new trouble spots. Theatre defence requires radars with ground moving target indication (GMTI) capability. Spaceborne MTI radar applications are illustrated in Figure 7.1. A radar waveform is transmitted from a transmit antenna whose beam is steered towards the desired
transmit waveform characteristics
receiver front-end SBR motion
external noise antenna characteristics: • main • auxiliaries
ionospheric effects Faraday rotation scintillation attenuation
tropospheric effects: volumetric clutter attenuation refraction
target characteristics
jammers
ianu
sea discrete earth's rotation
Figure 7.1 Spaceborne MTI radar applications footprint on the ground. The signal interacts with the ionosphere and is affected to some degree by Faraday rotation, scintillation and attenuation. The signal then interacts with the troposphere and is affected by volumetric clutter (e.g. rain), further attenuation and refraction. In an AMTI application, the signal of interest reflects off airborne targets, and in a GMTI application, the signal of interest reflects off ground targets, before propagating back towards the receive antennas. Only a single receive antenna is shown, but in practice at least two are needed for MTI operation using STAP techniques. Auxiliary antennas may be available for other functions such as electronic counter counter measures (ECCM), to deal with sidelobe jammers, for example. The receive antennas have moved with respect to the transmit location due to the high speed of the orbiting satellite (typically several km/s) and the long distance to the ground. The clutter signals reflected off the earth's surface and the signals transmitted by jammers towards the SBR are interference signals, which must be suppressed by the radar signal processor, so that the desired targets can be reliably detected. The large FoV is evident in the Figure.
7.2.2
Spaceborne MTI radar design
Several radar and system elements are designed and traded-off in order to meet particular mission requirements. The SBR orbit altitude and inclination are two key design considerations. Circular orbits are preferred for SBR because of the uniform coverage
iiilllillilii iiliiiliilili
||||li||p;||;| iiiiiiiiii
nw
Figure 7.2
|||||JjJ|||i||
SH||:iipi|j||||ii
iiwilliiililiil l^^llllHI ;||||||||||||||||||
Electronic steering geometries for SBR
provided and because an elliptic orbit has large variations in power-aperture requirements. Circular orbits also minimise the constellation size for global coverage. For weight and cost reasons, altitude is constrained by the Van-Allen belts. Higher radiation levels need more (heavier) shielding to protect onboard electronics and thus have higher launch costs. Altitudes of interest are thus limited to approximately 2800 km. Spacecraft at higher altitudes provide a larger field of view, and require fewer satellites for efficient global coverage. Below 800 km, the number of satellites needed to provide a given level of global coverage increases rapidly. However, coverage of limited regions can be accommodated at lower altitudes. Lower altitude SBRs are cheaper because of reduced power-aperture and shielding requirements. For a given altitude, the orbit inclination determines the geographical coverage provided. An equatorial orbit will only cover regions within a band on either side of the equator; a highly inclined orbit will cover all of the earth's surface. Antenna design is one of the key elements that influences radar performance. Whereas rotating antennas have been proposed for some MTI radars, electronicallysteerable antennas are preferred (see Figure 7.2 for illustrations of the features described below). Conventional SAR systems (e.g. RADARSAT-I) usually only provide elevation beam steering. With this type of antenna, stripmap SAR imaging can be performed. The antenna beams can be electronically scanned in elevation to move the beam inward or outward from the satellite track. However, azimuth scanning, which would allow the beams to move (almost) instantly forward and backward, is not supported. Combined azimuth and elevation scanning is needed for MTI SBRs and represents a significant engineering challenge. In surveillance applications, scanning a specified volume, where beams are steered and tiled on the ground to interrogate the specified region, is often required (rather than a stripmap). Certain look directions may be revisited for confirmation or tracking dwells. Information from other sensors may be used to cue the radar to look at a given area at a certain time. If the dwell time
for a given look is long enough, the antenna may require spotlighting, which involves steering the antenna during the dwell so that its boresight remains fixed on the area of interest. All of these scanning features require electronic steering to work well. Active phased-array antennas provide the ability to quickly look to arbitrary directions, and to reconfigure the aperture for STAP operation (i.e. forming multiple simultaneous subapertures on receive). Usually the array is aligned with its face either down (nadir-pointing) or inclined to the side (side-looking) and with its longer dimension parallel to the flight direction. In this case, phase shifts that vary across the length of the array cause the beam to steer forward and backwards (i.e. azimuth steering), while phase shifts varying across the width of the array cause the beam to shift up and down (i.e. elevation steering). In practice, the FoV available for surveillance is between the grazing angles 3° and 50°. The lower limit is due to atmospheric effects such as attenuation, whereas the higher limit is due to stronger clutter returns and lower target Doppler. Furthermore, the small antenna footprint at higher grazing angles makes surveillance inefficient. The circular region bounded by 50° grazing angle is referred to as the nadir hole (in surveillance coverage). WAS applications often favour the lower grazing angles due to the longer footprint and higher search rates achievable, whereas theatre defence applications usually work with the higher grazing angles (lower incidence angles). For SBR, the choice of RF is one of the most complex and important design decisions. Low-frequency systems are generally simpler and cheaper, but some hardware is bulkier and heavier. Higher frequencies offer better angular resolution and thus lower minimum detectable velocity (MDV) (which is important for MTI) for a given aperture size. Low frequencies suffer less atmospheric and rain attenuation, but have more problems with the ionosphere. Clutter CTQ tends to increase with RF. Proposed SBRs have varied from UHF to X-band. A typical WAS AMTI system provides long-range surveillance, where high power is needed, which is cheaper at lower frequencies. Since AMTIMDV requirements are not too stringent, L-band provides a reasonable overall compromise. Theatre defence GMTI systems, on the other hand, require more stringent MDV, which is more easily attainable at X-band. Rain is less problematic because of the higher grazing angles. STAP techniques are needed to provide suitable MDV with SBRs. Consider the mainbeam clutter bandwidth, which is nominally 2vp/L Hz, where vp is the spacecraft velocity in m/s and L is the antenna length in m. Given a typical low-earth orbit satellite speed of 7350 m/s and an antenna length of say 10 m, the mainbeam clutter will occupy a Doppler bandwidth of 1470Hz. This translates to mainbeam clutter velocities spreading ± 110 m/s at L-band and ± 11 m/s at X-band. Sidelobe clutter can easily fill the remaining spectrum as illustrated in Figure 7.3. As a result, subclutter visibility is needed for all small targets, and for large targets that are not fast enough (in radial velocity) to shift in Doppler out of the mainbeam clutter. STAP techniques filter or cancel the clutter in two dimensions (space and time) in order to provide the required MDV for targets of interest. In Reference 6, a simple expression for the MDV achievable is derived, which is given by MDV = 0.23 * Vp * X/L. This expression says that an MDV of 5 m/s (for a GMTI application) can be achieved with a 10 m antenna at X-band using STAR STAP
jammer mainbeam clutter
jammer interference sidelobe clutter SBR antenna pattern
small target large slow target
large fast target
SBR direction of motion Doppler
Figure 7.3
Typical SBR clutter spectrum
requires a radar system with a minimum of two receive antennas, suitably spaced, each with dedicated and well matched receivers.
7.3 7.3.1
STAP processing for SBR Typical GMTI signal processing
A review of signal processing techniques typically considered for spaceborne MTI radars is presented next. In this section, both adaptive and non-adaptive MTI signal processing algorithms will be briefly examined. Although this examination is not exhaustive, it illustrates the kinds of signal processing typically considered. Later in this chapter, signal processing requirements for computer simulation tools will be derived, based on this discussion. Space-time adaptive processing techniques are effective for cancelling clutter viewed from fast-moving platforms. STAP techniques operate adaptively on time samples collected from several spatially distinct receive antennas, in order to suppress unwanted clutter. An MTI filter that operates in only one dimension (time) does reduce the mainbeam clutter power. Unfortunately, with moving platforms, it also annihilates targets. STAP filters that operate in both dimensions (time and space) attenuate mainbeam clutter while not overly harming moving targets. Possible STAP domains are time and aperture (space), time and angle, Doppler and aperture, and Doppler and angle. In all cases, STAP is applied quasi-independently over the range dimension. In the time and aperture domain, adaptive clutter cancellation weights are computed from and applied to the pulsed (slow-time) signals from each aperture. MASR (multiple antenna surveillance radar) DPCA [16] and subCPI STAP
are algorithms in this domain. In the Doppler and aperture domain, pulsed-Doppler processing is performed before STAP, which works on the Doppler signals from each aperture. ASAR (arrested synthetic aperture radar) DPCA [5], PRI-staggered STAP and factored STAP [17] are examples of Doppler aperture domain algorithms. A variety of two-dimensional STAP processors that operate simultaneously in time and space are described in Reference 18. Non-adaptive space-time processing can also be applied successfully, using the displaced phase centre antenna (DPCA) technique. The DPCA technique is a form of STAP that works well with only two receive apertures. The whole antenna aperture is used on transmit. On receive, the aperture is divided into halves, each of which feeds its own receiver. The DPCA condition assumes that the phase-centre separation between the two is twice the distance that the platform moves in one PRI. If the condition holds, then on any pulse, the ranges to clutter scatterers for the leading aperture are the same as those for the trailing aperture one pulse later. Therefore, clutter can be cancelled by a conventional (non-adaptive) MTI filter operating on the pulse streams from the two subapertures (shifted by one pulse with respect to each other). Since a target moves over the PRI, its range changes, and it is not cancelled by the filter. The DPCA technique works for clutter from any look direction, and can be generalised to three or more receive apertures. Adaptive cancellation can also be performed effectively with DPCA. A candidate signal processing baseline suitable for GMTI operation is shown in Figure 7.4. If DPCA operation is desired, then the delays shown must correspond to one pulse repetition interval (T) or a multiple thereof. If an algorithm such as PRI-staggered STAP is employed, then a few delays (not shown) would have to be provided for each antenna channel. If a factored STAP algorithm is employed, then the delays are not needed. A minimum of two antenna channels is needed for MTI operation; three channels are shown in Figure 7.4, which allows monopulse estimation to be carried out concurrently with MTI operation. Pulse Doppler (PD) and pulse compression (PC) operations are performed on each channel; these provide the required coherent integration against noise. (Often, PC precedes PD; however, the ordering shown in Figure 7.4 supports the use of non-Doppler tolerant waveforms.) Each STAP filter combines its set of input channels into a single clutter-cancelled signal. The signals from multiple bursts are integrated noncoherently by the NCI function. The detection function thresholds the data and assembles a detection list, which includes position and velocity information, to complete the processing. Adaptive nulling (ECCM) is also shown in Figure 7.4. This function suppresses sidelobe and mainbeam jammers. Sidelobe nulling uses signals from the auxiliary antennas to cancel the jammer signals in the main antenna channels. To illustrate typical processing steps associated with the STAP filtering, the PRIstaggered STAP technique is described in some detail. For each antenna channel, PRI-staggered STAP uses the signals from a number of temporal taps. All temporal taps consist of the same number of pulses, but they differ in their starting pulse. For example, tap 1 's signal consists of pulses 1 to ND, while tap 2 has pulses 2 to No -f-1. A particular tap from a particular antenna is referred to as a space-time channel,
Bogpfef
STAP
auxiliaries
Figure 7.4
Typical MTIprocessor architecture
and corresponds to an adaptive degree of freedom in the STAP algorithm. With PRIstaggered STAP, the number of degrees of freedom is KN, where K is the number of taps and N is the number of channels. The total number of pulses range-compressed is M = No + K — 1, in order to handle all of the taps. Each space-time channel is Doppler processed after range compression. The input to the STAP processing is ^Vrange-Doppler arrays, each with NR range gates and ND Doppler bins, or NRNO resolution cells. For each range-Doppler cell, which has a length KN vector of samples spanning the space-time channels, STAP processing is applied. This involves collecting the vector of samples from the given cell (x) and a size KN by Ns matrix of snapshot vectors from neighbouring cells (X). The snapshot vectors are from the same Doppler bin as the current cell, at ranges surrounding it. The snapshot region specifications (Ns, guard region) are algorithm design parameters. Outer products of the snapshot vectors are averaged to form a covariance matrix:
which is an estimate of the clutter covariance across the space-time channels. R is then inverted. Two options are available for forming the weight vector used to cancel clutter: STAP without steering vectors and STAP with steering vectors. STAP without steering vectors forms the product xHR~ 1X, which cancels the clutter while taking the power of the target plus residual noise. The product, arranged as a range-Doppler array, is the STAP output signal, and is the input to the CFAR detection processing. Optimal detection matches the received vector to all possible steering vectors, and then picks the best one. A steering vector is the response across the space-time channels to an ideal target with a specified position and velocity. To generate the STAP output signal in each range-Doppler cell, the statistic xHR~ls is computed for all steering angles, and the one with maximum amplitude is kept. This is maximum likelihood (ML) processing. The operation, like STAP without steering vectors, uses R~l to cancel the clutter components in x. The advantage with steering is in the suppression of the noise components in x by matching with s. The steering vectors may be scaled by (sHR~ls)~1/2 as described in Reference 13; this is required for proper angle estimation.
7.3.2 Extension to other modes It is unlikely that an SBR would only provide a single mode of operation such as GMTI. Other modes, such as AMTI, pulse Doppler (PD), SAR and SAR-MTI, would also be considered. Trade-offs concerning the performance of any given mode against the other modes is necessary. Therefore, simulation tools used to assist in these trade-offs would need to support a variety of modes. AMTI operation, for example, requires similar processing to that shown in Figure 7.4, but would typically use a waveform optimised for faster targets and greater search rates (i.e. lower resolution). Additional processing considerations, such as range ambiguity removal, would be required. The pulse Doppler mode is suitable for fast targets that are clear of clutter, as well as for the detection of very large targets such as ships. Only a single antenna channel is needed. PC and PD operations are performed, followed by NCI and detection. SAR operation is similar to pulse Doppler in that a single antenna channel is required. PC and PD operations are replaced by an SAR algorithm, which includes range processing similar to PC, range cell migration correction to account for range walk due to the longer dwells and azimuth processing to form the synthetic apertures. NCI is replaced with multilook processing to reduce image speckle, and automated detection, if present, is usually image based. SAR-MTI operation combines features of MTI and SAR to allow moving targets to be detected and overlaid onto an SAR image. At least two channels are needed to support the MTI operation. In a conceptually simple case, SAR processing is performed first on each channel, followed by MTI processing and automated detection. Delay taps can be used as illustrated in Figure 7.4 to implement a DPCA condition, or to support MTI algorithms such as PRI-staggered STAR
7.3.3 Other issues There are a few system and environment issues that impact STAP performance and hence require special modelling considerations. The following issues are discussed below before leaving this section: (i) (ii) (iii) (iv)
effect effect effect effect
of internal motion of random sidelobes of the earth's rotation of jammers and rain.
Clutter internal motion is an inherent characteristic of clutter that fundamentally limits the ability of STAP algorithms to cancel clutter. Sea scatterers are moved about by sea swell, waves and wind, and sea clutter can have a spectral spread of the order of 1 m/s. Land clutter (e.g. vegetation and trees) vibrates due to the wind to a lesser degree, and can have a spectral width of the order of 0.1 m/s. This random motion introduces random phase shifts that limit the amount of cancellation otherwise achievable. Other system elements also impact cancellation (e.g. how well matched and calibrated the antennas and receivers are). Therefore, clutter internal motion effects should be considered when evaluating STAP performance. As alluded to above, the level of matching associated with the receive antennas impacts STAP performance. Consider an active phased array that is an ideal candidate for an SBR antenna. The beam pattern is formed by summing the element responses, suitably weighted and phase shifted. Due to imperfections in manufacturing, there are random variations in the spacing and gains associated with the elements. These variations have two effects. First, they result in the presence of random sidelobes in the beam pattern of a given antenna. Second, the patterns (main beam and sidelobes) of different antennas will be different. As a result, perfect clutter cancellation will not be possible due to these imperfections. In order to mitigate STAP performance degradations that would otherwise result due to random sidelobes, STAP algorithms adaptively compute separate weights for each Doppler bin, which has the effect of making piecewise gain corrections to the antenna patterns. Random sidelobe effects should be considered when evaluating SBR designs or trading-off STAP performance. Since an SBR orbits around it, the earth's rotation imparts different radial velocities on clutter scatterers, depending on their look directions to the radar. As a result, earth's rotation causes clutter spectral spreading in addition to that caused by the motion of the radar platform. To mitigate the effects of earth's rotation, the radar antenna can be mechanically slewed so that the receive antennas align with a certain vector: the sum of the satellite orbital motion vector and the earth's rotational vector (which varies with latitude). Alternatively, if a programmable active phased array is used, receive antennas can be properly aligned electrically, by controlling their aperture shading functions. Earth's rotation and slewing (mechanically or electrically) should be taken into account when modelling and evaluating the performance of STAP radars. The presence ofjammers or rain has the effect of requiring more adaptive degrees of freedom to maintain the same STAP performance. In the case of rain, this is because
the total clutter spectrum becomes more complex in the Doppler dimension. Rather than a simple notch, a more elaborate filter response is needed. Thus extra temporal degrees of freedom (e.g. more taps from each subaperture) may be needed to mitigate rain clutter. Dealing with jammers requires spatial diversity in the radar antennas, so that angular nulls (regions of low gain) can be steered toward the jammer. By varying multiplicative weights on the antenna signals before summing their signals, interferometric nulls can be moved to arbitrary directions. Differing strategies are needed for dealing with either mainbeam or sidelobe jammers. Because the gain towards a sidelobe jammer is low, the interferometer can be set up with a low gain auxiliary antenna and the main aperture. Main beam jammers are more problematic. Nulling them requires similar procedures as for clutter cancellation, namely dividing the main aperture into a small number of large subapertures. Then the interferometer is set up with apertures having similar gains towards the jammer. Combining clutter cancellation with main beam jammer nulling requires more spatio-temporal degrees of freedom than for either individually. It also requires a STAP algorithm designed to handle both forms of interferes
7.4
Simulation and processing for SBR
SBR simulators are used to estimate the performance of space-based radar systems and their signal processing algorithms. They are also used to design space-based radar systems by trading-off and optimising design parameters to achieve a specified performance. In the previous sections, it is seen that GMTI radar detection concepts rely on the cancellation of strong clutter signals, in order to allow the detection of weak target returns. The clutter signals that are combined for cancellation by the STAP filter originate from different apertures at different times. To draw meaningful conclusions, it is imperative that proper correlation (in space and time) of both clutter and target signals be modelled. It is also important to model the effects of real hardware because, as discussed earlier, it impacts radar performance. A simulator must properly model platform motion, range and Doppler ambiguities, clutter internal motion, antenna patterns (differing between apertures) and earth's rotation. It should support a myriad of design choices, with selectable radar, antenna, platform and signal processing parameters so that different radar designs can be traded-off or optimised. It should also be able to simulate arbitrary scenarios, with selectable targets, clutter, and jamming so that particular applications or scenarios of interest can be evaluated. All of these requirements point to a baseband signal simulation, where the radar return signal is generated as samples in range for each pulse. The simulator models the full, expanded, transmit pulse, and then implements pulse compression, rather than modelling an effective compressed pulse. This approach results in greater fidelity and also provides the mechanism for efficiently implementing receiver response mismatch. Also, by starting off with an expanded pulse, imperfections in pulse compression operation can be embedded naturally. The full signal path must be modelled, including the environment, the antennas, the radar analogue and digital parts (for both
transmit and receive) and the signal processor. All of the effects in Figure 7.1 (motion, atmosphere, large FoV etc.) must be dealt with. As return signals are convolutions of the transmitted pulse with each scatterer's response function, range equation terms are needed for each scatterer (gains, range, Doppler, RCS) so that its sampled return signal can be generated. The difficulty with baseband signal simulation is in generating the returns from the many scatterers contributing to the return signal. Although the mathematics for computing their returns is (relatively) straightforward, the sheer volume of computations can easily render a simulator unable to generate the signals in an acceptable time frame. Clutter generation is the single most computationally intensive operation, and care must be taken to optimise its speed and fidelity. Efficiency in generation is a key requirement in radar simulator design. A well designed graphical user interface (GUI) is important to the user, who is the person using the tool to carry out design studies or performance evaluations. It eases the entry of parameters that describe the radar system and scenario, and the subsequent running of simulation experiments. The GUI also permits the quick derivation and cataloguing of results from the experiments, including the plotting of images and curves. The user should be able to specify the design of a radar system and scenario of interest, generate the corresponding complex baseband signals, and then process them with algorithms and parameters that he/she selects. In this section, the aforementioned high-level simulator requirements are broken down into a set of design elements, which describe a candidate baseband signal simulator of space-based radars. Quantities that are described as being modelled, computed, converted, transformed etc. are operations internal to the simulator code. The presentation below organises the design elements into functional areas, beginning with the GUI and followed by discussion of the models needed for the radar, the environment, the baseband signal generation, the GMTI signal processing, and tools needed for evaluation of the results. Radarsim™ SBR [15] is a space-based radar simulator codeveloped by the authors that satisfies the requirements described herein. Selected screen-captures taken from this tool will be used to illustrate design concepts, where appropriate.
7.4.1
User interface
For convenient use of a simulator, a properly designed graphical user interface (GUI) is important, since radar systems and engagement scenarios have so many variables. GUI designs that intuitively manage related groups of parameters make the user's job easier. Below is sampling of logical parameter groupings needed to specify a typical space-based radar, followed by a description of how the environment might be conveniently defined via a GUI. 7.4.1.1 Simulation parameters Reference to the spaceborne radar section yields several logical groupings of radar system parameters that need to be specified by the user: orbit, waveform, receiver,
Table 7.1 Orbit parameters Parameter
Typical value
Altitude Inclination Subsatellite point
800 km 80° latitude less than inclination; any longitude north or south
Direction
Table 7.2
Waveform parameters
Parameter
Typical value
Peak power Carrier frequency Expanded pulse width Pulse bandwidth Burst length Nominal PRF Pulse modulation Fill pulse duration PRI compensation? ZRT compensation? Spotlighting?
5 kW 10 GHz 50 |xs 200 MHz 100 ms 2000Hz linear FM 5 ms yes yes yes
antenna, generation and environment parameters. These are summarised in the following series of tables. Circular orbits are preferred for GMTI radars. A parameter set suitable for specifying a circular orbit is shown in Table 7.1. The subsatellite point and direction relate the SBR to the environment scenario. Pulse Doppler waveforms are appropriate for space-based GMTI (and other modes). Such waveforms can be modelled using the parameters shown in Table 7.2. The pulse compression ratio is defined by the product of its expanded width and its bandwidth. Other forms of modulation include phase coding, non-linear FM and none. The fill pulse duration should be long enough to fill the footprint with ambiguous pulses. The effects of non-ideal receivers are important because of their impact on clutter cancellation. The receivers can be modelled according to the noise added, their frequency responses (including differences between channels) and their non-linearities. The IF filter parameters in Table 7.3 allow the nominal response for each channel
Table 7.3 Receiver parameters Parameter
Typical value
Noise temperature Radar system loss IF bandwidth IF filter type and order IF centre frequency Channel to channel mismatch Number of mismatch ripples A/D sampling rate A/D quantisation level Number of A/D bits Phase noise spectrum
10000K 3 dB 200MHz Chebyshev, 8 1500 MHz —40 dB 8 200 MHz — 120 dB m 8 levels (dB c) at a discrete set of frequencies
Table 7.4 Antenna parameters Parameter
Typical value
Receive aperture locations
+0.5 m azimuth +0 m elevation 4 m azimuth by 1 m elevation Taylor —45 dB azimuth; —30dB elevation to place footprint at desired location 30° elevation tilt; auto-yaw for minimum clutter spread — 50 dB 1.5 cm
Aperture sizes Aperture shadings Electronic steering Mechanical positioning Random sidelobe levels Element spacing
to be computed; the mismatch parameters are used to compute each error response. Additional parameters are used to model the A/D and phase noise characteristics. A phased array antenna system, with multiple subapertures on receive, is a part of most high-performance GMTI designs. Parameters to describe such a system are listed in Table 7.4. Aperture sizes and shadings are specified for the transmit and all receive apertures. Locations are of the given receive aperture's centre, relative to the transmit aperture centre. Random sidelobes are due to aperture errors, whose level can be derived from the entered sidelobe level. An auto-yaw capability is provided, which sets the positioning to minimise the effects of earth's rotation on clutter.
Table 7.5 Generation parameters Parameter
Typical value
Noise seed Clutter seed Aperture error seed Jammer seed
large large large large
integer integer integer integer
Table 7.6 Clutter patch parameters Parameter
Typical value
Position Size Scatterer spacing Backscatter statistics a0 Spectral width Mean radial velocity Height Rain rate
within range swath and main beam 5 km by 5 km 1 m range; 10 m cross-range log-normal, with 10 dB spread -20dBm 2 /m 2 0.1 m/s 2 m/s 3 km 5 mm/hr
Random number seeds (such as those in Table 7.5) for all statistical processes modelled should be user-set parameters, so that simulations can be repeated, and so Monte-Carlo experiments can be performed. Modelling realistic clutter returns is required in order to properly assess the target detection capability of SBR. Parameters to describe clutter patches and rain cells are listed in Table 7.6. (Land clutter should have zero mean radial velocity.) Height and rain rate are only appropriate for rain cells. Rain cells are modelled as sources of clutter as well as in terms of the attenuation they impart on propagating signals. Typical target and jammer parameters of interest are shown in Tables 7.7 and 7.8. These parameters are most easily obtained from the user using a dialogue window. Figure 7.5 shows one such window for some of the receive aperture parameters. 7.4.1.2 Environment GUI Although dialogue windows are good for obtaining parameters such as those discussed in Tables 7.1 to 7.8, they are not well suited to placement of targets and clutter.
Table 7.7 Target parameters Parameter
Typical value
Position Velocity Mean RCS Spectral width
within range swath and main beam 10 m/s; heading towards radar 10 m 2 0.1 m/s
Table 7.8 Jammer parameters Parameter
Typical value
Position Velocity ERP Centre frequency Bandwidth Type Modulation Start and end times
anywhere in FoV stationary 500W 9.5 GHz 1 GHz barrage not pulsed extends over duration of radar waveform
Editing parameters for receive subaperture 1 Select subaperture: | Rx aperture 1 Azimuth (X) taper: | taylor Az. sidelobe level (dB): | X width (m): [ X position (m): [
jjj
jjjt| Elevation (Y) taper: j taylor El. sidelobe level (dB): |
jf|
-40
j
T
|
Y width (m): [
-30 T
^3
]
Y position (m): ["""
0
Figure 7.5 Parameter window A graphical display tool for specifying the locations of clutter patches, rain cells, targets and jammers is a better design. It also allows the placement of the antenna beam and the range swath. The display can provide a map of the earth with latitude/longitude grid lines. Zoom and pan capability can also be provided. The display can also have km rulers as a guide to the distortions introduced by the projections and zooming. Objects can be placed in the environment using controls on the side of the display. After selecting the object to place, the mouse can be used to click and place the object
(or to click and drag out the region occupied by a patch). Once placed on the display, objects can be edited by first selecting them, and then raising a window for their parameters. The subsatellite point can be indicated on the display to assist the user in locating objects. Other useful indicators that can be shown on the display include the orbit ground track, a world map and the azimuth, ground range and slant range to any selected object. In addition, isodop and isorange contours (including the horizon) can be drawn. Surface scatterers on a given isorange line (circle) have the same range to the radar. Stationary scatterers on a given isodop line (hyperbola) have the same radial velocity with respect to the radar. Having these contours helps the user to design good test experiments by allowing him/her to easily place objects so that they appear at desired locations within the radar images. Land and sea patches can be indicated in the display by rectangles of a specified size at the indicated centre location. Targets can be indicated by an appropriate symbol placed at the location of the target, with an attached arrow denoting the direction of motion at that location. Figure 7.6 shows a sample environment display (zoomed in) for an example scenario. Targets are denoted by T symbols, and the solid square is a land patch. The range swath is between the dotted lines (the O symbol is the userdefined swath centre). Solid lines are isodops. The dashed curves are the antenna beam contours (the + symbol is the electronic scanned boresight).
environment editing area (axes in degrees) to add objects select object and press define
!attitude, degrees
Land Patch
axes dimensions (km): x-top: 16.3021 x-middle: 16.3954 x-bottom: 16.4886 y: 40.3316 instructions:
longitude, degrees object geometry from SSP (deg/km): azimuth, slant range, ground range zoom: state: 88.196 817.564 415.123
Figure 7.6
Environment display
zoom:
pan:
7.4.2
Model the radar
In the next two subsections (model radar and environment), suitable models for the platform, antenna, radar and environment are described which form the basis of a simulator for SBR applications. Model-related operations computed as initialisations before core generation operations are also discussed. 7.4.2.1 Platform geometry Standard circular orbits are modelled, and Kepler's laws are used to determine the orbital speed, given the altitude. The satellite orbit is computed in its plane given its inclination, altitude and subsatellite point at time zero. Orbiting platforms are first computed in an ECI (earth-centred inertial) frame, within which the earth rotates. Orbit positions are then transformed into the ECR (earth-centred rotating) frame, where the target and clutter scatterers are represented, by applying earth's rotation. The positions of the transmit and receive antennas are computed at time instants spanning the duration of the waveform. The antenna orientation is computed at the same instants; it consists of the three-dimensional rotation matrices required to convert scatterer ECR coordinates to antenna coordinates (which are azimuth (w), elevation (t>) and boresight (w)). Coordinate transformations are a fundamental aspect of the simulation of radar return signals. Since these transformations are executed many times, they are designed and implemented as efficiently as possible. The positions of the radar and the environment are both described in the ECR coordinate system, in order to difference them for range and bearing computations. They are then converted to antenna coordinates for gain computations. The conversion between the coordinate systems uses time-varying rotation matrices. The user enters the antenna boresight at the beginning of the waveform. It is entered either as its intersection with the earth's surface or as angles in the satellite FRD (forward-right-down) frame. The antenna is assumed to have the same FRD orientation throughout the waveform. The boresight orientation and the aperture positions are combined to determine the rotation matrices. 7.4.2.2 Antenna patterns The antennas are modelled according to the voltage distribution across their apertures. Aperture shadings are computed on two-dimensional x-y grids. Antenna errors (deviations from ideal in amplitude and phase for each element) lead to increased antenna sidelobes. The shadings include aperture errors that, for active phased arrays, are correlated for overlapping subapertures. (STAP processing needs the formation of multiple simultaneous subapertures on receive.) The errors cause the patterns to differ between otherwise identical subapertures. These pattern differences lead to reduced clutter cancellation (relative to ideally matched patterns) and therefore must be modelled. Antenna patterns (in azimuth and elevation) are computed as Fourier transforms of the shading functions. Electronic steering is modelled by applying phase shifts to the element shadings.
7.4.2.3 Receiver responses The receiver frequency responses are modelled as standard (Chebyshev, Butterworth) filter responses. Real receivers have imperfections in their responses that differ between channels. Channel-to-channel mismatch is introduced by multiplying each channel's nominal response with a different error response. The user specifies the level of error and mismatch. The channel responses are later applied to return signals (as filtering operations). 7.4.2.4 Waveform Pulse Doppler waveforms (identical pulses transmitted at an approximately constant repetition interval) are used for SBR. The pulse parameters (duration, bandwidth, RF, PRF etc.) are user-selectable. If selected, motion compensation (ZRT) attempts to keep a desired spot or track at the middle of the range swath. This is done by first computing the range to the spot/track for each pulse, and then delaying the start of sampling (i.e. zero range) appropriately. PRI compensation (if selected) has the pulses transmitted at constant azimuth separation, rather than at a constant time interval. The radar only samples and processes the signal returned from within a userdefined range swath. The swath width must be less than the PRI. The swath's position on the earth varies with time as the platform moves. The swath has range ambiguities, where returns from ranges within the swath plus or minus an integer number of PRIs are received. Range-ambiguous clutter returns can be more troublesome to cancel, thus modelling them is needed for a true assessment of performance. Fill pulses can be transmitted, to ensure that ambiguities of the swath have the same number of pulses returned to the radar. The transmit and receive times for each pulse are computed. Note that these times must be separated by the two-way delay to the swath. 7.4.2.5 Errors All forms of amplitude and phase mismatch between channels reduce the achievable cancellation of strong clutter signals. These include mismatches between antenna gains as a function of angle, those between receiver channels as a function of frequency and those between the I and Q channels of digitisers. All can be helped (but not eliminated) by various calibration and equalisation schemes. Short-term transmitter stability is also important for cancellation, which operates over different pulses. Each of these forms of error is modelled by applying the appropriate distortions to the received signals.
7.4.3
Model the environment
The environment includes models for targets, clutter, jammers and noise. 7.4.3.1 Positions of scatterers In order to properly model their returns across range and Doppler, distributed clutter patches are described as two-dimensional arrays of scatterers, located at the centres
of nominal resolution cells. A position is computed for each scatterer within a patch. Scatterer positions are laid out on a range/cross-range grid. This allows the scatterer spacing to account for the resolution available in each dimension. Target positions are computed as they move over the duration of the waveform, and are not represented on any grid. The atmospheric and rain attenuation are computed at each scatterer location. 7.4.3.2 Reflectivity of scatterers Land and sea clutter amplitude distributions (over area) are in general non-Rayleigh, especially with high range resolution. The distributions one should model include log-normal, K, Rayleigh and fixed. Consider a log-normally distributed patch as an example. Log-normally distributed variates are generated, and these become the mean powers of the scatterers within the patch. However, each scatterer is temporally Rayleigh amplitude distributed. That is, as time evolves, its amplitude will vary according to a Rayleigh distribution, with a user-specified spectral width. This is a realistic model, since a given ground scatterer is unlikely to be log-normal temporally. The target models can be chosen from the same distribution families, and also include a spectral width for their scintillation. The temporal variation models clutter internal motion; doing so properly is critical to deriving performance in realistic clutter. The scattering time sequences, which are their radar cross sections (RCS) described as a (complex) voltage for each pulse, are computed for each scatterer. This can be done in either of two ways. The first is to generate a random spectral sequence shaped by the amplitude spectrum, and then inverse Fourier transform. The second is to generate a white time sequence and then FIR filter with a response shaped by the correlation. The choice depends on the scatterer bandwidth relative to the PRF. These techniques are described in Mitchell [14].
7.4.3.3 Noise For each channel, a white noise signal is first computed, and then the receiver response is applied. This approach works for receiver channels originating from independent apertures. To implement proper receiver noise modelling, its channelto-channel correlation across overlapping subapertures of phased arrays must be included. This is important because adaptive algorithms can be impeded by this correlation. A selectable noise temperature completes the model. 7.4.3.4 Jammers Jammer generation is performed in a manner similar to noise generation. Extra steps include applying antenna gains, propagation loss (radar equation terms) and jammer modulation. When aperture separations and receiver bandwidths are both large, it is important to model jammer decorrelation between channels, since it leads to degradations in achievable cancellation.
7.4.4
Generate the signals
This part of the simulation generates the return signals from the scatterers. The received signal is the sum of scatterer returns, noise and jamming. It is of dimensions (number of receivers) by (number of pulses) by (number of samples in range swath), typically 2 by 128 by 4 K. The objective of STAP processing is to cancel the strong clutter return that originates from a large number of scatterers. It is thus important that the amplitudes and phases for all contributing range equation terms for each scatterer at each time instant be consistent. Any discontinuities in the models can lead to spurious clutter residue that really does not exist. In particular, the following terms must be simulated with high fidelity: • • • •
range and bearing (as functions of time) from each aperture to each scatterer cross section (as function of time) for each scatterer gain (as function of angle) for each aperture towards each scatterer distortions (as functions of time or frequency) for each channel.
7.4.4.1 Range equation terms On each pulse, the range, azimuth (w), elevation (i>), Doppler, transmit and receive gains, carrier phase and cross section are computed for each scatterer. These are obtained by differencing scatterer and satellite positions, rotating to antenna coordinates, differencing ranges at the two time instants, looking up and interpolating the antenna pattern grids, respectively. Ranges to each aperture are computed using their respective positions at the receive time. Accurate range computations are important to get the correct carrier phase relation between apertures, which is needed to accurately model the achieved cancellation. Doppler is needed for interpolations in the scatterer summation/convolution. Bistatic range computations are necessary for high fidelity modelling of spacebased radars. These account for motion between the transmission and reception of the radar pulses. The algorithm interpolates backwards from receive times to find reflection and then transmit times for each scatterer, and then interpolating the satellite position to the transmit time. For moving targets, their range computations also include interpolation of the target position to the reflection time. The range, transmit power, system loss, RCS, antenna gains etc. are combined together as range equation terms into a (complex) return signal voltage for each aperture. The return from each scatterer now conceptually consists of a delta function at its range delay, weighted with this voltage. The sum of delta functions is next convolved with the transmit pulse. The result of the convolution is the received reflected signal. Depending on whether there was a transmit pulse for a given scatterer's range ambiguity (i.e. if there were enough fill pulses), its returns may be suppressed for the first or last few pulses. 7.4.4.2 Scatterer summation The purpose of the scatterer summation is to generate the complex voltage signal that is the superposition of the pulse returns from all scatterers. For all of the scatterers, the
range delays t;(0), received voltages Vix and Doppler frequencies Di(O) have been computed (/ denotes scatterer and t denotes continuous time starting at the zero-range return for the given pulse):
T[ is the range to the /th scatterer and fc is the carrier frequency. The total voltage is the sum of the individual scatterer voltages convolved with the expanded pulse response p(t): VcumiO = J2
Vix
P(f ~ T 'W) exp(-j27r/ c r/(0)
i
Each scatterer pulse return requires a complex exponential (representing the expanded pulse) to be evaluated at every receiver sampling time, which is very expensive if done by brute force. The process of optimising this computation is most challenging in terms of maintaining both high fidelity and high execution rates. 7.4.4.3 Receiver responses Receiver responses and models for the non-linearities (including the A/D), phase noise and eclipsing are applied to received signals. Receiver responses are implemented as filtering operations, while the other effects are single-point functions applied to each received sample.
7.4.5
Model the processing
Simulating the processing is more straightforward, since most algorithms can be simulated exactly as they would be implemented. The challenge is in simulating most of the many potentially viable algorithmic alternatives. The user first selects one out of a number of candidate baselines, and then chooses a set of appropriate parameters. The baselines of interest for SBR in general consist of coherent integration followed by STAP and then by detection. 7.4.5.1 Coherent integration Range and Doppler processing form a bank of matched filters to the waveform over the span of resolvable target ranges and velocities. The user specifies which channels and how many pulses are required as part of the processing parameters, as well as window parameters to control range and Doppler sidelobes. Each pulse from each spatial channel is range compressed. Range compression produces NR range gates for each pulse. Doppler processing immediately follows range compression, and produces No Doppler bins from Np pulses, and is performed on each space-time channel.
7.4.5.2 Clutter cancellation STAP processing then filters across the space-time channels. Covariance matrix estimates are formed for each range-Doppler cell, and these are used to compute a weight vector. The reference cell parameters should be user entered. The weight vector is used for summing the signals over the channels, resulting in a single cluttercancelled matrix of ND by NR resolution cells. With this part of the processing, the numerical precision of the SBR target computer should be modelled, since it has important effects on adaptive algorithms that are attempting to cancel interference that is considerably stronger than target returns. 7.4.5.3 Non-coherent processing After clutter has been cancelled, detection, estimation and imaging operations are typically performed. CFAR processing typically includes either the cell-averaging (CA) or the ordered statistic (OS) algorithm in order to produce a list of target detections. The user selects the threshold and the CFAR reference cell region (including guard bins) spanning both the range and Doppler dimensions.
7.4.6
Evaluate the results
Important performance measures for a radar system relate to the detectability of targets: how small? how slow? at what range? in what clutter? A simulator needs tools to measure the detection (and estimation) performance of the radar system being modelled. It also needs the ability to analyse the signals after various processing steps, in order to determine their effectiveness. 7.4.6.1 Intermediate products During processing, intermediate products (the signals coming out of each processing stage) should be stored in data structures so that the user can analyse them. The twodimensional signal after each stage can be displayed as an image depicting amplitude as a function of range and Doppler (or pulse). Intensity/colour denotes amplitude and each signal dimension takes a Cartesian axis. The following images can be displayed, depending on the processing baseline and options: 1 2 3 4 5
input signal: amplitude versus fast-time and pulse range-compressed signal; amplitude versus range and pulse post PC/PD signal: amplitude versus range and Doppler clutter-cancelled signal: amplitude versus range and Doppler CFAR normalised signal: amplitude versus range and Doppler
For each image, the user selects the two-dimensional region to display and the dynamic range. For images 1 to 3, the channel number is also selected. Target positions can be overlaid with symbols at their true (known a priori) locations; this helps greatly in determining the effectiveness of various processing steps. Figures 7.7 and 7.8 show images of the post-PC/PD and of the clutter-cancelled signals for an example scenario.
Doppler, Hz
ASAR post PC-PD signal, channel 1
range, m
Figure 7.7 Signal before STAP processing
Doppler, Hz
ASAR post-STAP signal
range, m
Figure 7.8 Signal after STAP processing
In the second image, notice that the clutter has been significantly cancelled, and that targets now appear as bright regions surrounding their true locations. 7.4.6.2 Analysis products Target statistics can be extracted from (small) range-Doppler regions surrounding their true positions. Each target's level, SES[R and improvement factor is computed as the maximum over its respective region. Clutter statistics (level, cancellation ratio)
can be extracted from a larger range-Doppler region. All target regions are excluded from the clutter statistics. Statistics are extracted from the signals both before and after STAP processing. This allows improvement factors to be computed. Target statistics can be displayed versus target number, Doppler, range, RCS or ground velocity to provide additional information. The detection process also creates a detection list (Doppler bin, range bin and power). Clutter statistics can be displayed as histograms, or as scatter plots versus Doppler or range.
7.5
Discussion and conclusions
Computer simulation tools for modelling space-based radars employing STAP have been presented. Using these tools, the performance of SBR designs can be evaluated, and design parameters can be optimised. Spaceborne radar applications and design were reviewed. SBR has a large FoV and is capable of global coverage and large search rates. WAS is used to protect large geographical areas from airborne threats and requires AMTI. Theatre defence is used for monitoring a localised region and requires GMTI. STAP techniques for space-based radar were reviewed. Clutter signals must be suppressed, so that targets can be detected. With fast-moving platforms, STAP filters the clutter in space and time in order to cancel it. This provides the required MDV for targets. STAP requires a minimum of two displaced receive antennas with well matched receivers. Pulse compression and pulse Doppler provide the coherent integration. The STAP algorithm combines the channels into a single clutter-cancelled signal, where targets can be detected. Active phased array antennas with electronic steering are preferred for SBR, because they provide beam agility, spotlighting and the ability to form multiple subapertures on receive. Clutter internal motion, ambiguities, pattern and receiver mismatches, earth's rotation, rain clutter and jammers all limit the ability of STAP to cancel clutter with SBR. To mitigate, STAP algorithms adaptively compute weights, have extra degrees of freedom and have their antennas slewed. SBR simulators are used to estimate performance and to help design systems. STAP cancels large clutter signals in order to detect small target returns. To ensure accuracy, proper correlation of signals must be modelled, as must the above limiting factors. Thus baseband signal simulation is recommended, where the radar return signal is generated as samples in range for each pulse. The implementation of a baseband simulator is difficult because of the large number of scatterer returns that must be included. The simulator should also support many design options, parameters and scenarios. Because of the many parameters needed to describe SBR, a well designed GUI is important. It eases parameter entry and allows the running of experiments. To generate the return signals, there are many details to be modelled: positions and orientation of antennas, ranges and bearings of scatterers, aperture shadings with errors, antenna patterns, pulse Doppler waveforms, the range swath with its ambiguities, clutter patches, scatterer distributions over area, and scatterer temporal variation. These are
all used to derive range equation terms that are convolved with the transmit pulse and combined to form the return signal voltage. It is important that the amplitudes and phases of modelled functions be accurate and consistent. It is also important to optimise the signal generation routines, in order to run in reasonable time. Optimising the convolution with the expanded pulse is typically the most difficult of the speed-fidelity trade-offs. A simulator needs tools to measure performance. It also needs the ability to analyse the signals after various processing steps, in order to determine their effectiveness. The two-dimensional signals after each stage can be displayed as images. Target and clutter statistics can be extracted both before and after STAP processing, and be displayed as histograms, curves or scatter plots. The statistics from a series of experiments can be extrapolated to determine whether the SBR design can meet mission requirements. SBR simulators should become increasingly more useful to radar designers in the future, as long as the models within them are accurate and flexible, with appropriate care taken so that both fidelity and speed are achieved. Their implementation on general-purpose office computers, making them widely available, should result in future SBRs being designed and launched with significantly reduced development costs. Finally, it is worth noting that the simulation requirements for SBR are common to other applications such as airborne MTI radars; hence, simulators for them can be designed using similar strategies to those described herein.
References 1 CANTAFIO, L. J. (Ed.): 'Space-based radar handbook' (Artech House 1989) 2 SKOLNIK, M.: 'Radar handbook' (McGraw-Hill, 1990, 2nd edn.) 3 BIRD, J. S. and BRIDGEWATER, A. W. 'Performance of space-based radar in the presence of earth clutter' IEE Proc. F, Radar Sonar Navig., August 1984, 131, (5), pp 491-500 4 BROOKNER, E. and MAHONEY, T. F.: 'Derivation of a satellite radar architecture for air surveillance', Microw. J., February 1986, pp 173-191 5 NOHARA, T. J.: 'Design of a space-based radar signal processor', IEEE Trans. Aerosp. Electron. Syst, April 1998, 34, (2), pp 366-377 6 NOHARA, T. J., WEBER, P., and PREMJI, A.: 'Space-based radar signal processing baselines for air, land, and sea applications', Electron. Commun. Eng. J., October 2000, pp 229-239 7 NOHARA, T. J., WEBER, P., PREMJI, A., and LIVINGSTONE, C : 'SARGMTI processing with Canada's Radarsat 2 satellite'. Proceedings of the IEEE symposium 2000 on Adaptive systems for signal processing, communications, and control, Lake Louise, Alberta, Canada, October 1^4, 2000 8 YAKOV D. SHIRMAN: 'Computer simulation of aerial target radar scattering, recognition, detection and tracking' (Kharkov Military University, Editor, Artech House Publishers, 2002)
9 SERGEY A. LEONOV: 'Handbook of computer simulation in radio engineering, communications and radar' (Artech House Publishers, 2001) 10 BASSEM R. MAHAFZA: 'Radar systems analysis and design using MATLAB' (Chapman & HALL/CRC, 2000) 11 U.S. Defense Modeling and Simulation Office (DMSO), http://www.dmso.mil 12 Modeling and simulation conference, Association of Old Crows, U.S.A., Orlando, FL, July 17-18, 2002 13 KELLY, E. J.: 'An Adaptive Detection Algorithm', IEEE Trans. Aerosp. Electron. Syst, AES-22, (1), March 1986, pp. 115-127 14 MITCHELL, R. L.: 'Radar signal simulation' (Artech House, Dedham, MA, 1976) 15 Radarsim®SBR Space-based radar design and performance evaluation tool, Sicom Systems Ltd., www.sicomsystems.com 16 KELLY, E. J., and TSANDOULAS, G. N.: 'A displaced phase center antenna concept for space-based radar applications'. IEEE Eascon, Washington, 1983, pp.141-148 17 TIM J. NOHARA: 'Comparison of DPCA and STAP for space-based radar.' Proceedings of the 1995 IEEE international Radar conference, Arlington, VA, USA, May 8-11, 1995, pp. 113-119 18 KLEMM, R. and ENDER, J.: 'New aspects of airborne MTF. Proceedings of the 1990 IEEE international Radar conference, Arlington VA, USA, May 7-10, 1990, pp. 335-340
Chapter 8
Techniques for range-ambiguous clutter mitigation in space-based radar systems1 Stephen M. Kogon and Michael Zatman
8.1
Introduction
Space-based radar (SBR) systems provide several important capabilities for the detection of moving targets that are not possible from an airborne platform. These advantages include continuous access to important tactical areas as well as rapid surveillance of large regions on the ground [I]. Although SBR systems have been studied for some time now, it is only recently with the rapid development of computational and antenna array technology that these systems have come closer to realisation. As a result, a large research investment has been made into the development of future SBR systems [2]. SBR systems show great promise for the detection of ground moving targets, a mode commonly referred to as ground moving target indication (GMTI) which is enabled by ground clutter cancellation using space-time adaptive processing (STAP). A GMTI mode typically utilises a pulse-Doppler waveform, consisting of a series of pulses transmitted at a constant rate known as the pulse repetition frequency (PRF). The advantage of these waveforms is the approximate decoupling of range and Doppler that allows for efficient processing with fast Fourier transforms (FFTs). However, these pulse-Doppler waveforms are plagued by ambiguities in both Doppler and range [3, Chapter 17]. In SBR geometries, these ambiguities intercept clutter and cause problems for GMTI detection [I]. Doppler ambiguities arise due to the high platform velocity in an SBR but can be effectively managed through sidelobe control and STAP algorithm design. Range ambiguities, on the other hand, are a problem
1
This work was supported by the U.S. Air Force under Air Force Contract # F19628-00-C-0002. Opinions, interpretations, conclusions and recommendations are those of the authors and are not necessarily endorsed by the United States Government
radar pulses
Figure 8.1 Range-ambiguity problem for an SBR at low grazing angles or long ranges for an SBR at low grazing angles or equivalently long ranges when the area on the ground illuminated by the radar exceeds the radar receive window of each pulse. An illustration of this problem is shown in Figure 8.1. These ambiguities are much more difficult to handle. Mechanical steering of the SBR array to maintain boresight on an area of interest produces a variation in clutter angle-Doppler characteristics with range. This non-stationarity leads to multiple Doppler frequencies with SINR loss holes from range-ambiguous clutter and can severely compromise detection performance. The use of STAP in the presence of range ambiguities and strategies for coping with these ambiguities are the topics of this chapter. For this chapter, we focus on a notional SBR platform in low-earth orbit (LEO) at 1000 km altitude with an antenna 20 m in length (azimuth) and 2 m in height (elevation). First, notation and metrics are established for moving target detection from an SBR with STAP followed by a discussion of the unique characteristics of ground clutter returns in SBR. In particular, the problem of range-ambiguous clutter is described. Next, we demonstrate the impact of range-ambiguous clutter on STAP performance and describe two means of overcoming the range-ambiguous clutter problem for pulse-Doppler waveforms: PRF diversity and an increase in elevation aperture. Last, the use of an alternative new waveform, namely a long single pulse waveform, that eliminates ambiguities both in range and Doppler, is considered for an SBR system.
8.2
Moving target detection with SBR
An SBR is a very powerful asset for surveillance applications since it can observe a large area on the ground below due to its high platform altitude. This large observable area gives an SBR a great deal of flexibility in where it looks and offers the capability of rapid search rates. The difficulty that arises from a large observable area is the management of all the returns. A large azimuthal aperture or array length focuses the SBR to a particular azimuth or cross-range area. In addition, aperture in the azimuth dimension limits mainbeam clutter Doppler spread which in turn determines performance against slow moving targets. The other dimension of range has an extent starting at the subsatellite point out to the horizon. This large range extent can present problems for a pulse-Doppler waveform. Unlike airborne applications which have
platform velocity
aimpoint
mechanical steering angle
Figure 8.2
Mechanical steering of SBR with angles with respect to array
much smaller range extent, an SBR utilising a pulse-Doppler waveform typically cannot remain range unambiguous.2 Therefore, an SBR must control range by using elevation aperture in a two-dimensional antenna array to focus to a particular range relying on elevation sidelobe control to reject range-ambiguous clutter. Since the deployment of two-dimensional arrays in space can be very costly, a limit must be placed on the total area of an array aperture. Therefore, elevation and azimuthal aperture inevitably must be traded off. Increasing azimuthal aperture improves crossrange accuracy and GMTI performance against slow moving targets, while elevation aperture can be used to control range and in turn range-ambiguous clutter. Another unique aspect of an SBR is that in an effort to optimise accuracy, the array is mechanically steered to keep the array face pointing at the aimpoint. Maintaining array broadside to the aimpoint is commonly referred to as boresighting the antenna. Like rotating arrays in airborne applications, mechanical steering leads to a mismatch in the platform velocity heading and the orientation of the array. A depiction of a mechanically steered array is shown in Figure 8.2 where (/> is used as the cone angle with respect to the array and ^ is the cone angle with respect to the velocity vector. The difference in these two angles: #mech=0-^
(8-1)
For example, at an altitude of 1000 km the PRF to remain range unambiguous to the horizon would be 55.5 Hz
is the mechanical steering angle. The angle between the array axis and the velocity heading is often referred to as the crab angle in certain airborne systems [4, 5]. This mismatch of the array axis with velocity heading leads to a range dependence of clutter Doppler and has certain consequences on STAP both in terms of training adaptive weights [6, 7] and range-ambiguous clutter. Although electronic steering can also be employed,3 for the analysis in this chapter we will restrict ourselves to a mechanically steered array without any electronic steering, i.e. array broadside 0 = 90°.
8.2.1 STAP for SBR systems Space-time adaptive processing (STAP) has become a well established research area, in large part due to a heavy research effort over the past 15 years, most of which has been well documented [4, 5]. Although the cancellation of ground clutter in radar systems using STAP had been introduced back in the 1970s [8], STAP has matured to the point that a recent textbook has been devoted to the topic [5]. Although most STAP work to date has focused on airborne platforms, space-based platforms have been considered more recently [9, 10]. STAP is the two-dimensional adaptive filtering for ground clutter cancellation used in radar applications with a moving platform, airborne or space-based, for the detection of moving targets. The principle that STAP exploits is that clutter returns constitute interference from the non-moving ground. These returns have a unique spatial and Doppler structure determined by the platform velocity and the orientation of the array. Moving targets, on the other hand, have a different Doppler than ground clutter due to the additional Doppler shift from their own radial velocity i>t relative to the radar. The Doppler frequency of a moving target is given by: ft = -r1 + - ^ c o s V t A
(8.2)
A
where up is the radar platform velocity, A is the radar wavelength and i/rt is the cone angle between the platform velocity vector and the direction to the target.4 The Doppler frequency of clutter coming from the same angle as the target is: 2 Un
/ c = - ^ cos Vt
(8.3)
A
Hence, the use of the two dimensions of space and time makes it possible to separate moving targets from clutter that is not possible in either angle or Doppler alone. Although most STAP analyses assume an TV element uniform linear array with A/2 spacing, SBRs typically use a two-dimensional array that is so large that forming digital channels on each element is not practical. Thus, we consider an SBR twodimensional array with subarrays formed in two-dimensional panels to create an array of TV spatial channels, i.e. subarrays, with a uniform spacing of D metres. Note that the spatial channels are formed in one dimension along the azimuthal axis as the 3
Electronic steering is used to alleviate mechanical steering requirements or to simultaneously cover multiple areas on the ground 4 V = 0° is forward, \Jr = 180° is aft and f = 90° is broadside
entire elevation dimension has been beamformed to the aimpoint for each subarray.5 Consider the problem of detecting a moving target at a Doppler frequency / t and an angle with respect to the array 0t- The pulsed waveform uses M coherent pulses transmitted at the pulse repetition frequency (PRF) /PR. The time between pulses, known as the pulse repetition interval (PRI), is simply the reciprocal of the PRF. The returns from these M pulses make up a coherent processing interval (CPI). The space and time response vectors of this target are given by: (8.4) (8.5) respectively. The combined space-time response vector of the target is then simply the Kronecker product: v(0t,/t) = b ( / t ) ® a ( 0 t )
(8.6)
Let us consider a space-time snapshot containing a target signal and given by: x ( / i ) = a t v ( 0 t , / t ) + Xi+n(n)
(8.7)
where oft, 4>u and / t are the target amplitude, angle with respect to the array, and Doppler frequency, respectively. Xi+n is the interference-plus-noise signal and n is the snapshot index. Here, we only consider interference consisting of ground clutter returns from the transmitted radar signal. The optimum space-time weight vector6 steered to 0o a t Doppler frequency /o is given by [8]: (8.8) where Qi+n = E{xi +n (n)xj +n (n) // } is the interference-plus-noise covariance matrix due to clutter coming from all angles. This version of the optimum STAP weights has been normalised for unit gain in the look direction. The performance of the optimum space-time processor from equation (8.8) is measured via the output signalto-interference-plus-noise ratio (SINR). As the name implies, SINR is simply the ratio of output target signal and interference-plus-noise powers: (8.9) where orf is the target signal power in a subarray channel. Many times, we want to compare the SINR to the maximum SINR that could possibly be achieved. This upper limit is determined by the ideal matched filter for the interference-free case, i.e. thermal noise only. Normalising the SINR by the SNR of the ideal (interference-free) The use of two-dimensional array channels is not considered in this chapter; adaptation for threedimensional STAP with two-dimensional degrees of freedom requires more complicated algorithms and training due to the range-dependent nature of clutter in the elevation dimension ^ STAP weights are optimum when (pQ = (pt and /o = ft
matched filter SNRo yields: (8.10) which is known as SINR loss [4]. Many times SINR loss is computed across angles and/or Doppler frequencies. An SINR loss of unity (0 dB) indicates perfect interference cancellation. Although the SINR loss metric indicates the losses associated with the presence of clutter, a more meaningful metric for a GMTI radar is the slowest velocity it is able to reliably detect, known as the minimum detectable velocity (MDV). A common measure of MDV is the minimum velocity for an acceptable SINR loss, e.g. LSINR = - 5 dB.
8.3
Clutter characteristics of pulse-Doppler waveforms in SBR
In this section, we examine the characteristics of clutter in an SBR system using pulse-Doppler waveforms that lead to some unique issues for STAP ground clutter cancellation. These characteristics arise from the range-Doppler ambiguity function for a pulse-Doppler waveform shown in Figure 8.3. Doppler ambiguities occur at regular spacings of the PRF /PR, and range ambiguities occur at integer multiples of the PRI, i.e. inversely proportional to the PRF. Control of these two ambiguity types is clearly at odds with one another. As we will see, requirements on a minimum PRF for Doppler considerations can lead to range ambiguities and degraded STAP performance. In addition, the Doppler resolution is determined by the coherent integration time (CPI length) and the range resolution by the radar bandwidth as shown in Figure 8.3. The implications of this pulse-Doppler ambiguity surface for an SBR arise from some unique aspects of the SBR platform. These aspects include:
azimuth beamwidth
Doppler
1 large platform velocity 2 high altitude 3 full mechanical steering (360°).
elevation beamwidth 2*bandwidth
integration time
range
Figure 8.3 Range-Doppler ambiguity surface for a pulse-Doppler waveform
The high platform velocity of an SBR leads to large Doppler frequencies for clutter and a large Doppler spread of mainbeam clutter. As a result, the radar has a minimum PRF that must be maintained for effective clutter mitigation and MDV performance. On the other hand, a high platform altitude leads to a large range to the ground and therefore a large illuminated area on the ground. To remain range unambiguous over the illuminated area, the radar must operate at a maximum PRF corresponding to the range extent of this illuminated area. Finally, range-ambiguous clutter is complicated by the mechanical steering of the array which leads to misalignment with the platform motion. The clutter has range-dependent angle-Doppler characteristics resulting in non-stationary behaviour in range complicating the clutter mitigation problem. We detail these characteristics and their implications on STAP in the following section.
8.3.1
Clutter Doppler ambiguities
SBR systems, unlike airborne radars, have such a high platform velocity that it is not possible to operate at a PRF that does not result in ambiguities in Doppler. Recall that the Doppler frequency of clutter from any point on the ground is given by: /c = ^COSi/rc A
(8.11)
where \j/c is the angle to the clutter with respect to platform velocity. The total Doppler extent of clutter is then the difference between forward ^c = 0° and ^c = 180° and is given by: 4v A/c = - 1
(8.12)
A
Since the PRF is essentially the Doppler sampling frequency, it must be greater than the total clutter Doppler extent in equation (8.12) to avoid any aliasing, i.e. to be unambiguous. For example, for a low-earth orbit SBR with an altitude of 1000 km and a platform velocity of 7000 m/s operating at / = 10 GHz, the total Doppler extent is 933.3 kHz. Choosing a PRF to be Doppler unambiguous in this case is clearly not practical.7 The fact that can be exploited in order to manage Doppler ambiguous clutter is that clutter has a unique angle-Doppler correspondence. As a result, Doppler ambiguous clutter can be rejected spatially either with low two-way transmit/receive (Tx/Rx) azimuth sidelobes or with STAP spatial degrees of freedom. For low grazing angles, i.e. shorter range, received clutter power is stronger and spatial DoFs can be used to handle clutter that leaks through azimuth sidelobes. At smaller grazing angles (long ranges), clutter power is weaker and azimuth sidelobes should be used to suppress Doppler ambiguities. Since we can rely on sidelobes and STAP to handle Doppler ambiguities, the key requirement is that we do not allow any Doppler ambiguities 7
A PRP this high would yield an extremely low range extent and would be highly range-ambiguous. In addition, this PRF would be severely oversampled and therefore redundant in terms of potential target velocities placing an unnecessary processing burden on the SBR
to exist within the receive azimuth mainbeam. Satisfying this condition allows for effective nulling of the clutter Doppler ambiguity and maintains acceptable SINR loss and MDV performance.8 For an array with an azimuthal aperture L az , the PRF must be chosen to be larger than the mainbeam clutter extent to avoid mainbeam Doppler ambiguities. Clutter within the mainbeam already requires STAP to perform mainbeam nulling to detect targets within the mainbeam clutter spread. For an array of length L32, the nulls for an untapered beam fall at COs^11 ^ = A/L az and the null-to-null beamwidth is: Acos
(mb)^|i
(8.13)
Therefore, the mainbeam clutter spread measured from null to null of the beamwidth is found by substituting equation (8.13) into equation (8.11) and is given by:
A/ffi = ^
(8-14)
L az
Note that this Doppler spread is independent of frequency or wavelength. As a result, the minimum PRF for effective STAP clutter suppression and to maintain good MDV performance is:
/PR > ~ L
(8.15)
az
This PRF is commonly referred to as the displaced phase centre array (DPCA) PRF which is the PRF that has a pulse-to-pulse delay T equal to the time for equivalent monostatic phase centres from the front and back halves of the array to spatially align from consecutive pulses [4]. Using a time delay between pulses that spatially aligns the front and back half apertures results in matching clutter characteristics for the two spatial channels. In theory, this condition allows for the perfect cancellation of clutter by subtracting consecutive pulses from the two half-aperture phase centres.
8.3.2
Clutter range ambiguities
The amount of unambiguous time associated with a pulse in a pulse-Doppler waveform is the pulse repetition interval (PRI) given by: ^P = y -
( 8 - 16 )
/PR
and the amount of time associated with a range extent AR on the ground is: At=™*
(8.17)
C
Provided the coherent integration time is long enough to give sufficient Doppler resolution and to ensure STAP performance is aperture limited, e.g. r c o h > ^r 2 -
Therefore, the amount of unambiguous range associated with a PRI, considering two-way propagation from the radar to the ground, is:
A*=-f-
(8.18)
2/PR
We will strictly concern ourselves with elevation mainbeam clutter since remaining range unambiguous out to the radar horizon is not practical for SBR. Instead, we will rely on the elevation Tx/Rx sidelobes and proper radar management of clutter power to ensure that ambiguities in the elevation sidelobes can be ignored. The amount of range illuminated by an SBR antenna array is determined by the elevation aperture Lei and the range to the ground. Although, for a flat earth assumption the illuminated range can be approximated by R\\ ~ /?A0ei, SBR applications must account for earth curvature and such a simple expression for illuminated range is generally not possible. However, the illuminated range is easily computed by computing the angles of elevation beamwidth nulls and computing the grazing angle, and therefore range, associated with these elevation angles. From the illuminated range, we can compute the range-unambiguous PRF by substituting for AR in equation (8.18) and solving for /PR. Figure 8.4 shows the range-unambiguous PRF, i.e. the maximum PRF for which the radar does not have any range ambiguities in the elevation mainbeam, as a function of grazing angle for our SBR example at / = 10 GHz with a platform
range-unambiguous PRF, Hz
5 metre 2 metre 1 metre
grazing angle, deg
Figure 8.4
Range-unambiguous PRF versus grazing angle for SBR with platform altitude h = 1000 km, f = 10 GHz and elevation apertures of1 m (solid line), 2 m (dotted line) and 5 m (dashed line)
altitude of 1000 km for elevation apertures of 1, 2, and 5 m. Recall the PRF for effective clutter cancellation from equation (8.15) is a function of the azimuth aperture. Usually, azimuth aperture will be L32 > 10 m for MDV considerations and 1500 > /PR > 3000 Hz. This requirement for range-unambiguous PRF is in clear violation of our minimum PRF requirement to maintain MDV performance from Section 8.3.1, especially for low grazing angles (long range). For these grazing angles, an SBR must simply accept the fact that range-ambiguous clutter exists within the elevation mainbeam. As we also see from Figure 8.4, more elevation aperture allows an SBR to operate range-unambiguously out to lower grazing angles. We explore this option as a tool to combat range-ambiguous clutter in Section 8.5.2.
8.4
Impact of range-ambiguous clutter on STAP performance
In the previous section, we discussed ambiguities in range and Doppler. Doppler ambiguities of clutter can be effectively handled with proper sidelobe control and STAP spatial degrees of freedom. Range-ambiguous clutter, however, is much more difficult to alleviate since STAP does not have a dimension to discriminate ambiguities from one another. Here, we give examples of the impact of range-ambiguous clutter and show the two aspects that affect the impact of range-ambiguous clutter: grazing angle or range to the radar and mechanical steering of the array. As discussed in Section 8.3.2, range ambiguities arise at long ranges due to the range extent of the elevation mainbeam. Mechanical steering of the array, however, is the mechanism which makes range-ambiguous clutter a big problem for GMTI performance. Consider the depiction of range-ambiguous clutter for the cases with and without mechanical steering shown in Figure 8.5. For a mechanically steered array, the misalignment between the velocity vector and the array axis produces isoDoppler and isocone angles that no longer overlay. As a result, clutter will have angle-Doppler characteristics that vary with range. This problem arises for STAP training across range, but its impact is much more severe in the case of range-ambiguous clutter. The result is multiple clutter ridges for all ambiguities which result in multiple Doppler frequencies that have clutter for any given angle, as shown in Figure 8.5. As we will see, the result is multiple SINR loss notches that create several blind velocity zones. To illustrate the impact of range-ambiguous clutter on STAP clutter cancellation performance, we show SINR loss versus target velocity for a few SBR scenarios.9 Throughout this chapter, SINR loss is computed from true covariance matrices and is intended to provide an upper bound on STAP performance that could be achieved in an actual SBR for the given geometry. The results do not reflect any losses arising from estimated covariances or from training STAP weights with range-varying clutter snapshots. In both cases, we look at the performance of a 20 x 2 m array operating 9 SINR loss is plotted versus target velocity and not Doppler frequency. Clearly, for mechanical steering angles other than broadside # mec h = 0°, the Doppler shift at array broadside is /dOpp ¥= 0- These SINR loss curves versus target velocity reflect the Doppler contribution of the target only and can be thought of as having compensated for SBR platform-induced Doppler on the target
isocone isoDoppler ambiguity aimpoint ambiguity
Doppler
no mechanical steering (azimuth = 90°)
angle mechanical steering
Doppler
range ambiguities
angle
Figure 8.5
Non-stationarity ofclutter ridges in angle-Dopplerfor range-ambiguous clutter
at a 1000 km altitude, / = 10 GHz and a PRF of / P R = 2500 Hz at the two ranges of 1300 km and 2700 km (47° and 11° grazing angles). The number of subarray channels is N = 8 and the CPI length is M = 64 pulses. The clutter-to-noise ratio in this case is 25 dB. SINR loss is shown for target velocities between ±100 km/hr (Doppler/velocity unambiguous out to ±67 km/hr for /PR = 2500 Hz), a typical range of velocities to expect for ground moving vehicles.10 First, we examine the performance for the array mechanically steered to broadside of the radar <9mech = 0° shown in Figure 8.6a. We see that the only difference in the two ranges is that the SINR loss notch at i>t = 0 km/hr is deeper for the long range case with the range ambiguities. The deeper notch is due to the fact that the clutter range ambiguities aligned in Doppler and angle and therefore just have an additive effect on overall CNR. Since the elevation mainbeam contains three ambiguous ranges at 11° grazing angle, the SINR loss is approximately —4.5 dB deeper. Next we look at SINR loss for the case when the array is mechanically steered forward to #mech = 60° shown in Figure 8.6b. Now clutter range ambiguities no longer align in Doppler and there are multiple SINR loss notches. The effect is that in addition to low velocity targets several other target velocities are undetectable, commonly referred to as blind velocities. *l
10 Note that Doppler frequency of a moving target corresponds to the radial velocity of the target with respect to the radar platform 11 Blind velocities are typically associated with Doppler frequencies separated from the Doppler at array broadside by multiples of the PRF but in this case are due to range ambiguous clutter
target velocity, km/hr
Cl
Figure 8.7
b
target velocity, km/hr
U
SINR loss, dB
SINR loss, dB velocity, km/hr
mechanical steering, deg
SINR loss without range-ambiguous clutter (1300 km range) and with range-ambiguous clutter (2700 km range) for mechanical steering of (p = 0° (broadside to platform) and 0 = 60° (forward scanned) a mechanical steering = 0° b mechanical steering = 60°
mechanical steering, deg
Figure 8.6
SINR loss, dB
SINR loss, dB a
velocity, km/hr
SINR loss versus mechanical steering angle without range-ambiguous clutter (1300 km range) and with range-ambiguous clutter (2700 km range) a Range = 1300 km (grazing angle = 47°) b Range = 2700 km (grazing angle =11°)
These complete losses in coverage are clearly undesirable. To get a better feel for the full effect on the coverage of an SBR, we show the SINR loss performance at both ranges for all mechanical steering angles from platform broadside (#mech = 0°) to forward (#mech = 90°) in Figure 8.7. Here, the effect on SBR coverage is quite dramatic. The region 0° < #rnech < 40° is almost completely lost as the range ambiguities have separated in Doppler and are all within the azimuth mainbeam. STAP cannot place multiple mainbeam nulls and as a result performance is clutter
limited rather than noise limited. As the array is steered forward (#mech > 40°), it is able to resolve the clutter range ambiguities and at least restore performance between the resulting blind velocities from the range-ambiguous clutter. In contrast, for the shorter range without clutter range ambiguities, coverage is complete for all mechanical steering angles.
8.5
Range-ambiguous clutter mitigation techniques with pulse-Doppler waveforms
We have now outlined the problem of range-ambiguous clutter for pulse-Doppler waveforms and shown their effect on GMTI performance with STAR Examining the pulse-Doppler ambiguity function from Figure 8.3, we clearly have two means of controlling clutter range ambiguities: lowering the PRF to increase the unambiguous range extent for a PRI from equation (8.16) or reducing the elevation beamwidth of the SBR by increasing the elevation aperture. Since PRF can only be reduced to the limit imposed by mainbeam clutter spread, we instead will consider a scheme of using multiple range-ambiguous PRFs to cover all potential target velocities, referred to as PRF diversity. For increasing the SBR elevation aperture, we will consider the case of maintaining the same azimuth aperture for MDV purposes. The resulting twodimensional array, therefore, has a larger array total area and is more costly in terms of deployment. This analysis may be used to gain insights into the amount of improvement additional elevation aperture provides to determine if the performance gains justify the additional cost. Note that we do not consider a true two-dimensional array with spatial channels in both the azimuthal and elevation dimensions. We assume that the elevation dimension is increased to reduce the elevation mainbeam and do not consider the use of elevation degrees of freedom for clutter cancellation [5, Chapter 10].
8.5.1 PRF diversity As has been shown, range ambiguities have the effect of creating additional blind velocities other than v\ = Okm/hr as illustrated in Figure 8.6b. The location of these blind velocities is determined by the range-ambiguous PRF falling at the Doppler frequencies of the clutter range aliases. By changing the alias ranges, accomplished by changing the PRF, we can alter the Doppler frequencies of the aliases and therefore move the blind velocities. In this section, we explore the use of multiple PRFs, or PRF diversity to cover all target velocities of interest. Note that this technique results in a loss in terms of surveillance rate since now multiple CPIs with different PRFs must be devoted to each area on the ground. We will not attempt to quantify this loss here since its specifics are highly dependent on the radar system parameters. Wereturntothecaselookedatearlierwitha20x2marrayat/z = 1000 km altitude mechanically steered to #mech = 60°. The range of the SBR to the aimpoint is 2700 km (Bp = 11°). Recall the SINR loss for a PRF / P R = 2500 Hz in Figure 8.6b. This plot is repeated in Figure 8.8 as the dash-dot line. Here, we also consider the two other
SINR loss, dB
PRF =1500 Hz PRF = 2000 Hz PRF = 2500 Hz target velocity, km/hr
Figure 8.8
SINR loss for multiple PRFs at 2 700 km range and mechanical steering ofcp = 60°. Solid line is / P R = 1500 Hz, dotted line is / P R = 2000Hz and dash-dot line is /PR = 2500 Hz
PRFs of/PR = 2000 Hz and 1500 Hz shown in the dotted and solid lines, respectively. Note that both of these PRFs are above the minimum PRF for this array which from equation (8.15) is / ^ m ) = 1400Hz. Using all three PRFs, we see that there is no additional blind velocity zone (LSINR < —5 dB) other than the ft = 0 km/hr zone. Clearly, three PRFs are sufficient to accomplish coverage for ±100 km/hr, although with the resulting losses in surveillance rate associated with dwelling on the same area with three different PRFs. Next, we consider full mechanical steering for PRFs of 1800 Hz and 2500 Hz shown in Figure 8.9. The lower PRF has reduced the number of total range ambiguities and therefore has fewer blind velocities, especially noticeable as the array looks forward. Important to note is that the region of mechanical steering angles of 0° < #mech < 40° has very poor performance for both PRFs, lacking detectability for all velocities. The reason for the complete loss of performance is that range ambiguities are now separated in Doppler due to the mechanical steering, yet both ambiguities fall within the azimuth mainbeam. STAP cannot cope with multiple mainbeam nulls and performance therefore becomes clutter limited since clutter cannot be effectively nulled. Once the mechanical steering is beyond #mech > 40°, the clutter range ambiguities fall outside the azimuth mainbeam and only affect the velocity corresponding to their associated Doppler frequency. Again, all the STAP performance results shown here are with true covariance matrices and do not reflect various real-world effects, e.g. estimation/training or nonstationary clutter. The training of the STAP adaptive weights with range-ambiguous clutter is complicated by the non-stationary nature of clutter for any mechanical steering. With multiple clutter points that all vary differently with range, techniques such as
Figure 8.9
SINR loss, dB
mechanical steering, deg
SINR loss, dB
mechanical steering, deg
velocity, km/hr
velocity, km/hr
SINR loss versus target velocity and mechanical steering angle for 20 x 2m array at range of 2700km a PRF = 2500Hz b P R F = 1800Hz
Doppler warping [6, 7] will not be very effective. The best alternative is to have the weights vary with range and update rapidly in range or to use some technique that accounts for non-stationarity.
8.5.2 Aperture trade offs The complete solution to the range-ambiguous clutter problem for SBR is to increase the elevation aperture to the point where range ambiguities do not exist. Of course, in most cases this increase in aperture comes at a very large cost. We will not attempt to get into this aspect of larger SBR antenna arrays but instead will only attempt to show how the use of larger apertures can enable full coverage for an SBR. We again consider the case of an SBR at an altitude of h = 1000 km operating at / = 10 GHz and a range to the aimpoint of 2700 km (0& = 11°). We look at the STAP performance for the same PRFs (/ PR = 1800 and 2500 Hz) as in Figure 8.9 for the 20 x 2 m aperture. However, in this case, we consider a 20 x 5 m aperture. The larger elevation aperture reduces the range extent illuminated on the ground and therefore the radar can still be range unambiguous at higher PRFs. The performance of the 20 x 5 m aperture is shown in Figure 8.10 for mechanical steering angles from #mech = 0° to #mech = 90°. This aperture can operate range-unambiguously for /PR = 1800 Hz while still having two range ambiguities for /PR = 2500 Hz. Clearly, the /PR = 1800 Hz could operate at this range without any problems. Even the higher PRF /PR = 2500 Hz has much better performance, even for 0° < #mech < 50° when the ambiguities cause multiple mainbeam nulls. Overall, the increased aperture is the most attractive solution for the rangeambiguous clutter problem since it completely eliminates the ambiguities. It should be noted that we demonstrated performance for a 20 x 5m array at #gr = 11° or 2700 km range. Further ranges still have range ambiguity problems and the amount
Figure 8.10
SINR loss, dB
mechanical steering, deg
SINR loss, dB
mechanical steering, deg
velocity, km/hr
velocity, km/hr
SINR loss versus target velocity and mechanical steering angle for 20 x 5 m array at range of 2700km a PRF = 2500Hz b PRF = 1800Hz
of elevation aperture needed to remain range unambiguous for all ranges depends on the maximum range for the SBR application.
8.6
Long single pulse phase-encoded waveforms
It has already been established that more aperture and multiple coherent processing intervals with different PRFs can be used to overcome the ambiguous nature of the returns from pulse-Doppler waveforms. Another way of handling this problem is to use waveforms which are effectively unambiguous in both range and Doppler. In this section, the use of unambiguous long single pulse waveforms is discussed.12 Examples of appropriate modulation include random binary phase encoding [3, Chapter 10.6] and chaotic modulation [H]. In this section, we consider the binary phase-encoded waveforms for illustration purposes. The Doppler resolution for single pulse waveforms is simply the inverse of the pulse length, i.e. the total coherent integration time. For conventional monostatic radars, which are unable to transmit and receive simultaneously, the pulse length is limited to less than half the propagation time to the target, since there must be enough uneclipsed range samples to fill the pulse-compression filter. We use the following well known relationships of the propagation time Jpuise> Doppler resolution A/dOpP
The objective could be viewed as designing a waveform whose ambiguities, if they exist, do not coincide with clutter returns, i.e. ambiguities are moved in such a way that all returns from the earth are unambiguous. We focus merely on the concept and do not investigate waveform design
resolution, km/hr
range, km
Figure 8.11 Doppler (velocity) resolution versus pulse length for a 10 GHz radar
of the pulse, and Doppler frequency /dopp(8.19) where R is the range to the target, c is the speed of light (propagation velocity), v is velocity and k is the wavelength. From these relations, we can derive the Doppler (velocity) resolution: Aw=^
(8.20)
Figure 8.11 shows the limit on Doppler resolution as a function of range for a radar operating at 10 GHz. The Doppler resolution of a GMTI radar should be significantly better than the specified target MDV for the radar. Airborne GMTI radars typically look for targets at ranges of under 300 km, equating to Doppler resolutions which are worse than 25 km/hr and clearly inadequate for typical GMTI performance requirements. However, SBR systems have considerably longer ranges to the target. For the SBR example explored in this chapter with a 1000 km orbit and a minimum range of 1200 km, the Doppler resolution without simultaneous transmit and receive is better than 7 km/hr, which is adequate for most GMTI applications. Figure 8.12 pictorially summarises the concept of the long single pulse waveform for SBR.
amplitude
phase encoded waveform
time
Concept of long single pulse waveform for SBR
response, dB
velocity, km/hr
Figure 8.12
range (km)
Figure 8.13 Ambiguity surface for a 10 ms 5 MHz chip rate phase-encoded pulse at 10 GHz 8.6.1
Properties of long single pulse phase-encoded
waveform (LSP W)
Except for a few special cases [12-14], phase-encoded waveforms have mean range and Doppler sidelobes levels that are inversely proportional to the time-bandwidth product given by: SLL = — — (8.21) TxB For example, a time-bandwidth product of one million is needed to achieve mean range-Doppler sidelobes of -6OdB. This mean sidelobe level is combined with a point-like ambiguity function as shown in Figure 8.13. These waveforms are
unambiguous in range, and unambiguous in Doppler up to the sample or chip rate of the waveform. As we will discuss below, this mean sidelobe level is a limitation of the phase-encoded LSPW, and care must be used in designing the radar to cope with this limitation. In conventional pulse-Doppler radars that utilise chirp waveforms or short phase-encoded pulses, the large amount of range-Doppler coupling makes the pulsecompression and Doppler filtering operations approximately separable. Since the range-Doppler coupling of the phase-encoded LSPW described here is minimal, the pulse-compression and Doppler filtering operations of the radar are not separable, and a Doppler optimised pulse-compression filter is required for each Doppler bin. There are both pros and cons to having separate pulse-compression filters for each Doppler bin. The pro is that for constant velocity targets there are no range walk losses due to target motion over the length of the single pulse. The Doppler-dependent pulse-compression filters are already tuned to the motion. The con is the amount of computation required. For example, an X-band SBR system with a 3000 km (20 msec) pulse has a Doppler resolution of 2.7km/hr from equation (8.20). To cover target velocities of ±110km/hr, eighty separate pulse-compression filters are needed. For waveforms with large time-bandwidth products, the computational complexity of the pulse-compression and Doppler filtering becomes phenomenal with conventional FIR implementations of the pulse compressor. However, the use of overlap-add FFTs helps ameliorate the computational burden. One example of an LSPW STAP processing chain is shown in Figure 8.14. In this case, the Doppler-dependent pulse-compression filters naturally feed the adjacent bin post-Doppler STAP algorithm [15]. However, this is not the only STAP algorithm which may be applied to the LSPW. Application of an inverse-discrete-Fourier transform to the output of all the pulse-compression filters on a per-range-gate basis (Figure 8.15) converts the output into a pre-Doppler pulse-like space with an apparent
pulse comp. Doppler bin 1 N channels pulse comp. Doppler bin 2 digital receiver pulse comp. Doppler bin...
pulse comp. Doppler bin M
Figure 8.14
Adjacent-bin post-Doppler STAP for a long single-pulse waveform
Af channels pulse comp. Doppler bin 2 digital receiver
pulse comp. Doppler bin...
inverse Fourier transform
pulse comp. Doppler bin 1
M 'Pulses' (nyquist sampled for highest Doppler)
pulse comp. Doppler bin M
Figure 8.15
Pulse-Doppler matched filtering for LPSW radar with an inverse DFT across Doppler frequency allows LPSW radar to emulate a conventional pulse-Doppler radar
PRF of: (8.22) where / ^ a x and /^ 11 I are the Doppler frequencies of the highest and lowest Doppler bins used. With this transformation any STAP algorithm that can be applied to pulseDoppler waveforms may also be used with an LSPW. The LSPW advantages of no Doppler or range ambiguities are still retained.
8.6.2 Integrated sidelobe clutter levels One of the key issues with LSPWs is the integrated sidelobe clutter level. If the integral of all the clutter outside the range-gate and Doppler bin of interest is significant compared with the noise floor, then the radar will be desensitised. Therefore, a fundamental requirement on an LSPW radar is an average range-Doppler sidelobe level that is better than the integrated clutter level. The integrated clutter level is computed by integrating the radar range equation over the illuminated clutter [16]: (8.23) where Pt is the transmit power, G(4>az, R) is the antenna gain as a function of azimuth angle 0 ^ and range R, o~(R) is the clutter reflectivity as a function of range, t the pulse length, X the wavelength, k Boltzman's constant, TQ = 290 K, L the radar system losses and F the noise figure. In determining the integrated clutter level, the radar designer only has control over the first three terms in the equation, transmit power and transmit and receive gain.
By approximating the azimuth distribution of the antenna pattern by the Rayleigh beamwidth b and modelling the clutter reflectivity with the constant Gamma clutter model [17] where #gr represents the grazing angle, equation (8.23) becomes:
(8.24)
CNR, dB
The middle term of equation (8.24) represents the clutter reflectivity multiplied by the area illuminated between the ranges R and R + Rg (Rg is the range resolution of the radar). It is interesting to note that since the area illuminated by the radar is approximately inversely proportional to the antenna's gain, the integrated clutter level is approximately proportional to the power aperture of the radar. Since the received target power is proportional to the power aperture squared of the radar, for the same target SNR, radars with larger apertures (and hence less transmit power) will exhibit smaller integrated clutter levels. For the X-band SBR described earlier, with a peak transmit power of 30 dBw, a 20 x 2m aperture (gain of 57.5 dB), noise figure of 4dB, 1OdB of losses, range resolution of 1 m, pulse length equal to the propagation time and a 50 dB receive taper in elevation, Figure 8.16 depicts the clutter-to-noise ratio as a function of range for a beam illuminating the ground at a range of 2000 km with a clutter y of —12 dB.
range, km
Figure 8.16
Clutter-to-noise ratio versus rangefor an LSP W with the example radar system
level, dB
constant power CNR mean sidelobe level variable power CNR range, km
Figure 8.17
Clutter-to-noise ratios for constant and variable transmit power and mean sidelobe level versus range
The CNR is positive for a range extent of about 150 km, or 150000 range gates. Integrating over range gives an integrated clutter-to-noise ratio of 65.2 dB. Figure 8.17 shows both the integrated clutter level and mean range-Doppler sidelobe level (assuming the LSPW length is the same as the propagation time and a bandwith of 150 MHz) as a function of range. If the transmit power is kept constant irrespective of the range which is being illuminated, then the integrated CNR is larger than the mean sidelobe level at ranges less than 2565 km. This would make the radar's performance clutter limited rather than noise limited. However, it should be noted that the target SNR grows faster with decreasing range than does the unsuppressed clutter, so target detectability still improves. However, if the transmit power is varied so as to keep constant SNR on target, then the ratio of the integrated CNR to the mean sidelobe level drops with decreasing range. The effective transmit power to the target can be adjusted in two ways: either by reducing the gain of the transmit amplifiers, or by broadening the transmit beam and forming multiple high gain receive beams. The latter method has the advantage of maximising the radar's search area rate. If the radar is designed to meet the SNR on target and the integrated CNR requirements at maximum range, then performance will be better at all closer ranges. As an example, consider the design of a radar which needs to attain a target SNR of 7.5 dB per pulse at 3000 km. Figure 8.18 shows how the integrated CNR changes as a function of array area and transmit power at 3000 km for a bandwidth of 150 MHz. The integrated CNR is approximately proportional to the radar's power aperture. Contours
integrated CNR, dB
area, m2
7.5 dB SNR on target 63 dB ICNR contour 66 dB ICNR contour peak power, dBw
Figure 8.18
Integrated CNR as a function of area and peak power
of constant integrated CNR and a constant target SNR of 7.5 dB are also shown in the Figure. To design a 150 MHz bandwidth radar which would achieve 7.5 dB SNR per pulse and have the integrated CNR equal to the mean range-Doppler sidelobes requires an array area of 56 m2 and a peak power of 1.25 kW. A radar with the integrated CNR 3 dB below the mean range-Doppler sidelobes at 3000 km requires an area of 111 m2 and a peak power of 330 W.
8.6.3 STAP simulations Here, we look at STAP simulations using the 40 m2 aperture SBR described earlier, and an LSPW with -7OdB range-Doppler sidelobes. The target range is 2500 km and the mechanical steering angle is 30° from the velocity vector, i.e. #mech = 60°. Two hundred snapshots are selected from between 2490 km and 2510 km to estimate the clutter-plus-noise covariance matrix. There are eight spatial degrees of freedom (subarrays) and the pulse is 16 msec long. Three different processing schemes are considered: 1
Pulse-Doppler with a PRF of 2 kHz, Doppler warping [6] and PRI-staggered STAP [5, Chapters 10, 18 and 19] 2 LSPW with adjacent-bin post-Doppler STAP from Figure 8.14 3 LSPW with transformation back to pulse-space from Figure 8.15, Doppler warping and PRI-staggered STAR In case 1, the /PR = 2000 Hz PRF means that range-ambiguous clutter at ±75 km will also be above the noise floor. Doppler warping is applied in cases 1 and 3 to compensate for the change in the Doppler of the mainbeam clutter as a function of
SINR loss, dB
velocity, km/hr
Figure 8.19
STAP performance for processing case 1
range due to the Doppler. However, in case 1 the Doppler warping is only precisely correct for the clutter around 2500 km, not the range-ambiguous clutter, leading to further degradation of the STAP performance. Since the Doppler warping is applied across the pulses on a range gate by range gate basis prior to the Doppler filtering, it is incompatible with the processing in case 2. The PRI staggered processing is chosen for cases 1 and 3 since this algorithm is known to exhibit better performance than the adjacent bin algorithm. The results are plotted in Figures 8.19 to 8.21. In all cases optimum refers to full dimension STAP with exact knowledge of clutter covariance (impossible in practice). Figure 8.19 shows the SINR performance of processing case 1. The nulls at about —32 km/hr and 27 km/hr are due to the range-ambiguous clutter from 2425 km and 2575 km. In this case, the range-ambiguous clutter nulls are only slightly wider than the correct null at 0 km/hr, since the Doppler warping is close to being correct for these ambiguities. With higher CNR the broadening of these nulls would be more severe. Figure 8.20 shows the STAP performance of processing case 2. In this case there is only a single null, but it is broad. Since processing chain 2 is incompatible with Doppler warping, the change in the Doppler of the mainbeam clutter as a function of range cannot be compensated for in the training data. Figure 8.21 depicts the STAP performance of processing case 3. There is only a single narrow null. The combination of the LSPW with a transformation back into the time domain from the Doppler domain, Doppler warping and PRI staggered STAP combine to give near-optimal performance. The only losses with respect to optimum
SINR loss, dB
optimal adjacent bin
velocity, km/hr
STAP performance of processing case 2
SINR loss, dB
Figure 8.20
optimal PRI stagg.
velocity, km/hr
Figure 8.21
STAP performance of processing case 3
STAP are due to the finite sample support and low sidelobe weighting on the Doppler filters in the PRI staggered algorithm.
8.7
Summary
In this chapter, we have discussed the problem of range-ambiguous clutter and its effect on the performance of an SBR system used for GMTI. Range-ambiguous clutter is a problem that arises at long ranges when the array is mechanically steered forward or aft. The misalignment of the array with the velocity vector creates a change in the angle-Doppler characteristics of range-ambiguous clutter and results in multiple blind velocity regions for GMTI. Pulse-Doppler waveforms can utilise multiple PRFs in order to shift the Doppler of these ambiguities and cover all velocities. The cost of this PRF diversity approach is the loss in search rate associated with using multiple PRFs to cover each spot on the ground. Also, PRF diversity is only effective at overcoming clutter range ambiguities that fall outside the mainbeam. For mechanical steering angles from broadside #mech = 0° to #mech = 40°, the 20 x 2m array considered in this chapter could not place multiple mainbeam nulls and, as a result, suffered large losses for all velocities. A more effective means of mitigating range-ambiguous clutter for pulse-Doppler waveforms was to increase the elevation aperture, completely eliminating or reducing the number of range ambiguities depending on the grazing angle. More aperture is clearly the preferred method of overcoming range-ambiguous clutter, the problem becomes a matter of the large cost of deploying a large aperture for an SBR. Finally, we discussed the new, novel approach to range-ambiguous clutter that uses an alternate waveform consisting of a single, long pulse utilising phase encoding to remain range and Doppler unambiguous. The length of the pulse is determined by the range of the SBR to the aimpoint so that at longer ranges, longer pulses are possible. Recall that the problem of range-ambiguous clutter arises at longer ranges. One of the main issues with a long phase-encoded pulse waveform is integrated sidelobe levels which are a problem due to the range-Doppler sidelobe levels of this waveform. Another issue is the amount of computation associated with the matched filtering in range and Doppler of the phase-encoded waveform. Two efficient architectures for STAP were given along with simulations demonstrating the viability and benefit that a range and Doppler unambiguous waveform could provide. This alternative waveform shows promise for SBR applications and bears further investigation and experimentation. References 1 CANTAFIO, L. J.:' Space-based radar handbook' (Artech House, Norwood, MA, USA, 1989) 2 DAVIS, M. E.: 'Technology challenges in affordable space-based radar'. Proceedings of IEEE international Radar conference. Washington, USA, 2000, pp.18-23
3 SKOLNIK, M.: 'Radar handbook' (McGraw-Hill, New York, USA, 1990) 4 WARD, J.: 'Space-time adaptive processing for airborne radar'. MIT Lincoln Laboratory TR 1015, ESC-TR-94-109, 1994 5 KLEMM, R.: 'Principles of space-time adaptive processing' (IEE, London, England, 2002) 6 BORSARI, G. K.: 'Mitigating effects on STAP processing caused by an inclined array'. Proceedings of IEEE Radar conference, 1998, pp. 135-140 7 KREYENKAMP, O. and KLEMM, R.: 'Doppler compensation in forwardlooking STAP radar', IEE Proc, Radar Sonar Navig., October 2001, 148, (5), pp. 253-258 8 BRENNAN, L. E. and REED, I. S.: 'Theory of adaptive radar', IEEE Trans. Aerosp. Electron. Syst., March 1973, 9, (2), pp. 237-252 9 NOHARA, T. J.: 'Design of a space-based radar signal processor', IEEE Trans. Aerosp. Electron. Syst., 1998, 34, (2), pp. 366-377 10 KOGON, S. M., RABIDEAU, D. J., and BARNES, R. M.: 'Clutter mitigation techniques for space-based radar'. Proceedings of IEEE international conference on Acoustics, speech, and signal processing, 1999, pp. 2323-2326 11 OPPENHEIM, A. V. and CUOMO, K. M.: 'Chaotic signals and signal processing', in MADISETTI, V. and WILLIAMS, D. (Eds.): 'Digital signal processing handbook' (CRC Press, Boca Raton, USA, 1997) pp. 71.1-71.13 12 BARKER, R.H.: 'Group synchronizing of binary digital systems', in 'Communication theory' (Academic Press, 1953) pp. 272-287 13 ZIERLER, N.: 'Linear recurring sequences', /. Industrial Applied Mathematics, 1959, 7, pp. 31-48 14 WELCH, L. R.: 'Lower bounds on the maximum cross correlation of signals', IEEE Trans. Inf. Theory, 1914, 20, pp. 397-399 15 DIPIETRO, R.: 'Extended factored space-time processing for airborne radar systems'. Proceedings of Asimolar conference on Signals and systems, 1992, pp. 425-430 16 VAN TREES, H.: 'Detection, estimation, and modulation theory, part III (John Wiley & Sons, New York, USA, 2001, 2nd edn.) 17 BILLINGSLEY, J. B.: 'Low angle radar land clutter measurements and empirical models' (SciTech Publishing, Norwich, NY, 2002) 18 BRENNAN, L. E. and STAUDAHER, R M.: 'Subclutter visibility demonstration'. Adaptive Sensors Inc. technical report, RL-TR-92-21, 1992 19 WARD, J. and STEINHARDT, A. O.: 'Multiwindow post-Doppler space-time adaptive processing'. Proceedings of 7th workshop on Statistical signal and array processing, 1994, pp. 461-464
Part III
Processing architectures
Chapter 9
Parallel processing architectures for STAP Alfonso Farina and Luca Timmoneri
9.1
Summary and introduction
This chapter describes methodologies for online processing of received radar data by a set of N antennas and M pulse repetition intervals (PRIs) for the calculation of space-time adaptive (STAP) filter output. The numerically robust and computationally efficient QR-decomposition (QRD) is used to derive the so-called MVDR (minimum variance distortionless response) and lattice algorithms; the novel inverse QRD (IQRD) is also applied to the MVDR problem. These algorithms are represented as systolic computational flow graphs. The MVDR is able to produce more than one adapted beams focused along different angular directions and Doppler frequencies in the radar surveillance volume. The lattice algorithm offers a computational saving; in fact, its computational burden is 0(N2M) in lieu of 0(N2M2). An analysis of the numerical robustness of the STAP computational schemes is presented when the CORDIC (coordinate rotation digital computer) algorithm is used to compute the QRD and the IQRD. Benchmarks on general purpose parallel computers and on a VLSI (very large scale integration) CORDIC board are also presented.
9.2
Baseline systolic algorithm
The detection of low flying aircraft and/or surface moving targets, and the standoff surveillance of areas of interest require a radar on an elevated platform like an aircraft. The AEW (airborne early warning) radars pose a number of interesting technical problems especially in the signal processing area. The issue is not new: detect target echoes in an environment crowded with natural (clutter), intentional (jammer) and other unintentional radiofrequency (especially in the low region of microwaves, e.g. VHF/UHF bands) interference. The challenge is related to the large dynamic range of the received signals, the non-homogeneous and non-stationary nature of
the interference, and the need to fulfil the surveillance and detection functions in real time. One technique proposed today to solve the problem is based on STAP [2-4,9,10,14-18]. Essentially, the radar is required to have an array (for instance, a linear array along the aircraft axis) of Af antennas each receiving M echoes from a transmitted train of M coherent pulses. Under the hypothesis of disturbance having a Gaussian probability density function and a Swerling target model, the optimum processor is provided by the linear combination of the NM echoes with weights w = M - 1 S*, envelope detection and comparison with threshold. M is the spacetime interference covariance matrix, i.e. M = E{z*zT} where z (dimension NM x 1) is the collection of the NM disturbance echoes in a range cell, s, the space-time steering vector, is the collection of the NM samples expected by the target and (*) stands for complex conjugate. A direct implementation (via sample matrix inversion, SMI) of the weight equation w = M - 1 S* is not recommended. One reason is related to the poor numerical stability in the inversion of the interference covariance matrix especially when a large dynamic range signal is expected during the operation; another one is the very high computational cost. There is a need of extremely high arithmetic precision during digital calculation. Note that double precision costs four times as much as single precision. The situation would be different if, instead of operating on the covariance matrix M, we would operate directly on the data snapshots z(k), k = 1,2,... ,ft where n is the number of snapshots (i.e. range cells) used to estimate the weights w. It can be shown that the required number of bits to calculate the weights, within a certain accuracy, by inversion of M is two times the number of bits to calculate the weights operating directly on the data snapshots z(k). This is so because the calculation of power values is avoided, and thus the required dynamic range is halved. The algorithms that operate directly on the data are referred to as data domain algorithms in contrast to the power domain algorithms requiring the estimation of M. Figure 9.1 depicts both approaches; in this chapter we will develop the algorithms based on the data domain approach. The QRD is a numerical technique for solving least squares problems, like the one in STAP, that avoids direct computation and inversion of interference covariance matrix [1,5]. Indicate with Z the n x (NM)-dimensional matrix which collects the n data snapshots: (9.1) The weight equation can be written as follows: (9.2) where (m)H stands for the complex conjugate transpose. Taking the data matrix Z and operating on it with unitary (i.e. covariance preserving) matrix Q (with dimension ft x /i) we are able to transform the matrix Z in an upper triangular matrix R (with dimension NM x NM): (9.3)
data cube CUT linear combination of weights and signals from the cell under test (CUT)
antenna elements
adapted output
range cells PRT
covariance matrix estimation
weight calculation
power domain approach
data matrix triangulation data domain approach
Figure 9.1
The power and data domain approaches for STAP
thus equation (9.2) can be rewritten as: R^Rw = s*
(9.4)
which is now easily solved by forward and back-substitution steps as follows. Indicating by a new vector, t, the product Rw, equation (9.4) becomes: R H t = s*
(9.5)
that can be solved in t. Subsequently, the additional equation: Rw = t
(9.6)
is solved in w. A noticeable improvement of the basic technique allows us to calculate the STAP output without extracting the weights, i.e. without performing the two substitutions above (see, for instance Reference 1 at page 147, see also Section 9.10, Appendix A). In summary, either the weight vector w or the output signal of the STAP are obtained without forming and inverting any covariance matrix. By using a large number of bits the data domain algorithm provides the same results as the power domain algorithm which estimates the covariance matrix ZH(n)Z(n) and derives the weight vector by the conventional Cholesky factorisation of that matrix in equation (9.2). The noticeable result is related to the far superior performance of the data domain algorithm when using a limited number of bits; in fact, the data domain algorithm needs half the number of the bits required by the power domain method to reach good interference cancellation and target coherent integration. The triangularisation of the data matrix, see equation (9.3), can be done with the following known methods: Givens rotations, Householder reflections (a generalisation of Givens rotations) and Gram Schmidt [I]. Another method to obtain a sparse (actually a diagonal in lieu of triangular) data matrix is singular value
decomposition (SVD); the Jacobi and Hestenes are recursive parallel algorithms to efficiently obtain the SVD. The Lanczos is another numerically efficient candidate to solve our real-time STAP problem [6]. The preferred approach in this chapter is the one based on Givens rotations (see Section 9.10, Appendix A). All these methods enj oy the possibility of being mapped onto a parallel processor such as a systolic array. This means that the algorithm is readily transformed in a computer architecture; this is not the case for the equation (9.2) where a single processor computer has the task of performing the indicated operations. Today it is possible to implement a systolic array with custom VLSI technology thus providing compact processors requiring limited prime power. An additional advantage is related to the large data throughput of the parallel processor representing a suitable means of reaching real-time operation. A remarkable implementation of a systolic algorithm on VLSI chips is called MUSE; it was developed by C. Rader and colleagues at MIT-Lincoln Laboratory (USA) (see third entry of the table in Section 9.14, Appendix E). The baseline architecture considered for the STAP problem is the trapezoidal one depicted in Figure 9.2 [5]. This constitutes the generalisation of a method, which was originally developed for MVDR beamforming, by QRD. The TVM-dimensional triangular array ABC receives the snapshots of data from a set of range cells and outputs from the right-hand side the matrix R produced as the data descend through the array. The matrix is passed to the right-hand column of cells DE which serves to steer the main beam in the desired angular direction and Doppler frequency. Multiple beams can be formed simply by adding right-hand columns as depicted in Figure 9.2; they are constraint post-processors. The bulk of the computation, i.e. the QRD, is common to all of the separate beamforming tasks, and only needs to be performed once. The MVDR processor in Figure 9.2 is designed to operate in the following manner [5]. The triangular processor, in normal adaptive mode (selected by setting an input binary flag / = 1), is fed with sufficient data snapshots to form a good statistical estimate of the environment. The triangular array is then frozen (by setting the input binary flag / = 0) while a look-direction constraint is input as though it were a data vector z(n) emerging from the multichannel tapped delay line. This serves to calculate the vector a = ( R ^ ) " 1 s* which is captured and stored in the right-hand column (also operating in mode / = 0); this vector is needed to determine the STAP output e(n) = zT(n)R~l (RH)~l s*. Once the vertical columns have been initialised, the adaptive mode of operation ( / = 1) is selected for both the main triangular array and the right-hand columns and more data snapshots are presented to the processor. The processor then updates its estimate of the environment (via the stored quantities R and a) and simultaneously outputs the STAP signals from the bottom of the columns DE. The number of processing elements in the triangular systolic arrays is 0.5(MN + X)-MN. The MVDR algorithm has a noticeable computational advantage with respect to the SMI which requires O(N3 M3) arithmetic operations per sample time. Two types of processing element are needed within the triangular array: one calculates the sine and cosine of an angle between two input data values, the other rotates the remaining data of the same angle. The calculation of the rotation and the application of the rotation is repeated for each row of the triangular array. A third cell type is used in
triangular array
Figure 9.2
detection
detection
no target target present
no target target present
Baseline QRD based MVDR flow graph [5]
in the look-direction constraint columns. Every processing cell of the triangular array should perform on average 24 floating-point operations per data snapshot. Let d be the desired data rate, i.e. the snapshots per second to process, the systolic machine should perform HdM2N2 flops. As an example, let d be 1 MHz and NM = 1 0 0 the corresponding processing power needed is 100 Gflops approximately. By down sampling (see also Section 9.4) the radar data by a factor often, the required processing power is 10 Gflops.
9.3
Lattice and vectorial lattice algorithms
An advanced processing architecture referred to as the MVDR lattice processor requires 0(N2M) arithmetic operations per sample time; it is described in Reference 5. It takes advantage of the time-shift invariance1 associated with STAR 1 This requirement is fulfilled only if the PRI is constant (no PRI staggering) and no platform motion perturbations (e.g. rapid acceleration) occur
trapezoidal array
1st stage
delay 2nd stage
MVDR column final stage
beamformer residual
Figure 9.3 MVDR lattice processor [5]
The data entering the triangular array change very little from one PRI to the next which means that a large part of the computation is being repeated on successive PRIs albeit in different parts of the array. This repetition is eliminated in the lattice algorithm where the big trapezoidal array is decomposed in a lattice of smaller (i.e. of dimension N) trapezoidal arrays; the lattice has M stages (see Figure 9.3). The lattice-based MVDR operates in a similar manner to the big trapezoidal array; details are found in Reference 5 (see also Section 9.11, Appendix B). If M = N = 10 and the update rate is one tenth of 1 MHz, the required computational power is 1 GfIop. The lattice
algorithm has also been designed and tested with simulated data for wideband STAP [12]; this architecture is particularly useful: (i) to deal with wideband radar, (ii) to compensate for amplitude and phase mismatching between the receiving channels, and (iii) to combat the hot clutter. The processing architecture, named the vectorial lattice, operates on an array of Af antennas, M PRIs and P samples taken within the radar range cell. The lattice has again M stages, each having trapezoidal arrays of dimension NP. The computational complexity of the scheme is 0(MN2P2). In the above mentioned processing architectures, namely the MVDR, the lattice and the vectorial lattice, the common processing module is the triangular systolic array. In Sections 9.5 and 9.6 we report the results concerning the mapping of the triangular systolic array onto parallel processors.
9.4
Inverse QRD-based algorithms
A further improvement of the triangular systolic array for STAP processing is called IQRD (inverse QR decomposition) and promises an additional decrease of the required computational power. The need to reduce the computational requirements of the triangular systolic array was discussed in Section 9.2. The possibility of down sampling of a factor ten, say, the update of the triangular array was mentioned. This tacitly requires the extraction of the adaptive weights of the STAP at the low update rate and the application of the weights to the radar echoes at the natural rate of the data. This approach has the following practical problem. The MVDR systolic array of Figure 9.2 could extract the adapted weights via back substitution; however, pipelining the two steps of triangular update and back-substitution seems impossible. There are two possibilities to overcome this problem. The first is to use a triangular array in addition to the main one, the second triangular array being reversed with respect to the array which updates the matrix R [14, p. 332]. This approach requires more hardware to be integrated on the chip. The second approach exploits a recursive equation which updates ( R ^ ) " 1 instead of R. The update of ( R ^ ) " 1 serves the purpose of extracting the weights. This algorithm, referred to as IQRD, can be implemented with just one triangular systolic array, which has a specular orientation of the basic triangular array to update R. Figure 9.4 illustrates the processing scheme for the sidelobe canceller (an adaptive system with a main antenna and a number of auxiliaries); by resorting to the concept of generalised sidelobe canceller, it is possible to show that a slightly modified version of the scheme is applicable to STAR The complex valued set of data x e CN are received by the N auxiliary channels; y is the data collected by the main antenna; x, y are processed to give the set of adapted weights w. The quantity e = y — x r w is the adapted residue; the parameter 8/8* occurs to update the weights as more data are received, during time, by the radar; see Section 3.1 of Reference 14 for details. A limitation of this approach is related to the difficult schedule of the various processing steps. A detailed comparative analysis of the IQRD and QRD-based MVDR algorithms is presented in Reference 14. Also, an implementation of the corresponding systolic architectures, with the use of the CORDIC algorithm as a basic building block, is discussed.
Figure 9.4
9.5
RLS-IQR array [14]
Experiments with general purpose parallel processors
This section summarises the findings described in detail in Reference 8; today this study seems out of date for the advancement in signal processing hardware, nevertheless it is still very instructive. We study the use of parallel processors of MIMD (multiple instruction streams multiple data streams) and SIMD (single instruction stream multiple data streams) types available on the market (early 1990s). This approach is meant to be preliminary to the VLSI solution. In fact, it provides guidelines for the design of the processing architecture to be implemented on silicon. The problems of synchronisation of the whole systolic array by a master clock and the data transfer between processors can also be investigated. Additionally, an estimate of the computational performance of several candidate processing architectures is also possible. With reference to the MIMD machine, a reconfigurable transputer-based architecture (the MEIKO computing surface, using up to 128 T800 INMOS transputers) has been adopted and three solutions have been proposed. The first uses a ring of transputers. Then an improvement of performance is reached by diminishing the amount of communication required; such a result has been achieved by using a linear array of processors. The mapping of the algorithm onto a triangular array of processors has also been studied. This solution allows the use of an arbitrary number B of processors provided that B = p(p + l)/2, p being an integer number. This mapping shows performance better than does linear mapping. The investigation on MIMD computers is concluded with a comparison of the results achieved by using the nCUBE2 with 64 processing elements. With reference to SIMD machine, tests on the Connection Machine CM-200 and the MasPar MP-I have been performed. CM-200 is equipped with 8192 single bit processors, whereas MP-I has 4096 four bit processors. The QRD has been mapped onto a mesh architecture for both machines.
Table 9.1 Pros and cons of COTS Pros
Cons
programmable and flexible
complex infrastructure including I/O control and protocols high speed data buses high speed memory and memory control
robust to technology obsolescence reuse of previously developed software essential in design trajectory of VLSI custom architecture (search for trade-off between flexibility and modularity, parallelisation options)
multi-DSP infrastructure requires extra-overhead which brings to a decline of ideal linear increment of computational power
Without going into the details, which are described in Reference 8, the main conclusions of the work are the following. The experimental work done suggests mapping the systolic array for the QRD algorithm onto an MIMD machine configured as a triangular array. An achievable data throughput is in the order of 1OkHz for a STAP with MN = 16 and 120 PEs using the MEIKO Computing Surface. A data throughput of the order of 100 kHz should be feasible either with advanced transputers or with devices like the Texas TMS320C40. These conclusions, which date back ten years, should be reconsidered in the light of the more powerful COTS (commercial off the shelf) machine available today; see Section 9.7. A preliminary evaluation of the pros and cons of the hardware implementation of STAP based on COTS is summarised in Table 9.1.
9.6
Experiments with VLSI-based CORDIC board
To explore the possibility of achieving better computational performance and using compacter systems - for installation in an operational radar - a QRD algorithm has been mapped onto an application specific prototyping platform which contains four VLSI CORDIC ASICs (application specific integrated circuits) and some FPGAs (field programmable gate arrays) [H]; this work was done in cooperation with the Technical University of Delft (The Netherlands). For details on the CORDIC algorithm and its use in adaptive beamforming see Sections 9.13 and 9.14, Appendices C and D, respectively. The test-bed platform mainly consists of a large (modular) memory buffer that is connected to a Sun workstation via a VME (versa module eurocard) bus. The memory buffer stores data that flow through the application board, back into the buffer. The application board consists of four CORDIC processors which are mesh-connected. These four processors perform complex rotations on two-dimensional complex vectors. The CORDIC processor is a pipeline processor operating in block floating
point. The physical connections between the CORDIC have been implemented via Xilinx chips. In the benchmark described in Reference 11, the triangular systolic array was mapped onto the 2 x 2 CORDIC application board of the tested platform. This four CORDIC mesh corresponds functionally to one of the processing nodes constituting the triangular systolic array. However, as the CORDIC processors are pipelined processors, many of these rotations can be performed at a very high throughput rate (the clock rate of the pipe), provided a schedule can be found such that the pipe can be kept filled. Such a schedule can indeed be found for the QRD algorithm. The results of the benchmark may be briefly summarised as follows. With a 100 per cent pipeline utilisation of the CORDIC, the throughput can be computed simply as: throughput =
clockfreqCORDIC —\ number of rotations
(9.7)
where clockfreqCORDIC is the clock frequency of the CORDIC processor (only 5 MHz in the experiments, just to show that no extremal values are needed), and number of rotations is the number of rotations (vectorisations included) for the case where we simulate a system of MN degrees of freedom. For MN = 1 0 the throughput is approximately 8OkHz, i.e. 80000 input vectors could be processed per second, which is better than the results reported in Section 9.5 where larger computers and higher clock frequencies were used. In a non-experimental implementation of the CORDIC system described in this section, clock frequencies up to 40 MHz are easily achievable; this would improve the throughput even further within a factor of 8. Table 9.2 summarises the pros and cons of the hardware implementation of STAP based on custom VLSI. Selection between COTS and VLSI is still an open question; the specialised technical literature reports descriptions of experimental systems using both the two technologies: a consensus has not been found yet on which technology to use, even though the trend seems today in favour of COTS. Further considerations on this problem are reported in Section 9.7. Also Section 9.14, Appendix E, lists several implementation examples of STAP taken by the open technical literature; these testify the wide spectrum of technologies used.
Table 9.2 Pros and cons of VLSI Pros
Cons
extremely high throughput (bulk processing) limited size and power consumption
low degree of flexibility expensive for limited number of pieces to produce
9.7
Modern signal processing technology overview and its impact on real-time STAP
In the chapter we have mentioned the role of the VLSI custom chip and the relevance of chips that may implement the CORDIC algorithm. In this section we give attention to more commercial technology and evaluate its possible use for real-time STAR The impact of modern signal processing technology on real-time STAP is provided in this section to complement the algorithmic aspects of STAP described up to now in the chapter. We summarise the state of the art of relevant devices for signal processing and the way to design complex signal processing schemes which are of interest for the real-time implementation of STAR Perhaps one of the most significant advances in radar in the past 30 years has been the application of digital technology to allow the radar designer to make practical what in the past were only academic curiosities. An impressive drawing of the advancement of digital technology is in Figure 9.5. It illustrates Moore's law, named for Intel cofounder Gordon Moore, which predicts that transistor density on microprocessors will double every 18 months. This prediction so far has proven amazingly accurate. In recent years the processing technology adopted for radar systems has evolved along the following steps: • •
design and implementation of proprietary circuits which, however, have the following cons: high life cycle cost and obsolescence proprietary circuits exploiting the digital signal processing (DSP) devices available from the market pentium processor 1993 intel 486 microprocessor
number of transistors 10 million
286 microprocessor 1982 1 million 8086 microprocessor 1978 - i n t e l 386 microprocessor 1985 microprocessor 8080 microprocessor 1974
Figure 9.5
Moore s Law, named for Intel co-founder Gordon Moore, predicts that transistor density on microprocessors will double every 18 months. This prediction, so far, has proven amazingly accurate. If it continues, Intel processors should contain between 50 and 100 million transistors by the turn of the century (From IEEE Aerosp. Electron. Syst. Mag., October 2000, 15, (10), Jubilee Issue, p. 13 (©2000 IEEE))
•
•
a tremendous interest in COTS technology which, however, has the following cons: need of engineering resources to track technology evolution and developing tools the more recent system COTS (SCOTS) also embedding communication, operative system and developing tools.
Today modern radar systems, including those with adaptive features, require a wide spectrum of technologies; e.g. ASIC, FPGA, DSP, fibre optic communication channel to use each one matched to a suitable purpose. Examples of the application of heterogeneous technology in adaptive radar, and also STAP, are: • • • •
ASIC for analogue to digital converter (ADC), filtering and channel equalisation DSP for matrix algebra calculation reduced instruction set computer (RISC) for data processing fibre optic for communication channels to distribute/collect data and command.
A taste of figures for recent technologies are (this is a not exhaustive list and by no means a commercial indication, but just a sample taken from the specialised literature): •
• • •
Analog Device's very recent processing board is characterised by the following features: eight ADSP-TSlOl Tiger Shark give 9000MFlops, the processor is running at 250 MHz, has 64 bits at 66 MHz on compact PCI (peripheral computer interconnect), with three banks of memory with 64 Mbytes; it is programmable in C, with library and software tools in Sharklab linked to Matlab status of the art for FPGA (year 2002): 10 Mgates, they are reprogrammable via software status of the art for ADC technology: 14 bits at 100 MHz sampling rate, 10 bits at 1.5GHz, 8 bits at 3 GHz status of the art for FFT: DoubleBW 1 K complex points FFT with windowing in 10 iis [21].
These technologies are heterogeneous but need to be approached, in the design phase of the whole radar system, in an homogeneous fashion; this has prompted the so-called concurrent design technology, or system codesign. It is a methodological approach based on: tools for cost estimate and analysis of requirements (e.g. Rational Rose), algorithmic analysis (with Matlab and Simulink), functional design (Ptolemy II, System C, Handel C), core library etc. In more detail, the rationale of codesign technology is the following. Systems are becoming more and more complex in terms of both functionality and hardware architecture. The need to include interaction with other design domains, such as data processing, control processing, input/output etc. is also increasing. Therefore there is a need for a true system level design capability which not only allows the design/simulation of the constituent parts of the system but also their interaction. To bring together the different design domains, system design languages are being developed which allow a model of the behaviour of the entire system to be generated, thus speeding up the design of the entire system by finding problems early in the design cycle rather than at the system integration stage.
Codesign techniques permit the functional specification to be explicitly mapped onto a model of a candidate architecture, in terms of both functional processing, memory access and communication interfaces between hardware and software elements. The resulting partitioned design can then be analysed for performance and the suitability of the candidate architectures investigated. Modifications of the architectural structure and mapping of the functional specification onto the modified architecture can then be made until the design meets the requirements. The level of abstraction of the architecture model can then be increased until the designer is satisfied that an implementation of the design will be 'right first time'. In addition to this simulation capability, codesign methodologies and supporting tools need to be able to export the design information to hardware-software coverification environments and implementation tools.
9.8
Processing of recorded live data
The data recorded by the Naval Research Laboratory (NRL-USA) airborne multichannel radar system have been processed by a systolic trapezoidal array which implements the STAP [9]. The performance of the algorithm has been evaluated against ground clutter, littoral clutter and jammer. Vehicular traffic has also been detected. The systolic array processing has been emulated with a MATLAB software tool. The airborne radar system used by NRL for its STAP flight test program is a modified AN/APS-125 surveillance radar; the operating frequency is 420-450 MHz. The side-looking linear array consists often hooked dipole antennas spaced approximately a half wave length apart, mounted in a 90° corner reflector to provide elevation pattern shaping. The two outer dipoles are terminated yielding eight channels with roughly equivalent element patterns and — 3 dB beam widths of 80° for both azimuth and elevation. The array was energised with a high power corporate feed which applied a taper on transmit such that the maximum azimuth sidelobe level is 25 dB down with respect to the main beam. The receiving system consists of eight identical channels with each channel having a UHF preamplifier, mixer, VHF amplifier bandpass filter and a synchronous demodulator. The synchronous demodulator consists of two demodulators, one referenced to the coherent oscillator (COHO) and the other referenced to the COHO shifted by 90°. This yields two bipolar video channels, one in phase (I), the other quadrature phase (Q). Each I and Q signal is converted to digital by a 10 bit, 5 MHz ADC. The radar pulse repetition frequency (PRF) is 300/750 pps. The output of the receiving system is 16 digital channels for a total digital word width of 160 bits with a clock of 200 ns. This yields a data bandwidth of 800 Mbps which is buffered in real time and stored on magnetic tape.
9.8.1 Systolic algorithm for live data processing As indicated in Figure 9.2, the radar has an array of N = 8 antennas and receiving channels. Each of these receives M echoes from a transmitted train of M (up to 18 in the actual radar) coherent pulses with a PRI (pulse repetition interval) of T = Kz s
where r is the Nyquist sampling period (i.e. typically, the range cell duration) and K is the range cell number in the PRI. The STAP provides a two-dimensional filter in the direction of arrival (DoA) Doppler frequency (/b) plane with a main beam focused towards the target and a wide null in the regions of the ' D O A - / D ' plane containing the interference. QRD constitutes the fundamental component of the voltage-domain algorithm. It operates recursively by using each snapshot of data to update the online estimation of the disturbing environment without forming the interference covariance matrix and only requires 0(N2M2) arithmetic operations to be performed every sample time. The scheme of Figure 9.2 has been applied to the data recorded by the NRL radar.
9.8.2 Data files used in the data reduction experiments This section describes the data files, recorded by NRL radar, used for space-time processing experiments. The files refer to ground clutter, land-sea clutter interface and jamming. The following information has been extracted by the data files, namely: (i) echo power in a radar receiving channel versus range, (ii) the probability density function (PDF) of the amplitude and phase of the radar echoes, (iii) the eigenvalues spectrum, and (iv) the two-dimensional power spectral density of the clutter versus / D and DoA. In this chapter, just a subset of this information is enclosed. 9.8.2.1 Ground clutter Two data files were examined, namely DL050 and DL087. For these files we have calculated the amplitude and phase histograms of the radar echoes. The histograms have been estimated using 896 echoes along range. The amplitude histograms show visually a good fit with the Rayleigh PDF. One more test to verify whether the histogram adequately matches the Rayleigh PDF is to calculate the mean to median ratio. The estimated value is 1.115, and the exact value is 1.442. The histogram of phase is approximately uniform. For file DL050 the spectrum of eigenvalues of the interference covariance matrix is reported in Figure 9.6. The number of antennas is 8, and the number of PRIs is the parameter of the curves ranging from 1 to 18. The covariance matrix has been estimated by averaging 896 independent samples along range, and the maximum eigenvalue has been normalised to O dB. The minimum eigenvalue, corresponding to the curve labelled '18', gives a good estimate of the noise floor in each receiving channel; before normalisation this value is about 10 dB. The clutter-plus-noise power value amounts to 45 dB in each receiving channel; this value has been determined by averaging along range the received signal on the first antenna. Thus the input average clutter-to-noise power ratio has been assumed to be equal to 35 dB for each receiving channels, i.e. for each antenna and for each range sample. 9.8.2.2 Land-sea clutter Figure 9.7 portrays the power versus range of the echoes collected by the first antenna for the data DR075. At the 480th range cell the transition from sea to land is clearly visible. The sea clutter power, estimated along the first 200 range cells,
amplitude of eigenvalues, dB
number of eigenvalues
Eigenvalue spectrum for data file DL050 (ground clutter) [9]. The parameter of the curves represents the number of PRIs
power (1st ant.)
Figure 9.6
range
Figure 9.7
Power versus range of the radar echoes collected by the first antenna of the array [9], At the 480th range cell the transition from sea to land is clearly visible
amplitude of eigenvalues, dB
number of eigenvalues
Figure 9.8
Spectrum of eigenvalues of jamming interference. Curve a: N = 8 antennas and M = I PRI; curve b: N = 8, M = 2 [9]
amounts to 12.8 dB. The land clutter power estimated from 600th to 800th range cells measures 30.2 dB. 9.8.2.3 Jamming The data file DWO15 refers to jamming overland. The jammer appears at the end fire, i.e. DoA = 90°. Figure 9.8 reports the eigenvalues (normalised to OdB) of the estimated covariance matrix (over 300 range cells) for N = 8 antennas and M = 1 PRI (curve a) and N = S and M = I (curve b). The presence of one principal eigenvalue in curve a indicates the presence of one jamming source. We also estimate (over 300 range cells) from the data file that the jammer plus noise power is equal to 36.5 dB. The thermal noise, evaluated by the minimum eigenvalue of the interference covariance matrix is 30 dB. Thus the jammer-to-noise power ratio is 6.5 dB.
9.8.3 Performance evaluation The detection performance of the systolic array of Figure 9.2 depends upon the array parameters, the interference environment and the target signal features. The parameters defining the trapezoidal array are: (i) the dimension NM of the data snapshot vector which equals the number of input lines to the triangular systolic canceller, (ii) the forgetting factor (which controls the adaptation speed of the canceller) of the QRD canceller, and (iii) the number L of linear columns for constraints. Synthetic targets as well as signals injected into the receiver are used to determine the integration of target echoes. Performance during steady state is measured in
terms of: (a) improvement factor (IF) defined as the ratio of the signal-to-total disturbance power ratios at the output and input of STAP, (b) visibility curve, i.e. IF versus target / D sweeping across the PRF, and (c) the two-dimensional response of the adaptive system versus DoA and /D9.8.3.1 Performance against ground clutter Consider the file DL087. Assume a trapezoidal array with one antenna (N = 1), eighteen pulses (M = 18) and L = 3 linear columns (processing cells DE of Figure 9.2). The constraints in the three columns are set to detect a target having the following Doppler frequencies: 0.5 PRF, 0.25 PRF and 0 PRF. A synthetic target having a Doppler frequency value of 0.5 PRF was added at the 264th range cell. Figures 9.9a, b and c show the power in dB of the residue signals at the output of the three columns. Note that the target echo appears only in Figure 9.9a as expected (being the constraint set at / b = 0.5 PRF); the estimated IF is 35 dB. 9.8.3.2 Performance against sea-land clutter The file DR075 contains a test target, injected in the receiver at the 3547th range cell. The Doppler frequency of the target is 0.5 PRF and the DoA is 0°. Figure 9.10 portrays the power in dB of the residue signal obtained by adaptively processing the echoes received by N = S and M = 18 PRIs. The trapezoidal systolic array has one vertical column (L = 1) with the constraints / D = 0.5 PRF and DoA = 0° which are fully matched to the target signal. The spike appears at the 3691st cell which differs from the original target range due to the space-time filter delay which is equal to the total number of degrees of freedom, i.e. 144. The visibility curve for a fictitious target having DoA = 0° and Doppler frequency sweeping across the radar PRF is reported in Figure 9.11; the visibility curve is approximately flat except around / D = 0 which is the mean Doppler frequency of clutter after compensation of the platform speed. The maximum value of the clutter cancellation equals the clutter-to-noise power ratio which has been estimated to be 23.9 dB while the maximum gain in the target direction of arrival and Doppler frequency is equal to 21.5 dB having used all the 144 degrees of freedom; this results in an IF of 45.5 dB. From the visibility curve the maximum IF value amounts to 44 dB, while the optimum IF would be 45.5 dB which is just few dBs higher than the values shown in visibility curve. 9.8.3.3 Performance against jammer The improvement factor of an array of N = 8 antennas, one PRI (M = 1) and one column constraint (the constraint is set along the expected target direction of arrival) is shown in Figure 9.12 as a function of the DoA of a simulated target scanning the angular interval [-90°, +90°]. The jammer is that described in Section 9.8.2.3. It is noted that the maximum IF is about 13 dB, while the optimum IF value would be 17 dB. The 4 dB loss is due to the adaptation of the systolic arrays [9, p. 600]. We note that adaptation loss is higher for jamming than for clutter; possible explanations are the following: (i) the number of spatial degrees of freedom is 8, which is lower than
residue power, dB
target
residue power, dB
range cells
residue power, dB
range cells
c
Figure 9.9
range cells
Processing of ground clutter live data [9]
target
power, dB
initialisation of space-time filter
range
Power of residue signal for data file DRO 75 [9]
IF, dB
Figure 9.10
F
Figure 9.11
Doppler/PRF
Visibility curve for data file DR075 [9]
the number of temporal degrees of freedom (=18), (ii) the jammer-to-thermal noise power has been estimated as being equal to 6.5 dB [9, p. 598], considerably lower than the clutter-to-thermal noise power. This will cause higher loss due to the need for precise estimation of jamming direction of arrival.
degree
Figure 9.12 IF versus DoA of a simulated target against jamming [9] 9.8.4
Detection of vehicular traffic
The detection of vehicular traffic has been attempted along US route 50 (see, for details, Reference 9). Four points on the route have been selected (bearing angle relative to the array normal, with positive values coming from the right-hand side of the array): 1st point: range = 39268 m, azimuth = —5.8° 2nd point: range = 39268 m, azimuth = —3.4° 3rd point: range = 39429 m, azimuth = -0.6° 4th point: range = 39429 m, azimuth =1.0°. The systolic array processes the snapshots along the range cells received by 8 antennas and 18 PRIs (i.e. it works with the maximum number of adaptive degrees of freedom). The adapted residue along the range cells has been further processed by a constant false alarm rate (CFAR) thresholding device based on the cell average (CA) technique. The CFAR-CA has two guard range cells on each side of the range cell under test and twenty range cells on each side to estimate the detection threshold. The CFAR-CA has been set to guarantee a PFA of 10~4. Figure 9.13 depicts the adapted residue versus range when the receiving antenna pattern is focused at —5.8°, which is the azimuth value corresponding to the first point on the US route 50. The analysed Doppler frequency is 0.225 PRF which corresponds to a radial speed of 23.2 m/s (i.e. 83.52 km/h) compatible with vehicular traffic. A detection appears at the 932th range cell that (it can be shown) comfortably compares with the expected location of the target in the first point. Similar results have been obtained for the other three points on the US route 50 [9].
range cells
Figure 9.13 Adapted residue power and detection threshold curves versus range cell [9]
9.9
Concluding remarks
The research work described in this chapter and the enclosed references is also relevant for other radar applications, sometimes simpler than the STAP, namely (i) ground-based or ship-borne radars for clutter cancellation and (ii) ground-based or ship-borne radars equipped with a multichannel phased array antenna for jamming cancellation. The STAP reverts to the first application by setting Af = 1, while it becomes the second application for M = 1. Thus, the adaptive processing architectures described in this chapter are applicable also to the systems in (i) and (ii). In general, the number of degrees of freedom involved is one order of magnitude less than for the STAP case; this makes less critical the implementation of a VLSI-based systolic array. A practical application of systolic processing for classical ground-based or ship-borne radar is described in Reference 19 where it is shown how to combine in one systolic scheme the two functions of adaptive interference cancellation and sidelobe blanking. The application of STAP to synthetic aperture radar for detecting and imaging of slowly moving targets is discussed in References 7, 15 and Chapter 3 of this book. In this respect the procedure to form the SAR image by one-bit processing plays a role; this procedure is also applied in along-track interferometry (ATI)-SAR to detect moving targets [20]. It can be shown that this approach offers a considerable computational advantage; FPGA technology has been successfully applied to implementing one-bit SAR processing. The enormous progress made in signal processing technology is under our eyes; this progress is also exploited and, at the same time, motivated by STAR Today the
key words are: heterogeneous processing (i.e. based on VLSI, ASIC, FPGA, RISC, MEMS, photonic technology etc.), virtual and rapid prototyping, modularity and flexibility of processing architectures, reuse and porting of the same, COTS approach to software and hardware, software language (e.g. System C; Handel C for FPGA), design tools like Ptolemy. All these techniques and technologies are conceived to counteract the obsolescence which is one of the most important problems to face today in signal processing.
9.10
Appendix A: Givens rotations and systolic implementation of sidelobe canceller
The QRD, which mainly performs orthogonal rotations, may be efficiently implemented by a recursive application of Givens rotations. A complex Givens rotation is an elementary transformation of the form: (9.8) where /3 is a scaling factor. The rotation coefficients, c and s, satisfy: (9.9) These relationships uniquely specify the rotation coefficients, c and s: (9.10) (9.11) The QRD by Givens rotations may efficiently be mapped onto a systolic array computer. A systolic computer is an array of processing cells. Each cell has a local memory and is connected with its neighbouring cells in the form of a regular grid. The more common configurations of the systolic array are linear and triangular. The operations performed by the systolic array are synchronised by a clock. On each clock cycle, every cell receives data from its neighbouring cells and performs operations. The resulting data are stored within the cell and passed to the neighbouring cells on the next clock cycle. The triangular systolic array is shown in Figure 9.14 for the simple case of an SLC (sidelobe canceller) system equipped with one auxiliary antenna. The triangular systolic array comprises three types of computational cell: the boundary cell (circular cell in Figure 9.14), the internal cell (rectangular cell in Figure 9.14) and the final cell which is a simple two-input multiplier. The function of each computational cell is specified in the Figure. Each cell performs the specified functions on its input data and delivers the appropriate output values to the neighbouring cells. The least-squares residue constitutes the noise-reduced output signal from the adaptive beamformer.
boundary cell
otherwise
internal cell
radio frequency (RF), intermediate frequency (IF) and baseband (BB) receiving channels
residual
Figure 9.14
Implementation of an SLC with one auxiliary antenna by means of a triangular systolic array (Adapted from WARD et al., IEEE Trans Antennas Propag., AP-34, (3), March 1986, pp. 338-346, (©1986 IEEE))
The processing scheme can be applied to the STAP case as shown in Figure 9.15. One limitation of this scheme derives from the need to reinitialise the systolic array every time that the look direction for searching the target is changed; in fact, the processor of Figure 9.15 focuses one quiescent beam to gather the echoes from a target possibly present in a specific region of the space-time plane. Because the direction of arrival and the Doppler frequency values of the target are not known a priori, it needs to monitor a number of lines of residue power for target detection purpose. The above mentioned limitation is overcome with the scheme of Figure 9.2.
triangular systolic array
processing elements
Figure 9.15
9.11
residue
Calculation of the adapted STAP residue via a sequence of Givens rotations implemented with a triangular systolic array. Comparison of residue to a detection threshold X to check for the two alternative hypotheses: H\ (target presence) versus HQ (no target)
Appendix B: lattice working principle
The lattice systolic array requires three working phases to correctly process the data samples. Indicate with i — PRT (pulse repetition time: PRT = PRI) the data matrix having dimensions TV (rows) and k (columns) containing the data pertinent to the /th pulse; N denotes the number of antennas and k the number of data snapshots to process in the adaptation phase. Indicate also with (/ — j) PRT the data matrix (of dimension N by k), pertinent to the data of the ith PRT after the application (by means of the squared array) of the rotation coefficients computed in the triangular array starting from the data captured during the jth PRT. For the study case described here the number (M) of PRI is three. The first working phase of the lattice processor is shown in Figure 9.16. The triangular and squared systolic processors, also shown in Figure 9.3, have dimension N by N; the number of operations of this phase is k by O(N2). Following the above mentioned notations, the second lattice working phase is reported in Figure 9.17; note that this phase costs 3k O(N2) operations. Finally, in Figure 9.18 is presented the third lattice work phase. Note also that this phase costs 3k O(N2) operations. The total number of operations is Ik O(N2) which is approximately 2 by M (i.e the number of pulses) by k O(N2); thus, the number of operations k 0(M2N2) required by the full triangular array of Figure 9.2 of the text has been reduced.
1 st phase data matrix of 2 PRT
data matrix of 1 PRT
squared array of processing elements data matrix of (1-2) PRT total number of operations = k O(N2)
Figure 9.16
1st working phase of lattice
2nd phase
(2-3) PRT total number of operations = k O(N2)
(3-2) PRT total number of operations = k Q(N2)
(l-2)-(3-2)PRT total number of operations = k O(N2)
Figure 9.17
9Al
2nd working phase of lattice
Appendix C: the CORDIC algorithm
Since QR-based algorithms mainly perform orthogonal rotations (see Section 9.10, Appendix A), the CORDIC algorithm may be selected for implementing the processing cells of the above described systolic arrays. A further motivation comes from the fact that some CORDIC-based VLSI processor arrays have already been developed for radar and more general signal processing applications.
3rd phase 3 PRT fictitious PRT = O total number of operations = k 0(N2)
(2-3) PRT total number of operations = k O(N2)
(l-2)-(3-2)PRT total number of operations = k O(N2)
Figure 9.18
3rd working phase of lattice
The basic idea underlying CORDIC [22] is to decompose a desired rotation angle O into a weighted sum of a given number n (e.g. n > 6) of predefined elementary rotation angles Qf(O, such that the overall rotation can be carried out via a sequence of n shift-and-add operations, called /JL-rotations. More specifically, given the vector Xin of components [x in ,y in ], the CORDIC algorithm transforms it, by means of a sequence of /x-rotations, into a new vector xout = [xout, youtlThe rotations are attained through the following operations: Xout = *in COSQf(Z) -
)>jit SnIQf(O
yout = xin sin Qf(Z) - yin cos a(i) ' IgVd) = Pd)I-1 pd) = ±1 = sign(xin)sign(yin)
^AZ)
The CORDIC algorithm can operate either in rotating or in vectoring modes. In rotating mode, the rotation angle 6 encoded by the sequence a(i) is applied to the vector [xinyin\T to give a new rotated vector [xoutyoutY'• Conversely, in vectoring mode, the CORDIC algorithm computes the coding sequence of the rotations which when applied to [xinyin\T yields \xout 0 ] r . The input/output description of the CORDIC cell considered for implementation of systolic algorithms is given in Figure 9.19 where m is a control bit selecting either the vectoring or rotating modes, and a represents the /[x-rotations sequence. The CORDIC algorithm operates on real valued vectors while, in adaptive beamforming, complex valued vectors must be handled. In particular, for the vectoring
Figure 9.19
Schematic of CORDIC processing function
mode
mode 0-CORDIC
-CORDIC
CORDIC
Re(r)
ImO) Re
Figure 9.20
twl
Im[X0UtI
CORDIC supercell for circular rotation on complex valued numbers. Adapted from Figure 4 of: CM. Rader, Wafer-Scale Integration of a Large Scale Systolic Array for Adaptive Nulling', The Lincoln Laboratory Journal, 1991, 4, (1), pp. 3-29
mode, a unitary transformation Q must be computed, which annihilates the second component of a given vector [rx]T, with r e R (i.e. r is a real valued number) and x e C (i.e. x is a complex valued number). This step occurs in QR and inverse QR algorithms [14, equation 37]. It is possible to see that such a transformation can be represented as [14]: Q
Tcos0 -[-sin0
sin0iri 0 I cos0][o e-J°\
(9 13)
'
with 0 — arc ^g (Im Qt)/Re (x)) and 0 = arcfg(|;t|/r). Subsequently, in the rotating mode the transformation Q can be applied to a generic two-dimensional complex vector. An example of a processor realising rotations on complex valued vectors, and using the same structure for vectoring and rotating mode, is shown in Figure 9.20. The processor is obtained by interconnecting three CORDIC cells, of the type of
Figure 9.19, named O and 0 CORDICs and two registers to store the real and imaginary parts of r (r e C when the processor is working in rotating mode). For the vectoring mode, this architecture allows us to annihilate the second component of a given vector [rx]T. In more detail, with reference to Figure 9.20 (r e R, thus Im(r) = 0), the imaginary part of x is annihilated in the 0 CORX)IC cell and, subsequently, \x\ is annihilated in the left 0 CORDIC cell. The right 0 CORDIC cell is not operating. The Q transformation of equation 9.13 is therefore coded by the sequence (fie, / ^ ) and they will be applied in the rotating mode. In the rotating mode the rotation angle 6 is applied to the incoming vector [Re(jc) Im(x)] r in the O CORDIC and, subsequently, the rotation angle 0 is applied by the left and right 0 CORDIC cells, respectively, to the real and imaginary parts of r (now r 6 C, as requested by QR and IQR algorithms) and the rotated vector x.
9.13
Appendix D: the SLC implementation via CORDIC algorithm
The sidelobe canceller (SLC) may be implemented by means of the CORDIC cells described in Section 9.12. Figure 9.21 refers to the case of a main antenna and one auxiliary antenna (the same application example as in Section 9.10, Appendix A, Figure 9.14). On the left-hand side of the Figure, the SLC algorithm is realised by means of a systolic triangular array computing the Givens rotations. On the right-hand side of Figure 9.21, the same algorithm is implemented by means of the CORDIC arithmetic. The architecture is basically composed of four computational cells having as input the data from the main and the auxiliary antennas, plus a computational cell auxiliary channel
main channel
auxiliary channel
main channel
<j> CORDIC vectoring
(f> CORDIC rotating
9 CORDIC vectoring
6 CORDIC rotating
residue
residue
Figure 9.21
The SLC implemented with CORDIC algorithm
6 CORDIC rotating
required for normalisation of the residue. The cells may work either in vectoring mode or in rotating mode: the following operations are performed in vectoring mode. Given the vector [x(0), y(0)] the rotation sequence is computed such that: (9.14) where ns is the required number of rotations. In rotating mode, given the sequence /x(7), the following operations are applied: y(ns)
= V(O) - QfX(O)
(9.15)
The cells (numbers 1 and 3 in the Figure) working in vectoring mode operate directly on the auxiliary data, computing the /x-rotations required to annihilate the second coordinate of the complex data xi = Qtir,Jti/), which changes in x'i = (x'lr,0). The rotating cells (numbers 2 and 4 in the Figure) operate on the main channel data applying the /x-rotations computed by the adjacent vectoring cells; the remaining computational cell (number 5 in the Figure) works as the previous ones and is required to normalise the cancellation residue.
9.14
Appendix E: an example of existing processors for STAP
This appendix summarises in a tabular form some of the most relevant processors for STAP as they are perceived by the authors in the open technical literature. The table is organised in three columns that report, respectively, the name of the processor and of the organisation, the features and some relevant references. The table entries are in accordance to the publication date of the references.
Name
Features
References
Rome Air Development Center RADC (USA)
Systolic array implemented with digital (chips from ESL Company) and optical technologies. QU factorisation based on Givens method; systolic weight computer and a digital weight applier; ESL systolic chip is a custom VLSI chip with 32-bits floating points. Acousto-optic adaptive processor: use of Bragg cells, a liquid spatial light modulator and an
LIS, S. etal.: 'Digital and optical systolic architectures for airborne adaptive radars'. Agard conference proceedings 381, Multifunction radar for airborne applications, pp. 18-1, 18-13, 1986
Name
Features
References
optical detector; GaAlAs semiconductor diode laser provides the illumination for the time integration correlator. Hazeltine, funded by RADC (USA)
Systolic array brass-board of 1.25 billion floating-point operations/s; solution of multiple simultaneous equations with twelve unknowns; weight update every 50 |xs. In 1988 brass-board was integrated into the flexible adaptive spatial signal processor test bed of RADC.
LACKEY, R. J., BAURLE, H. R, and BARILE, J.: 'Application specific supercomputer', SPIE, 977, Real Time Signal Processing XI, 1988, pp. 187-195
MUSE, MIT-Lincoln Laboratory (USA)
Matrix update systolic experiment (MUSE) for 64 degrees of freedom. 96 CORDIC processors update the 64 weights - to apply, say, to 64 receiving channels - on the basis of 300 new observations in only 6.7 ms; equivalent to 2.8 Ginstructions/s. MUSE has been realised on a single large wafer of 5 in by 5 in by using restructurable VLSI. Each CORDIC cell has 54 000 CMOS transistors. 50 dB of signal-to-interference-plus-noise power ratio is achievable. Mapping strategies and estimation of achieved computational throughput of QR decomposition by using: Meiko surface computer with 128 processing elements (PEs), nCube2 with 64 PEs, connection machine CM-200 with 8192 PEs, and MasPar MP-I with 1024 PEs. Application to an adaptive array of antennas and STAR
RADER, C. M.: 'Wafer scale integration of a large scale systolic array for adaptive nulling', Line. Lab. J., 1991, 4,(1), pp. 3-29 RADER, C. M.: 'VLSI systolic arrays for adaptive nulling', IEEE Signal Process. Mag., July 1996, pp. 29-49
Benchmarks on general purpose parallel computers (It)
D ' A C I E R N O , A.,
CECCARELLI, M., FARINA, A., PETROSINO, A., and TIMMONERI, L.: 'Mapping QR decomposition on parallel computers: a study case for radar applications', IEICE Trans. Commun., October 1994, E77-B, (10), pp. 1264-1271
References
Name
Features
Test bed (MIT-Lincoln Laboratory)
One of the first world-class systems MARTINEZ, D. R. and McPHEE, J. V.: ever developed to demonstrate an end-to-end real-time STAP 'Real-time test bed for processing capability designed STAP'. IEEE 1994 around commercial off-the-shelf Adaptive antenna integrated hardware components. systems symposium, The peak throughput amounts to Long Island, 7-8 22 Gops/s. The system is divided November 1994, into front end digital system with pp. 135-141 capability of 19 Gop/s and a back MARTINEZ, D. R., end programmable processor with MOELLER, T. J., and capability of 2.5 Gflops/s. This back TEITELBAUM, K.: end is based on DSP TI TMS320C30 'Application of microprocessor. The processor reconfigurable architecture is sufficiently flexible in computing to a high programming to implement different performance front-end classes of STAP algorithms. The radar signal processor', mapping of algorithm is on identical J. VLSI Signal Process., PEs operating under the same May/June 2001, 28, set of program instructions (1-2) but on different data input streams. Application of reconfigurable computing to high throughput front-end radar.
CORDIC test-board (It, Ne)
A board with four VLSI CORDIC chips used to implement the QR decomposition for adaptive array processing; the estimation of achieved computational throughput is also obtained.
KAPTEIJN, P., TIMMONERI, L., DEPRETTERE, E., and FARINA, A.: 'Implementation of the recursive QR algorithm on a 2*2 CORDIC test-board: a study case for radar applications'. 25th European Microwave conference, 4-7 September 1995, Bologna, Italy
Name
Features
References
Maui High Performance Computer Center (MHPCC), MIT-Lincoln Laboratory (USA)
A network configuration with the 400-way IBM supercomputer SP2 hardware and software, visualisation hardware, mass storage system, file servers, parallel tools, compilers, preprocessors, debuggers, parallel operating environment etc. Crest environments I and II for STAP related to data of Mountaintop program.
1996 Adaptive Sensor Array Processing (ASAP) workshop, Maui High Performance Computer Center (MHPCC) Training Session, 12 March 1996
Mountaintop is UHF radar with 18 channels (16 of which are active) operating with the concept of inverse displaced phase centre antenna (IDPCA). An auxiliary transmitting aperture is added to a stationary radar. The auxiliary array is linear with the number of elements equal to the number of pulses in a coherent processing interval; the element spacing is chosen to create the desired aircraft motion. Mercury Computer Systems, Inc. (USA)
Description of strategies to distribute three-dimensional data set for STAP over available computing elements in a parallel computer system.
SKALABRIN, M. F. and EINSTEIN, T. H.: 'STAP processing on a multi-computer: distribution of 3-D data sets and processor allocation for optimum interprocessor communication'. Proceedings of Adaptive sensor array processing (ASAP) workshop, 13-15 March 1996, pp. 429-447
Name
Features
References
STAP on GP parallel computer, Honeywell Inc. (USA)
Coarse grained data flow mapping for STAP; up to 234 nodes on Rome Lab. 321 node machine; sequential C code for four benchmarks for Doppler filtering, adaptive processing etc.
SAMSON, R., GRIMM, D., MORRILL, K., and ANDRESEN, T.: 'STAP performance on a Paragon ™Touchstone system'. IEEE National Radar Conference, Natrad 96, Ann Arbor, MI, 13-16 May 1996, pp. 315-320
Massive parallel processor (USA)
Experience in porting the MTI-Lincoln Laboratory STAP benchmark programs onto the IBM-SP2, Cray T3D and Intel Paragon. Benchmark performance results along with scalability analysis on machine and problem size.
HWANG, K. and XU, Z.: 'Scalable parallel computers for real-time signal processing', IEEE Signal Process. Mag., July 1996, pp. 50-66
MeshSP, MIT-Lincoln Laboratory (USA)
McMAHON, J. O.: Investigates the application of commercially available massively 'Space-time adaptive parallel processors for STAR These processing on the mesh processors are sufficiently flexible to synchronous processor', accommodate different STAP MIT Line. Lab. J., 1996, architectures and algorithms and are 9, (2), pp. 131-152 scalable over a wide parameter space to support the requirements of different radar systems. The mesh synchronous processor is an SIMD architecture with an array of processors connected via a two-dimensional or three-dimensional nearest-neighbour mesh. It incorporates the single monolithic processor element (PE) of the Analog Devices ADSP-21060 SHARC. Each PE permits 120 Mflops/s peak performance and 512 kB of internal memory,
Name
Features
References
six processor communication ports, each capable of 40 MB/s peak throughput and two I/O ports, each capable of 5 MB/s peak throughput. A MeshSP processing board (7 inc by 13 inc) dissipates 100 W, contains 64 SHARCs and is capable of 7.7 Gflops/s peak performance. The higher order post Doppler (HOPD), an embodiment of STAP algorithm, has been mapped onto the MeshSP. In the HOPD the method for computing the QR decomposition of the data matrix has been the Householder reflection algorithm. It was found that the real-time processing for a study case of 48 channels, 128 Doppler bins and 1250 range gates required 16 boards for a total of 123 GFlops. ONEST: on line experimental space time, FGAN-FFM (Ge)
Implements algorithms of moving target extraction in SAR by means of STAR Use of heterogeneous DSP network, VME bus boards; Sharp processors for range and azimuth compression FFT; bunch of 24 -i- 32 TMS320C40 for filtering of slow moving targets; 16 TMS320C40 for moving target detection, position finding and concurrent SAR image generation.
JANSEN, W. and KIRCHNER, C : 'ONEST: concept of a real time SAR/MTI processor'. EUSAR96, Konigswinter, Germany, 1996, pp. 349-352
High Performance Computer (HPC), Rome Lab. (USA)
Honeywell ruggedised Touchstone used in four flight experiments, 2 5 + 4 processing nodes; each node has three i860 processors; 300 Mflops per node, 7.5 Gflops overall; weights update with QRD, sustained overall throughput 3.15 Gflops, efficiency 48 per cent. Data are passed from IF digital down conversion to the 29 nodes via a
LINDERMAN, M. H. and LINDERMAN, R. W.: 'Real time STAP demonstration on an embedded high performance computer'. IEEE National Radar Conference, Natrad 97, 13-15 May 1997,
Name
Features
References
high performance parallel interconnect (HiPPI) channel having 100 Mbytes/s. Hardware housed in two racks (19 in).
Syracuse, NY, pp. 54-59
Real-time multi-channel airborne radar measurements (RT-MCARM) with onboard Intel Paragon computer with 25 compute nodes running Sunmos operating systems. Each node has three i860 processors accessing common memory of 64 Mbytes as shared resource; two HiPPI; two service nodes. Linear speed up was obtained for up to 236 compute nodes. The adaptive beamforming is done applying the QRD. Real-time multichannel airborne radar measurements (RT-MCARM), Rome Lab. (USA)
In May-June 1996 Rome Laboratory conducted experiments of real-time STAP on board the BAC-III. Two processing chains have been tested contemporaneously: a) conventional analogue beamforming without STAP on a Mercury computer, b) 16 simultaneous beams provide digital data to the ruggedised 28 nodes of Paragon using SUNMOS as operating system and a Pentium PC to control radar, processing chain and display. Four flights included urban and rural clutter, land-sea interface, target was a Sabreliner, a moving target simulator and various targets of opportunity plus a CW jammer operating during random periods. Paragon functions: digital beamforming (six receiving beams within a wide transmitter beam),
CHOUDARY, A., LIAO, W-., WEINER, D. et al.: 'Design implementation and evaluation of parallel pipelined STAP on parallel computers', IEEE Trans. Aerosp. Electron. SySt., April 2000, 36, (2), pp. 528-548
LITTLE, M. V. and BERRY, W. P.: 'Real time multichannel airborne radar measurements'. IEEE national Radar conference, Natrad 97, 13-15 May 1997, Syracuse, NY, pp. 138-142
Name
Lockheed Martin (USA)
Features
References
pulse compression, two alternative STAP algorithms, CFAR detection, data recording. The flights demonstrated the feasibility of using high performance computer to conduct STAP of radar data in real time: a unique capability within DoD USA. High performance scalable computer MANSUR, H. H.: 4 (HPSC) employs 716 Analog STAP architecture Devices Share processing elements implementations using (PEs); two identical chassis and only high performance two board types are needed. scalable computer Sustained processing throughput of (HPCS)MEEE national the order of 32 Gflops. The Share Radar conference, PEs are grouped in four forming a Natrad97, 13-15 May processing node. Myrinet switching 1997, Syracuse, NY, technology (129Mbytes/s) provides pp. 325-330 the connections between the nodes. The recursive modified Gram Schmidt QR algorithm is implemented. The HPCS is currently (1997) under development.
References 1 FARINA, A.: 'Antenna based signal processing techniques for radar systems' (Artech House, 1992) 2 WARD, J.: 'Space-time adaptive processing for airborne radar'. MIT-Lincoln Laboratory, TR 1015, 13 December 1994 3 KLEMM, R.: 'Principles of space-time adaptive processing' (IEE, UK, 2002) 4 FARINA, A. and TIMMONERI, L.: 'Space-time processing for AEW radar'. Proceedings of international Radar conference, Radar 92, Brighton, UK, 12-13 October 1992, pp. 312-315 5 TIMMONERI, L., PROUDLER, I. K., FARINA, A., and McWHIRTER, J. C : 'QRD-based MVDR algorithm for multipulse antenna array signal processing', IEEProc. Radar, Sonar Navig., April 1994,141, (2), pp. 93-102 6 FARINA, A., BARBAROSSA, S., CECCARELLI, M., PETROSINO, A., TIMMONERI, L., and VINELLI, F.: 'Application of the extreme eigenvalue
7
8
9
10
11
12
13
14
15
16
17
18 19
analysis to signal and image processing for radar'. Invited paper, Colloque International sur Ie Radar, Paris, 3-6 May 1994, pp. 207-213 FARINA, A. and BARBAROSSA, S.: 'Space-time-frequency processing of synthetic aperture radar signals', IEEE Trans. Aerosp. Electron. Syst, 1994, 30, (2), pp. 341-358 D'ACIERNO, A., CECCARELLI, M., FARINA, A., PETROSINO, A., and TIMMONERI, L.: 'Mapping QR decomposition on parallel computers: a study case for radar applications', IEICE Trans. Commun., October 1994, E77-B, (10), pp.1264-1271 FARINA, A., GRAZIANO, R., LEE, F., and TIMMONERI, L.: 'Adaptive spacetime processing with systolic algorithm: experimental results using recorded live data'. Proceedings of the international conference on Radar, Radar 95, Washington DC, 8-11 May 1995, pp. 595-602 FARINA, A. and TIMMONERI, L.: 'Antenna based signal processing techniques and space-time processing'. Tutorial, international conference, Radar 95, Washington DC, USA, 8-11 May 1995 KAPTEIJIN, P., DEPRETTERE, E., TIMMONERI, L., and FARINA, A.: 'Implementation of the recursive QR algorithm on a 2 x 2 CORDIC test-board: a case study for radar application'. Proceeding of the 25th European Microwave conference, Bologna, Italy, 1995, pp. 490-^95 FARINA, A., SAVERIONE, A., and TIMMONERI, L.: The MVDR vectorial lattice applied to space-time processing for AEW radar with large instantaneous bandwidth', IEEProc, Radar Sonar Navig., February 1996,143, (1), pp. 4 1 ^ 6 FARINA, A. and TIMMONERI, L.: 'Parallel algorithms and processing architectures for space-time adaptive processing', CIE international conference on Radar, ICR96, Beijing, P. R. of China, invited paper for the workshop on STAP, 1996, pp. 771-774 BOLLINI, P., CHISCI, L., FARINA, A., GIANNELLI, M., TIMMONERI, L., and ZAPPA, G.: 'QR versus IQR algorithms for adaptive signal processing: performance evaluation for radar applications', IEE Proc, Radar Sonar Navig., October 1996,143, (5), pp. 328-340 LOMBARDO, P. and FARINA, A.: 'Dual antenna baseline optimization for SAR detection of moving targets'. Proceedings of ICSP96, Beijing, RR. of China, 1996, pp. 431-433 FARINA, A. and LOMBARDO, P.: 'Space-time adaptive signal processing'. Tutorial, IEE international conference on Radar, Radar 97, Edinburgh, UK, 13 October 1997 FARINA, A. and TIMMONERI, L.: 'Real time STAP techniques'. Proceedings of IEE symposium on Space time adaptive processing, London, 6th April 1998, pp. 3/1-3/7 FARINA, A. and TIMMONERI, L.: 'Real time STAP techniques', Electron. Commun. Eng. J., Special Issue on STAP, February 1999,11, (1), pp. 13-22 FARINA, A. and TIMMONERI, L.: 'Systolic schemes for joint SLB, SLC and adaptive phased-array'. Proceedings of the IEEE international conference Radar 2000, Washington DC, 7-12 May 2000, pp. 602-607
20 PASCAZIO, V., SCHIRINZI, G., and FARINA, A.: 'Moving target detection by along track interferometry'. IGARSS 2001, Sidney, Australia, July 2001 21 BIERENS, L.: DoubleBW Systems B.V., May 2002, Private communication www.doublebw.com/brochures.htm 22 VOLDER, J. E.: 'The CORDIC trigonometric computing technique', IRE Trans. Electron. Computers, September 1959, pp. 330-334
Part IV
Clutter inhomogeneities
Chapter 10
STAP in heterogeneous clutter environments William L. Melvin
10.1
Introduction
Aerospace radar systems must detect a variety of target types in the presence of severe, dynamic clutter and jamming signals. Signal diversity - the exploitation of azimuthal, elevation, Doppler, range and polarisation measurement spaces - is a necessary component of advanced detection architectures. Space-time adaptive processing1 (STAP) improves the detection of slow moving and/or low radar cross section (RCS) targets competing with mainlobe and sidelobe ground clutter returns [I]. Additionally, in light of the equivalence between the maximum signal-to-interference-plus-noise ratio (SINR) filter and the minimum variance beamformer, we recognise STAP as a member of the class of superresolution algorithms [2]. For this reason, STAP is a key element of radar systems whose electrically small apertures, and hence relatively large beamwidths, would otherwise seriously affect clutter-limited detection performance. Adaptive filters adjust their response in accord with estimates of interference characteristics. A critical distinction exists between optimal and adaptive filters. Specifically, the optimum filter design requires clairvoyant knowledge of interference statistics (e.g. known covariance matrix), while the adaptive implementation relies on necessarily imperfect estimates of unknown interference parameters. A training stage generates estimates of these unknown parameters. Hence, STAP is a datadomain implementation of the optimum filter. The optimum filter defines the upper bound on STAP's detection performance potential. In the multivariate Gaussian case, maximising SINR equivalently maximises the probability of detection (PD) for a fixed probability of false alarm (PFA) [I]1 In the case of ground clutter suppression, we consider spatial and slow time (Doppler) degrees of freedom, thereby taking advantage of the clutter's angle-Doppler coupling. On the other hand, cancellation of jammer multipath requires spatial and fast time (range) degrees of freedom
Consider an N-channel array receiving M pulses. Given the space-time snapshot for the &th range, Xk e cMNxl, the optimal weight vector in the maximum SINR sense takes the well known form, Wk = /xQ^ 1 S 8 -^ where \i is an arbitrary scalar; Qk = £{Xk///0x£yH } e cNMxNM; Xk/H0 is the zero-mean, interference-only (nullhypothesis, Ho) space-time snapshot; and, ss_t € CNMxl is the target space-time signal vector [3]. In practice, both Qk and ss_t are unknown, and so the adaptive processor substitutes the estimate Qk for Qk and the surrogate steering vector vs_t for ss_t; the adaptive weight vector is then Wk = AQk vs-t> where it is common to set l//x = y v ^ Q ^ V s - t in an attempt to normalise the output noise to unit power. Using secondary (auxiliary, training) data taken from other range cells within the CPI, the processor computes Qk. Ideally, secondary data exhibit statistical behaviour identical to the null-hypothesis condition characterising the primary (test) range cell k. The bandwidth and PRF limit the maximum number of unambiguous range cells in the coherent dwell to Qtot = Ru/^R total vectors, where R11 is the unambiguous range extent and AR is range resolution.2 Just as the optimal processor bounds the performance of the STAP, finite sample support of <2tot — 1 independent and identically distributed (HD) secondary data vectors limits the capability of the adaptive processor. Moreover, suppose the receiver bandwidth B is 5 MHz, indicating AR = 30 m. If M = 32 and Af = 11 (typical values), then sample support of twice the processor's degrees of freedom (DoF) is 2NM = 704; in this case, training data come from a range interval of 21.1 km! Based on the variable nature of ground clutter, it is highly unlikely that data exhibit statistical similarity over this range swath. Even minimal STAP training intervals - arising, for example, from the application of reduced-dimension/reduced-rank methods [ 3 , 4 ] - require relatively large numbers of samples and may suffer from discrete outliers. Additionally, limitations on computing capability may restrict training to block selection, further increasing the likelihood of mismatch between primary and secondary data features. STAP dynamically adjusts its two-dimensional frequency response, H(fs, / D ) = w^v s _ t (/y, / D ) , where fs and / D are spatial and Doppler frequencies, to suppress ground clutter returns coupled in angle and Doppler. When the clutter angle-Doppler properties vary over range, filter mismatch occurs and the output SINR deviates from a value one predicts under the HD assumption. Two factors lead to range-varying angle-Doppler behaviour: changing cultural features, such as clutter reflectivity or shadowing, and non-ideal sensor geometry. In the latter instance, non-linear or canted arrays, combined with deviation from the perfectly side-looking (zero crab) case, result in non-proportionality between clutter Doppler and spatial frequencies [3,5,6]. This type of variation is deterministic when platform velocity vector and array manifold are known, and so the STAP radar system designer has some control over this non-stationary effect. In contrast, changing cultural behaviour is environmentally driven, leading to covariance matrix estimation errors (viz., E[Q]x] ^ Qk) For example, eclipsing, pulse compression, sensitivity time control and antenna pattern gain variation decrease the number of usable range cells
and consequent STAP performance loss. We refer to this latter effect as clutter heterogeneity. References 7-15 highlight the impact of heterogeneous clutter effects on adaptive radar performance. Nitzberg considers the impact of amplitude heterogeneous clutter on improvement factor (IF) when using adaptive Doppler processing, consequently finding small losses as long as the processor accurately estimates clutter centre Doppler frequency and spectral width [7]. Armstrong et al. extend Nitzberg's work by including simultaneous amplitude and spectral mismatch, thereafter reporting significant IF degradation of the adaptive Doppler processor and proposing some ameliorating techniques, including non-adaptive prefiltering [8]. Futernik and Haimovich consider STAP performance in the presence of edge transition and Weibull range-dependent clutter effects [9]; they compare the performance of sample matrix inversion, the generalised likelihood ratio test and the eigencanceller [16], reporting superior performance for the latter processing method. In References 10 and 11, Melvin et al. hypothesise a taxonomy of space-time clutter heterogeneity and compute STAP performance loss for several cases, including range-angle varying amplitude fluctuations, range-angle varying spectral mismatch, clutter-to-noise ratio (CNR) dependent variations over range leading to amplitude and spectral mismatch, edge effects, and target-like signals in the secondary data (TSD). Kalson considers a parameterised generalised likelihood ratio test (GLRT), where adjustment of the parameter minimises the impact of mismatched signals - such as sidelobe discretes and TSD on detection performance [12]; the TSD problem is described further in Reference 13. Cai and Wang describe the impact of signal contamination and clutter edge heterogeneity on the performance of the generalised likelihood ratio test (GLRT) STAP detection algorithm in Reference 14. McDonald and Blum present exact expressions describing STAP performance in the presence of steering vector and clutter mismatch in Reference 15, further performing sensitivity analysis on detector characteristics in light of this mismatch.
10.1.1 Adaptivity with finite sample support Several different approaches are available for computing the adaptive weight vector. Among the competing methods, the maximum likelihood estimate (MLE) is the most popular [17] and is given by:3
(10.1)
where Xs e CNMx@ is the collection of secondary data vectors. Substituting Q^ 1 for Q^ 1 leads to the sample matrix inverse (SMI) formulation. Reed, Mallett and 3
This formulation assumes the secondary data are zero mean, Gaussian and HD
Brennan further demonstrate in Reference 17 that: P(wk;vs_t = ss_t) =
' ^ T " ^
(10-2)
^YiVA I Wk
where w k and Wk correspond to STAP and optimal weight vectors, is bound between zero and unity and follows a beta distribution, in which case: (10.3) The famous Reed-Mallett-Brennan (RMB) rule follows from equation (10.3): to achieve an average SINR loss of 3 dB between optimal and adaptive implementations, set Q = 2NM — 3. The SMI approach is preferable to other weight vector calculation schemes since convergence of the weight vector to the optimal condition depends solely on the number of HD secondary data vectors used in equation (10.1). In Reference 18, Boroson derives the variance of equation (10.2) under the conditions of unknown signal direction or corruption of the training data by the desired signal. When the clutter and interference is of low numerical rank, alternate processing approaches can yield 3 dB average SINR loss with sample support relaxing to 2/c, where K = rank(Qk) [19]. Clutter is often of low numerical rank due to redundant spatio-temporal sampling of the environment [3,16]. Each narrowband noise jamming source contributes M independent components to the space-time rank due to the complete decorrelation of the jamming source in slow time [3]. An estimate of the interference rank for an environment composed of ground clutter and / noise jammers is then: (10.4) where |"-~| denotes rounding to the next highest integer. The first term on the right-hand side of equation (10.4) is known as Brennan's rule [3]. Due to the generally low numerical rank of clutter, alternate STAP techniques with reduced degrees of freedom provide acceptable performance while minimising sample support requirements. Additionally, weight computation strategies involving diagonal loading or aperture smoothing tend to reduce requisite sample support with potentially small penalties in performance. Section 10.9.2 describes these issues in more detail.
10.1.2 STAP performance metrics The radar detection problem involves binary hypothesis testing to determine either target absence (Ho) or target presence (H\). The two hypotheses are given as: # o : x k = c k + j k + nk Hl:xk = s + xk/H0
^'D)
c k , j k and n k represent clutter, interference and noise snapshots, while s is the target signal vector. In the non-fluctuating target case, s = ys s _t(/s,/b), where y
is a complex scalar with unknown peak amplitude and uniformly distributed phase (n.b., since it is understood that the space-time steering vector ss_t is a function of angle and Doppler, we often neglect the corresponding argument as a matter of notational convenience). By exploiting signal diversity, STAP greatly improves detection performance over a variety of competing methods: it enables endoclutter detection performance by overcoming diffraction-limited characteristics of the spacetime receive aperture, suppresses sidelobe clutter masking the detection of low radar cross section targets, and supports simultaneous ground clutter and noise jammer cancellation. This section highlights key STAP metrics. Consider an arbitrary weight vector, Wk, applied to the space-time snapshot and thus yielding the output: (10.6) An optimum detection statistic for equation (10.6), under the Gaussian assumption X k/Ho ~ CN(Q, Qk), follows from the likelihood ratio test and appears as [1,3]:
(10.7) The performance of equation (10.7) is given by:
(10.8)
where PFA is the probability of false alarm, Po is the probability of detection, fir is a normalised detection threshold, /o(-) is the modified zero-order Bessel function of the first kind and a equals the square-root of the peak output signal-to-interferenceplus-noise ratio (SINR).4 In light of equation (10.6), we find: (10.9)
4
Peak (signal amplitude) SINR is often used in detection analyses. Most radar and communication engineers prefer to use power, and so the corresponding SINR equals one-half the peak value. By using the term SINR in this chapter, we imply the use of power
CJ-2 — £[|y| 2 /2] is the single channel, single pulse signal power. Equation (10.8) is a monotonic function of a, and hence a2. Thus, maximising SINR likewise maximises PD for a fixed value of PFA- For this reason, SINR is a critical STAP metric. When comparing algorithms or determining the impact of training on performance, it is convenient to define SINR loss factors [3]. Two commonly used SINR loss factors are LSi\ (/?, / b ) and Ls^(fs, Jb) I each loss term is bound between zero and unity. LSfi(fS9fv) compares clutter/interference-limited performance to the noiselimited condition for arbitrary weight vector WR: (10.10) Q s = cr2ss-ts^_t is the signal correlation matrix and a2 represents the single channel, single pulse noise variance. Upon substituting Wk = wopt, where wopt = MQ1^* s s-t is the optimal weight vector, equation (10.10) specifies the upper bound on performance in the maximum SINR sense. Since computing w opt requires the known covariance matrix, LSi\ (fs, / D ) is sometimes called the clairvoyant SINR loss. On the other hand, Ls^(fs, / D ) determines the loss between an implementation requiring estimated statistics and the clairvoyant case (e.g. adaptive versus optimum): (10.11) In the absence of steering vector mismatch, we find Ls^(fs, Jb) = p(w k ;v s - t = s s _ t ). Radar system characteristics, the severity of the interference environment, and the particular signal processing approach all strongly influence LSi\ (fs, /b) . Accurate parameter estimation relies on both fidelity and quantity of secondary data; typical radar environments provide limited, heterogeneous secondary data, leading to smaller values ofLSi2(fs, / D ) . Since: (10.12) with SNR(fs) the angle-dependent signal-to-noise ratio, those effects reducing Ls,2(fs, / D ) will likewise reduce PD in accordance with equation (10.8). For nonadaptive signal processing methods, such as displaced phase centre antenna (DPCA) processing or digital beamforming cascaded with a weighted Doppler processor,
Lsa(fS9jb) = Wf59Jp. Those target velocities closest to the dominant clutter component, and exhibiting SINRloss above some acceptable value, viz. Ls^\ (/?, JD)-LS^(JS, JD) > £, determine the minimum discernible velocity (MDV). For example, suppose we calculate SNR to be 13 dB, thereby yielding PD = 0.87 for PfA — IE — 6 according to equation (10.8). If our minimum detection requirement is PD — 0.5 for this same false alarm rate, then SINR must be greater than or equal to 11.25 dB. This indicates a tolerable combined SINR loss of 1.75 dB, or e = 0.668.
The notion of SINR loss equally applies to fluctuating target models. For example, in the Swerling I case: (10.13) where o^ is the sum of (Gaussian) clutter, noise and interference variances and a^vg denotes the average SINR computed from equation (10.12) after replacing SNR with the value computed using the average target radar cross section. Improvement factor (IF) is given as: (10.14) with O^ representing the total clutter single pulse power received by a single element. In the noise-limited case, equation (10.14) defaults to the space-time integration gain (nominally, NM). IF closely relates to the preceding SINR loss definitions.
10.1.3
Covariance matrix errors
The adaptive filter employs training data to estimate the unknown covariance matrix, as equation (10.1) suggests. In an HD training environment, the finite sample support leads to differences between estimated and actual covariance matrices, Qk and Qk. These differences consequently result in mismatch between optimum and adaptive filter responses. Reed, Mallett and Brennan characterised the impact of covariance errors by deriving the distribution for the SINR loss term given in equation (10.2) under homogeneous, yet finite, training support conditions. Low sample support conditions tend to perturb the noise floor estimate, leading to a substantial degree of loss. A variety of STAP methods improve upon the convergence rate identified by the RMB rule; Section 10.9.2 suggests some minimal sample support methods. In contrast, heterogeneous clutter environments can lead to significant deviation between Qk and Qk. The impact of covariance matrix error depends not only on the error itself, but also on the angle-Doppler location of the target with respect to the mismatch. Suppose: Ak ^ Q k 1 - Q k 1
(10.15)
which exists when both inverses exist. The output SINR of the adaptive processor is (10.16) Letting vs_t = ss_t, the SINR loss due to the covariance mismatch alone is:
(10.17)
Under the matched condition, Qk -> Qk, and so Ak = 0 and Ls -> 1. Otherwise, equation (10.17) suggests an SINR loss dependent on both the magnitude of the co variance error - given by || Ak || ^, for instance - and the projection of the target steering vector onto the columns of the error matrix, s^ t Ak- For example, a large clutter discrete substantially displaced from the target look-direction may affect performance much less than a smaller discrete at an angle-Doppler location in proximity to the target. By specifying the specific nature of Ak, equation (10.17) can be used to characterise the impact of certain forms of clutter heterogeneity on STAP performance. We accomplish this goal implicitly in subsequent sections.
10.2
Classes of space-time clutter heterogeneity
A general model for the clutter space-time snapshot is: N0
N
P
Ck = ^^r,oc/k.m^n(cLt(k\m,n)
O st(./b/*;#w,/i))
m=\ n=\
( a s ( ^ , n ) 0 s s ( / f c n ) )
(10.18)
where we approximate a continuum of statistically independent clutter scatterers along an unambiguous isorange contour k by a summation accounting for Na rangefolded cells, each composed of Np discrete, independent clutter patches. Additionally, ^c/k-m n describes the average power for the mnth. clutter patch and fcth range, at(k; m, n) is a vector describing the normalised pulse-to-pulse voltages, St(fD/k;m,n) is the length M clutter patch temporal steering vector, as(k;m,n) accounts for spatial decorrelation, ss(fs/k;m,n) is the length TV clutter patch spatial steering vector, and O and ® denote Schur and Kronecker products. Also, At = E[cttct^] and As = E[asa^] are the corresponding temporal and spatial correlation matrices. In the HD case, the terms comprising equation (10.18) exhibit no dependence on k. The clutter-plus-noise space-time snapshot follows from equation (10.18): Xk = Ck -b nk, where nk ~ CAf(O, (J^INM), &n *s t n e n °ise variance, Xk ~ CN(O, Qc/k + O^INM), andQc/k = £[c k c^]. The clutter voltage decorrelates from pulse-to-pulse due to intrinsic clutter motion (ICM), system jitter, antenna flexing and rotation, and radar cross section scintillation. ICM is a function of the type of scatterer, e.g. moving vegetation or ocean waves lead to Doppler fluctuations, whereas 'hard' scatterers, such as buildings, yield narrower Doppler spectra. Commonly used ICM models include the Gaussian [20,8,10J and exponential [21] correlation functions. The frequency spectrum of the Gaussian model is: (10.19)
where |gol2 is an arbitrary power gain, Ao is the centre wavelength, / is frequency and av is the RMS velocity spread in metres per second [20]; Reference 21 describes a contrasting model. The correlation coefficient is the normalised inverse Fourier transform of equation (10.19); it is Gaussian too, taking the form: (10.20) /o = COO/2TZ is the centre frequency. The argument in equation (10.20) corresponds to the i — yth pulse pair. The pulse-to-pulse correlation matrix is then: A t = Toeplitz(p/(0), P1(I)9 P1(I)9...,
Pl(M
- I))
(10.21)
By summing the variances, equation (10.20) accommodates multiple, independent sources of temporal decorrelation, i.e. o\ = Y^m a\im' The channel-to-channel voltage decorrelates due to dispersive effects. Assuming a bandlimited receive waveform - appearing flat over the bandwidth &>o — TCB < a* < coo + xB, where co is the frequency in radians - yields the corresponding spatial correlation coefficient of the form: ps(m,n) = smc(7tB(rm - Xn))
(10.22)
where xp is the time delay between the pth channel and a suitable reference point due to direction of arrival (DoA). Bandwidth, the DoA embodied in the time delay, and the size of the array (influencing the maximum time difference between samples) affect ps (m, n). The channel-to-channel correlation matrix is then:
(10.23)
which is Toeplitz if the spatial sampling is uniform. (Non-dispersive errors yield non-Toeplitz structure; the model of equation (10.18) can incorporate such errors independent of equation (10.23).) As subsequently shown in Section 10.8.1, the model given by equation (10.18) is a good approximation to actual measured data. To summarise, the key components of the clutter space-time model include: the amplitude scaling term <7c/k;m,n used to set the clutter patch power; the particular spatial and temporal response of the patch, fs/k;m,n and fo/kim^nl the RMS spectral spread of the patch, ov/k;m,n\ the spatial decorrelation term of equation (10.22) characterising the process as(k;m,n). Range variation among these components leads to heterogeneous clutter conditions. Table 10.1 provides a taxonomy of predominant heterogeneous clutter effects and also lists the corresponding cause and effect relationship. This Table includes target-like signals - a significant nuisance effect, especially for ground moving target indication (GMTI) radar - as a related effect. Additionally, the clutter angleDoppler loci can vary over range due to non-linear array geometries, non-side-looking
Table 10.1
Taxonomy of clutter heterogeneity and related effects
Heterogeneity type
Causes
Impact on adaptive radar
Amplitude (range variation in
shadowing and obscuration, range-angle dependent change in clutter reflectivity, strong stationary discretes, sea spikes, urban centres, land-sea interfaces etc.
null depth depends on eigenvalue ratio - MLE 'averaging' leads to underestimated eigenvalue magnitude and, consequently, uncancelled clutter and increased false alarm rate null width set to mean spread too narrow for some range cells, too wide for others thereby leading to either increased clutter residue or signal cancellation; degrades MDV same impact as spectral mismatch
a
c/k;m,n)
Spectral (range variation in Gv/k;m,n)
CNR-dependent spectral mismatch (range variation in <*c/k\mji
leads t0
apparent spectral width variation) Moving scatterers
Some other effects (e.g. variation in the pair, (fs/k;m,rn fD/k\m,n))
intrinsic clutter motion due to soft scatterers (trees, windblown fields etc.), ocean waves, weather effects
modulation of principal components and other low power signal terms rise above the noise floor with increases in CNR ground traffic, weather, insects and birds, air vehicles chaff, hot clutter, multibounce/multipath, impact of platform geometry (e.g. non-side-looking or bistatic) on angle-Doppler behaviour over range
mainlobe nulling, false sidelobe target declarations, distorted beampatterns, exhausts DoF combination of above effects
arrays, or bistatic sensor configuration [3,5,6]. Section 10.7 briefly describes the cause of range variation in the angle-Doppler pair (fs/k;m,n, fD/t,m,n) for monostatic radar. Clutter heterogeneity can occur among the training data, within the cell under test, or simultaneously among both training and test cells. At issue here is the difference between average and instantaneous performance. The covariance estimate resulting from equation (10.1) captures the average nature of the training data set. For this reason, the estimated covariance matrix can appear significantly mismatched with respect to the interference in any given test cell. As well as designing for worse case circumstances, one should also consider performance achievable in any given test cell as an upper bound.
10.2.1
General simulation characteristics
In the ensuing sections we further consider the forms of clutter heterogeneity listed in Table 10.1. Nominal simulation results correspond to a 'typical' airborne radar scenario with the following salient parameters: L-band transmit frequency, 1.2 m by 1.2 m side-looking array (8 elements in azimuth by 8 elements in elevation), eight azimuthally oriented spatial channels (N = 8), 25 dB Taylor weighting on transmit, cosine-squared element pattern (180° null beamwidth), no weighting on receive, PRF set slightly higher than the DPCA condition, sixteen pulse dwell (M = 16), platform velocity of lOOm/s and height of 3000 m, and clutter reflectivity varied to yield a CNR of value specified within the text. It is also worthwhile at this stage to note the following: we use both asymptotic and finite sample support analysis in succeeding sections of this chapter. In the asymptotic case we let Qk -> £[QkL thereby allowing us to examine the expected impact of heterogeneity on detection performance. The finite sample support condition, on the other hand, employs a training interval to compute a covariance estimate, thus enabling STAP performance characterisation in the most natural adaptive implementation. We identify the corresponding analysis as belonging to either class asymptotic or finite sample support - as the various examples appear within the text, and include the number of training samples for the finite sample case. The simulation results provide a controlled environment since we are able to specify the known, ideal covariance matrix Qk.
10.3
Amplitude heterogeneity
Amplitude heterogeneity is perhaps the most obvious source of mismatch occurring in adaptive radar signal environments. In this section we examine three different views of amplitude variation affecting practical STAP implementation: (a) strong discrete clutter returns in an otherwise homogeneous texture; (b) range-angle variation of clutter reflectivity; (c) abrupt changes in the clutter environment due to transition or edge regions.
10.3.1
Clutter discretes
Strong clutter discretes can corrupt the cell under test (CUT), the training data, or both. For simplicity, assume a space-time processor operating in a homogeneous, distributed clutter background with either a strong discrete in the CUT or a strong discrete in the training data. In the former case, residual clutter degrades SINR and also increases PFA • The latter case results in overnulling and the potential for signal cancellation. Suppose a clutter discrete resides in the CUT. The corresponding null-hypothesis covariance matrix takes the form: Qk = Qc+n/k + crJsdS^
(10.24)
where Qc+n/k is the covariance matrix of the homogeneous, distributed clutter-plusnoise component5 and crj is the power of the discrete clutter return with space-time signal vector Sd located on the clutter ridge. Given a homogeneous training set, we let Wk -» Wk = £[wk] = MQc^ n /k v s-t t 0 evaluate asymptotic behaviour. The corresponding asymptotic SINR loss is:
(10.25) Letting vs_t = ss_t, equation (10.25) expands to: 6
(10.26) \/2
1/2
Defining p = Qc_|_n/kss-t and u = Qc+n/kSd> equation (10.26) can be written: (10.27) where cos 2 (p,u) = Ip^up/HpH^llull^ and sin 2 (p,u) = 1 — cos 2 (p,u). From equation (10.27) we make the following expected observations: 0 < Ls < l;whenaj = 0, we find Ls = 1; when Sd = vs_t (u = p) and a^ J=. 0, then Ls = 1; as aj —• oo, Ls -> 0; while in general, as a j ^9L5 | as long as Sd ^= v s _ t . We numerically verified the equivalence between equations (10.27) and (10.25). Discretes in the CUT will adversely affect the false alarm rate. Suppose the design threshold, vj, is set for the expected output noise power: (10.28) whereas the actual output noise power in the CUT is: (10.29)
5 In this case, Q c + n /k = Qc+n V k since we assume the distributed clutter environment is homogeneous ^ The SINR loss due to target mismatch is typically small in comparison with the loss associated with clutter heterogeneity. Since clutter heterogeneity is our focus, we'll assume perfect match between hypothesised and actual space-time steering vectors
The expected and actual false alarm rates, PfA,e and PFA,CI, are then: (10.30) from which we find (10.31) PFA,e serves as the design false alarm rate in this instance. Equation (10.31) indicates the following: (a) if Pa = Pe, then PFA,a = PFAy, (b) if Pe > Pa, then PFA,a < PFA*',
and, more importantly for the case of a discrete in the CUT, (c) if Pe < Pa, then PFA,a > PFA,e- Also, as a result of the exponential term, a small mismatch between the design and actual power leads to large changes in false alarm rate. For the case with the discrete in the training interval, we let w^ -* ^k — £[Wk] = ^(Qc+n/k+a^SdS^)" 1 v s _ t , where a j incorporates the 'averaging' process, to evaluate asymptotic performance degradation. The corresponding SINR loss is:
(10.32) Equation (10.32) can be rewritten:
(10.33)
Qc+n/ksd- Using the prior definitions for p and u, equation (10.33) becomes: (10.34) We observe the following from equation (10.33): 0 < Ls < 1; Ls = 1 when o2d = 0; Ls = 1 when Sd = vs_t (u = p) and
Rather than increase the false alarm rate, a discrete residing in the training data generally leads to an upward bias of the threshold. Consequently, target masking is a concern in this situation. Ideally, we want PFA,O, — PFA,e- Equation (10.30) then indicates a threshold bias of ^/Pe/ Pa from the optimal setting, viz. vj = V Pe I ^a VT, optimal •
SINR loss, dB
Consider the typical airborne radar configuration described in Section 10.2.1. The clutter reflectivity is set to yield approximately 64 dB integrated CNR. The nominal slant range of interest is 32 km. We generate a clutter discrete of specified power <7j and angular position and compute the impact on STAP performance for the two cases described above: clutter discrete in the cell under test and clutter discrete in the training data (post averaging). Figure 10.1 shows the consequent asymptotic loss in the former case for varying values of aj over azimuth angle measured from the antenna boresight. As expected, loss depends on both power and position of the discrete, as equation (10.27) suggests; the loss is seen to be substantial in some regions. In contrast, Figure 10.2 shows the asymptotic loss in the latter case, where the discrete resides in the training data segment. The loss in this latter instance tends to be quite small, but the reader should keep in mind the somewhat ideal nature of the otherwise homogeneous, distributed clutter environment. Subsequent sections describe other forms of heterogeneity that, in combination, further exacerbate STAP performance. Finally, Figure 10.3 suggests the very sensitive dependence of the probability of false alarm, P ^ , on the output noise residue; from this Figure it is seen that only 1-2 dB of residue leads to a tremendous increase in PFA-
target location: antenna boresight (270°) 200 Hz Doppler (endoclutter)
discrete's azimuthal location, deg
Figure 10.1
SINR loss due to discrete in CUT
SINR loss, dB
target location: antenna boresight (270°) 200Hz Doppler (endoclutter)
discrete's azimuthal location, deg
Figure 10.2
SINR loss due to discrete in training data
probability of false alarm
actual P F A design P F A
actual output noise power, dB
Figure 10.3
Impact of discrete on false alarm rate
10.3.2 Range-angle varying clutter RCS In Reference 7, Nitzberg analyses adaptive Doppler filter performance degradation due to clutter power variation in range by considering the range-dependent, temporal covariance matrix: Qk = PcJcQx+ o*lM
(10.35)
where pc,k is the clutter power at the £th range with Weibull or gamma probability density, Q x is a normalised clutter temporal covariance matrix and a% is the uncorrelated noise power. The Fourier transform of Q x describes the normalised clutter power spectrum. Expression (10.35) is a composite model: each temporal snapshot is multivariate complex Gaussian, but non-Gaussian fluctuation in clutter radar cross section leads to power fluctuation over range. The corresponding asymptotic MLE is: (10.36) Generally, the mean clutter power will not precisely match the instantaneous clutter power at any given range, i.e. pc ^ /?c?£, and so estimated and actual covariance matrices are non-convergent. The consequent mismatch between adaptive and optimal filters leads to detection performance degradation. As described in Reference 10, the model of equation (10.35) applies under the following circumstances: (a) the spectral width and location of correlated clutter returns in the frequency domain appear fixed over range; (b) variation in clutter reflectivity is constant with azimuth, thereby implying uniform scaling of all clutter subspaces. In practice, we expect spectral width to vary with clutter type and environmental conditions, and clutter power to exhibit dependency over both range and angle. Shadowing, obscuration and varying clutter types lead to apparent changes in clutter RCS with range and angle. Additionally, non-side-looking sensor configurations induce angle-Doppler contour dependency with range irrespective of any cultural variations (see Section 10.7 and References 3, 5 and 6). Returning to the space-time case, equation (10.18) accommodates range-angle variation of clutter power and spectral width through terms (Jt,m,n and at(k;m,n). Assume narrowband conditions and no other effects giving rise to spatial decorrelation so that ocs (k; m9n) = 1. Further, note that clutter reflectivity, yc, is constant over range and angle in the homogeneous case. The single channel, single pulse clutter-to-noise ratio for the mnth patch at range k is then approximated as:
(10.37) where Pt is peak transmit power, Gt{-) is transmit antenna gain, ((pm,n,0k;m) is the azimuth-elevation pair, r is slant range, i/fg/k is the grazing angle, Acc is the clutter
patch area, A. is wavelength, gr(-) is the receive channel antenna gain, Nin is the input noise power, Fn is the receiver noise figure and Lr/ represents radiofrequency (RF) system losses [20]. The constant gamma model, ao = yc sin^g/fc), evident in equation (10.37), was proposed by Barton based on extensive analysis of measured data [22]; this model is commonly used in radar system analysis. Sensitivity time control is used to equalise range-dependent space loss [20]. Antenna gain characteristics and the impact of grazing angle on clutter backscatter determine the range-angle variability of patch clutter power under the homogeneous assumption. To accommodate heterogeneous clutter reflectivity, let yc/k-m,n represent the reflectivity of the mnth clutter patch at range k. Heterogeneous clutter power follows from equation (10.37) as: 2
(Gt((/)m,n,0t,m)gr((f)m,n,0k;m)
K G °c/k;m,n c/k;m,n = = K\\ (
~4 ~A
sin(^rg/ik) \
— J) Yc/k;m,n Yc/k;m,n
(10.38)
where K\ is a constant. The fluctuations embodied in yk;m,n result in range-dependent amplitude variation across the clutter ridge. Suppose the range-angle variation in clutter reflectivity - and hence, clutter power - follows the gamma probability distribution: ,v p(ycc)) = p(y
kVY
11
(~Yc\ ^ - a-\ I (-Yc\ — vr exp —z— —z— ;
~ E[yc]22 £[/c] = aa = ;
ar £~ vvar[y l>c]c] 6= = B
,m-iON (10.39)
K J T(S)F \ P ) var[yc ]' H E[yc] where a and ft are shape parameters. The selection of the gamma distribution represents a likely choice for overland surveillance [7,8], Notice that as var[}/c] approaches zero while the mean is held constant, the environment tends to the homogeneous case. Figure 10.4 shows finite sample support SINR loss based on 150 Monte Carlo trials for varying levels of clutter amplitude heterogeneity for our nominal airborne radar simulation example. Figure 10.5 depicts the CNR sample mean, standard deviation (STD) and maximum values specified on a single channel, single pulse basis. The basic analysis procedure, described in Reference 10, involves the following:
(a)
compute 2MN + 1 space-time realisations for the nominal airborne radar scenario, where the clutter reflectivity for each clutter patch is randomly chosen to abide by the distribution of equation (10.39) with fixed mean and specified variance (b) compute and save the clairvoyant covariance matrix for the first space-time realisation (c) compute the covariance matrix estimate using the MLE and the remaining 256 realisations (RMB loss in an HD environment is —2.96 dB in this instance) (d) compute the adaptive weight vector for the transmit angle over all Doppler (e) calculate SINR loss (f) repeat in Monte Carlo fashion, averaging the various trials. Results given in Figure 10.4 are consistent with those described by Nitzberg in Reference 7, and seem to suggest the robustness of STAP to range-angle varying clutter reflectivity. Generally, the adaptive filter's null depth inadequately suppresses
SINR loss, dB
(l)mean/std=1000 (2)mean/std=100 (3)mean/std=10 (4)mean/std = 2/3 (5)mean/std=l/2 (6)mean/std=l/3 (7)mean/std=l/10 (8)mean/std=l/12 RMB Rule (-2.96 dB)
Doppler filter
Figure 10.4
SINR loss due to range-angle varying clutter reflectivity following a gamma distribution ([10J, © 2000 IEEE)
large clutter fluctuations and so residual clutter leads to performance degradation. Figure 10.4 indicates that the predominant loss occurs about the centre of the clutter spectrum, with only slight impact on MDV in this instance. However, the reader should keep in mind that the Figure does not capably capture instantaneous performance (i.e. the Monte Carlo procedure smoothes the results) or the impact on false alarm rate.
10.3.3
Clutter edges
Shadowing and edges lead to abrupt changes in the clutter power profile over range. Terrain obscuration leads to shadowing, while littoral or rural-urban interfaces are examples of edges. Both amplitude and spectral characteristics can vary across clutter edges or transition regions. For the moment we consider amplitude effects. The power variation describing a linear transition between distinct clutter regions A and B takes the form:
(10.40)
CNR sample mean CNR sample std.
case number
CNR sample maximum
case number
Figure 10.5
Mean, standard deviation, and maximum single channel, single pulse CNR for example in Figure 10.4 ([1O], © 2000 IEEE)
with AAB — (PC/B — Pc/A)/(?B — VA) representing the change in power over the transition region. The asymptotic clutter covariance matrix is: (10.41) rm is the rath discrete range sample and Q cx is a normalised clutter covariance matrix, where tr(Qcx) = 1. If an abrupt edge characterises the clutter environment, equation (10.41) takes the form:
(10.42)
Both formulations (10.41) and (10.42) assume uniform scaling of all clutter subspaces. In the latter instance, QrcA and QrcB characterise the clutter covariance matrices of clutter snapshots taken from either region A (r^) or region B (r#), respectively.
Consider the following cases: (i)
Scenario A: primary region tr(QrcA) = 65 dB,
a 2 = 0.0001 m 2 /s 2
secondary region tr(QrcB) = 53.5 dB,
a 2 = 0.01 m 2 /s 2 .
(ii) Scenario B: primary region tr(QrcA) = 76.5 dB,
o2v = 0m 2 /s 2
secondary region tr(QrcB) = 43.5 dB,
a2v = 0.25 m 2 /s 2 .
Scenario C: Same as Scenario A, but with primary and secondary region characteristics reversed, (iv) Scenario D: Same as Scenario B, but with primary and secondary region characteristics reversed.
SINR loss, dB
SINR loss, dB
(iii)
Scenario A Scenario C Scenario D
Scenario B
Figure 10.6 Asymptotic SINR loss for clutter edge scenarios ([W]t O 2000 IEEE)
Figure 10.6 shows the asymptotic loss due to the four edge scenarios listed above for the worst case Doppler, with the computation assuming that the test cell resides in the primary region. (Li = QA, L = Q). The abscissa indicates the per cent of training data taken from the primary region. Based on the characteristics listed above, we anticipate small losses for Scenarios A and C: the degree of mismatch between the two scenarios is relatively small. Undernulled clutter is the culprit leading to the larger losses seen for Scenario B. In summary, the modest degree of mismatch among the four scenarios, and the otherwise idealised nature of the clutter environment and platform configuration, tends to limit the severity of the observed losses.
10.4
Spectral heterogeneity
SINR loss, dB
Spectral heterogeneity results from the varying range-angle responses of different clutter classes to environmental conditions [7,8,10]. For instance, the intrinsic clutter spread of a windblown field differs from that of an urban region. Adaptive radar design for systems operating in littoral zones, over sea clutter, in the presence of weather effects, or under otherwise diverse conditions, should be particularly mindful of range-varying spectral properties. Since the adaptive processor tends to a response characterising the average behaviour of clutter in the training region, spectral
mean/std=1000 mean/std = 10 mean/std = 2 mean/std=l/2 mean/std= 1/5 RMB rule (-2.96 dB)
Doppler filter
Figure 10.7
SINR loss for varying degrees of spectral heterogeneity ([10], O 2000 IEEE)
heterogeneity leads to mismatch between adaptive and optimal filter notch widths. The inappropriately set adaptive notch width either increases clutter residue or partially cancels the target signal. In this section we examine the impact of spectral heterogeneity on STAP performance using both finite sample support Monte Carlo analysis and asymptotic evaluation. Taking an approach similar to that of Section 10.3.2, suppose the RJVIS clutter spread exhibits range-angle dependence given by the gamma distribution of equation (10.39) with av replacing y. We set E[av] = 0.1 m/s and alter the variance to cover cases of increasing spectral heterogeneity. The single channel, single pulse CNR is held constant at 25 dB (46 dB integrated CNR). Figure 10.7 shows the SINR loss based on 150 Monte Carlo trials. As in the case of range-angle varying clutter reflectivity considered in Section 10.3.2, we generated 257 space-time realisations per trial, using the latter 256 vectors to estimate the covariance matrix. As seen from the Figure, losses are small for the more homogeneous cases where the gamma distribution shape parameters abide by mean/STD >2. In accord with our expectations, the loss is negligible in the bin encompassing main beam clutter (Doppler filter 1), and the greatest losses occur in adjacent bins (Doppler filters 2 and 16). The roughly 3 dB loss in the Doppler filters adjacent to main beam clutter for the most heterogeneous case clearly indicates the deleterious nature of spectral heterogeneity on the radar MDV. In general, we expect the impact of spectral heterogeneity to further depend on CNR (by typical accounts, 46 dB CNR is modest). We examine this issue in a slightly different context in the next section.
eigenvalue number
Figure 10.8
Eigenspectrafor varying levels ofspectral spread ([1OJ, © 2 000 IEEE)
SINR loss, dB
Next Page
CNR1 = 58 dB
Doppler filter number
Figure 10.9 Asymptotic SINR loss for ([10], ©2000 IEEE)
varying levels of spectral spread
Next, suppose we train the adaptive filter in a region whose dominant spectral features differ from the test cell region (e.g. littoral zone or rural-urban interface). Figure 10.8 depicts the clutter-plus-noise eigenspectra for six distinct training regions. We assume the primary data exhibit an RMS clutter velocity spread of 0.8 m/s. Next, we suppose the training region encompasses one of the six regions identified in Figure 10.8. Figure 10.9 depicts the corresponding asymptotic loss over sixteen Doppler filters for the varying levels of spectral heterogeneity. The integrated CNR is set to 58 dB, a 12 dB increase over the preceding example. No loss occurs when training in the region with 0.8 m/s RMS spread, since this represents the matched condition. For values of RMS velocity spread of less than 0.8 m/s, the clutter notch width is inadequate, thereby leading to increased clutter residue and losses up to 3 dB in addition to those losses already associated with finite training. In contrast, the penalty for training in a region with greater spectral spread than the primary data is an apparently lesser degree of degradation. However, the result is dependent on the temporal resolution; we anticipate an increasingly severe, observable loss upon resolving the clutter spectrum via a longer dwell time. Additionally, degradation is CNR-dependent.
10.5
CNR-induced spectral mismatch
CNR-induced spectral mismatch refers to the range variation of clutter spectral width due to fluctuating CNR. ICM, clutter scintillation, dispersion, timing jitter,
SINR loss, dB
Previous Page
CNR1 = 58 dB
Doppler filter number
Figure 10.9 Asymptotic SINR loss for ([10], ©2000 IEEE)
varying levels of spectral spread
Next, suppose we train the adaptive filter in a region whose dominant spectral features differ from the test cell region (e.g. littoral zone or rural-urban interface). Figure 10.8 depicts the clutter-plus-noise eigenspectra for six distinct training regions. We assume the primary data exhibit an RMS clutter velocity spread of 0.8 m/s. Next, we suppose the training region encompasses one of the six regions identified in Figure 10.8. Figure 10.9 depicts the corresponding asymptotic loss over sixteen Doppler filters for the varying levels of spectral heterogeneity. The integrated CNR is set to 58 dB, a 12 dB increase over the preceding example. No loss occurs when training in the region with 0.8 m/s RMS spread, since this represents the matched condition. For values of RMS velocity spread of less than 0.8 m/s, the clutter notch width is inadequate, thereby leading to increased clutter residue and losses up to 3 dB in addition to those losses already associated with finite training. In contrast, the penalty for training in a region with greater spectral spread than the primary data is an apparently lesser degree of degradation. However, the result is dependent on the temporal resolution; we anticipate an increasingly severe, observable loss upon resolving the clutter spectrum via a longer dwell time. Additionally, degradation is CNR-dependent.
10.5
CNR-induced spectral mismatch
CNR-induced spectral mismatch refers to the range variation of clutter spectral width due to fluctuating CNR. ICM, clutter scintillation, dispersion, timing jitter,
clutter-only, yc = -20 dB clutter-plus-noise, yc = -20dB clutter-only, yc = -l0 dB clutter-plus-noise, yc = -10 dB
noise floor
ranked eigenvalue number
Figure 10.10
Illustration of eigenspectra variation for clutter spectral spread of 1.5 m/s and variable CNR
antenna motion and system instabilities, for instance, lead to subspace leakage [23]. Expressions (10.21) and (10.23) effectively model the temporal and spatial spectral spreading mechanisms; their effective application takes the form of a covariance matrix taper (CMT) [23,24]. As CNR increases, modulated components rise above the noise floor, thereby altering the width of the clutter spectrum. The adaptive processor must alter its response to mitigate these new coloured noise subspaces. Let us reconsider our 'typical' airborne radar example with the clutter spectral spread nominally set to 1.5 m/s and variable CNR controlled by the clutter reflectivity, yc. Figure 10.10 shows the corresponding clutter-only and clutter-plus-noise eigenspectra. The CNR-dependent nature of the dimension of the clutter subspace is evident in the Figure, viz. the apparent clutter rank increases with CNR. An increase in the rank suggests the presence of additional signal components - the modulated components resulting from signal decorrelation - and a consequent increase in clutter Doppler spectral spread. Guerci and Bergin describe this effect in detail in Reference 23. System effects, scintillation, multipath and other practical, non-ideal effects influence the nature of the spectral spread. Thus, all other effects held constant, if the CNR varies over range simply due to varying clutter RCS, a heterogeneous clutter condition arises distinct from simple amplitude mismatch. The corresponding spectral mismatch leads to either increased clutter residue or cancellation of potentially
SINR loss, dB
Doppler filter 1 Doppler filter 2 Doppler filter 3 Doppler filter 4 Doppler filter 5
asymptotic power, dB
Figure 10.11 Asymptotic SINR loss for CNR-induced spectral mismatch detectable target signals. As an example, Figure 10.11 depicts the asymptotic SINR loss for different Doppler filters as a function of CNR. The primary data has CNR set to 52 dB. Both training and primary data incorporate 0.45 m/s RMS spectral spread into the normalised covariance structure. The Figure suggests that training in a region with lower CNR markedly degrades MDV as a result of the joint amplitude and spectral mismatch; losses are substantial for only modest deviation in CNR. In contrast, training in a region with CNR greater than the primary data leads to a lesser degree of loss in this specific example. However, as the losses in Doppler filter 2 - the bin adjacent to main beam clutter - suggest, MDV is still significantly affected. Implementations employing overnulling strategies for adaptive training will suffer for this reason. An increase in the nominal RMS spectral spread (0.45 m/s is modest) will lead to an increase in the observable SINR degradation. To summarise, inappropriate filter notch width is the physical mechanism leading to performance degradation in the presence of CNR-induced spectral spread. Additionally, we note this scenario differs from that described in Section 10.3.2 as follows: in Section 10.3.2, the clutter RCS varied over angle and range in accordance with the gamma distribution in a manner representative of spiky clutter conditions; in contrast, the CNR-induced class of clutter heterogeneity indicates a persistent change in CNR over range that also leads to a consequent mismatch in spectral width, as described herein. Section 10.8 presents a measured data example corroborating CNR-dependent spectral spread and providing further clarification of this issue (see Figure R5 in the colour signature).
10.6
Targets in the secondary data
In this section we consider the impact of targets in the secondary data (TSD). From a practical perspective, the likelihood of target-like signals corrupting the secondary data set in a GMTI scenario is high. TSD affects STAP performance in several ways: from a filtering perspective, TSD leads to whitening of the desired signal component; TSD distorts the adapted pattern; TSD leads to inefficient use of the adaptive processor's DoFs; depending on the weight training strategy, TSD can also modulate the output power of the STAP, thereby biasing the constant false alarm rate (CFAR) threshold applied in subsequent processing; and, TSD impacts the variance of the target's angle of arrival (AoA) estimate. We now examine these points in further detail. To begin, we consider the following simple modification of equation (10.1):
(10.43)
where STSD is the space-time signal vector for a corruptive, target-like signal with power a\SD and Qk is the covariance matrix estimate of an assumed homogeneous clutter-plus-noise component. The corresponding weight vector is then:
(10.44)
Two main cases of interest arise: (a) perfect match to the target response, SJSD = v s _t; (b) mismatch between target and TSD responses, SJSD = vs_t + 5, where 8 represents the steering vector mismatch. In case (a) it is readily seen that W^/TSD is proportional to Qj^ vs_t, and so no penalty in SINR results from the TSD. Next, considering case (b), we express the weight vector in equation (10.44) as:
(10.45)
with p = Q^" ' s s _ t and z = Q^ ' S representing whitened target response and mismatched steering vectors, respectively, and where we otherwise assume v s _ t = s s _ t (i.e. no array manifold errors or mismatch to target angle-Doppler response). Several observations are in order: as expected, r\ - • 0 as a\SD -^ 0; the magnitude of T] depends on both TSD power, Oj
To evaluate the asymptotic SES[R loss due to TSD, let Qk -» Qk in equations (10.43) and (10.45). We may then express the additional loss with respect to noise-limited and optimal cases as [13]:
(10.46) Consequently, SINR(f59 JD) = SNR(J5) x LSy\ (/., Jh) x L5jSD(fs, Jh)9 which does not include further losses due to finite sample support. Expression (10.46) identifies the key parameters affecting performance: the TSD power and location of the corruptive target-like signal (measured by the projection of the generalised vectors, Q~ 1 / 2 s s _ t and Q^ 1/2<S), as well as the normalised SINR term ||p||^ = s^Qj^Sg-t. Observe that 0 < LsjsD(fs, Jh) < 1 a n d LsjsD(fs, Jh) = 1 when 6 = 0. The term T]Q^ 8 in equation (10.45) - or its asymptotic value - is responsible for SINR degradation via target whitening; this effect is most apparent when viewing the filter response. We now consider the aforementioned typical airborne radar interference scenario comprising ground clutter and receiver noise with 60 dB integrated CNR. Recall, eight elements make up the array radar in the azimuth dimension, with eight elements in elevation beamformed into a single spatial channel. A 25 dB Taylor weighting is applied on transmit, with no spatial weighting on receive. The receive half-beamwidth is roughly six degrees in azimuth. The PRF is set slightly higher than the displaced phase centre antenna (DPCA) requirement [3] and the array is side-looking with respect to the platform velocity vector. Figure P.I (see colour signature) plots LSJSD(JS* JD) for a single Doppler filter (bin 2, adjacent to main beam clutter) with the TSD component stepped over angle and power. In this case, we compared the calculation in equation (10.46) against: (10.47) with SINRopt(Js, Jh) denoting the optimal SINR and Wk/TSD = Qk/TSDvst> finding both expressions to be numerically equivalent. The embedded power of the TSD measures the corruptive power included in the covariance estimate and is essentially the TSD-to-noise ratio (noise floor at 0 dB) after the averaging process leading to equation (10.43). Moving scatterers spread over the range extent, such as vehicles on a highway, contribute to TSD. Additionally, tractor-trailers and multiple vehicles in a resolution cell lead to a strong echo. As expected, we observe zero loss when the target and TSD have the same direction of arrival and Doppler. However, slight misalignment between target and TSD directions leads to substantial losses within the main beam due to signal whitening. The null near seven degrees corresponds with a null in the quiescent pattern. Figure 10.12 depicts the spatial filter responses in Doppler bin 2 for cases of moderate and strong main beam TSD; for comparison, we also show the optimal
normalised gain, dB
target and TSD separation: 1°
optimal strong TSD (22 dB) moderate TSD (15 dB)
direction of arrival, deg
Figure 10.12
Filter response distortion due to target-like signal corrupting covariance matrix estimate ([13], © 2001 IEEE)
filter response. The Figure clarities the issue of signal cancellation due to TSD: observe the decreased gain in the target direction of 0° due to the migrating adaptive null when comparing with the optimal case. As will be shown in Section 10.8.1, we observe this same effect in actual measured airborne radar data taken in regions with ample roadways in the radar field of view. Upon detecting a target, a bearing estimate is then necessary to initiate target tracking. A method seamlessly integrating with STAP processing is desirable. Relevant approaches based on maximum likelihood estimation are discussed in References 25 and 26. In developing the bearing estimator, first consider the alternative hypothesis space-time snapshot: (10.48) where y is a complex constant, g = [fs / b ] , and ntot is the interference-plusnoise (total noise) vector. The basic problem is to estimate g with g. If ntot ~ CN(O, Q k ), then: (10.49)
Given this likelihood function, we first require an estimate for the complex constant. Equation (10.49) is maximal when: n(K,g) = (*k - KSs-ICg))^Qk1CXk - yss-t(g))
(10.50)
is minimal. Differentiating equation (10.50) with respect to y and setting the result to zero yields: (10.51) Next, substituting equation (10.51) into (10.50) leads to: (10.52) Differentiating equation (10.52) with respect to g and setting the result to zero yields the MLE for g, given as g. Consequently, the estimator is: (10.53) In practice, we replace Qk in equation (10.53) with QkFigure 10.13 shows the DoA cost surface of equation (10.53) for a 2OdB SNR target with a true AoA of—2 degrees. The target signal competes with 60 dB integrated CNR and receiver noise for the typical eight-channel, side-looking airborne array. For comparison, we also include the MLE in the presence of TSD by incorporating a target-like component with crjSD = 20 dB and a true AoA of 3 degrees. By searching for the peak of each of the two cost surfaces, we find unbiased estimates of the target AoA. However, flattening of the MLE cost surface in the presence of TSD translates into a substantial increase in the estimate's variance. Constant false alarm rate (CFAR) performance is highly desirable in radar detectors; the false alarm rate is a critical design factor influenced by computational and algorithmic processing capabilities. In some cases, the normalisation applied to the STAP weight vector influences CFAR performance. For example, Robey et al. describe the adaptive matched filter (AMF) as a CFAR detection statistic [27]. The AMF takes the form: (10.54) where §i is the threshold and vs_t = ss_t (no steering vector mismatch). The AMF follows from the STAP weight vector when \x = \/Jv^_tQ^ v s _t, and normalises the output noise power to unity; this characteristic is highly desirable, since in theory it enables a single threshold setting for all angle, Doppler and range to achieve specified false alarm behaviour. The output noise for range cell k under the AMF setting,
no TSD TSD
true target AoA: -2° TSD AoA: 3°
direction of arrival, deg
Figure 10.13
Target angle of arrival estimate in the presence of TSD
WAMF/k, is:
(10.55)
Equation (10.55) suggests an output noise level roughly set to unity, and for this reason the AMF is sometimes referred to as the unit-power constraint. Suppose we now incorporate the TSD model of equation (10.43) into equation (10.54). Using the matrix inversion lemma, assuming vs_t = ss_t, and coaligned TSD and target signal (8 = 0), we find: (10.56) Thus, while TSD does not degrade SINR in this instance, as previously shown, it does effectively bias the threshold upwards by 1+ ^j5Z>ss^tQk Ss ~t' thereby leading to target masking. This same effect is seen in traditional scalar CFAR algorithms. Since mismatch and clutter residue are always concerns, it is common to follow the AMF with a traditional scalar CFAR with adaptive threshold; depending on the STAP training strategy (e.g. sliding window), the AMF normalisation in the presence
of TSD suppresses the target response prior to the scalar CFAR, thereby effectively biasing the target response below the adaptive threshold. Interestingly, some non-CFAR-type normalisations do not suffer from the preceding effect. The unit norm constraint on the weight vector, w^Wk = 1 ? leads to v s-tQk vs-t> a n d the well known minimum variance distortionless response (MVDR) constraint, w^s s _ t = 1, leads to: AMVDR = l / ( v ^ t Q k l y s-t)- In the case where TSD and desired target steering vectors are coaligned, one can confirm the impervious nature of the decision statistic to corruptive TSD by employing equation (10.43) with STSD = v s _t. These latter normalisations are less desirable in theory than the AMF since the detection processor requires a different threshold setting for each angle-Doppler resolution cell, at least in a range homogeneous environment. More to the point, in a heterogeneous environment, these latter normalisations tend to modulate the output noise in the range dimension. Now we consider the impact of TSD on STAP performance when using finite sample support. To wit, we employ a Poisson distribution to seed targets in various range-angle sectors of the radar field of regard. Otherwise, we assume a homogeneous clutter environment so that we may specifically isolate the effects of TSD. Melvin and Guerci presented related analysis in Reference 13. A more complete treatment using site-specific clutter and road layout is given by Bergin et al. in Reference 28. Suppose dense traffic regions occur in specific range-azimuth sectors, with certain densities Acars and Atrucks defining the expected number of vehicles per square kilometer. A Poisson distribution is a natural selection for seeding targets within the space-time data cube. Figure 10.14 shows a single trial seeding for the parameters of Table 10.2. We generate a space-time signal for each vehicle location shown in the Figure; additionally, we assume each vehicle response follows a Swerling I target model, with a mean radar cross section of 10 dBsm for cars/light trucks and 22 dBsm for tractor trailer trucks. Next, we generate 279 radar clutter-plus-noise data realisations from a starting range of 32 km towards the end of the unambiguous range interval at 75 km. We use the same typical airborne radar parameters as in the preceding asymptotic analysis, with a waveform bandwidth set to 1 MHz (150m resolution). Using all 279 data vectors in equation (10.1) yields E[LS^] ~ —3dB in an otherwise homogeneous clutter environment. Beginning with the homogeneous clutter-plus-noise data cube of dimension TV = 8 by M = 16 by Q = 279, we then independently seed the Poisson-distributed TSD for varying trials using the parameters in Table 10.2. The TSD is uniformly distributed over the specified range-angle sector. We set the TSD Doppler response to correspond with ground moving targets - cars and truck on roadways in the radar field of view - nominally spread over velocities in the range of ±30 m/s. The Doppler is fixed by region (see Table 10.2), as one might expect for a roadway with a specific orientation, but uniformly spread over the corresponding Doppler filter width. Figure 10.15 compares SINR loss for the HD and homogeneous clutter-plus-TSD cases shown over a ±30 m/s target velocity interval. In this Figure we show the complete loss, LSt\ • L8^, characterising finite sample support, TSD, radar system
slant range, km
cars trucks
azimuth, deg
Figure 10.14
Target seeding scenario, one trial ([13], © 2001 IEEE) Table 10.2 Poisson target seeding scenario Region
1
2
3
4
A-carsCkm"2) ^trucks (km"2) Start range (km) Stop range (km) Azimuth (degrees)
0.5 0.1 32 39 - 5 to - 3
1 0.09 48 56 6 to 7
0.2 0.1 64 66 6 to 10
0.01 0.02 43 60 12 to 15
and homogeneous clutter effects. The upper bound on performance is also shown in Figure 10.15 and labelled as 'known covariance'. We obtained the TSD curve by averaging 200 different Poisson trials. As expected from our earlier discussion, TSD in a finite sample support scenario leads to significant performance loss when using the proposed Poisson model. Figure P.2 (see colour signature) shows SINR loss over ±50 m/s for each individual Poisson trial. The impact of TSD on the adaptive radar's MDV is evident in this plot, which we may further contrast against SES[R loss in the homogeneous (no TSD) case in Figure P.3 (see colour signature). TSD degrades the MDV in this instance by roughly a factor of three.
SINR loss, dB
known covariance estimated: with TSD (200 trials) estimated: HD secondary data
velocity, m/s
Figure 10.15
SINR loss comparison using finite training data and 200 Poisson trials to seed TSD ([13], © 2001 IEEE)
In Section 10.8.1 we show measured multichannel airborne radar measurements (MCARM) data illustrating that TSD is a practical concern when implementing STAP and corroborating the effects discussed in this section of the chapter using synthetic data. Section 10.9.4 suggests some strategies to mitigate TSD effects.
10.7
Joint angle-Doppler mismatch and clutter heterogeneity
STAP maximises SINR by filtering ground clutter in the angle-Doppler domain. Radar geometry determines the filter null location in this higher-dimensional space. When the null location varies over range, the adaptive filter produces an incorrect frequency response. Indeed, the filter tends to an average response with the potential for very poor instantaneous performance. We briefly highlight angle-Doppler properties for the monostatic radar case; relevant discussion applicable to bistatic geometry is given in References 3, 5 and 6. Consider a right-handed coordinate system with the x-axis pointing north, the y-axis pointing west and the z-axis pointing upwards. A unit vector pointing from the platform to a stationary point on the ground is: k(0,0) = cos 0 sin 0x + cos 0 cos 0y + sin Oi
(10.57)
where 0 is azimuth measured positive in the clockwise direction from the y-axis and 0 is elevation measured negative in the downward direction from the horizon. The direction vector to the mth subarray is: (10.58) while the platform velocity vector is: (10.59) The spatial phase at the mth channel is: (10.60) A denotes wavelength. Also, the normalised Doppler frequency of a stationary point is: (10.61) where T is the pulse repetition interval. If the channel spacing is d and we consider a side-looking array configuration (i.e. (Is9Hi = (m — l)d\) we can define the spatial frequency as: (10.62) where 0cone represents cone angle [3]. Normalised Doppler in the side-looking case can be written: (10.63) In other words, the angle-Doppler properties of a continuum of stationary points (ground clutter) fall on a line with the slope determined by platform velocity, pulse repetition interval and sensor spacing, and there is no dependence of the angle-Doppler contours with range (elevation angle). Hence, platform geometry does not induce non-stationary behaviour. The forward-looking geometry is a contrasting case. With the platform in level flight in the x-direction: (10.64) The angle-Doppler contours are ellipses that vary over range. As a rule-of-thumb, when the slant range divided by the platform height is less than five, range dependence is significant [3]; otherwise, the beam traces tend to align with the isodops at farther range, and so the angle-Doppler contour locations tend to stabilise. This example shows that geometry induces non-stationary angle-Doppler behaviour.
Range-varying angle-Doppler loci further exacerbates culturally induced heterogeneous clutter effects. For example, residue from clutter discretes further increases in the presence of adaptive filter null migration associated with the non-stationary clutter mechanism described in this section.
10.8
Site-specific examples of clutter heterogeneity
10.8.1 Measured multichannel airborne radar data This section briefly examines measured data taken from the multichannel airborne radar measurements (MCARM) programme [29]. Figure 10.16 shows the MCARM platform with a port-mounted, multichannel radar system housed within the radome. Table 10.3 provides some characteristics of the MCARM system. We consider data taken from Flight 5, Acquisition 575. Our objectives in considering this data are two fold: (a) we wish to corroborate the model of equation (10.18); (b) we identify a few instances of heterogeneous clutter behaviour described in the preceding sections.
Figure 10.16 Table 10.3
• • • • • • • • • •
MCARM radar system Some MCARM
characteristics
L-band transmit frequency 15 kW peak transmit power variable PRF (0.5 kHz, 2 kHz, 7 kHz) LFM or gated-RF 0.8 microsecond range resolution 0.8 MHz receiver bandwidth 7.5 degree Tx beam or blob (3 x) pattern for broad coverage 1.25 MHz IF centre frequency 5 MHz IF sampling rate (4 x oversample for digital I/Q) test manifold for channel balancing
• • •
• • • •
range-measured steering vectors 32 transmit subarrays (16 over 16 planar configuration) 24 receivers (sum, delta, and 11 over 11 additional channels oriented in a planar configuration) 128 radiating elements total (32 subarrays x 4 elements per subarray) 1 acquisition = 1 CPI nominally 128 pulses per CPI at the 2 kHz PRF collection region: DelMarVa Peninsula, USA
estimated SINR loss, dB
simulated
Doppler, Hz
Figure 10.17
Estimated SINR lossfor MCARM 575 data, different training intervals
As a means of corroborating the model of equation (10.18), Figure P.4 (see colour signature) compares the minimum variance distortionless response (MVDR) spectra for the actual data against simulation. The plots generally exhibit close correspondence. Clutter ridge slope and shape show good match; the additional azimuthal power spread in the actual data is a likely result of near-field scattering, an effect not included in the simulation model. Figure P.4 (see colour signature) confirms the suitability of equation (10.18) as a basic space-time model for ground clutter. Using test manifold data, we estimate the MCARM receiver noise floor. Next, we select training data in block regions from near range to far and estimate the clutter-plusnoise covariance matrix. Figure 10.17 shows estimated SINR loss for training data taken over different regions, identified in the legend, as well as the simulated curve using the model of equation (10.18). Good correspondence exists between actual and simulated clutter null locations. However, the impact of amplitude heterogeneity and TSD affects the shape of the measured data SINR loss curves. For example, scalloping in the loss curves is most likely due to TSD. To verify this conjecture, Figure 10.18 shows estimated SINR loss when training over a contiguous block, and after removing roadways intersecting the main beam (MB) and first sidelobe (SL) regions; we also apply some diagonal loading (denoted as DL) to stabilise the noise floor after removing training samples. We use map data (see Figure 10.20) and ownship navigation data to identify roadways in the radar field of view. As can be seen, the scalloping disappears after excising certain training data potentially containing target-like signals,
estimated SINR loss, dB
excise Hwy 15 MB excise Hwy 15 SL excise Hwy 15 SL, 0 dB DL simulated
Doppler, Hz
Figure 10.18
Estimated SINR loss after removing training data overlaying roadways
thereby yielding much closer correspondence between actual and simulated curves. Figure 10.19 shows the adaptive filter patterns within a single Doppler bin for the same training intervals (labelled by range bin number) as shown in Figure 10.17. The quiescent beam points to 1 degree off boresight. Migration of the null due to TSD is apparent in the upper two plots where the corresponding training data comes from regions with dense roadway networks. Varying clutter reflectivity affects the depth of the clutter null shown in Figure 10.17. Figure P.5 (see colour signature) shows range-Doppler and power versus range information for the subject acquisition. The power versus range curve clearly shows three distinct regions corresponding, from near to far range, to the predominantly farmland terrain of the DelMarVa Peninsula, USA, Delaware River and terrain in New Jersey, USA, opposite the Delaware River. A map of the collection region, along with the locations of several adjacent acquisitions, is shown in Figure 10.20; the aircraft flew a southerly route, with the array normal pointing almost due east. The three distinct terrain regions, along with many roadways in the radar field of view, are evident from viewing the map. Additionally, the CNR-dependent nature of the main beam clutter spread is seen in the range-Doppler map. From this single measured data acquisition we find examples of TSD, rangevarying clutter edges (three distinct regions) and CNR-dependent spectral spread, thereby corroborating several heterogeneous clutter models described in prior sections. Additionally, from Figure P.4 (see colour signature), we find that equation (10.18) adequately characterises the basic features of ground clutter.
Figure 10.19
10.8.2
azimuth angle, deg
azimuth angle, deg
azimuth angle, deg
azimuth angle, deg
Comparison of adaptive patterns for MCARM 575 data, different training regions ([13], © 2001 IEEE)
Site-specific
simulation
Realistic clutter environments, as the measured data from the prior section suggests, are site-specific. A variety of databases describing terrain cultural features are available to researchers investigating STAP performance in site-specific, heterogeneous clutter environments [28,30-34]. In addition, the signal processor can use these databases to enhance STAP implementation (e.g. by improving training data selection, predicting clutter edges, or prefiltering the data) [28,30,32]. For example, the United States Geological Survey (USGS) database includes land use and land cover (LULC) data and National Land Characterisation Data (NLCD) classifying predominant clutter types by geographic coordinate; digital line graph (DLG) data characterising discrete features, such as railway tracks, power transmission lines, etc.; digital elevation model (DEM) data providing terrain height information [30,32,33]. The US Census Bureau provides a variety of mapping products, including the Topologically Integrated Geographic Encoding and Referencing (TIGER/Line) road overlay data [28,30,34]. Combining cultural databases, such as those described in the preceding paragraph, with the clutter model of equation (10.18) enables site-specific clutter simulation. Different reflectivities are assigned to different clutter classes based on a query of the database [28,30]. Incorporating ground moving vehicles involves placing road
Figure 10.20
Mapping data of acquisition region showing roadways and Delaware River [34]
power, dB
MCARM acquisition 575 site-specific model
slant range, km
Figure 10.21 Measured and site-specific synthetic power versus range curves for MCARM 575 scenario ([30], © 2003 IEEE)
segments - given, for instance, by the TIGER/Line database - onto the earth's surface [28,30] and seeding targets using a particular probability distribution, such as the Poisson distribution employed in Section 10.6 or Reference 13. Database information can potentially provide very accurate prediction of ground clutter characteristics. Figure P.6 (see colour signature) shows a site-specific clutter RCS map of the data collection region for the MCARM Flight 5 data analysed in the prior section. The shape of the reflectivity map corresponds with our expectations based on the US Census Bureau map in Figure 10.20. Using the RCS map of Figure P.6 in the model of equation (10.18) leads to a remarkable match between the actual and site-specific, synthetic MCARM power versus range curves, as shown in Figure 10.21. Finally, Figures P.7 and P.8 (see colour signature) contrast the site-specific and homogeneous (bald earth) synthetic range-Doppler maps for the MCARM scenario. (Note: the ordinate has units of velocity, obtained simply by scaling Doppler frequency by half the wavelength.) Figure P.7 compares favourably against the measured MCARM range-Doppler map shown in Section 10.8.1.
10.9
STAP techniques in heterogeneous environments
In this section we highlight some STAP techniques applicable in heterogeneous clutter environments. This listing of methods is by no means exhaustive. A thorough examination and comparison of the various methods is beyond the scope of this chapter. We provide corresponding references for readers interested in delving further.
10.9.1 Data-dependent training techniques Data-dependent training techniques are valuable in heterogeneous clutter environments. The non-homogeneity detector (NHD) [35,36], power-selected training (PST) [37-39] and map-based training selection [28,30] are three examples of data-dependent training schemes. The non-homogeneity detector (NHD) assumes that gross changes in the underlying data structure lead to degraded performance [35,36]; the processor enhances adaptive capability by detecting and excising secondary data realisations significantly deviating from the surrounding realisations. The processor's goal is to select the most homogeneous set of secondary data based on a measure of covariance structure. The generalised inner product (GIP) is one viable metric and is given by: (10.65) Next, observe that the GIP can be written: (10.66) where Xk represents the whitened data vector, Ek is the matrix of eigenvectors resulting in the unitary similarity transform and Lk is a diagonal matrix containing the eigenvalues ofQk. Further notice that E[XkX1^] = Q^ QkQk , thereby implying
that when Qk = Qk(10.67) where am are the Karhunen-Loeve coefficients and e m are the eigenvectors of Qk. In this sense, the GIP measures the similarity in covariance structure between Qk and QkNotice that an ideal measure of difference in covariance structure is dk = /g(Qk 1Qk)- When Qk = Qk, we find the sum of the elements of dk equals NM. For a single realisation, we might consider Qj^ 1XkX^, but XkX^ is singular, thus leading us to equation (10.65). Figure 10.22 shows the modified sample matrix inverse (MSMI) detection statistic, NMSMIi over range for sliding window and NHD training approaches applied to the MCARM Flight 5, Acquisition 575 data considered in Section 10.8.1. (The MSMI and AMF detection statistics are identical, and so both terms are often used interchangeably.) The NHD is used to select a homogeneous training set. The curves correspond to the tenth Doppler bin, and a synthetic target is present at a range of 16.5 miles. The synthetic target is easily detected when employing the NHD method
N
MSMI> dB
2 x DoF sliding window
slant range, miles
2 x DoF most homogeneous NMSMI* dB
injected target
slant range, miles
Figure 10.22
Comparison of sliding window (top) and NHD (bottom) training schemes applied to MCARMFlight 5, Acquisition 575, Doppler bin 10 (1 mile = 1.61 km) ([11], © 2000 IEEE)
to select a homogeneous training set, but masked by surrounding interference in the sliding window case. To implement the NHD, the GIP is applied as follows. First, the data are Doppler processed and then the processor passes a minimum size sliding window through the secondary data to compute a covariance matrix estimate. Next, the processor sorts and ranks the GIP, identifying the most homogeneous samples to compute the adaptive weight vector. Figure 10.23 depicts the GIP over range for the tenth Doppler bin. Using mapping data, the peaks of the GIP - corresponding to heterogeneous samples - are found to generally align with roadways in the radar field of view, thereby suggesting TSD as the culprit leading to the performance loss of the sliding window approach shown in Figure 10.22. Power-selected training (PST) uses an adaptive procedure to select secondary data for covariance matrix estimation [37,38]. The basis for PST is the presumption that clutter heterogeneity is a result of power variations over range, and that these power variations result in inadequate clutter null depth. The adaptive processor does not fully cancel clutter as a result of insufficient null depth, leading to an increase in
Rt.9 SL, Edge Del. River Intracoastal Waterway Back Edge of Del. River Acq.l,575 Broadside Doppler 10, 4xDoF Sliding Window RU5SL, Rts. 13,71 MB Rts. 9.13 SL Rt. 9 MB
SL = sidelobe, MB = mainbeam
range, miles
Figure 10.23
Generalised inner product versus range, MCARM Flight 5, Acquisition 575, Doppler bin 10 (1 mile = 1.61km) ([11], © 2000 IEEE)
the false alarm rate. PST involves choosing stronger clutter samples, thereby yielding deeper clutter nulls. Specifically, the optimal weight vector may be written: (10.68) where am = e^s s _t, e m is the mth eigenvector with eigenvalue Am, and"knoiseis the noise floor eigenvalue. The beamformer voltage pattern gain is:
(10.69) where Qv(g) is the quiescent response, Emv(g) is the eigenbeam response and Amv(g) is the projection of the eigenbeam onto the quiescent pattern. Notice that Amv (g) scales the eigenbeam equal to the quiescent response at g = [(/) O]. Therefore, the level of nulling below the quiescent pattern at g is: (10.70) or twice the eigenvalue spread. This suggests training on those snapshots exhibiting the strongest power, thereby increasing Xm and consequently driving the adaptive null to a greater depth. The procedure for implementing post-Doppler PST (see Table I in Reference 38) is as follows: (a) (b) (c)
a jammer-nulled beam is pointed in the direction of clutter for each Doppler bin the processor sorts the beamformed outputs by power (in each Doppler bin for the Doppler-factored STAP approach) the processor selects the data snapshots for covariance matrix estimation in one of two suggested ways, either the single training set or the multiple training sets approach. Using the single training set method, the processor chooses the strongest outputs over all range for covariance matrix estimation. In the multiple training sets approach, the processor selects those range samples whose estimated power exceeds the power in the primary cell.
Variations on the PST method are given in Reference 38. One alternate implementation, referred to as 'power-selected deemphasis (PSD)', involves including some proportion of the primary test cell and surrounding guard cells in the covariance matrix estimation procedure to potentially improve the representation of test cell interference. In this case, the covariance matrix estimate takes the form: (10.71) where & = 0 excludes the primary data from the MLE and # = 1 fully includes the primary data. Projection deemphasis (PDE), also proposed in Reference 38, is one
approach for avoiding the selection of #. The idea behind PDE is to remove target energy from the primary data. One potential choice for the projection operator, for a post-Doppler STAP approach, is: (10.72) where vc is the clutter spatial steering vector. PST reduces the false alarm rate in heterogeneous clutter environments. Principal concerns with this training approach include signal cancellation - particularly when close correlation exists between clutter and target, such as in GMTI applications and degraded MDV resulting from CNR-dependent spectral heterogeneity. An example of map-dependent training selection was given in Figure 10.18. Mapping information, such as the TIGER/Line data [34], along with ownship data, can be used to identify training regions likely to contain ground moving targets. The processor then excises suspect range cells to mitigate target whitening and other TSD-related degradation.
10.9.2 Minimal sample support STAP Distributed clutter may exhibit substantial similarity over localised regions. For example, if a series of clutter transitions (e.g. rural to urban to littoral) exists, it stands to reason that training independently on each clutter type - versus a more global training strategy - will enhance performance. Reduced-dimension (RD) and reduced-rank (RR) STAP methods minimise requisite training data, hence enabling localised training selection. Operating directly in the space-time domain, STAP serves as a data-domain implementation of the optimum filter; both optimum and adaptive filters involve weighting and combining complex baseband voltage outputs of Af spatial channels over M temporal samples, as equation (10.6) suggests. Additional space-time DoFs increase the requisite training range interval and computational burden. Decreasing either the spatial or temporal aperture is one approach to reducing DoF, but the corresponding information loss typically yields unacceptable detection performance since it is more difficult to separate the target signal from competing ground clutter returns. A preferred approach to DoF reduction involves linear filtering and/or subspace selection. Reduced-dimension (RD) STAP methods apply data-independent transformations to project space-time data to a lower-dimensional subspace of size P <^ NM [3]. In this case, the computational burden decreases from 0(N3M3) to O(P3), and nominal sample support tends to IP. Given the linear transformation matrix TRD € CNMxP, the reduced-dimension data vector is: (10.73) The optimal linear filter maximising SINR in the lower-dimensional subspace follows from prior results and has weight vector: (10.74)
Estimating the unknown PxP null-hypothesis covariance matrix proceeds in the same fashion as the full DoF space-time case: the processor uses the MLE of equation (10.1) applied to the transformed data. Although the clairvoyant performance of the filter given by equation (10.74) is inferior to the full DoF approach, the adaptive performance using finite training data can potentially yield better results. Reduced-rank (RR) STAP involves selecting a lower-dimensional subspace to represent clutter and jamming signals; each selected basis vector retains the original length NM. Subspace calculation commonly involves a computationally expensive singular value decomposition (SVD), but RR-STAP improves statistical convergence over the full DoF case from 2NM to 2/c, where K = rank (Qk) is ideally much less than NM [4,19]. In practice, rank determination is challenging. Unlike the RD-STAP case, the performance of the RR-STAP can identically match the full DoF processor when the interference subspace of the null-hypothesis covariance matrix is known a priori. Diagonal loading involves scaling an identity matrix in proportion to the nominal noise floor level and adding the result to the covariance estimate [40]. Diagonal loading tends to compress the noise eigenvalues, thereby stabilising the adaptive filter sidelobes and emphasising the covariance matrix principal components. Furthermore, diagonal loading enhances the covariance matrix condition number. The processor can reduce the requisite amount of training data without concern over ill conditioning. Forward-backward (F/B) averaging [41] and space-time aperture smoothing [42] further enable operation with reduced sample support. Both approaches exploit symmetry in the space-time snapshot to enhance statistical convergence. In the latter case, degradation of space-time aperture (resolution) is the corresponding cost for improved convergence. The processor can blend F/B averaging and the extent of smoothing to alter the range extent of the training region. F/B averaging and smoothing techniques are common in spectral estimation. The supposition that localised training enhances STAP performance is inherent when applying minimal sample support techniques to combat heterogeneity. Localised training is particularly valuable in those regions where different classes of distributed clutter dominate. Less clear, however, is the value of localised training when operating in the presence of clutter discretes and TSD. Furthermore, the variation of distributed clutter is unlikely to appear homogeneous over range and angle within the confines of a local range segment. One approach to circumvent the deleterious impact of heterogenous clutter is to completely forsake training in the range dimension. Sarkar et al. present a variant of space-time aperture smoothing in Reference 43 involving a least-squares formulation with training (parameter estimation) completely confined to the snapshot under test. Farina, Lombardo and Pirri describe a filter-bank approach in References 44 and 45 involving the design of a deterministic space-time filter bank, with a cell-averaging constant false alarm rate processor following each filter. A non-linear detector selects the filter exhibiting minimal estimated output clutter power, thereafter comparing the corresponding output signal in the range dimension to a CFAR threshold. The deterministic space-time clutter filter design does not require training, although the CFAR stage requires a small training interval in comparison with typical STAP approaches.
Farina et al. successfully compare the performance of this non-linear, non-adaptive processing scheme against fully adaptive STAP methods using measured radar data in Reference 44.
10.9.3
Clutter discretes
In Reference 46, Richmond analyses the performance of an adaptive sidelobe blanker configuration to reduce the false alarm rate of the adaptive matched filter in the presence of undernulled sidelobe clutter and discretes. The adaptive sidelobe blanker implementation involves two stages: the adaptive matched filter, resulting from the magnitude-squared output of the STAP with normalisation jl = \/J v^_ t Q^ v s _t; followed by the adaptive coherence estimator (ACE). The ACE determines whether a main beam or sidelobe origination best characterises the likely target response, operating in a manner similar to a traditional blanker. The power-selected training approach of References 37 and 38, and described in some detail in Section 10.9.1, attempts to minimise the impact of clutter discretes by training on the most powerful samples, thereby driving the adaptive null to greater depth. This approach will likely find greater application in airborne moving target indication. As Klemm suggests in Reference 3, orthogonal projection methods - such as the principle components inverse approach of Reference 47 - can mitigate clutter bleed through associated with amplitude heterogeneity and clutter discretes by offering (theoretically) infinite adaptive null depth.
10.9.4
Targets in training data
By minimising main beam distortion, diagonal loading can potentially mitigate TSD effects [13]. Additionally, some researchers have suggested foregoing adaptivity to avoid challenges associated with TSD; for example, the displaced phase centre antenna processing approach is a non-adaptive solution for suppressing ground clutter returns [3]. Figures 10.24 and 10.25 compare SINR loss for the Poisson target seeding example described in Section 10.6 when applying diagonal loading and DPCA methods. In Figure 10.24, the DPCA condition (see Reference 3) is violated, and so DPCA performance is quite poor. The 1OdB diagonally loaded STAP yields a 3—4 dB SINR improvement. Figure 10.25 shows performance under the DPCA condition; we find that the diagonally loaded STAP performance approaches the DPCA result, and that traditional STAP exhibits substantial losses due to TSD. Incorrect PRF selection, non-ideal platform attitude and various system errors exacerbate practical DPCA implementation; adaptive processing is commonly used in lieu of DPCA to accommodate these practical issues. Constrained adaptive processing is another approach to combating TSD effects. For example, derivative constraints mitigate main beam distortion associated with TSD [48]. The NHD [33,34] and map-based training approaches discussed in Section 10.9.1 are also potentially useful approaches when operating in environments contaminated by TSD. Finally, as another example, the adaptive multistage
SINR loss, dB
STAP with TSD (PRF - 2 kHz) DPCA (PRF = 2 kHz) diagonal loading (PRF = 2 kHz)
velocity, m/s
Figure 10.24
Comparison of conventional STAP, DPCA and STAP with 1OdB diagonal loading (DL) in a TSD environment ([13], © 2001 IEEE)
median cascaded canceller - proposed by Picciolo, Gerlach and Goldstein - is also an approach for coping with moving discretes in the training interval [49]. The method incorporates a median sample selection function to mitigate the impact of outliers on detection performance.
10.9.5
Covariance matrix tapers
Covariance matrix tapers (CMTs), described in References 23 and 24, provide robustness to spectral mismatch and modulation of the dominant subspace due to varying CNR conditions. The CMT concept involves applying a complex taper, t, to the space-time data to tailor the adaptive notch width. Specifically, let: xk = t O x k
(10.75)
The covariance matrix of x k is then: (10.76) Generally, ACMT will possess full rank. As previously mentioned, the application of the correlation matrices At and As described in Section 10.2 is tantamount to a CMT operation. With the particular heterogeneous clutter environment in mind, the STAP designer chooses the correlation matrix ACMT, or the corresponding taper t, to create a desirable adaptive filter response.
SINR loss, dB
STAP with TSD (PRF =1.3 kHz) DPCA (PRF =1.3 kHz) diagonal loading (PRF = 1.3 kHz)
DPCA condition
velocity, m/s
Figure 10.25
DPCA mitigates TSD losses - since it is non-adaptive ~ but requires rigid control over PRF, velocity vector and system errors ([13], © 2001 IEEE)
The CMT operation subsumes diagonal loading: ACMT tf£)L№zg(Qk))~ 1 INMJ where o2DL is the diagonal loading level. 10.9.6
=
1+
Knowledge-aided space-time processing
Oftentimes, a preponderance of a priori information is available to partially characterise aspects of the clutter environment. References 30 to 32 suggest a variety of knowledge sources, including cultural databases, inertial navigation unit (INU) and GPS data, and information from other sensors. The adaptive processor can incorporate this additional information to accomplish the following: • • • •
enhance training data selection [30-32,50,51] preadaptively null discretes [30,50,51] minimise the number of unknowns requiring estimation [52,53] condition the space-time data through pre-filtering [30,52,53].
The non-linear, non-adaptive filtering method proposed by Farina et al. in References 44 and 45, for instance, employs INU/GPS to estimate the clutter ridge slope; by default, this approach is knowledge-aided. Hiemstra proposes a coloured loading scheme in Reference 53; coloured loading replaces adaptive degrees of freedom with deterministic filtering objectives. Interestingly, the coloured loading approach provides a mechanism for incorporating a priori knowledge in a convenient manner.
Since clutter environments are generally site specific, knowledge-assistance will likely prove invaluable in further enhancing STAP capability for airborne and spaceborne radar systems.
10.10
Summary
STAP-based radar systems must operate in heterogeneous clutter environments. This chapter has introduced the reader to aspects of space-time clutter heterogeneity, further describing the impact of clutter heterogeneity on STAP performance and briefly highlighting some techniques to enhance STAP implementation when operating in heterogeneous clutter environments. We introduced five basic classes of clutter heterogeneity and related effects: amplitude heterogeneity, spectral mismatch, CNR-induced spectral mismatch, moving (Doppler-shifted) discretes, and related effects such as mismatched angle-Doppler loci. Using a basic model to describe space-time clutter characteristics, we investigated the potential impact of each class of heterogeneity on detection performance. Using measured airborne radar data, we also corroborated a fundamental space-time clutter model and several heterogeneous clutter models. It is important to note that performance curves provided in this chapter are specific to the particular simulation scenario and are generally intended to isolate singular effects, such as the impact of either amplitude or spectral mismatch. Practical environments are much more complicated than the simplified models described in this chapter might suggest. Specifically, real-world environments are site specific and involve a simultaneously occurring conglomeration of various heterogeneous clutter effects. Since actual clutter scenarios generally appear heterogeneous, an understanding of clutter heterogeneity is essential to the development of mitigation techniques. Certainly, much work remains.
10.11
Acknowledgments
The author gratefully acknowledges his collaboration on the development of STAP implementations with Dr. Joseph R. Guerci of DARPA's Special Projects Office, and thanks Dr. Guerci for his encouragement and support. The author further acknowledges the support of his Georgia Tech colleagues; special thanks go to Dr. Daniel Leatherwood, Dr. Greg Showman and Dr. Daren Zywicki, members of the technical staff at the Georgia Tech Research Institute. Additionally, the author gratefully acknowledges Dr. Zywicki for generating the site-specific clutter simulation results discussed in Section 10.8.2, and Dr. Showman for proofreading the manuscript and providing valuable suggestions for improvement. The author further thanks his colleagues at the Air Force Research Laboratory for introducing the author to STAP, for many stimulating interactions over the years, and gratefully acknowledges the support of Dr. Mark Davis, Mr. Bill Baldygo, Mr. Mike Callahan, Dr. Mike Wicks and Mr. Jerry Genello.
References 1 BRENNAN, L. E. and REED, I. S.: 'Theory of adaptive radar', IEEE Trans. Aerosp. Electron. Syst9 March 1973, 9, (2), pp. 237-252 2 HAYKIN, S.: 'Adaptive filter theory' (Prentice-Hall, Upper Saddle River, NJ, 1996, 3rd edn.) 3 KLEMM, R.: 'Principles of space-time adaptive processing' (IEE Publishing, UK, 2002, 2nd edn.) 4 GUERCI, J. R., GOLDSTEIN, J. S., and REED, I. S.: 'Optimal and adaptive reduced-rank STAP', IEEE Trans. Aerosp. Electron. Syst, April 2000, 36, (2), pp. 647-661 5 MELVIN, W. L., CALLAHAN, M. J., and WICKS, M. C : 'Adaptive clutter cancellation in bistatic radar'. Proceedings of 34th Asilomar conference, Pacific Grove, CA, 29-31 October 2000, pp. 1125-1130 6 KLEMM, R.: 'Comparison between monostatic and bistatic antenna configurations for STAP', IEEE Trans. Aerosp. Electron. Syst, April 2000, 36, (2), pp. 596-608 7 NITZBERG, R.: 'An effect of range-heterogeneous clutter on adaptive Doppler filters', IEEE Trans. Aerosp. Electron. Syst9 May 1990, 26, (3), pp. 475-^80 8 ARMSTRONG, B. C , GRIFFITHS, H. D., BAKER, CJ., and WHITE, R. G.: 'Performance of adaptive optimal Doppler processors in heterogeneous clutter', IEEProc, Radar Sonar Navig., August 1995,142, (4), pp. 179-190 9 FUTERNIK, A. and HAIMOVICH, A. M.: 'Performance of adaptive radar in range-heterogeneous clutter', J. Franklin Inst., 1998, 335B, pp. 71-87 10 MELVTN, W. L.: 'Space-time adaptive radar performance in heterogeneous clutter', IEEE Trans. Aerosp. Electron. Syst, April 2000, 36, (2), pp. 621-633 11 MELVIN, W. L., GUERCI, J. R., CALLAHAN, M. J., and WICKS, M. C : 'Design of adaptive detection algorithms for surveillance radar'. Proceedings of IEEE 2000 international Radar conference, Alexandria, VA, 7-12 May 2000, pp. 608-613 12 KALSON, S.: 'An adaptive array detector with mismatched signal rejection', IEEE Trans. Aerosp. Electron. Syst, January 1992, 28, (1), pp. 195-207 13 MELVIN, W. L. and GUERCI, J. R.: 'Adaptive detection in dense target environments'. Proceedings of 2001 IEEE Radar conference, Atlanta, GA, 1-3 May 2001, pp. 187-192 14 CAI, L. and WANG, H.: 'Further results on adaptive filtering with embedded CFAR', IEEE Trans. Aerosp. Electron. Syst, October 1994, 30, (4), pp. 1009-1020 15 McDONALD, K. P. and BLUM, R. S.: 'Exact performance of STAP algorithms with mismatched steering and clutter statistics', IEEE Trans. Signal Process., October 2000, 48, (10), pp. 2750-2763 16 HAIMOVICH, A.: 'Theeigencanceler: adaptive radar by eigenanalysis methods', IEEE Trans. Aerosp. Electron. Syst, April 1996, 32, (2), pp. 532-542
17 REED, I. S., MALLETT, J. D., and BRENNAN, L. E.: 'Rapid convergence rate in adaptive arrays', IEEE Trans. Aerosp. Electron. Syst, November 1974, 10, (6), pp. 853-863 18 BOROSON, D. M.: 'Sample size considerations for adaptive arrays', IEEE Trans. Aerosp. Electron. Syst, July 1980, AES-16, (4), pp. 446-451 19 GIERULL, C. H.: 'Statistical analysis of the eigenvector projection method for adaptive spatial filtering of interference', IEE Proc, Radar Sonar Navig., April 1997,144, (2), pp. 57-63 20 SKOLNIK, M. L: 'Introduction to radar systems' (McGraw Hill, New York, NY, 1980, 2nd edn.) 21 BILLINGSLEY, J. B., FARINA, A., GINI, R, GRECO, M. V., and VERRAZZANI, L.: 'Statistical analysis of measured radar ground clutter data', IEEE Trans. Aerosp. Electron. Syst., April 1999, 35, (2), pp. 579-593 22 BARTON, D.: 'Land clutter models for radar design and analysis', Proc. IEEE, February 1985, 73, (2), pp. 198-204 23 GUERCI, J. R. and BERGIN, J. S.: 'Principal components, covariance matrix tapers, and the subspace leakage problem', IEEE Trans. Aerosp. Electron. Syst, January 2002, 38, (1), pp. 152-162 24 GUERCI, J. R.: 'Theory and application of covariance matrix tapers for robust adaptive beamforming', IEEE Trans. Signal Process., April 1999, 47, (4), pp. 977-985 25 DAVIS, R. C , BRENNAN, L. E., and REED, I. S.: 'Angle estimation with adaptive arrays in external noise fields', IEEE Trans. Aerosp. Electron. Syst., March 1976, AES-12, (2), pp. 179-186 26 MANOLAKIS, D. G., INGLE, V. K., and KOGON, S. M.: 'Statistical and adaptive signal processing: spectral estimation, signal modeling, adaptive filtering and array processing' (McGraw Hill, Boston, MA, 2000) 27 ROBEY, F. C , FUHRMAN, D. R., KELLY, E. J., and NITZBERG, R.: 'A CFAR adaptive matched filter detector', IEEE Trans. Aerosp. Electron. Syst., January 1992, 28, (1), pp. 208-216 28 BERGIN, J., TECHAU, P., MELVIN, W. L., and GUERCI, J. R.: 'GMTI STAP in target-rich environments: site-specific analysis'. Proceedings of 2002 IEEE Radar conference, Long Beach, CA, 22-25 April 2002, ISBN 0-7803-7358-8 29 FENNER, D. K. and HOOVER, W. F.: 'Test results of a space-time adaptive processing system for airborne early warning radar'. Proceedings of 1996 IEEE national Radar conference, Ann Arbor, MI, 13-16 May 1996, pp. 88-93 30 ZYWICKI, D. J., MELVIN, W. L., SHOWMAN, G. A., and GUERCI, J. R.: 'STAP performance in site-specific clutter environments'. Proceedings of 2003 IEEE Aerospace conference, Big Sky, Montana, 8-15 March 2003, ISBN 0-7803-7651-X, paper no. 1145 31 MAHER, J., CALLAHAN, M., and LYNCH, D.: 'Effects of clutter modeling in evaluating STAP processing for space-based radars'. Proceedings of 2000 IEEE international Radar conference, Alexandria, VA, 7-12 May 2000, pp. 565-570
32 WEINER, D. D., CAPRARO, G. T., CAPRARO, C. T., BERDAN, G. B., and WICKS, M. C : 'An approach for utilizing known terrain and land feature data in estimation of the clutter covariance matrix'. Proceedings of 1998 IEEE national Radar conference, Dallas, TX, 12-13 May 1998, pp. 381-386 33 National Land Characterization Project website, http://landcover.usgs.gov/ classes.html 34 TIGER/Line website, http://tiger.census.gov/cgi-bin/mapbrowse-tbl 35 MELVIN, W. L. and WICKS, M. C : 'Improving practical space-time adaptive radar'. Proceedings of 1997 IEEE national Radar conference, Syracuse, New York, May 13-15, 1997, pp. 48-53 36 CHEN, P., MELVIN, W. L., and WICKS, M. C : 'Screening among multivariate normal data', J. Multivariate Anal, March 1999, 69, pp. 10-29 37 RABIDEAU, D. J. and STEINHARDT, A. 0.: 'Improving the performance of adaptive arrays in non-stationary environments through data-adaptive training'. Proceedings of 30th Asilomar conference, Pacific Grove, CA, 3-6 November 1996, pp. 75-79 38 RABIDEAU, D. J. and STEINHARDT, A. 0.: 'Improved adaptive clutter cancellation through data-adaptive training', IEEE Trans. Aerosp. Electron. SySt., July 1999, 35, (3), pp. 879-891 39 MARSHALL, D.: 'Evaluation of STAP training strategies with mountaintop data'. MIT Lincoln Lab, TR MTP-5, Bedford, MA, February 1996 40 CARLSON, B. D.: 'Covariance matrix estimation errors and diagonal loading in adaptive arrays', IEEE Trans. Aerosp. Electron. Syst., July 1988, 24, (4), pp. 397-401 41 PILLAI, S. U., KIM, Y. L., and GUERCI, J. R.: 'Generalized forward/backward subaperture smoothing techniques for sample starved STAP', IEEE Trans. Signal Process., December 2000, 48, (12), pp. 3569-3574 42 FANTE, R. L., BARILE, E. C , and GUELLA, T. P.: 'Clutter covariance smoothing by subaperture averaging', IEEE Trans. Aerosp. Electron. SySt., July 1994, 30, (3), pp. 941-945 43 SARKAR, T. K., WANG, H., PARK, S., et al.: 'A deterministic least-squares approach to space-time adaptive processing (STAP)', IEEE Trans. Antennas Propag., January 2001, 49, (1), pp. 91-103 44 FARINA, A., LOMBARDO, P., and PIRRI, M.: 'Nonlinear nonadaptive spacetime processing for airborne early warning radar', IEEProc, Radar Sonar Navig., February 1998,145, (1), pp. 9-18 45 FARINA, A., LOMBARDO, P., and PIRRI, M.: 'Nonlinear STAP processing', Electron. Commun. Eng. J., February 1999, pp. 41-48 46 RICHMOND, C. D.: 'Statistical performance analysis of the adaptive sidelobe blanker detection algorithm'. Proceedings of 31st Annual Asilomar conference, Signal, systems & computers, Pacific Grove, CA, 2-5 November 1997, pp. 872-876 47 TUFTS, D. W., KIRSTEINS, L, and KUMARESAN, R.: 'Data-adaptive detection of a weak signal', IEEE Trans. Aerosp. Electron. Syst., March 1983, AES-19, (2), pp. 313-316
48 JOHNSON, D. H. and DUDGEON, D. E.: 'Array signal processing: concepts and techniques' (Prentice-Hall, Englewood Cliffs, NJ, 1993) 49 PICCIOLO, M. L., GERLACH, K., and GOLDSTEIN, J. S.: 4An adaptive multistage median cascaded canceller'. Proceedings of 2002 IEEE Radar conference, Long Beach, CA, 22-25 April 2002, ISBN 0-7803-7358-8 50 ANTONIK, P., SCHUMAN, H. K., MELVIN, W. L., and WICKS, M. C : 'Implementation of knowledge-based control for space-time adaptive processing'. Proceedings of 1997 IEE international Radar conference, Edinburgh, Scotland, 14-16 October 1997, pp. 478-482 51 MELVIN, W., WICKS, M., ANTONIK, P., SALAMA, Y, LI, P., and SCHUMAN, H.: 'Knowledge-based space-time adaptive processing for AEW radar', IEEE Aerosp. Electron. Syst. Mag., April 1998, pp. 3 7 ^ 2 52 TECHAU, P. M. and BERGIN, J. S.: personal communication, March 2001 53 HIEMSTRA, J. D.: 'Colored diagonal loading'. Proceedings of 2002 IEEE Radar conference, Long Beach, CA, 22-25 April 2002, ISBN 0-7803-7358-8
Chapter 11
Adaptive weight training for post-Doppler STAP algorithms in non-homogeneous clutter1 Stephen M. Kogon
11.1
Introduction
The mission of a surveillance radar is the detection of moving targets, either airborne or ground-based, known as moving target indication (MTI). Traditionally, MTI radars have been implemented with a pulsed waveform using antenna reflectors or phased arrays with a limited number of spatial channels. Future systems, however, will most likely be implemented with antenna arrays with a large number of spatial channels to greatly enhance detection performance, especially in the presence of interference, through adaptive processing. The spatial channels formed from the antenna array, either beams or subarrays, provide angle measurements, and pulses are used to extract velocity information on potential targets. For each pulse, the radar returns are digitally sampled in time where each sample corresponds to a range or distance from the radar. The entire collection of spatial, pulse and range samples makes up a three-dimensional datacube. The key challenge to an MTI system is the suppression of interference, either due to other transmitted signals, such as jamming, or from the ground reflections of the transmitted radar signal, known as clutter. The fundamental problem that ground clutter presents is that, despite the fact that it is not physically moving, it has a nonzero Doppler frequency due to the platform motion of the MTI radar. As a result, the detection of moving targets that share the same Doppler with ground clutter is severely impaired without adaptive clutter mitigation. The physical characteristic that can be exploited, however, is the fact that ground clutter with the same Doppler as that
1
This work was supported by the DARPA under Air Force Contract # F19628-00-C-0002. Opinions, interpretations, conclusions and recommendations are those of the author and are not necessarily endorsed by the United States Government
of a moving target has a different angle of incidence. Thus, by adaptively combining elements and pulses, clutter can be cancelled by placing a null in angle and Doppler. Two-dimensional adaptive processing of spatial channels and pulses is commonly referred to as space-time adaptive processing (STAP) [1—3]. STAP computes an adaptive weight vector for every beam (angle) and Doppler frequency bin for which a target detection hypothesis is to be formed. Since the exact clutter characteristics, i.e. array response and power level, are not known a priori, the STAP weights must be estimated from the data and the processing is thus data adaptive. Since the adaptation is performed in angle and Doppler, training is done using range samples assuming that angle/Doppler characteristics of clutter remain unchanged over the training samples.2 Although, training methods are often specific to a particular application, generally STAP weights are computed for an entire range interval or for all ranges. The motivation is not only to minimise computations but to also make the ensuing angle estimation problem tractable.3 Primary performance considerations for selecting a training strategy are clutter mitigation and detection of slow moving targets, also referred to as minimum detectable velocity (MDV). Since performance of a training method is highly dependent on actual clutter characteristics, it is crucial to apply any proposed method to experimental data to assess performance. STAP simulations, although intuitive, are less reliable since clutter characteristics are often very difficult to simulate accurately. One difficulty associated with training STAP weights is the non-homogeneous nature of clutter power4 [7]. Many times sharp peaks are present in the level of clutter returns that are a great deal stronger than the average clutter power, often referred to as clutter discretes. The result of training over all clutter is that these clutter discretes may be under nulled by STAP since the weights are computed for the average clutter level in the training cells. The resulting STAP weights are diluted by the averaging process and the null on clutter is not deep enough to effectively mitigate the clutter discretes. A training method aimed at combating the non-homogeneity of clutter and specifically for the mitigation of false alarms due to clutter discretes is power-selected training [8, 9]. Here, the strongest clutter returns are used for training to set the depth of the null based on these strongest returns instead of on the average clutter level. Although other weaker clutter is over-nulled, the consequence in terms of detection performance is usually negligible.5 Another problem can arise when basing the inclusion of the snapshots in STAP training on power level alone. If targets are present that may be either strong or at a Doppler frequency with weak clutter levels, these targets may
2 Certain radar scenarios can produce non-stationary angle/Doppler characteristics of clutter for which compensation methods are available that can approximately remove this non-stationarity [4-6] ^ When STAP is employed, angle estimation algorithms should account for the non-deterministic adaptive response of the STAP weights, particularly for slow moving targets near or in mainbeam clutter Non-homogeneous clutter refers to the power of the clutter returns. It does not indicate a nonstationarity of clutter angle/Doppler characteristics in range as occurs for cases when the platform motion vector does not align with the array axis, e.g. forward-looking radars Slow moving targets that compete with mainbeam clutter may suffer more loss due to over-nulling clutter which can slightly degrade minimum detectable velocity
be included in a training strategy if power is the only selection criterion. The effects of including targets in the STAP training set are potentially significant target self-nulling and an overall degradation in clutter cancellation performance. Both of these effects result in compromised detection capability. In this chapter, we introduce a new technique which attempts to solve the problem of targets included in the STAP adaptive weight training for power-selected training. Since any moving target produces its own non-zero Doppler contribution that adds to the radar-platform-induced Doppler, a target with the same Doppler as clutter must come from a different angle. Since for a given Doppler frequency, clutter comes from a common angle for all range cells over which STAP weights are trained, any significant deviation in angle indicates that the cell contains significant non-clutter returns. To avoid the deleterious effects of including targets in STAP training, a snapshot with significant target contributions should be excluded from the adaptive weight training. A method of excluding targets is based on angle or phase. One means of estimating the angle of a return is to measure the phase difference between antenna phase centres across the array. Since angle directly corresponds to phase on the array, this measurement is essentially an interferometric angle measurement whose quality improves with the strength of the incoming signal, i.e. those snapshots selected based on power. If the measured phase differs significantly from the expected phase (angle) of clutter for the given Doppler bin, it should not be included in STAP training. Using such an additional phase criteria, strong target signals are excised from the STAP adaptive weight training. The proposed technique is successfully demonstrated on experimental data collected with an airborne radar sensor.
11.2
Training of STAP algorithms
Space-time adaptive processing (STAP) has become a major focus of research activity in radar applications over the past decade [1, 3]. For airborne radars, the detection of moving targets, either airborne or ground-based, is impeded by the returns from ground clutter. Using an antenna array and a series of pulses, an airborne radar has the ability to spatially discriminate with the sensor array and distinguish based on Doppler frequency using the set of pulses. For a moving target with a given Doppler, the radar must be able to separate this target from clutter with the same Doppler frequency. The velocity of targets is determined by measuring both the spatial location (angle) and Doppler frequency of detected targets. STAP is the two-dimensional adaptive processing in angle and Doppler employed for the purposes of clutter mitigation to enable the detection of moving targets. For the STAP processing of radar signals, we want to emphasise signals that arrive at the radar from a certain angle and Doppler frequency, while rejecting all other significant energy. For this chapter, we assume a linear array of uniformly spaced elements. We begin by defining the space-time steering vector for a signal arriving from an angle 4> with a Doppler frequency / v(0,/) = b(/)®a(0)
(11.1)
where b ( / ) = [1 eJ2nf/f™ . . • GJ27u(L-l)f/fPRf
(1L2)
a(0) = [1 e i 2 ^ s i n 0 A ) _
(11.3)
% Qj2n(M-l)(dsm(f)/k)^T
are the temporal and spatial steering vectors, respectively. The radar wavelength is X9 the physical spacing between elements is d, L is the number of pulses in a coherent processing interval (CPI) and M is the number of spatial channels, /PR is the rate at which the radar system transmits and receives pulses, known as the pulse repetition frequency (PRF). Likewise, the time between pulses is the pulse repetition interval (PRI). Note that in many applications the antenna array must have a large aperture to achieve the necessary angular resolution. In such cases, only a limited number of spatial channels are digitised for adaptive processing. A common method of reducing the number of spatial channels is to form subarrays of the full aperture by combining the A/2-spaced elements from a section of the array prior to forming the digital channels. Radar returns from spatial channels and pulses are often represented by a spacetime data vector x(n) for a given range sample n. A space-time snapshot containing a target signal is given by: x(n) = a t v(0t, ft) + x i+n (n)
(11.4)
where Q^, 0t> and /t are the target amplitude, angle, and Doppler frequency, respectively. Xi+n is the interference-plus-noise signal and n is the snapshot index. Here, we only consider interference consisting of ground clutter returns from the transmitted radar signal. The optimum space-time processor is given by References 1 and 3 (p. 118): (11.5) where Q i + n = E{xi +n (n)xi +n (n) // } is the interference-plus-noise covariance matrix. The space-time steering vector v((/>o, /o) determines the angle and Doppler frequency of interest. Note that the Doppler frequency of a clutter patch is given by: /c = ^ s i n 0 c
(11.6)
A
where t>p is the radar platform velocity and >c is cone angle of the clutter patch with respect to both the velocity axis of the radar platform and the array assuming a side-looking radar with the array axis aligned with the platform velocity vector [1,3, Chapter 3]. The performance of the optimum space-time processor from equation (11.5) is measured via the output signal-to-interference-plus-noise ratio (SINR). As the name implies, this is simply the output signal power divided by the interference-plus-noise
power: (11.7) where a^ is the signal power at the element level. Many times, we want to compare the SINR to the maximum SINR that could possibly be achieved. This upper limit is determined by the ideal matched filter for the interference-free case, i.e. thermal noise only. Normalising the SINR by the SNR of the ideal (interference-free) matched filter SNR0 yields: (11.8) which is known as SINR loss. Many times SINR loss is computed across angles and/or Doppler frequencies. An SINR loss of unity (0 dB) indicates perfect interference cancellation or no loss due to the presence of clutter. A processor is evaluated by how closely it comes to achieving this goal. Suboptimum partially adaptive processors are judged on their performance with respect to the optimum processor. Another common STAP metric is to determine the minimum velocity with an acceptable SINR loss, e.g. LSINR = —5 dB, often referred to as minimum detectable velocity (MDV). Since the optimum STAP weights are not known, they must be estimated from data samples or snapshots. Thus, STAP weights are computed using a sample covariance matrix given by: (11.9) /V—1
where Xtr = [x, r (l)x, r (2). -.xtr(K)]
(11.10)
is the data matrix of STAP training data samples. The training samples xtr(k) are selected from the overall set of radar returns x(n) for n = 1 to N. When Rx is substituted for Q i + n into the STAP weight equation from equation (11.5), we then have STAP weights given by: (11.11) This technique is commonly referred to as sample matrix inversion (SMI) [12]. The assumption made with this substitution of the sample covariance matrix is that the training samples in equation (11.10) are clutter-plus-noise snapshots and do not contain target signals and have clutter with the same angle/Doppler characteristics. Two major considerations go into the choice of data samples for STAP training: how many samples are necessary to get a good estimate of the covariance matrix and which data samples share the same statistics with the data sample to which the STAP weights are to be applied. Many times when the statistics of the clutter vary rapidly, these
two requirements are at odds with one another. We want to use as many samples as possible in order to obtain a good estimate of the covariance matrix that minimises the estimation loss [12]. On the other hand, the use of snapshots over a limited range near the data sample to be processed is desirable from the point of view that these samples most accurately reflect the local interference-plus-noise statistics.
11.3
Post-Doppler STAP algorithms
In practice, STAP usually must be implemented using a reduced degrees of freedom architecture that uses a deterministic preprocessor prior to adaptive processing to project into a reduced dimension subspace. The resulting algorithms are known as partially adaptive STAP [1,3, Chapter 5]. In the spatial dimension, the full array data can be reduced to either a set of subarray channels or a set of non-adaptive beams spanning the angular sector of interest. In the time domain, Doppler filtering is performed on the pulses prior to STAR This resulting class of algorithms is known as postDoppler STAR Since clutter comes from potentially all angles, performing Doppler filtering can reduce the STAP problem to cancelling clutter from the same Doppler frequency arriving from a different angle than the target, as shown in Figure 11.1. The viability of such a post-Doppler STAP approach is predicated on the assumption that the Doppler sidelobe levels are low enough to reduce clutter from other Doppler frequencies to below the thermal noise floor. Note that although interfering clutter may only be from a certain angle for a given Doppler frequency, limitations on Doppler resolution result in a clutter spread in angle over the Doppler frequency bin
Doppler
angle/Doppler resolution cell of STAP weights
STAP nulls
angle
Figure 11.1 Post-Doppler STAP approach with Doppler sidelobe suppression of mainbeam clutter
repeat for every Doppler pulses
spatial channels
DFT
degrees of freedom
Figure 11.2 PRI-staggered post-Doppler STAP algorithm block diagram and still necessitate spatial and temporal DoFs to cancel for two-dimensional adaptive processing, i.e. STAR Post-Doppler STAP algorithms are typically implemented with a DFT filter bank for the Doppler filtering component. Among these algorithms, the PRI-staggered post-Doppler STAP algorithm has been shown to be very effective, achieving near optimal performance [11]. A block diagram of the PRI-staggered STAP algorithm is shown in Figure 11.2. This algorithm uses two highly overlapped windows over pulses to produce staggered or delayed bins at the same Doppler frequency. The algorithm can be viewed as a frequency-domain (Doppler) adaptive filter using a time tap at a delay equal to an integer multiple of the PRI. The implementation requires two independent DFTs over the two highly overlapped windows over pulses. The delay between the two pulse windows, i.e. the pulse spacing, is known as the stagger and is ideally tuned to the PRF as in displaced phase centre array processing. This delay between bins is a temporal degree of freedom that STAP uses to suppress clutter. This algorithm was used for the experimental STAP results obtained in Section 11.5 of this chapter.
11.4 Phase and power-selected training for STAP The goal of any STAP training method is to determine the space-time snapshots x(A:) to use in the STAP data training matrix in equation (11.10). Many times, the assumption is made that ground clutter is homogeneous. In practice, clutter power levels can vary a great deal over range. The result of these non-homogeneities is that although clutter averages out to a certain power level, the resulting STAP weights computed with data samples over a large range extent will only have a clutter null deep enough to cancel clutter with the average power level. When the clutter contains large values, known as clutter discretes, the result will be under-nulled clutter which
in turn produces false alarm detections in the STAP output. Power-selected training is a data-adaptive STAP training method that attempts to combat this problem [8,9]. By using the strongest returns for each Doppler bin for training, STAP weights are based on the strongest clutter and, therefore, set the null depth according to the strongest clutter. Range cells with weaker clutter returns, in turn, have clutter interference over-nulled, although with little impact on SINR loss. One major problem with power-selected training arises when actual target signals are strong. Strong targets can arise from targets being at close ranges with respect to the radar or from large radar cross sections. Including targets in the STAP training can result in significant target self-nulling due to mismatches of the target space-time response and the assumed STAP steering vector [13]. In addition, since STAP is typically using reduced degrees of freedom, clutter cancellation performance suffers as a result of devoting a degree of freedom to the target in the STAP training data. Using power-selected training only compounds this problem since strong targets are almost guaranteed to be included in the STAP training. This problem is magnified when these targets are in Doppler bins with low clutter levels. Therefore, a desirable STAP training methodology would be to include strong clutter cells for maximum clutter nulling while recognising that any cells that contain targets need to be excluded from the training. This philosophy is what is behind phase and power-selected STAP training. For a given Doppler cell, clutter is expected to come from a certain angle given by: sin0c =
^_/
(11.12)
where / is the Doppler frequency, k is the wavelength and t>p is the radar platform velocity. Since angle translates to phase across the array, the expected relative phase of clutter is: Ax XJf0 = In—sin0c
(11.13)
where Ax is the physical distance between spatial phase centres. In contrast to ground clutter, moving targets have their own velocity which induces a Doppler frequency shift in addition to that Doppler produced by the motion of the radar platform. The angle of a moving target at the Doppler frequency / is therefore: sin>t= X' f~2vt 2vp
(11.14)
where vt is the radial velocity of the target with respect to the radar. Therefore, an effective means of discriminating between clutter and targets is to examine the phase between elements or phase centres to determine if the return has an angle of incidence consistent with the expected angle of clutter. If not, then this cell should be excluded from the STAP training.
For each potential STAP training cell within each Doppler bin, the phase can be estimated between two phase centres as: \Ir = arctan
(11.15)
where x\ (n) and x2(n) are data samples for the two phase centres on the array in the STAP Doppler bin for time or range sample H. Note that this STAP training method based on both phase and power is fairly robust to target amplitude. Since the accuracy of such an interferometric angle measurement is tied directly to signal-to-noise ratio, strong targets yield a significantly different phase than clutter. Range cells with only a weak target, on the other hand, are rejected for training on the basis of power via the power selection criterion. In addition, this method is computationally inexpensive since it only requires measuring a phase difference across the array for potential STAP training cells. Extensions of this method to other techniques that use angle to identify valid clutter training cells, especially for arrays with a large number of phase centres, is possible. The final STAP weight vector for a Doppler bin is then shared over a large range extent or even by all range cells. Note that target excision from STAP training data based on this phase criterion falls into a larger class of non-homogeneity detectors [7]. The combination of the power and phase selection criteria, however, has a different goal than these other non-homogeneity detectors [7]. These methods, which include the generalised inner product and an initial SMI test statistic metric, remove snapshots from STAP training that differ from the sample co variance matrix formed from the remainder of potential training cells. A relative distance between a potential training cell and the sample co variance is measured using the specified metric. In all of these metrics, however, no distinction is made based on the type of non-homogeneity, i.e. power or phase. Moving targets are removed from the training set based on phase-induced non-homogeneity from the target Doppler. Likewise, a snapshot containing the return from very strong clutter will fail these up-front tests based on the power non-homogeneity. The removal of strong clutter returns from STAP training is in contrast to the goal of power-selected training which seeks to include them. As a result, comparison of these STAP training methods with the phase and power-selection method outlined here is not appropriate since the methods are designed with different objectives.
11.5 Experimental results In this section, the proposed phase and power-based STAP training technique is demonstrated on experimental data collected with an airborne sensor. The aircraft had an altitude of approximately 5500 m and a velocity of 130 m/s. The radar was at X-band ( / = 10 GHz) with a PRF of/ P R = 1338Hz. The CPI length was 131 pulses or approximately 100 ms. The array consisted of three spatial channels formed with uniformly-spaced subarrays or phase centres each with a 0.6 m aperture. The subarrays have an overlap of 50 per cent. The experiment consisted of three ground moving vehicles driving on a road network with the range to the radar of approximately
20 km. The vehicles were a jeep, a pick-up truck and a long tractor-trailer. In addition to the road network, the experiment site also contained several large buildings that provided plenty of opportunity for large clutter discretes. Before looking at actual STAP results, we demonstrate the procedure for using the phase and power selection criterion using experimental data.
11.5.1 Example of phase/power selection We demonstrate the use of power and phase selection criteria for STAP training data with the experimental collection. Figures 11.3 and 11.4 show power and relative phase versus range for Doppler bin 42 (/dopp = —230 Hz) where a 128-point DFT was used for the Doppler filtering stage. The relative phase is measured between the first and third phase centres (subarrays). For this Doppler bin, the expected phase of clutter is approximately 150 degrees. Instead of relying on a predicted phase, however, we can use the median to estimate the clutter phase for this Doppler bin. Since clutter is present at all range cells, targets will produce phase outliers and the median is a robust estimator of mean in the presence of these outliers. An estimated phase from the data is actually more robust than relying on a predicted phase due to uncertainties that might arise in the data, e.g. unknown array orientation such as with aircraft crab. Next, a threshold is set around this median phase. Any samples with a phase outside of the acceptable range of deviations from the median phase are rejected or excluded from STAP training. These deviations in phase are due to either the presence of targets or clutter non-homogeneities. Also, many range gates may have a random phase due to noise when clutter power is weak. Using a tolerance of ±5 degrees, a total of 423
target # 2
power, dB
target # 1
range gate
Figure 11.3 Power versus range gate (Doppler =
-230Hz)
target # 2
phase, deg
target # 1
range gate
Figure 11.4 Relative phase versus range gate (Doppler = -23OHz) out of the 640 available range gates satisfy the phase criterion for inclusion in STAP training for this Doppler bin. Two target returns are actually present in this Doppler bin as illustrated in both Figures 11.3 and 11.4. Since these targets are effectively removed from the training based on phase, they cannot impact STAP performance. Once the phase criterion has been satisfied, power-selected training can be employed to combat strong clutter discretes, using the strongest cells that also satisfied the phase criterion to train STAP weights.
11.5.2 STAP results The STAP algorithm used was the PRI-staggered post-Doppler algorithm applied to the three-channel or phase centre array. A total of 131 pulses per CPI was used for the Doppler filter bank, with a — 65 dB Taylor taper and two staggers of 0 and 3 pulses (128 pulses per stagger). This staggering corresponds to two 128 pulse windows on the 131 pulse CPI similar to Figure 11.2 except that the stagger or delay between the two windows is now three pulses or roughly 2.2 ms. This delay was chosen to correspond to the time delay needed for one of the phase centres to fly into the other with the given PRF and platform velocity, commonly referred to as the displaced phase centre antenna (DPCA) condition. For PRI-staggered STAP, this stagger optimises STAP performance [H]. The array was approximately aligned with the velocity vector of the aircraft, i.e. no aircraft crab. With three phase centres and two staggers, the total degrees of freedom for this post-Doppler STAP algorithm is then 6. A total of 640 range cells was available for STAP weight training.
range gate
output power-to-noise, dB
Using the PRI-staggered post-Doppler STAP algorithm, we examine the performance of various STAP training methods. The performance of the proposed power and phase-based STAP training is compared with simple fixed target-free training and power-selected training. For all three methods, the number of training cells is fixed at 30 out of the total 640 which is five times the six STAP degrees of freedom (approximately a - 1 dB sample matrix inversion loss [12]). The STAP output is in the form of a range-Doppler power map for the beam steered to 0 = 0° (broadside) and is plotted in decibels. Doppler frequency has been converted to a target radial velocity hypothesis (at beam centre) in km/hr. The STAP normalisation is unit gain in the look direction (distortionless response), known as minimum variance distortionless response (MVDR), so that the zero velocity Doppler bin contains clutter at the expected clutter power level. Note that the data has been normalised up front so that the noise power level is set to 0 dB. Target-free training refers to using the last 30 range gates of each Doppler bin to compute STAP weights. The STAP training set is target free since it is known a priori that no targets are in these range gates. In practice, such an assumption cannot be made but these results are included to illustrate performance of STAP training without any considerations of clutter power levels. The outputs for target-free training (last 30 range gates), power-selected training and combined power and phase-selected training are shown in Figures 11.5, 11.6 and 11.7, respectively. Examining the STAP output from the target-free training, we see a large number of clutter discretes making it through STAP, especially at the Doppler bins corresponding to velocities between —30 and —20 km/hr. These range and Doppler cells correspond to the locations of buildings in the experiment producing large clutter discretes. Among these false alarms is also the actual target at —22 km/hr in range gate 280. Other false alarms are
velocity, km/hr
Figure 11.5 Range-Doppler STAP output for training on last 30 target-free range gates (broadside beam)
output power-to-noise, dB
range gate
velocity, km/hr
range gate
output power-to-noise, dB
Figure 11.6 Range-Doppler STAP output with power-only-based training selection (broadside beam)
velocity, km/hr
Figure 11.7 Range-Doppler STAP output with phase and power-based training selection (broadside beam) produced at approximately range gate 270 and velocity 18 km/hr and throughout the 6-7 km/hr Doppler bins. The remaining two targets are at range gate 300 and Doppler bin - 1 0 km/hr and range gates 205-225 and Doppler bins - 5 to - 1 0 km/hr. This last target is the tractor-trailer (extended in range) that is in a turn during the CPI. The different parts of this extended target produce different radial velocities and therefore spread over multiple Doppler bins in STAR Clearly, this high number of false alarms
is undesirable with more false alarms than actual detections. A high number of false alarms creates problems for subsequent tracking of target detections. For the power-selected training, the false alarms are cleaned up almost completely. However, since strong targets have been included in the STAP training, these targets have experienced a significant amount of self-nulling and clutter cancellation performance in these Doppler frequency bins/velocity hypotheses has been severely degraded. In fact, the target at —22 km/hr in range gate 280 has been almost completely removed. Also, for each of the Doppler bins that included targets in the STAP training, performance is degraded across all range due to the target self-nulling. The entire bin is corrupted from the high white noise gain of the STAP weights [14].6 By contrast, using both phase and power for STAP training selection criteria has successfully exposed all three targets by excluding them from the STAP training. Performance against false alarms due to clutter discretes is similar to power-selected training. Again, the three targets are at approximately —22 km/hr and range gate 280, — 10 km/hr and range gate 300, and between —5 and —10 km/hr and between range gates 205 and 225. Combining phase-based target excision from STAP training with power-selected training ensures adequate nulling on strong clutter returns helping STAP to realise the improved suppression of clutter discretes from power-selected training while avoiding the performance degradations associated with including the targets in the STAP training.
11.5.3 Experimental versus theoretical STAP performance Lastly, we compare the performance of STAP achieved on experimental data with the proposed phase and power-based STAP training to that predicted from theoretical STAP for a clutter-to-noise ratio of 25 dB (estimated from the data). Both the theoretical (dashed line) and experimental (solid line) SES[R losses, shown in Figure 11.8, reflect that the loss incurred by the Doppler taper (—65 dB Taylor) is approximately —3 dB. SINR loss is estimated from the experimental data using 200 target-free range gates. The result shows good agreement between experimental and theoretical SINR loss. A slight widening of the SINR loss notch at 0 km/hr is seen in the experimental data. Possible causes are array and channel errors as well as slight over nulling at low velocities. Note that internal clutter motion due to wind-blown clutter is not considered to be a likely culprit since the experiment site lacked any significant vegetation.
11.6
Summary
A new method for the training of STAP adaptive weights has been introduced and demonstrated on experimental data collected with an airborne sensor. The method 6
Note that, in general, white noise gain control methods are not usually considered for STAP since low MDV usually implies mainbeam nulling. White noise gain limits would hinder performance against low velocity targets
HP 'ssoi ^NLIS
Theoretical Experimental radial velocity, km/hr
Figure 11.8 Comparison of experimental (solid line) and theoretical (dashed line) SINR loss versus target velocity hypotheses uses a phase-based criterion, as well as power, to choose samples for STAP training and is considered as an improvement to power-selected training for post-Doppler STAP algorithms. The use of phase effectively removes targets whose inclusion in the training results in target self-nulling and a degradation in clutter cancellation performance. The new technique was compared with power-selected training on experimental data collected with an airborne radar using the PRI-staggered STAP algorithm. The performance improvement achieved over the power-selected training is significant as the target signals were properly excluded from the training set, yet the method still achieves effective mitigation of large clutter discretes. In addition, SINR loss estimated from the STAP applied to the experimental data was compared with a theoretically predicted SINR loss, showing good agreement over all Doppler frequencies.
References 1 WARD, J.: 'Space-time adaptive processing for airborne radar'. MIT Lincoln Laboratory TR 1015, ESC-TR-94-109, 1994 2 WARD, J.: 'Space-time adaptive processing for airborne radar'. Proceedings of IEEE international conference on A coustics, speech, and signal processing, 1995, pp. 2809-2812 3 KLEMM, R.: 'Principles of space-time adaptive processing' (IEE, London, England, 2002, 2nd edn.)
4 BORSARI, G. K.: 'Mitigating effects on STAP processing caused by an inclined array'. Proceedings of IEEE Radar conference, 1998, pp. 135-140 5 KREYENKAMP, O. and KLEMM, R.: 'Doppler compensation in forwardlooking STAP radar', IEE Proc, Radar Sonar Navig., October 2001, 148, (5), pp.253-258 6 HIMED, B., ZHANG, Y., and HAJJARI, A.: 'STAP with angle-Doppler compensation for bistatic airborne radar'. Proceedings of IEEE Radar conference, Long Beach, CA, 2002, pp. 311, 317 7 MELVIN, W. L. and WICKS, M. C : 'Improving practical space-time adaptive radar'. Proceedings of IEEE Radar conference, 1997, pp. 48-53 8 BORSARI, G. K. and STEINHARDT, A. O.: 'Cost-efficient training strategies for space-time adaptive processing algorithms'. Proceedings of Asilomar conference on Signals, systems, and computers, Pacific Grove, CA, 1995, pp. 650-654 9 RABIDEAU, D. J. and STEINHARDT, A. O.: 'Improved adaptive clutter cancellation through data-adaptive training', IEEE Trans. Aerosp. Electron. SySt., 1999, 35, (3), pp. 879-891 10 BRENNAN, L. E., MALLETT, J. D., and REED, I. S.: 'Adaptive arrays in airborne MTI radar', IEEE Trans. Antennas Propag., September 1976, AP-24, pp. 607-615 11 WARD, J. and STEINHARDT, A. O.: 'Multiwindow post-Doppler space-time adaptive processing'. Proceedings of 7th IEEE workshop on Statistical signal and array processing, Quebec City, 1994, pp. 461^-64 12 REED, I. S., MALLETT, J. D., and BRENNAN, L. E.: 'Rapid convergence in adaptive arrays', IEEE Trans. Aerosp. Electron. Syst., 1974, AES-IO, (6), pp. 853-863 13 COX, H.: 'Resolving power and sensitivity to mismatch for optimum array processors', J. Acoust. Soc. Am., 1973, 54, pp. 771-785 14 COX, H., ZESKIND, R. M., and OWEN, M. M.: 'Robust adaptive beamforming', IEEE Trans. Acoust. Speech Signal Process., 1987, 35, (10), pp. 1365-1376
Chapter 12
Application of deterministic techniques to STAP Jeffrey T. Carlo, Tapan K. Sarkar, Michael C. Wicks and Magdalena Salazar-Palma
12.1
Introduction
This chapter presents a deterministic least-squares adaptive signal processing technique for nulling interferers and estimating signals of interest (SoI) in the presence of clutter and transient electromagnetic interference. Unlike stochastic methods which rely upon auxiliary training data to estimate the statistics of the interference, in order to place nulls, this approach operates on a snapshot-by-snapshot basis to determine adaptive weights. Due to the reduced training data set (one range ring only - the range ring under test) and because of the particular least-squares techniques employed, this approach can readily be implemented in real time in fleld-deployable sensor signal processing systems. From a scientific standpoint, a deterministic method provides an estimate which comes closer to the Cramer-Rao bound than do the results provided by a stochastic methodology [I]. Another advantage of this method over conventional stochastic methods is that it operates effectively in highly non-stationary environments and is effective against fluctuating interferers as well1. In fact, this may be its greatest strength. The deterministic least-squares adaptive signal processing technique introduced here has application in many fields, including: communications, radar, sonar, medical imaging etc. The application addressed in this chapter is airborne radar, of which a description can be found in the literature [2,3]. In airborne radar surveillance systems, the detection of air and ground targets is complicated by many factors, as illustrated in Figure 12.1 [3]. In this Figure, the target signal competes with sidelobe clutter at the same Doppler frequency, in addition to the mainlobe clutter at the same angle, and jammers at potentially many different discrete angles, but spread over a wide band of frequencies. Changes in the velocity of the
Interferers fluctuating yet coherent with the radar signal
Figure 12.1 An airborne radar signal environment [3] airborne radar platform also impact the system performance. The Doppler spread of clutter is a function of platform velocity and the cosine of the angle of arrival (AoA) of the clutter energy with respect to the velocity vector of the platform. Therefore, an increase in the velocity of the platform reduces the clutter-free spectrum of the system and will degrade the minimum discernible velocity (MDV), which complicates the detection of slower moving ground targets. Compounding this are near-field airframe effects, in addition to size, weight and power constraints imposed by the platform, which limit the power aperture product of the radar (radar range equation effects). In addition to uncertainties introduced by the platform, the radar system must also contend with a severely non-stationary environment (consisting of jammers, multipath, clutter and discrete blinking interferences) while attempting to detect targets such as a low observable (LO) aircraft or objects employing concealment, camouflage and deception (CCD). One simple, albeit expensive, way to design a system that will detect these targets of interest is to have a very large aperture with very low sidelobes. This would provide a narrow search beam on transmit/receive, and low sidelobes to help mitigate interference. Beyond size constraints, one problem with this solution is that a low-cost, lightweight, air qualified array would be challenging to design and build and would be very expensive. Another solution, and the one discussed here, is to use adaptive space-time processing to suppress the interferences and enable the system to detect potentially weak target returns. In this analysis, we assume that the system consists of a linear array of A^ equally spaced antenna elements as shown in Figure 12.2 (with a digital receiver behind each element). Methods for handling directive antenna elements in a conformal array, including the effects of mutual coupling and near-field effects, are available elsewhere [1] and will not be discussed in this presentation. Here, d is the spacing between the antenna elements and O is the angle of incidence of the various signals impinging on the array with respect to the normal (the aperture is collinear with the platform velocity vector). Even though we are dealing with isotropic point radiators in this
Mh element
Figure 12.2 A uniform linear antenna array
pulses
snapshot
range bins
Figure 12.3 A radar data cube scenario, use of directive antenna elements can also be employed with a deterministic least-squares methodology, and is discussed elsewhere [1,4]. This configuration is comparable to that of a planar array, where elements in a column are combined using an analogue beamformer and, in turn, the output of the analogue beamformer is down converted and digitised. This analysis can easily be extended to the case where there is a digital receiver behind every element in the planar array. We also assume that the system processes a number of pulses (M) within a coherent processing interval (CPI). Each pulse repetition interval (PRI) consists of the transmission of a pulsed waveform of finite bandwidth and the reception of the reflected energy by the radar aperture and receiver(s), with a bandwidth matched to that of the radar pulse. In the receive chain, measured signals are down converted, matched filtered and stored, either digitally (IF sampling) or in analogue (baseband digitisation). In this manner, complex samples (/ and Q) are generated at R range bins for M pulses and at N elements (channels). To facilitate working with the array output the baseband samples can be arranged into a three-dimensional matrix commonly referred to as a data cube, as illustrated by Figure 12.3. The three axes of the data cube correspond to the spatial channel, pulse and range dimensions. At a particular range bin, n, the sheet or slice of the data cube is referred to as a space-time snapshot or simply the snapshot, as seen in Figure 12.3. We assume that the signals entering the array are narrowband, with a planar phase front, and consist of an SoI along with interference plus noise. The noise
(thermal noise) originates in the receiver and is understood to be independent across elements and pulses. The interference is external to the receiver and consists of clutter (reflection of the transmitted electromagnetic energy from terrain and sea, the atmosphere of the earth, including weather phenomena, etc.), jammers, signal multipath (due to the SoI, clutter, and/or jamming) and mutual coupling effects. We assume that for each jammer, the energy impinging on the array is confined to a particular direction of arrival (DoA) and is spread in frequency. The jammers may be blinking. The received complex envelope of the SoI at the nth element corresponding to the rath pulse is given by: a s e x p M 2 7 r n - s i n ( 0 ) + 27rm— M
(12.1)
where /d is the Doppler frequency of the signal, / r is the pulse repetition frequency (PRF), ofs is the complex amplitude and A is the wavelength at the centre frequency of the radar. Having stored the complex space-time-range samples, the adaptive signal processing algorithms can now be initiated. Traditionally, a range-dependent statistical approach to adaptive array processing has been taken [2,3,5-9]. Earlier approaches adjusted the spatial weights iteratively in an attempt to either maximise the signalto-noise ratio (SNR) [5] or minimise a mean square error [6]. Brennan and Reed showed [7] that in the case of Gaussian interference the space-time filter weights that maximise the probability of detection for a fixed false alarm rate, are given by: w
= k'№~1 -v*
(12.2)
where A: is a complex constant, 3fc is the covariance matrix of the interference, v is a steering vector and * represents the complex conjugate. This joint-domain (elementpulse) optimum processor requires knowledge of the statistics of the interference. Since these statistics are not known a priori in a fielded radar system, especially one in which the interference is rapidly changing, an adaptive approach is needed. One adaptive method, the sample matrix inversion method (SMI) [5], involves estimating the interference covariance matrix from measured radar data. This is achieved using the element-pulse returns from adjacent range rings which are assumed to be independent and identically distributed (HD) to the interference in the cell under test. This method of estimation uses the sample covariance matrix [8], which is written as: 1
n
»= -Ex^
( 12 - 3 >
where r\ is the number of range samples and x r is the received vector for the rth range ring. Here it is assumed that the training data does not contain any signal energy. It has been estimated [5] that approximately r] = 2£ HD samples are required to accurately estimate the covariance matrix, where £ is the order of the covariance matrix. It has been pointed out [3] that the number of required samples can be as high as three to five times £, depending on the environment. In an airborne surveillance radar, the clutter environment can be highly non-stationary and the aperture quite
large (in order to resolve long-range targets in angle). As such, the requirements on the training data sample size and HD conditions can severely limit the applications of such a methodology to radar. For this reason, researchers have investigated alternative approaches, which require less training data. One class of alternative processor to joint domain SMI is the factored or cascaded signal processing approach. In these algorithms, the raw data is Doppler processed first, and then adaptive beamforming is applied, or alternatively beamforming is followed by adaptive Doppler processing. These approaches perform one-dimensional processing sequentially (with adaptive processing in only the second dimension). This results in nulls being placed across the other (second) dimension. A partial taxonomy of various space-time processing methods is identified in the literature [3]. There is debate over which factored approach is superior. For the three different cases evaluated by Wang and Cai [9] the factored approach performed significantly poorer, or at best, equally to the joint domain approach, but under certain restrictive conditions. This poor performance by the factored approach, which requires a significant amount of secondary data, motivated Wang and Cai to develop an algorithm which operates in the joint transform (angle-Doppler) domain across a smaller dimensional space. This algorithm, the joint domain localised (JDL) processor, was thefirstpractical algorithm to show significant improvement over the factored approach. This JDL processor still requires training data, although significantly less than for other approaches. Another shortcoming of the factored approach includes poor performance in non-stationary environments. While many researchers were investigating suboptimum space-time adaptive approaches which relies upon estimating the underlying statistics using auxiliary training data, other researchers investigated deterministic ways to suppress interference. Luthra [10] proposed an algorithm which could null K-I coherent jammers, using data from N = 2 K — 1 elements (spatial information) while maintaining a fixed gain along the desired look direction. This approach was improved upon by Sarkar and Sangruji [11] using a new and simpler data matrix, and solved for the adaptive weights using the conjugate gradient method (CGM). Further improvements were made by Schneible and Park [12,13], where they implemented multiple constraints to preserve the SoI which might arrive slightly off the look direction (steering vector). Then Sarkar and coauthors [14-16] presented two more variations to this approach, the backward processor (BK) and the forward-backward (FB) processor. These algorithms were extended to the space-time domain, where Park and Sarkar [13,21] presented a generalised eigenvalue processor and Sarkar et al. [14-16] implemented the direct data domain least-squares (D3LS) processor.
12.2
Direct data domain least-squares (D3LS) approach, one dimension
The one-dimensional D3LS forward algorithm, developed in Reference 11, uses element space data and a look direction constraint to null out interference and estimate
the amplitude of the SoI. This algorithm is capable of nulling both coherent and noncoherent jammers/interferers. The number of nulls that can be placed is (N — l)/2, where N is the number of the receive antenna elements. A brief review of this method is provided below. Given the Af-element uniform linear array of Figure 12.2, assume that a desired sinusoidal signal arrives at an angle Os degrees from broadside (measured from the array normal) with amplitude of as. In addition, K — 1 far-field coherent jammer signals enter the array with angles of arrival #; and amplitude(s) <2/, i — 1,2,..., K-I, and clutter and thermal noise signals are also present. In this algorithm, array weights are computed based upon the samples taken across the array for a given pulse and range ring. Let Xn represent the baseband received signal (SOI, jammers, clutter and noise) arriving at the nth antenna element and is given by: K-I
{
d
\
Xn = a, eJWnd/k) sin(es) + J^ at exp f j2n — Sm(O1) J + cn
(12.4)
where Cn is the contribution due to clutter and thermal noise. The strength of the SoI, a s , is the desired unknown parameter which will be estimated using the snapshot of data, x, collected at time t. The vector of complex voltages measured at time t at the N elements of the array can then be represented as: x T = [Xl
X2
...
xN]
(12.5)
where T denotes the transpose of a matrix. Let X be a matrix, which contains data from the N elements of the vector x defined in equation (12.5). Let us define another matrix S whose elements comprise the complex voltages received at the antenna elements due to unity amplitude coming from the direction #s. The actual complex amplitude of the desired signal is a, which is to be determined. Then we form the matrix pencil using these two matrices, which are defined by: X-a-S
(12.6)
where
(12.7)
(12.8)
and the components of S, are defined by: (12.9)
Equation (12.6) represents the contribution of all the undesired signals, due to clutter, thermal noise, coherent multipaths and other interferences (i.e. all the undesired components except the signal). In the following method K = (N + l)/2 degrees of freedom {when Af is assumed to be odd} can be used to null the interference. One could form the undesired noise power from equation (12.6) and estimate a value of a by using a set of weights w, which will minimise the noise power. This results in [14-16]: (X-Qf-S)-W = [O]
(12.10)
Alternately, one can view the left-hand side of equation (12.10) as the total noise signal at the output of the adaptive processor due to interferences and thermal noise: N out - Q - W = { X - o f - S j - w
(12.11)
Hence, the total undesired power is given by: Undesired = WW{X - a • S]" • {X - « • S}w
(12.12)
where the superscript H denotes the conjugate transpose of a matrix. The objective is to set the undesired power to a minimum by selecting w for a fixed value of the signal strength a. This yields the generalised eigenvalue equation given by equation (12.10). Therefore: X - w = Qf-S-w
(12.13)
where of is given by the generalised eigenvalue and the weights w are given by the generalised eigenvector. Even though equation (12.8) represents a K x K matrix, the matrix S is only of rank one. Hence, equation (12.13) has only one eigenvalue and that generalised eigenvalue is the solution for the SoI. For real-time applications, it may be computationally burdensome to solve the reduced rank generalised eigenvalue problem efficiently, particularly if the number of weights and the dimension K are large. For this reason we convert the solution of a non-linear eigenvalue problem in equation (12.13) to a linear matrix equation which can be solved more efficiently. We observe that the first and the second elements of the matrix Q in equation (12.11) are given by: Qx9X
= Xi - o f -sx
(12.14«)
Ig 12 = X 2 -Qf-S 2
(12.146)
where xi and X2 are the voltages received at antenna elements 1 and 2 due to the signal, interferences and thermal noise, and si and S2 are the desired signals at these same elements due to an incident signal of unity strength. Defining: (12.15)
where Os is the angle of arrival corresponding to the desired signal. Then the following difference equation: Qi?i— Z" 1 • Q12 = undesired signals (= interference + noise)
(12.16)
removes the contribution due to the SoI, since: (12.17a) (12.176) Therefore one can form a reduced rank matrix [V](K-i)xK generated from Q. This matrix can then be set equal to a null vector to minimise the interference and noise:
(12.18) This matrix consists of N — K cancellation rows, for N = 2 • M — 1. In order to make the matrix full rank, we fix the gain of the subarray by forming a weighted sum of the voltages Ylf=\ w*x/ along the direction of arrival of the SoI. The gain of the subarray is C in the direction of Os. This provides an additional equation resulting in:
(12.19) or, equivalently: F w= y
(12.20)
Once the weights are solved using equation (12.20), the signal component as may be estimated from: (12.21) The proof of equations (12.20) and (12.21) is provided in Reference 11. As noted in Reference 14, equation (12.20) can be solved very efficiently by applying the fast Fourier transform (FFT) and the conjugate gradient method (CGM), which may be implemented to operate in real time utilising, for example, a DSP32C signal processing chip [17,18].
To solve equation (12.20), the conjugate gradient method starts with an initial guess wo for the solution and generates [18]: P 0 = -b-i • ¥H • r 0 = -b-i¥H
• {F • W0 - y}
(12.22a)
At the nth iteration the conjugate gradient method develops:
(1222b) (12.22c)
(I222d) (\222e) (12.22/) The norm is defined by: HF-PK II2 = p n " . F " . F . p n
(12.23)
The above iterative procedure continues until a prespecified error criterion is satisfied. In our case, the error defining the stopping criterion for the iteration is given by: (12.24) where £ denotes the effective number of bits of data associated with the measured voltages. Hence, the iteration stops when the normalised residuals are of the same order as the error in the data. The computational bottleneck in the application of the conjugate gradient method involves the various matrix vector products. That is where the FFT plays an important role as the equations involved exhibit a Hankel structure and therefore use of the FFT reduces the computational complexity by an order of magnitude without sacrificing accuracy [18]. The matrix vector products are done between a vector and the system matrix F which has a Hankel structure. The Hankel nature of a matrix always arises when one is performing a convolution. The conventional matrix-vector product usually requires an order of magnitude greater computation time than when performing the matrix-vector product using the FFT when the system matrix has a Hankel structure. The details are available in Reference 1. One advantage obtained using the conjugate gradient method arises out of the fact that the iterative procedure will converge even if the matrix F is singular. As such, it is suitable for real-time applications. Second, the number of iterations taken by the conjugate gradient method to converge to the solution is dictated by the number of independent eigenvalues of the matrix F. This often translates into the number of dominant signal components in the data. Thus, the conjugate gradient method has the advantage of a direct method as it is guaranteed to converge after a finite number of steps, as well as that of an iterative method, since the round off and truncation error in the computation are limited only to the last stage of computation (iteration).
Next we illustrate how to increase the degrees of freedom (DoF) associated with equation (12.20). It is well known in the spectral estimation literature that a sampled sequence can be estimated by observing it either in the forward direction (FW) or in the reverse direction [19,20]. This latter approach we term as the backward (BK) model as opposed to the forward model outlined above. If we now conjugate the data and form the reverse sequence, then we get an equation similar to equation (12.20) for the solution of weights w^ as:
(12.25)
where * denotes the complex conjugate, and C is the corresponding look direction constraint. Equivalently: B • wb = y b
(12.26)
The signal strength a can again be determined by: (12.27) Note that in both equations (12.20) and (12.26), dealing with the matrices F and B, we have K = (N + l)/2, where Af is the total number of antenna elements. We can increase the number of weights significantly by combining these approaches to produce the forward-backward model. In this way we exactly double the amount of data by considering the data not only in the forward direction but also conjugating it and reversing the increment direction of the independent variable as well. However, the number of weights increases only by a half. This approach can be applied so long as the series to be estimated can be approximated by an exponential function with a purely imaginary argument. This is always true in the adaptive array case. There is an additional benefit as well. For both the forward and the backward method, the maximum number of weights we can consider is (N + l)/2. Hence, even though all the antenna elements are being utilised, the number of DoF available is essentially half the number of antenna elements. For the forward-backward method (FB), the number of degrees of freedom can be significantly increased without increasing the number of antenna elements. For this case, the number of degrees of freedom [1] A'7 can reach (N + 0.5)/1.5. It is important to note, that for any method, be it deterministic or stochastic, this is the largest number of coherent interferers one can cancel using any method. For the stochastic method one needs a block of data to form the covariance
matrix where the same goal is achieved using a single snapshot. Thus one can have approximately 50 per cent more DoF for the FB case than for either of the two FW or BK cases. The equation in the combined forward-backward model is obtained using equation (12.20) and (12.26) to be:
(12.28)
or, equivalently: FB w ft = y ft
(12.29)
Once the increased degrees of freedom are used to compute the weights, the complex amplitude for the signal of interest is determined from equation (12.21) or (12.27), where in the summation K is replaced by the new degrees of freedom K''. Also, as before, the matrix FB now has a block Hankel structure. Thus we have illustrated how the direct data domain least-squares approach can be implemented in real-time using a single snapshot of data. For a stochastic method, when multiple steering vectors are available, one does not have to invert the covariance matrix as a simple multiplication is all that is necessary to get the desired weights from equation (12.2). In this case, one can carry out a similar procedure but the computations are involved, as the steering vector is part of the system matrix. Therefore, one can partition the system matrix into blocks and invert the dominant block of the system matrix which is still block Hankel and can do similar cost saving computations.
12.3
D3LS approach with main beam constraints
The D3LS approach defined in the last section is very effective in nulling interferers and estimating the SoI when there are (N + l)/2 interferers or less and the SoI arrives at exactly 6S, the look direction. In a radar system the main beam of the antenna is pointed in various discrete directions in search of targets. The target may not be at that exact angle. Also, due to atmospheric refraction, diffraction or vibration in the array, the assumed direction of arrival for the SoI may be in error. When the SoI
arrives slightly off the look direction, 0S, then the performance of the processor will be significantly degraded. One way to improve this is to add multiple constraints within the main beam [12], as a direct data domain method is a superresolution method. The manner in which this is done differs between the generalised eigenvalue processor and the least-squares process, but the effect is the same. In both cases, the additional constraints prevent the algorithm from treating the SoI as an interferer when the SoI arrives off the look direction, but within the a priori constraints of the adapted receive beampattern. Multiple constraints can be added to the generalised eigenvalue processor by modifying the S matrix [13]. The single constraint for this processor is given by equation (12.9), where the look direction is# s . Let#i = 0S, then additional constraints, #2? #3 etc., can be included by altering Sn such that: (12.30)
where L is the number of constraints. It is assumed that the weighting amplitude of each constraint is unity. Using equation (12.30) we can now form the generalised eigenvalue equation with multiple constraints as: (X-QfS 7 J-W = [O]
(12.31)
where the KxK matrix X is constructed using equation (12.7) and the KxK matrix S7 is constructed using equations (12.8) and (12.30). Multiple constraints can also be implemented in the D3LS approach [12,15]. Recall that the single look direction constraint is implemented in the first row of matrix F in equation (12.19). Additional constraints can be incorporated by adding multiple constraint rows to F and replacing the corresponding zeros in the excitation vector y equation (12.19) with non-zero entries. The format of these constraint rows is given by:
[l
Z1
Z\
Z3
...
Zf~l]
(12.32)
where I = 1 , . . . , L is the index number of the constraint. The format of the cancellation rows of the matrix F remains the same as in equation (12.19). For each constraint row added, a cancellation row must be removed so that the matrix F remains square. As defined earlier, the number of cancellation equations is given by N — K, where N is the number of elements and K is the number of weights. By adding L constraint equations the relationship between L, K and N becomes: K = L + N-K
=* N = 2K-L
(12.33)
Given a system with K weights and L constraints, the number of jammers that can be nulled is (K — L)/2. The price the processor pays for the additional constraints is that fewer jammers may be nulled for a given number of antenna elements N.
12.4
A D3LS approach with main beam constraints for space-time adaptive processing
The adaptive algorithms that have been presented up to this point have used data from a single space snapshot, which consists of samples from the antenna elements across the array at a particular instant in time. In this section we present four different algorithms that operate across pulses and elements, increasing the degrees of freedom over those of the element domain techniques. The first processor that will be described implements a generalised eigenvalue equation [1,16], and the last three processors implement a least-squares solution to a linear matrix equation [1,16]. Using the linear array defined in Figure 12.2, we now take advantage of the temporal diversity provided by the coherent pulses within a CPI, as well as the spatial diversity provided by the elements of the array. Using this data from a given range bin r as illustrated in Figure 12.3, the adaptive weights can be obtained deterministically, on a snapshot-by-snapshot basis, and implemented in the space-time architecture shown in Figure 12.4. Assuming a narrowband signal, the complex envelope of the received SoI with unity amplitude, for the mth pulse at the nth antenna element can be described by: (12.34)
antenna elements
receiver A/D
coherent pulses
receiver A/D
Figure 12.4 A two-dimensional space-time filter
for m = 1,2, . . . , M and n = 1,2, . . . , N. In this equation
(12.35)
where as is the complex amplitude of the SoI entering the array and S(m,n) is the received space-time snapshot due to only the SoI but of unity amplitude only. For each pulse-element cell (given range bin r) the difference equation: X(m9n) -asS(m9n)
(12.36)
removes the SoI from the nth antenna element and mth pulse data, leaving noise plus interference. Based on equation (12.36) a two-dimensional matrix pencil equation can be created whose solution will result in a weight vector that nulls interferers and extracts the SoI. The elements of this matrix pencil can be constructed by sliding a window (box) over the space-time snapshot as shown in Figure 12.5. By creating a vector using the elements in the window, each window position generates a row in the S and X matrices resulting in:
(12.37)
(12.38)
The window size along the element dimension is Na, and along the pulse dimension, Nt. Selection of Afa determines the number of spatial degrees of freedom, and Nt determines the temporal degrees of freedom. 7Va and Nt must satisfy the following
pulses
window #
elements
Figure 12.5 A space-time data snapshot equations for the single constraint case:
Wa < ^p-
(12.39)
N1 < Z±±
(12.40)
The product of the spatial and temporal degrees of freedom, Q, is: Q = NaxNt
(12.41)
Given the physical constraints (size, weight, power etc.) of the platform, most airborne radar systems contain more temporal degrees of freedom than spatial.
12.4.1 Space-time D^LS eigenvalue processor Using equations (12.37) and (12.38), the two-dimensional generalised eigenvalue system can then be constructed as: {X - as • S}w = [0]
(12.42)
Once again, the generalised eigenvalue provides the amplitude of the SoI, while the generalised eigenvector provides the weights necessary to suppress the interferers. Additional look-direction constraints can be added to this processor by modifying the signal matrix, S, in a manner similar to that for equation (12.30), except now constraints can be offset from the look direction in angle or Doppler, or both. The
constraint rows can be written as: (12.43) for n = 1, . . . , N and m = 1, . . . , M. Here Oi and ft determine the space-time look direction of the €th constraint. The generalised eigenvalue equation with multiple constraints can then be defined as: {X - a • S"} • w = [0]
(12.44)
where the Q x Q matrix X is constructed from equation (12.38) and the matrix S" is constructed by replacing Sm?n in equation (12.37) with S^ n in equation (12.43).
12.4.2 Space-time Z)3 LS forward processor The formulation of the D3LS space-time algorithm [17] can be obtained through an extension of the one-dimensional case. As before, we start by developing the forward case, and then present the backward and the forward-backward algorithms. For all the cases, the matrix equation to be solved is defined by: F w= y
(12.45)
where F is the system matrix and w is the vector of space-time weights which will null the interferers. The system matrix, F, contains the angle-Doppler information of the SoI, as well as the cancellation rows which contain angle-Doppler information describing the interference. This interference information is obtained through a difference equation similar to equation (12.16), where the contribution of the SoI is removed leaving only the undesired signals. For the two-dimensional case these difference equations are performed with elements offset in space, in time and jointly in space and time. Let us define the information off-set of the SoI in space and time, respectively, as: (12.46) (12.47) Again, the SoI has an angle of arrival of Os and a Doppler frequency of /d. The three types of difference equation for the FW, BK and the FB methods are then given by: (12.48) (12.49) (12.50) The cancellation rows of the matrix F can now be formed using equations (12.48) to (12.50) and using the windowing procedure depicted in Figure 12.5. In this case the dots in Figure 12.5 represent the induced voltages, X(m, n) in equation (12.35), for a
example: computing a cancellation row for Na = 3 and Np = 4 ^window 1
^ window 2 ^ 1
»
converting to a row vector
Figure 12.6
Creating a cancellation row in matrix [T]
given element-pulse location. Just as was done for the generalised eigenvalue algorithm, a space-time window is passed over the data. For each location of the window function, three rows in matrix F are formed by implementing equations (12.48) to (12.50), which removes the SoI. The rows are created by performing an element-byelement subtraction using the values contained within the offset windows and then arranging the resulting data into a row vector as illustrated in Figure 12.6. The window is then displaced one space to the right and three more rows are generated, and this procedure is continued along the space and time axis. After this window has reached the second column from the far right and three rows are generated, the window is reduced (by a row) and shifted to the left-hand side of the data array. The generation of rows continues, and is repeated until (Q-I) cancellation rows have been formed, where Q = Na x Nt. As in the one-dimensional case, the first row of the system matrix is used to set the gain of the system along the look direction. For the two-dimensional case the look direction is specified by the angle of arrival, 6S, and the Doppler frequency, /d, of the SoI. The elements of this row can be obtained by placing window #1 in Figure 12.5 over the matrix S of equation (12.37). Using the same method as above (Figure 12.6) the elements of the row constraint can then be properly arranged. This results in the following row vector:
(12.51)
In a similar manner, the elements of the weight vector, w, in equation (12.45) are arranged in the same fashion as the elements in the row constraints, except that the vector is a column vector. Setting the product of F and w equal to a column vector y then completes the matrix equation. The first element of y consists of the complex gain constant C, and the remaining Q-I elements are set to zero in order to complete the cancellation equations. The resulting matrix equation is:
(12.52)
In solving this equation one obtains the weight vector w which places space-time nulls along the direction of the interferers while maintaining a fixed gain in the direction of the SoI. The amplitude of the SoI can now be estimated from: (12.53) The above analysis is done for the single constraint case. As in the one-dimensional case, the SoI for the two-dimensional case could arrive at the antenna slightly offset in angle, in Doppler, or both. In order to prevent the processor from nulling the SoI, multiple constraints can be implemented. The added constraints would reduce the number of degrees of freedom, but given the antenna beam width and the Doppler filter width, the constraints could help to maintain system gain over this finite look direction. In a manner similar to that of the single constraint case, additional control can be implemented using L row constraints where the look direction of the €th row is determined by Og and ft. For the €th constraint we have: (12.54) (12.55) and the £th row of F, denoted as F (€,:), becomes:
(12.56)
12.4.3 Space-time D^LS backward processor A second direct data domain least-squares space-time processor can be implemented by conjugating the element-pulse data and processing this data in reverse. The basis for this algorithm, the backward two-dimensional processor, is given in References 16, 19 and 20. The form of this linear matrix equation is similar to that of the forward
algorithm and is given by: B-wb=yb
(12.57)
where B is Q x Q, and w and y are both Q x 1. The constraint rows in B are implemented in the same manner as the constraint rows in F. The difference between B and F is in the cancellation equations. For the backward method, these equations are formed by first conjugating the space-time snapshot illustrated in Figure 12.5. Then using a windowing procedure similar to that for the forward method three cancellation rows are generated for each position of the window, except that now the window starts in the lower right-hand corner of the space-time snapshot of Figure 12.5. This window is then moved to the right and up the snapshot. The three difference equations used to cancel the SoI are given by: (12.58) (12.59) (12.60) where X* is the complex conjugate of X. Using equations (12.58) to (12.60) the SoI is removed from the windowed data. This data is then converted from a matrix to a vector, as shown in Figure 12.6. Then the elements of the vector are reversed in order, to form the rows of B. A vector similar to y used in the forward method is also used in the backward method. For the single constraint case the only non-zero entry in yb will be the first element, which again is set to an arbitrary constant C The weight vector w b can now be solved by substituting B and yb into equation (12.57) and applying the conjugant gradient method. The estimate of the amplitude of the SoI is then obtained from: (12.61) For systems where the AoA and Doppler frequency of the SoI are not exactly known, but approximately known (e.g. within the mainbeam of the antenna and within a given Doppler filter), then multiple constraints can be implemented to preserve the SoI. The procedure for doing this is identical to that for the forward method. For each additional constraint a new equation replaces a cancellation equation in B and the corresponding amplitude is placed in y b . The constraint equations are generated using equation (12.56).
12.4.4 Space-time D 3 LS forward-backward processor The forward and the backward methods described in the previous sections both provided independent solutions for the problem through equation (12.52) and (12.57), using Q = Na x Nt degrees of freedom. The third D3LS processor, the forwardbackward processor, provides another independent solution to the space-time adaptive problem. By applying the data in both forward and backward manners,
one can increase the degrees of freedom by as much as 50 per cent over that of the forward or backward methods alone. Once again, one can use the samples from N antenna elements and M time pulses to formulate the following matrix equation: FB-Wfb^yfb
(12.62)
The system matrix FB consists of at least one constraint row and numerous cancellation rows. The constraint rows preserve the SoI during the adaptive process by defining multiple 0s — /d look direction constraints in which to maintain the SoI, as defined in equation (12.56). The remaining rows in FB consist of the cancellation equations that are formed in the forward and backward directions. The forward direction cancellation equations are defined by equations (12.48) to (12.50), and those for the reverse direction are given by equations (12.58) to (12.60). The important thing to remember is that for a given number of antenna elements TV and pulses M the selection of Na and N't for the FB method can be significantly larger for the forward-backward algorithm than Na and Nt used in either the forward or backward algorithm. Following the steps similar to the forward and backward methods, a non-zero element is placed in the vector yft, for each constraint row. Using the conjugant gradient method, the matrix FB and vector yfb, the weight vector Wft, in equation (12.62) can be determined. Having obtained Wfb, an estimate of the amplitude of the SoI can be computed using equatoin (12.53), with Na and Nt replaced by Na and N[9 respectively, and w by Wft>.
12.5
Determining the degrees of freedom
The spatial and temporal degrees of freedom (DoF) for the three direct data domain least-squares algorithms are constrained by the number of antenna elements and pulses, TV and M, respectively. Recall that for each given box position in Figure 12.5, three cancellation equations can be generated. Also, the number of possible box positions can be determined from the number of shifts in each dimension. Let Sa be the number of shifts in the spatial dimension and St the number of shifts in the temporal dimension, such that: N-Na
= Sa
(12.63)
M-Nx
= St
(12.64)
This results in a total of G possible box positions where: G = Sa x Sx
(12.65)
For both the forward and the backward methods the maximum number of cancellation equations possible is given by Nc: Nc = 3xG
(12.66)
Meanwhile, the number of elements for each cancellation row, and hence the number of elements in the weight vector, is Q which is given by: Q = NaxNt
(12.67)
In order to set up a system of equations with at least one solution, and possibly many from which a minimum norm solution can be chosen, Na and Nx must be selected such that: Na x Nx < Nc - L
(12.68)
where L is the number of system constraints, L > 1. Therefore, the number of spatial and temporal DoF for the forward or backward methods should be chosen such that: Na x Nx < 3 x (N - ATa) x (M - Nx) - L
(12.69)
The choice of N& and Nx for the forward-backward method is slightly different. For a system with N antenna elements and M coherent pulses the forward-backward method provides up to a 50 per cent larger product of degrees of freedom (PDoF) than either the forward or the backward method alone. In this case the choice of the spatial DoF, A^, and the temporal DoF, N[, can be chosen in the following manner. The number of elements in the weight vector is similar to that of equation (12.67), and is given by: Q' = N'a x N[
(12.70)
Likewise, the number of shifts in each dimension, Sa in the spatial and S[ in the temporal, is given by: Sa = K-N'a S[ = M-
(12.71) N[
(12.72)
Therefore the total number of possible box positions G' is given by: G' = Sa x St'
(12.73)
Since the data will be applied in both the forward and the reverse directions, the maximum number of cancellation equations possible for the forward-backward method is N'c, and is given by: N'c = 2x3xGf
= 6x(N-N'a)x(M-
N't)
(12.74)
Once again, in order to set up a system of equations with at least one solution, and possibly many solutions from which a minimum norm solution can be chosen, we must select N'a and N{ so that: K *N[
(12.75)
where L is the number of system constraints, L > 1. Therefore, the number of spatial and temporal DoF for the forward-backward method should be chosen such that: Nax
N{ <6x
(N - Na) x (M - N{) - L
(12.76)
12.6
An airborne radar example
12.6.1 Simulation setup The following example illustrates the utility of the D3LS space-time processing algorithms that have been presented in the previous sections for a real world scenario. We do not consider the stochastic algorithms in these cases because as it has been shown for the measured MCARM data sets [1,16] the performance of the direct data domain methods has superceded those of a stochastic-based methodology at a fraction of the computational cost. This scenario consists of a ground moving target indicator (GMTI) radar on board a medium sized unmanned air vehicle (UAV), such as the Predator shown in Figure 12.7 [22]. From a target detection point of view this type of platform is well suited for the GMTI mission due to its low velocity. The slow platform speed reduces clutter spectral spread, enhancing the system's ability to separate slow moving targets from the background clutter. On the other hand, the size and lift capability of the vehicle limits the radar system size and weight, providing a challenge to radar designers to not only improve performance versus weight of the hardware, but also that of the signal processing algorithms being implemented. Using the FAS website on the world wide web [22], the performance and dimensions of the airborne platform were obtained and from this an estimate was made of an aperture and radar system that might fit on such a UAV. Table 12.1 summarises the radar, platform, signal sources and geometric parameters for our sidelooking radar simulation. To simplify the implementation, the PRF was selected to keep clutter returns unambiguous. The clutter radar cross section, G0, in Table 12.1 was obtained from Figure 13.8 in Reference 23 for a side-looking radar, and applies to cultivated terrain at grazing angles at approximately 10 degrees for a horizontally polarised X-band radar. The size of the clutter area Ac can be estimated using: Ac = R • 0 3 d B • AR - secant (grz)
Figure 12.7 An example of the Predator UAV
(12.77)
Table 12.1
Parameters related to the simulation of a side-looking radar
Radar Frequency Wavelength Bandwidth Comp. pulse width Orientation Pulse rep. freq. Antenna dimensions Rx beam width, #3 ^B Doppler filter Range resolution
9.5GHz -0.0316 m 10 MHz 100 |xs side-looking 4 kHz 4' x Y ^ 0 . 1 rad ^ 6 deg ^444 Hz, 222 Hz 15m
Platform Altitude Radar velocity Target velocity Target RCS
17.4kft^5.3km 60 knts ^ 3 1 m/s 20 mph ^8.9 m/s 10 m 2
Signals SoI: SNR AoA Interferers discretes: Doppler
12.5 dB 65 deg
AoA Jammer: AoA Clutter: extent Nature of clutter: point scatterers
[400, - 8 0 0 , 1700, 1400, 325, -1650, 950, - 1 2 0 0 , - 1 2 5 , 145O]Hz [85, 120, 40, 35, 65, 120, 100, 65, 55, 125] deg 100 deg main beam separated by 0.06 deg in angle from each other
Geometry Range Grazing angle Clutter area (J0 Clutter RCS
30 km 10 deg 37 967 m 2 -26.2 dB 17.3 dBsm
where #3 $Q is the three dB beamwidth, AR the range resolution and grz the grazing angle. The radar cross section of the clutter, a c , can then be approximated by: Cf0 = Cr0- A
0
(12.78)
Using the parameters in Table 12.1, the computed clutter radar cross section (RCS) is 17.3 dB sm. The assumed target RCS is 10 m 2 , resulting in a per pulse signal-to-clutter ratio (SCR) of approximately —7 dB.
The received signals modelled in this simulation consist of the SoI, main beam clutter, multiple interferers, jammer and thermal noise. The clutter is modelled as point scatterers placed approximately every 0.06 degrees apart. The amplitude of the clutter points are modelled with a normal distribution about a mean that results in a signal-to-clutter ratio, SCR, of — 7.3 dB. Ten strong point scatterers are modelled in this simulation, based on the AoA and Doppler parameters in Table 12.1. Thermal noise generated in each receive channel is independent from channel to channel, and consists of a random phase and amplitude. This phase is uniformly distributed between 0 and lit. The resulting SNR is approximately 12.5 dB per receive channel per pulse. The jammer is modelled as a broadband noise signal that arrives from 100° in azimuth and covers all Doppler frequencies. Summing the power of the interfering sources received by the first channel and comparing it with the power of the SoI, the average input signal-to-interference-plus-noise ratio (SINR) is estimated to be — 32 dB. Since the algorithms developed in this chapter require a single space-time snapshot, only data from the range cell at 30 km, where the target resides, are generated. Given the size of the platform, and the cost and weight per channel, the number of receive channels is limited to ten. The cost of temporal DoF is not as expensive as spatial DoF. Initially, 16 coherent pulses are integrated and the processing performance is computed using the parameters in Table 12.1. Then the total number of DoF is enhanced by increasing the number of integrated coherent pulses, and the change in performance is noted. For the first case, N = 10 antennas and M = 16 pulses have been used in estimating the performance of the three D3LS algorithms, and is evaluated based upon the adaptive weight pattern and output SINR. The adaptive weight pattern is computed by taking the magnitude square of the discrete Fourier transform (DFT) of the resulting weight vector. In a similar manner, the output response pattern is computed using the weighted input data. For case I, only one constraint equation will be used to maintain gain on the SoI, and the steering vector (constraint) is aligned with the SoI vector. As will be seen from the results of case I, the forward and backward methods have very similar performance. Therefore, for the second case, N = 10 and M = 32, only the forward and forward-backward methods will be used. The three subcases that will be evaluated for case II are: (i) target aligned with the steering vector using a single constraint; (ii) target and steering vector misaligned using a single constraint; (iii) target and steering vector misaligned using multiple constraints.
12.6.2
Case I: single constraint space-time example
For the first case, Af = 10 and M = 16, the forward and backward methods utilise 7 spatial and 9 temporal DoF resulting in a product of the degrees of freedom (PDoF) of 63, and the forward-backward method employs 8 spatial and 9 temporal DoF, for a total of 72 PDoF. In this example we will look at the performance of each of the processors for the case where the SoI is aligned with the spatial and temporal steering vector, using just one constraint. The power spectrum of the input signals is shown in Figure 12.8. Here, the green circles indicate the locations of discretes, the green bar of circles is the main beam clutter and the blue circle locates the SoI. The
angle, deg
frequency, Hz
Figure 12.8
The power spectrum of the input signal2
adaptive weight and output patterns for the forward, backward and forward-backward methods are shown in Figures 12.9, 12.10, and 12.11, respectively. As seen in these three Figures, each of the processors performs well in generating nulls to mitigate discretes and the main beam clutter. The processors also place a deep null along the location of the jammer, at an AoA of 100°. While nulling these interferers, the system gain is maintained along the look direction, as can be seen by the strong response of the weight pattern centred on the blue circle. As mentioned above, the FW and BK methods employ 63 PDoF and the FB method utilises 72 PDoF. The improvement due to the increased PDoF is evident when comparing the weight patterns of Figures 12.9 and 12.10 with that of Figure 12.11. The forward-backward processor generates a deeper null, more in line with the clutter ridge, than does either the forward or backward processor. The performance of these processors can also be compared using the output signal-to-interference-plus-noise ratio (SINRoUt). We define the output signal-to-interference-plus-noise ratio SINRoUt as: (12.79) where A8 is the amplitude of the SoI and as is an estimate of the amplitude of the SoI. Using equation (12.79) for case I, the output SINR, averaged over 50 runs, is 40.5 dB 2
This figure is available in colour at www.ieee.org/mvieee/chl 2_figures
angle, deg angle, deg
frequency, Hz
frequency, Hz
Figure 12.9
Output of the forward method2 a adaptive weight pattern b output response pattern
angle, deg angle, deg
frequency, Hz
frequency, Hz
Figure 12.10
Output of the backward method2 a adaptive weight pattern b output response pattern
angle, deg angle, deg
frequency, Hz
frequency, Hz
Figure 12.11
Output from the forward-backward method2 a adaptive weight pattern b output response pattern
forthe forward method, 40.7 dB forthe backward method and 43.0 dB forthe forwardbackward method. As mentioned earlier in this chapter, and shown here by the output SINR and adaptive weight patterns (Figures 12.9 and 12.10), the performances of the FW and the BK methods are very similar. Therefore, in the next section, we will focus on the comparison between the FW and FB methods, with the understanding that the BK method yields results comparable with that of the FW method.
12.6.3
Case II: multiple constraint space-time example
In this case the temporal DoF is increased to M = 32 and the performances of the forward and forward-backward processors are evaluated for the misaligned look-direction condition using various (numbers of) constraints. Each additional look-direction/Doppler constraint replaces a cancellation row. Therefore, maintaining adequate interference nulling while increasing the number of constraints requires an increase in the PDoF. In the first subcase the look direction is misaligned with the SoI and only one constraint is utilised. The SoI is located at an AoA of 63° with a Doppler frequency of 1278 Hz, and the system constraint is set at an AoA of 65° and a Doppler of 1378Hz. The adaptive weight pattern is shown in Figure 12.12 for the forward processor, and in Figure 12.13 for the forward-backward processor. In both Figures, the cyan circle indicates the location of the constraint, the dark blue circle indicates the location of the SoI and the dark blue rings identify the location where additional constraints will be implemented in the following subcases. With only one constraint, both processors treat the SoI as an interferer and attempt to null it, even though the SoI is arriving in the main beam of the radar (both spatially and temporally). The resulting output SINR is —33.7 dB for the forward processor and —39.3 dB for the forwardbackward (FB) processor. Once again, the additional degrees of freedom enable the FB processor to do a slightly better job of nulling the interference, which in this case includes the SoI. However, the performance of either method is less than desirable. The next subcase involves the misalignment of the SoI with the look-direction constraint, along with four additional constraints. Results for this five-constraint case for the FW method are shown in Figure 12.14. Once again the constraint locations are identified by the cyan coloured circles, and the dark blue circle marks the location of the SoI. The four additional constraints help keep the forward processor from placing a null on the SoI, through the main beam. Yet, with only five constraints, one at the centre and the others at four corners of a rectangle in the main beampattern, the adapted main beam weight pattern is protected, although slightly eroded along one edge. The resulting output SINR is —1.8 dB, 32 dB better than the single constraint case. This situation can be improved further by applying additional constraints. The results from the misaligned case using nine constraints in the FW method are shown in Figure 12.15. The four additional constraints help to maintain a more uniform response across the main beam, which keeps the processor from degrading the weight pattern along the edges. The output SINR for this case is 1.2 dB, a 3 dB improvement over the five-constraint case, and a 35 dB improvement over the single constraint case. Similar trends were obtained for the FB processor when going from one constraint to five constraints to nine constraints.
angle, deg angle, deg
frequency, Hz
frequency, Hz
Figure 12.12
Results for the forward method, M = 32, steering vector misaligned, constraints2 = 1 a adaptive weight pattern b adaptive weight pattern, zoomed in
angle, deg angle, deg
frequency, Hz
frequency, Hz
Figure 12.13
Results for the forward-backward method, M = 32, steering vector misaligned, constraints1 = 1 a adaptive weight pattern b adaptive weight pattern, zoomed in
angle, deg angle, deg
frequency, Hz
frequency, Hz
Figure 12.14
Results for the forward method, M = 32, steering vector misaligned, number of constraints2 = 5 a adaptive weight pattern b adaptive weight pattern, zoomed in
angle, deg angle, deg
frequency, Hz
frequency, Hz
Figure 12.15
Results for the forward method, M = 32, steering vector misaligned number of constraints1 — 9 a adaptive weight pattern b adaptive weight pattern, zoomed in
12.7
Conclusions
In this chapter we have presented four deterministic direct data domain least-squares (D3LS) techniques for nulling interferers and extracting the signal of interest (SoI). The D 3 LS approaches generate adaptive weights on a snapshot-by-snapshot basis, eliminating the need for auxiliary training data sets. These techniques are capable of handling both coherent and non-coherent interferers, in a stationary or non-stationary environment. The four processors are the eigenvalue processor, and the three leastsquares methods: forward, backward and forward-backward procedures which may be implemented in real time on a signal processing chip. After describing the radar environment used to demonstrate these techniques, we developed the one-dimensional (space domain) implementation for each of the methods. We showed how to generate the system matrices using spatial samples and then solve for the adaptive weights, and estimate the amplitude of the desired signal. In order to maintain system gain of the SoI when the desired signal arrives slightly off the look direction, a method of setting multiple main beam constraints was presented. Next, we extended these techniques to the two-dimensional case (space-time domain) and identified how to select the proper weight dimensions to ensure a solution to the problem. Finally, an example related to a side-looking airborne radar is described. This example illustrates the challenges faced by an airborne radar and how, given the limited size and weight dimensions of a realistic system, the direct data domain leastsquares algorithms can assist in nulling interferers on a snapshot-by-snapshot basis. The greatest advantages of these procedures are that they yield four independent solutions using four different approaches. Hence, if the results between the methods are similar it provides greater confidence in the solution. The performance of the forward, backward and forward-backward methods has been evaluated using the output SINR and adaptive weight pattern for each of the methods. This has been done for both the SoI look-direction aligned and misaligned cases, as well as using various look-direction constraints (1,5 and 9) for both Doppler and angle of arrival. We showed that the D 3 LS algorithms perform extremely well for the aligned case. For the misaligned case it has been shown that unless multiple constraints are applied, the processors will treat the SoI as interference and attempt to null it. Increasing the number of constraints improved the processor's ability to maintain the system gain along the main beam, thereby improving the output SINR.
12.8 Ac a B as as (ib
List of variables clutter area (m2) vector containing amplitudes of coherent interferers system matrix of backward method complex amplitude of the SoI estimate of the as estimate of the SoI from the backward method
b C c d f § /r F FB /d G grz K K' L X M TV Na Nfa TVL TV0Ut Nt Nl T] p Amdesired Q fft ?k R r AR t tn O 0S S S' S" s O0 v
scale factor used in CGM arbitrary constant matrix of clutter, interference and thermal noise samples spacing between elements/channels order of the covariance matrix effective number of bits in data pulse repetition frequency system matrix for the forward method system matrix for the forward-backward method Doppler frequency of SoI number of data subarray positions grazing angle number of received signals, DoF related to one-dimensional FW and BK methods DoF for the one-dimensional FB method number of constraints wavelength (m) number of coherent pulses number of antenna elements number of spatial DoF - FW and BK methods number of spatial DoF - FB method maximum number of cancellation rows output interference plus noise number of temporal DoF - FW and BK methods number of temporal DoF - FB method number of range samples direction vectors used in C G M power of output interference and noise matrix of received samples with the SoI removed covariance matrix estimate of the covariance matrix number of range bins residual vector in C G M range resolution (m) time scale factor used in C G M angle of arrival of the various signals A o A of SoI matrix of samples of the SoI with unity amplitude matrix S with multiple spatial constraints matrix S with multiple space-time constraints vector containing snapshot of SoI clutter R C S ( m 2 ) steering vector
w Wb Wfb X x xr y yb yfb Z Z\ Z2 Z/i Z/2
weight vector - F W method weight vector - B K method weight vector - FB method matrix of snapshot samples containing all received signals snapshot of the voltages containing all components of the received signals snapshot of the received signals from rth range ring forcing function - F W method forcing function - B K method forcing function - FB method element to element phase shift due to SoI - one-dimensional method SoI spatial factor SoI temporal factor spatial factor due to /th constraint temporal factor due to /th constraint
References 1 SARKAR, T. K., WICKS, M. C , SALAZAR-PALMA, M., and BONNEAU, R.: 'Smart antennas' (John Wiley & Sons, New York, NY, 2003) 2 KLEMM, R.: 'Adaptive clutter suppression for airborne phased array radar', IEE Proc. F, Commun. Radar Signal Process., 1983,130, pp. 125-131 3 WARD, J.: 'Space time adaptive processing of airborne radar'. Lincoln Laboratory, Lexington, MA, Tech Report 1015, December 1994 4 ADVE, R. S. and SARKAR, T. K.: 'Compensation for the effects of mutual coupling on direct data domain adaptive algorithm', IEEE Trans. Antennas Propag., January 2000, 48, (1), pp. 86-94 5 APPLEBAUM, S. P.: 'Adaptive arrays'. Syracuse University Research Corp., dept SPL-769, June 1964 6 WIDROW, B., et al.: 'Adaptive antenna systems', Proc. IEEE, December 1967, 55, pp. 2143-2159 7 BRENNAN, L. E. and REED, L. S.: 'Theory of Adaptive Radar', IEEE Trans. Aerosp. Electron. Syst, March 1973, 9, pp. 237-252 8 REED, L. S., MALLETT, J. D., and BRENNAN, L. E.: 'Rapid convergence rate in adaptive arrays', IEEE Trans. Aerosp. Electron. Syst., November 1974, 10, pp. 853-863 9 WANG, H. and CAI, L.: 'On adaptive spatial-temporal processing for airborne surveillance radar systems', IEEE Trans. Aerosp. Electron. Syst., July 1994, 30, pp. 660-669 10 LUTHRA, A.: 'A solution to the adaptive nulling problem with a look-direction constraint in the presence of coherent jammers', IEEE Trans. Antennas Propag., May 1986, 34, pp. 702-710 11 SARKAR, T. K. and SANGRUJI, N.: 'An adaptive nulling system for a narrowband signal with a look direction constraint utilizing the conjugate gradient method', IEEE Trans. Antennas Propag, July 1989, 37, pp. 940-944
12 SCHNEIBLE, R.: 'A least squares approach for radar array adaptive nulling'. Doctoral dissertation, Syracuse University, May 1996 13 PARK, S.: 'Estimation of space-time parameters in non-homogeneous environments'. Doctoral dissertation, Syracuse University, August 2000 14 SARKAR, T. K., KOH, J., and ADVE, R., et al.: 'A pragmatic approach to adaptive antennas', IEEE Antennas Propag. Mag., April 2000, 42, (2), pp. 39-55 15 SARKAR, T. K., PARK, S., KOH, J., and SCHNEIBLE, R. A.: 4A deterministic least squares approach to adaptive antennas', Digit. Signal Process., 1996, 6, pp. 185-194 16 SARKAR, T. K., WANG, H., and PARK, S., et al.: 'A deterministic least squares approach to space time adaptive processing (STAP)', IEEE Trans. Antennas Propag., January 2001, 49, pp. 91-103 17 BROWN, R. and SARKAR, T. K.: 'Real time deconvolution utilizing the fast Fourier transform and the conjugate gradient method'. Presented at the 5th ASSP workshop on Spectral estimation and modelling, Rochester, NY, 1980 18 SARKAR, T. K.: 'Application of the conjugate gradient method to electromagnetics and signal analysis (Elsevier, Mass., 1991) 19 HAYKIN, S.: 'Adaptive filter theory', (Prentice-Hall, Englewood Cliffs, NJ, 1996, 3rd edn.) 20 JOHNSON, D. H. and DUDGEON, D. E.: 'Array signal processing' (PrenticeHall, Englewood Cliffs, NJ, 1993) 21 PARK, S. and SARKAR, T. K.: 'Prevention of signal cancellation in an adaptive nulling problem', Digit. Signal Process., 1998, 8, pp. 95-102 22 FAS Intelligence Programs website, www.fas.org/irp/program/collect/predator. html, created by John Pike, maintained by Steven Aftergood, updated Nov 6,2002 23 SKOLNIK, M. L: 'Introduction to radar systems' (McGraw-Hill Publishing, New York, NY, 1980)
Chapter 13
Robust techniques in space-time adaptive processing Keith E McDonald1 and Rick S. Blum2
13.1
Introduction
The function of most radar systems is to supply data about the presence and position of targets of interest [I]. A typical radar system emits an electromagnetic pulse and analyses the reflections of the transmitted energy. Ideally, the radar system will detect returns from range cells that contain targets and will reject returns from range cells that contain only background energy. A range cell represents a small block of physical space that is a certain distance (range) from the receiver. The radar checks for a target in each range cell of interest, typically referred to as a test cell. Range cells other than the test cell are called reference cells. The target signal competes with background energy that is potentially composed of a combination of thermal noise, clutter and jamming. Thermal noise is caused by the thermal agitation of electrons and is present in all real receivers. In this chapter, we consider radar arrays in which each antenna element has its own receiver. Therefore, the thermal noise at each antenna element is considered to be independent of the thermal noise at the other elements. Radar clutter is generically defined as the echoes from scatterers that are not deemed to be of tactical significance [115]. For airborne surveillance radar, the ground is the major source of clutter. Ground clutter is distributed in both angle and range. It is also spread in Doppler frequency due to the motion of the array platform. In airborne radars, ground clutter would ideally lie along a line in angle-Doppler
1
Keith McDonald's portion of this Chapter was funded by the MITRE Sponsored Research Task 51CCG-930-W3 (Sensor Technologies) Rick Blum's portion of this Chapter was partially supported by the Air Force Research Laboratory under Agreement No. F49620-03-1-0214
space. This spread occurs because the Doppler frequency of a reflection from a small patch of ground is proportional to the projection of the airplane velocity along the unit direction vector between the airplane and the small patch of ground [115]. Due to system and algorithm design imperfections, the line is typically widened into a structure called the clutter ridge. Jamming refers to the use of a signal to fool the radar or to intentionally limit its successful operation. Jamming has traditionally been associated with military scenarios, but the increase in public air traffic and the threat of terrorist activity has made jamming an important issue for commercial applications as well.
13.1.1 Initial development of space-time adaptive processing (STAP) algorithms In the detection process, a target return from the test cell competes with the thermal noise, jamming and clutter returns. Therefore, large amounts of energy from these components can significantly lower the probability of detection and/or raise the probability of false alarm. In order to mitigate the detrimental effects of this energy on the receiver, space-time adaptive processing (STAP) algorithms can be employed. The design and analysis of STAP algorithms for target detection have received significant attention from the radar and signal processing communities. STAP algorithms utilise the multiple receiving elements ('space') of a radar antenna array and multiple pulses ('time') to provide filtering of this energy in both the spatial and temporal domains [115]. 'Adaptive' refers to the fact that the filter weights are computed using information from the particular radar environment. The negative effects of noise (unless explicitly stated, hereafter noise is defined as clutter plus jamming plus thermal receiver noise) in the test cell are minimised through the calculation and subsequent application of a two-dimensional adaptive filter in the signal processing chain. Figure 13.1 provides visualisation of a typical STAP technique improving radar performance. Figure 13.1a depicts the signal energy received by a conventional radar that uses Doppler frequency selectivity. The dashed lines represent the filtering process. The clutter lies along a line, often called a ridge, with a larger amplitude produced by the mainlobe of the antenna pattern. As shown in Figure 13.1a, although the clutter from the mainlobe of the antenna response is cancelled via one-dimensional filtering, the sidelobe clutter remains. By allowing an extension of conventional radar processing to two dimensions, a STAP algorithm can produce a null along the entire clutter ridge, as shown in Figure 13.1b, if the clutter ridge location is exactly known. This two-dimensional filtering will improve the system's performance. Figure 13.1c shows a detection environment in which the radar reception is negatively impacted by a jammer. One common type ofjamming resembles thermal noise temporally, but looks like a point target in the spatial domain. In Figure 13.1 c, an attempt to filter out the jamming results in the filtering of the target response as well. Although still allowing the reception of the target signal, STAP provides an additional filtering dimension, as shown in Figure 13. Id, so that a properly placed null can limit the effects of jamming. This type of processing increases the probability of target detection in a clutter plus jamming environment.
mainlobe of antenna response
clutter
sidelobes of antenna response signal
clutter
signal
target spatial frequency
spatial frequency
jamming
signal
target
signal target
spatial frequency
Figure 13.1
spatial frequency
Conventional filtering versus STAP (adaptedfrom [204])
Much of the initial interest in adaptive processing is generated by the work presented in References 2 and 3. The automatic target detection processing proposed in References 2 and 3 attempts to minimise the power of the unwanted interference by maximising the signal-to-noise ratio (SNR) of the received process. The work in References 4, 5, 14 and 16 builds on the efforts of References 2 and 3 to design an automatic detection scheme, often called a test, with a constant false alarm rate (CFAR) characteristic known as the adaptive matched filter (AMF) test [4,5] or as the modified sample matrix inversion (SMI) algorithm [14,16]. Following References 2 and 3, the next most influential STAP research is likely Reference 6, where an alternative approach to that of References 2 and 3 is proposed to determine a CFAR processing scheme for the scenario in which the covariance matrix of the test cell is unknown. The approach in Reference 6 assumes that the statistics of the test cell data are the same as those of the data from the surrounding range cells and involves the derivation of a likelihood ratio test using maximum likelihood estimates. This particular methodology is termed the generalised likelihood ratio test (GLRT). Additional work concerning the GLRT test is presented in Reference 7 as well as in a number of recent papers including References 14 and 15. It is interesting to note that the matched filter discussed in References 2 and 3 appears as a portion of the GLRT test statistic, and that for large amounts of training data these two approaches are identical [15]. Recent work on adaptive algorithm design includes the introduction and development of the adaptive coherence estimator (ACE) test [188,178,212,213]. Expressions for the probability of false alarm and detection of the test are subsequently determined in Reference 213. The ACE test is especially attractive since it can be justified using the rigorous theory of invariant tests [223,219]. It is worth noting that the theory of invariant tests has been used to study STAP algorithms in the past [226,227] and, in fact, Reference 223 builds on the work in References 226 and 227 by adding an additional invariance to scale. Further, the relationship between the ACE and the
GLRT is carefully described in Reference 215. To summarise, the ACE test has been shown to be a generalised likelihood detector and has a derivation similar to that of Kelly's GLRT. However, the hypothesis testing procedure assumes that the scaling of the covariance matrix of the training data, relative to that of the test cell covariance matrix, is unknown [214,215]. Of course, this approximation will not necessarily agree with the true situation. There can be significantly more or less differences between the training and test cell data. A similar assumption, invariance to scale, leads to the invariance results in Reference 215, and thus, similar concerns can be raised. However, in general, the invariance properties and the relationship to the GLRT provide analytical justification that the ACE algorithm will work well for cases with slightly imperfect estimates, cases with limited training data and some cases with non-stationary training data. This support of the robustness of the ACE algorithm stimulates future investigations. Further, in a channel with time-varying gain or non-Gaussian statistics, the ACE algorithm has been shown to provide better performance than algorithms that do not possess the invariance properties [218]. The performance analysis of a set of adaptive matched subspace detectors, of which the ACE test is one, is discussed in Reference 216. A review of matched and adaptive detectors is supplemented by the work in Reference 217, and their relationship to other algorithms such as the GLRT and AMF is presented in Reference 218. Finally, the work presented in Reference 9 considers a large class of adaptive detection algorithms, of which the GLRT and the AMF are special cases. Furthermore, the ACE is a limiting case. Further work concerning the theory of invariance and its application to the hypothesis testing problem for adaptive arrays is given in References 226, 227,231 and 232. The adaptive array problem involves the search for a detection statistic that is complicated by the presence of the noise covariance matrix, a high-dimensional nuisance parameter. The authors discuss that the search for the detection statistic should be conducted over the set of detectors that satisfies the invariance criterion, and thus, one must only search for the detector with an optimality property over this set of detectors. A test that is invariant to the nuisance parameter (the unknown covariance matrix), since it is irrelevant to the decision, would thus be beneficial. Invariance tests do not distinguish between scenarios differing in their nuisance parameters. Two different cases are considered: one in which the noise covariance matrix is totally unknown and another in which a priori information about its structure exists. This invariance approach reduces the problem size since all invariance test statistics can be expressed in terms of a function (possibly vector-valued but having much fewer dimensions than the data) of the data called the maximal invariant. The distribution of the maximal invariant is parameterised by another low-dimensional function on the parameter space, and thus most of the nuisance parameters are removed. The use of the theory of invariance has also been implemented to validate the spatial stationarity assumption of the data in References 228 to 230. Invariance principles are used to ensure that the test has constant significance when the model is true by eliminating the dependence on the covariance matrix. The test decision depends only on the covariance matrix structure rather than on the actual covariance matrix.
13.1.2 Hypothesis testing problem To understand the target detection problem, it is essential to describe the method used to gather data at the receiver. Consider a uniform linear array composed of Af identical elements. The radar transmits a coherent burst of M pulses per coherent processing interval (CPI) at a constant pulse repetition frequency (PRF) /PR = l/T.A CPI is the time over which the waveforms are collected, and T is the pulse repetition interval (PRI). Thus, a CPI lasts MT time units. For each pulse transmitted, L samples are collected. Each sample corresponds to a return from a specific range from the receiver. The sampling scheme is shown in Figure 13.2a. With N receiver channels and M pulses, the number of received data samples per CPI is LMN. It is helpful to visualise the data set for a CPI as the data cube depicted in Figure 13.2b. The test cell data corresponds to one particular range cell and is illustrated by the shaded area of the data cube. Denote the observation corresponding to the nth receiving element and the mth pulse at the €th range sample (the range being examined) as 8n,m,n — 1 , . . . , N and m = 1,...,M. Let xm ^ e m e NxI vector of antenna element outputs for pulse m, so that / m = (<$i,m,..., &N,m)T? where T is the transpose operator. xm ^s called a spatial snapshot and corresponds to the data from
pulse 1
pulse 2
pulse 3
pulse M antenna 1
range sample 1 range sample 2 range sample L pulse 1
pulse 2
pulse 3
pulse M antenna 2
pulse 1
pulse 2
pulse 3
pulse M antenna TV
antenna element
MN samples for cell under test data .
Figure 13.2 a Data to be processed (adapted from [42]) b CPI data cube
the range cell under consideration and the rath pulse. The portion of the data cube that represents the Uh range cell is the MN x 1 vector Jt, where x is defined by stacking the spatial snapshot vectors as x = (/i, • . . , XM) T Now consider the use of Jt, called the space-time snapshot, to determine the presence or absence of a target in a particular range cell. The objective of the hypothesis testing problem is to determine which of the following is true for the range cell under consideration: x =n HQ: target absent x = KS + n H\: target present where n encompasses all of the noise contained in the test cell. The signal which can be added to the noise is o , where s is a unit length vector that is completely known (referred to as the signal vector) and /c is an unknown complex constant.
13.2
Real-world detection environments
In most of the adaptive radar detection algorithms, the filter weights are computed via a calculation that attempts to approximate a maximum likelihood estimate of the noise covariance matrix. However, the covariance matrix of the noise in the test cell is usually characterised using samples from the neighbouring reference cells. The set of vectors composed of the test cell and all of the reference cells is typically assumed to be an independent and identically distributed (HD) set of vectors. The noise in both the test and reference cells is usually assumed to have been generated from the same Gaussian distribution. The estimated covariance matrix generated from the reference cells is then used in the signal detection procedure. If the estimated covariance matrix accurately represents the covariance matrix of the test cell, the adaptive algorithm will usually perform well. Unfortunately, the non-ideal conditions of real-world detection environments can preclude the performance described in much of the theoretical STAP literature. Mismatches between the clutter statistics of the test cell and those used to design the adaptive processing scheme present one such difficulty that can result from these circumstances. This type of non-homogeneity occurs when the reference data statistically departs from the test cell data in the structure of its covariance matrix, thus violating the HD assumption critical to statistical estimation of the test cell covariance matrix [33]. Mismatches of this type have been observed in measured airborne radar data and can be intuitively justified. In airborne radar, the ground clutter in the reference cells is produced by reflections from portions of the ground different from those that produce the ground clutter in the test cell. Due to variations in terrain, weather and other factors (see discussion in the next section), the statistics of the ground clutter returns in the test cell can differ significantly from those of the reference cells. As an example, the United States Air Force Rome Laboratory has collected data under the multichannel airborne radar measurement (MCARM) programme in collaboration with Westinghouse Electric Corporation. The MCARM data has been shown to be non-homogeneous in range [33,35-38,42,43,45-49].
jammer
#-steering vector
jammer null
target s-signal (target) vector
clutter null
Figure 13.3
Steering vector mismatch (adapted from [115])
An additional difficulty with real-world covariance matrix estimation is a violation of the assumption that the reference cells are free of targets. In much of the literature, the target signal is assumed to be confined to the test cell so that the reference cells are free of signal components. Under practical circumstances, the assumption of targetfree reference data can be invalid [11,8,172-176,45^8]. For example, consider several targets with similar velocities that are spatially distributed in range. This configuration yields similar signals in both the test and reference cells. The use of a high range resolution (HRR) radar can produce scenarios such that even a single target can occupy tens of range cells3 [H]. When the reference cells contain signals, traditionally known as signal contamination, the estimate of the co variance matrix of the noise in the test cell can become inaccurate. Therefore, the performance of STAP algorithms can degrade significantly [126,45-48]. Steering vector mismatch is another type of mismatch that affects performance. It occurs when the radar antenna array forms a beam that is not pointed in the exact direction desired. The misalignment can also be caused by aircraft crabbing as well as antenna calibration errors. Certain algorithms can be less sensitive to steering vector mismatch than others [195-199,10,9]. Depending on the algorithm used, steering vector mismatch can have a negative impact on the radar signal processing. Steering vector mismatch is depicted in Figure 13.3.
3 The authors acknowledge that the regular resolution case is currently of more practical interest. Presently, the HRR radar mode is not used for target detection in any operational platforms known to the authors. However, with increases in observational resources and computational speed, it is feasible that in the future such a capability will exist
Jamming and the subsequent multipath signals that result from jamming (also known in the literature as hot clutter or terrain scattered interference (TSI)) are a major consideration, especially in hostile detection environments. Typically, the mitigation of a direct path jammer requires the application of only a one-dimensional spatial filter as depicted in Figure 13.Id. However, the jammer signal can reflect off of different terrain surfaces. Consequently, time-delayed versions of the signal with different angles of arrival are processed at the receiver. This type of interference requires a two-dimensional filtering procedure to reduce negative effects. Unfortunately, the filtering coefficients are significantly different from those required to cancel the ground clutter (also known as cold clutter in the TSI problem formulation) [206-208,141-143,77]. Even under ideal conditions of covariance matrix and steering vector match, the application of two-dimensional filters (or in the case of TSI, potentially two stages of two-dimensional filters or even three-dimensional filters) often presents a daunting challenge to the real-time computing community. The computational complexity necessary to conduct real-time adaptive processing quickly becomes challenging as the dimensionality of the filtering problem increases. Therefore, many reduced processing techniques (sometimes called partially-adaptive) have been introduced recently and evaluated under ideal detection conditions [201,58-61,211]. Under the conditions of covariance matrix and steering vector mismatch, many fully adaptive and partially adaptive STAP tests are not optimal and can suffer losses in performance. If not factored into the design of the algorithm, TSI presents additional challenges. Computational demands are often difficult to meet for the desired real-time target detection problem. In this chapter, the real-world difficulties that result in suboptimal STAP performance are analysed. Then, the evaluation of STAP performance under these non-ideal conditions is discussed and algorithm robustness is studied. We review the development of novel techniques that attempt to either circumvent or minimise the deleterious effects of non-homogeneous clutter, signal contamination, steering vector mismatch and TSI. Some of these algorithms provide an additional benefit of reduced computational complexity. We conclude that, for the case of detecting a target embedded in non-homogeneous clutter in the presence of antenna array errors and hostile jamming, the proper design of the adaptive signal processing algorithm, accounting for the unique detection environment and utilising intelligent exploitation of a variety of resources, can lead to significant performance enhancement over traditional methods. We recognise that due to the vast amount of literature on this topic, a fully comprehensive review and analysis is beyond the scope of a single chapter. Therefore, we summarise the efforts of major contributors with the hopes of introducing the reader to the significant topics of research as well as providing references for further investigation.
13.3
Non-homogeneity - causes and impact on performance
As discussed briefly in the introduction, the performance of most STAP algorithms is linked to the accuracy of the estimated covariance matrix of the noise in the test cell.
Clutter non-homogeneity leads to mismatch in structure, in terms of both amplitude and spectral characteristics, between the estimated and actual covariance matrices of the test cell data [34]. The non-homogeneous component causes a shift in the test statistic from the homogeneous case. Therefore, the filtering implemented by the adaptive processing suppresses the homogeneous component of the clutter, while passing the non-homogeneous components which are not nulled by the filtering [35,36]. It is precisely the non-homogeneous elements that impact the effort to discern between target and noise. Many different factors affect the covariance matrix estimation procedure and cause violation of the HD assumption needed for true maximum likelihood estimation of the test cell covariance matrix [34-36,38]. Therefore, Reference 38 divides the non-homogeneous causes into several categories. One category, amplitude nonhomogeneity, results from spatially varying clutter reflectivity, shadowing and clutter edges. Spatial variation of the clutter is caused by factors that include strong discretes such as point reflectors. Shadowing generally occurs due to terrain obscuration. Interference edges can yield a mix of reference cells, some with the same covariance matrix as the test cell, while others have a very different covariance matrix. Examples of clutter edges include littoral or rural-urban interfaces, or the boundary area of farmland and wooded hills [38]. As expected, performance degrades as the number of reference cells with a covariance matrix different from the test cell increases [12]. Depending on the test cell covariance matrix and the dominant reference cell type, the edge can produce an excessive false alarm rate, severe signal masking and inadequately nulled clutter. Various studies on the performance of STAP algorithms in the presence of clutter edges have been conducted. The work in Reference 14 discusses the performance of the AMF and GLRT. The AMF and GLRT are represented by the following test statistics: (13.2)
(13.3) where x is the test cell space-time snapshot, s is the steering vector, K is the number of reference cells used to calculate the covariance matrix estimate, t\ and T2 and are predefined thresholds and Qe is the estimated covariance matrix: (13.4) where the JC/ vectors are the reference cell space-time snapshots. Due to the additional term in the denominator that compensates the processor, the GLRT has been shown to be more robust in clutter edge environments than the AMF [14]. The amount of degradation is related to the clutter spectrum spread and the distance between the signal and clutter in the Doppler domain [15].
Spectral non-homogeneity is another category of non-homogeneity outlined in Reference 38. Spectral non-homogeneity stems from a host of effects jointly characterised as variable intrinsic clutter motion (ICM), which is caused by such factors as sea state or foliage moving in the wind. Variable velocity of the clutter causes a widening of the clutter ridge due to temporal decorrelation of clutter echoes, thus broadening the clutter spectrum [50]. The pulse-to-pulse decorrelation of the clutter alters the STAP covariance matrix [148]. Due to its similar impact, here we also discuss the effects of crabbing. Aircraft crabbing, which is airplane movement orthogonal to the antenna array, causes the clutter to spread off the diagonal line onto a curve, thus resulting in suboptimal cancellation [203,204]. Typically, the noise covariance matrix is represented by some large eigenvalues corresponding to the clutter/jamming and the remaining smaller eigenvalues corresponding to the thermal noise. The effects of ICM and crabbing cause an increase in the number of larger eigenvalues (the rank of the clutter/jamming covariance matrix). This might, for example, hinder the performance of reduced-rank STAP methods by causing them to inaccurately estimate rank. ICM and crabbing can also yield an adaptive filter response which produces uncancelled clutter or unintended signal cancellation. Under either scenario, the clutter ridge expansion causes degraded minimum detectable velocity (MDV) of targets. A distorted beampattern and high azimuth-Doppler sidelobes also result. Methods to raise the noise level to eliminate pattern distortion have been investigated [148]. The effects of ICM caused by wind-induced motion of trees and vegetation are studied in References 191, 192 and 194. This work conducts a statistical analysis on measured clutter data to consider appropriate modelling and then discusses detector performance. Errors caused by range walk, due to the platform motion during the interpulse sampling, can cause temporal decorrelation of space-time clutter echoes similar to that caused by ICM. Techniques exist to correct for difficulties caused by range walk [56]. Near-field scattering smears the clutter return off of the diagonal ridge into the entire azimuth-Doppler plane. Near-field obstacles include airplane structures, such as the wing. Consequently, the clutter cancellation in the presence of near-field obstacles will require more degrees of space-time freedom than are necessary in the absence of such obstacles. Increasing the number of elements to give more spatial degrees of freedom helps to alleviate this difficulty, since more spatial nulling can be achieved [202]. Increasing the number of temporal degrees of freedom is less effective in this case. A mathematical analysis of the effects of ICM, crabbing, near field obstacles and steering vector mismatch on the eigenvalues of the noise covariance matrix is given in Reference 204. Another type of non-homogeneity is clutter-to-thermal-noise ratio (CNR)dependent spectral mismatch. The non-homogeneity is caused by the CNR influence on spectral spreading mechanisms (timing jitter, scintillation, array errors and other decorrelation effects). The result is either an adaptive filter pattern that does not adequately encompass the primary data spectral width causing uncancelled clutter, or an overtly wide filter notch causing overnulling and a reduction in MDV The rank of the clutter covariance matrix tends to expand with greater CNR [38].
A violation of the homogeneous data assumption is caused by moving scatterers or multiple interfering targets, such as vehicles, aircraft, weather or chaff resident in either the primary or the secondary data. These scatterers cause angle-Doppler responses that appear spectrally distinct from the ground clutter power spectral density. The effects of moving interferers have been seen in several different multichannel airborne radar data collections. For example, it is difficult to avoid vehicles in the reference data set while surveying near a network of roads and highways. These non-stationary discretes can cause signal cancellation, distorted beam patterns and exhaustion of the degrees of freedom necessary to successfully perform clutter cancellation [38]. Non-stationarity may be due to radar geometry causing a range variation in clutter characteristics, and therefore has a certain predictability. Any configuration, with the exception of a side-looking array, has range-dependent Doppler. The effects of nonstationarity can be mitigated by reduced-rank methods, a range-dependent taper to adjust for the geometry or a covariance matrix taper. The work in Reference 38 illustrates the losses caused by each type of non-homogeneity.
13.3.1 Signal contamination Investigation of non-homogeneity due to signal contamination has seen significant growth in recent years, and warrants further attention. It has been observed that when the reference cells contain signals, the estimate of the noise in the test cell becomes inaccurate. If the false target has a stronger power than the primary target, the classical covariance matrix-based STAP approaches can attempt to cancel the primary target in favour of the false target [158]. Targets in the reference data cause the performance of many STAP algorithms to degrade significantly [126,199,172-176,158]. Therefore, robustness of traditional STAP algorithms in signal-contaminated environments is of particular interest, as are new methodologies used to mitigate the problem. The strength and direction of the contaminating signal play a major role in determining its effect on performance. As the signal strength increases, performance degrades substantially [12]. In Reference 126, the authors illustrate the deleterious effects of signal contamination on two detection schemes. Their work shows that performance can be significantly degraded when a strong signal contaminates only one reference cell. The impact of the contamination also depends critically on the angle between the desired and contaminating signals. Contamination close to the clutter frequency has much less of an effect on performance because it falls into the clutter notch. If the target and contaminating signals are orthogonal, the contamination has no influence. However, as the contaminating Doppler frequency approaches that of the desired signal (the angle and velocity difference between the desired and contaminated signal becomes smaller), the algorithmic performance decays further. These moving discretes can degrade the signal processing significantly, provided that the direction of arrival is within, or near, the 3 dB beamwidth of the mainbeam of the array [34,47,48]. Of course, as the number of reference cells increases, the effects of
distributed target STAP
traditional STAP test cell
antenna element
data cube
PRI
data cube antenna element
test cell
guard band cells reference cells
PRI
guard band cells reference cells
Figure 13.4 a traditional target detection b distributed target detection a contaminating signal can become averaged out. However, this technique increases the probability of yet another contaminating signal entering the estimate [126,14,15]. In an attempt to reduce the chance of signal contamination, some processing schemes omit the range cells adjacent to the test cell (known as guard bands) from the covariance matrix estimate (see Figure 13.4). However, the possibility of an inaccurate covariance matrix estimate increases as reference cells farther from the test cell are used in the estimation procedure. Additionally, implementing guard bands can result in the loss of potentially valuable signal information. In general, there are two approaches to combat the signal contamination problem. One attempt is to remove the signal contamination and present the processor with a more homogeneous data set, which is the topic of the next section. An alternative approach is to recognise that some of the signal contamination can be used to improve detection performance. This increased performance can occur if the goal of the system is to track a group of targets such as a convoy, or if the radar resolution is such that a single target is spread across several different range cells. We denote occurrences of signal contamination in range cells adjacent to, or within a few range cells of, the test cell as distributed target scenarios. Signal contamination by distributed targets occurs while attempting to detect targets in individual range cells as shown in Figure 13.4a. The alternative approach is to attempt to detect the presence of the target within an input data block (a set of test cells). This alternative technique is depicted in Figure 13.4b. The approach produces an effective integration gain of target returns that enhances the target set's detectability [127]. Hence, a presumed source of performance degradation (signal contamination) can actually be a source of performance enhancement. The penalty for grouping test cells into blocks is a loss in radar range resolution. STAP algorithms designed specifically for distributed target detection resembling the AMF and the GLRT, provided in the seminal work of Reference 8 and revisited in References 172 and 173, have attracted interest. The original STAP algorithms for
distributed target detection make the assumption of homogeneous range cell data in both algorithm design and performance analysis. Research on this topic is conducted in Reference 128, in which a detector called the modified generalised likelihood ratio test (MGLRT) is derived. Examples are provided that illustrate that the MGLRT can perform better than the GLRT. Unfortunately, the MGLRT does not have the desired CFAR characteristic exhibited by the GLRT. To address this concern, further work on the MGLRT is conducted in Reference 131, in which knowledge of the thermal noise is assumed to be known a priori. The approach results in a form that exhibits bounded CFAR characteristics and shows acceptable performance for range-distributed, Doppler-shifted targets. As previously discussed, the homogeneous data assumption can cause a system to perform poorly under real-world conditions. Adaptive detection for range distributed targets in partially homogeneous environments, in which the noise covariance matrices of the test and reference cells differ by a scale factor, is discussed in References 174 to 176. The derivation of the probability of detection and false alarm equations in References 172 to 176 involves conditional probability distribution functions (pdfs). A limiting aspect of these approaches stems from the fact that the conditioning is removed through a numerical expectation via a Monte Carlo technique rather than through analytical means. In many real-world scenarios, mismatches between the statistics of the test cells and those used to design the adaptive processing scheme are more complex than simple scalar differences. Complex mismatches have been observed in measured airborne radar data. Consequently, Reference 199 presents closed-form probability of false alarm and detection equations for an adaptive detection algorithm based on the AMF, for distributed target detection. The work in Reference 199 considers algorithm robustness under the real-world conditions of steering vector and covariance matrix mismatch and does not require the use of Monte Carlo techniques. Detection of distributed targets in noise environments that have non-Gaussian statistics is investigated in References 177 and 134. Such tools as those provided in References 6, 172 to 176 and 199 can prove valuable in discussing the robustness of algorithms in signal-contaminated, ideal/non-ideal detection conditions.
13.3.2 Non-homogeneity detection Although clutter non-homogeneity causes suboptimum performance by a STAP processor, this loss may be mitigated through intelligent approaches to covariance matrix estimation [38]. As discussed previously, individual targets in the reference data can significantly degrade performance. Two effects of the presence of targets in the STAP training set are a potentially significant target self-nulling and an overall degradation in clutter cancellation performance [114]. Outlier data vectors whose signal vector is highly correlated with the desired steering vector have the most impact [133]. The removal of these cells can provide substantial overall detection improvement [47,48,158]. One such removal approach is called non-homogeneity detection. In general, a non-homogeneity detector attempts to excise the reference cells that are least homogeneous from the group of cells to be used in the estimate
of the test cell noise covariance matrix. Frequently, the non-homogeneity detection schemes assume that the homogeneous clutter comprises the majority of cells to be tested. Computing the covariance matrix of the test cell from the identified 'most homogeneous' secondary data range cells ensures acceptable performance for the majority of the test cells [35,38]. The literature illustrates that proper identification and excision of non-homogeneous data lead to improved covariance estimation and a corresponding improvement in STAP detection. Some efforts have attempted to utilise a power-based technique to assess non-homogeneity through a calculation of the inner product of the return from each cell. It is assumed that the reference cells with the largest power are signal contaminated, and that their removal will leave the processor with a remaining set of homogeneous reference cells. Rather than removing those cells with large amounts of power, other approaches choose to estimate the test cell noise covariance matrix with the reference cells having the largest power. The efforts in References 91 and 94 illustrate such a technique called power-selected training (PST). The methodology produces a deeper filter notch than if the covariance matrix was formed by utilising the entire set of reference cells, thus ensuring that all of the clutter ridge is nulled. The work in Reference 114 extends PST to a more complete non-homogeneity detector in which both phase and power-selection criteria are utilised. The method first uses a phase-based criteria to remove strong moving targets from the training data. For a given Doppler, clutter comes from a common angle for all range cells over which STAP weights are formed. Thus, any significant deviation in angle indicates that the cell contains non-clutter returns. If the measured phase differs significantly from the expected phase of the clutter for a given Doppler bin, the cell is excluded. Subsequently, the reference data are put through PST to train on the clutter discretes. By using the strongest returns for each Doppler bin for training, STAP weights are based on the strongest clutter. Increased false alarms may result from target signals located in the sidelobes of the antenna beampattern. These extra target signals can be due to multiple detections of a single target, non-stationary jamming, etc. A technique has been presented for editing detections resulting from sidelobe targets and excising these detections [90]. The outputs of the normal detector are passed to an editor that examines the beamformer's outputs, the adaptive weights and the list of raw detections. This information is used to classify each raw detection as either from the radar mainlobe or sidelobe. Only the mainlobe detections are passed on to subsequent signal processing stages. Another type of non-homogeneity detection is discussed in Reference 89. In this work, the power of ground clutter at each range-Doppler cell is measured by forming a non-adaptive beam at each Doppler pointed at the azimuth at which ground clutter appears. The amount of nulling accomplished by the adaptive system is estimated, and then the measured power of the ground clutter and an estimate of nulling performance is used to adjust a threshold. This methodology attempts to account for any undernulled ground clutter discretes predicted to appear in the adaptive range-Doppler map. Other recent work on robust algorithms that mitigate the effects of signal contamination includes Reference 133. This work illustrates how the maximum likelihood
estimate of the number of discretes in the training data can combat the negative effects of non-homogeneity. Algorithms that censor outliers are derived using approximations to maximum likelihood estimates. A more robust AMF algorithm is devised by using these censoring algorithms to improve the covariance matrix estimate. A technique that uses the generalised inner product (GIP) as a method for nonhomogeneity detection has been introduced and analysed by the research community. The GIP can be utilised as a measure of the variability of the components available to estimate the covariance matrix of the noise in the test cell [33]. The GIP nonhomogeneity detector has been further validated through work which approaches the non-homogeneity problem via an eigenanalysis. This effort analyses edge effects due to rural-urban interfaces, stationary discretes and moving scatterers in the reference data and further justifies the GIP as a simple test of eigenstructure [43]. The work in Reference 158 introduces a variation of the GIP non-homogeneity detector to improve target detection in cases of non-homogeneous background with multiple targets in close proximity. This scheme involves the application of a weighting, termed an equalisation. For the pool of reference cells used in the noise covariance estimation procedure, the cells closer in range to the test cell are weighted with a higher probability of being homogeneous than those farther away, which are assumed to be less representative of the noise in the test cell. A statistical analysis of the GIP, in which its pdf is determined, is presented in Reference 159. This statistical effort introduces a GIP-based test for non-homogeneity detection that utilises a goodness-of-fit test on an empirically formed pdf. The work is extended to cases with very limited reference data in Reference 160, which implements the derived pdfs in the calculation of two techniques that use the nonhomogeneity detector for covariance estimation. Finally, the scheme is used to derive a non-homogeneity detector for non-Gaussian noise in which the clutter is modelled as a spherically invariant random process (SIRP) [161]. The statistics of the detector are determined with the aforementioned pdf employed. Overall, the literature has shown that for both simulated and measured data, the proper use of the GIP as a nonhomogeneity detector can yield a dramatic increase in signal processing robustness in non-homogeneous environments. Unfortunately, all aspects of the radar detection procedure are not factored into the GIP methodology. In Reference 71, the GIP is compared with the AMF detection statistic as a non-homogeneity detector. The authors suggest that the GIP nonjudiciously limits reference cell data supply since the measure is independent of the radar's steering vector. Therefore, the use of the GIP can remove reference cells that have no significant negative impact on the processing, such as those with contaminating signals far away in angle from the target signal. The study also shows that, under non-homogeneous conditions and non-ideal array environments (steering vector mismatch), the non-homogeneity detector might actually worsen the situation. Many non-homogeneity detectors do not specify how to detect targets in test cells which are deemed to be non-homogeneous. In Reference 72, a two-stage hybrid STAP algorithm is introduced to remove this limitation. The method first implements a deterministic algorithm to filter the uncorrelated interference. This type of interference includes discretes in the test cell coming from directions and velocities other
than the location that the radar is querying for a target. This deterministic technique only employs data from the test cell and is therefore known as a direct data domain algorithm. After this stage, which uses only one degree of freedom, a STAP algorithm is implemented. Classical STAP schemes would not place a null at the location of discretes in the test cell, since the test cell, and thus the discrete, is not incorporated in the estimation of the noise covariance matrix. This two-stage method nulls discretes other than the target in the test cell. The technique could be used in conjunction with one of the previously described non-homogeneity detection schemes to increase overall performance [74,76]. Further analysis on direct data domain techniques is presented in Reference 32. Non-homogeneity detection remains a very active topic in the adaptive algorithm community.
13.3.3 Knowledge-based signal processing Knowledge-based signal processing is a very general approach that attempts to use signal processing in an intelligent fashion by considering additional information and adapting the processing based on this supplementary knowledge. Non-homogeneity detection is one problem to which knowledge-based signal processing has been successfully applied. Experts in target detection have noticed that easily accessible databases can be employed in conjunction with adaptive algorithms to mitigate the difficulties caused by non-homogeneous reference cells. Sources include mapping data, communication link trackers and other information sources that can be utilised along with adaptive processing [44-48]. For example, using map data, the range cells that correspond to geographic locations with roads can be removed. This preventative action excises reference cells that can potentially contain moving targets and consequently avoids signal contamination of the covariance matrix estimate. Additionally, it is possible to prenull known jammers, so that the degrees of freedom used in the STAP processing are utilised only to mitigate the effects of clutter. The effort in Reference 70 suggests that reference cells can be removed from the covariance matrix estimation procedure if they do not have common codes that correspond to databases, such as United States Geological Survey Land Use and Land Cover codes that describe the earth based on a classification system. The area of knowledge-based signal processing has received special attention due to the initiation of the United States Defense Advanced Research Projects Associations's Knowledge Aided Sensor Signal Processing and Expert Reasoning Program (KASSPER).
13.3.4 Analysis of degraded performance due to non-homogeneity Since the non-ideal, real-world detection environment presents difficulties not taken into consideration in the derivation of classical STAP algorithms, the capability to accurately determine the robustness of these techniques to non-ideal conditions is extremely valuable. There has been a significant effort to quantify the robustness of a variety of algorithms, both via Monte Carlo analysis as well as through closed-form equations that predict performance. Towards this goal, a variety of clutter modelling techniques and software have been developed to model non-homogeneous clutter.
The efforts supplement examples of the effects of non-homogeneity and allow further understanding of the specific contributors in a controlled environment. Examples in the literature include Reference 34, which develops a composite model of clutter non-homogeniety that allows for amplitude and spectral fluctuation in angle-Doppler subspaces. Spectrally distinct forms of space-time clutter non-homogeniety due to moving scatterers are provided as well. More recently, the United States Air Force at Rome Laboratory has developed high fidelity clutter modelling software, entitled RL-STAP, that has received positive attention [120,121]. Concerning traditional algorithmic performance, the robustness of the AMF algorithm in non-homogeneous clutter environments has been discussed in Reference 15, but only under the assumption of the availability of an infinite number of reference cells. This essentially drives the variability of the estimate to zero, which reduces the problem to a known covariance matrix case. Performance changes due to non-homogeneous environments have also been studied using simulations in such References as 43 and 36, and have been analytically described in References 101 to 103. Other efforts include Reference 106, which is relevant since most of the algorithms discussed are front-end adaptive beamformers. Work has been done to study the effects of scaling mismatch between the true and estimated noise covariance matrix on algorithm performance [212]. Also, the performance of a general class of detection algorithms in non-homogeneous clutter environments with mismatched steering and signal vectors is provided in References 195 and 196. Upperbounds of performance in mismatched scenarios are given in Reference 198. The studies in References 101,195 and 196 are based on a similar assumption, called the generalised eigenrelation (GER) in Reference 101. Validation of the GER assumption is provided in References 105 and 109. A discussion of the effects of this assumption on derived results is given in Reference 196, and an interpretation of this constraint from an adaptive nulling perspective is given in Reference 105. This constraint is relaxed in References 197, 198 and 200, and results in closed-form equations for performance of a general class of STAP algorithms under steering vector and covariance matrix mismatch. To lessen the negative impact of non-homogeneity, some efforts have attempted to combine more than one classical STAP algorithm into a hybrid approach. For instance, References 101 to 103 discuss the adaptive sidelobe blanker (ASB) algorithm which stems from sequentially following the AMF test with the ACE test. The ASB performance is compared with other traditional algorithms in homogeneous environments in Reference 107. An in-depth analysis of the ASB in non-homogeneous environments, in relation to other traditional detectors like the ACE, AMF and GLRT, is presented in Reference 108. The work in Reference 103 presents two methods for attempting to choose the thresholds required for the ASB to operate successfully in non-homogeneous clutter. In comparison with other traditional algorithms, the ASB is shown to provide superior performance to the AMF test alone due to its ability to mitigate false alarms caused by clutter non-homogeneities. Although computationally more efficient, its performance is on par with the GLRT. The ACE algorithm seems to suffer more from lower mainlobe target sensitivity than does the hybrid algorithm.
13.4
Antenna array errors
Most adaptive algorithm designs do not account for antenna array errors and such oversight can degrade signal processing performance. Since channels differ generally in both elevation and azimuth patterns, even at a fixed frequency, the argument can be made that the array calibration difficulty has been underestimated [22,23]. This difference in element beampatterns is a cause of steering vector mismatch. Mismatch occurs when the antenna array forms a beam that is not pointed in the exact direction desired. Under the conditions of steering mismatch, most adaptive algorithms are not optimal and can suffer significant losses in performance. The effects of steering mismatch on the GLRT and AMF tests are studied in References 7 and 5, respectively. A study of the effects of steering mismatch on a larger class of STAP algorithms appears in References 9 and 104. In Reference 104, the author illustrates that the AMF is less sensitive to steering vector mismatch than the ASB or the GLRT. However, the AMF is more sensitive to targets in the sidelobes (resulting in a higher false alarm rate). In Reference 113, array misalignment is shown to reduce the radar's ability to detect slow moving targets. In the design of many STAP processing schemes, the main assumption is a linear array of equally spaced isotropic point sensors. Also, each element of the array is assumed to be exactly like every other element. In reality, the physical size of the sensor causes a reradiation of incident fields leading to mutual coupling between the array elements. In addition, the channels are mismatched as a result of manufacturing errors. The works of References 74 and 75 discuss the negative effects of steering vector mismatch as well as mutual coupling. The significance of these errors is discussed in Reference 79. The influence of channel effects on the estimated covariance matrix such as amplification and orthogonality errors in the in-phase and quadrature channels, delay errors and offset errors is discussed in Reference 233. Taking these real-world factors into consideration in the design of the signal processing can result in performance improvement [73]. For example, certain algorithms use a discrete Fourier transform (DFT) to transform spatial data to the angle domain. However, when channel mismatch and mutual coupling affect the spatial steering vectors, a DFT may not be the most appropriate transform. Much of the literature has ignored the difficulties that can occur due to aircraft motion causing a change in the pitch angles of the platform. Analysis in Reference 80 indicates that the pitch angle of the airplane can have a significant effect on algorithm performance. Many antenna arrays are mounted to the fuselage of the aircraft. The pitch angle of the aircraft causes the array axis to incline relative to the velocity vector. The angle of the array relative to the velocity vector causes a broadening of the clutter ridge leading to a decrease in the MDV of the radar as well as an increase in the false alarm rate due to the broader, shallower nulls of the resulting filter. The crab angle that results from aircraft movement yields similar problems, but does not seem to be as substantial a concern. The study in Reference 55 suggests the utilisation of Doppler compensation to negate the effects of wind drift leading to aircraft crabbing. Reference 80 describes the use of a Doppler warping technique as the compensation for the effects of array inclination. Doppler warping is based on
shifting the mainlobes of the Doppler filters as a function of range so that the shifting follows the movement of the clutter ridge over range caused by the pitch angle. Since the calculation needs completing only once every CPI, it is not computationally expensive. Another alternative to counter the effects of platform or interferer motion without constantly having to obtain and invert new covariance estimates is given in Reference 110, which describes a transform that spatially widens the nulls created by traditional adaptive processing algorithms. It is hoped that in the future, with improved antenna design and calibration techniques, the effects of steering vector mismatch and pitch angle will be reduced.
13.5
Deviation from Gaussian assumption
One of the fundamental design assumptions of many STAP algorithms is that the statistics of the clutter returns can be derived from a Gaussian distribution. Under certain conditions, the assumption may be invalid. For instance, Reference 125 argues that the increased resolution of modern radars has caused a deviation from the Gaussian assumption of the clutter. Arguments are presented for modelling the clutter as a Rayleigh mixture, and signal processors are derived according to Rayleigh mixture pdfs. Non-Gaussian statistics have been reported for scattered power from the ocean and terrain and have been confirmed by experimental data recorded from a high resolution airborne radar operating at low grazing angles [154], Therefore, a large effort has been placed on algorithm design for the detection of targets in non-Gaussian noise. The performance of the GLRT and the AMF algorithms, which are both designed under the Gaussian assumption, have been shown to provide nearly identical performance with Gaussian interference [13]. In this case, the more computationally complex GLRT implementation is often not required. However, as the interference becomes more non-Gaussian and heavy tailed (as in the cases of either Weibull or log-normal distributions), the GLRT shows more robustness than the AMF [13]. Discussion of robust estimators using a priori information about the covariance matrix structure is described in Reference 63. The robustness of algorithms to the deviation from the Gaussian assumption is confronted in References 99 and 100, in which adaptive detection and beamforming structures are analysed when the clutter is modelled by a complex multivariate elliptically contoured distribution. Classical studies on robust detection in non-Gaussian noise and the statistical theory of minimax robustness are also relevant [220]. Contributions to the design of receivers for operation in clutter that is modelled by a complex spherically invariant random vector of the circular class include References 153 and 154. The derived receiver is shown to be equivalent to a parametric adaptive matched filter as opposed to a data-dependent threshold detector. The noise covariance matrix is approximated by a parametric multichannel model in which the parameters are estimated from the secondary data. This approach is shown to exhibit better performance than the AMF in non-Gaussian noise. However, the test involves implicit knowledge of the clutter statistics. In References 155 and 156, the effort is
extended to derive a test that requires no a priori knowledge of the noise statistics. The methods presented in this work illustrate robustness with respect to variations in clutter texture power for compound Gaussian clutter and provide a potential approach for cases in which small amounts of reference cells are available to estimate the noise covariance matrix. This trait is important in non-homogeneous environments in which homogeneity only exists for several reference cells. An adaptive matched filter and covariance matrix estimation procedure for target detection in spherically invariant noise are developed in Reference 178. In References 162 to 165, the authors derive a new generalised likelihood ratio test for detecting a signal in unknown, strong, non-Gaussian, low-rank interference plus white Gaussian noise. The algorithm shows robustness to both steering vector mismatch and rank overestimate. Sea clutter is sometimes well modelled using a K distribution. In Reference 180, an algorithm is presented, based on second and higher-order cumulants, which is able to operate in an unknown interference environment by employing both non-parametric and parametric estimation techniques. Detection in clutter that can be modelled as an SIRP (in this case, correlated K distributed clutter) is presented in References 181 and 183, in which Kelly's GLRT is outperformed by a detection algorithm that exhibits CFAR behaviour with respect to the amplitude pdf parameters and to the correlation structure of the disturbance. Additional work on detection in compound Gaussian noise is accomplished in Reference 182, in which a new adaptive receiver has been proposed as a candidate for robust detection. This detector is CFAR with respect to texture statistics and is robust with respect to variations in clutter temporal correlation as well as to mismatch between the design and operating conditions. Further work on robust detection in compound Gaussian clutter plus thermal noise is conducted in References 192 and 193. A performance analysis in terms of the classical covariance matrix estimator as well as that proposed for nonGaussian environments (which is a normalised version of the previously described estimator) is provided in Reference 186. This type of covariance estimator is used in an adaptive linear quadratic (ALQ) detector which is designed to work against a background of correlated non-Gaussian clutter. In Reference 187, the ALQ and GLRT performance are illustrated when confronted with the problem of detection against (non-Gaussian) real sea clutter data. To combat the difficulties caused by the computational complexity of STAP algorithms designed for non-Gaussian environments, less computationally intense, suboptimum detection schemes based on a polynomial approximation of a data-dependent threshold have been suggested in References 184 and 185. It has been argued that sometimes the Gaussian assumption is only partially violated. For instance, radar returns from the ground can be described by a Gaussian distributed process, and the echoes from the sea may follow a K distributed random process. Therefore, in References 189 and 190, a detector is derived for this type of composite disturbance and is compared with those derived based on only Gaussian or K distributed clutter. This work formats the AMF adaptive array processing into the sidelobe canceller form and into the Gram-Schmidt cascaded canceller form similar to References 123 and 124. However, the sensitive least-squares method is replaced
Next Page
with a pseudo-median canceller that is much less sensitive to outliers in the data. Also, with no outliers and with an asymptotically large set of reference cells, this robust processor approaches the AMF. Additional work considers this processor cascaded with a reduced-rank scheme in Reference 130. In this effort, the optimal rank reduction capability from the multistage Wiener filter (described in the next section) is retained, and robustness to targets/outliers and non-stationary data are attributed to the pseudo-median canceller. Violation of both the Gaussian and homogeneous data assumptions is considered in Reference 129. The authors introduce a pseudo-median canceller as a robust adaptive array method which significantly reduces the deleterious effects of non-Gaussian, real-world interference with signal contamination on typical array performance metrics. This algorithm also offers protection against signal cancellation.
13.6
Jamming and terrain scattered interference
Jamming represents a significant source of degradation in the target detection procedure. Radar jamming can be caused by either a hostile enemy or unintentionally by a friendly emitter. There are several different modes of operation and sophistication for a jammer. Wideband jammers can be broadband noise-like signals with sufficient spectral width to ensure overlap with the frequency bands that the radar employs. Other jammers, known as intermittent or self-screening, are responsive in nature and attempt to adjust to changing radar frequency bands and signal characteristics [201]. Although jammers can significantly differ in their temporal behaviour, their spatial propagation (and hence the correlation of their signals between channels) is dependent only upon their angular location. Therefore, an exploitation of these spatial correlations, such as the spatial filter shown in Figure 13.Id, can result in significant performance improvement. To properly sample the jammer signal without clutter, the radar can utilise the 'quiet times' between CPIs or use an auxiliary channel. Other works advocate using the over-the-horizon mode of operation to obtain jammer samples without clutter [95]. Some authors attempt to exploit the jamming energy in the sidelobes to suppress a jammer that may be in the mainlobe [62]. Along with the signal received via the direct path between the sensing platform and the jammer, a more deleterious component often exists. The signal of the jammer can bounce off terrain, buildings and other structures before reaching the receiving radar. This reflected signal results in the reception of time-delayed jammer signals, known in the literature as multipath interference, hot clutter or terrain scattered interference (TSI). In regions where the earth is smooth, the multipath signal appears at the same azimuth as the direct path jamming. Both components can usually be cancelled via a single spatial filter, as in Figure 13. Id, or via the design of an antenna beampattern with a low sidelobe response. Unfortunately, in more variable terrain, the jammer signal is scattered off a large portion of ground, including the section covered by the mainbeam of the antenna pattern. The TSI entering the radar through the mainbeam can be considerably larger than that entering through the sidelobes. Thus, the TSI creates a jamming environment in which the interference is distributed spatially in
Previous Page
with a pseudo-median canceller that is much less sensitive to outliers in the data. Also, with no outliers and with an asymptotically large set of reference cells, this robust processor approaches the AMF. Additional work considers this processor cascaded with a reduced-rank scheme in Reference 130. In this effort, the optimal rank reduction capability from the multistage Wiener filter (described in the next section) is retained, and robustness to targets/outliers and non-stationary data are attributed to the pseudo-median canceller. Violation of both the Gaussian and homogeneous data assumptions is considered in Reference 129. The authors introduce a pseudo-median canceller as a robust adaptive array method which significantly reduces the deleterious effects of non-Gaussian, real-world interference with signal contamination on typical array performance metrics. This algorithm also offers protection against signal cancellation.
13.6
Jamming and terrain scattered interference
Jamming represents a significant source of degradation in the target detection procedure. Radar jamming can be caused by either a hostile enemy or unintentionally by a friendly emitter. There are several different modes of operation and sophistication for a jammer. Wideband jammers can be broadband noise-like signals with sufficient spectral width to ensure overlap with the frequency bands that the radar employs. Other jammers, known as intermittent or self-screening, are responsive in nature and attempt to adjust to changing radar frequency bands and signal characteristics [201]. Although jammers can significantly differ in their temporal behaviour, their spatial propagation (and hence the correlation of their signals between channels) is dependent only upon their angular location. Therefore, an exploitation of these spatial correlations, such as the spatial filter shown in Figure 13.Id, can result in significant performance improvement. To properly sample the jammer signal without clutter, the radar can utilise the 'quiet times' between CPIs or use an auxiliary channel. Other works advocate using the over-the-horizon mode of operation to obtain jammer samples without clutter [95]. Some authors attempt to exploit the jamming energy in the sidelobes to suppress a jammer that may be in the mainlobe [62]. Along with the signal received via the direct path between the sensing platform and the jammer, a more deleterious component often exists. The signal of the jammer can bounce off terrain, buildings and other structures before reaching the receiving radar. This reflected signal results in the reception of time-delayed jammer signals, known in the literature as multipath interference, hot clutter or terrain scattered interference (TSI). In regions where the earth is smooth, the multipath signal appears at the same azimuth as the direct path jamming. Both components can usually be cancelled via a single spatial filter, as in Figure 13. Id, or via the design of an antenna beampattern with a low sidelobe response. Unfortunately, in more variable terrain, the jammer signal is scattered off a large portion of ground, including the section covered by the mainbeam of the antenna pattern. The TSI entering the radar through the mainbeam can be considerably larger than that entering through the sidelobes. Thus, the TSI creates a jamming environment in which the interference is distributed spatially in
azimuth throughout the mainbeam [208,61,77]. Placing a spatial null at a single azimuth in the mainbeam would not be effective, nor would placing nulls everywhere in the mainbeam succeed, since the target signal would also be cancelled. In the TSI problem formulation, the reflection of the radar transmission from the ground is specifically referred to as cold clutter, and the axis of the data cube given in Figure 13.4 denoting PRI and range are referred to as slow time and fast time, respectively. In this chapter, we use the term clutter to refer to cold clutter. TSI typically decorrelates across large time intervals (such as pulses) due to jammer motion. Thus processing across slow time, as done through the utilisation of range cells for the clutter-only scenario, is usually ineffective. However, the TSI will have a large temporal spread, containing significant correlation in fast time. Therefore, filters to ameliorate the problems of TSI must be designed to utilise the fast time correlation and have weights that are unfortunately quite different from those designed to combat the clutter [92,93]. The robustness of STAP algorithms in the presence of both direct path jamming and TSI is important in both military and commercial applications. This robustness is discussed in the following subsections. 13.6.1
Constraining detection
schemes
Designing and implementing constraints to combat the effects of jamming in the mainbeam, while maintaining acceptable performance levels, have recently received significant attention. Unconstrained adaptive algorithms can suppress the interfering signal by placing beampattern nulls at the jammer locations when interfering sources, such as TSI, are located in the mainbeam. However, the resultant beampattern can be distorted so that the desired target signal is cancelled along with the jamming. A potential solution involves the introduction of constraints to specifically prevent cancellation of the desired mainlobe signal while realising mitigation of the jamming [122-124]. The utilisation of multiple linear constraints is also investigated in References 150 and 151, in which adaptation is applied to pre-Doppler data without modification of the desired signal Doppler spectrum. The work in Reference 152 extends the linear constraints analysis to include both STAP in slow and fast time to mitigate both clutter and TSI. The work outlined in References 84 to 86 is combined and extended in References 87 and 88. The efforts confront the problem of TSI mitigation via a method of stochastic constraints and initially consider methods that require the use of range cells with only TSI and no clutter components [87]. However, attempts to remove this limitation are discussed in Reference 88. These publications address applications to the field of high-frequency over-the-horizon radar. The specific approach attempts to achieve TSI suppression while maintaining distortionless clutter post processing, a characteristic that is important when designing two-stage processors.
13.6.2
Two-stage processors
The technique of first using an adaptive spatial preprocessor for wideband noise jammer suppression followed by a STAP processor to promote clutter mitigation is
explored in works such as Reference 28. A frequency sideband, next to but distinct from the radar signal bandwidth, is used to obtain jammer-only training data for the presuppression processor. In some scenarios in which the reference data is limited, this approach has been shown to outperform the method of simultaneously nulling clutter and direct path jamming. When designing two-stage processing techniques, the impact of the first processor on the structure of the data should be taken into consideration for the design of the second stage [141,142,62]. The fact that two-stage algorithms mitigating TSI can modulate the clutter, and thereby degrade the performance of subsequent conventional STAP clutter cancellation techniques, is described in References 93, 92 and 95. In particular, References 92 and 96 present processors that attempt to cancel this modulated interference. The architecture modifies conventional post-Doppler STAP so that it has the freedom to place nulls in locations that will cancel modulated clutter as well as unmodulated clutter. To accomplish this task, the weights applied in the TSI canceller are analysed to determine how the clutter will be modulated. As previously mentioned, Reference 87 designs a two-stage processor utilising stochastic constraints, but reference cells with only hot clutter are required for the method's successful implementation. The work of Reference 88 attempts to remove this requirement through the use of a moving target indicator (MTI) filter to reject cold clutter for a few PRIs. The fast time samples within these PRIs are then used to determine the hot clutter filter for the first stage of the processing. The cold clutter filter is subsequently applied in the second stage. The success of this method depends on the ability of the MTI filter to remove the cold clutter. Other efforts on successfully cancelling jamming and clutter in two-stage processing schemes include References 209 and 210. A method of using antimodulation prefilters is described in Reference 97. This approach involves exploiting the structure of the weight modulation for TSI to construct special prefilters that largely eliminate modulated clutter. The basic approach cancels clutter of interest prior to TSI mitigation, so that when the remaining clutter is modulated, it lies below the thermal noise floor at its destination frequency. Also, the prefilter does not remove clutter from directions absent of modulation that will be cancelled in the STAP stage. Much of this work is summarised in Reference 98, which covers the effects of TSI modulation and clutter, methods to limit false alarms caused by TSI modulation and a variety of techniques to mitigate the effects of TSI and clutter in either two stages or simultaneously. A eountermeasure that is more sophisticated than wideband jamming is coherent repeater jamming, which mirrors the radar signal of interest. The jammer produces a coherent pulse train of the same carrier frequency, modulation type, pulse width and PRF as the radar system to be countered. It has a randomisation of the scan-to-scan range delay of the jammer pulse and an adjustable, linear initial phase progression over the pulse train. The coherent repeater thus produces target-like jamming signals with random range and Doppler. These countermeasures have been shown to degrade performance of algorithms developed under the assumption of homogeneity of the reference cells [21]. Since the jammer components are usually expected to be significantly stronger than targets, the aforementioned non-homogeneity detectors
represent a potential solution to this problem. Other techniques to combat coherent repeater jamming include spatial or temporal smoothing as well as filters designed to utilise constraints to decrease sensitivity to the presence of mainlobe coherent repeater jamming.
13.6.3
Three-dimensional STAP
In recognition of the limitations of two-stage approaches to mitigate the effects of jamming and TSI, some authors have realised that jamming, TSI and clutter environments can be dealt with concurrently in three-dimensional STAP schemes. These schemes are computationally complex and thus costly [201]. For example, References 206 and 207 consider a two-element adaptive canceller which includes both tapped delay lines and bandwidth partitioning. The ability of this system to cancel a direct jammer signal and its diffuse multipath is discussed under the assumption that the jammer signal is much stronger than any desired signal or system noise. Additional temporal degrees of freedom are achieved by using bandwidth partitioning (with a separate adaptive loop in each subband), an adaptive finite impulse response (FIR) filter, or a hybrid system that requires both bandwidth partitioning and adaptive FIR filters. An excellent development of three-dimensional STAP filtering is presented in Reference 208. This effort advocates the use of a processor with adaptive spatial degrees of freedom, as well as two different sets of adaptive temporal degrees of freedom. The two different sets of temporal taps required for each antenna element consist of one set spaced by the PRI, so as to cancel ground clutter, and another spaced by about one-half of the reciprocal of the radar bandwidth, so as to cancel the TSI. In References 141 and 142, the authors provide a three-dimensional STAP methodology for determining performance bounds of three-dimensional STAP when site-specific digital terrain data is used (clutter scattering coefficients are known a priori). In this way, future proposed TSI and clutter mitigation strategies can be compared with the bound presented to ascertain their absolute efficacy rather than only the relative performance. The real-time computing problem presented by two-dimensional STAP is only exacerbated by the addition of a third dimension. Therefore, Reference 143 explores the idea of reduced-rank three-dimensional STAP to reduce computational complexity, and provides a comparison of a multistage Wiener filter to a principal component method of rank reduction. Such methodologies are discussed in the next section. The technique presented in References 58 to 60 presents a compromise between two-dimensional and three-dimensional STAR The authors state that the majority of the clutter, jammer and TSI cancellation can be accomplished via a clutter filter. Further TSI suppression is achieved with a filter calculated by utilising the portion of the data cube corresponding to fast time and only one or a few elements of space and slow time. This technique is called extended beam-augmented STAR A further discussion of full-rank and reduced-rank methods for mitigating TSI is presented in Reference 61.
13.7
Reduction in computational complexity
STAP provides many interesting challenges to the real-time computing community. The computational rigours of computing the filter weights required for two-dimensional fully adaptive processing is demanding, and three-dimensional STAP computations are daunting at best. Thus there has been significant effort by the research community to investigate robust, suboptimal approaches that offer the benefit of a reduction in computational complexity. Several of these methodologies are discussed below.
13.7.1 Reduced-rank methods and covariance matrix tapers Reduced-rank, suboptimal adaptive detection schemes can provide robustness in nonideal detection environments. Reduced-rank methods have the benefit of reduced computational complexity as compared with fully adaptive algorithms. Reduced-rank processing exploits the low-rank nature of the clutter covariance matrix to reduce the required number of reference cells [38]. Since less data is needed, these algorithms can result in better performance than fully adaptive STAP in non-homogeneous environments. In addition, reduced-rank algorithms can be used in conjunction with other techniques such as non-homogeneity detectors to further enhance performance [37]. A general taxonomy of reduced-rank processors is provided in References 116 and 117. Reduced-rank approaches are also discussed in Reference 56. It is interesting to note that although reduced-rank methods have less desirable performance than full-rank methodologies in the case of a known clutter covariance matrix, they can outperform full-rank adaptive processing under non-ideal conditions. This limited performance of fully adaptive methods is due to errors resulting from the estimation process as well as the presence of thermal noise. These methodologies suppress estimation errors at the cost of a bias in the SNR. The net effect is a significant improvement for cases when the interference is truly low-rank [68]. Principal component inverse (PCI) is a modification of the AMF test that can provide a sufficient degree of adaptation with less training data than is required by fully adaptive processing. This technique uses a priori knowledge that, during the observation interval, the noise vectors are composed of a strong component and a background component (in which the strong component is well represented by a set of basis vectors that do not change over the adaptation interval). This principal components method chooses the largest eigenvectors corresponding to the largest estimated eigenvalues to generate an operator that reduces the rank of the data prior to adaptive processing, resulting in a low-rank representation of the full-rank space. PCI estimates the rank from data over the adaptation interval and assumes that the covariance has low rank [221,222]. It has been claimed that the reduced-rank methodology that maximises a cross-spectral metric (CSM) yields performance that is superior to that of PCI [57,64-67,147]. The CSM approach uses the estimated cross-spectral metric to
choose the eigenvectors and is formulated for a prescribed rank. Other modifications of the CSM are included in Reference 144. Rebuttal arguments are presented in References 221 and 222 to illustrate scenarios where PCI performs better than CSM, specifically in low-rank interference in which only a small number of reference cells are available. A third reduced-rank technique, called the eigencanceller, also uses the eigenstructure of the estimated test cell noise covariance matrix. The eigencanceller is an approach in which the weight vector is constrained to lie in the subspace composed of the small eigenvalues of the covariance matrix (corresponding mostly to the thermal noise), which is the subspace orthogonal to the dominant eigenvectors. The improved performance stems from using only the dominant eigenvectors in the formulation of the weight vector, whereas traditional methods use the smaller eigenvectors as well. The eigencanceller provides clutter and jammer cancellation, low variation and distortion of the beampattern, and is less computationally complex than fully adaptive algorithms. The spatial-only problem of mitigating jammer interference is considered in Reference 168. This spatial-only work is extended in Reference 169, in which two different space-time eigencanceller approaches are provided. In References 170 and 171, the eigencanceller is shown to be more robust than some traditional algorithms to estimation errors due to channel-to-channel mismatches (antenna responses are usually assumed to be identical, but due to manufacturing and calibration errors, always have some difference), pointing errors, limited number of reference cells or signal contamination. Another reduced-rank approach, discussed in References 118,119 and 69, results from a multistage representation of the Wiener filter. The Wiener filter is represented as a nested chain of Wiener filters. The reduced-rank approach differs from the fully adaptive approach by limiting the number of filters in the nested chain. One of the benefits of this approach is that it does not have the computational rigours involved in the eigendecomposition of the covariance matrix. In References 119 and 69, the authors demonstrate that under certain conditions the reduced-rank multistage Wiener filter may outperform such eigendecomposition approaches as the CSM (in minimum mean squared error and computational complexity). Some extended analysis of reduced-rank techniques is given in References 166 and 167. These studies consider the variation in performance due to estimation errors of the order of the filter, ICM, aircraft crabbing, the presence of jammers and the size of training data. Apparently, the multistage Wiener filter is less sensitive to the number of stages than is the eigencanceller, and the CSM is sensitive to problems with estimation of the rank of the clutter but is less sensitive to the effects of ICM. One disadvantage of the multistage Wiener filter model is that its order must be obtained a priori in an iterative fashion. Additional analysis of reduced-rank techniques is provided in Reference 144. We have noted that the non-ideal conditions of real-world detection environments can make an accurate calculation of the rank of the clutter covariance matrix difficult. Some of the resulting performance loss can be regained through the implementation of a covariance matrix taper (CMT). An important class of adapted pattern modification techniques is realised by the application of a conformal matrix taper to the original
covariance matrix. The work in Reference 145 reviews the concept of covariance matrix tapers and illustrates that previously introduced techniques to increase jammer notch width are special cases of covariance matrix tapers. CMTs provide diagonal loading benefits to minimise the effects of pattern distortion due to insufficient sample support and weight mismatch due to non-stationary interference. These CMTs can lower sidelobe levels and appropriately widen the notches meant to clear clutter ridges and jammers, thus making them more robust to errors in algorithm assumptions and limited sample support. In Reference 149, a CMT is combined with the principal components method to yield interference mitigation that restores the minimal sample support property, and yet does not suffer from undernulled interference. Such hybrid techniques can add robustness to alleviate the problems from which the principal components methods suffer in cases with severe eigenvalue spreading. This technique is studied further in Reference 146 in which the dominant eigenvectors are initially estimated by utilising a commensurately small sample support. Next, the remaining subdominant interference subspace is synthesised by applying a CMT. The dominant eigenvalues and eigenvectors are obtained in exactly the same manner as in the principal components method. However, to appropriately account for subspace leakage, a CMT is applied to synthesise the subdominant coloured noise space arising from the aforementioned modulation mechanisms. The white noise floor is introduced to properly set null depth and ensure non-singularity. Models of ICM and general knowledge of the environment and system can be used to generate the CMT. Conservative CMTs result in overnulling the clutter.
13.7.2
Techniques implementing limited reference cells
As previously described, the assumption that reference cells are homogeneous may only apply to a grouping of a few reference cells rather than the amount necessary to conduct fully adaptive STAP processing. Therefore, a variety of techniques has been developed to use a smaller number of reference cells, referred to as limited sample support, than that needed by fully adaptive STAR These processing schemes are suboptimal under ideal conditions, but exhibit a robustness in non-homogeneous environments not shared by traditional STAP architectures. One methodology designed for less samples than fully adaptive STAP is called spatial smoothing [205], or forward-backward averaging [111, 112]. This technique provides extra snapshots for covariance matrix estimation by forming subarrays of the spatial arrays. If the resulting loss in resolution produced by using a reduced number of elements at a time is acceptable, the covariance matrix can be smoothed. Therefore, subaperture averaging provides sufficient samples for estimating the covariance matrix in cases in which this calculation would not otherwise be possible and proves useful in sample-starved or non-homogeneous cases. The technique improves SNR and sidelobe levels and has been shown to be robust in the presence of antenna manifold errors. Other efforts, such as References 138 and 139, have explored the utility of smoothing to trade spatio-temporal aperture for increased effective sample support. These investigations show that we may obtain additional
sample support via subspatial and subtemporal aperture smoothing. The reduction in aperture can lead to a widening of the clutter ridge and hence an increase in MDV, but this can be an acceptable exchange for the increased number of samples. The use of diagonal loading to reduce the effects of limited reference cells has also received attention. As previously stated, the covariance matrix typically has several large eigenvalues corresponding to the clutter/jamming, with the remaining eigenvalues corresponding to the thermal noise. With very limited reference cell data, there is inadequate estimation of the thermal noise, and a large thermal noise eigenvalue spread occurs. As more training data is added, the estimation improves and the thermal noise eigenvalues converge to the correct values. If the eigenvalue spread is minimised, the adapted beam shape approaches the ideal pattern. It has been suggested that by increasing the effective value of the thermal noise eigenvalues by creating an artificial thermal noise level, or loading, limited training data performance can be improved. The large interference eigenvalues are minimally affected, but the eigenvalues well below the interference level are significantly increased. This technique can improve the adaptive sidelobe levels and enhance the shape of the mainbeam response when limited training data is available. A reduced null depth and hence an increase in residual interference is the consequence [83,47]. Further study illustrates that diagonal loading helps performance when signal contamination is the only source of non-homogeneity [48]. The work in Reference 140 attempts to determine the ideal scaling for the diagonal loading. It finds this scaling for the case when the number of reference cells used in the adaptive processing is larger than the number of eigenvalues corresponding to the clutter, and argues that this scaling should be set equal to zero when this number is less than the number of eigenvalues corresponding to the clutter. Another method for reducing the possibility of non-homogeneity is the implementation of a multifrequency system. This technique reduces the range from which reference cells need to be taken. It is hoped that the reference cells that are closer to the test cell in range will have a higher probability of homogeneity. In References 16 and 17, the authors suggest the use of frequency diversity. Multifrequency algorithms based on the AMF and the GLRT are developed. These schemes are shown to outperform traditional STAP algorithms in non-homogeneous environments in which a limited number of reference cells are available. Of course, such systems are inherently more complex and costly. An extension of this work is adopted by Reference 179, which proposes a multiband detector for targets in non-Gaussian noise modelled as compound Gaussian clutter. The approach presented in References 136 and 137 is also of interest for reducing the effects of non-homogeneity. Here, a multichannel adaptive whitening filter is obtained from an adaptive least-squares predictive transform signal modelling procedure. This whitening filter attempts to prewhiten the otherwise angle-Doppler correlated multichannel interference signal prior to detection. This type of processing has been shown to approach optimal processing for some cases with much smaller sample support and thus less computational burden. Another suboptimal approach to reduce the required sample support is described in Reference 78.
The subarraying approach in conjunction with a space-time least-squares FIR filter processor can significantly reduce complexity. A comparison of several of these techniques with optimal processing and reduction in computational requirements is presented in Reference 56. Finally, the approach in Reference 224 attempts to model the covariance matrix with only a few parameters as opposed to estimating the entire noise covariance matrix. If there are less parameters to estimate, the paper proposes that the algorithm requires less reference cells than that of a fully adaptive approach.
13.7.3 Low complexity approaches to STAP Real-time computing is one of the many challenges of target detection in complex environments. Reduced-rank methodologies provide less computational load than do fully adaptive techniques. Other approaches to reducing computational load are also of great interest. For example, References 18 to 20, 26 and 27 argue that applying STAP to the full data cube can be computationally wasteful and can result in poor performance when the data set used to estimate the noise covariance matrix is small. The authors suggest that the adaptivity should be applied only to those regions of the processing domain in which some detection improvement is 'necessary, possible, and probable'. The suggested technique is to perform a Fourier transform of the data. Based on some a priori knowledge of the types of interference or on the power distribution across the Doppler frequency bins, the Doppler domain is subsequently divided into flat regions and sharply changing regions. Only a few angular bins close to the transmitted energy are analysed, and all Doppler bins are considered because the target Doppler frequency shift is unknown to the processor. For the flat or nearly flat spectrum regions, the conventional windowed DFT or FIR filter bank has achieved nearly optimum performance. The sharply changing regions denote areas that could result in improved detection performance via adaptive algorithm application. This technique is called the joint-domain localised generalised likelihood ratio (JDL-GLR) processor. A much cited reference concerning suboptimal STAP techniques with reduced complexity is Reference 211. This paper outlines the so-called factored approach, in which Doppler filtering is initially performed on each array element, followed by spatial filtering in each separate Doppler channel. This separation, or factoring, of the temporal and spatial filtering functions leads to considerable computational savings at the expense of some performance loss. The technique advocates a data transform into the space-frequency domain, in which the transformed signals are less correlated. It is this reduction in correlation which promotes an approximation of the covariance matrix which can result in techniques to simplify the normally demanding inversion of the sample covariance matrix. Along with a restriction of the allowed target Doppler, these algorithms can result in good performance with a dramatic reduction in computational load [211]. Work on the factored approach with robustness to limited reference cell data is provided in Reference 225. The work in Reference 157 presents the parametric adaptive matched filter methodology. This method is based on approximating the interference spectrum
with a multichannel autoregressive model of low order. The fact that a low-order autoregressive model provides an accurate representation of simulated and measured interference for a wide variety of system and scenario conditions leads to reduced computational requirements. The modelling fidelity is attained using a small fraction of the amount of reference cells required for fully adaptive STAR It also seems to be tolerant of the presence of targets in the secondary data for both small and large secondary data set sizes. However, this method does not have the desirable CFAR property. Another method for reducing computational load [81] considers how the samples for the covariance matrix estimate are gathered and exactly how that matrix is calculated. Typically, reference cells surround the test cell, with a few guard bands between the test and reference cells. The methods given in Reference 81 illustrate how different training strategies can limit the computational burden. Such methods include using the same weights for a variety of primary data cells. Another method is called the sliding hole, in which a covariance matrix is estimated once using all of the range gates in a range segment. The target range gate and guard bands are removed (the hole). Then the hole slides through the training region. A method for achieving a reduction in computational load is also given in Reference 135, in which the expanding block structure of the covariance matrix is exploited to update the covariance matrix inverse as each pulse is received. The block augmented matrix inversion algorithm results in significant computational gain, and the residual information in the incoming pulse is used to generate an adaptive stopping criterion that addresses the sample data support problem. Several adaptive algorithm techniques have been developed which use only several spatial channels and thus produce a reduction in computational complexity [116,117]. Alternatively, References 41 and 42 discuss a low complexity STAP scheme called adaptive displaced phase centred antenna (DPCA). It is shown that this technique performs well under certain conditions of covariance matrix mismatch and can potentially outperform fully adaptive STAR The technique is based on a pulse cancelling structure, which allows the scheme to cancel clutter with high correlation across several pulses even if this correlation is not present in the training data. This structure can be interpreted as changing the steering vector in the approach [2,3]. One weakness of this scheme is that there are stringent requirements for its application [47]. The problem with non-adaptive DPCA is that it is sensitive to antenna errors and requires the platform velocity to be known well enough to adjust the interpulse period [204]. The work in References 24, 25 and 31 considers a type of STAP denoted as sum/difference STAR An antenna array can be divided into two sections, each with its own beampattern. The sum beam is formed by adding the result of these two subarrays, and the difference beam is formed by subtracting the two returns. Therefore, sum/difference STAP utilises only the sum and difference channels of the radar. It requires a much smaller sample support than do other approaches and seems to perform well in non-homogeneous clutter. Additionally, this technique uses fewer channels than other techniques so that system calibration, which can be costly, is less difficult. The sum and difference channels are processed both spatially and
temporally to minimise clutter effects while preserving the predesigned gain on the desired target signal. Additionally, this technique can be implemented in conjunction with the presuppression of wideband noise jammers using spatially adaptive presuppression [29,30]. The authors argue that the clutter suppression in this twostage processor is almost as successful as full STAP, even though the beampattern is changed before STAP is applied. In Reference 132, a technique is developed which shows faster convergence (and thus less need for sample support) than traditional methods. A maximum likelihood solution under the assumption that the input interference is Gaussian is derived for a structured covariance matrix that has the form of the identity matrix plus an unknown, positive, semidefinite Hermitian matrix. This work is subsequently leveraged to develop a fast converging detector for distributed targets. Other methods that have reduced computational load (and potentially less sample support) and therefore may be more applicable in current computing conditions are provided in References 50 to 54, 82, 39, 40 and 128.
13.8
Conclusions
Many STAP algorithms have been designed under the assumption of ideal conditions in the operational detection environment. Unfortunately, factors such as clutter nonhomogeneity, signal contamination, antenna array errors, jamming and TSI are often present in the real world. To combat the deleterious effects that such components have on radar systems, a variety of techniques have been developed to either circumvent or ameliorate the impact of such conditions. Some of these algorithms introduce a level of complexity greater than that required by the previously proposed schemes, and other techniques offer the benefit of a reduction in computing resources. It is crucial for the commercial user as well as the warfighter to understand the robustness of these algorithms in such challenging environments. Here we have presented a brief review of several traditional STAP schemes as well as an explanation of the complex detection environments in which many radar signal processors must operate. A review of the deterioration of the performance of algorithms, as well as the benefits and drawbacks of newly proposed adaptive processing approaches, has been provided. We discussed that an engineering trade off exists between the design of the algorithm and its computational difficulties in real-world implementation. In the future, we look forward to the potential marriage of several of the presented signal processing techniques to provide overall system robustness and real-time target detection in real-world, non-ideal detection environments. References 1 CURTIS SCHLEHER, D.: 4MTI and pulsed Doppler radar' (Artech House, Inc., Norwood, MA, 1991) 2 BRENNAN, L. E. and REED, I. S.: Theory of adaptive radar', IEEE Trans. Aerosp. Electron. Syst9 March 1973, 9, (2), pp. 237-252
3 REED, I. S., MALLETT, J. D., and BRENNAN, L. E.: 'Rapid convergence rate in adaptive arrays', IEEE Trans. Aerosp. Electron. SySt., November 1974, 10, (6), pp. 853-863 4 CHEN, W. S. and REED, I. S.: 'A new CFAR detection test for radar', Digit. Signal Process., October 1991, 4, pp. 198-214 5 ROBEY, F. C , FUHRMANN, D. R., KELLY, E. J., and NITZBERG, R.: 4 A CFAR adaptive matched filter detector', IEEE Trans. Aerosp. Electron. Syst., January 1992, 28, (1), pp. 208-216 6 KELLY, E. J.: 'An adaptive detection algorithm', IEEE Trans. Aerosp. Electron. Syst., March 1986, 22, (1), pp. 115-127 7 KELLY, E. J.: 'Performance of an adaptive detection algorithm; rejection of unwanted signals', IEEE Trans. Aerosp. Electron. Syst, March 1989, 25, (2), pp.122-133 8 KELLY, E. J. and FORSYTHE, K. M.: 'Adaptive detection and parameter estimation for multidimensional signal models'. Technical report 848 - Lincoln Laboratory, Massachusetts Institute of Technology, Lexington, MA, April 1989 9 KALSON, S. Z.: 'An adaptive array detector with mismatched signal rejection', IEEE Trans. Aerosp. Electron. Syst, January 1992, 28, (1), pp. 195-207 10 PULSONE, N. B. and RADER, C. M.: 'Adaptive beamformer orthogonal rejection test', IEEE Trans. Signal Process., March 2001, 49, (3), pp. 521-529 11 LIAO, X., BAO, Z., and XING, M.: 'On the aspect sensitivity of high resolution range profiles and its reduction methods'. Record of the IEEE 2000 international Radar conference, Alexandria, VA, May 7-12, 2000, pp. 310-315 12 WANG, H. and CAI, L.:'Robust CFAR detection in nonhomogeneous correlated interference'. Proceedings of the IEE international conference on Acoustics, speech, and signal processing, San Francisco, CA, March 23-26, 1992, 5, pp. 373-376 13 CAI, L. and WANG, H.: 'Performance comparisons of modified SMI and GLR algorithms', IEEE Trans. Aerosp. Electron. Syst, May 1991, 27, (3), pp. 487^91 14 WANG, H. and CAI, L.: 'New results on integrated adaptive filtering and CFAR processing'. Proceedings of the national Telesystems conference, Washington, DC, May 19-20, 1992, 7, pp. 9-15 15 CAI, L. and WANG, H.: 'Further results on adaptive filtering with embedded CFAR', IEEE Trans. Aerosp. Electron. Syst, October 1994, 30, (4), pp. 1009-1020 16 WANG, H. and CAI, L.: 'On adaptive multiband signal detection with the SMI algorithm', IEEE Trans. Aerosp. Electron. Syst, September 1990, 26, (5), pp. 768-773 17 WANG, H. and CAI, L.: 'On adaptive multiband signal detection with GLR algorithms', IEEE Trans. Aerosp. Electron. Syst, March 1991, 27, (2), pp. 225-233 18 WANG, H. and CAI, L.: 'On adaptive implementation of optimum MTI in severely nonhomogeneous environments'. Record of the IEEE Radar Conference, Arlington, VA, May 7-10, 1990, pp. 351-355
19 WANG, H. and CAI, L.: 4A localized adaptive MTD processor', IEEE Trans. Aerosp. Electron. Syst., May 1991, 27, pp. 532-539 20 WANG, H. and CAI, L.: 'On adaptive spatial-temporal processing for airborne surveillance radar systems', IEEE Trans. Aerosp. Electron. Syst., July 1994, 30, (3), pp. 660-670 21 WANG, H., ZHANG, Y, and ZHANG, Q.: 4A view of current status of space-time processing algorithm research'. Record of the IEEE international Radar conference, Alexandria, VA, May 8-11, 1995, pp. 635-640 22 WANG, H., ZHANG, Y., and ZHANG, Q.: 'Lessons learned from recent STAP experiments'. Proceedings of the CIE international conference of Radar, Beijing, China, October 8-10, 1996, pp. 761-765 23 WANG, H., ZHANG, Y, ZHANG, Q., BROWN, R. D., and WICKS, M. C : 'An improved and affordable space-time adaptive processing approach'. Proceedings of the CIE international conference of Radar, Beijing, China, October 8-10, 1996, pp. 72-77 24 BROWN, R. D., WICKS, M. C , ZHANG, Y, ZHANG, Q., and WANG, H.: 'A space-time adaptive processing approach for improved performance and affordability'. Proceedings of the IEEE national Radar conference, Ann Arbor, MI, May 13-16, 1996, pp. 321-326 25 ZHANG, Y and WANG, H.: 'Further study on space-time adaptive processing with sum and difference beams'. Digest of antennas and propagation society international symposium, Montreal, Quebec, July 13-18, 1997, 4, pp. 2422-2425 26 WANG, H., WICKS, M., and ZHANG, Y: 'Signal processing techniques for target detection performance improvements in existing airborne early warning radars'. Proceedings of the IEEE national Aerospace and electronics conference (NAECON), Dayton, OH, May 23-27, 1994,1, pp. 189-196 27 WANG,H.,WICKS,M.,andZHANG,Y: 'A new Doppler processing technique for detection performance improvement in existing airborne radars'. Record of the IEEE international Radar conference, Alexandria, VA, May 8-11, 1995, pp. 72-76 28 RIVKIN, R, ZHANG, Y, and WANG, H.: 'Spatial adaptive pre-suppression of wideband jammers in conjunction with STAP: a sideband approach'. Proceedings of the CIE international conference of Radar, Beijing, China, October 8-10, 1996, pp. 439^43 29 MAHER, J., ZHANG, Y, and WANG, H.: 'A performance evaluation of EA-STAP approach to airborne surveillance radars in the presence of both clutter and jammers'. Proceedings of Radar 97, Edinburgh, United Kingdom, October 14-16, 1997, pp. 305-309 30 ZHANG, Y, MAHER, J., CHANG, H., and WANG, H.: 'Adaptive pre-suppression of jammers for STAP-based airborne surveillance radars'. Proceedings of the IEEE Radar conference (RADARCON), Dallas, TX, May 11-14, 1998, pp. 32-37 31 BROWN, R. D., SCHNEIBLE, R. A., WICKS, M. C , WANG, H., and ZHANG, Y: 'STAP for clutter suppression with sum and difference beams', IEEE Trans. Aerosp. Electron. Syst., April 2000, 36, (2), pp. 634-646
32 SARKAR, T. K., WANG, H., and PARK, S., et al.: 'A deterministic least-squares approach to space-time adaptive processing (STAP)', IEEE Trans. Antennas Propag., January 2001, 49, (1), pp. 91-103 33 MELVIN, W. L., WICKS, M. C , and BROWN, R. D.: 'Assessment of multichannel airborne radar measurements for analysis and design of space-time processing architectures and algorithms'. Proceedings of the IEEE national Radar conference, Ann Arbor, MI, May 13-16, 1996, pp. 130-135 34 MELVIN, W. L.: 'Space-time adaptive radar performance in heterogeneous clutter', IEEE Trans. Aerosp. Electron. Syst, April 2000, 36, (2), pp. 621-633 35 MELVIN, W. L. and WICKS, M. C : 'Improving practical space-time adaptive radar'. Proceedings of the IEEE national Radar conference, Syracuse, NY, May 13-15, 1997, pp. 48-53 36 WICKS, M. C , MELVIN, W. L., and CHEN, P.: 'An efficient architecture for nonhomogeneity detection in space-time adaptive processing airborne early warning radar'. Proceedings of Radar 97, Edinburgh, United Kingdom, October 14-16, 1997, pp. 295-299 37 HIMED, B. and MELVIN, W. L.: 'Analyzing space-time adaptive processors using measured data'. Record of the thirty-first Asilomar conference on Signals, systems and computers, Pacific Grove, CA, November 2-5, 1997, 1, pp. 930-935 38 MELVIN, W. L., GUERCI, J. R., CALLAHAN, M. J., and WICKS, M. C : 'Design of adaptive detection algorithms for surveillance radar'. Record of the IEEE international Radar conference, Alexandria, VA, May 7-12, 2000, pp. 608-613 39 BOJANCZYK, A. W., MELVIN, W. L., and HOLDER, E. J.: 'Subspace approximation for adaptive multichannel radar filtering'. Record of the thirty-second Asilomar conference on Signals, systems and computers, Pacific Grove, CA, November 1-4, 1998, 2, pp. 1542-1546 40 BOJANCZYK, A. W. and MELVIN, W. L.: 'Simplifying the computational aspects of STAP'. Proceedings of the IEEE Radar conference (RADARCON), Dallas, TX, May 11-14, 1998, pp. 123-128 41 BLUM, R. S., MELVIN, W. L., and WICKS, M. C : 'An analysis of adaptive DPCA'. Proceedings of the IEEE national Radar conference, Ann Arbor, MI, May 13-16, 1996, pp. 303-308 42 GU, Z., BLUM, R. S., MELVIN, W. L., and WICKS, M. C : 'Comparison of STAP algorithms for airborne radar'. Proceedings of the IEEE national Radar conference, Syracuse, NY, May 13-15, 1997, pp. 60-65 43 MELVIN, W. L.: 'Eigenbased modeling of nonhomogeneous airborne radar environments'. Proceedings of the IEEE Radar conference, Dallas, TX, May 11-14, 1998, pp. 171-176 44 ANTONIK, P., SCHUMAN, H. K., MELVIN, W. L., and WICKS, M. C : 'Implementation of knowledge-based control for space-time adaptive processing'. Proceedings of Radar 97, Edinburgh, United Kingdom, October 14-16, 1997, pp. 478-482
45 ANTONIK, R, SCHUMAN, H., LI, R, MELVIN, W., and WICKS, M.: 'Knowledge-based space-time adaptive processing'. Proceedings of the IEEE national Radar conference, Syracuse, NY, May 13-15, 1997, pp. 372-377 46 MELVIN, W., WICKS, M., ANTONIK, P., SALAMA, Y., LI, P., and SCHUMAN, H.: 'Knowledge-based space-time adaptive processing for airborne early warning radar', IEEE Aerosp. Electron. Syst. Mag., April 1998, 13, (4), pp. 37-12 47 MELVIN, W. L. and GUERCI, J. R.: 'Adaptive detection in dense target environments'. Proceedings of the IEEE Radar conference, Atlanta, GA, May 1-3, 2001, pp. 187-192 48 BERGIN, J. S., TECHAU, P. M., MELVIN, W. L., and GUERCI, J. R.: 'GMTI STAP in target-rich environments: site-specific analysis'. Proceedings of the IEEE Radar conference, Long Beach, CA, April 22-25, 2002, pp. 391-396 49 BABU, B. N. S., TORRES, J. A., and MELVIN, W. L.: 'Processing and evaluation of multichannel airborne radar measurements (MCARM) measured data'. Proceedings of the IEEE international symposium on Phased array systems and technology, Boston, MA, October 15-18, 1996, pp. 395-399 50 KLEMM, R.: 'Introduction to space-time adaptive processing', Electron. Commun. Eng. J., February 1999,11, (1), pp. 5-12 51 KLEMM, R.: 'Introduction to space-time adaptive processing'. Proceedings of the IEE colloquium on Space-time adaptive processing, London, United Kingdom, April 6, 1998,1, pp. 1-11 52 KLEMM, R.: 'Real-time adaptive airborne MTI, Part I: space-time processing'. Proceedings of the CIE international conference of Radar, Beijing, China, October 8-10, 1996, pp. 755-760 53 KLEMM, R.: 'Real-time adaptive airborne MTI, Part II: space-frequency processing'. Proceedings of the CIE international conference of Radar, Beijing, China, October 8-10, 1996, pp. 430-433 54 KLEMM, R.: 'Cramer-Rao analysis of reduced order STAP processors'. Record of the IEEE 2000 international Radar conference, Alexandria, VA, May 7-12, 2000, pp. 584-589 55 KREYENKAMP, O. and KLEMM, R.: 'Doppler compensation in forwardlooking STAP radar', IEE Proc, Radar Sonar Navig., October 2001, 148, (5), pp.253-258 56 KLEMM, R.: 'Principles of space-time adaptive processing' (IEE, London, UK, 2002, 2nd edn.), pp. 58 and 142 57 GOLDSTEIN, J. S., WILLIAMS, D. B., and HOLDER, E. J.: 'A frequency domain realization of an optimal partially adaptive sensor array'. Conference record of the Military communications conference (MILCOM), San Diego, CA, November 5-8, 1995, 2, pp. 607-611 58 SELIKTAR, Y, WILLIAMS, D. B., and HOLDER, E. J.: 'Beam-augmented space-time adaptive processing'. Proceedings of the IEEE international conference on Acoustics, speech, and signal processing, Phoenix, AZ, March 15-19, 1999, 5, pp. 2849-2852
59 SELIKTAR, Y, WILLIAMS, D. B., and HOLDER, E. J.: 'Beam-augmented STAP for joint clutter and jammer multipath mitigation', IEE Proc, Radar Sonar Navig., October 2000,147, (5), pp. 225-232 60 SELIKTAR, Y, WILLIAMS, D. B., and HOLDER, E. J.: 'Adaptive monopulse processing of monostatic clutter and coherent interference in the presence of mainbeam jamming'. Record of the thirty-second Asilomar conference on Signals, systems and computers, Pacific Grove, CA, November 1-4, 1998, 2, pp.1517-1521 61 KOGAN, S. M., WILLIAMS, D. B., and HOLDER, E. J.: 'Reduced-rank terrain scattered interference mitigation'. Proceedings of the IEEE international symposium on Phased array systems and technology, Boston, MA, October 15-18, 1996, pp. 400-405 62 KOGAN, S. M., WILLIAMS, D. B., and HOLDER, E. J.: 'Exploiting coherent multipath for mainbeam jammer suppression', IEE Proc, Radar Sonar Navig., October 1998,145, (5), pp. 303-308 63 WILLIAMS, D. B. and JOHNSON, D. H.: 'Robust estimation of structured covariance matrices', IEEE Trans. Signal Process., September 1993, 41, (9), pp. 2891-2906 64 GOLDSTEIN, J. S., KOGON, S. M., REED, I. S., WILLIAMS, D. B., and HOLDER, E. J.: 'Partially adaptive radar signal processing: the crossspectral approach'. Record of the twenty-ninth Asilomar conference on Signals, systems and computers, Pacific Grove, CA, October 30-November 2, 1995, 2, pp.1383-1387 65 GOLDSTEIN, J. S. and REED, I. S.: 'Reduced-rank adaptive filtering', IEEE Trans. Signal Process., February 1997, 45, pp. 492^96 66 GOLDSTEIN, J. S. and REED, I. S.: 'Subspace selection for partially adaptive sensor array processing', IEEE Trans. Aerosp. Electron. Syst., April 1997, 33, pp. 539-544 67 GOLDSTEIN, J. S. and REED, I. S.: 'Theory of partially adaptive radar', IEEE Trans. Aerosp. Electron. Syst., October 1997, 33, pp. 1309-1325 68 PECKHAM, C. D., HAIMOVICH, A. M., AYOUB, T. R, GOLDSTEIN, J. S., and REED, I. S.: 'Reduced-rank STAP performance analysis', IEEE Trans. Aerosp. Electron. Syst., April 2000, 36, pp. 664-676 69 GOLDSTEIN, J. S., REED, I. S., and SCHARF, L. L.: 'A multistage representation of the Wiener filter based on orthogonal projections', IEEE Trans. Inf. Theory, November 1998, 33, pp. 2943-2959 70 WEINER, D. D., CAPRARO, G. T., and WICKS, M. C : 'An approach for utilizing known terrain and land feature data in estimation of the clutter covariance matrix'. Proceedings of the IEEE Radar conference (RADARCON), Dallas, TX, May 11-14, 1998, pp. 381-386 71 ADVE, R. S., HALE, T. B., and WICKS, M. C : 'Transform domain localized processing using measured steering vectors and non-homogeneity detection'. Record of the 1999 IEEE Radar conference, Waltham, MA, April 20-22, 1999, pp. 285-290
72 ADVE, R. S., HALE, T. B., and WICKS, M. C : 4A two stage hybrid space-time adaptive processing algorithm'. Record of the 1999 IEEE Radar conference, Waltham, MA, April 20-22, 1999, pp. 279-284 73 ADVE, R. S. and WICKS, M. C : 'Joint domain localized processing using measured spatial steering vectors'. Proceedings of the IEEE Radar conference (RADARCON), Dallas, TX, May 11-14, 1998, pp. 165-170 74 ADVE, R. S., WICKS, M. C , HALE, T. B., and ANTONIK, P.: 'Groundmoving target indication using knowledge based space time adaptive processing'. Record of the IEEE international Radar conference, Alexandria, VA, May 7-12, 2000, pp. 735-740 75 ADVE, R. S., HALE, T. B., and WICKS, M. C : 'Practical joint domain adaptive processing in homogeneous and nonhomogeneous environments. Part 1: homogeneous environments', IEE Proc, Radar Sonar Navig., April 2000,147, (2), pp. 57-65 76 ADVE, R. S., HALE, T. B., and WICKS, M. C : 'Practical joint domain adaptive processing in homogeneous and nonhomogeneous environments. Part 2: nonhomogeneous environments', IEE Proc, Radar Sonar Navig., April 2000, 147, (2), pp. 66-74 77 WICKS, M., PIWINSKI, D., and LI, P.: 'Space-time adaptive processing in modern electronic warfare environments'. Record of the IEEE international Radar conference, Alexandria, VA, May 8-11, 1995, pp. 609-613 78 PADOS, D. A., TSAO, T., MICHELS, J. H., and WICKS, M. C : 'Joint domain space-time adaptive processing with small training data sets'. Proceedings of the IEEE Radar conference (RADARCON), Dallas, TX, May 11-14, 1998, pp. 99-104 79 SARKAR, T. K., ADVE, R. S., and WICKS, M. C : 'Effects of mutual coupling and channel mismatch on space-time adaptive processing algorithms'. Proceedings of the IEEE international conference on Phased array systems and technology, Dana Point, CA, May 21-25, 2000, pp. 545-548 80 BORSARI, G. K.: 'Mitigating effects on STAP processing caused by an inclined array'. Proceedings of the IEEE Radar conference (RADARCON), Dallas, TX, May 11-14, 1998, pp. 135-140 81 BORSARI, G. K. and STEINHARDT, A. O.: 'Cost-efficient training strategies for space-time adaptive processing algorithms'. Proceedings of the 1995 twentyninth Asilomar conference on Signals, systems and computers, Pacific Grove, CA, October 30-November 1, 1995,1, pp. 650-654 82 GOLDSTEIN, J. S., INGRAM, M. A., HOLDER, E. J., and SMITH, R. N.: 'Adaptive subspace selection using subband decompositions for sensor array processing'. Proceedings of the IEEE international conference on Acoustics, speech, and signal processing (ICASSP), Adelaide, SA, April 19-22, 1994, 4, pp. 281-284 83 CARLSON, B. D.: 'Covariance matrix estimation errors and diagonal loading in adaptive arrays', IEEE Trans. Aerosp. Electron. Syst., July 1998, 24, (4), pp. 297-401
84 ANDERSON, S. J., ABRAMOVICH, Y. L, and FABRIZIO, G. A.: 'Stochastic constraints in nonstationary hot clutter cancellation'. Proceedings of the IEEE international conference on Acoustics, speech and signal processing, Munich, Germany, April 21-24, 1997, 5, pp. 3753-3756 85 ABRAMOVICH, Y. L, GOROKHOV, A. Y, and SPENCER, N. K.: 'Convergence analysis of stochastically-constrained spatial and spatio-temporal adaptive processing for hot-clutter mitigation'. Proceedings of Information, decision and control, Adelaide, SA, February 8-10, 1999, pp. 61-64 86 ABRAMOVICH, Y. I., SPENCER, N. K., and GOROKHOV, A. Y: 'Sample support analysis of stochastically constrained STAP with loaded sample matrix inversion'. Record of the IEEE international Radar conference, Alexandria, VA, May 7-12, 2000, pp. 804-808 87 ABRAMOVICH, Y L, SPENCER, N. K., ANDERSON, S. J., and GOROKHOV, A. Y : ' Stochastic-constraints method in nonstationary hot-clutter cancellation - Part I: fundamentals and supervised training applications', IEEE Trans. Aerosp. Electron. Syst., October 1998, 34, (4), pp. 1271-1292 88 ABRAMOVICH, Y I., SPENCER, N. K., and ANDERSON, S. J.: 'Stochasticconstraints method in nonstationary hot-clutter cancellation - Part II: unsupervised training applications', IEEE Trans. Aerosp. Electron. Syst., January 2000, 36,(1), pp. 132-150 89 KREITHEN, D. E. and STEINHARDT, A. 0.: 'Target detection in postSTAP undernulled clutter'. Record of the twenty-ninth Asilomar conference on Signals, systems and computers, Pacific Grove, CA, October 30-November 2, 1995, pp. 1203-1207 90 RABIDEAU, D. J.: 'Multidimensional sidelobe target editing with applications of terrain scattered jamming cancellations'. Proceedings of the ninth IEEE Statistical signal and array processing workshop, Portland, OR, September 14-16, 1998, pp. 252-255 91 RABIDEAU, D. J. and STEINHARDT, A. 0.: 'Improving the performance of adaptive arrays in nonstationary environments through data-adaptive training'. Record of the thirtieth Asilomar conference on Signals, systems and computers, Pacific Grove, CA, November 3-6, 1996,1, pp. 75-79 92 RABIDEAU, D. J.: 'Detached-bin space-time adaptive cancellation of terrain scattered jamming and clutter'. Proceedings of the IEEE 8th DSP workshop, Bryce Canyon, Utah, August 9-12, 1988,1, paper number 135 93 RABIDEAU, D. J.: 'Modulation of signals in rapidly-updated adaptive filters: theory, mitigation, and applications'. Record of the thirty-first Asilomar conference on Signals, systems and computers, Pacific Grove, CA, November 2-5, 1997, 2, pp. 1665-1669 94 RABIDEAU, D. J. and STEINHARDT, A. 0.: 'Improved adaptive clutter cancellation through data-adaptive training', IEEE Trans. Aerosp. Electron. Syst, July 1999, 35, (3), pp. 879-891 95 GABEL, R. A., KOGON, S. M., and RABIDEAU, D. J.: 'Algorithms for mitigating terrain-scattered interference', Electron. Commun. Eng. J., February 1999,11,(1), pp. 49-56
96 RABIDEAU, D. J.: 'Maximum likelihood training of adaptive beamformers in distributed interference'. Record of the thirty-fifth Asilomar conference on Signals, systems and computers, Pacific Grove, CA, November 4-7, 2001, 2, pp.1379-1384 97 RABIDEAU, D. J.: 'Multistage cancellation of terrain scattered jamming and conventional clutter'. Proceedings of the IEEE international conference on Acoustics, speech, and signal processing, Seattle, WA, May 12-15, 1998, 4, pp. 2001-2004 98 RABIDEAU, D. J.: 'Clutter and jammer multipath cancellation in airborne adaptive radar', IEEE Trans. Aerosp. Electron. Syst., April 2000,36, (2), pp. 565-583 99 RICHMOND, C. D.: 'Adaptive array processing in non-Gaussian environments'. Proceedings of the eighth IEEE Signal processing workshop on Statistical signal and array processing, Corfu, Greece, June 24-26, 1996, pp. 562-565 100 RICHMOND, C. D.: 'Exact pdfs for sample covariance based array processors with elliptically contoured data'. Proceedings of the IEEE international conference on Acoustics, speech, and signal processing (ICASSP), Atlanta, GA, May 7-10, 1996, 5, pp. 2849-2851 101 RICHMOND, C. D.: Analysis of an adaptive detection algorithm for nonhomogeneous environments'. Proceedings of the IEEE international conference on Acoustics, speech, and signal processing (ICASSP), Seattle, WA, May 12-15, 1998, 4, pp. 2005-2008 102 RICHMOND, C. D.: 'Statistical performance analysis of the adaptive sidelobe blanker detection algorithm'. Record of the thirty-first Asilomar conference on Signals, systems and computers, Pacific Grove, CA, November 2-5, 1997, 1, pp. 872-876 103 KREITHEN, D. E., PEARSON, C. R, and RICHMOND, C. D.: Adaptive sidelobe blanker: a novel method of performance evaluation and threshold setting in the presence of inhomogeneous clutter'. Record of the thirty-second Asilomar conference on Signals, systems and computers, Pacific Grove, CA, November 1-4, 1998,15 pp. 528-532 104 RICHMOND, C. D.: 'The theoretical performance of a class of space-time adaptive detection and training strategies for airborne radar'. Record of the thirty-second Asilomar conference on Signals, systems and computers, Pacific Grove, CA, November 1-4, 1998, 2, pp. 1327-1331 105 RICHMOND, C. D.: 'Statistics of adaptive nulling and modeling inhomogeneities in adaptive processing'. Proceedings of the ninth IEEE Signal processing workshop on Statistical signal and array processing, Portland, OR, September 14-16, 1998, pp. 65-68 106 RICHMOND, C. D.: 'Response of sample covariance based MVDR beamformer to imperfect look and inhomogeneities', IEEE Signal Process. Lett., December 1998,12, pp. 325-327 107 RICHMOND, C. D.: 'Performance of the adaptive sidelobe blanker detection algorithm in homogeneous environments', IEEE Trans. Signal Process., May 2000, 48, pp. 1235-1247
108 RICHMOND, C. D.: 'Performance of a class of adaptive detection algorithms in nonhomogeneous environments', IEEE Trans. Signal Process., May 2000, 48, pp. 1248-1262 109 RICHMOND, C D . : 4 Statistics of adaptive nulling and use of generalized eigenrelation (GER) for modeling inhomogeneities in adaptive processing', IEEE Trans. Signal Process., May 2000, 48, pp. 1263-1273 110 ZATMAN, M.: 'Production of adaptive array troughs by dispersion synthesis', Electron. Lett., December 7, 1995, 31, (25), pp. 2141-2142 111 ZATMAN, M.: 'Forwards-backwards averaging for adaptive beamforming and STAP'. Proceedings of the IEEE international conference on Acoustics, speech, and signal processing, Atlanta, GA, May 7-10, 1996, 5, pp. 2630-2633 112 ZATMAN, M.: 'Forward-backward averaging in the presence of array manifold errors', IEEE Trans. Antennas Propag., November 1998, 46, (11), pp. 1700-1704 113 ZATMAN, M. and BARANOSKI, E.: 'Time delay steering architectures for space-time adaptive processing'. Digest of the Antennas and Propagation Society international symposium, Montreal, Quebec, July 13-18, 1997, 4, pp. 2426-2429 114 KOGAN, S. M. and ZATMAN, M. A.: 'STAP adaptive weight training using phase and power selection criteria'. Record of the thirty-fifth Asilomar conference on Signals, systems and computers, Pacific Grove, CA, November 4-7, 2001,1, pp. 98-102 115 WARD, J.: 'Space-time adaptive processing for airborne radar'. Technical report 1015 - Lincoln Laboratory, Massachusetts Institute of Technology, Lexington, MA, December 1994 116 WARD, J.: 'Space-time adaptive processing for airborne radar'. Proceedings of the international conference on Acoustics, speech, and signal processing, Detroit, MI, May 9-12, 1995, 5, pp. 2809-2812 117 WARD, J.:'Space-time adaptive processing for airborne radar'. Proceedings of the IEE colloquium on Space-time adaptive processing, London, UK, April 6, 1998, 2, pp. 1-6 118 GOLDSTEIN, J. S., REED, I. S., ZULCH, P. A., and MELVIN, W. L.: 'A multistage STAP CFAR detection technique'. Proceedings of the IEEE Radar conference (RADARCON), Dallas, TX, May 11-14, 1998, pp. 111-116 119 GOLDSTEIN, J. S., REED, I. S., and ZULCH, P. A.: 'Multistage partially adaptive STAP CFAR detection algorithm', IEEE Trans. Aerosp. Electron. Syst., April 1999, 35, pp. 645-661 120 PUGH, M. L. and ZULCH, P. A.: 'RL-STAP algorithm development tool for analysis of advanced signal processing techniques'. Record of the twenty-ninth Asilomar conference on Signals, systems and computers, Pacific Grove, CA, October 30-November 1, 1995, 2, pp. 1178-1182 121 GENELLO, G. J., BALDYGO, W. J., and CALLAHAN, M. J.: 'Modeling and simulation for sensor craft multi-mission radar'. Proceedings of the IEEE Aerospace conference, Big Sky, MT, March 10-17, 2001, 2, pp. 741-748
122 GERLACH, K.: 'Implementation and convergence considerations of a linearly constrained adaptive array', IEEE Trans. Aerosp. Electron. Syst., March 1990, 26, pp. 263-272 123 GERLACH, K. and KRETSCHIMER, F. E.: 'Convergence properties of Gram-Schmidt and SMI adaptive algorithms', IEEE Trans. Aerosp. Electron. Syst, January 1990, 26, pp. 44-56 124 GERLACH, K. and KRETSCHMER, F. E.: 'Convergence properties of Gram-Schmidt and SMI adaptive algorithms: Part II', IEEE Trans. Aerosp. Electron. Syst., January 1991, 27, pp. 83-91 125 SANGSTON, K. J. and GERLACH, K. R.: 'Coherent detection of radar targets in a non-Gaussian background', IEEE Trans. Aerosp. Electron. Syst., April 1994, 30, pp. 330-340 126 GERLACH, K.: 'The effects of signal contamination on two adaptive detectors', IEEE Trans. Aerosp. Electron. Syst., January 1995, 31, pp.297-309 127 GERLACH, K.: 'Adaptive detection of range distributed targets', IEEE Trans. Signal Process., July 1999, 47, pp. 1844-1851 128 STEINER, M. and GERLACH, K.: 'Fast-converging maximum-likelihood interference cancellation'. Proceedings of the IEEE Radar conference, Dallas, TX, May 11-14, 1998, pp. 117-122 129 PICCIOLO, M. L. and GERLACH, K.: 'Fast converging robust adaptive arrays'. Proceedings of the 2001 IEEE Radar conference, Atlanta, GA, May 1-5, 2001, pp. 216-221 130 PICCIOLO, M. L., GERLACH, K., and GOLDSTEIN, J. S.: 'An adaptive multistage median cascaded canceller'. Proceedings of the IEEE Radar conference, Long Beach, CA, April 22-25, 2002, pp. 218-325 131 GERLACH, K. and STEINER, M. J.: 'Fast converging adaptive detection of Doppler-shifted, range-distributed targets', IEEE Trans. Signal Process., September 2000, 48, pp. 2686-2690 132 STEINER, M. and GERLACH, K.: 'Fast converging adaptive processor of a structured covariance matrix', IEEE Trans. Aerosp. Electron. Syst., October 2002, 36, pp. 1115-1126 133 GERLACH, K.: 'Outlier resistant adaptive matched filtering', IEEE Trans. Aerosp. Electron. Syst., July 2002, 38, pp. 885-901 134 GERLACH, K.: 'Spatially distributed target detection in non-Gaussian clutter', IEEE Trans. Aerosp. Electron. Syst., July 1999, 35, pp. 926-934 135 PILLAI, S. U., LEE, W. C , and GUERCI, J.: 'Multichannel space-time adaptive processing'. Record of the twenty-ninth Asilomar conference on Signals, systems and computers, Pacific Grove, CA, October 30-November 1, 1995, 2, pp. 1183-1186 136 GUERCI, J. R., PILLAI, S. U., and KIM, Y. L.: 'Efficient space-time adaptive processing for airborne MTI-mode radar'. Proceedings of the IEEE international conference on Acoustics, speech and signal processing, Atlanta, GA, May 7-10, 1996, 5, pp. 2781-2784
137 GUERCI, J. R. and FERIA, E. H.: 'Application of a least squares predictive transform modeling methodology to space-time adaptive array processing', IEEE Trans. Signal Process., July 1996, 44, pp. 1825-1833 138 PILLAI, S. U., GUERCI, J. R., andKIM, Y. L.: 'Generalized forward/backward subaperture smoothing techniques for sample starved STAP'. Proceedings of the IEEE international conference on Acoustics, speech, and signal processing, Seattle, WA, May 12-15, 4, pp. 2501-2504 139 PILLAI, S. U., KIM, Y. L., and GUERCI, J. R.: 'Generalized forward/backward subaperture smoothing techniques for sample starved STAP', IEEE Trans. Signal Process., December 2000, 48, (12), pp. 3569-3574 140 KIM, Y. L., PILLAI, S. U., and GUERCI, J. R.: 'Optimal loading factor for minimal sample support space-time adaptive radar'. Proceedings of the 1998 international conference on Acoustics, speech, and signal processing, Seattle, WA, May 12-15, 1998, 4, pp. 2505-2508 141 TECHAU, P. M., GUERCI, J. R., SLOCUMB, T. H., and GRIFFITHS, L. J.: 'Performance bounds for interference mitigation in radar systems'. Record of the IEEE Radar conference, Waltham, MA, April 20-22, 1999, pp. 12-17 142 TECHAU, P. M., GUERCI, J. R., SLOCUMB, T. H., and GRIFFITHS, L. J.: 'Performance bounds for hot and cold clutter mitigation', IEEE Trans. Aerosp. Electron. Syst, October 1999, 35, pp. 1253-1265 143 GUERCI, J. R., GOLDSTEIN, J. S., ZULCH, P. A., and REED, I. S.: 'Optimal reduced-rank 3D STAP for joint hot and cold clutter mitigation'. Record of the IEEE Radar conference, Waltham, MA, April 20-22, 1999, pp. 119-124 144 GUERCI, J. R., GOLDSTEIN, J. S., and REED, I. S.: 'Optimal and adaptive reduced-rank STAP', IEEE Trans. Aerosp. Electron. Syst., April 2000, 36, pp. 647-663 145 GUERCI, J. R.: 'Theory and application of covariance matrix tapers for robust adaptive beamforming', IEEE Trans. Signal Process., April 1999, 47, pp. 977-985 146 GUERCI, J. R. and BERGIN, J. S.: 'Rapid adaptation in subspace leakage environments via covariance matrix tapering'. Record of the thirty-fourth Asilomar conference on Signals, systems and computers, Pacific Grove, CA, October 29-November 1, 2000,1, pp. 283-286 147 GOLDSTEIN, J. S., GUERCI, J. R., and REED, I. S.: 'Advanced concepts in STAP'. Record of the IEEE international Radar conference, Alexandria, VA, May 7-12, 2000, pp. 699-704 148 TECHAU, P. M., BERGIN, J. S., and GUERCI, J. R.: 'Effects of internal clutter motion on STAP in a heterogeneous environment'. Proceedings of the IEEE Radar conference, Atlanta, GA, May 1-3, 2001, pp. 204-209 149 GUERCI, J. R. and BERGIN, J. S.: 'Principal components, covariance matrix tapers, and the subspace leakage problem', IEEE Trans. Aerosp. Electron. Syst, January 2002, 38, pp. 152-162 150 GRIFFITHS, L. J.: 'Linear constraints in hot clutter cancellation'. Proceedings of the IEEE international conference on Acoustics, speech, and signal processing (ICASSP), Atlanta, GA, May 7-10, 1996, 2, pp. 1181-1184
151 GRIFFITHS, L. J.: 'Multiple-pulse STAP adaptation prior to radar Doppler processing'. Record of the thirtieth Asilomar conference on Signals, systems and computers, Pacific Grove, CA, November 3-6, 1996,1, pp. 394-398 152 GRIFFITHS, L. J.: 'Linear constraints in pre-Doppler STAP processing'. Proceedings of the IEEE international conference on Acoustics, speech, and signal processing, Munich, Germany, April 21-24, 1997, 5, pp. 3481-3484 153 RANGASWAMY, M. and MICHELS, J. H.: 'A parametric multichannel detection algorithm for correlated non-Gaussian random processes'. Proceedings of the IEEE national Radar conference, Syracuse, NY, May 13-15, 1997, pp. 349-354 154 RANGASWAMY, M. and MICHELS, J. H.: 'Adaptive signal processing in non-Gaussian noise backgrounds'. Proceedings of the ninth Signal processing workshop on Statistical signal and array processing, Portland, OR, September 14-16, 1998, pp. 53-56 155 MICHELS, J. H., RANGASWAMY, M., and HIMED, B.: 'Performance of STAP tests in compound-Gaussian clutter'. Proceedings of the IEEE Sensor array and multichannel signal processing workshop, Cambridge, MA, March 16-17, 2000, pp. 250-255 156 MICHELS, J. H., HIMED, B., and RANGASWAMY, M.: 'Evaluation of the normalized parametric adaptive matched filter STAP test in airborne radar clutter'. Record of the IEEE international Radar conference, Alexandria, VA, May 7-12, 2000, pp. 169-11A 157 ROMAN, J. R., RANGASWAMY, M., DAVIS, D. W., ZHANG, Q., HIMED, B., and MICHELS, J. H.: 'Parametric adaptive matched filter for airborne radar applications', IEEE Trans. Aerosp. Electron. Syst, April 2000, 36, pp. 677-692 158 HIMED, B., SALAMA, Y, and MICHELS, J. H.: 'Improved detection of close proximity targets using two-step NHD'. Record of the IEEE international Radar conference, Alexandria, VA, May 7-12, 2000, pp. 781-786 159 RANGASWAMY, M., HIMED, B., and MICHELS, J. H.: 'Statistical analysis of the nonhomogeneity detector'. Proceedings of the thirty-fourth Asilomar conference on Signals, systems and computers, Pacific Grove, CA, October 29-November 1, 2000, 2, pp. 1117-1121 160 RANGASWAMY,M., HIMED, B., andMICHELS, J. H.: 'Performanceanalysis of the nonhomogeneity detector for STAP applications'. Proceedings of the IEEE Radar conference, Atlanta, GA, May 1-5, 2001, pp. 193-197 161 RANGASWAMY, M., MICHELS, J. H., and HIMED, B.: 'Statistical analysis of the nonhomogeneity detector for non-Gaussian interference backgrounds'. Proceedings of the IEEE Radar conference, Long Beach, CA, April 22-25, 2002, pp. 304-310 162 KIRSTEINS, I. P. and RANGASWAMY, M.: 'Approximate CFAR signal detection in strong low rank non-Gaussian interference'. Proceedings of the IEEE conference and exhibition OCEANS, Providence, RI, September 11-14, 2000, 2, pp. 1305-1308 163 KIRSTEINS, I. P. and RANGASWAMY, M.: 'Approximate CFAR signal detection in strong low rank non-Gaussian interference'. Proceedings of the tenth
164
165
166
167
168 169 170
171
172
173
174
175
176
IEEE Statistical signal and array processing workshop, Pocono Manor, PA, August 14-16, 2000, pp. 286-290 KIRSTEINS, I. P. and RANGASWAMY, M.: 'Approximate CFAR signal detection in strong low rank non-Gaussian interference'. Proceedings of the IEEE international conference on Acoustics, speech, and signal processing, Salt Lake City, UT, May 7-11, 2001, 5, pp. 2849-2852 KIRSTEINS, I. P. and RANGASWAMY, M.: 'Approximate CFAR signal detection in strong low rank non-Gaussian interference'. Proceedings of the IEEE Aerospace conference, Big Sky, MT, March 10-17, 2001, 4, pp. 1783-1790 HIMED, B. and MICHELS, J. H.: 'Performance analysis of the multi-stage Wiener filter'. Record of the IEEE international Radar conference, Alexandria, VA, May 7-12, 2000, pp. 729-734 TSAO, T., HIMED, B., and MICHELS, J. H.: 'Effects of interference rank estimation on the detection performance of rank reduced STAP algorithms'. Proceedings of the IEEE Radar conference, Dallas, TX, May 11-14, 1998, pp.147-152 HAIMOVICH, A. M. and BAR-NESS, Y: 'An eigenanalysis interference canceler', IEEE Trans. Signal Process., January 1991, 39, pp. 76-84 HAIMOVICH, A. M.: 'The eigencanceler: adaptive radar by eigenanalysis methods', IEEE Trans. Aerosp. Electron. Syst., April 1996, 32, pp. 532-542 HAIMOVICH, A. M., PUGH, M. L., and BERIN, M. O.: 'Training and signal cancellation in adaptive radar'. Proceedings of the IEEE national Radar conference, Ann Arbor, MI, May 13-16, 1996, pp. 124-129 HAIMOVICH, A. M. and BERIN, M.: 'Eigenanalysis-based space-time adaptive radar: performance analysis', IEEE Trans. Aerosp. Electron. Syst., October 1997, 33, pp. 1170-1179 CONTE, E., DE MAIO, A., and RICCI, G.: 'An adaptive matched filter detector for distributed targets in homogeneous environment'. Proceedings of the 2000 IEEE international conference on Acoustics, speech, and signal processing, Istanbul, Turkey, June 5-9, 2000,1, pp. 301-304 CONTE, E., DE MAIO, A., and RICCI, G.: 'Space-time adaptive detection of distributed targets in homogeneous environments'. Proceedings of the Adaptive sensor array processing workshop, Lexington, MA, March 14-15, 2000 CONTE, E., DE MAIO, A., and RICCI, G.: 'Space-time adaptive radar detection of distributed targets'. Record of the IEEE 2000 international Radar conference, Alexandria, VA, May 7-12, 2000, pp. 614-619 CONTE, E., DE MAIO, A., and RICCI, G.: 'GLRT-based adaptive detection algorithms for range-spread targets', IEEE Trans. Signal Process., July 2001, 49, (7), pp. 1336-1348 CONTE, E., DE MAIO, A., and RICCI, G.: 'Adaptive detection of distributed targets in partially-homogeneous environment'. Proceedings of the 2000 IEEE Sensor array and multichannel signal processing workshop, Cambridge, MA, March 16-17, 2000, pp. 129-133
177 CONTE5 E. and DE MAIO, A.: 'A CFAR detection of distributed targets in non-Gaussian disturbance', IEEE Trans. Aerosp. Electron. SySt., April 2002, 38, pp. 612-621 178 CONTE, E., LOPS, M. and RICCI, G.: 'Adaptive matched filter detection in spherically invariant noise', IEEE Signal Process. Lett., August 1996, 3, pp. 248-250 179 LOMBARDO, P., PASTINA, D., and BUCCIARELLI, T.: 6A multiband GLRTLQ algorithm for the coherent radar detection against compound-Gaussian clutter'. Proceedings of Radar 97, Edinburgh, UK, October 14-16, 1997, pp.576-580 180 GINI, R: 'A cumulant-based adaptive technique for coherent radar detection in a mixture of K-distributed clutter and Gaussian disturbance', IEEE Trans. Signal Process., June 1997, 45, pp. 1507-1519 181 GINI, R, GRECO, M. V., SANGSTON, K. J., and FARINA, A.: 'Coherent adaptive radar detection in non-Gaussian clutter'. Record of the thirty-first Asilomar conference on Signals, systems and computers, Pacific Grove, CA, November 2-5, 1997,1, pp. 225-259 182 CONTE, E., LOPS, M., and RICCI, G.: 'Adaptive detection schemes in compound-Gaussian clutter', IEEE Trans. Aerosp. Electron. SySt., October 1998, 34, pp. 1058-1069 183 GINI, F. and GRECO, M. V.: ' Suboptimum approach to adaptive coherent radar detection in compound-Gaussian clutter', IEEE Trans. Aerosp. Electron. Syst., July 1999, 35, pp. 1095-1104 184 GINI, R, GRECO, M. V., and FARINA, A.: 'Clairvoyant and adaptive signal detection in non-Gaussian clutter: a data dependent threshold interpretation', IEEE Trans. Signal Process., June 1999, 47, pp. 1522-1531 185 SANGSTON, K. J., GINI, R, GRECO, M. V., and FARINA, A.: 'Structures for radar detection in compound Gaussian clutter', IEEE Trans. Aerosp. Electron. Syst, April 1999, 35, pp. 445-458 186 GINI, F. and MICHELS, J. H.: 'Performance analysis of two covariance matrix estimators in compound-Gaussian clutter', IEE Proc, Radar Sonar Navig., June 1999,146, pp. 133-140 187 GINI, R, GRECO, M. V, GIANI, M., and VERRAZZANI, L.: 'Performance analysis of two adaptive radar detectors against non-Gaussian real sea clutter data', IEEE Trans. Aerosp. Electron. Syst, October 2000, 36, pp.1429-1439 188 CONTE, E., LOPS, M., and RICCI, G.: 'Asymptotically optimum radar detection in compound-Gaussian clutter', IEEE Trans. Aerosp. Electron. Syst, April 1995, 31, (2), pp. 617-625 189 GINI, R, GRECO, M. V., FARINA, A., and LOMBARDO, R: 'Optimum and mismatched detection against A'-distributed plus Gaussian clutter', IEEE Trans. Aerosp. Electron. Syst, July 1998, 34, pp. 860-876 190 GINI, R, GRECO, M. V., FARINA, A., and LOMBARDO, P.: 'Note on optimum and mismatched detection against ^-distributed plus Gaussian clutter', IEEE Trans. Aerosp. Electron. Syst, January 2001, 37, pp. 296-297
191 GRECO, M., GINI, R, FARINA, A., and BILLINGSLEY, J. B.: 'Validation of windblown radar ground clutter spectral shape', IEEE Trans. Aerosp. Electron. Syst9 April 2001, 37, pp. 538-548 192 LOMBARDO, R, GRECO, M., GINI, R, FARINA, A., and BILLINGSLEY, J. B.: 'Impact of clutter spectra on radar performance prediction', IEEE Trans. Aerosp. Electron. Syst., July 2001, 37, pp. 1022-1038 193 GRECO, M., GINI, R, and DIANI, M.: 'Robust CFAR detection of random signals in compound-Gaussian clutter plus thermal noise', IEE Proc, Radar Sonar Navig., August 2001,148, pp. 227-232 194 GRECO, M., GINI, R, FARINA, A., and BILLINGSLEY, J. B.: 'Analysis of clutter cancellation in the presence of measured L-band radar ground clutter data'. Record of the IEEE international Radar conference, Alexandria, VA, May 7-12, 2000, pp. 422-427 195 McDONALD, K. R and BLUM, R. S.: 'Analytical analysis of STAP algorithms for cases with mismatched steering and clutter statistics'. Record of the IEEE Radar conference, Waltham, MA, April 20-22, 1999, pp. 267-272 196 BLUM, R. S. and McDONALD, K. R: 'Analysis of STAP algorithms for cases with mismatched steering and clutter statistics', IEEE Trans. Signal Process., February 2000, 48, pp. 301-310 197 McDONALD, K. P. and BLUM, R. S.: 'Performance characterization of STAP algorithms with mismatched steering and clutter statistics'. Record of the thirtyfourth Asilomar conference on Signals, systems and computers, Pacific Grove, CA, October 29-November 1, 2000,1, pp. 646-650 198 McDONALD, K. P. and BLUM, R. S.: 'Exact performance of STAP algorithms with mismatched steering and clutter statistics', IEEE Trans. Signal Process., October 2000, 48, (10), pp. 2750-2763 199 McDONALD, K. P. and BLUM, R. S.: 'Performance characterization of spacetime adaptive processing algorithms for distributed target detection in non-ideal environments'. Proceedings of the IEEE Radar conference, Long Beach, CA, April 22-25, 2002, pp. 298-303 200 McDONALD, K. P.: 'Models and analysis of antenna array signal processing systems'. Ph.D. dissertation, Lehigh University, Bethlehem, PA, 2000 201 BRIEMIE, E.: 'Aspects of adaptive beamforming with an AESA radar with subarray architecture'. Proceedings of the IEE colloquium on Electronic beam steering, London, UK, October 28, 1998, 4, pp. 1-9 202 BABU, B. N. S., TORRES, J. A., and GUELLA, T. P.: 'Impact of nearfield scattering on multichannel airborne radar measurements (MCARM)'. Proceedings of the IEEE national Radar conference, Syracuse, NY, May 13-15, 1997, pp. 227-231 203 BARILE, E. C , FANTE, R. L., GUELLA, T. P., and TORRES, J. A.: 'Performance of space-time adaptive airborne radar'. Record of the IEEE national Radar conference, Lynnfield, MA, April 20-22, 1993, pp. 173-175 204 BARILE, E. C , FANTE, R. L., and TORRES, J. A.: 'Some limitations on the effectiveness of airborne adaptive radar', IEEE Trans. Aerosp. Electron. Syst., October 1992, 28, pp. 1015-1032
205 FANTE, R. L., BARILE, E. C , andGUELLA, R. P.: 'Cluttercovariance smoothing by subaperture averaging', IEEE Trans. Aerosp. Electron. Syst, July 1994, 30, pp. 941-945 206 FANTE, R. L.: 'Cancellation of specular and diffuse jammer multipath using a hybrid adaptive array'. Proceedings of the IEEE national Radar conference, Los Angeles, CA, March 12-13, 1991, pp. 54-57 207 FANTE, R. L.: 'Cancellation of specular and diffuse jammer multipath using a hybrid adaptive array', IEEE Trans. Aerosp. Electron. SySt., September 1991, 27, pp. 823-837 208 FANTE, R. L. and TORRES, J. A.: 'Cancellation of diffuse jammer multipath by an airborne adaptive radar', IEEE Trans. Aerosp. Electron. Syst, April 1995, 31, pp. 805-820 209 DAVIS, R. M., FANTE, R. L., CROSBY, W. J., and BALLA, R. J.: 'A maximum likelihood beamspace processor for improved search and track'. Record of the IEEE international Radar conference, Alexandria, VA, May 7-12, 2000, pp. 590-595 210 DAVIS, R. M., FANTE, R. L., CROSBY, W. J., andBALLA, R. J.: 'Amaximum likelihood beamspace processor for improved search and track', IEEE Trans. Antennas Propag, July 2001, 49, pp. 1043-1053 211 DiPIETRO, R. C.: 'Extended factored space-time processing for airborne radar systems'. Record of the twenty-sixth Asilomar conference on Signals, systems and computers, Pacific Grove, CA, October 26-28, 1992, pp. 425-^30 212 SCHARF, L. L. and McWHORTER, L. T.: 'Adaptive matched subspace detectors and adaptive coherence estimators'. Proceedings of the 1996 thirtieth Asilomar conference on Signals, systems and computers, Pacific Grove, CA, November 3-6, 1996, l,pp. 1114-1117 213 McWHORTER, L. T., SCHARF, L. L., and GRIFFITHS, L. J.: 'Adaptive coherence estimation for radar signal processing'. Record of the thirtieth Asilomar conference on Signals, systems and computers, Pacific Grove, CA, November 3-6, 1996,1, pp. 536-540 214 KRAUT, S. and SCHARF, L. L.: 'The CFAR adaptive subspace detector is a scale-invariant GLRT'. Proceedings of the ninth IEEE Signal processing workshop on Statistical signal and array processing, Portland, OR, September 14-16, 1998, pp. 57-60 215 KRAUT, S. and SCHARF, L. L.: 'The CFAR adaptive subspace detector is a scale-invariant GLRT', IEEE Trans. Signal Process., September 1999, 47, pp.2538-2541 216 KRAUT, S., McWHORTER, L. T., and SCHARF, L. L.: 'A canonical representation for distributions of adaptive matched subspace detectors'. Record of the thirty-first Asilomar conference on Signals, systems and computers, Pacific Grove, CA, November 2-5, 1997, 2, pp. 1331-1335 217 SCHARF, L. L., KRAUT, S., and McCLOUD, M. L.: 'A review of matched and adaptive subspace detectors'. Proceedings of the IEEE Adaptive systems for signal processing, communications, and control symposium (AS-SPCC), Lake Louise, Alta., Canada, October 1-4, 2000, pp. 82-86
218 KRAUT, S., SCHARF, L. L., and McWHORTER, L. T.: 'Adaptive subspace detectors', IEEE Trans. Signal Process., January 2001, 49, pp. 1-16 219 SCHARF, L. L.: 'Statistical signal processing: detection, estimation, and time series analysis' (Addison-Wesley Publishing Company, Massachusetts, 1990) 220 KASSAM, S. A. and POOR, H. V.: 'Robust techniques for signal processing: a survey', Proc. IEEE, March 1985, 73, (3), pp. 433-481 221 BREBURGER, B. E. and TUFTS, D. W.: 'Rapidly adaptive signal detection using the principal components inverse (PCI) method'. Record of the thirty-first Asilomar conference on Signals, systems and computers, Pacific Grove, CA, November 2-5, 1997,1, pp. 765-769 222 FREBURGER, B. E. and TUFTS, D. W.: 'Case study of principal component inverse and cross spectral metric for low rank interference adaptation'. Proceedings of the IEEE international conference on Acoustics, speech, and signal processing, Seattle, WA, May 12-15, 1998, 4, pp. 1977-1980 223 KRAUT, S. and KROLIK, J.: 'Applications of maximal invariance to the ACE detection problem'. Proceedings of the thirty-fourth Asilomar conference on Signals, systems and computers, Pacific Grove, CA, October 29-November 1, 2000, pp. 417-420 224 LIN, X. and BLUM, R. S.: 'Robust STAP algorithms using prior knowledge for airborne radar applications', Signal Process., 1999, 79, pp. 273-287 225 WARD, J. and STEINHARDT, A. O.: 'Multiwindow post-Doppler spacetime adaptive processing'. Proceedings of the IEEE seventh Signal processing workshop on Statistical signal and array processing, June 1994, pp. 26-29 226 BOSE, S. and STEINHARDT, A. O.: 'A maximal invariant framework for adaptive detection with arrays'. Proceedings of the IEEE international conference on Acoustics, speech, and signal processing (ICASSP), San Francisco, CA, March 23-26, 1992, 5, pp. 357-360 227 BOSE, S. and STEINHARDT, A. O.: 'A maximal invariant framework for adaptive detection with structured and unstructured covariance matrices', IEEE Trans. Signal Process., September 1995, 43, (9), pp. 2164-2175 228 BOSE, S. and STEINHARDT, A. O.: 'Invariant tests for spatial stationarity using covariance structure'. Proceedings of the IEEE international conference on Acoustics, speech, and signal processing, Minneapolis, MN, April 27-30, 1993, 4, pp. 41-44 229 BOSE, S. and STEINHARDT, A. O.: 'Invariant tests for spatial stationarity using covariance structure'. Proceedings of the IEEE seventh Signal processing workshop on Statistical signal and array processing, June 26-29, 1994, pp. 101-104 230 BOSE, S. and STEINHARDT, A. O.: 'Invariant tests for spatial stationarity using covariance structure', IEEE Trans. Signal Process., June 1996, 44, pp. 1523-1533 231 BOSE, S. and STEINHARDT, A. O.: 'Optimum array detector for a weak signal in unknown noise', IEEE Trans. Aerosp. Electron. SySt., July 1996, 32, (3), pp. 911-922
Next Page
232 BOSE, S. and STEINHARDT, A. 0.: 'Adaptive array detection of uncertain rank one waveforms', IEEE Trans. Signal Process., November 1996, 44, (11), pp. 2801-2809 233 NICKEL, U.: 'On the influence of channel errors on array signal processing methods'. Proceedings of the AEU, 1993, 47, (4), pp. 209-219
embedded power of TSD, dB
Previous Page
direction of arrival from target boresight, deg
trial number
Plate 1 Asymptotic SINR loss due to target-like signal corrupting covariance matrix estimate ([13], © 2001 IEEE)
velocity, m/s
Plate 2
SINR loss for each individual Poisson trial when using finite training data corrupted by TSD
trial number
velocity, m/s
Plate 3
SINR loss using homogeneous clutter-plus-noise training data (no TSD), 200 different Poisson trials
Doppler, Hz Doppler, Hz
angle, degrees
angle, degrees
Plate 4
Comparison ofactual (top) and simulated (bottom) MCARMMVDR spectra
Doppler, Hz
slant range, km
Plate 5
Range-Doppler map and sum-channel power versus range curve for MCARM575 data
Plate 6
DelMarVa site-specific radar cross section map including grazing angle dependence ([30], O 2003 IEEE)
velocity, m/s
site specific model
slant range, km
Plate 7 Site-specific synthetic range-Doppler velocity map ([30], © 2003 IEEE)
velocity, m/s
bald earth model
slant range, km
Simulated range-Doppler velocity map using bald earth model ([30], © 2003 IEEE)
depth, m
Plate 8
range, km
Conventional range-depth ambiguity surface with vertical array for a source at 9.06 km and depth of 76 m. The colour scale is the cross correlation coefficient
depth, m
Plate 9
range, km
Plate 10
Conventional range-depth ambiguity surface of the single-element preDoppler MFPfor source at 76 m towed at 2.5 m/s from 9.18 km. The colour scale is the cross correlation coefficient
relative speed, m/s
range, km
Plate 11 Conventional range-speed ambiguity surface of the single-element preDoppler MFPfor source at 76 m towed at 2.5 m/s from 9.18 km. The colour scale is the cross correlation coefficient
own-ship noise
bottom bounce
Hap 'innunzB
Sap 'irjnuiize time, s
time, s
target
responses at 10° azimuth
SP 'I3A3J
Sap 'mnunzB
own-ship bottom bounce target
time, s
Plate 12
time, s
Single-Line BTRs of each signal component are shown in the top two and the bottom-left panels. The beam-time responses at 10° azimuth are shown in the bottom-right panel. The colour scales are in dB
bottom bounce
azimuth, deg
azimuth, deg
own-ship noise
Doppler, Hz
Doppler, Hz target
responses at 10° azimuth
level, dB
azimuth, deg
own-ship bottom bounce target
Doppler, Hz
Doppler, Hz
Plate 13
Single-Line Doppler-/Azimuth responses with a 256-sec integration time of each signal component are shown in the top two and the bottom-left panels. The Doppler responses at 10° azimuth are shown in the bottom-right panel. The colour scales are in dB adaptive plane-wave (10°)
time, min
time, min
conventional plane wave (10°)
Doppler, Hz
level, dB
time, min
Doppler, Hz
adaptive PW(I line) adaptive MFP(I line) adaptive MFP (4_line_vert) Doppler, Hz
Plate 14
time, min
Single-line spectrogramsfor the conventional plane wave and the adaptive plane-wave beamforming at 10° azimuth are shown in the top two panels. The single-line adaptive MFP track-cell spectrogram is shown in the bottom-left panel. The bottom-right panel shows the time histories of the peak responses over Doppler for single-line adaptive plane wave beamforming, single-line adaptive MFP, and 4-vertical-line adaptive MFP The colour scales are in dB
single line, adaptive MFP
time, min
time, min
single line, conventional MFP
range, km
range, km 4_line_vertical, adaptive MFP
time, min
time, min
4_line_sequential, adaptive MFP
range, km
range, km
Plate 15 Array-size dependence of MFP range tracking at target depth and target speed. Target was from 10 km moving in to 6.5 km after 25 min. The top two panels show the track-cell-gram for the single-line conventional MFP and the single-line adaptive MFP The bottom-left panel shows the 4-sequential-line adaptive MFP track-cell-gram. The bottom-right panel shows the 4-vertical-line adaptive MFP track-cell-gram. The colour scales are in dB depth = 60 m
time, min
time, min
depth= 10 m
range, km
range, km
depth= 180 m
time, min
time, min
depth = 90 m
range, km
range, km
Plate 16 Depth discrimination of 4-vertical-line adaptive MFP range tracking at target speed. The top two panels show MFP track-cell-grams at depths of 10 m and 60 m. The bottom two panels show MFP track-cell-grams at depths of 90 m and 180 m. Target was at 90 m in depth and was from 10 km moving in to 6.5 km after 25 min. The colour scales are in dB
speed = 1 m/s
time, min
time, min
speed = 3 m/s
range, km
range, km
speed=—3 m/s
time, min
time, min
speed = - 1 m/s
range, km
Plate 17
range, km
Speed discrimination of 4-vertical-line adaptive MFP range tracking at target depth. The top two panels show MFP track-cell-grams at speeds of 3 m/s and 1 m/s moving toward the array. The bottom two panels show MFP track-cell-grams at speeds ofl m/s and 3 m/s moving away from the array. Target was moving toward the array at 3 m/s and was from 10 km moving in to 6.5 km after 25 min. The colour scales are in dB jammer
cost/!,
Plate 18
Space-frequency characteristic of reverberation and illustration of signal processing. BF = beamformer, DFT= discrete Fourier transform, B ~ ^/Tpuhe- The green curves in plot A and B correspond to the output of the ideal processing with no jammer present. The blue curves in plot C and D refer to the output of the realistic processing with no jammer present. The read lines in all plots correspond to a situation with a jammer
pixel
test data
pixel transformed data (radon domain)
filtered data (radon domain)
0 (degrees)
6 (degrees)
pixel
filtered data
pixel
Plate 19
Illustration of echogram image enhancement. First row: test data image, second row/left side: transformed test data image (Radon domain), second row/right side: filtered transformed test data image (Radon domain), third row: filtered test data image
frequency, Hz
frequency, Hz
distance, m
distance, m
Dependence of Doppler spectrum on range; left side: conventional processing, right side: fully adaptive STAP
Plate 21
Dependence of Doppler spectrum on direction; left side: conventional processing, right side: fully adaptive STAP
frequency, Hz
Plate 20
frequency, Hz
spectrum stave 20, data piece 48, ping 15, 512 FFT
distance, m
Plate 22
Spectrum of a single sensor signal of VLFACTAS
frequency, Hz
frequency, Hz
distance, m
Left side: conventional analysis of sea data; right side: post-Doppler adaptive beamforming of MFACTAS data. In both figures a target can be seen at a distance of about 100Om and a frequency of about 5340Hz
frequency, Hz
frequency, Hz
Plate 23
distance, m
Plate 24
distance, m
distance, m
Left side: conventional analysis of sea data; right side: post-Doppler adaptive beamforming of VLFA CTAS data. The strong signal at afrequency of about 990Hz and a distance of about 12 km is due to a transponder at the target position. This transponder signal is suppressed on the righthand side because the window length M is adapted to the pulse length, see Section 19.2.2
range, kyd
beam number
Original echogram image with target at about 21.5 kyd and beam 8 to beam 15
range, kyd
Plate 25
beam number
Plate 26
Filtered echogram image of Figure 19.18
range, kyd
range, kyd
test statistic
ping number
Left side: echogram of one beam with about 20 sounding periods; right side: test statistic of the proposed detection scheme
range, kyd
Plate 27
beam number
Plate 28
Test statistic of all beams
velocity, m/s
depth, m
location, m
Plate 29
Synthetic data example, diffraction events have not been modelled a near-offset section extracted from the prestack data b common midpoint gather for midpoint location 2000 m
location, km
time, s
time, s
location, km
Plate 30
location, km
time, s
time, s
location, km
Synthetic data example a coherence section b emergence angle section c section of the radius of curvature 7?NIP — ^NIP d section of curvature K^ associated with stacked section shown in Figure 21.6
velocity, m/s
depth, m depth, m
velocity, m/s
location, m
depth, m
velocity, m/s
location, m
location, m
Plate 31
Inversion of the CRS wavefield attributes a reconstructed smooth model with reconstructed normal rays b reconstructed smooth model with reconstructed reflector elements c reconstructed reflector elements superimposed onto the true blocky model
Section B
Miscellaneous space-time processing applications
Part V
Ground target tracking with STAP radar
Chapter 14
Ground target tracking with STAP radar: the sensor Richard Klemm
14.1
Introduction
In this chapter we deal with the problem of tracking moving objects on or close to the ground by means of an airborne or space-based radar. A typical airborne radar geometry is assumed in the numerical examples. However, the results and insights apply as well to spaceborne geometries. Specifically, we focus on the sensor aspects, i.e. we discuss what kind of information a suitable radar can contribute to the tracking function. The material contained in this chapter has been presented at a RTO SET symposium [H]. Ground targets have to compete with strong ground clutter returns. Therefore, clutter rejection is an indispensable feature of the tracking radar. If the radar is carried by a moving platform (e.g. airplane, satellite) the clutter Doppler spectrum is spread out proportional to the radar velocity. Therefore, low Doppler targets may fall into the clutter band which makes them difficult to detect by conventional (temporal only) MTI techniques. As is well known from the existing literature (see for instance the list of references given in Reference 6) space-time adaptive processing (STAP) techniques can solve the problem of detecting slow moving targets within the clutter Doppler band. STAP processors implicitly compensate for the radar platform motion. The principle of STAP including a comparison with conventional (temporal or spatial only) techniques is illustrated in Reference 6, p. 111. For the subsequent calculations, if not denoted otherwise, a notional radar characterised by the parameters listed in Table 14.1 has been assumed. 14.2
Properties of the STAP radar sensor
In this section we briefly summarise some properties of STAP radar as far as required for the subsequent analysis.
Table 14.1 Radar parameters Platform velocity Range Number of elements Wavelength Element spacing Number of processed echoes ^^Nyquist Clutter-to-noise ratio, single element, single pulse Signal-to-noise ratio, single element, single pulse ASEP, spatial dimension ASEP, temporal dimension
Up = 240 m/s 10 km N = 24 X = 0.03 m X/2 (Nyquist) 24 32000Hz 20 dB — 10 dB K = 5 L= 5
14.2.1 Processing techniques 14.2.1.1 The optimum STAP processor [6, Chapter 4] Let us assume an array of Af receive elements and a transmit antenna which transmits M coherent pulses. The (N x 1) vector xm includes the echo signals received by the individual receive elements due to the mth transmitted pulse. Let us introduce vector notation for the following quantities:
(14.1) where c, s, n and j denote the space-time vectors of clutter, target signal, noise and jamming, x is the vector of actual received data which may contain clutter, noise, jamming and a target signal. The optimum processor is given by: w opt = / Q - 1 S = JzD + - 1 D- 1 S
(14.2)
where
(14.3)
is the space-time eovariance matrix of clutter, noise and jamming and s is the space-time signal replica (or matched filter). Since Q is positive definite it can be factorised into two matrices, D* and D. The term D*" 1 in equation (14.2) whitens clutter and jamming in space and time. The term D" 1 equalises the matched filter s for the distortion of a received target signal in x. The efficiency of any processor can be judged by calculating the improvement factor (IF) which becomes for the optimum processor: (14.4) In Figures 14.1 and 14.2 the optimum improvement factor has been depicted for a sidelooking (sensors aligned with flight path) and a forward-looking (sensor arranged in across-flight direction) linear array versus the look angle cp and the normalised Doppler frequency F (clutter + noise, no jamming). As can be seen, there is a clutter trench along the diagonal for the side-looking (SL) and a semicircular trench for the forwardlooking (FL) case. In a certain look direction one obtains a cross section through such a plot which results in an IF as shown by the upper curve of Figure 14.3. Here 0 dB means the theoretical optimum IF, i.e. the maximum achievable IF determined by the receiver noise limitation. It is obvious that the optimum curve reaches the theoretical limit everywhere, except for a narrow clutter notch. Slow targets show up close to the
Figure 14.1 Improvement factor (side-looking array)
versus normalised Doppler and azimuth
Figure 14.2
Improvementfactor versus normalised Doppler and azimuth (forwardlooking array)
clutter notch. Therefore, the width of the clutter notch determines the detectability of a slow target and is, therefore, referred to as minimum detectable velocity (MDV). Since the covariance matrix Q includes the spatial and temporal characteristics of the interference, all processing schemes derived from equation (14.12) are adaptive. For large numbers of sensors TV and radar pulses M the optimum processor cannot be realised for several reasons (computational load, arithmetic accuracy and lack in training data support). However, it may serve as a reference for judging the interference suppression performance of suboptimum architectures with real-time processing capability. 14.2.1.2 The potential of STAP In Figure 14.3 we find a comparison between optimum STAP processing, conventional (beamforming plus temporal only) MTI processing, and just beamforming plus Doppler filtering (no interference suppression). The three curves demonstrate clearly the potential of STAR At the clutter Doppler frequency associated with the look direction (here 45°) we find the clutter notch which is very narrow for the optimum processor, but much broader for one-dimensional processors. There is also considerable sidelobe ripple in the passbands of the one-dimensional processors.
optimum processing beamformer adaptive temporal filter beamformer plus Doppler filter
Figure 14.3
The potential of space-time adaptive processing
14.2.1.3 Subspace STAP processing A wide class of suboptimum processing architectures can be derived from the optimum processor by applying a certain linear transform T to the received data. Such transforms may be space-time by nature [6, Chapters 5, 9] or spatial only [6, Chapter 6]. Then we obtain for the various vector quantities: c T = T*c,
DT = T*n,
sT = T*s,
j T - T*j,
x T - T*x,
Q x = T*QT (14.5)
and the improvement factor becomes in the transformed domain:
(14.6)
The transform matrix T determines the architecture of the individual suboptimum processor. The ASEP (auxiliary sensor/echo processor [6, pp. 278-282]) uses a symmetric sidelobe canceller architecture in both the spatial and temporal dimension. The
transform is given by:
(14.7)
where Wf\ • • • WfM are weights of a Doppler filter at frequency / . The spatial submatrices have the form:
(14.8)
where b\ • • • bs are beamformer weights. The clutter suppression performance of a processor based on the transform of equations (14.7) and (14.8) W1 = Q^1ST
(14.9)
approximates the optimum processor of equation (14.2) very closely, and the computational burden is strongly reduced. The results presented below will be based on this processor.
14.2.2 A rray properties Linear equispaced arrays are particularly useful for the design of suboptimum processing architectures because they can be subdivided into subarrays with uniform shape so that the clutter spectra received by all the subarrays are identical except for a spatial phase due to the subarray displacement. Subtracting, for instance, two identical spectra from each other results in an effective clutter suppression. For more details see Reference 6, Chapter 6, and Reference 8.
14.2.2.1 Side-looking linear array Side-looking arrays are being used for surveillance radar applications (e.g. Joint-STARS, [3]; SOSTAR [5]) and synthetic aperture radar (SAR). A side-looking linear array has a particular aptitude for STAP: the clutter Doppler frequency is range independent. This has a number of advantages pertaining to the realisation of an adaptive GMTI system: 1 STAP filtering is range independent 2 range-ambiguous clutter returns have all the same Doppler frequency and, therefore, produce only one clutter notch 3 there are enough HD data samples to train the adaptive STAP algorithm. Of course, the clutter Doppler is independent of range only as long as the array is aligned with the flight path. If the radar carrier is crabbing to some extent, for instance due to cross-wind drift, some range dependence will occur, and the aforementioned advantages are no longer valid. For more details see References 6 and 10. 14.2.2.2 Forward-looking linear array A typical application of forward-looking arrays is the nose radar of a fighter aircraft (e.g. AMSAR [I]). All the advantageous properties of the side-looking array mentioned above do not apply for the forward-looking array. 14.2.2.3 Omnidirectional arrays The use of omnidirectional arrays has been suggested in Chapter 5 [8] of this volume for use with airborne surveillance radar. Omnidirectional arrays are based on omnidirectional dipole elements as have been used in the crow's nest antenna [18]. 14.2.2.4 Sensor directivity patterns In the following calculations we assume that the array elements have directivity patterns of the form: D((p) = 0.5(1 + cos(2((? -
(14.10)
D(O) = 0.5(1 + cos(2(<9 - #L)))
(14.11)
where the angles #?L and #L denote the look direction of the array element (across-flight for side-looking, in-flight for forward-looking arrays).
14.2.3 Summary of the data output provided by the STAP radar During the tracking process the radar provides a number of data to the tracker: (i) Detection probability The detection probability is associated with the signalto-noise-plus-interference ratio at the output of the radar. (ii) Estimates of the angular target direction and velocity We assume that the radar produces bias-free estimates of these quantities.
(iii)
(iv)
Variances of the target parameters It is assumed that the parameter estimates given by the radar are optimum. Therefore, the Cramer-Rao bounds are used as variances, Variance of the range estimate It is assumed that the variance of the range interval is defined by the response of the conventional matched filter. The standard deviation of the range estimate then becomes:
where Br is the relative system bandwidth.
14.3
The scenario
We will now consider a radar target scenario as depicted in Figure 14.4. The radar platform moves at the height hp in the x-direction at speed vp. The target is supposed to move on a straight line at zero height in the direction cpc (course angle). It is further assumed that the radar revisits the target in uniform intervals.
14.3.1 SNIR and Pd of a moving target Let us assume that the course angle is 0, i.e. the target and the radar are moving on parallel straight paths, see Figure 14.5. Furthermore, we assume a linear side-looking array. Figure 14.6 shows the target (thick) and the radial clutter velocities at the target position. At the beginning both velocities are positive which reflects the dominating
target positions O=O)
radar positions
Figure 14.4
The tracking scenario (perspective view)
target positions
radar positions
Figure 14.5
Radar and target on parallel flight paths (top view)
revisits
Figure 14.6
Radial target (thick) and clutter velocities versus revisit time (target velocity lOm/s)
revisits
Figure 14.7
Radial target (thick) and clutter velocities versus revisit time (target velocity 100 m/s)
impact of the approaching radar carrier. The radial target velocity is slightly lower than the clutter velocity, because the target moves in the same direction at t>t = 10 m/s. After about 75 revisits we notice that the velocity curves are crossing each other. This is the point where the radar overtakes the target. At this moment the radial target velocity is zero. In Figure 14.7 a faster target (vt = 100 m/s) has been assumed. Accordingly, the point, where the radar overtakes the target and the radial velocity is zero, occurs at a later time. By comparing Figures 14.6 and 14.7 we find that for the slow target the velocity curve runs very closely to the clutter velocity curve whereas the faster target produces a velocity curve that is much better separated from the clutter curve. We find for the slow target a sliding intersection whereas the higher target velocity in Figure 14.7 results in a very distinct intersection. Let us now consider the associated signal-to-noise-plus-interference ratio (SNIR) curves shown in Figure 14.8. The upper row shows the slow target (a for white noise and b for clutter plus noise) and the lower row shows the SNIR for the 100 m/s target. The left column shows the SNIR for the clutter-free case, the right for clutter plus noise. There are two reasons for the maximum at the point where the aircraft overtakes the target (Figures 14.8a and c). First, this is the point of closest approach, therefore, due to the radar range equation the signal power becomes maximum. Second, the
Figure 14.8
SNIR versus revisit time, with and without clutter a no clutter, vt = lOm/s b CNR = 2OdB, vt = lOm/s c no clutter, vt = lOOm/s d CNR = 2OdB, vt = lOOm/s
SNIR curve is modulated by the directivity patterns of the antenna elements, both on transmit and receive, whose maximum point is at 90° (across-flight). When the radial target velocity is zero the processor cannot distinguish between target and clutter. In this case, the target is suppressed which results in a clutter notch in the SNIR curve. For the faster target the clutter notch occurs at a later time. 14.3.1.1 Impact of SNR Figures 14.9 and 14.10 illustrate the effect of signal-to-noise ratio (SNR) on the SNIR and the associated probability of detection. The higher the SNR, the narrower the clutter notch becomes. In Figures 14.11 and 14.12 the Cramer-Rao bound for the estimates of angle and radial velocity has been plotted. It can be seen clearly that the standard deviation of the angle and velocity estimates decreases with increasing SNR. This means also that the sensor directivity patterns and the range law according to the radar equation lead to increased standard deviations of both estimates at the beginning and the end of the tracking scene (left and right in Figures 14.11 and 14.12). In the position of the clutter notch the CRB curves show a local maximum.
SNIR, dB
revisits
Figure 14.9
SNIR versus revisit time: impact of the signal-to-noise ratio. SNR/dB = o -10; * O; x 10
revisits
Figure 14.10
Pd versus revisit time. SNR/dB = o -10; * 0; x 10
standard deviation, azimuth, deg
revisits
CRB of azimuth estimate versus revisit time. SNR/dB = o —10; * 0; x 10
standard deviation, velocity, m/s
Figure 14.11
revisits
Figure 14.12
CRB of velocity estimate versus revisit time. SNRJdB = o —10; * 0; x 70
14.3.1.2 Target stop If the ground target stops it behaves like ground clutter and, therefore, is suppressed by the STAP processor. A numerical example is shown in Figures 14.13 and 14.14. The target stops between the revisits 95 and 105. The corresponding clutter notch can be seen clearly. Because of the rectangular shape of this notch the associated probability of detection is nearly independent of the SNR.
14.3.2 System aspects
SNIR, dB
14.3.2.1 System dimensions The width of the clutter notch is a measure for the detectability of slow targets. It depends on the system dimensions (N — number of array elements, M = number of coherent echoes). Larger spatial and temporal apertures result in higher resolution capability and, hence, in narrower clutter notches. This well known fact can be observed in Figure 14.15. On the other hand, the width of the clutter notch depends as well as on the geometry of the tracking scene. For instance, the larger the distance between radar and target, the broader the clutter notch becomes when observed during the revisits. In Figure 14.16 one can notice that (except for the different levels of SNIR) the clutter notches of all four curves have about the same width which indicates that for the given
revisits
Figure 14.13
SNIR versus revisit time. SNR/dB = o -10; * 0; x 10
revisits
Pd versus revisit time. SNR/dB = o —10; * O; x 10
SNIR, dB
Figure 14.14
azimuth, deg
Figure 14.15
Influence of the system dimensions, SNIR versus azimuth (SNR = OdB). N = M = o 6; * 12; x 24; + 48
SNIR, dB
revisits
Figure 14.16
Influence of the system dimensions, SNIR versus revisit time (SNR = OdB). N = M = o 6; * 12; x 24; + 48
scenario the clutter notch is determined by the varying target radar geometry rather than by the system dimensions Af and M. 14.3.2.2
Comparison of STAP with one-dimensional clutter suppression techniques In this section we discuss the advantages of space-time processing over conventional MTI techniques (beamformer plus adaptive temporal clutter filter; beamformer plus Doppler filter bank, i.e. no clutter filtering). The upper curve in Figure 14.17 shows the SNIR versus revisit time as achieved by the ASEP processor defined by equations (14.7) and (14.8). Again, the effects of the radar range equation and the sensor directivity pattern as well as the clutter notches due to tangential motion and target stop can be noticed. The two lower curves show the SNIR versus revisit time of a beamformer/adaptive temporal clutter filter (middle) and a beamformer cascaded with a Doppler filter bank (lower curve). Dramatic losses can be recognised which are reflected in serious losses in detection probability (Figure 14.18). The superiority of space-time clutter rejection over one-dimensional processing techniques is obvious. 14.3.2.3 Forward-looking array Forward-looking radar plays an important role in fighter aircraft. Compared with a side-looking array, the aperture is now turned by 90° and the directivity patterns of the array elements are directed in a forward direction.
ASEP, ,Y= M= 24, K=5,L=5 beamformer plus adaptive temporal filter beamformer plus Doppler filter
revisits
Figure 14.17
SNlR for suboptimum processing
In the scenario shown in Figure 14.5 a remarkable coincidence can be observed. At the time the radar passes the target the radial velocity is zero so that a clutter notch occurs. At the same time the target meets the null of the array element pattern which points in the flight direction. After the radar passed the target is in the back of the antenna and, therefore, invisible. Figure 14.19 shows a numerical example where the cut-off of the SNIR curves can be seen clearly. Let us consider an alternative scenario as shown in Figure 14.20. The aircraft is moving the same as before, however, the target is now crossing its way. At the time the target meets the x-axis the flight direction is perpendicular to the radar look direction so that a clutter notch can be expected. As can be seen in Figure 14.21 the SNIR increases as the radar approaches the target (left to right). This increase is caused by the range law only. There is almost no influence of the directivity pattern of the array elements because the target is in the main look direction of the radar. For a very slow target (a) the clutter notch is smeared out so that no real notch can be recognised. The faster the target is the more distinct the clutter notch becomes. At the end of the scene the radar carrier flies over the target so that subsequent echoes will arrive in the back of the antenna, the SNIR breaks off. The detection probability curves in Figure 14.22 reveal that a slow target with SNR = 0 dB will be detected relatively late because it spends all its time in the
Pd versus revisit time for suboptimum processing a ASEP processing b Beamformer plus adaptive temporal filter c Beamformer plus Doppler filter
SNIR, dB
Figure 14.18
revisits
Figure 14.19
SNIR versus revisit time. SNR/dB = o -10; * 0; x 10
radar positions y = 0,z = hp
target positions
y = D,z=0
Figure 14.20
Radar and target on crossing flight paths (top view)
Figure 14.21
SNIR for the scenario in Figure 14.20 (SNR = 0dB) a Vt = lOm/s b ut = 20m/s c Vt = 30m/s d ut = 120m/s
Figure 14.22
Pdfor the scenario in Figure 14.20 (SNR = OdB) a Vt = lOm/s b ut=20m/s c ut — 30m/s d V1 = 120 m/s
broad clutter notch. Faster targets appear outside the clutter notch and can be detected earlier. The clutter notch is reflected in the detection probability curves.
14.4
Degrading effects
In this section the influence of some radar parameters on the tracking performance of a STAP radar is analysed.
14.4.1 Bandwidth effects In all previous examples it was assumed that the radar is narrowband which means that the system bandwidth has no influence on the spatial resolution capability of the array and, hence, on the width of the clutter notch. Let us consider again a slow target (10 m/s). The velocity curve (thick line in Figure 14.6) runs very closely to the clutter notch (thin line in Figure 14.6). If the clutter is broadened due to the system bandwidth (see grey line in Figure 14.23) the target line will be partly or even totally submerged in the clutter notch. This effect is illustrated in Figure 14.24. The six curves have been calculated for different system
revisits
Figure 14.23
Radial target (black) and clutter (grey) velocities versus revisit time (target velocity lOm/s)
revisits
Figure 14.24
Impact of the system bandwidth (SNIR versus revisits, SL); relative bandwidth [%]; o 10, * 3, x 1,+0.3,DOJ, > 0.03
Figure 14.25
Impact of the system bandwidth (PD versus revisits, SL); relative bandwidth [%] a 1 b 0.3 c 0.1 d 0.03
bandwidths. As can be seen, the SNIR achieved by the STAP processor is lower the larger the bandwidth. It can further be noticed that the difference between the two upper curves (Br = 0.1% and 0.03%) is much smaller than the difference between the lower curves. This indicates that the clutter notch has become so narrow that the slow target shows up in the passband. Figure 14.25 shows the associated detection probabilities for the upper four curves.
14.4.2 Doppler ambiguities The choice of the PRF determines the amount of ambiguity. By choosing a low PRF range ambiguities can be avoided, however, by taking more Doppler ambiguities into account. As a consequence one encounters ambiguous clutter notches (blind velocities). If the PRF is smaller than half the clutter Doppler bandwidth, ambiguous clutter notches will show up inside the clutter Doppler band. Figure 14.26 shows SNIR curves for four different target velocities. If the target velocity is smaller than the unambiguous Doppler range no ambiguous clutter notches occur (a and b). For target velocities larger than the unambiguous Doppler range ambiguous clutter notches show up (c and d).
Figure 14.26
Impact of Doppler ambiguities (SNIR versus revisits, SL, PRF 2000Hz, N = 96); vtgt [m/s] a 5 b 20 c 60 d 120
This can be further illustrated by Figure 14.27. The velocities of a target and the primary clutter velocities are plotted as thick lines. The thin lines denote the ambiguous clutter notches. Notice that there are three intersections between the target line and the ambiguous clutter lines which correspond to the positions of the clutter notches in Figure 14.26c. Target lines for very slow targets (Figures 14.26a,b) would run between the ambiguous clutter lines without crossing them. In Figure 14.28 we consider the same scenario, however, for a radar with a forward-looking array antenna. As we know from Figure 14.19, the primary clutter notch coincides with the null in the directivity pattern of the array element. Therefore, the primary clutter notch should show up at the point where the curves break off. Nevertheless, if the target velocity is high enough, some ambiguous clutter notches can be seen in Figures 14.28c,d.
14.43
Range ambiguities
In the medium PRF mode both Doppler and range ambiguities occur. The clutter returns are then the superposition of several arrivals coming from different directions. For a side-looking array clutter returns are range independent, which means that all
revisits
Figure 14.27
Target and ambiguous clutter velocities (SL, PRF 2000Hz, vt = 60 m/s, N = 96)
Figure 14.28
Impact of Doppler ambiguities (SNIR versus revisits, FL, PRF 2000Hz); vt [m/s] a 5, b 20, c 60, d 120
arrivals contribute to the same clutter notch. For any other array configuration, e.g. a forward-looking array, the ambiguous arrivals exhibit different Doppler frequencies which results in additional clutter notches. Usually one finds two main clutter notches, the primary one and another one caused by all the ambiguous arrivals. Figure 14.29 shows the same scene as Figure 14.28, however with range ambiguities included. It can be noticed that both plots show roughly the same results. Obviously there is no additional degradation of the SNIR curves due to range ambiguities when the radar is used in a track mode. Let us try to explain this behaviour. The track can start only if a detection has been made, that means, a target has shown up in the passband of the STAP filter, i.e. outside of a clutter notch. From one revisit to the next both the target and the radar move a certain distance. Therefore, the radar target geometry changes slightly in angle, range and Doppler. The trajectory of the clutter notch can be considered to be a hyperplane in the three-dimensional (angle, range and Doppler) clutter cube. For the chosen scenarios the target course through the three-dimensional clutter volume was such that no crossing of a clutter notch happened. It is not clear at this time whether this behaviour occurs in all possible radar target scenes, or if there are certain scenarios where the target falls into an ambiguous clutter notch.
Figure 14.29
Doppler and range ambiguities (SNIR versus revisits, FL, PRF 2000Hz); vt [m/s] a 5 b 20 c 60 d 120
14.4.4 STAP radar under jamming conditions In this section we discuss the effect of jamming in addition to clutter. We assume standoff jammers radiating white noise on the radar; the scenario under consideration is shown in Figure 14.30. Up to four stationary jammers are located on a straight line parallel to the flight paths of radar and target. Let us first consider the effect of jamming only. In Figure 14.31 we find SNIR curves for different jammer configurations achieved by the ASEP processor used in the previous examples. Recall from Table 14.1 that a processor with five spatial and five temporal channels has been assumed. We notice jammer notches (1, 2 or 4 jammers) showing up clearly. The case of a very slow target (ut = lOm/s) is under consideration. The performance degrades considerably, however, if additional clutter is involved (Figure 14.32). If the jammer direction is well separated from the clutter notch (a) one finds two distinct notches, where the one close to revisit 50 is due to clutter. If the jammer direction comes closer to the direction associated with the clutter notch then both notches are amalgamated to one bigger clutter-plus-jammer notch (b). In Figure 14.31c we have the case of two jammers separated from the clutter notch. In the case of four jammers the processor runs out of degrees of freedom. Recall that a STAP processor with five antenna channels has only four spatial degrees of freedom (for broadband jammers only the spatial degrees of freedom are relevant). Therefore, it fails to cancel four jammers plus clutter so that the SNIR breaks down and the target will no longer be detected (see the corresponding P& curves in Figure 14.33).
stationary jammers
target positions
radar positions
Figure 14.30
Radar, target and jamming scenario (top view)
Figure 14.31 Jamming only (vt = Wm/s) a jammer #1, b jammer Wl, c jammers #1 & #3, d 4 jammers
Figure 14.32
Clutter and jamming, SNIR versus revisits (vt = lOm/s) a jammer #1, b jammer #2, c jammers #1 & #3, d 4 jammers
Figure 14.33
Clutter and jamming, Pd versus revisits (vt = Wm/s) a jammer #1 b jammer #2 c jammers #1 & #3 d 4 jammers
The SNIR curves in Figure 14.34 are based on a faster target (ut — 100 m/s). The jammer configurations are the same as in Figure 14.32. As can be seen there are no jammer notches visible. The reason is as follows: a jammer notch occurs only if the target and jammer directions coincide, i.e. if jammer, target and radar are roughly (within the resolution capability of the array) located on a straight line. In this case the jammer hits the radar main beam. If the target is very slow there is a high probability that this geometry occurs. For faster target motion it becomes much less likely to encounter this configuration. Therefore, in Figure 14.34 the jammers radiate only in the sidelobes of the array antenna. Notice that the results in Figure 14.34 are still based on STAP processing. Without adaptive processing the SNIR would be considerably lower so that targets would hardly be detected. Notice again the degradation in SNIR in Figure 14.34d due to a lack of spatial degrees of freedom of the STAP radar.
14.5
Issues in convoy tracking
Modern down-look radars have a GMTI mode for detecting moving targets and an imaging mode for target recognition. The imaging mode usually makes use of
Figure 14.34
Clutter and jamming, SNIR versus revisits (vt = lOOm/s) a jammer #1 b jammer #2 c jammers #1 & #3 d 4 jammers
the SAR principle to achieve high geometric resolution (state of the art: less than 10 x 10 cm [2]). With this kind of resolution the application is confined to selected targets or small scenes of interest. The purpose of the GMTI mode is to detect moving targets over a wide area and, therefore, it has to operate at a much lower resolution than the imaging mode. The range resolution can be determined by choosing the system bandwidth, and the azimuth resolution is given by the antenna beamwidth. For fine target location superresolution techniques such as monopulse may be applied. Such techniques have to incorporate interference suppression which leads to the concept of adaptive monopulse [14]. There is to our knowledge no information on convoy tracking available. In the following we develop a few preliminary ideas on this topic which might serve as a guideline for future in-depth research.
14.5.1
Convoy detection by range-only information
In the following considerations on convoy tracking we assume an HRR mode (high range resolution) which means that the members of a convoy can be resolved in range
Figure 14.35 Apparent target length during tracking; Course angle a 0° b 45° c 90° d 135° but not in azimuth. This requires that the range resolution cell is chosen to be equal to or smaller than the target size. The task is to decide on whether a moving convoy is contained in the observed scene or not, and to estimate the velocity vector of the target motion. Such a decision may initialise the imaging mode for finer analysis of the observed scene, for instance, for classification of the individual members of a convoy. Measurement of the apparent length of observed targets by HRR radar has been successfully proven [15]. Including the apparent target length in the tracking algorithm may result in a more precise estimate of the convoy length. In Figure 14.35 some histories of the target length are plotted. It is assumed that the convoy forms a straight line. The angle between the horizontal and the convoy is referred to as the course angle cpc, see Figure 14.4. The four sub figures show the apparent length measured by range-only measurements (full length = 100) during the tracking process for four different course angles. A maximum indicates that the convoy axis is aligned with the radar look direction whereas it becomes zero if the convoy is perpendicular to the look direction.
14.5.2
Convoy detection by azimuth variance analysis
Let us assume again a number of vehicles arranged in a straight line (convoy axis). As stated before, no length measurement can be made from range profiles if the convoy
axis is perpendicular to the radar look direction. However, in this case, the whole convoy shows up in a single range cell (assuming that the range resolution is chosen accordingly). The radar observes, therefore, a target extended in azimuth. The idea is to exploit this extension to detect a convoy by interpretation of the radar response. A localisation technique such as monopulse, which is based on a point target model, will produce a higher variance of the azimuth estimate for an extended target than for a point target. It can be expected, therefore, that the variance of the azimuth estimate runs like the length curves in Figure 14.35, however, upside down. It should be noted that in this procedure the mismatch between the radar sensor (monopulse angle estimator matched to a point target) and a target with geometrical extension is exploited. Therefore, the Cramer-Rao bound cannot be used as an indicator because it is based on perfect match between radar and target. Let us now consider two typical scenarios. 14.5.2.1 Convoy in march formation If the convoy is in march formation (see Figure 14.36) the maximum of the azimuth variance will occur at the time the convoy moves perpendicular to the radar look direction. At this point, the apparent azimuthal width observed by the radar is maximum. In a clutter-free zone this might give a significant increase of the azimuth variance which can be interpreted as 'convoy present'. If the scene is, however, buried in ground clutter this maximum occurs exactly in the position of the clutter notch because the radial target velocity is zero.
convoy positions
radar positions
Figure 14.36
Radar-target scenario (march formation)
It is well known from earlier publications (References 16 and 17, and in some detail in Reference 6, Chapter 14) that the variance has a maximum at the location of the clutter notch. It follows that for a convoy in march formation, according to Figure 14.36, the maximum of the azimuth variance due to the extension of a target coincides with the maximum due to the clutter notch. The increase in variance due to target extension will probably be masked by the increase due to clutter. 14.5.2.2 Convoy in echelon formation If the motion of the convoy is perpendicular to the convoy axis (see, for example, Figure 14.37), the radial target velocity is maximum when the convoy axis is perpendicular to the radar look direction. Therefore, for this scenario an increase in azimuthal variance due to the geometrical extension of a target can be expected to be separable from the clutter effects. Some preliminary calculations have shown that the variance of the adaptive monopulse due to an extended target will be significant only if the radar can resolve the vehicles in the convoy. 14.5.2.3 Matching the response of extended targets The well known conventional beamformer is matched to a plane wave radiated from a point source in the far field. Instead, one might design a generalised beamformer which is matched to an extended target. Such a beamformer has been used, for instance, to improve the location accuracy of acoustic arrays in shallow water [9]. Similar techniques have been proposed to make an adaptive array robust against
convoy positions
radar positions
Figure 14.37
Radar-target scenario (echelon formation)
wideband interference [4]. In both applications the targets appear to be spread in angle. Application of such a technique would require that the array beamwidth is narrower than the extended target. This requirement is somewhat contradictory to our starting point which was the identification of a convoy with a radar with high range but low azimuth resolution. It remains an open issue.
14.6
Summary
In this chapter the perspectives of STAP radar for application to ground target tracking have been pointed out. The role of the radial target velocity was put forward. If the radial velocity becomes zero, as in the case of tangential motion or a stop, the STAP processor cannot distinguish between the target and the ground clutter and, therefore, suppresses the target. Several degrading effects have been discussed, including the influence of a large bandwidth, jamming and radar ambiguities. A few preliminary thoughts on tracking convoys of vehicles have been added which might be useful to inspire further research in this field. In the next chapter (Chapter 15), W. Koch will analyse the impact of the presented results on ground target tracking.
References 1 ALBAREL, G., TANNER, J. S., and UHLMANN, M.: 'The trinational AMSAR programme: CAR active antenna architecture'. Radar'97, 14-16 October 1997, Edinburgh, Scotland, pp. 344-347 2 BRENNER, A. R. and ENDER, J. H. G.: 'First experimental results achieved with the new very wideband SAR system PAMIR'. Proceedings of EUSAR 2002, 4-6 June 2002, Cologne, Germany, pp. 81-86 3 COVAULT, C : 'Joint-Stars patrols Bosnia', Aviation Week & Space Technology, February 1996, pp. 44-49 4 GERSHMAN, A. B., SEREBRYAKV, G. V., and BOEHME, J. F.: 'Constrained Hung-Turner adaptive beam-forming algorithm with additional robustness to wideband and moving jammers', IEEE Trans. Antennas Propag., March 1996, 44, (3), pp. 361-367 5 HOOGEBOOM, P., et al.: 4SOSTAR-X, a high performance radar demonstrator for airborne ground surveillance'. Proceedings of EUSAR 2000,23-25 May 2000, Munich, Germany, pp. 825-827 (VDE Publishers) 6 KLEMM, R.: 'Principles of space-time adaptive processing' (IEE Publishing, London, UK, 2002, 2nd edn.) 7 KLEMM, R.: 'Effect of ambiguities on GMTI radar'. Proceedings of IEE Radar 2002, Edinburgh, Scotland, 2002, pp. 148-152 8 KLEMM, R.: 'STAP with omnidirectional antenna arrays', in KLEMM, R. (Ed.): 'Applications of space-time adaptive processing', this volume
9 KLEMM, R.: 'Low-error bearing estimation in shallow water', IEEE Trans. Aerosp. Electron. Syst., July 1982, 18, (4), pp. 352-357 10 KLEMM, R.: 'Effect of aircraft crabbing on sidelooking STAP radar'. EUSAR 2002, Cologne, Germany, 4-6 June 2002 11 KLEMM, R.: 'Ground target tracking with STAP radar: the sensor'. RTO SET symposium fall 2003, Budapest, Hungary, 15-17 October 2003 12 KOCH, W. and KLEMM, R.: 'Ground target tracking with STAP radar', IEE Proc, Radar Sonar Navig, June 2001, 148, (3), pp. 173-185 13 KOCH, W.: 'Effect of Doppler ambiguities on GMTI tracking'. Proceedings of IEE Radar 2002, Edinburgh, Scotland, 2002, pp. 153-157 14 NICKEL, U.: 'Performance of corrected adaptive monopulse estimation', IEE Proc, Radar Sonar Navig., February 1999,146, (1), pp. 17-24 15 SCHILLER, J.: private communication 16 WARD, J.: 'Cramer-Rao bounds for target angle and Doppler estimation with space-time adaptive processing radar'. Proceedings of 29th ASILOMAR conference on Signals, systems and computers, 30 October-2 November 1995, pp.1198-1203 17 WARD, J.: 'Maximum likelihood angle and velocity estimation with space-time adaptive processing radar'. Proceedings of ASILOMAR 96, 4-6 November 1996 18 WILDEN, H. and ENDER, J.: 'The crow's nest antenna - experimental results'. IEEE international Radar conference, Arlington, VA, 1990, pp. 280-285
Chapter 15
Ground target tracking with STAP radar: selected tracking aspects Wolfgang Koch
15.1
Introduction
Ground surveillance aims at near real-time production of a dynamic ground picture. This task comprises track extraction and track maintenance of single ground moving vehicles and convoys, mobile weapon systems or military equipment, as well as lowflying targets such as helicopters. As ground target tracking is a challenging problem, all available information sources must be exploited, i.e. the sensor data themselves as well as background knowledge about the sensor performance and the underlying scenario. For long-range, wide-area, all-weather and all-day ground surveillance operating at high data update rates, GMTI radar proves to be the sensor system of choice (GMTI: ground moving target indication). By using airborne sensor platforms in stand-off ground surveillance applications the effect of topographical screening is alleviated, thus extending the sensors' field of view. Chapter 14 is devoted to characteristic problems of signal processing related to GMTI tracking with STAP radar, whereas here we discuss selected tracking aspects, which arise from using more appropriate sensor models and exploiting background information. The following topics are of particular interest: (i) Doppler blindness Even after platform motion compensation by using STAP techniques [9], ground moving vehicles can well be masked by the clutter notch of the sensor. This physical phenomenon directly results from the low-Doppler characteristics of ground moving vehicles and causes interfering fading effects that seriously affect track accuracy and track continuity. Unless appropriately handled, Doppler blindness can cause serious problems, which seem to be even more difficult in the presence of Doppler ambiguities.
(ii) Road map information Even military targets usually move on road networks, whose topographical coordinates are known in many cases. Digitised topographical road maps, such as those provided by geographical information systems (GIS), should therefore enter into the target tracking and sensor data fusion process. The problem of battlefield surveillance is not considered here (i.e. transition to off-road targets). (iii) Sensor data fusion Since a single GMTI sensor on a moving airborne platform can record the situation of interest merely over short periods of time, sensor data fusion proves to be of particular importance. The data processing and fusion algorithms used for ground surveillance are closely related to the statistical, logical and combinatorial methods applied to air surveillance. A GMTI radar sensor produces measurements of kinematical target parameters and possibly false returns caused by residual clutter, for example. In addition, measurement errors and sensor parameters are provided. The resulting tracks represent the kinematical states of the targets along the corresponding accuracies and as such are prerequisites for producing a ground picture. We here discuss track maintenance for well separated vehicles for the case of low residual clutter. The effect of road map information and data fusion is addressed. Track initiation/extraction and track termination [7, 8] are not serious problems in the situations considered here. References 11 and 12 are standard references explicitly devoted to the problem of GMTI tracking with single and multiple sensors. For particular aspects see References 1 and 21. The impact of sensor modelling on GMTI tracking is discussed in References 19, 15 and 17. For a more general introduction into modern target tracking algorithms see References 3, 2 and 14.
15.1.1 Discussion of an idealised scenario Assuming a flat earth, Figure 15.1 shows an idealised scenario with two airborne GMTI sensors observing a ground vehicle moving with constant speed (15m/s = 54 km/h) parallel to the x-axis for most of the time. This situation is typical of standoff or gap-filling ground surveillance missions. In the second half of the observation period over Atmax = 25min the target stops for 7min. Then it speeds up again reaching its initial velocity. Finally, the target leaves the field of view of sensor 2. In Table 15.1 selected sensor and platform parameters are summarised. hp, vp denote the constant height and speed of the sensor platforms over ground; Ar, Acp are the range and azimuth regions covered by each sensor during observation. Here and in the following the azimuth angle
height, km
ATC = 6 s (cumulative) platform 1, Ar 1 = 15 s
platform 2, target, v = 54 km/h
Figure 15.1
Simplified scenario
Table 15.1 Selected sensor and platform parameters Sensor hp [km]
vp [m/s]
Ar [km]
1 2
200 40
[232,292] [-128,-67] [22,54] [77,172]
10 1
Acp [deg]
AT [s] MDV [m/s] 15 10
2 2
phenomena can cause problems: 1
Sensor-to-target geometries can occur where targets to be tracked are masked by the clutter notch of the sensor. This results in a series of missing detections until the geometry is changing again. 2 As stopping targets are indistinguishable from ground clutter, the early detection of a stopping event itself as well as tracking of 'stop & go' targets can be important to military applications. The impact of these effects on the detection probability is shown in Figure 15.2 for the scenario previously introduced. For both sensors we observe deep notches (dashed line: platform 1; dotted line: platform 2). In the centre of these notches the radial velocities of the target and the surrounding ground patch are very close to each other, thus making target discrimination by Doppler processing (STAP [9]) impossible. This is particularly true if the target stops. The dashed and solid lines in Figure 15.3 denote the radial velocities of ground patches around the target and target returns, respectively. The area shaded in grey reflects the width of the clutter notches of the sensors, which is determined by the individual minimum detectable velocities. For each sensor, both curves are closely adjacent to each other indicating that the target is moving at a much lower speed than
detection probability
platform 1
cumulative
platform 2
tracking time, min
Detection probability
range-rate, m/s
Figure 15.2
platform 2
solid: target, dashed: clutter shadow: clutter notch platform 1 tracking time, min
Figure 15.3
Range-rate (ground, target)
the sensor platforms. We notice sliding intersections between the curves. These are responsible for the relatively long duration of Doppler blind phases. 15.1.1.2 GMTI model For dealing with this phenomenon we propose a refined model for describing the sensor characteristics and discuss its benefits for improving the performance of ground target tracking and sensor data fusion. The model is adapted to STAP techniques in that the detection probability assumed in the tracking process is described as a function of the GMTI-specific clutter notch. Although the current location of the notch is determined by the kinematical state of the target and the current sensor-to-target geometry, its width is given by a characteristic sensor parameter (MDV).
Using this more detailed information on the sensor, performance can be incorporated into the tracking process. This, in particular, permits a more appropriate treatment of missing detections. In other words, information on the potential reasons that might have caused the missing detections enters into the tracking filter. We have observed that, by this measure, the number of lost tracks can significantly be reduced while the track continuity is improved, finally leading to a more reliable ground picture. 15.1.1.3 Sensor fusion Assuming an idealised processing architecture (centralised data fusion), the mean cumulative revisit interval ATC results from the individual revisit intervals A 7}, / = I,... ,ns,ofns sensors according to: (15.1) For the previous example (ns = 2, AT\ = 15 s, A72 = 10 s), we obtain ATC = 6 s. The mean cumulative detection probability PCD referring to ATC is given by: (15.2) with PlD denoting the detection probabilities of the individual sensors, which depend on the corresponding sensor-to-target geometries. Evidently, the corresponding revisit intervals AT/ also enter into this formula, which describes the mean improvement of the overall detection performance to be expected by sensor data fusion. The larger the individual revisit interval A 7} of sensor i, the smaller is the effect of sensor i on the collective performance, even if the corresponding individual detection probability PlD is large. ATC and PCD are averaged quantities, by which the expected performance improvement can be predicted in an overall sense. Figure 15.2 also shows the mean cumulative detection probability PCD for the above example (solid line). The impact of the clutter notches is more or less compensated. Due to the fact that PCD is related to the mean cumulative revisit interval ATC = 6 s, being shorter than those of the individual sensors (ATi = 10s, A 72 = 15 s), the cumulative detection probability is smaller than the detection probability of the sensor dominating at that time.
15.1.2 Summary of observations In the following sections of this chapter selected tracking aspects typical of GMTI tracking and sensor data fusion are addressed. They arise from more appropriate sensor models and the use of simple background information. For their treatment basic approaches and suboptimal realisations are discussed. The potential improvements with respect to GMTI tracking performance are illustrated by a simulated example. We in particular made the following observations.
15.1.2.1 Gain by GMTI modelling An appropriate sensor model is important to GMTI tracking: 1
2
3
4
5
6
The phenomenon of Doppler blindness, which depends on the clutter notch of the sensor, the current sensor-to-target geometry, the kinematical target state and the particular signal processor used, can significantly degrade the tracking performance. Information on the GMTI-specific clutter notch should explicitly enter into GMTI tracking. As a consequence, the probability density functions of the target states are normal mixtures with possibly negative weighting factors (Bayesian approach). The model is characterised by an additional sensor parameter (MDV, minimum detectable velocity). For ground moving convoys, Doppler blindness often superposes resolution conflicts. A separate sensor resolution model can thus be omitted. By an adequate exploitation of all available a priori knowledge of the sensor performance, including the instantaneous position/width of the clutter notch, missing detections can provide valuable information on the current kinematical target state. In comparison to standard trackers that do not make use of a GMTI-specific sensor model, the resulting tracking errors and correlation gates are significantly smaller. This simplifies the discrimination of false or unwanted targets (residual clutter). If applied to operational GMTI tracking systems, the refined sensor model will not only imply reduced track loss probabilities. We also expect an improved track continuity being a prerequisite to classification/identification in military applications.
15.1.2.2 Gain by road maps The GMTI tracking framework is open to incorporate background information such as road maps: 1
An important element is a simplified Gaussian model of the potential error sources, which is characterised by possibly varying error covariances (mapping, discretisation, coordinates, sensor misalignment and positioning, terrain registration). 2 In a Bayesian approach the probability density functions of road targets are also given by normal mixture densities, whose components refer to the individual road segments. Crossroads imply an inherent multiple hypothesis structure on the densities. 3 In the case of curvilinear or winding roads, the state predictions can greatly be improved; in the filtering step the cross-range errors, being large in the case of stand-off radar, are significantly reduced depending on the instantaneous sensorto-target geometry.
4
More accurate azimuth estimates support the discrimination of unwanted or false returns and the pruning of irrelevant track hypotheses. In addition, the performance of track extraction and automatic target recognition techniques can be improved. 5 Even in the case of Doppler blindness the exploitation of both the GMTI-specific sensor model and road map information in general lead to more acceptable tracks. In particular, the recognition of stopping targets can be assisted (stop and go traffic).
15.1.2.3 Gain by sensor fusion Assuming an idealised sensor fusion architecture without limitations from misalignment or bandwidth we observed: 1
The effect of Doppler blindness, already alleviated by the GMTI sensor model, can be reduced even more. In addition, the fusion of the event 'a target under track is no longer detected' can be useful for the early recognition of stopping targets. 2 The achievable gain by sensor data fusion is not only a consequence of increased data update rates but also caused by the different geometrical conditions under which the ground moving targets are either observed or screened by the GMTI clutter notches. 3 Due to the movement of the sensor platform, the targets merely remain for relatively short periods of time in the sensor's field of view. Evidently, the total coverage of a ground situation of interest can significantly be increased by sensor data fusion.
15.2
Tracking preliminaries
The choice of a suitable coordinate system for describing the underlying sensor-totarget geometry, the sensor platform trajectory and the available a priori information on the dynamic behaviour of ground moving targets are prerequisites to target tracking.
15.2.1
Coordinate systems
For the sake of simplicity we consider three coordinate systems in which the underlying physical phenomena become transparent: 1
cartesian ground coordinates, where the description of the target and platform kinematics is of a particularly simple form 2 the moving cartesian antenna coordinate system, whose x-axis is oriented along the array antenna of the GMTI radar mounted on the airborne sensor platform 3 the sensor coordinate system, in which measurements of the kinematic target parameters are described (target range, azimuth and range-rate).
Furthermore, a flat earth is assumed. In the applications the use of coordinates different from these may be more convenient [11, 12]. 15.2.1.1 Antenna coordinates In ground coordinates the kinematic state xp(t) of a platform moving at a constant height hp and a constant velocity rp = vp(cos(pp,sin(7) T ,rJ) T with rp{t) = r^(0) H- xpt and i>(0) = (xp,yp,hp)T. Let the orientation of the antenna array relative to the platform velocity rp be defined by the direction vector ea = (cos <pa, sin(pa,0)T. The cases cpa = 0, n/2 describe sideand forward-looking array antennas, respectively (SLAR/FLAR). Due to aircraft crab, however, even in the case of a side-looking antennas cpa ^ 0 may occur (crab angle). The kinematic target states x^ and x£ at time tk in ground and antenna coordinates, respectively, can be transformed into each other by means of the transformation equations x^ = tg+-.a[x%; tk] and x£ = ta^g[x8k; tk], which are defined by: (15.3) (15.4)
15.2.1.2 Sensor coordinates Let xak = O t T , r £ T ) T with rak = (xk,yk,-hp)T and rak = ( i * , B , 0 ) T denote the kinematic target state in antenna coordinates and let x^ = (rk,Vk)T be the corresponding quantity in sensor coordinates. The non-linear transforms between both coordinate systems are given by:
(15.5)
(15.6) The transforms between ground and sensor coordinates, t 5 «_ g [x^;^], tg+slzkitk], result from the concatenation of these transforms. The calculation of the related Jacobians, T 5 ^ g [x^] = dts^g/dxf andT g ^[xfc] = dtg^-s/dxk, is straightforward.
15.2.2
Target dynamics model
Let the kinematic state vector of a ground moving target at time tk be given by its current position r | = (xl-\>xk-2>xk-3^T alK* velocity r^: (15.7) Acceleration components are omitted. In the previous scenario (Figure 15.1) we, in particular, have: xf.3 = xgt6 = 0. 15.2.2.1 Cartesian coordinates In the ground coordinate system the target dynamics are modelled by a linear system equation with additive white Gaussian noise [3,2]. With a scalar plant noise variance ^k\k-\ S i v e n b v £*|*-i = vt(l - Q-2^k-tk-\)/ot^\/2 (p a r a m e ters vt and 0t discussed below), three-dimensional diagonal matrices J = diag[l, 1,0], O = diag[0,0,0] and a noise component v# ~ N(O, J), let us consider the following realisation [5, 6]: (15.8) with matrices Ff ^ 1 and Gf^ -1 given by:
(15.9)
According to this dynamics model, the velocity i> is described by an ergodic Markov process with E[r|] = 0. The corresponding autocorrelation function is given by E[r
15.2.2.2 Sensor coordinates With the non-linear transforms t g ^ , ts^~g between ground and sensor coordinates, a first-order Taylor expansion around x | _ ^ - 1 and x^_\\k-\ (the state estimates at tk-1 in ground and sensor coordinates, respectively, using all associated measurements up to and including tk-\) yields a linearised system equation in the moving sensor coordinate system: *k = Fk\k-\Xk-\ + G*|*-iv* + ukik-\
(15.10)
where:
Gk\k-\
=Ts^g[x8klk^l]Ggklk_l
Uk\k-1 = h\k-\[*k-\\k-\]
*l\k-\
— ^k\k-\*k-\\k-\i
=K\k-\h-s[*k-\\k-\\tk-x]
h\k-\l*k-\\k-\\
=ts^gW{\k-itg*-s[*k-\\k-\\tk-\\\tk\
In this system equation the input vector Uk\k-\ basically reflects the (known) platform position and motion.
15.3
GMTI sensor model
In the subsequent considerations Zk = {zJ^Lo denotes a set of nk sensor reports (frame of observations), which are detected at time tk by scanning a certain area of interest. The time instant tk is often referred to as revisit time. In this notation let z£ be the event that at time tk the sensor possibly produced no valid target detection.
15.3.1
GMTI characteristics
Due to the physical and technical reasons previously discussed, the detection of ground moving targets by airborne radar is limited by strong ground clutter returns. This can be much alleviated by STAP techniques [9]. The characteristics of STAP processing, however, directly influence the GMTI tracking performance. 15.3.1.1 Main-lobe clutter Let us consider a ground patch located at the position rf in ground coordinates (i.e. rf = o), which is illuminated by the mainlobe of the moving STAP radar sensor. In antenna coordinates let r and (p denote slant range and azimuth angle of the ground patch related to the cartesian position vector xac = Vr 2 — h2p (cos cp, sin cp, 0) T . According to equation (15.3) in the moving antenna coordinate system the state vector of the ground patch is given by x^ = ( r ^ T , r ^ T ) T with a non-zero velocity vector xac — — ^ ( c o s (pa, sin (pa, 0) T . The radial velocity of the ground patch therefore directly results from equation (15.6): rc(r,(p;vp,hp,(pa)
= -vp cos((p - ^ ) V l - (hp/r)2
(15.11)
Besides the location of the ground patch described by r and a. For physical and technical reasons, signal processing for airborne GMTI radar [9] is unable to separate a ground moving target from the surrounding mainlobe clutter
return if the target's radial velocity is equal to r c , i.e. if the kinematic state vector xk = (rk,(pk,h,^Pk)T of the target obeys the relation nc(xk) = 0 where the function nc is given by: nc(xk) =nc(rk,(pk,rk;vp,hp,(pa) = h -rc(rk,
(15.12) (15.13)
In other words, the equation nc(\k) = 0 defines the location of the GMTIspecific clutter notch of the sensor in the state space of a ground target and as such reflects a fundamental physical/technical fact without implying any further modelling assumptions. 15.3.1.2 Qualitative discussion An adequate modelling of this phenomenon is important to ground moving target tracking. Besides being simple enough to be mathematically tractable, the GMTI sensor model must reflect the following qualitative conclusions: 1
The detection probability of the sensor depends on the kinematic target state and the sensor-to-target geometry: PD = PD(XJO2 PD&k) is small in a certain region |nc(Xj0l < MDV around the clutter notch characterised by the sensor parameter MDV. 3 Far from the clutter notch, the detection probability is a function of the SNR (target signal-to-noise ratio), which itself depends on a number of sensor parameters such as antenna and processing gain and target parameters such as target cross-section and range. 4 There exists a comparatively narrow transient region between these two domains in the state space of the ground target. 15.3.1.3 Quantitative modelling In a generic description of the detection performance of GMTI sensors it seems plausible to write PD = PD(^) as a product with one factor reflecting the directivity pattern and propagation effects due to the radar equation [4], pD = PD(Tk,
(15-14)
In this expression the sensor parameter MDV has a clear and intuitive meaning: in the region defined by |n c (x^)| < MDV we have PD < \PD- The parameter MDV is thus a quantitative measure of the minimum radial velocity with respect to the sensor platform which a ground moving target must at least have to be detected by the sensor (minimum detectable velocity). The actual size of MDV depends on the particular signal processor used. For Swerling I targets PD is given by: /?£>(r,
impact of sensor parameters such as antenna and processing gain, the factors (o/cro) and (r/ro)~ 4 denote the dependency on the radar cross section o and the range r of the target with reference to a standard target characterised by cro and r§. Let the directivity pattern of the single radiating array element be described by D(cp) = sin2 (
15.3.2
Convoy resolution
Since in military ground traffic vehicles often move in convoys, at first view resolution phenomena seem to be typical of long-range ground surveillance. Due to the asymmetric effect of range and angle resolution, however, Doppler blindness in many cases superimposes resolution effects: as soon as convoy targets cease to be resolvable, they are at the same time buried in the clutter notch and thus escape detection. Vice versa, resolvable convoy targets are rarely Doppler screened. A separate modelling of the sensor resolution might therefore be omitted. As an example we assume two targets moving in a row along a straight road at 30 km/h, as typical of military applications. Their mutual distance is 50 m. The sensor-to-target geometry is as depicted in Figure 15.1. Let the sensor resolution be given by: ar = 10 m (range), a^ — 0.1° (azimuth), ar — 0.5 m/s (range-rate). Figure 15.4 shows the detection probabilities of both sensors (solid lines). The width of the notch is larger than in Figure 15.2 due to the smaller convoy speed. The dotted lines denote the resolution probabilities Pr of the sensors modelled according to [18]: p — I _ e -log2(Ar/a r ) 2 e-log2(A
/ ^ ^x
detection/resolution probability
Ar, A(p, Ar are the distances between the targets in sensor coordinates. If Pr is dominated by the angular resolution (i.e. Ar and Ar are small), Doppler blindness occurs. Outside of the notch the high range/range-rate resolution guarantees resolved returns. For an approach to track ground moving convoys see Reference 16.
platform 1
solid: detection dotted: resolution platform 2
tracking time, min
Figure 15 A
Detection and resolution probability
15.3.3 Doppler ambiguities For Ft Doppler ambiguous radar, the signal processing chain [9] is unable to separate a ground moving vehicle from the surrounding mainlobe clutter return if the difference between the radial velocity r of the vehicle and rc is equal to a multiple of the first blind speed r/,, i.e. if the relation: nfJ(r,
= 0,
j = 0,1,2,...
(15.16)
holds, where ncJ = nc J (x) is a function of the state vector x = (r, (p, r,
15.3.4 Measurements Provided that a detection actually occurred, measurements of the kinematic target parameters are modelled by z& = Hx^ + w&, H = (I,o), w^ ~ Af(O5R) with an error covariance matrix R = diagO;2, o^, af] and I = diag[l, 1,1], o = (0,0,0) T . In other words: range, azimuth and range-rate measurements are independent of each other and bias free with normally distributed measurement errors. Doppler ambiguous measurements are given by zk J = z^ ± j(fi, 0, r^) T , j — 1,2 Due to monopulse processing [23], the standard deviations cfy, a-r depend on the signal-to-noise plus
interference ratio: a^ = S<^,r/(SNIR/SNR0)1/2, where SNIR can be modelled by SNIR(r,
15.4
GMTI data processing
In a Bayesian view, a tracking algorithm is an iterative updating scheme for conditional probability densities p(xk\Zk) that statistically describe the kinematic state vector X^ of a target at discrete revisit times tk given both the accumulated sensor data Zk = {Zi}k=l up to time tk and all available background information. Each update consists of a prediction step that exploits the target dynamics model (and road map information if available). Prediction is followed by a subsequent filtering step, where the newly received sensor data are processed by making use of the underlying sensor model. Retrodiction (or smoothing) is a backwards-directed iteration for calculating the conditional probabilities p(xi\Zk)9 I < k, that describe the past target states x/ given all sensor data up to and including the present time ^ . This process is illustrated by the following scheme: p(xk-i\Zk~l)
dynamiCS
p(xk\Zk~l)
>
(prediction)
(15.20)
road maps ,
,/-yt—K
p{xk\Zk
sensor model
[
)
p(xi\Zk)
_tN
> new data Z^
p(*i+i\Zk),
.
/i/-^i\
(filtering)
(15.21)
(retrodiction)
(15.22)
/ r
p(*k\ZK) l < k
.u
.
model
By assuming suitable cost criteria (e.g. MMSE [2]), the probability density functions p(xi\Zk) provide state estimates Xp with related covariance matrices P/^ (/ > k: prediction, / = k: filtering, / < k: retrodiction). The subscripts denote that the quantities are related to t\ based on all measurements up to and including tk. As in any practical application, ambiguity due to the uncertain origin of the sensor data must be taken into account, the densities in general prove to be finite mixtures [3, 14, 24], i.e. weighted sums of individual probability densities.
15.4.1 Prediction Even under benign conditions, however, the formal application of the update recursions of equations (15.20) and (15.21) lead to a representation of p(Xk\Zk) by a Gaussian mixture with an ever increasing number of mixture components as the following discussion shows. Therefore, suitable mixture reduction techniques must be applied for keeping the tracking algorithm tractable. In the situations considered here it seems to be sufficient to consider densities p(xk\Zk), which are represented by a normal mixture consisting of two components.
Let the density p(xk-\ \2k
l
) be known at tk-\: (15.23)
with Af (xk; xk]kj P*,*) = d e t ( 2 7 r P ^ r 1 / 2 exp[-±(x* - x^) T P~{(x* - x ^ ) ] (multivariate normal density), where the weighting factors plk obey YIi P\ — 1 • Due to the dynamics model (equation (15.8)), the predicted density is given by [14, Section 4.2]: (15.24) with: (15.25) (15.26)
15.4.2 Data processing According to Bayes' formula, the processing of the new sensor data Zk received at time tk makes use of the predicted density p{xk\Zk~^) (previously calculated) and the measurement likelihood function p(Zk, nk\xk). 15.4.2.1 An approximate likelihood function The likelihood function p(Zk,nk\xk) statistically describes what a single frame of reports Zk = {zjJ}"^0 at time tk can say about the target state xk. Its particular structure is determined by the current sensor data frame and the sensor model describing the sensor's properties. The likelihood function can be written as a sum over all interpretation hypotheses regarding the origin of the data. For well separated targets in a possibly cluttered environment two classes of data interpretations can be identified: 1 the object being considered was not detected, the received sensor data are false 2 the target was detected, z£ is the target measurement, all other returns are false. For the sake of simplicity let us first consider Doppler unambiguous measurements. According to the previous modelling assumptions we obtain up to a factor, which is independent of x^ and the measurements [14, Section 3.5], the following expression for the likelihood function: (15.27) with PF denoting the spatial false return density. In this expression the function nc {xk) in the definition of the detection probability Po(^k) (equation (15.14)) is linearised
around the predicted state estimate \k\k-\ = (rk\k-\,
- We (15.28)
with a scalar quantity Zk and a 1 x 4 Jacobian - H ^ , which are given by: (15.29) (15.30)
(15.31) The notation chosen indicates that the effect of the GMTI-specific clutter notch on the likelihood function can formally be described by a fictitious measurement Zk and a corresponding fictitious measurement matrix H^. The real measurements are denoted by zjj*. By these considerations the detection probability PD(^) can thus be approximated by using a Gaussian, which linearly depends on the target's kinematic state vector x^:
(15.32) with a fictitious standard deviation of the measurement error related to Zk essentially given by the minimum detectable velocity (MDV), being a characteristic sensor parameter of the STAP radar. In order to simplify the notation we introduce quantities z^, H^7, RnJ defined by:
(15.33) We formally introduce R 00 = 00. According to the previous approximation the likelihood function p(Zk,rik\\k) can therefore be written as a weighted sum of Gaussians: (15.34)
with weighting factors given by (n = 1 , . . . , rc&):
15.4.3 Filtering process According to Bayes' rule the processing of the new sensor data Z& received at revisit time tk is based on the predicted density p(xk\Zk~l) and the likelihood function p(Zk,nk\xic)- With equations (15.24) and (15.34) we obtain up to a normalising constant: p(xk\Zk)
(X p(Zk,nk\xk)
p{xk\Zk~x)
(15.37)
(15.38) The product of Gaussians in equation (15.38) can be rewritten according to the following product formula:
(15.39) where (15.40) with compatible vectors and matrices x, y, z, H, P, R. This formula is a consequence of the observation that the left-hand side of equation (15.39) can be interpreted as a joint density /?(z, x) = p(z\x)p(x). The right-hand side follows by computing p(x\z) and p(z). We thus obtain p(xk\Zk) as a Gaussian mixture: (15.41) with mixture parameters p^l\ x ^ , and P ^ given by the Kalman-type following update equations: (15.42)
(15.43) (15.44) (15.45) (15.46)
15.4.4 Realisation aspects The probability density p(%k\Zk) (equation (15.41)) is a mixture with 4(n& + 1) components. If it is propagated according to equations (15.20) and (15.21), an exponential growth of the number of mixture components seems to be inevitable. 15.4.4.1 GMTI characteristics Following the spirit of the techniques used in standard PDA or IMM methods [2, Section 3.4] (PDA: probabilistic data association, IMM: interacting multiple models), we propose a suboptimal approach for keeping the number of mixture components under control, which might easily be refined if required. Let us first consider the following second-order approximation similar to the proceeding in hybrid estimation [20]:
(15.47) l
where the quantities p™, p k, xjj*', and P ^ are given by: (15.48)
15.4.4.2 Data association The mixture components that define the individual densities pt(xk) are related to the data interpretations due to the uncertainty regarding the origin of the reports. They
may be handled by well established mixture reduction techniques [3, section 16.2]. Provided the clutter density is low, a second-order approximation of each Pi(Xk) seems to be reasonable and results in a Gaussian (standard PDA [2]). For the sake of simplicity this approach has been used in the numerical results presented below. Hence the p(\k\Zk) is finally approximated by a normal mixture with two components at each time t^. The generalisation to more refined approximation techniques is straightforward and leads to MHT (multiple hypothesis tracking).
15.4.5 Discussion Figures 15.5 to 15.10 provide a qualitative insight into the effect of the refined sensor model on target tracking and sensor data fusion. Although a high adaptivity is evident near the clutter notch, far from the notch no difference to standard filters is observed, as to be expected.
Figure 15.5
No detection a Standard filter b GMTI filter
Figure 15.6
Within the notch a Standard filter b GMTI filter
Figure 15.7
Detection near the notch a Standard filter b GMTI filter
15.4.5.1 Effect of GMTI modelling Figures 15.5a,b display the probability density functions resulting from processing the event that a missing detection occurred near the notch. To show the most interesting features, the densities are projected onto the azimuth/range-rate plane. The PDF of the standard tracker (Figure 15.5a) is identical with the corresponding predicted density, but the refined sensor model leads to a bimodal structure (Figure 15.5b). The broader peak refers to the possible event that the missing detection has simply statistical reasons as in the case of standard filtering, and the sharper peak behind it reflects the hypothesis that the target was not detected because it is masked by the clutter notch. The situation that the target is buried in the clutter notch for several revisits is represented in Figures 15.6a,b. Evidently the PDF of the standard filter totally faded away permitting no reasonable state estimation (Figure 15.6a). The refined filter, however, preserved a definite shape (Figure 15.6b). This can be explained as follows: instead of actual sensor data the very information that several successively missing detections occurred was processed. This event provides a hint to the filter that the kinematic target state probably obeys a relation determined by the clutter notch. Apparently, this piece of evidence proves to be as valuable as a measurement of one of the components of the target state. Figure 15.7a,b refers to the event that a detection occurred near the clutter notch. Although the standard filter produced a simple Gaussian, the refined filter shows a more complex structure. In fact, the PDF is a two-component mixture whose weighting factors differ in their sign (but sum up to one). The resulting shape permits an intuitive interpretation: the sensor model inherently takes into account that the target state Xfc does not lead to a small value of nc(\k) (Section 15.3.1); otherwise the target would not have been detected at all. For this reason, the sharp cut in the PDF simply indicates the location of the clutter notch.
15.4.5.2 Gain by sensor data fusion Figures 15.8 to 15.10 show the probability densities of the target position in cartesian ground coordinates after filtering. The prolated structure of the PDFs mirrors the predominant impact of cross-range errors. Their shape is rotated with respect to each other due to the different sensor-to-target geometries. This effect can be much more pronounced in other situations. We indicated the true target position. Figures 15.8a to 15.10a refer to a regular tracking situation (after 10 min, see Figures 15.1 and 15.2). Doppler blindness occurred for sensor 2 during the previous revisits. The probability densities shown in Figures 15.8b to 15.10b have been calculated at a time when the target has stopped for 3 min (Figure 15.2). Evidently in Figures 15.8b and 15.9b the dissipation of the density functions is confined to a particular direction according to the GMTI sensor model. Figure 15.10a,b shows the probability densities obtained by sensor data fusion. In both cases we observe a significant fusion gain, which is a consequence of the different orientation of the density functions and leads to improved state estimates.
Figure 15.8
Sensor 1 a Tracking b Target stop
Figure 15.9
Sensor 2 a Tracking b Target stop
Figure 15.10
Sensor fusion a Tracking b Target stop
The result for the stopping targets is particularly remarkable. Although no sensor data are available from both sensors, the very fusion of the sensor output 'target under track is no longer detected' implies an improved target localisation. This is a consequence of the different sensor-to-target geometries.
15.4.6 Retrodiction The update formulae for the retrodicted densities p(\i\Zk) (/ < k, equation (15.20)) are derived in close analogy to the IMM retrodiction techniques proposed in References 13 and 14. Let the retrodicted density at time ti+\ be given by: (15.49) Formally, the requested conditional density p(xi\Zk) can be written as a marginal density: (15.50) (15.51) An application of Bayes' formula and exploiting equation (15.10) yields: (15.52)
(15.53)
(15.54)
By using the product formula (equation (15.39)) we obtain:
(15.55) with weighting factors C^(X/+1) depending on the target state x/ + i at time tf. (15.56)
(15.57) (15.58) (15.59) With x/ + \\k = Xl/=o P\+1 \k x/+1 life' b e m S m e best estimate of the target state x/+1 given the data up to, and including, time tk, we consider the approximation: CL(Xz+I) & C^(Xz+IiJt). With this approximation the Gaussian mixtures at the right-hand sides of equations (15.49) and (15.55) are inserted into equation (15.51). By a second use of the product formula (equation (15.39)) we finally obtain Rauch-Tung-Striebel-type update equations: (15.60) (15.61) (15.62) for the retrodicted densities: (15.63) Exponential growth of the number of mixture components is avoided by a secondorder approximation as discussed in Section 15.4.3. We thus obtain: (15.64)
with mixture parameters /?L, x L PL given by: 1
(15.65)
(15.66)
(15.67)
15.4.7 Effect ofDoppler ambiguities In the case ofDoppler ambiguities, the measurement likelihood function p(Zk, rik\\k) has the following form:
(15.68) up to a factor being independent of x^ and the measurements with Doppler ambiguous measurements given by zk J = zlk ± J(O9 0, r/,) T . In close analogy to the previous discussion we approximately represent the detection probability PQ (equation (15.18)) by linearising the non-linear function nc J (Xk) at time tk around the predicted state vector \k\k-\ and obtain: (15.69) with: (15.70) (15.71) As a direct consequence of these considerations the measurement likelihood p(Zk, nk\*k) can be written as a function proportional to a Gaussian mixture in the kinematic state variable x#. By exploiting this Gaussian sum representation of the likelihood the probability density p(\k\Zk) (equation (15.41)) is a Gaussian mixture itself with many components, most of them, however, with small weighting factors. If the density is propagated according to equations (15.20) and (15.21), however an exponential growth of the number of mixture components seems to be inevitable. We therefore apply the PDA- or IMM-type mixture reduction techniques proposed in Reference 3 and additionally consider a second-order approximation with respect to the Doppler multiples appearing in the measurement likelihood function. By this approximations it becomes possible to represent the density p(\k\Zk) by a
two-component mixture in analogy to the proceeding in Reference 19, which is characterised by the parameters /?/, xJL^, PJ^, i' = 1,2. Despite the admittedly rather crude nature of these approximations, nevertheless the essential characteristics of the GMTI sensor seem to be incorporated into the tracking algorithm, at least in a qualitatively correct way. The application of more refined approximation techniques is possible, but this does not seem to enhance the results greatly. 15.4.7.1 Selected examples Let us consider a side-looking STAP radar with a configuration such as discussed in Reference 10. In order to focus on GMTI-specific aspects, the spatial residual clutter density pr is assumed to be small. False returns therefore play practically no role in our simulations. The following simulation parameters were chosen: revisit interval: 10 s; MDV: 3m/s; A.: 0.03 m; PRF: 2,4,8,16 kHz; measurement error: 20 m (range), 0.2° (azimuth), 0.5 m/s (range-rate); r$\ 65 km; SNRo: 25 dB; pf. 10~6; a/ao = 1. Figures 15.11a, 15.12a and 15.13a show an idealised scenario with a sensor platform moving at a speed of 200 m/s along with a ground moving vehicle (12 m/s) and a low-flying target (120 m/s), respectively. The platform height is 3 km, the total tracking time 25min. Figures 15.11b, 15.12b and 15.13b display the corresponding detection probabilities, and Figures 15.11c, 15.12c and 15.13c show the radial velocity r of the targets along with the radial velocity rc of the corresponding mainlobe clutter return. In the detection curves the directivity pattern of the side-looking radar becomes clearly visible. 15.4.7.2 Observations In general we observed that the processing of the Doppler ambiguous sensor returns causes no problems. Even with the PDA-type approximations used, after a short initiation time the tracker correctly identifies the relevant Doppler measurement. The processing of Doppler information on the target during the phase of track maintenance is advantageous in that the correlation gates involved are much reduced in size. By this the discrimination of false or unwanted targets as well as convoy tracking can be alleviated. From the situation shown in Figure 15.11 we can learn that the radial velocity of low-Doppler targets and the radial velocity of the surrounding mainlobe clutter return show sliding intersections. Therefore vehicles once masked by the clutter notch will escape detection for a longer series of revisits until the sensor-to-target geometry is changing again, as previously discussed. Information on the current location of the clutter notch, however, is incorporated in the measurement likelihood function previously proposed. More intuitively speaking, this knowledge acts as an additional fictitious measurement by which this GMTI-specific fading phenomenon can be alleviated. Obviously the multiplicity of clutter notches is in most practical cases irrelevant to low-Doppler targets. In the case of low flyers moving at a higher speed than ground moving vehicles, r varies over a larger range and the sensor-to-target geometry changes more rapidly.
A7= 10 s (revisit interval)
ground moving vehicle: v= 12 m/s
range rate, m/s
detection probability
height, km
A/max = 25 min (total tracking time)
tracking time, min
Figure 15.11
tracking time, min
Ground moving vehicle: PRF = 2 kHz a Scenario b Detection probability c Range-rate (clutter/target)
Therefore, multiple clutter notches related to higher blind speeds do occur in the tracking scenario. In most cases, however, the higher target speed is also responsible for the fact that the intersections between the target's radial velocity r and rc ± jr^, j = 0 , 1 , . . . are no longer of a sliding nature. Fading phenomena as in the case of moving vehicles are thus rarely observed (Figure 15.12). The benefit of exploiting the more refined GMTI detection model is therefore not significant in the same way as before. The bridge-over provided by standard Kalman filtering is nearly of the same quality. Even for low flyers, however, very broad clutter notches due to sliding intersection may occur for particular sensor-to-target geometries (Figure 15.13c). For these fading phenomena typically clutter notches related to blind speeds are responsible. In Figure 15.13b no Doppler blindness occurs for PRF = 16 kHz, and it is visible at the still fairly high PRF = 8 kHz. As in the case of ground moving vehicles the related fading phenomena can be alleviated by exploiting information on the location of the corresponding notch, which is contained in the measurement likelihood function. The multiple clutter notches appearing at even lower PRFs are much narrower and hardly disturb the tracking process.
A7 = 10 s (revisit interval)
height, km
Atmax = 25 min (total tracking time)
range rate, m/s
detection probability
low-flying target: v= 12m/s
solid: targeif dotted; flutter shadow* flutter jr6tch
tracking time, min
tracking time, min
Figure 15.12
Low-flying target 1: PRF = 4 kHz a Scenario b Detection probability c Range-rate (clutter/target)
Ar= 10 s (revisit interval)
height, km
A/max = 25 min (total tracking time)
Figure 15.13
low-flying target: v = 12 m/s
Low-flying target 2: PRF = 16,8,2
a Scenario
kHz
range rate, m/s
detected probability
tracking time, min
range rate, m/s
detected probability
tracking time, min
tracking time, min
range rate, m/s
detected probability
tracking time, min
tracking time, min
Figure 15.13
15.5
6lid;target, dotted: clutter
tracking time, min
Continued b Detection probability c Range-rate (clutter/target)
Road map information
In many practical cases even military ground vehicles move on roads, the topographical coordinates of which are available up to a certain error (digitised road map information). In this context it seems reasonable to describe the kinematic state vector
\[ of road targets at time tk by its position on the road k (i.e. the arc length of the curve) and its scalar speed /*: x£ = (Ik, h)T• The model for describing the dynamic behaviour of road targets is therefore a two-dimensional version of equation (15.8). By making use of the related transition density p(xrk\\rk_^) the predicted density in road coordinates is given by p(xrk\ Zk~l) = / d x j ^ p(\rk\\rk_x) p(\rk_x\Zk~x). 15.5.1
Modelling of roads
A given road through a real road network is mathematically described by a continuous three-dimensional curve 1Z* in cartesian ground coordinates. For the sake of simplicity the effect of crossroads is not considered here. See References 25 and 26 for a more detailed discussion. Let IZ* be parameterised by the corresponding arc length /. The exploitation of digitised road maps provides the database for a piecewise linear approximation of the road curve IV : I i-> Tl* (I) by a polygonal curve IZ. Let us furthermore assume that the curve IZ is characterised by nr node vectors: s m =n*(lm),
m = l,...,nr
(15.72)
From the these quantities nr — 1 normalised tangential vectors: tm = (Sm+i - s m ) / | | s m + i - s m | | ,
m = l,...,rcr- 1
(15.73)
can be derived (see Figure 15.14). The euclidian distance ||s m +i — s m || between two adjacent node vectors, however, is usually not identical with the distance km = lm+\—lm actually covered by a vehicle when it moves from sm to s m+ i along the road. Besides the vectors sm the scalar quantities Xm > |s m+ i — s m | should therefore enter into the road model to make it more realistic. The differences Gd = | km — \ \ sm+1 — sm 111 can evidently serve as a quantitative measure of the discretisation errors we have to deal with. Using the characteristic functions defined by:
(15.74) and: (15.75)
road segments: startng point length direction width (error)
Figure 15.14
Representation of a road
we obtain a mathematically simple description of the polygon curve TZ, by which the road IZ* is approximated: (15.76) with: (15.77)
15.5.2 Densities on roads The Bayesian formalism previously discussed can directly be applied to road targets, if it is possible to find a transformation operator Tg<-r by which the predicted density p(xrk\Zk~l) in road coordinates can be transformed into ground coordinates: (15.78) When available in ground coordinates, the linearised versions of the transforms ts+-g and tg+-s (Section 15.2.1) can be used to represent the densities in sensor coordinates, where the filtering step is performed. To this end, we write the density p(xf\Zk~l) as a sum over the nr + 1 road segments considered: (15.79)
(15.80)
(15.81) In equation (15.81) the probability: (15.82) (15.83) (15.84) denotes the probability that the target moves on the segment m given the accumulated sensor data Zk~x. The matrix H r is defined by Hrx£ = /^. Later on, it will be intuitively interpreted as a fictitious measurement matrix. Since the density p(xrk\Zk~x) = Yl)=O P{-\ N (xrk; x ^ _ 1 ? ^k-IJ i s a G a u s s i a n mixture due to the GMTI sensor model, the probabilities /?^_ r can explicitly be expressed by
error functions: (15.85) with: (15.86)
(15.87) For the remaining term in equation (15.81) standard probability reasoning yields: (15.88) (15.89) (15.90)
15.5.2.1 Simple roads Let us first consider the simple limiting case of a straight road defined by: 1Z(I) = s + It. Under Gaussian assumptions the transform from road to ground coordinates is defined by the normal transition density: /?(xj^+1|x£+1) = Af (x8k+]; tg«_r[xj*+l], o^) with the affine transform t g< _ r [x r ] = (J J)x r -f (s~^lt) and am denoting the standard deviation of the mapping error. The transformation of the density p(xrk\Zk~x) into the ground coordinate system is therefore described by p(xgk\Zk~x) = f dxrkp(xk Ixp p(xrk I Zk~1). The integration can explicitly be carried out and preserves the Gaussian character of the density functions (normal mixtures). The corresponding inverse is simply provided by a projection of the density p(xgk\Zk~l) on the road. With these transformations the previous considerations directly apply. 15.5.2.2 Polygonal roads The transition density p(xf \xrk, m) for the road segment m (equation (15.90)) is characterised by road map and discretisation errors (crm, cr^), which may vary from segment to segment. Under Gaussian assumptions regarding the possible error sources, with the affme transforms tJL_r[xr] = ( V f ° )x r + (**n-imtm ) for each individual road segment m, and the error standard deviation a;? = G^n + a1^ we obtain normal transition densities: (15.91)
With these preliminaries, an application of Bayes' rule to the remaining density in the integrand of equation (15.90) yields: (15.92) with probabilities p(m\xrk) given by: p(m\xrk) = Xm(Hrxrk)
(15.93)
Up to now the derivation was exact. Due to the normalisation constant, however, the characteristic functions violate the Gaussian character of the probability densities. To circumvent this problem we propose the following normal approximation: (15.94) with z™ and X2m given by: (15.95) (15.96) The quantities zf and )?m can be interpreted as the mean and variance of a uniform density given by x m (/). From equation (15.93) and the product formula (Section 15.4.3) we obtain: (15.97) with Kalman-type update equations for x ^ _ j and P ^ _ j , where z™,"kmare analogous to a measurement and a related measurement error variance: (15.98) (15.99) with 'innovation' covariance matrices S™; and 'Kalman gain' matrices W^ 7 given by: (15.100) (15.101) The notation chosen indicates that the effect of road map information on the probability density functions can formally be described by a fictitious measurement, a corresponding measurement matrix and a fictitious measurement error. Now the integration in equation (15.90) can be carried out explicitly as in the previously discussed limiting case. The transformation from road to ground coordinates is thus known. In analogy to the limiting case of straight roads, the inverse transform is simply provided by individually projecting the densities p(\f \m, Zk) on the road (i.e. after the filtering
step has been performed). Before the subsequent prediction is performed, it seems to be reasonable to apply a second-order approximation to the mixture densities: (15.102)
(15.103)
15.6
Quantitative discussion
The idealised sensor-to-target scenario discussed in the introduction (Figure 15.1) displays features characteristic of military ground surveillance applications with airborne GMTI radar. Based on this example, we quantitatively illustrate the potential gain by exploiting: (i) GMTI sensor modelling, (ii) road map information (simple straight road) and (iii) sensor data fusion. The covariance matrices of the estimates resulting from filtering (equation (15.21)) provide tracking-inherent measures of performance. For the sake of simplicity we confine the discussion to the semi-axes of the position error ellipses in ground coordinates. The mean squared error with reference to the true trajectory (known in the simulation) is a more direct measure of the tracking performance.
15.6.1 Simulation parameters The GMTI sensor reports are randomly generated according to the detection probabilities of the individual sensors as defined in equation (15.14) and displayed in Figure 15.2. The errors of the range, azimuth and range-rate measurements are assumed to be bias free and normally distributed. The corresponding standard deviations Gp, Or of the azimuth and range-rate measurement errors depend on the underlying signal-to-noise-plus-interference ratio, SNIR = SNIR(^, ^ , r^), after clutter filtering according to the discussion in section 15.3.3. The standard deviation ar of the range errors is assumed to be constant. In order to focus on GMTI-speciflc aspects, the spatial residual clutter density is assumed to be small. False returns therefore play practically no role in our simulations. In the following list the chosen simulation parameters are summarised: range error: or — 20 m azimuth error: E^ = 0.2 deg range-rate error: Hf = 0.5 m/s false alarm probability: p/ = 10~6 SNIR0 = 2OdB ro1'2 = 200 km, 50 km dynamics model: vt = 18 m/s 0t = 200 s road map error:
15.6.2 Numerical results Figures 15.15 to 15.17 show the measures of performance plotted over time. We consider the sensors on both platforms individually as well as the result of centralised fusion. For each case the way in which the GMTI model and the fusion with road map information affects the track quality is examined. The temporal behaviour of the performance measures reflects the four different phases of the scenario: 1 2 3 4
high PD far from the notches missing detections due to unfavourite sensor-to-target geometries missing detections due to the target stop decreasing PD when the target is leaving the field of view (sensor 2).
position error semi-axes, km
position error semi-axes, km
15.6.2.1 Gain by GMTI sensor model Unless appropriately handled by a GMTI model, the sensor inherent clutter notch can seriously affect the overall performance of a ground target tracking system. From the
time, min
time, min
position error semi-axes, km
Sensor a No GMTI model no road b GMTI model, no road
position error semi-axes, km
Figure 15.15
time, min
Figure 15.16
Sensor a No GMTI model, no road b GMTI model, no road
time, min
position error semi-axes, km
position error semi-axes, km
time, min
Figure 15.17
time, min
Fusion a No GMTI model, no road b GMTI model, no road
Figures we derive the following conclusions: 1
By successively missing plots due to Doppler blindness, the mean tracking error and filtering covariances strongly increase. 2 In consequence the expectation gates also increase, making the discrimination of false or unwanted returns more difficult. 3 Stopping vehicles escape detection by GMTI radar. An early recognition of this event may be of particular military interest. From these observations it is evident that track loss is the probable consequence in ground situations similar to our example. Even if track reinitiation is successful after some delay, the reestablishment of the track continuity may remain as a difficult and time consuming task. The simulation results show, however, that the refined sensor model can improve the tracking performance: 1
The minor semi-axes of the position error ellipses are strongly, the corresponding major semi-axes are slightly reduced. This is due to the fact that the GMTI model has a larger effect on the range/range-rate estimates than on the azimuth estimates. 2 For this reason, the bridging over periods characterised by Doppler blindness and the recognition of stopping targets are much alleviated, while the discrimination of false returns can be improved by significantly smaller range/range-rate gates.
15.6.2.2 Gain by road map information Further improvements are obtained by exploiting information from digitised road maps: 1
Even without GMTI modelling in the tracking filter, the effect of Doppler blindness on the track accuracy can be alleviated. In the scenario considered, the
road map information is equivalent to an additional range measurement (minor semi-axis). 2 Depending on the sensor/target geometry actually being in effect, the azimuth estimates can be much improved. By this the major semi-axes of the error ellipses are affected. In particular, the impact of the target stop on the track accuracy is reduced. 3 If both information on the current location of the GMTI clutter notch and topographical road maps are exploited, the early recognition of the event 'a target being under track has stopped' can be assisted. 15.6.2.3 Gain by sensor data fusion Even if the GMTI sensor model and road maps are not exploited in the algorithms, sensor data fusion significantly improves the track accuracy. Additional improvements result if this information is taken into account: 1
The effect of Doppler blindness on track accuracy can be reduced even more. We observe a gain by sensor fusion (combined with GMTI modelling) even in the case of stopping targets (i.e. if both sensors do not produce target measurements). 2 The gain by sensor data fusion is not merely due to the increased data rate, but a consequence of the sensor/target geometries considered. Intuitively speaking, the fusion algorithms combine estimation error ellipses rotated with respect to each other. 3 Since the sensor platforms typically move much faster than the observed targets, ground vehicles remain in the field of view a relatively short time (sensor platform 2 in Figure 15.1). Evidently, by sensor fusion, the total coverage can greatly be increased.
position error semi-axes, km
position error semi-axes, km
A direct comparison between Figures 15.15a, 15.16a and 15.17b intuitively illustrates the gain achievable by the proposed methods.
time, min
t igure 15.18
Sensor a No GMTI model, road b GMTI model, road
time, min
position error semi-axes, km
position error semi-axes, km
time, min
time, min
Sensor a No GMTI model, road b GMTI model, no road
position error semi-axes, km
position error semi-axes, km
Figure 15.19
time, min
Figure 15.20
15.7
time, min
Fusion a No GMTI model, road b GMTI model, road
List of variables
otr(<Xr, <x
sensor resolution in range (range-rate, azimuth) mean cumulative revisit interval revisit interval of sensor i = 1 , . . . , ns linearised evolution matrix in sensor (ground) coordinates P^ ant n ° i s e covariance matrix in sensor coordinates
Gf ik_ j Gf k_ j hp H H^
plant noise covariance matrix in ground coordinates height of sensor platform over ground measurement matrix ficticious measurement matrix related to the clutter notch at time tk transmitted radar wave length
X
nc PD p'k' PCD PlD Pr P^ (Pa (pk (Pp 7t^J rk ru Yc Yk (r£, r | )
difference between the r^ and rc detection probability far from the clutter notch weighting factors of the mixture components in p(\k\Zh) mean cumulative detection probability detection probability of sensor / = 1 , . . . , ns resolution probability covariance matrix related to x ^ orientation of the antenna array relative to rp target azimuth angle at time tk (relative to r ^ ) orientation of rp relative to the x-axis (ground coordinates) weighting factors for describing the likelihood function target range (slant) at time tk target radial velocity at time tk radial velocity of a ground patch in sensor coordinates position vector of the target at time tk in sensor (antenna, ground) coordinates r k (**£, r | ) velocity vector of the target at time tk in sensor (antenna, ground) coordinates rp, Yp position and velocity vector of the sensor platform R measurement error covariance matrix a target radar cross section om road map error standard deviation <7r,crr, G(p measurement error standard deviation in range, range-rate and azimuth £ju£_j scalar plant noise variance (parameter of the target dynamics model) tk revisit time ta^g[x|; tk] x^ transformed into ground coordinates tg<-a[xjj?; tk] x^ transformed into antenna coordinates ty^txjj!; tk] x£ transformed into sensor coordinates 0t manoeuvre correlation time (parameter of the target dynamics model) Uk zero-mean white measurement noise with covariance R Vp speed of sensor platform over ground vt limiting speed (parameter of the target dynamics model) V£ zero-mean, white, unit-covariance plant noise Xk (x£, x | , xrk) kinematical target state at time tk in sensor (antenna, ground, road) coordinates xi\k expectation of x/ (prediction: I > k, filtering: / = k, retrodiction: (/ < k) xp(t) kinematical state of the sensor platform at time t Zk ficticious measurement related to the clutter notch at time tk Zk sensor report at time tk (consists of measurements rk,(pk, h) z^ event that at time tk possibly no valid detection occurs
Zfc Z*
set of sensor reports zjf, n = 1 , . . . , ns, at time ^ to be processed temporal sequence of the sensor reports to be processed up to Zk
References 1 ARULAMPALAM, M. S., GORDON, N., and ORTON, M.: 'A variable structure multiple model particle filter for GMTI tracking'. Proceedings of the fifth international conference on Information fusion FUSION'02, Annapolis, USA, July 2002, pp. 927-934 2 BAR-SHALOM, Y., LI, X.-R., and KIRUBARAJAN, T.: 'Estimation with applications to tracking and navigation' (Wiley & Sons, 2001) 3 BLACKMAN, S. and POPULI, R.: 'Design and analysis of modern tracking systems' (Artech House, 1999) 4 BOGLER, P. H. L.: 'Radar principles with applications to tracking systems' (John Wiley & Sons, 1990) 5 KEUK, G. VAN and BLACKMAN, S. S.: 'On phased array radar tracking and parameter control', IEEE Trans. Aerosp. Electron. Syst., 1993, AES-29, 186 6 KEUK, G. VAN: 'Multiple hypothesis tracking with electronically scanned radar', IEEE Trans. Aerosp. Electron. SySt9 1995, AES-31, 916 7 KEUK, G. VAN: 'Sequential track extraction', IEEE Trans. Aerosp. Electron. Syst., AES-34, 1998, 1135 8 KEUK, G. VAN: 'MHT extraction and track maintenance of a target formation', 2002, IEEE Trans. Aerosp. Electron. Syst., AES-38, 19 9 KLEMM, R.: 'Principles of space-time adaptive processing' (IEE Publishing: London, UK, 2002, 2nd edn.) 10 KLEMM, R.: 'Effect of Doppler ambiguities on GMTI radar'. IEE international Radar conference RADAR 2002, Edinburgh, UK, October 2002 11 KIRUBARAJAN, T., BAR-SHALOM, Y., PATTIPATI, K. R., and KADAR, L: 'Ground target tracking with variable structure IMM estimator', IEEE Trans. Aerosp. Electron. Syst., 2000, AES-36, 26 12 KIRUBARAJAN, T., BAR-SHALOM, Y., PATTIPATI, K. R., and KADAR, L: 'Large scale ground target tracking with single and multiple MTI sensors' in BAR-SHALOM, Y. and BLAIR, W. D. (Eds.): Multitarget-multisensor tracking applications and advances III (Artech House, Boston, MA, 2000) chapter 6 13 KOCH, W.: 'Fixed-interval retrodiction approach to bayesian imm-mht for maneuvering multiple targets', IEEE Trans. Aerosp. Electron. Syst., 2000, AES-36, 1 14 KOCH, W.: 'Target tracking' in STERGIOPOULOS, S. (Ed.): Advanced signal processing handbook: theory and applications for radar, sonar, and medical imaging systems' (CRC Press, 2000) chapter 8 15 KOCH, W.: 'GMTI-tracking and information fusion for ground surveillance', Signal and data processing of small targets, San Diego, USA, August 2001, SPIE 4473, pp. 381-393
16 KOCH, W.: 'Expectation maximization for GMTI convoy tracking'. Proceedings of SPIE Signal and data processing of small targets 2002, April 2002, 4728, pp. 450-460 17 KOCH, W.: 'Effect of Doppler ambiguities on GMTI tracking'. Proceedings of IEE international Radar conference RADAR'02, Edinburgh, UK, October 2002, pp. 153-157 18 KOCH, W. and KEUK, G. VAN: 'Multiple hypothesis track maintenance with possibly unresolved measurements', IEEE Trans. Aerosp. Electron. SySt., 1997 AES-33, (3) 19 KOCH, W. and KLEMM, R.: 'Ground target tracking with STAP radar', IEE Proc, Radar Sonar Navig. Syst., Special issue: modelling and simulation of radar systems, 2001,148, (3), invited paper 20 LI, X. R.: 'Hybrid estimation techniques' in LEONDES, C. T. (Ed.): Control and dynamic systems (Academic Press, San Diego, CA, 1996) 21 MALLICK, M., MASKELL, S., KIRUBARAJAN, T., and GORDON, N.: 'Littoral tracking using particle filter'. Proceedings of the fifth international conference on Information fusion FUSION'02, Annapolis, USA, July 2002, pp. 935-942 22 MORI, S., CHONG, C-Y., TSE, E., and WISHNER, R. P.: 'Tracking and classifying multiple targets without a priori identification', IEEE Trans. Autom. Control, 1986, AC-31, 401 23 NICKEL, U.: 'Monopulse estimation with subarray-adaptive arrays and arbitrary sum and difference beams', IEE Proc, Radar Sonar Navig., 1996, 143, (4), pp. 232-238 24 TITTERINGTON, D. M., SMITH, A. F. M., and MAKOV, U.E.: 'Statistical analysis of finite mixture distributions' (John Wiley & Sons, 1985) 25 ULMKE, M. and KOCH, W.: 'On road-map assisted GMTI tracking'. Proceedings of DGON German Radar symposium GRS'02, Bonn, Germany, September 2002, pp. 89-93 26 ULMKE, M.: 'Improved GMTI-tracking using road-maps and topographical information'. Proceedings of SPIE Signal and data processing of small targets 2003, July 2003, 5204
Part VI
Space-fast time techniques
Chapter 16
Superresolution and jammer suppression with broadband arrays for multifunction radar Ulrich Nickel
16.1
Introduction
Broadband superresolution and interference suppression have been topics of intensive research since the early 1980s. At that time the pioneering work of Schmidt appeared [1] and a broad literature on adaptive arrays was already available [2]. It is impossible to summarise all the work that has been published since then in a few pages. Therefore, we focus here on special aspects of the problem. The intention of this chapter is to give a fairly compact overview of some of the key ideas on broadband superresolution and interference suppression which are applicable for the special case of a multifunction radar. There is always a request for performance enhancements leading to arrays with larger bandwidth. In particular for airborne radar, there is a great interest in providing a search and track radar with an additional target recognition feature by high range resolution or with an additional imaging mode based on SAR (synthetic aperture radar) processing. The typical feature of a multifunction radar is that it has a high gain antenna, which means an array antenna which consists of thousands of elements. The array elements are summed up in analogue technology into subarrays which are further processed digitally. Our main interest is to point out in which way we can extend the usual narrowband processing with such a complicated array to broadband applications. We will therefore consider in particular simple cost-effective methods. Sophisticated methods that can only be applied to special antennas (e.g. uniform linear equispaced arrays, typically dedicated SAR antennas) are not reviewed. The subarray formulation is quite useful because formally it can be easily extended to beam-space methods, which is just the case of fully overlapping subarrays covering the whole aperture. The results are presented for a generic planar array. However, the formulation is general enough such that it
can be extended to volume arrays. In this sense we have tried to give a fairly general presentation of the material. A special constraint of an array with digital subarrays is that some beam steering is always applied at the elements, either by phase shifting at the elements and/ or by additional time delays at the subarrays. Conceptually this is no severe restriction against a non-steered array, because for the broadside look direction all steering quantities are zero. Another restriction in this chapter is that we do not consider broadband methods that use information about the desired signal. Clearly, this is important information which allows the application of certain narrowband techniques. For example, in stretch processing a chirp signal is transmitted and demodulation with this chirp leads to a narrowband sliding frequency window. In stepped frequency methods, which may be considered as the discrete version of stretch processing, successive narrowband signals covering a broad bandwidth are transmitted and are correctly frequency processed on receive. The estimation of the required quantities from finite samples (covariance matrix, eigenvalues etc.) is already a broad field of research in the narrowband case. To obtain patterns with a specific main beam shape and sidelobe level additional constraints are applied to the adaptive weights. Broadband processing involves further considerations for estimation from finite samples (length and shape of the time windows, Fourier transformation length, frequency weightings). These issues are beyond the scope and the admissible length of this chapter. Therefore, we compare all methods on the basis of the true statistics, i.e. we are working only with the asymptotic covariance matrices. In the adaptive beamforming/space-time filtering context this is sometimes called the clairvoyant filter. The use of asymptotic covariance matrices shows the differences in performance of the underlying models regardless of finite sample effects. However, to make the considerations realistic we always allow typical channel errors such as delays, filter ripple, I- and Q-demodulation errors. This is achieved by the asymptotic error model developed in Section 16.2.3 which is extended for the complicated array with subarrays. In the next section we introduce our notation and discuss extensively the concept of subarrays and beamforming, the associated patterns and the error models. In Section 16.3, the different solutions for broadband superresolution are presented. For passive sensors this is the same as adaptive beamforming (ABF), because there is no criterion to discriminate between desired target and interference. For active sensors like radar, we can measure the interference alone when no signal is transmitted, and the interference-only covariance matrix can be used for jammer cancellation. This type of jammer cancellation is the topic of Section 16.4. However, we restrain this section to commenting on the main principles and on the differences in applying the methods of Section 16.3.
16.2
Broadband array signal model and beamforming
In this section we develop a model for the array element and beam outputs and for the asymptotic quantities like covariance matrices that are used in the algorithms.
We start with a general formulation for three-dimensional volume arrays. Explicit formulations are given later only for planar arrays. Detailed and theoretically sufficient descriptions of the array output data have been given in several textbooks. We follow here the development given by Bohme [4]. For details on the homogeneous wave field model, signal representations and general beamforming a study of the chapters of this book is recommended.
16.2.1 Received signal and notation 16.2.1.1 Plane wave at single frequency Suppose we have an array with N elements at the positions (JC/, V/, Zi)9 i = 1, •. •, N. The direction of a plane wave impinging on the array can be described in different parameterisations (angles). We use a parameterisation-free description given by the unit direction vector u (||u|| = 1) in the cartesian (x, y, z)-coordinate system. A plane v/ave (elementary wave) with amplitude Ar from direction Uo = (wo,uo, WQ)T at frequency / at the position r/ = (JC/, yt,Zi)T can then be written as: Si =
AreJ^fteJ2nfrfu0/c
where c is the velocity of light. For planar (and also linear) arrays with elements at positions (JC/, v/), i = 1 , . . . , N9 a signal with deviation from the centre frequency, / = Qf/o, is indistinguishable from a signal at angle auo at centre frequency, because / • (JC;«O + v/i>o) = /o • (xiccuo + v/Q?t;o). Another interpretation is that the array positions seem to be stretched if a higher frequency signal is observed with receivers at the centre frequency. This may introduce bias and grating effects. Clearly, such errors are zero for boresight direction and increase with the angle of incidence uoMost of the techniques considered in Section 16.3 are concerned with the reduction of this bias. This angle shift does not appear for arrays with true three-dimensional element distribution (volume array or crow's nest antenna [5]), because the third component of Uo, w, is not independent of w, v9 as ||uo|| = 1. There is no way of aliasing auo with a /o. It is a fundamental property of three-dimensional volume arrays that squint, bias and grating effects due to frequency do not appear. The three-dimensional volume array has an inherent bandpass filtering property. Accordingly, the techniques of Section 16.3 will serve for three-dimensional arrays only to enhance the signal-to-noise ratio, not to reduce a bias due to frequency shift. For the detection of such an elementary wave we have to sum up all array elements phase coherently in a beamforming operation ysum = Xl/=i tfst with bt = e-i2nafri K°/c, according to matched filter theory. Of course, the true signal frequency is in general unknown and usually the centre frequency or different frequencies of a frequency filter bank are used. Beamforming with the centre frequency phases produces for a planar array the aforementioned angle shift by the factor a or a relative angle error [Cy = a — 1.
The relation between bandwidth and angle error for planar arrays can be characterised by the bandwidth factor [3] defined as: K- Ti —
bandwidth (%) boresight beamwidth (deg)
The bandwidth factor is used to calculate the admissible bandwidth from the maximum scan angle and the tolerable angle error in beamwidth at this scan angle, if narrowband beamforming at centre frequency is performed. The magnitude of KB typically ranges from 1 to 2, e.g. we obtain for a maximum scan angle of 60° and for a maximum acceptable squint angle of 1A of the beamwidth a value of KB = 1 [3]. 16.2.1.2 Broadband band-limited signals If a broadband signal is received we have a superposition of all frequencies in the band. Such a signal can be described by a homogeneous plane wavefield, see Reference 4, which is a special stochastic process in time and space. It can be written in the following notation (Cramer representation):
(16.1) where /o is the centre frequency and B is the receiving bandwidth. Zs(f) is a complex stochastic process with independent increments, i.e. E{Zs(f)} — 0 and
E{dZs(f)dZ*(g)} = Ps(f)&(f-g)dfdg
and E{dZs{f)dZs(g)} = 0. Ps(f) denotes
the power spectral density of the signal, the asterisk denotes complex conjugation. As the process s is assumed to be real, we have dZs{—f) = dZs(f)* and the signal can also be written as: (16.2) The complex baseband outputs (the I and Q components) are ideally generated by the following demodulation procedure:
I(t, r, u) = LP{s(t, r, u) cos 2nf0t} Q(t9 r, u) = LP{-s(t, r, u) sin27if0t}
(16.3)
where LP{.} denotes the ideal lowpass filtering operation. As described in Reference 6, the complex baseband signal s(t, r, u) = /(*, r, u) + jQ(t, r, u) can then be written as: (16.4)
Such a stochastic signal is not typical of the desired radar echo. The transmit signal is a special deterministically modulated signal (linear/non-linear frequency modulated), but such a signal is included in this model by means of a suitable deterministic measure dZs. The model of equation (16.4) is very useful in describing clutter and interference signals. 16.2.1.3
Narrowband and broadband beamforming
The beamforming procedure is a linear filtering operation which extracts maximum signal energy for all possible signal spectra. From equation (16.4) it is seen that this is achieved by coherently summing up the delayed signals [4]. If the wavefield is sampled by an array at positions r/ = (x;,y;,z;), we have the sum beam output power:
(16.5)
withr/ = x\ u/c and hi = ej2jT^°ri u/c. This is the classical delay and sum beamformer. For narrowband processing the delay is assumed to be small, Bxi <^C /or, or #//o ^C 1, such that the phase shift alone by means of the steering vector b = (fri)i=i,...,# is sufficient:
(16.6)
The entries of the covariance matrix Q 5 are given by:
(16.7)
or in matrix notation: (16.8)
with b = (eJ2^f^)i^_N,
C5 = (fB/B2/2 P5(§ +
fo)e№ri-T*T*
and O denotes the element-wise (Schur-Hadamard) matrix product. P 5 is the signal spectral power density. For rectangular power density we have C 5 = Ps(smc(nB(ri — Tk)))i,k=\,...,N- In the narrowband case f//o ^ 0 and Cs ^ P 5 I, such that Q 5 = PsbbH, and P 5 is the signal power. The element receiver noise is assumed to be independent in each channel and independent of the direction, i.e. the term rTu/c in equation (16.2) vanishes and the
noise alone covariance is: (16.9)
with noise power spectral densities Pnj(f), i = I9.. .,N. The total received signal is composed as a sum of M plane waves plus receiver noise such that with equation (16.4) one has:
(16.10)
with Hi = f_L2eJ2jT^dZn-j(^ + /o) and with obvious definition of the vectors. According to equations (16.8) and (16.9) one obtains the array output covariance matrix R = YIh=I Qs;k + Qn- Narrowband and broadband beamforming is performed with these quantities. The delay operation for broadband beamforming may be too costly if it is applied at each element of a large array. Therefore a hybrid solution is often preferred, where subarrays are summed up with phase shifting at the elements (the delays are small within the subarrays) and where the subarray outputs are summed up with delays (time-delay subarrays). For the application of digital multichannel (array processing) techniques A/D conversion of subarray outputs is needed (digital subarrays). These digital subarrays may be larger than the time-delay subarrays, depending on receiver cost, bandwidth and error considerations [8]. However, the preferred solution is to have a sufficiently large number of time-delay subarrays which coincide with the digital subarrays. This is the solution we assume in the sequel.
16.2.2 Digital beamforming with subarray outputs Although we have formulated the signal model in the preceding section for arbitrary arrays, we will consider in the sequel only planar arrays for notational convenience. 16.2.2.1 Subarray transformation, normalisation and antenna patterns The complex array element outputs x are summed up to subarray outputs. This operation can be described by a subarray forming matrix T, i.e. x = THx. Suppose we have K subarrays, then T is of size NxK. Vectors and matrices at the subarray outputs are denoted by the tilde. At the array elements an amplitude weighting w (tapering, real vector of length N) is often applied to achieve a sum beam with low sidelobes. Also, the switching of the look direction of the array is applied at the elements by phase shifting. Amplitude weighting and phase shifting ej27rfo(XiU°+yiVo)/c is included in the elements of the matrix T. The subarray outputs are converted analogue to digital with a possibly high sampling rate to compensate for delays. The beams are then formed
ADC
Figure 16.1
Subarrays and beamforming
digitally with the subarray outputs by applying a final weighting m; (7 = 1 , . . . , K) as y = mHx. The whole structure is shown in Figure 16.1. For a multifunction radar we need multiple beams for sum, azimuth and elevation difference, guard channel etc., which are all formed with the same subarray outputs. If the full taper weights are applied at the elements the sum beam is calculated from the subarray outputs by simply summing up the channels, i.e. by m = ( 1 , 1 , . . . , l ) r . Let us first assume that we have non-overlapping subarrays, i.e. each element belongs to only one subarray. This results in a subarray forming matrix T with orthogonal columns (the Mi entry of the zth column is non-zero only if the &th element belongs to the ith subarray) and T^T is diagonal. A common portion of the amplitude taper within the subarray may be applied digitally after subarraying. This means that we factor the subarray forming matrix into T = ToD with a diagonal matrix D, applying only To for subarray forming, but using a subarray weighting vector m = D(I • • • \)T. Amplitude weighting is thus distributed over stages (1) and (2) in Figure 16.1. This distribution of the tapering allows different scalings, e.g. for a lower dynamic range of the element attenuation and a better exploitation of the A/D converter dynamic range. As explained in Reference 9 it is also important to scale the signals such that the receiver noise before adaptation is uncorrelated and equal. This preserves the low sidelobe pattern after adaptive beamforming. This means a normalisation such that T^T 0 = I, because for ^{nn^} = a2l we then have £{T;f nn^To) = (T2I. For notational convenience, we will always assume this noise power normalisation. Now let us assume that we have overlapping subarrays. In the extreme case all elements are used for all subarrays. This is then called the beam-space array, and N element outputs are transformed into the same number of beam outputs. In general these beams are not orthogonal and the uncorrelated receiver noise covariance matrix is E{THnnHT] = a 2 T ^ T and is not diagonal. This will lead an adaptive processor to find weights which decorrelate the receiver noise as much as possible and this distorts the desired beampattern. Prewhitening the subarray outputs with (T^T)" 1 / 2 will be necessary to counter this effect. The antenna pattern is defined as the response to a plane wave b(u) and can be written in this notation as: /(u) = I m ^ T ' V u ) ! 2
(16.11)
with bi(u) = eJ27Tf°(XiUJryiV^c'. More precisely, this is the antenna directivity pattern because we ignore the effect of the element patterns. Note that the look direction of the array uo = (wo, t>o) is incorporated in the matrix T. Superresolution methods based on the subarray outputs x = THx have to use as their signal model the response of a plane wave at the subarrays: 6(U) = T77I)(U)
(16.12)
The components of b(u) describe the subarray antenna patterns. This vector as a function of u is called the array manifold. The subarray configuration can be thought of as an array consisting of elements with phase centres at the centres of the subarrays and with element patterns according to the subarray patterns. The centres of the subarrays have to take into account the applied (partial) taper weighting at the elements. As described in Reference 9, the subarray centres are calculated for an applied real valued taper weighting Wk (Jc= I9..., N) a s : (16.13) where Ut denotes the set of indices of elements in the /th subarray. This choice of the subarray centres preserves the same gain and phase in the look direction as the true array manifold. The array of subarray centres is called the superarray. Assuming omnidirectional superarray antenna elements with gains gt = YlkeUt wk (which is equal to the /th subarray gain), we can define a superarray antenna pattern as: (16.14)
16.2.2.2 Generic subarray configuration To show the effects of different manifold models we need a realistic array configuration. For demonstration purposes we use an array with 902 elements on a triangular grid with 32 subarrays as shown in Figure 16.2. This subarray configuration has been optimised using the technique of Reference 9 for narrowband digital beamforming with low sidelobe sum and difference patterns and for adaptive jammer suppression. This array has also been used in References 15-17 and 55. It is representative without having too many elements. The element taper for low sidelobes is chosen as a 40 dB Taylor weighting. As this subarray configuration has been optimised for narrowband beamforming it is questionable if it is also really good for broadband applications. However, the typical features with irregular subarray sizes are representative. The properties of this array are demonstrated by the different patterns defined before. Figure 16.3 shows an azimuth cut of the sum beampattern according to equation (16.11) and the superarray pattern as defined in equation (16.14) with a sinusoid source. The patterns for a source at offset frequency of ±5 percent is indicated by light lines. This squint is considerable for the chosen scan angle wo = —0.8, which corresponds to —53°.
Figure 16.2
Generic array with 902 elements and 32 irregular sub arrays
For regular subarrays without any element tapering the superarray pattern has the typical grating lobes due to the large distances of the subarray centres. The nonexistence of grating lobes in Figure 16.3b demonstrates the careful design of the pseudorandom subarrays. The subarray patterns as defined in equation (16.12) with noise power normalisation T 77 T = I are shown in Figure 16.4. The subarrays with a large extension in the x-direction have a fairly small beamwidth in azimuth. 16.2.2.3 Broadband antenna pattern We can extend the definition of the antenna pattern to the broadband case such that the difference in narrowband BF (phase shifting only) and broadband BF (phase shifting plus subarray time delays) is visible. We define the broadband antenna pattern as the response of the array to a broadband far-field source moving around the array: / ( u ) = ^(Im77T77S(U)I2J = A77T77Q5(U)Tm
(16.15)
with Q 5 as in equation (16.8). For narrowband beamforming (phase shifting only) in a direction (wo, vo) we apply in the matrix T the weights bi (wo, i>o) = ^i i^xyj («o> ^ o)) = w.ej27Tfo(xiuo+yivo)/c a n d t h e v e c t o r b
_ (ej27Tf°Zi)i=i,...,N
in the signal covariance
matrix Q5 has delays X{ = rxyj(u, v) = (x(U -f ytv)/c. For beamforming by timedelayed subarrays we compensate the delays between the subarray centres for the
Figure 16.3 Antenna patterns of generic array with sinusoid source, squinted patterns are produced by source at offset frequencies of ±5%, scan angle u0 = -0.8, V0 = O a 40 dB sum beampattern b superarray pattern direction (wo, fo) such that the steering vector applied in the matrix T is bi(uo, V0) =
bi(Txyj(uo,vo) - rp,k(uo,vo)) = ^ ^ ^ ( ^ ^ - w ^ ^ o ) ) , / e
Ukm
The broadband patterns according to equation (16.15) with narrowband and delayed subarray beamforming are shown in Figure 16.5 for the generic array with 40 dB Taylor taper and with ten percent relative bandwidth. For reference the
Figure 16.4 Antenna patterns of the 32 subarrays, scan angle wo = —0.8, Vo = O antenna pattern with a sinusoid signal at centre frequency is also shown (legend 'PhS/narrowB'). For the broadside look direction the delay subarray pattern is identical to the narrowband pattern and the sidelobe level below —40 dB is retained. For off-boresight scan angles the main beam with narrowband BF broadens due to the superposition of the different frequency squints, while the main beam shape of the delay subarray pattern is preserved. However, the sidelobe level increases considerably. The reason for these higher sidelobes is basically grating effects [8], which appear here at a reduced level because of the irregular subarrays. The preservation of the main beam shape will be an important feature for superresolution methods, where a good model for the array manifold is needed.
16.2.3 Influence of channel imperfections We will study the effect of errors by analysing the array output covariance matrix. The influence of I and Q demodulation errors and bandpass filter errors has been described in References 6 and 7. We use here the notation of Reference 6. We start by describing the errors of an array with single elements and then extend the results to subarrays. These formulas will be used later to build subband covariance matrices or space-time covariance matrices. Therefore, we will extend the notation to include a common delay and the signal band as a subband of the receiving bandwidth. 16.2.3.1 I and Q demodulation errors An amplification error R(H) for the I (Q) channel, orthogonality error (p(^) and offset error F(G), and delay error 8 in analogue IQ demodulation, can be described
PhS/broad-B delay sub/broad-B PhS/narrow-B
sin, theta
PhS/broad-B delay sub/broad-B PhS/narrow-B
sin, theta
Figure 16.5 Antenna patterns for broadband source and narrowband BF and subarray delay BF, sinusoid pattern added for reference a look direction 0° b look direction -53° (u = -0.8) by extending equation (16.3) to: /(O = R L?{s(t - 8, r, u) cos(2nf0t -
- S9T9U) sm(2nf0t - x/r)} + G
(16.16)
This leads to a complex baseband signal description [6]:
(16.17) with p = ReW + He^9 q = Re~w - He'M and g = F + jG. In addition to the interchannel delay error we may consider a common delay ^o to characterise space-time correlation. The covariance matrix Q5(So) = E{(s(t9r(9u)s* (t + 60, r&, u))i^=i,...,iv} can then be written as described in Reference 6 by: (16.18) with the following quantities. To model subband covariance matrices of the receiving bandwidth as used in Section 16.3.2.2 we assume a rectangular signal power spectral density with centre frequency fs and bandwidth Bs such that [fs — Bs, fs + Bs] C [fo-B,fo + B], Then we obtain b = (ej2nfsTi)i=\,...,N with X1 = ifii/c, a n d + HteM), qt = e~illx^(R1C^ - Hte-^), i = I9..., N9 Pi = e^f^iRteM and: (16.19) with the offset frequency A / = fs — /0. For the receiver noise covariance matrix we obtain: (16.20) The complete array output covariance matrix results in: (16.21) With these formulas we can build subband and space-time covariance matrices. 16.2.3.2 Bandpass filter errors For general bandpass filter functions the integrals cannot be solved analytically. However, for the special technically important case of a sinusoidal ripple over the bandwidth, they can be solved. Let the baseband filter functions have the form
elsewhere
(16.22)
si < 1 is the ripple amplitude, /i/ counts the number of ripples and <5; is a linear phase slope or a delay. With this we have the signal model: (16.23) Again we assume a subbandwidth [fs — Bs, fs + B8] C [/o — B, /o + B]9 which we may incorporate into the definition of dZs. As calculated in Reference 6, this model leads to a signal covariance matrix: (16.24)
(16.25) and with:
with dik =
For the receiver noise covariance we obtain:
(16.26) 16.2.3.3 Errors in subarrayed arrays and eigenanalysis The errors described in the preceding section are formulated for array element outputs. For an array with subarrays we assume that these errors occur at the subarray outputs. This is realistic from the hardware realisation viewpoint, because the dominant errors are often introduced by the filtering and IQ demodulation process. However, the
Figure 16.6
Transformation of errors from subarray level to element level
stochastic signal model of equations (16.17) and (16.23) and the resulting covariance matrices are defined at element level. As we can interchange summation and integration in the calculation of the covariance matrices with errors (equations (16.18) and (16.24)), we simulate computationally all effects at element level. This is done by transforming the errors at subarray level to element level by assuming identical error quantities within each subarray. The errors are distributed to element level by the subarray selection matrix To, which contains ones at the (*,A;)th entry only if the /th element belongs to the Mi subarray, or, (To);*: = 1 for T/& ^=. 0 and zero elsewhere. This transformation is illustrated in Figure 16.6. The subarray delay error vector 8 e CK is equivalently represented by an element delay error 8 = To^ G C^. For weightings and quantities involving the signal power (ripple and IQ errors) we have to transform subarray vector error quantities h as h = T 0 (T^To) - 1 Ii e CN, which implies T^h = h, and this ensures that all powers add up to the desired value. This transformation from subarray to element level is a powerful tool for simulating errors in a subarray or beam-space context. A first analysis of error effects, bandwidth effects and beamforming type can be done in very general terms by studying the eigenstructure of the subarray output covariance matrix. The following examples are calculated in this way with the generic subarray configuration of Section 16.2.2.2 with a partial 4OdB Taylor weighting such that at the subarray outputs equal receiver noise power is present (noise power normalisation). We consider a standard scenario with the errors and the location of the sources as given in Table 16.1. This scenario shows a fairly strong dispersion due to bandwidth. Figure 16.7 shows the eigenvalues for narrowband beamforming (phase shifting only) with and without errors and also the case of zero bandwidth for reference. The bandwidth and the random channel errors lead to an approximate doubling of the dominant eigenvalues. This is the typical dominant eigenvalue leakage effect, which makes the source power, which is concentrated in the narrowband error-free case in the three dominant eigenvalues, leak into the smaller eigenvalues. Also, the noise eigenvalues become unequal. The amount of leakage depends on the signal power, the bandwidth and the amount of error. It is important to note that this is typically a strong signal effect. The leakage eigenvalues are a certain level below the dominant eigenvalues and are merged into the noise eigenvalues for weak sources.
Table 16.1
Scenario 1
Three sources:
directions - 5 3 ° , - 2 4 ° , 0° (u = - 0 . 8 , -0.4, 0.0)
Relative bandwidth: IQ errors:
10% I and Q errors complex-normal distributed such that amplitude std is 1 dB cosine on rectangular characteristic, cosine amplitude variation 1 dB std, cosine period 20% variation
Filter ripple errors:
element S N R = 12, 12, 12 B
additional 0.2% delay errors (BT = 0.002) additional 0.2% delay errors (BT = 0.002)
Array look direction: (phase shifters and/or subarray delays)
no B no errors elQ eripple
number of eigenvalues
Figure 16.7
Eigenvalues ofcovariance matrix at subarray level, narrowband beamforming by phase shifting at elements, scenario 1, insert figure shows eigenvalues from estimated covariance matrix from 64 snapshots for error-free case (corresponding to 2nd bar in figure)
Next Page
no errors elQ eripple
number of eigenvalues
Figure 16.8
Eigenvalues of the covariance matrix at subarray level with and without errors, beamforming with time-delay subarrays, scenario 1
The common belief that, with high signal-to-noise ratio, performance converges to the ideal error-free case is not true for this type of error. The eigenvalue leakage will in particular complicate the determination of the number of dominant eigenvalues, which is important for the application of subspace methods for jammer suppression in Section 16.4 or superresolution methods in Section 16.3. For covariance matrices estimated from finite samples the leakage effect will be even stronger. This is indicated by the insert figure, which shows the error-free eigenvalues with bandwidth for 64 samples, which is the value that would give a 3 dB loss according to Brennan's rule [39]. Figure 16.8 shows the improvement that can be gained by subarray delay beamforming. By focussing on the most dispersed source at u = —0.8 we obtain a reduction of the eigenvalue leakage. The fifth eigenvalue is now 11 dB above the noise level as compared with the 19dB before. The overestimation of the dimension of the dominant subspace due to errors will be less. Note that in both cases the different types of error have little influence on the leakage effect (for the same amplitude std we have in the given examples). The eigenvalue leakage is in these examples mainly determined by the bandwidth.
16.3
Superresolution with broadband arrays
All processing in a multifunction radar starts with sum beamforming and signal detection, because this provides the first big reduction of the data rate. Superresolution is
Previous Page
no errors elQ eripple
number of eigenvalues
Figure 16.8
Eigenvalues of the covariance matrix at subarray level with and without errors, beamforming with time-delay subarrays, scenario 1
The common belief that, with high signal-to-noise ratio, performance converges to the ideal error-free case is not true for this type of error. The eigenvalue leakage will in particular complicate the determination of the number of dominant eigenvalues, which is important for the application of subspace methods for jammer suppression in Section 16.4 or superresolution methods in Section 16.3. For covariance matrices estimated from finite samples the leakage effect will be even stronger. This is indicated by the insert figure, which shows the error-free eigenvalues with bandwidth for 64 samples, which is the value that would give a 3 dB loss according to Brennan's rule [39]. Figure 16.8 shows the improvement that can be gained by subarray delay beamforming. By focussing on the most dispersed source at u = —0.8 we obtain a reduction of the eigenvalue leakage. The fifth eigenvalue is now 11 dB above the noise level as compared with the 19dB before. The overestimation of the dimension of the dominant subspace due to errors will be less. Note that in both cases the different types of error have little influence on the leakage effect (for the same amplitude std we have in the given examples). The eigenvalue leakage is in these examples mainly determined by the bandwidth.
16.3
Superresolution with broadband arrays
All processing in a multifunction radar starts with sum beamforming and signal detection, because this provides the first big reduction of the data rate. Superresolution is
applied as an additional processing step for special radar tasks to improve the results. The reason for this hierarchy lies in the sensitivity of superresolution methods to errors and the possibility of producing artefacts in complicated scenarios. There are three main applications for superresolution methods: a
Angle estimation of continuous wave (cw) emitting sources. In this case the resolution requirements are most severe due to the lack of other parameters for source discrimination. b Simultaneous angle and frequency resolution. Typically this is done in sonar. The angle and frequency history is displayed in a waterfall plot. c Resolution only of the desired signals (echoes) using the knowledge of the (emitted) signal spectrum. There are a couple of broadband techniques which allow a reduction to narrowband processing. For example, for a chirp transmit signal we can demodulate with the same chirp (deramping, stretch processing), or one can coherently combine several narrowband signals (stepped frequency processing). In this section we want to present general techniques for mitigating bandwidth effects. Therefore we limit the scope to applications a and b. The number of superresolution methods that have been published in the last decades are considerable. Some fundamental papers are References 1, 4, 10-14 and 29. The goal here is to give an overview and not a discussion of all the special problems of these methods. In addition, many of these methods are closely related. The generic array we consider has digital outputs from irregular subarrays. This means that methods that require a special structure, like the uniform linear array (ULA) or Toeplitz structure of the co variance matrix, or certain shift invariances, like the ESPRIT method, will not be considered. With these remarks we are only interested in generally applicable methods which are described in summary in Reference 15, like Capon's method, MUSIC [ 1 ], the lean matrix inversion (LMI) method, the deterministic or stochastic maximum likelihood (ML) method [4], the weighted noise subspace fitting (WNSF) [14], or covariance matching estimation technique (COMET). In order to visualise the differences between the various broadband techniques we use the well known MUSIC method. This method provides a simple and meaningful graphical display of the source distribution (in contrast to parametric methods, like deterministic/ stochastic ML, COMET) and has the advantage of little dependence on additional weighting factors (e.g. as in WNSF). MUSIC is considered as a kind of standard reference method for superresolution. However, we are aware that the sensitivity to errors and bandwidth effects is different for the various methods [15]. The MUSIC method calculates an estimate of the angular power spectral density as: SMUSIC(U)
= b^WbW/b^WP^bW
where P^ = I — U5 U^ is a projection on the complement of the signal subspace spanned by the columns of the orthonormal (unitary) matrix \JS e CKxM, K is the number of subarrays and M the (estimated) dimension of the signal subspace (dimSS). The usual way to estimate a basis of the signal subspace is to estimate the covariance matrix from the subarray outputs R = - ^ - ]T^t=i x(tk)xH(tk) and to
determine dimSS from the number of dominant eigenvalues with some criterion like AIC, MDL, or threshold tests, possibly robustified, see References 4 and 16 for various methods. As we have indicated in Section 16.2.3.3 this may be a critical procedure for the performance. Clearly, if we have a complete orthonormal system (U?, Vn), the projection P^ = I — VSV^ is identical to the projection Yn = VnVn^, called projection onto the noise subspace. The function b(u) describes the array manifold. For our generic array the length of this vector varies with u and it is therefore important to normalise the array manifold vector to length one. The array manifold incorporates a priori information about the signal model with respect to angle and frequency, beamforming and calibration. The generic array with subarray outputs already contains a partial beamforming operation (phase shifting for narrowband BF or phase shifting and subarray time delays) which must be accounted for by choosing b(u) = b(u — Uo) with b as in equation (16.12).
16.3.1 Spatial-only processing of broadband data This section covers methods that are simple extensions of standard narrowband processing and which are, from a hardware point of view, very easy to implement. The time samples are only used for averaging in forming the covariance matrix estimate. The covariance matrix is then processed and compared with a plane wave model. The methods presented in this section differ in the applied array manifold model (subarray patterns) for both types of subarray data, narrowband and subarray time delayed. The estimation (measurement) of all subarray patterns may be a problem for large arrays with different subarrays. In particular in the subarray sidelobe region the patterns may be insufficiently predictable. It has therefore been suggested in Reference 17 using a directionally uniform manifold model which takes into account only the subarray centres and gains (direct uniform model, DUM). This is basically the manifold model used for the superarray pattern with b(u) = b(u — uo) and b as in equation (16.14). Such a model is easier to calibrate than the full array manifold. The drawback of the simple model is that superresolution is only achieved in the vicinity of the look direction of the array (local or spotlight superresolution). However, under realistic conditions superresolution procedures may give unreliable results in the subarray sidelobe region because of measurement and calibration errors. Under these circumstances the DUM model helps us to ignore unwanted sidelobe sources. We will show that such a simple model can also be useful for superresolution with broadband data. The use of time-delayed subarrays for broadband bearing estimation has been introduced in Reference 18. Time-delayed subarrays have a focussing effect for the look direction. On the other hand, targets that are far away from the look direction suffer more angular dispersion. However, the dispersion is less than for narrowband BF, compare the source at u = —0.8 in Figure 16.9a and the source at u = 0 in Figure 16.9b. If we are interested in superresolution only in the vicinity of the look direction this method can be well combined with the DUM model. The performance of these methods is shown in the following Figures.
truAM/eripple/M= 6 DUM/eripple/M=6
Figure 16.9
MUSIC for scenario 1 with true and DUM manifold model, narrow and broadband BF a narrowband BF b element phase shifting and subarray time-delay BF truAM/eripple/M= 5 DUM/eripple/M=5
Figure 16.10
truAM/eripple/M= 5 DUM/eripple/M=5
truAM/eripple/M= 4 DUM/eriDole/M=4
MUSIC for scenario 1, reduced dimensions of signal subspace (dimSS) narrowband BF a dimSS = 5 b dimSS = 4
Figure 16.9 shows the performance with MUSIC for the true array manifold and the DUM model for scenario 1 including filter ripple errors. The dimension of the signal subspace (dimSS) was chosen according to the eigenvalue spectrum shown in Figures 16.7 and 16.8. As a rule of thumb we accounted only for eigenvalues 1OdB above noise. This gives dimSS = 6 and 5 for the narrowband and delayed subarray BF case, respectively. The grey shading indicates the 3 dB beamwidth of the full array. In Reference 16 it has been found that by ignoring some leakage eigenvalues the angular dispersion due to bandwidth and errors can be reduced. Figure 16.10 shows the patterns for the narrowband BF case with reduced dimension of the signal subspace. For moderately off-boresight targets some reduction in angular dispersion is obtained, see the target at u = —0.4 for dimSS = 5, cf. dimSS = 6 in Figure 16.9a.
Table 16.2 Scenario 2 Sources:
directions -48.6°, - 4 6 ° , 30° (u = -0.75, -0.72, 0.5)
Relative bandwidth: Filter ripple errors:
10% cosine on rectangular additional 0.2% delay errors (Br = 0.002) characteristic, cosine amplitude variation 1 dB std, cosine period 20% variation -47.3° (M0 = -0.735)
Array look direction: (phase shifters and/or subarray delays)
powers S N R = 1, 12, 12dB
On the other hand there is the danger of suppressing targets if we are interested in the whole field of view. So far we have considered scenarios without any challenging resolution problems, which are in fact the motivation for using superresolution methods. Scenario 2 described in Table 16.2 includes a severe resolution problem: two closely spaced sources separated at about half the 3 dB beamwidth of the array (BW = 0.056 or 3.2° at boresight). In addition the two sources are very different in power. The detection of a weak source close to a strong source is an important practical task. For the errors we consider only one representative type, passband ripple errors, as there are only small differences between the error types. The eigenvalues for this scenario are shown in Figure 16.11. We can identify six dominant eigenvalues for the narrowband data and five for the broadband time-delay subarray data. The MUSIC patterns for this scenario corresponding to Figures 16.9 and 16.10 are shown. Clearly, the dispersion due to the bandwidth prohibits resolution of the closely spaced sources. A reduction of the dimension of the signal subspace can partially reduce the dispersive bandwidth effects in the narrowband case, see Figure 16.13. The resolution with the time-delayed subarrays is good. There are other means of making superresolution methods robust against bandwidth effects, besides the control of the dimension of the signal subspace. Methods to modify the array manifold have been proposed by Agrawal and Prasad [20]. They derive an appropriate array manifold model from the theoretical covariance matrix with bandwidth. This leads them to a manifold model with a direction-dependent amplitude taper u>/ given by the sine function appearing in equation (16.8) for rectangular spectrum, W( = sinc(7r/?(r; — ri)), if array element number 1 is chosen as a reference channel, for example. This means that the rectangular bandpass filter characteristic is assumed to be known. With this manifold model they derive two new superresolution methods: a
The multidimensional method. This method finds the unknown directions u/ by minimising tr{PgR}, where Pg is a projection orthogonal to the columns of the
NB/eripple BB/eripple
number of eigenvalues
Figure 16.11 Eigenvalues of covariance matrix of scenario 2, narrowband and broadband BF with filter ripple errors matrix B, each column b(u/) of B describes the tapered manifold model for the /th source and tr is the trace function. This is in fact the deterministic ML method. b Single-dimensional method. This method minimises YIk=^ |k(u)v*l2> where Vjt is an orthonormal set of vectors spanning the noise subspace and b is the tapered array manifold. Obviously, this is the same as the MUSIC method with direction-dependent amplitude taper. As is known from antenna theory, such a taper increases the beamwidth and thus makes the beam less sensitive to extended sources, but it reduces the resolution. The performance of the MUSIC method with and without this direction-dependent tapered array manifold model (the single-dimensional method) is shown in Figure 16.14 for scenarios 1 and 2. Comparing the two curves one can see the obvious improvement. Although this method does not give the resolution one would get with time-delayed subarrays (as in Figure 16.12b) it gives a better display of the two closely spaced unequal sources than without tapering and better than with reduced dimSS as shown in Figure 16.13a. If this model is applied to time-delay subarray outputs one has to combine the direction-dependent taper with the array look direction, i.e. one has to use a weighting Wi = sinc(7tB(Txy;i(u,
v) - rp-j(uo, VQ) - T x ^i(u, v) + rp;\(wo, VO))) to account for
the effective delays. In this case an improvement far off the look direction is obtained where dispersion due to the time-delay steering occurs. The application of the DUM model with this tapering and with time-delay subarrays does not make sense.
truAM/eripple/M=6 DUM/eripple/M=6
Figure 16.12
MUSIC for scenario 2 a narrowband BF, dimSS = 6 b subarray time-delay BF, dimSS = 5
truAM/eripple/M= 4 DUM/erinnle/M=4
Figure 16.13
truAM/pripple/M= 5 DUM/ehpple/M=5
truAM/eripple/M= 3 DUM/eripple/M=3
MUSIC for scenario 2 with reduced dimensions ofdimSS a narrowband case with dimSS = 4 b delayed subarray case with dimSS = 3
This section can be summarised as follows: 1
The bandwidth transforms point targets into extended targets, which cannot be distinguished from a cluster of closely spaced targets by the superresolution method. If high resolution is of interest bandwidth compensation methods are needed. 2 The delayed subarray outputs transform point targets back into point targets, however, only in the array look direction. Targets at boresight, which remain point targets for narrowband subarrays, are in fact broadened in angle. 3 The dimension of the signal subspace is a means for controlling bandwidth (and error) effects. The problem is to reduce the dimension so moderately such that no target information is lost. Robust criteria to determine the effective subspace rank as discussed in Reference 19 are important. From the theoretical eigenvalue
truAM/eripple/M= 6 truAM+tap/eripple/M= 6
Figure 16.14
truAM/eripple/M= 6 truAM+tap/eripple/M= 6
MUSIC with and without tapering for bandwidth, narrowband BF and true array manifold a scenario 1, dimSS = 6 b scenario 2, dimSS = 6
distributions obtained from equations (16.18) and (16.24) the expected leakage levels can be determined and with this knowledge thresholds and loading factors as suggested in Reference 19 can be set accordingly. However, only limited reduction of the bandwidth effects is achievable by the choice of dimSS. 4 The simple DUM model is useful, in particular for delayed subarrays. It is especially suited for concentrating the improvements which may be gained with reduced subspace dimension into the region of interest, i.e. the look direction. This method is attractive because of the low calibration effort. 5 A direction-dependent tapering according to the bandwidth (the Agrawal-Prasad method) improves the performance for narrowband BF significantly. This method can be combined with the DUM model and is also attractive because of the low calibration effort. We have not shown figures with the DUM model because qualitatively they look very similar to those in the previous plots. For broadband BF this method is useful to reduce the artificial dispersion introduced by the steering time delays outside the look direction.
16.3.2 Space and time processing methods In this section we assume that the information of the time series of data snapshots x(t) is exploited coherently. Most methods Fourier transform the data x(^) (by independent periodogram estimates) into narrow subbands x(/ n ), n = 1 , . . . , Nf9 where Nf is the number of subbands. From the subband data the cross spectral density matrix (SDM) or the frequency domain covariance matrices R(Z72) are estimated. Of course, for small size of temporal data there is considerable cross-talk between the frequency channels. For each subband we can apply MUSIC or any other superresolution method independently. The problem is how to combine the results for different frequencies. The STAP method of Section 16.3.2.3 uses the time data directly.
16.3.2.1 Incoherent addition of subband estimates The simplest solution is to average all MUSIC patterns incoherently over the subbands. This has been proposed by many authors, e.g. Wax, Shan and Kailath [10]. Bandwidth effects are then only present according to the subbandwidth and (with exact covariance matrix) the MUSIC patterns are asymptotically exactly equal to those with the subbandwidth only. A particular problem arises with steered subarrays. The phase shifter settings are not correct for the subbands. This introduces biases in the subband MUSIC patterns and leads to a broadening of the averaged pattern (see also the similar problem with CSST in Section 16.3.2.2). The main practical problems arise from estimating the SDM from finite samples. We will not give further details on that. Simple averaging of all patterns may not result in good resolution properties. In particular, the variance is increased by the incoherent summation and this degrades the resolution performance. Using different types of averaging has therefore been suggested [10], e.g. the geometric mean instead of the arithmetic mean. However, this is a trade off between statistical stability and resolution. Techniques for coherent integration of the SDM instead of incoherent pattern averaging are therefore preferred. 16.3.2.2 Coherent subspace transformation (CSST) The idea of coherent subspace transformation is to shift the subband covariance matrices R(/ w ) (n = 1 , . . . , Nf) into the centre subband by suitable transformation matrices T(/ n ) by T(fn)R(fn)T(fn)H. This idea was first presented by Wang and Kaveh [21]. The problem is the choice of optimum transformation matrices, as these matrices require knowledge of the unknown target directions. The suggested procedure is to first perform a rough source location estimate by conventional beamforming. From this the focussing directions are estimated, perhaps extended by clusters of directions where clusters of sources are assumed. Following this a coherently averaged covariance matrix R = X]«=i T ( / w ) R ( / w ) T ( / n ) ^ is calculated for which the narrowband MUSIC pattern is calculated as usual. Various methods for calculating the transformation matrices have been suggested. In general, the transformation matrices are found by fitting the transformed array manifold to the centre frequency: (16.27) for a suitable matrix norm. The columns of the matrix B(/ w , 6) contain the array manifold model vectors for M different directions U/, 0 = (ui,.. . , U M ) , and different frequencies fn. A problem arises in that the transformation matrices make the uncorrelated receiver noise coloured which necessitates a prewhitening step, i.e. an estimation of the noise-only covariance matrix R^ and transformation of the full covariance matrix by R^ R Rn . Depending on the type of transformation this can degrade the signal-to-noise ratio (SNR) considerably. It has been shown in Reference 22 that with unitary transformation matrices no SNR degradation is achieved and that the unitary solution of equation (16.27), i.e. a solution with the additional constraint
truAM/3sB eripple/M= 8 truAM/3sB eripple/M= 5
truAM/eripple IM= 8 truAM/eripple /M= 5
number of eigenvalues
Figure 16.15
T(fn)HT(fn)
CSST for scenario 1 (narrowband BF and filter ripple errors), optimum focussing directions a MUSIC, dimSSj= 8 and 5 b eigenvalues of R
= I is given by:
T(Zn) = V(Zn)U(Zn)"
(16.28)
where U(Zn), V(Zn) are the left and right singular vectors of B(Zn, G)B^ (Zo, Q)Some typical features of this method for our generic array and scenarios are given in the following figures. The frequency band (ten per cent bandwidth) is divided into three subbands. The focussing directions are indicated by dots on top of the plots. The following can be observed: 1
If the focussing directions coincide with the true source directions excellent MUSIC patterns are produced (nearly identical to the narrowband case). 2 Although the patterns are very similar to the narrowband MUSIC patterns, the signal subspace has a different and unusual behaviour. There are only three peaks visible in Figure 16.15 but the signal subspace has a much higher dimension (=8). A reduction of this dimension below 6 leads to a signal loss. This imposes some requirements on the procedure for estimating dimSS as the separation of signal and noise subspace will be much less distinct if the subband covariance matrices are estimated independently from finite samples. The reason for this eigenvalue spread is the special array with the subarrays steered into a look direction. The phase shifter setting is not appropriate for the subbands and introduces biases in the MUSIC patterns calculated from the subband covariance matrices. All this is compensated for by the subspace transformation (focussing) matrices and thus also in the patterns, but not in the eigenvalues. The case for data without phase shifting (look direction is boresight) is shown in Figure 16.17. Here dimSS is 5, which corresponds to the eigenvalue spread due to the reduced bandwidth. This eigenvalue spread effect due to such refocussing methods has to be taken into account in the determination of dimSS if subarrayed or, more general, beam-space
truAM/3sb enpple/M= 8 truAM/3sb eripple/M= 8 truAM/3sb eripple/Af=7
truAM/eripple/M= 8 truAM/eripple/M= 8 truAM/eripple/M= 7
number of eigenvalues
Figure 16.16
CSST for scenario 2 (narrowband BF and filter ripple errors) for different focussing directions a MUSIC, focussing directions indicated by dots on top of figure, dimSS = 8, 8, 7 b eigenvalues of R
truAM/3sb eripple/M= 5
truAM/eripple/M=5
number of eigenvalues
Figure 16.17
CSST with narrowband BF without phase shifting (look direction is boresight) a MUSIC patteriMlimSS = 5 b eigenvalues of R
data are used. This implies also that the accuracy of the array manifold model in the subarray sidelobe region has a special importance because the focussing matrices perform a pattern compensation. 3 The performance of CSST methods depends critically on the choice of the focussing directions. Figure 16.16 shows that resolution of the two closely spaced sources is only achieved if the focussing directions are exactly equal to the true directions. Also, one can see that there is no improvement by an iterative application of the method as claimed in Reference 22: for the line (with circles) one
might consider a second peak corresponding to the second weak source, but this is exactly in the focussing direction. The big influence of the focussing direction is also displayed in the very different eigenvalue distributions. 4 CSST methods can easily be combined with the simple DUM model for the array manifold. No examples are given here, but the performance with respect to the true array manifold is the same as in Section 16.3.1. The DUM model is convenient because it allows us to ignore all sidelobe modelling effects. Array interpolation techniques Many researchers have considered other choices of transformation matrices. The main objective is to get rid of the need for a priori knowledge of the source directions. This can be achieved by array interpolation. As noted in Section 16.2.1.1, a signal at frequency / can be interpreted as a signal at centre frequency /o received by a stretched array. This idea has been used in Reference 23 to interpolate the received data by classical techniques (FIR filter and decimation) on a fine spatial grid. The covariances for each subband are then calculated from the grid data corresponding to the centre frequency of the subband. Of course, this can also be interpreted as a special transformation matrix T(/ n ). However, knowledge of the directions as required for equation (16.27) is not necessary here. This calculation of the interpolation filter is only efficient for linear arrays. In Reference 24 the array manifold is interpolated by means of some matrix H obtained from a least-squares fit ||b(/o,u) — H^b(/ W ,u)|| 2 . The relation to equation (16.27) is obvious. It is clear that the matrix H is a very bad interpolation if it is applied for all directions u. As noted in Reference 24 clusters of directions will be formed which finally lead to the original CSST approach and hence will give similar performance. In Reference 25 the array manifold has been expanded into an infinite series of Bessel functions. If this series is truncated for sufficiently small Bessel terms a representation b ( / n , u) = Grtw(w) with W((u) = e~Jm is obtained. This is a convenient factorisation in angle and frequency terms. The applied transformation matrices for the subband covariance matrices are then chosen as T(/ w ) = Go(G^G^)" 1 G^ and for the MUSIC method the array manifold (G^G 0 )" 1 G^b(Z 0 , u) is used after prewhitening. This interpolation has to be extended for two-dimensional arrays. All these interpolation techniques work better for large numbers of sample points in the u-(v-) domain, but in this case they are also numerically more intensive, if no special structures can be exploited. An interesting algorithm, which is however only applicable to regular arrays, has been derived in Reference 26. In this approach the CSST is calculated by equation (16.27) without a priori knowledge of the DOAs by exploiting the Toeplitz and persymmetric structure of the asymptotic covariance matrix for linear equispaced arrays. This is achieved by iterated differencing of type D(/ n ) = R(/ w ) — ER* (/«)E, where the asterisk indicates complex conjugate and E is the exchange (or contraidentity) matrix having ones only on the cross diagonal. It has been shown that by higher-order differencing the matrix converges to a matrix which has the same signal subspace as R ( / n ) . By singular value decomposition we can find a basis
V(fn) of this subspace without knowledge of the signal directions and the desired unitary transformation is then calculated as T(fn) = XJ(fo)\J(fn)H, similar to equation (16.28). A CSS transformation in beam-space called Bi-CSSM is formulated in Reference 27. This approach leads to transformations of the form minx(/n) ||Tobo(O) —T(/W)a(/W, 0) ||2 with a reference steering vector bo. This gives additional flexibility in the design of the transformation. The array interpolation techniques are very similar to the CSST technique and we give no further results on this. WAVES For MUSIC and many other superresolution methods we do not explicitly need the coherently CSST-averaged covariance matrix. Only one representative signal subspace is required. The investigation of such an optimum signal subspace leads to a new statistic called WAVES (weighted average of signal subspaces) introduced by Di Claudio and Parisi [28]. The idea of WAVES is to determine a joint subspace from all subspaces of the subband covariance matrices. The proposed algorithm is as follows: (i)
(ii) (iii)
(iv) (v)
Determine the signal subspaces Vs(fn) of dimension Mn for each subband covariance matrix R(/«), e.g. by eigendecomposition of the covariance matrix and a test criterion like MDL etc. [15]. Compute CSS transformations T(/ w ) and subspace weighting matrices P(/ w ) by some available method, e.g. the CSST by Hung and Kaveh of 16.3.2.2. Compose a new pseudo-data matrix:
From the pseudo-data matrix Z estimate the signal subspace Vs and its dimension M5 by SVD. Calculate the MUSIC pattern as usual with Vs.
For the weighting matrix in step (ii) it is known from weighted subspace fitting (WSF [14]) that the optimum subspace component weighting is obtained by
For the focussing matrices we may choose the method of Hung and Kaveh, equation (16.28). In simulations a performance improvement with this method could be found with respect to the sensitivity against the focussing directions. Without channel errors WAVES produces the same resolution and the same sensitivity to the focussing directions, i.e. under identical conditions, e.g. as in Figures 16.15 and 16.16, we obtain practically the same MUSIC patterns, although the singular value distributions of the pseudo-data matrices are different. Figure 16.18 demonstrates the sensitivity against the selected focussing directions for scenario 2 with filter ripple and delay errors.
truAM/ 3sb eripple/M=8 truAM/ 3sb eripple/M= 8
Figure 16.18
truAM/ 3sb eripple/M-5 truAM/ 3sb eripple/M=5
Influence of focussing directions grid, look direction —47.3° (UQ = —0.735), narrowband BF a MUSIC with CSST b WAVES
truAM/ eripple/M= 5 truAM/ eripple/M= 5
rank(SS)=ll 11 number of singular values (SV)
Figure 16.19
Singular values of pseudo-data matrix from Figure 16.18b for the different focussing directions
To resolve the two closely spaced sources we have chosen a grid of three and four directions indicated by the dots on top of the plots. Obviously, the subspace choice by WAVES gives a better display of the source distribution. The third source is not displayed because no focussing direction is allocated to this source. The singular values of the pseudo-data matrices that were used for the two sets of focussing directions in Figure 16.18b are shown in Figure 16.19. From this we
truAM/3sb eripple/M=5
Figure 16.20
WAVES with focussing directions equal to the maxima of Figure 16.18b, narrowband BF
conclude that the dominant signal subspace dimension is dimSS = 5 in both cases. The different focussing directions have practically no effect on the dominant singular values. The rank of the pseudo-data matrices is 11 in both cases. The dimSS of the subband covariance matrices were all chosen equal to 5 (the same as the final dimSS). If the positions of the two maxima of the WAVES pattern in Figure 16.18b are chosen as new focussing directions the bias is not reduced. This is seen in Figure 16.20. The pattern is practically identical to the ones in Figure 16.18b. This means that an iterative application of the focussing procedure does not approach the true source direction, the same as was observed with CSST. If we want superresolution in the sidelobe region of the subarrays, the look direction of the array is of some importance, if higher resolution is required. We illustrate this by the scenario of Figure 16.18 without steering the array (no phase shifting, look direction equal to boresight). The resolution is then degraded, as shown in Figure 16.21. No difference between WAVES and CSST MUSIC is seen. The choice of the dimension of the signal subspace is critical in this case. The fifth eigenvalue of the CSST covariance matrix is only 5 dB above the noise level (the fourth eigenvalue is 18 dB above noise), but the corresponding eigenvector is essential for high resolution as can be seen from the third curve. The same is true for WAVES, although the difference between the fourth and fifth eigenvalue is less, only 8 dB. This illustrates again the importance of correct selection of the dimension of the signal subspace. The low resolution of WAVES is not due to the weighting of the subspaces in the pseudo-data. Without subspace weighting the resolution of WAVES is even worse. The main problem with the coherent subspace transformations seems to be the extreme sensitivity to the selection of the focussing directions. For high-resolution scenarios this sensitivity is so strong that no convergence to the true directions with iterative application of the method is achieved. More recent methods like WAVES perform only little better in this respect and do not solve this problem.
truAM/Jsb enpple/M = 4 truAM/3sb eripple/M=5 truAM/3sb eriDple/M=5
truAM/3sb eripple/M=5 truAM/3sb eripple/M=5
Figure 16.21 Influence offocussing directions grid for look direction (0,0), narrowband BF a MUSIC with CSST, dimSS = 5 and 4 b WAVES, dimSS = 5
Figure 16.22
Support of two-dimensional power spectral density
16.3.2.3 Full space—time processing These methods try to account for the fact that broadband sources have a very special linear interdependence of angle and frequency. If we consider, for example, a linear array, then the two-dimensional spectral density in u and / has in fact a support of measure zero as sketched in Figure 16.22. The problem for superresolution then is to model the frequency dependence of the signal subspace. ARMA model fitting The first attempts to account for the frequency dependence were made by fitting a vector ARMA model to the array output data [29, 30]. The data, the covariance matrix and the array manifold are defined in a rational vector space, the signals and the receiver noise are represented by finite dimensional linear systems. From a system theoretic point of view this is a very general concept. The practical realisation of
this technique requires a fairly large number of sample points for determining the polynomial coefficients. The method is therefore often numerically quite expensive. STAP spectral estimation (BASS-ALE estimator) The principle here is the classical STAP principle: stack time and space data into one vector, calculate the space-time covariance matrix, extract the dominant subspace from this matrix, with this calculate the MUSIC spectrum using the corresponding space-time array manifold model. This is the essence of BASS-ALE (broadband signal subspace spatial-spectral estimation) introduced by Buckley and Griffiths [31]. Of course, a lot of practical problems arise: the size of time and space samples, the dimension of the space-time signal subspace, estimation of the space-time covariance matrix, and focussing procedures. The essential parts of the BASS-ALE spatialspectrum estimation are [31]: (i)
(ii)
(iii) (iv) (V)
Compute a space-time covariance matrix R^ from L fain, snapshots x(t),x(t-\r ) , . . . ,x(t + (L — l)r). The sample interval r must be compatible with the bandwidth (smaller than Nyquist). This means fast time STAR The space-time covariance matrix then has block-Toeplitz structure with blocks R((k — l)r) and is of dimension NL x NL. In addition CSST focussing for subbands as in equation (16.27) may be performed. The broadband steering vector is b s r ( / , u ) = c(/)
We want to demonstrate the benefits of using the bare time-plus-space covariance matrix, without the special problems of finite sample size, covariance estimation techniques and focussing. We can use the signal model of equation (16.18) or (16.24) to calculate covariance matrices R(r) = E{x(t)xH(t + r)}. This gives the asymptotic space-time covariance matrix with errors, with different kinds of array steering for the subarray outputs of a fairly complicated antenna. With this technique we have calculated the BASS-ALE spectra for scenarios 1 and 2 for one and two delay taps. These are shown in Figures 16.23 and 16.24. The space-time covariance matrix then has dimension 96 x 96 and 128 x 128. The tap delay is set Bx — 1. The dimension of the signal subspace is significantly larger than in Figures 16.7 and 16.11. With the '1OdB above noise criterion' we find for scenarios 1 and 2 dimSS = 15 (2 taps) and 20 (3 taps). These values are used for the corresponding MUSIC spectra. One can see that the extension to space and time is not
truAM/ 2t eripple/ btau= 1/ M= 13 truAM/ 3t eripple/ btau = 1/ M= 17
truAM/ 2t eripple/ btau= 1/M= 13 truAM/ 3t eripple/ btau= 1/M= 17
number of eigenvalues
Figure 16.23
BASS-ALE estimation for scenario 1, narrowband BF a BASS-ALE spectrum for 2 and 3 taps b corresponding eigenvalues
completely effective against broadband effects, for scenario 1 similar to spatial-only processing with tapering (Agrawal method, Figure 16.14a). This is due to the fact that the time basis (three or four time samples) is too short to resolve the bandwidth effects. The poor performance is not related to the tap delay Bz = 1. For Br = 0.5 we have
truAM/ 2t eripple/ btau= 1/ M= 13 truAM/ 3t eripple/ btau = 1/ M= 17 truAM/2t/btau=l/M=13
truAM/ 2t eripple/ btau = 1/ M= 13 truAM/ 3t eripple/ btau = 1/ M= 17 truAM/2t/btau=l/M=13
number of eigenvalues
Figure 16.24
BASS-ALE estimation for scenario 2, narrowband BF a BASS-ALE with 2 and 3 taps with errors and with 2 taps without errors b corresponding eigenvalues
truAM/ 2t eripple/Btau= 1/ M= 13
Figure 16.25
truAM/2t eripple/ Btau = l/A/=11
BASS-ALE estimation with time-delay subarray BF a scenario 1, dimSS = 1 3 b scenario 2, dimSS = 11
nearly the same performance. The computational complexity increases significantly with BASS-ALE due to the larger dimensions. For space-time covariance matrices the channel errors have the same impact as in spatial-only processing. In Figure 16.24 we have in addition plotted the MUSIC pattern and eigenvalues for the error-free case (bandwidth effects only). Clearly, resolution can be improved with BASS-ALE by additional CSST focussing, but this is also true for the spatial-only processing methods. If we have an array with beam steering by subarray time delays, we have a built-in focussing procedure. The BASS-ALE estimates from such an array have excellent properties as shown in Figure 16.25, but this can also be achieved by spatialonly processing, compare with Figure 16.12. In summary, with respect to the high computational expense due to its high dimensions, the BASS-ALE method does not seem to be too attractive against bandwidth effects. However, the big improvement is its independence against subspace or covariance matrix focussing directions. Modelling of frequency dependent signal subspaces The idea of this method is to model directly the frequency dependence of the signal subspace as sketched in Figure 16.22. This has been presented by Grenier [32]. The evolution of the noise subspace vectors Vn(f) is approximated by a linear combination of frequency-dependent basis functions gq(f), Un(f) = Ylf=onnqgq(f)There are many possibilities for choosing the basis functions. In the simulations in Reference 32 simple polynomials gq(f) = ( / — fo)q were chosen and orthogonalised over [/o — B/2,fo -f B/I]. This shows that this method is closely related to ARMA modelling. It was explained in Reference 32 that the choice of the basis functions is not very critical. The choice of the optimum basis function is an open problem. The complete algorithm was formulated in Reference 32 in a time-domain and a frequency domain version. We list here the
frequency domain version: (i) (ii)
(iii) (iv)
Expand the data x ( / ) e C ^ in the frequency domain xex(f) = g ( / ) x(f) e £N(Q+\) ^01. b y filtering in the time domain), Estimate the expanded N(Q + 1) x N(Q + 1) covariance matrix R^x = E{xex(f)x^x(f)} (by averaging of subband vectors in the frequency domain, in the time domain by averaging over the filtered vectors), Determine the noise eigenvectors Vn e C{Q+l)N (n = Ms + 1 , . . . , N(Q + I)) for Ms dominant eigenvectors of Kex. Partition Vn in Q + 1 blocks of length N: U f = [ U ^ , . . . , U ^ ] and define
Un(Z) = Ej=O u ^ft(/>-
(v)
Calculate the expanded MUSIC spectrum as ( E n |U^(Z)b(Z, W)I2)"1 •
Step (i) shows that this method is closely related to STAP methods: in STAP special expanded vectors of time-delayed data of data xex (with gq(f) = e^2n^q are chosen. In this sense frequency-dependent modelling is more general. Again we have simulated this method with our standard scenarios without any finite sample effects but with errors included. This was achieved by using the frequency domain version with the subband covariance matrices used in Section 16.3.2.2 for CSST (exactly the same matrices were used). The subband matrices R(ZJO (k = 1,.,.,Nf, the number of subband frequencies) were expanded by polynomials up to first order, i.e. we used vectors (go(fk))k=\,...,N/ = (!>•••? 1) and (g\ (fk))k=\,...,Nf = (Zi ~ Zo, •. •, Zv/ - Zo) which were normalised to length one. The expanded covariance matrix was then composed as: Nf
Kex = Y, (g(Z*)gr(ZO) ® R(Zt) k=\
with g(fk) = (go(fk), gi (fk))T - Steps (iv) and (v) of the above algorithm were then performed with the subband frequencies fk. The crude expansion with polynomials of zero and first order was also considered in Reference 32 and was found to work satisfactorily. The first observation is that this method cannot be applied to a steered array. Phase shifting at the elements, which is done at the centre frequency of the whole band, introduces biases at the subband centre frequencies which are not modelled by this method and which cannot be recovered. The same is true for time-delay steered subarrays. This is certainly a drawback of this method and it would be worthwhile thinking about generalising this method to steered arrays. Without phase shifting (look direction set to boresight) the method works well even for the considered irregular superarray. This is shown in the following plots. The dimension of the expanded covariance matrix is 64 ([number of subarrays] x [expansion order -f- I]). Only the 32 largest eigenvalues are shown. The result for scenario 1 is seen in Figure 16.26. This is similar to the result with BASS-ALE, Figure 16.23, and comparable to CSST, Figure 16.17. Also, with this method we do
truAM/ 3sb eripple/ M=6
number of eigenvalues
Figure 16.26
Frequency-dependent model expansion for scenario 1, three frequency subbands, expansion order 1, narrowband BF a MUSIC spectra of the three subbands, dimSS = 6 centre frequency marked with crosses b eigenvalues of expanded covariance matrix truAM/3sb eripple/M=7
number of eigenvalues
Figure 16.27
Frequency-dependent model expansion for scenario 2, three frequency subbands, expansion order 1, narrowband BF a MUSIC spectra of the three subbands, dimSS = 7, centre frequency marked with crosses b eigenvalues of expanded covariance matrix
not have the critical dependence on the focussing directions, but we have in addition a certain frequency resolution given by the patterns for the three different subbands. The high-resolution properties are seen with scenario 2 in Figure 16.27. The weak source in the vicinity of the strong source is not resolved as it is with BASSALE, in contrast to CSST with perfect focussing directions, Figure 16.16. This is the price we have to pay for not using any a priori source direction information for the model expansion method. In reality we will never know the exact focussing directions and this degrades the performance of the CSST methods as well, as shown
in Section 16.3.2.2. Note the difference of the spectra while scanning through the subbands. From these subband spectra a resolution of the two sources may be obtained with suitable processing. For this method the fine structure of the eigenvalues is necessary for high resolution. Figure 16.27b might suggest using only four or five dominant eigenvalues (according to the 10 dB rule used before). However, with this choice of dimSS we would completely lose the information about the weak source in scenario 2, and only one peak would be displayed. The fact that model expansion does not work for a steered array implies that an array manifold approximation like DUM does not work. Clearly, for a method that extrapolates the array manifold we cannot use a simplified model. In summary, model expansion provides some advantages with respect to bandwidth dispersion and disadvantages with respect to resolution against BASS-ALE, if we use covariance matrices of equal dimensions as was done here.
16.3.3
Conclusions on broadband superresolution
There are useful methods for making an array with narrowband beamforming robust to bandwidth. Direction-dependent weighting (the Agrawal method) according to the bandwidth is simple and effective. This method can be combined with the simple DUM array manifold model if little calibration effort is desired. All this gives a simple local (spotlight) superresolution property. If very high resolution is desired and an array with narrowband beamforming is used, we have to apply space and time processing methods. The problem is how well we know the array manifold. If it is perfectly known, the model expansion method seems to be very useful. Once good estimates of the source directions have been found, these could be used as focussing directions for a CSST method. WAVES is the most recent and best of these methods. If the array manifold is known (measured) with certain errors, the errors will be largest in the subarray sidelobe region and it may be useful not to consider high resolution in this region. Model expansion is then not applicable because it cannot be used for steered arrays. CSST (with narrowband beam steering) is then a suitable method. To overcome the strong dependence on the focussing directions different superresolution methods may be applied in parallel (superresolution filter banks [33]) to obtain some convergence to the true directions when this method is applied iteratively. BASS-ALE gives good resolution without requiring focussing directions, however, at the price of a large computational load. BASS-ALE does not completely eliminate the bandwidth dispersion effects. All space and time processing methods are a little more sensitive against channel errors than the spatial-only processing methods. This is because the dispersion due to the full bandwidth in spatial-only processing dominates the errors while for the smaller subbands the relative errors (per subbandwidth) have a greater impact. If we have a broadband array with subarray delay beam steering we have fairly good focussing in the array look direction. This gives with all methods high resolution within a certain sector of view even with the spatial-only methods, but it introduces dispersion outside of this field of view. The combination with the simple array manifold model DUM is then very useful.
Next Page
Table 16.3 Key features of broadband superresolution methods Sensitivity to channel errors
Computational cost and matrix size
A priori information/ sensitivity against it
low
low
low (K x K)
none
moderate
some
low
low+ (K x K)
Incoherent moderate averaging over subbands CSST good
some
low
subband processing
bandpass filter characteristic/ weak none
Method
Reduction of Resolution bandwidth dispersion
Standard narrowband processing Agrawal tapering
none
WAVES
good
BASS-ALE
good
Model expansion
moderate to good
Nf(K x K) high to low, medium depending on focussing directions medium high to low, depending on focussing directions medium high medium
medium
subband processing
Nf(K x K) subband processing
source directions/ strong source directions/strong
Nf (K x K) STAP processing with NT delays (NT K x NxK) subband processing, expansion order NQ (NQK
number of taps/weak dimSS, no beam steering allowed
X NQK)
To point out the differences between the various methods at a glance we have listed the properties by keywords in Table 16.3.
16.4 Jammer suppression with broadband arrays Adaptive antennas for suppressing interference have a long history dating back into the 1960s. The first pathfinders were Applebaum [34], Widrow [35], Griffiths [37], Lacoss [36] and others. Since then innumerable papers on this subject have appeared. Much of this material has appeared in textbooks [38—40]. We will summarise only the principles of adaptive interference suppression for the multifunction radar context in the next section. As in the preceding sections we will not go into the problems associated with limited sample size, like weight jitter, pattern stabilisation etc. This means that a lot of algorithms dealing with rapid convergence, adaptation in the data or covariance domain, projection methods for small sample size etc. will not be considered. In Section 16.4.2 we will then comment on the usefulness of spatial-only adaptation for broadband ABF and Section 16.4.3 presents
Previous Page
Table 16.3 Key features of broadband superresolution methods Sensitivity to channel errors
Computational cost and matrix size
A priori information/ sensitivity against it
low
low
low (K x K)
none
moderate
some
low
low+ (K x K)
Incoherent moderate averaging over subbands CSST good
some
low
subband processing
bandpass filter characteristic/ weak none
Method
Reduction of Resolution bandwidth dispersion
Standard narrowband processing Agrawal tapering
none
WAVES
good
BASS-ALE
good
Model expansion
moderate to good
Nf(K x K) high to low, medium depending on focussing directions medium high to low, depending on focussing directions medium high medium
medium
subband processing
Nf(K x K) subband processing
source directions/ strong source directions/strong
Nf (K x K) STAP processing with NT delays (NT K x NxK) subband processing, expansion order NQ (NQK
number of taps/weak dimSS, no beam steering allowed
X NQK)
To point out the differences between the various methods at a glance we have listed the properties by keywords in Table 16.3.
16.4 Jammer suppression with broadband arrays Adaptive antennas for suppressing interference have a long history dating back into the 1960s. The first pathfinders were Applebaum [34], Widrow [35], Griffiths [37], Lacoss [36] and others. Since then innumerable papers on this subject have appeared. Much of this material has appeared in textbooks [38—40]. We will summarise only the principles of adaptive interference suppression for the multifunction radar context in the next section. As in the preceding sections we will not go into the problems associated with limited sample size, like weight jitter, pattern stabilisation etc. This means that a lot of algorithms dealing with rapid convergence, adaptation in the data or covariance domain, projection methods for small sample size etc. will not be considered. In Section 16.4.2 we will then comment on the usefulness of spatial-only adaptation for broadband ABF and Section 16.4.3 presents
results for space-time jammer suppression (fast time STAP) in comparison with spatial-only ABF. As previously mentioned, we will not consider broadband techniques based on a known signal waveform, e.g. stretch processing. An important publication on broadband jammer nulling using stretch processing is Reference 59.
16.4.1
General principles of adaptive interference suppression
We consider adaptive beamforming (ABF) with an active system and assume that the interference covariance matrix alone can be measured. Ideally, this is achieved by measuring this matrix when no signal is transmitted. We will not consider the question of how a signal-free interference covariance matrix can practically be measured in a real radar, because this question involves system aspects like pulse compression gain, radar modes, dead zones etc. The main point is that this approach is fairly different from passive systems where there is little information for distinguishing between desired and unwanted signals and where superresolution patterns (from methods like Capon's or MUSIC) are considered as adaptive beamforming. The difference lies in the significance of the array manifold which, as we have seen in the preceding section, plays a major role in broadband techniques and in the resolution properties. If the interference-only covariance matrix is used, the adapted pattern looks very similar to the quiescent sum beampattern in Figure 16.3 and the adapted pattern has similar robustness properties, at least for few sidelobe interference sources. Thus most of the array manifold problems discussed before do not apply for this kind of ABF. 16.4.1.1
Optimisation problem and optimum narrowband beamforming with subarrays The first step in radar processing is detection. For high detection probability we need a high signal-to-interference-plus-noise ratio (SNIR) and therefore the principal task of ABF is to restore the SNIR. If we had access to the elements this would mean choosing a weight vector w e CN such that: SNlR = ——77-V (16.29) v J ^{Iw^nl 2 } l H is maximised. The solution is w = fiQ~ s with Q = E{nn } and fi a free normalisation constant. If there is only receiver noise present, Q = I, then we obtain the conventional sum pattern. For few sidelobe jammers the antenna pattern will look very similar to the conventional pattern with the broad mainbeam and with robustness against errors. This is in contrast to superresolution methods. In reality the interference cannot always be measured alone. If the signal is included in the covariance matrix Q it will be suppressed as an interference unless it is exactly in the subspace spanned by s. Much work has been done to investigate and mitigate the effects of the signal included in the adaptation data. In addition clutter, which is the reflection from unwanted objects (ground, rain etc.), may be present. For ground-based radar it is not desirable to have clutter in the covariance matrix Q, because it may consume unnecessarily spatial degrees of freedom and the clutter
can in this case be effectively suppressed by filtering in the time domain after beamforming. For airborne radar the clutter has a complicated dependence on Doppler and patterns with low sidelobes are desired, e.g. a 40 dB Taylor sidelobe level as in Figure 16.3. Low sidelobe patterns are not optimum if only receiver noise is present. Multiple constraint optimisation All these pattern requirements can be incorporated by replacing the SNIR maximisation by a constrained minimisation of the interference power, minimise: ^{Iw^nl 2 } = w^Qw
(16.30)
subject to the r constraints: w^C = k
(or w H c/ = ki9 i = 1 , . . . , r)
The solution of this optimisation is w = Q~ l C ( C ^ Q " l C) ~ l k. For a single constraint w H s = 1 this is equivalent to the SNIR maximising solution for /x = (S^Q - 1 S)" 1 . However, other additional constraints lead in general to non-SNIR optimum solutions. One should also note that additional constraints consume degrees of freedom, i.e. the maximum number of sources that can be suppressed is reduced by additional constraints. An important practical problem is to formulate the appropriate constraints for the desired performance features. A large number of papers has appeared around the problem of defining suitable constraints and the solution of the optimisation problem. In Reference 41a general strategy is given on how to formulate the constraints on the desired pattern for discrete directions and extended sectors of direction. Additional quadratic constraints to avoid signal cancellation, if the signal is also present in the adaptation phase, are considered in Reference 42. An additional quadratic constraint can also be interpreted as an artificial jamming situation, or, as a pattern constraint. A pattern constraint for spatially uniform low sidelobes is also equivalent to the diagonal loading technique, which consists simply of adding a small portion of the identity matrix to the estimated covariance matrix. This is shown in Reference 43. Diagonal loading is a very powerful technique for pattern stabilisation for small sample size [44]. Instead of a uniform constraint on the sidelobes, one may require enhanced or reduced sidelobe levels in selected areas. This leads to off-diagonal loading techniques, e.g. Reference 45, which means adding a small portion of a full constraint matrix to the estimated covariance matrix. To preserve a certain pattern shape for successive adaptations over time, stochastic (or data-dependent) constraints have been proposed and successfully applied in over-the-horizon radar (OTHR) by Abramovich et al. Reference 46. This is a powerful tool to obtain a minimally distorted time sequence of adapted beam outputs for Doppler processing. Projection and subspace solutions The matrix inversion appearing in the solutions of equations (16.29) and (16.30) may be replaced by a projection on the complement of the jammer subspace. This is equivalent to considering a scenario with the jammer-to-noise ratio (JNR) going to
infinity, because we have from equation (16.8):
and with the matrix inversion lemma we obtain:
The columns of B span the jammer subspace and Pg B = 0. The use of this projection leads to a complete cancellation of the jammer, and the use of the inverse covariance matrix suppresses the jammer only below the receiver noise level. Projections are therefore useful to achieve a jammer-power-independent suppression. This may be of interest for blinking jammers, if the jammer is partly switched off during the adaptation time. In reality, the j ammer subspace is estimated by the eigenvectors corresponding to the dominant eigenvalues of the estimated covariance matrix (or dominant singular vectors of the data matrix) and the complete cancellation of the jammer is only achieved asymptotically. A lot of fast subspace estimation algorithms have been proposed, e.g. [49-52]. If we have eigenvalue leakage due to errors, bandwidth and finite sample size, then the dimension of the signal subspace must be so large that all leakage eigenvalues are covered, because all jamming power has to be suppressed. This is contrary to the superresolution case, where we tried to select a small subspace dimension to ignore error and bandwidth effects. The main practical reason for using projections is the stabilisation of the adapted patterns for small sample size [47, 48, 53]. If we use the matrix inversion, the small eigenvalues and corresponding eigenvectors with highest estimation noise determine the weight fluctuation. With the projection the weight fluctuation is determined by the stable dominant eigenvectors. However, the residual jamming power of ABF with projections is very sensitive to the choice of the dimension of the jammer subspace. To achieve less sensitivity against the choice of dimSS a weighted projection has been recommended, which is a matrix I — KWK^, where the columns of K span the jammer subspace and W is a suitable weighting matrix [54]. This approach has been termed the lean matrix inversion (LMI). ABF configurations with subarrays For high gain antennas with thousands of elements it is not feasible for cost reasons to process all array element outputs, and from a signal processing viewpoint this is also not necessary. In principle there are two ways of reducing the degrees of freedom (DoF): subarraying and partial digital beamforming (adaptation before beamforming by direct subarray weighting, DSW) and full beamforming plus auxiliary elements (generalised sidelobe canceller configuration, GSLC; or post-beamforming adaptation). Figure 16.28 shows the two concepts. For modern arrays modularity of the antenna structure is a typical requirement. Therefore, a certain subarray structure is also used for the GSLC, as indicated in the Figure. An important point in comparing computational and hardware complexity for a multifunction radar is that we need not
ADC
analogue channel splitting
auxiliary transformation M
adaptive weighting Wy
ADC
adaptive weighting
Figure 16.28
The two configurations for adaptive beamforming a partial digital beamforming by direct subarray output weighting b analogue beamforming and digital adaptation by GSLC (only one difference beam indicated)
only an adapted sum beam for detection, but also adapted difference beams for angle estimation and tracking. Sometimes a guard channel for sidelobe blanking is required, which may be considered as another beam generated from the subarray outputs. With a subarrayed system all beams (in principle arbitrarily many) can be formed digitally, but the GSLC system uses analogue sum and difference beamforming networks. The auxiliary channels for the GSLC need not be single elements but may be generated from the whole array by subarrays as indicated in Figure 16.28b.
Adapted weights with DSW The covariance matrix at the subarray outputs is Q = T^QT € CKxK, where Q G CNxN is the element output covariance matrix. Let the unadapted sum beam at the subarrays be formed by a final weighting mi (i = 1 , . . . , K) as y = mHx. Difference beamforming is done similarly by weightings dx, dy. The adaptive DSW sum beam output is then given by:
ya = wf x with vi^ = Q~ 1 m Normally, the subarray outputs are simply summed up, i.e. m = ( l , l , . . . , l ) r . As pointed out in Reference 9, there are reasons for choosing the reference vector m differently, to fulfil additional properties. The aperture taper applied at the elements for low sidelobe patterns can be divided into the stages (1), (2), (3) as indicated in Figure 16.28a. By this we can achieve properties such as, e.g. minimum amplification spread at the elements, or that the subarrays have equal gain at the A/D converters to reduce limiting or error effects, or noise power normalisation at the subarray outputs before adaptation (i.e. T^T = I). The noise power normalisation is a key tool to preserve low sidelobes after adaptation [9]. If the receiver-noise-only covariance matrix differs from the identity, the adaptation process will try to whiten this matrix and this distorts the desired low sidelobe pattern. The weights for difference beamforming are only applied at stage (3) in the DSW concept. Adapted weights with GSLC The estimated jamming signal is subtracted from the main beam y by the weighted sum of s auxiliary channels xaf\,... ,xa/. ysic — y ~ w ^ c x «- ^ e t t n e s u m ^ e a m and auxiliary channels be generated from the same subarray outputs, y = m^x and xa = M^x (some subarrays may consist only of single elements). The K x s auxiliary forming matrix M can be any matrix, from a simple selection matrix for single subarrays to a matrix that forms full beams with all subarray outputs. The optimum GSLC weight is well known as: wsk =
E{xax^rlE{xayH}
or Wsic = (M^QM) 1 M ^ Q m If the auxiliary channels block the reference signal, i.e. if M ^ m = 0, and if the auxiliaries preserve all degrees of freedom, i.e. if M is of rank K — I, then one can show that in fact y — w^,x a = ^Q- 1 In, i.e. the GSLC is exactly the same as DSW up to the normalisation constant ft. As mentioned before, m need not be equal to the desired signal, i.e. the reference blocking condition is not necessarily a signalblocking condition, which is often desired to avoid signal cancellation. For monopulse
estimation additional difference beams will be needed. Correction techniques for monopulse with adapted beams can also be applied for the GSLC [55]. The main difference between GSLC and DSW lies in the dynamic range of the adaptive channels because of the different point of A/D conversion. This leads to differences in the suppression of strong jammers. Limiting the A/D converters has to be avoided, because any non-linearity degrades the adaptive suppression. For the GSLC strong sidelobe jammers are attenuated before adaptation by the sidelobe level; limiting effects will occur only for very strong jammers. For DSW the jammer will in general be located within the subarray mainlobe. Conversely, for mainbeam jammers the GSLC will sooner come to limiting in the mainbeam than DSW. Of course, both systems do not completely fail if ADC limiting occurs. In this case, the gain of the subarrays or the main beam would be reduced by an automatic gain control device, which results in some SNR degradation, i.e. range reduction. The GSLC is not suited to reduce the degrees of freedom (DoF). The required number of DoF depends on the number of jammers and the system errors. Any reduction of the DoF below the necessary number will result in performance loss. As a rule of thumb it has been found [6] that a number of DoF of two to three times the number of jammers is necessary to compensate for channel errors. This is primarily a requirement for strong jammers, in particular for mainbeam jammers. IfABF with difference beams for monopulse estimation is also performed, additional degrees of freedom may be necessary to achieve small angle errors after correction [55]. Another difference between both concepts is with respect to channel errors. The analogue beamforming networks of the GSLC tend to be more broadband. For DSW all bandpass filtering and A/D conversion errors have an impact on beamforming, in particular with respect to the null depth and the sidelobe level. DSW with channel errors can perform significantly worse than the GSLC. Calibration procedures are the key solution to this problem. Simple phase and amplitude calibration is not the problem. The channel inequalities have to be equalised over the receiving (signal) bandwidth. 16.4.1.2 Extension to broadband adaptive beamforming If we have a broadband interference source, the application of the narrowband techniques described before will produce broad nulls corresponding to the bandwidth and the deviation of the angle from boresight. This means that the antenna seems to see a large number of closely spaced jammers which are suppressed using a corresponding large number of degrees of freedom. If a large number of spatial degrees of freedom (subarrays) is available broadband jammers can be suppressed with narrowband ABF as far as sidelobe jammers are concerned. Also, for the number of subarrays this may not be very effective [8]. But most important, for main beam jammers spatial-only ABF will produce extended nulls at the skirt of the main beam and this degrades the gain and the sidelobe level considerably. Adaptivity in space and time reduces these problems significantly. The ideas presented in the preceding section can be directly generalised to the delay and sum beamformer. In this case we would have a tapped delay line behind each
element with an adaptive weight for each tap, i.e. for the delayed L data snapshots x(f),x(f -f T), . . . ,x(f + (L — 1)T) we calculate a space-time covariance matrix R^ of dimension NL x NL, which asymptotically is a Hermitean block-Toeplitz matrix with blocks R((k — l)r). All adaptive algorithms mentioned before can be applied with this covariance matrix and the steering vector bsrif, u) = c ( / ) ® b ( / , w), where ® denotes the Kronecker product, b is the spatial steering vector defined as in Section 16.2.1.3 and c ( / ) = (ej2nf{k~l)T)k=\,...,L- This is the general space-time adaptive processing, STAR For the realistic arrays we consider here, delays for beam steering will only be realised at the subarray level. Four configurations are possible: (i) Narrowband beamforming with adaptivity in space only, i.e. full narrowband processing. This means that we have all bandwidth effects as shown in Figure 16.5 in the main beam and the general sidelobe behaviour. In the jammer directions we have broad nulls. (ii) Narrowband beamforming with space-time adaptivity. This means that we have all bandwidth effects as shown in Figure 16.5 (main beam and sidelobe behaviour), but narrow nulls in the jammer directions. (iii) Subarray delay beamforming with spatial adaptivity only. This means that we have no squint and a fairly constant main beam shape for all scan angles, but the nulls for the jammers will broaden depending on the deviation of the jammer direction from the actual scan angle. This is suitable for the demanding case of jammers close to the main beam because for these jammers only a few degrees of freedom will be consumed. (iv) Subarray delay beamforming with space-time adaptivity. This configuration combines the desired main beam shape with narrow adaptive nulls.
16.4.2 Spatial-only adaptation In this section we consider a processing scheme exactly as in Figure 16.28. However, the subarray outputs for DSW and GSLC may have time delays for beam steering (broadband BF, configuration (iii) of Section 16.4.1.2) or not (narrowband BF, configuration (i) of Section 16.4.1.2). Due to the spatial-only adaptive channels the adaptive processor will create broad nulls in the jammer directions. The difference between narrowband BF and broadband BF is only that the extension of the nulls increases with the separation from boresight or from the look direction, respectively. Spatialonly adaptation with time-delay beamforming is therefore still quite effective against mainbeam jammers. The techniques mentioned in Section 16.3.1 are only concerned with modified array manifold models, i.e. affect the quiescent sum beam shape. These models may be applied for ABF, but they have no impact on the null width. Also, the different manifold models have little effect on the beam pattern because all beam steering is done here at element/subarray delay level. There is no digital beam scanning with the subarray outputs, as in the superresolution case.
16.4.2.1 Data tapering Sometimes a certain tapering of the adaptation data is useful against broadband jammers. The jammer may sweep through the radar receiving band such that during adaptation the full band is not covered. In this case one would like to broaden the adaptive null width to suppress the jammer over the full band. A broadening of the null width is also of interest against moving jammers. The null width can be controlled by a tapering of the estimated covariance matrix by Q O W, where O denotes the Schur-Hadamard product. In Reference 56 the weighting matrix is chosen as Wtk = sin m A/sin A, with A = (7r/c)/o(x/ — Xk)8 for suitable values of m, 8. In Reference 57 a simpler and more efficient weighting is suggested: W = I + 8(\xT H- yy r ) + /*(x (2) x (2)r + y ( 2 ) y ( 2 ) r ) + • • • with Xi, v/, denoting the array element/subarray centre coordinates and x ^ = x 0 x = (X?)I=IV..JV and with suitable weightings 8, \i. The x/y-coordinates control the broadening in azimuth/elevation. Hence an independent broadening in azimuth and elevation can be selected. The quadratic or higher-order terms are in general not necessary. For the case of digital subarray outputs the technique has to be applied analogously using the subarray centres. Subband adaptation The techniques of Sections 16.3.2.1 and 16.3.2.2 start with a Fourier transform of the data into narrow subbands. We can perform spatial-only adaptation with the subband data (post-FFT ABF). The subband data may be considered as narrowband and all properties for that case apply. The problem is, how the adapted subband beams are further processed. In principle one could average the subband beams incoherently or form a CSS averaged covariance matrix with which adapted beam outputs are produced. However, in an active system we want to integrate coherently a desired signal. The adaptation distorts the subband beams and thus the desired signal is perturbed in an unknown manner depending on the jamming scenario. It is not clear if CSS is the appropriate integration. Additional constraints (e.g. stochastic constraints mentioned above [46]) to preserve the signal characteristics may be necessary, but this depends on the type of application. Post-FFT adaptive beamforming is proximate for systems which already use an FFT in their processing chain, like in SAR systems with limited swath width. For classical search and track radar an FFT at each subarray is unusual and would require more complex data storage.
16.4.3 Space and time adaptation In this section we assume that we have a tapped delay line with an adaptive FIR filter at each subarray output. The beamforming operation before the subarray outputs may be narrowband, configuration (ii), or with delayed subarrays, configuration (iv). The intertap delay must be such that nulls can be maintained for all jammer locations over the entire band B. This means that we are talking here about space-time adaptive
processing with fast time samples. Empirically it has been found that an intertap delay of r = e/B with 0.2 < e < 0.8 is a good choice [58]. The required number of taps L depends on the maximum jamming delay to be encountered. The maximum delay in endfire direction is A/c, if A denotes the array aperture diameter. Let the aperture diameter be measured in units of the wavelength of the centre frequency, A = AoAo, then we have Le/B = Ao//o, or L = AoBr/s, where Br is the relative bandwidth. With 60/Ao ^ beamwidth [degrees] we can express the required number of taps by the bandwidth factor defined in Section 16.2.1.1 L = KBO.6/S. We have simulated asymptotic space-time covariance matrices with errors as in Sections 16.2.3.2 and 16.2.3.3 for the generic array. We used a scenario very similar to scenario 1. However, the look and signal direction was set between two jammers and we chose a realistic high jamming power. We consider only the DSW case, because the GSLC with optimum blocking matrix is identical and we do not want to investigate the effect of different blocking matrices. In the following plots we show the adapted patterns for weights adapted against broadband jammers but for a narrowband test signal at centre frequency, i.e. the mainbeam shape is as in the narrowband case. The element SNR is 0 dB which gives an array output SNR of 28.3 dB as shown in Table 16.4 (we have a 1.7 dB loss due to the 4OdB Taylor weighting). Let us first look at the situation for narrowband beamforming. Figure 16.29 shows the adapted sum patterns for spatial-only adaptation. With the 32 subarrays we can suppress the three broadband jammers and this is achieved by spatially extended nulls. Also, with the many degrees of freedom available, there is little difference if there are bandpass ripple errors present. The loss in signal-to-noise-plus-interference ratio (SNIR) after adaptation is only 0.8 dB (relative to the quiescent case). Note that with errors the null depth is limited to —50 to —60 dB depending on the angular location of the jammer. The sensitivity with respect to errors is totally different if we use space-time processing. We show examples for two and three taps resulting in a space-time covariance matrix of size 64 x 64 or 96 x 96. We have chosen the intertap delay r Table 16.4 ABF scenario 3 Jammers: Signal: Relative bandwidth: Filter ripple errors:
Array look direction: (phase shifters and/or subarray delays)
directions-53°,-30°, 0° (u = - 0 . 8 , - 0 . 5 , 0.0) direction - 3 7 ° , (M0 = -0.6)
element JNR = 34, 34, 34 dB element SNR 0 dB, array output SNR 28.3 dB
cosine on rectangular additional 0.2% delay errors (Bx = 0.002) characteristic, cosine amplitude variation 1 dB std, cosine period 20% variation
no errors/SNIRloss = 0.8 eripple/SNIRloss = 0.8
Figure 16.29
Sum pattern for spatial-only adaptation with and without errors, NB beamforming
as Bz = 0.5, which is a medium value as mentioned above. Figure 16.30 shows that the main effect with errors is an increase in the sidelobe level and a lot of leakage eigenvalues. With respect to the SNIR loss there is little difference with and without errors (—1.4 versus — 1.5dB) because we have excess degrees of freedom in this example and we are working with the clairvoyant covariance matrix. Note that the SNIR loss is given normalised to the corresponding receiver noise-only SNR, i.e. the STAP SNIR loss is with respect to the STAP beamformer gain while in Figure 16.29 the loss of —0.8 dB is with respect to the spatial-only beamformer. With three taps resulting in a space-time covariance matrix of size 96 x 96 we do not get further improvement in the error-free case. This is seen in Figure 16.31. From the rough calculation we gave at the beginning of this section one would need for this scenario and this array a number of taps L = 3.5 (the bandwidth factor for this array with Br = 10% is 2.9). The effect of the errors becomes more severe with more taps. Comparing the error simulation with and without delay errors we found that it is the delay errors which STAP is particularly sensitive for, even the present small amount of 0.2 per cent. This is comprehensible as these errors distort the STAP beamforming gain. The situation is different for subarray delay beamforming. We have assumed that the time delays for beam steering are exact (no quantisation) and a STAP intertap delay Bz = 0.5. As can be seen from Figure 16.32 the two jammers close to the look direction appear as narrowband and narrow nulls are formed in the pattern. However, as in Figure 16.5, the subarray delay sum pattern has high sidelobes at
2t no errors/btau = 0.5 SNIRloss= 1.4 2t eripple/btau=0.5 SNIRloss= 1.5
2t no errors 2t eripple
number of eigenvalues
Figure 16.30
STAP with two time delays, NB beamforming a adapted sum patterns with and without errors b eigenvalues with and without errors
about -3OdB. However, in this case the high sidelobes are a consequence of the broadband adaptation against the jammers, the desired signal is here narrowband and the quiescent pattern would be a perfect 40 dB Taylor pattern. As in the narrowband BF case there is little influence of the channel errors for spatial-only processing because of the excess number of degrees of freedom. In the STAP case with two taps we see a significant influence of the errors. In particular the delay errors interfere with the delays in the subarrays for beamforming. In summary we can state that in the ABF case where we are concerned with strong jammers, channel errors seem to be more of a problem, because the leakage
lt/Btau = 0.5 SNIRloss = 2.9 2t/Btau = 0.5 SNIRloss = 1.4 3t/Btau = 0.5 SNIRloss = 2.2
lt/Btau = 0.5SNIRloss = 2.9 2t/Btau=0.5 SNIRloss =1.5 3t/Btau = 0.5 SNIRloss= 1.9
Figure 16.31
STAP with one, two and three taps, NB beamforming a no errors b filter ripple errors
no errors SNIRloss = 2.0 eripple SNIRloss = 2.0
2t no errors/btau = 0.5 SNIRloss = 2.2 2teripple/btau = 0.5 SNIRloss = 0.6
Figure 16.32 ABF with sub array delay beamforming a spatial only b STAP with two taps
eigenvalues are related to the jamming power. In particular, if adaptation in space and time is allowed the sensitivity to errors increases compared with spatial-only adaptation. This is a point that needs to be investigated if the spatial-only versus spacetime adaptation concept is considered. STAP may need higher accuracy requirements for the channel receivers or more adaptive taps resulting in higher cost. For ABF and STAP with narrowband beamforming the desired low sidelobe level is fairly well preserved after adaptation. This is due to the noise normalisation of the subarrays before adaptation. This mechanism does not seem to work for subarray delay beamforming. The problem of subarray delay adaptive and non-adaptive beamforming with low sidelobes needs further study.
16.5
Final remarks
We have tried to give a compact insight into the problems of extending superresolution and adaptive jammer suppression to a larger bandwidth. We have focussed the presentation on a well defined limited application: a standard multifunction radar with a large phased array antenna with digital subarrays. Although in many publications superresolution and ABF are considered as similar problems, we have shown that there are significant differences. For superresolution the main problem is knowledge of the array manifold. With the inherent errors of a multichannel system this quantity has to be calculated from initial antenna measurements and from subsequent calibrations during the antenna lifetime. Array manifold calibration and approximation is a key issue of superresolution. Many coherent techniques for superresolution which try to approximate or restore the broadband array manifold, like CSST and model expansion, have no counterpart for ABF and STAP, because the time dependence of the desired signal is involved and such techniques were not considered here. Time-delay beamforming has proved to be very effective for broadband superresolution. By selecting the look direction we can determine the region with best resolution. It seems that superresolution is less sensitive against dispersive channel errors. Errors produce some leakage of the high-resolution pattern peaks into the sidelobes and also of the dominant eigenvalues into the smaller ones, but these effects are moderate. For jammer suppression we have to cancel strong sources. Numerically this is done by subtracting large numbers of equal magnitude, a procedure which is known not to be very stable. Consequently, errors have a visible influence on the residual jamming power and on the adapted pattern sidelobes. The adapted beampatterns with time-delayed subarrays have no good sidelobe level. This topic needs further investigation. Also the optimum number of spatial versus adaptive taps in time needs to be investigated for a complicated antenna of the type considered here. For all superresolution methods we need to know the number of sources. This can be a critical procedure as realistic eigenvalue spectra may have no significant features. Robust criteria for determining the dimension of the subspace from finite data samples are needed. If we use subspace methods for ABF/STAP, all leakage eigenvalues have to be accounted for, because they all represent jamming power, in contrast to superresolution. Subspace ABF methods have not been considered here, because they are mainly motivated to counter finite sample effects.
References 1 SCHMIDT, R. 0.: 'A signal subspace approach to multiple emitter location and spectral estimation'. PhD dissertation, Dept. of El. Eng., Stanford Univ. 1981 2 IEE Proc. Fy Commun. Radar Signal Process. February 1983, 130, (1), Special issue on adaptive arrays 3 SKOLNIK, M. L: 'Radar handbook' (McGraw-Hill, 1990, 2nd edn.)
4 BOHME, J. E: 'Array processing', in HAYKIN, S. (Ed.): 'Advances in spectrum analysis and array processing, vol. IF (Prentice Hall, 1991) 5 WILDEN, H. and ENDER, J.: 'The crow's nest antenna - experimental results'. Proceedings of IEEE international Radar conference, Arlington, 1990, IEEE cat no. CH2882-9/90/0000-0280, pp. 253-258 6 NICKEL, U.: 'On the influence of channel errors on array signal processing methods', Int. J. Electron. Comm. (AEU)9 1993, 47, (4), pp. 209-219 7 FARINA, A. and SANZULLO, R.: 'Performance limitations in adaptive spatial filtering', Signal Process., 2001, 81, pp. 2155-2170 8 MITCHELL, M., HOWARD, R., and TARRAN, C : 'Adaptive digital beamforming (ADBF) architecture for wideband phased array radars'. Radar sensor technology IV, SPIE 3704, 1999 (Proc. SPIE, Orlando 1999), pp. 36-47 9 NICKEL, U.: 'Subarray configurations for digital beamforming with low sidelobes, adaptive interference suppression, and superresolution'. FFM-report no. 451, FGAN 1995 10 WAX, M., SHAN, T.-J., and KAILATH, T.: 'Spatio-temporal spectral analysis by eigenstructure methods', IEEE Trans. Acoust. Speech Signal Process., August 1984, 32, (4), pp. 817-827 11 STOICA, R, MOSES, R., FRIEDLANDER, B., and SODERSTROM, T.: 'Maximum likelihood estimation of the parameters of multiple sinusoids from noisy measurements', IEEE Trans. Acoust. Speech Signal Process., March 1989, 37, (3), pp. 378-392 12 LE CADRE, J. P.: 'Parametric methods for spatial signal processing in the presence of unknown colored noise fields', IEEE Trans. Acoust. Speech Signal Process., July 1989, 37, (7), pp. 965-983 13 CADZOW, J. A.: 'Multiple sourcelocation-the signal subspaceapproach',IEEE Trans. Acoust. Speech Signal Process., July 1990, 38, (7), pp. 1110-1125 14 VIBERG, M., OTTERSTEN, B., and KAILATH, T.: 'Detection and estimation in sensor arrays using weighted subspace fitting', IEEE Trans. Signal Process., November 1991, 39, (11), pp. 2436-2449 15 NICKEL, U.: 'Aspects of implementing superresolution methods into phased array radar', Int. J. Electron. Comm. (AEU), 1999, 53, (6), pp. 315-323 16 NICKEL, U.: 'On the application of subspace methods for small sample size', Int. J. Electron. Comm. (AEU), 1997, 51, (6), pp. 279-289 17 NICKEL, U.: 'Spotlight MUSIC: super-resolution with subarrays with low calibration effort', IEE Proc, Radar Sonar Navig, August 2002, 149, (4), pp. 166-173 18 KROLIK, J. and EIZENMAN, M.: 'Minimum variance spectral estimation for broadband source location using steered covariance matrices'. Proceedings of IEEE international conference ICASSP'88, New York, 1988, (IEEE cat no. CH2561-9/88), pp. 2841-2844 19 NICKEL, U.: 'Determination of the dimension of the signal subspace for small sample size'. Proceedings of IASTED international conference on Signal processing and comm., 1998, Gran Canaria, Spain (IASTED/Acta Press, 1998) pp. 119-122
20 AGRAWAL, M. and PRASAD, S.: 'Broadband DOA estimation using "spatialonly" modeling of array data', IEEE Trans. Signal Process., March 2000,48, (3), pp. 663-670 21 WANG, H. and KAVEH, M.:' Coherent signal-subspace processing for the detection and estimation of angles of arrival of multiple wideband sources', IEEE Trans. Acoust. Speech Signal Process., August 1985, 33, (4), pp. 823-831 22 HUNG, H. and KAVEH, M.: 'Focussing matrices for coherent signal-subspace processing', IEEE Trans. Acoust. Speech Signal Process., August 1988, 36, (8), pp.1272-1281 23 KROLIK, J. and SWINGLER, D.: 'Focused wide-band array processing by spatial resampling', IEEE Trans. Acoust. Speech Signal Process., February 1990, 38, (2), pp. 356-360 24 FRIEDLANDER, B. and WEISS, A. J.: 'Direction finding for wide-band signals using an interpolated array', IEEE Trans. Signal Process., April 1993, 41, (4), pp. 1618-1634 25 DORON, M. A., DORON, E., and WEISS, A. J.: 'Coherent wide-band processing for arbitrary array geometry', IEEE Trans. Signal Process., January 1993,41, (1), pp. 414-417 26 KUMAR, K. A., RAJAGOPAL, R., and RAO, P. R.: 'Wide-band DOA estimation in the presence of correlated noise', Signal Process., 1996, 52, pp. 23-34 27 LEE, T.-S.: 'Efficient wide-band source localization using beamforming invariance technique', IEEE Trans. Signal Process., June 1994,42, (6), pp. 1376-1387. 28 DI CLAUDIO, E. D. and PARISI, R.: 'WAVES: weighted average of signal subspaces for robust wide-band direction finding', IEEE Trans. Signal Process., October 2001, 49, (10), pp. 2179-2191. 29 SU, G. and MORF, M.: 'The signal subspace approach for multiple wide-band emitter location', IEEE Trans. ASSP, December 1983, 31, (6), pp. 1502-1522 30 LOUBATON, P.: 'A two-dimensional spectral estimation approach for source detection in array processing'. IEEE Proceedings of 4th international ASSP workshop on Spectral estimation and modeling, Mississippi, USA, 1988, pp.339-343 31 BUCKLEY, K. M. and GRIFFITHS, L. J.: 'Broad-band signal-subspace spatialspectrum (BASS-ALE) estimation', IEEE Trans. Acoust. Speech Signal Process., July 1988, 36, (7), pp. 953-964 32 GRENIER, Y.: 'Wideband source location through frequency-dependent modelling', IEEE Trans. Signal Process., May 1994, 42, (5), pp. 1087-1096 33 GERSHMAN, A. B.: 'Pseudo-randomly generated estimator banks: a new tool for improving the threshold performance of direction finding', IEEE Trans. Signal Process., May 1998, 46, (5), pp. 1351-1364 34 APPLEBAUM, S. P.: 'Adaptive arrays'. Syracuse Univ. research report SPL-796, June 1964 35 WIDROW, B., MANTEY, P. E., GRIFFITHS, L. J., and GOODE, B. B.: 'Adaptive antenna systems', Proc. IEEE, December 1967, 55, (12), pp. 2143-2159 36 LACOSS, R. T.: 'Adaptive combining of wideband array data for optimal reception', IEEE Trans. Geosci. Electron., May 1968, 6, (5), pp. 28-86
37 GRIFFITHS, L. J.: 'A simple adaptive algorithm for real-time processing in antenna arrays', Proc. IEEE, October 1969, 57, (10), pp. 1696-1704 38 HUDSON, J. E.: 'Adaptive array principles' (Peter Peregrinus/ IEE, Stevenage, 1981) 39 MONZINGO, R. A. and MILLER, T. W.: 'Introduction to adaptive arrays' (John Wiley & Sons, New York, 1980) 40 NICOLAU, E. and ZAHARIA, D.: 'Adaptive arrays' (Elsevier Sc. Pub., Amsterdam, 1989) 41 TSENG, C-Y. and GRIFFITHS, L. J.: 'A unified approach to the design of linear constraints in minimum variance adaptive beamformers', IEEE Trans. Antennas Propag, December 1992, 40, (12), pp. 1533-1542 42 QIAN, F. and VAN VEEN, B. D.: 'Quadratically constrained adaptive beamforming for coherent signals and interference', IEEE Trans. Signal Process., August 1995, 43, (8), pp. 1890-1900 43 CARLSON, B. D.: 'Equivalence of adaptive array diagonal loading and omnidirectional jamming', IEEE Trans. Antennas Propag., May 1995, 43, (5), pp.540-541 44 CARLSON, B. D.: 'Covariance matrix estimation errors and diagonal loading in adaptive arrays', IEEE Trans. Aerosp. Electron. SySt., July 1998, 24, (4), pp. 397-401 45 WARNER, E. S. and PROUDLER, I. K.: 'Mitigation of weight jitter for broadband arrays', IEEProc., Radar Sonar Navig., April 2000,14, (2), pp. 49-56 46 ABRAMOVICH, Y. L, SPENCER, N. K., ANDERSON, S. J., and GOROKHOV, A. Y: 'Stochastic-constraints method in non-stationary hot-clutter cancellation Part I: fundamentals and supervised training applications', IEEE Trans. Aerosp. Electron. Syst, October 1998, 34, (4), pp. 1271-1292 (Part II: unsupervised training applications, ibid, January 2000, 36, (1), pp. 132-149) 47 HAIMOVICH, A. M. and BAR-NESS, Y: 'An eigenanalysis interference canceller', IEEE Trans. Signal Process., January 1991, 39, (1), pp. 76-84 48 SUBBARAM, H. and ABEND, K.: 'Interference suppression via orthogonal projections: a performance analysis', IEEE Trans. Antennas Propag, September 1993, 41, (9), pp. 1187-1194 49 YANG, B.: 'Projection approximation subspace tracking', IEEE Trans. Signal Process., January 1995, 43, (1), pp. 95-107 50 DEGROAT, R. D., DOWLING, E. M., YE, H., and LINEBARGER, D. A.: 'Spherical subspace tracking for efficient, high performance adaptive signal processing applications', Signal Process., 1996, 50, pp. 101-121 51 RABIDEAU, D. J.: 'Fast rank adaptive subspace tracking and applications', IEEE Trans. Signal Process., September 1996, 44, (9), pp. 2229-2244 52 GIERULL, C. H.: 'A fast subspace estimation method for adaptive beamforming based on covariance matrix transformation', Int. J. Electron. Commun. (AEU), 1997, 51, (4), pp. 196-205 53 GIERULL, C. H.: 'Statistical analysis of the eigenvector projection method for adaptive spatial filtering of interference', IEE Proc., Radar Sonar Navig., April 1997,144, (2), pp. 57-63
54 GIERULL, C. H.: 'Fast and effective method for low-rank interference suppression in presence of channel errors', Electron. Lett, March 1998, 34, (6), pp.518-520 55 NICKEL, U.: 'Monopulse estimation with subarray adaptive arrays with arbitrary sum and difference beams', IEE Proc, Radar Sonar Navig., August. 1996,143, (4), pp. 232-238 56 MAILLOUX, R. J.: 'Covariance matrix augmentation to produce adaptive array pattern troughs', Electron. Lett., May 1995, 31, (10), pp. 771-772 57 GERSHMAN, A. B., NICKEL, U., and BOHME, J. R: 'Adaptive beamforming algorithms with robustness against j ammer motion', IEEE Trans. Signal Process., July 1997, 45, (7), pp. 1878-1885 58 FANTE, R. L., DAVIS, R. M., and GUELLA, T. R: 'Wideband cancellation of multiple mainbeam jammers', IEEE Trans. Antennas Propag., October 1996,44, (10), pp. 1402-1413 59 TORRES, J. A., DAVIS, R. M., KRAMER, J. D. R., and FANTE, R. L.: 'Efficient wideband jammer nulling when using stretch processing', IEEE Trans. Aerosp. Electron. SySt9 October 2000, 36, (4), pp. 1167-1178
Part VII
Over-the-horizon radar applications
Chapter 17
Stochastically constrained spatial and spatio-temporal adaptive processing for non-stationary hot clutter cancellation Yuri L Abramovich, Stuart J. Anderson, Alexei Y Gorokhov and Nicholas K. Spencer
17.1
Overview
We consider the use of spatio-temporal adaptive array processing in over-thehorizon radar applications in order to remove non-stationary multipath interference ('hot clutter'). Since the spatio-temporal properties of hot clutter cannot be assumed constant over the coherent processing interval, conventional adaptive techniques fail to provide effective hot clutter mitigation without simultaneously degrading the properties of the backscattered sea/terrain radar signals ('cold clutter'). The approach presented here incorporates multiple stochastic (i.e. data-dependent) constraints to achieve effective hot clutter suppression, while maintaining distortionless output cold clutter post-processing stationarity. We discuss the use of stochastically constrained spatial and spatio-temporal adaptive processing for hot clutter mitigation in scenarios that both do and do not allow access to a group of range cells that are free of cold clutter (supervised and unsupervised training, respectively). Theoretical and simulation results are complemented by surface-wave over-the-horizon data processing, collected during experimental trials in northern Australia. The final section discusses convergence properties and convergence rate issues for stochastically constrained adaptive algorithms based on loaded sample matrix inversion routines.
17.2
SC STAP fundamentals and supervised training applications
altitude, km
plasma frequency, MHz
This chapter is concerned with the adaptive processing of data from high frequency (HF) over-the-horizon radar (OTHR) [1,2]. HF skywave radars exploit the ionised region of the upper atmosphere (the ionosphere) to reflect upwardly transmitted radiowaves back to the distant earth's surface. By this mechanism, it is possible to illuminate regions at ranges of several thousand kilometres, although at the penalty of subjecting the signals to the distorting effects of the spatially and temporally varying ionospheric plasma medium. For rays directed at a sufficiently oblique angle to the vertical, total internal reflection occurs, Figure 17.1. Illustrated in Figure 17.2 are some of the many dynamic processes that prevail in the ionosphere, and which imprint their various modulation
range, km
Figure 17.1 Examples of HF radiowave ionospheric propagation paths high-angle ray travelling ionospheric disturbances
field-aligned irregularities
F-layer
meteors
equatorial
auroral scatter E-layer
round-the-world signals
Figure 17.2 Physical processes in the ionosphere that distort HF radiowaves
signatures onto the radar signals. Further, it is apparent that the earth-ionosphere cavity constitutes a waveguide, in which all terrestrial radio transmissions reverberate until they are eventually absorbed or escape into outer space. As a consequence of this complex dynamic physical environment, signals received by a skywave radar differ markedly from simple time-delayed, Doppler-shifted replicas of the transmitted signal. They are [3,4]: • • •
• •
amplitude- and phase-modulated by the plasma medium during ionospheric propagation, over a wide variety of spatial and temporal scales; shifted by a priori unknown amounts in range and azimuth by the large-scale structure of the reflecting ionosphere; characterised by enormous clutter-to-signal ratios, as the illuminated zone (the radar footprint) is typically some HOdB greater than the effective area (radar cross section) of the targets of interest; contaminated with transient echoes from ionised trails produced by the continuous flux of small meteors entering the upper atmosphere; embedded in high levels of additive interference, including impulsive noise generated by lightning, industrial noise and shortwave radio transmissions from around the world. An HF broadcast station in the radar footprint may radiate 250-500 kW, compared with perhaps 10~7 W total radar power incident on a target at the same range.
Figures 17.3 and 17.4 show, respectively, the transmitting and receiving facilities of a typical HF skywave radar - the Jindalee radar near Alice Springs in central Australia. This design, employing uniform linear antenna arrays for both transmit and receive, in a quasimonostatic geometry, is the most common configuration. Several alternative
Figure 17.3 Jindalee skywave radar transmitting facility, Harts Range, central Australia
altitude, m
Figure 17.4 Jindalee skywave radar receiving facility, Mt Everard, central Australia
range, km optical
Figure 17.5
Notional indication of SWR versus microwave radar coverage
designs have been proposed and implemented elsewhere; their relative advantages are widely understood but seldom heeded. HF surface-wave radars (SWRs), on the contrary, do not rely upon the ionosphere, but exploit a waveguide type of propagation over the highly conductive (saline) ocean surface (Figure 17.5). This physical phenomenon, known as the Norton surface wave,
one-way transmission loss, dB
groundwave transmission loss (smooth sea) dielectric constant = 80 conductivity = 4 mhos/m
frequency
range, km
Figure 17.6
Theoretical surface-wave frequencies
transmission losses at various HF
ground wave propagation pol =V, normalised at 40 km
gp 'm§u9i;s ppii distance, km refractivity=1.02 rel permittivity = 80.00
Figure 17.7
scale height = 7.30 conductivity = 5.00
Experimentally observed surface-wave transmission losses
supports HF propagation well beyond the radio horizon, with propagation losses rapidly increasing with the frequency (Figures 17.6 and 17.7). Since surface-wave signals backscattered by the ocean surface and surface/air targets do not experience ionospheric channel distortions, the coherent processing interval of time (CPI) is
Figure 17.8
ComRad surface-wave radar receiving facility, northern Australia
able to significantly exceed that for skywave (ionospheric) radars. Unfortunately, it is practically impossible to select an operating frequency that supports sufficiently distant surface-wave propagation and, at the same time, does not involve ionospheric propagation or backscattering. Thus surface-wave radars are usually exposed to a significant level of ionospherically propagated highly dynamic (non-stationary) cochannel interference, especially at night. Figure 17.8 shows the receiving facilities of the ComRad HF SWR facility, which was located in northern Australia from 1997 to 2001. Clearly, the demands on signal processing in OTHR radar systems are challenging in the extreme. Classical techniques such as multi-dimensional FFT-based signal analysis that have dominated in the past have proven adequate when conditions are benign, but their efficacy degrades rapidly when propagation conditions deteriorate or when interference/jamming levels are high. The need to adapt to the dynamic signal and interference environment is the motivation behind the work presented in this chapter. In general, HF OTHR systems operate similarly to airborne radars by collecting data over a CPI that consist of M transmitted pulses or sweeps (in the case of continuous wave (CW) radars), emitted at a pulse (waveform) repetition frequency of fPR pulses per second. The receiving system consists of N elements, subarrays for example, with each element linked to an individual digital receiver. Receiver outputs are sampled at the Nyquist rate of fyy samples per second, resulting in T samples per pulse repetition interval (PRI). The total set of data collected in this manner during a single CPI therefore consists of Af x M x T samples. Increments in the Nyquist rate (range bins) within a particular PRI are called fast time samples, and those across PRIs are termed slow time samples.
For the experimental airborne radar system developed under the Mountain Top program [5], for example, we have an N — 14 element array, with M = 16 repetition periods and T — 1500 range bins. For typical skywave OTHR, on the other hand, we have N = 16 to 372, M = 128 to 256 and T ~ 50 to 60 [6, p. 267]; for surface-wave (SW) radars, the CPI may approach 60 or 120s [7], so that even with fNy = 10 Hz, we have M= 1000. For both HF OTHR and airborne radar, an interference signal produced by a single source is seen as a multiplicity of interference signals at the receiving antenna array, each mode propagating from source to receiver along a different path. For airborne applications, the strongest such modes are typically the ones travelling directly to the receiver and those arising from specular reflection. Nevertheless, a multiplicity of additional ground reflection paths, called hot clutter [5], may also exist depending upon the nature of the terrain between the transmitter and receiver [8,9]. Under the Mountain Top program, the time difference between the direct arrival and the longest hot path component for a far-field source generally does not exceed 20 to 30 |xs (i.e. 20 to 30 range bins), and a quite comprehensive model for this type of hot clutter has been presented in Reference 9. It is important to note the critical distinction between hot clutter and ordinary cold clutter: the latter is a reflection of the radiated radar signal, and the former refers to the (diffuse) multipath-scattered interferer signals. The signal strength, direction of arrival (DoA) and relative Doppler shift for each individual propagation path (or mode in HF terminology) depends upon the reflection characteristics of the ground (or ionospheric layer in the skywave case). In the model of diffuse multipath-scattered interferers [9], all scatterers along a line of constant path delay contribute a spatial component with a single synthetic wavefront that is slowly changing with differential Doppler frequency. Thus, within any given PRI, the diffuse scattering may again be represented as a superposition of a limited number of modes, separated in delay and characterised by a synthetic wavefront differing for each mode. The relative motion of source and receiver is the main reason for these synthetic wavefronts evolving in time. In the field of HF OTHR, the various ionospheric layers and inhomogeneities involved in the reflection of interference signals are responsible for a similar hot clutter phenomenon [10]. According to Reference 10: Ionospherically propagated signals may consist of several modes, non-stationary in bearing and highly correlated with each other. Furthermore, due to the signal being refracted from an inhomogeneous region, each mode can be considered to consist of a specular ray surrounded by a cone of diffracted rays. The resulting wavefront for each mode may therefore be far from planar.
If the propagation paths involve reflections from highly perturbed and non-stationary ionospheric regions (such as the equatorial hot plasma area or the polar regions), then non-stationarity of the spatio-temporal covariance matrix over a typical CPI is inevitable. Indeed, spatial non-stationarity of ionospherically propagated interference signals has been observed in HF OTHR, even over the relatively short CPIs typical for aircraft detection [H]. Consideration of this phenomenon is found to be essential for ship detection via skywave propagation, and even more so for HF surface-wave radars
where non-stationarity over much longer CPIs leads to a dramatic degradation in the performance of most existing adaptive interference cancellation techniques [H]. Note that spatial non-stationarity is associated with DoA varying over the CPI of time only for a simplistic model that incorporates a single plane wave. Although the mean DoA can indeed vary dramatically due to the passage of travelling ionospheric disturbances (TIDs), spatial non-stationarity involves more complicated transformations of the space-time properties of the multimode and diffractive rays. Of course, the so-called general fully adaptive spatio-temporal processing (STAP) described in [9], for example, can theoretically solve the problem of joint hot and cold clutter suppression [12]. However, in practice this is rarely possible because the dimension of such a fully adaptive system would be NML, where L is the number of fast time samples involved (number of taps). In Reference 9, the number of taps is chosen so that the range of delays expected (20 to 30 range bins in the Mountain Top experiment) is encompassed by the total delay (L — l)r, where r is the tap delay. For airborne moving target indicator (MTI) applications with M = 2 or 3 [13], it is possible that such a brute force method could be used effectively. For M= 16 (as in the Mountain Top data), and moreover for any HF application with M > 128, a fully adaptive scheme is totally impractical simply due to the lack of, say, 2NML training samples. Thus, from a practical viewpoint, we should consider a scheme whereby each finger beam is associated with an TVL-variate fast time STAP to reject the (diffuse) multipath interference (hot clutter). For HF OTHR applications, the output signals of each beam should be processed by the standard slow time inter-PRI coherent processing (Doppler spectrum analysis), provided that the cold clutter slow time properties at the scalar finger beam output are not perturbed by the previous fast time STAR For airborne applications, one could imagine a scheme with N\ M-variate slow time STAP to handle the cold clutter, where N\ is the number of spatial channels (fmger beams), and with the hot clutter already cancelled using corresponding fast time STAP preprocessing. Obviously for sufficiently high non-stationarity of the hot clutter signal, such a scheme can again be valid only if the slow time properties of the cold clutter are retained in some way, since the uncontrolled pattern fluctuations over the CPI introduced by conventional fast time STAP modulate and consequently decorrelate the cold clutter signal. For purely spatial adaptive processing (SAP), this latter phenomenon has been established both theoretically and experimentally in HF OTHR [14,11,15,16]. Note that if the cold clutter signal was created by a limited number of point scatterers, it would be possible to freeze the receiving antenna pattern in the direction of each point scatterer using standard linear deterministic constraints [5] that are normally used in order to protect the antenna pattern in the expected signal direction. Clearly the spatial distribution of the cold clutter is generally quite broad, so that it is collected by most of the antenna beampattern rather than in just a few directions. Therefore the above method is inappropriate, since we are not able to freeze the entire pattern or even a significant part of it without a dramatic degradation in hot clutter rejectability.
Another quite straightforward approach in avoiding antenna fluctuations is to return to time-invariant (over the CPI) fast time STAR The technique of averaging the non-stationary covariance matrix over the CPI has been introduced and tested for HF OTHR applications [14,11], where it was demonstrated that this approach is appropriate only for extremely short CPIs. Reference 11 defines the typical stationarity interval for ionospherically propagated interferers within the dynamic range of contemporary digital receivers to be 100 to 150 |xs, which includes only a few PRIs. Interferer averaging over this interval usually leads to the acceptable degradation of 1 to 3 dB in interference rejectability compared with quasi-instantaneous covariance matrix estimation which uses fast time training samples in the immediate neighbourhood of the analysed samples within the same PRI, where the delay between the training samples and the operational ranges can be ignored. For airborne applications, a similar approach is introduced in Reference 9 where it is shown by simulation that the expected degradation caused by averaging over two adjacent PRIs is less than 3 dB for UHF radar, and less than 8 dB for S-band radar operating at the same PRI. Therefore for airborne radars (M = 16) or for HF OTHR (M = 128), this approach is also completely inappropriate, although the property of local stationarity over the short interval of a few consecutive repetition periods will be heavily exploited in what follows. Naturally, this also means that the traditional approach for treating stationary interferers, whereby the interferer parameters are estimated in passive radar mode before and/or after the active CPI, is also inappropriate. To summarise, no existing technique is able to provide over sufficiently long CPIs a highly effective hot clutter only mitigation without compromising the cold clutter processing. The main objective of this section is to introduce an approach whereby the non-stationary hot clutter rejection is performed by fast time STAP updated from PRI to PRI, and the slow time correlation properties of the cold clutter scalar output are not affected by the STAP temporal fluctuations. An SAP technique involving stochastic constraints was introduced to solve this problem [14,11,15,16]. This technique was experimentally verified for HF OTHR, with results reported in References 11 and 16. Here we introduce the stochasticconstraints approach for spatio-temporal adaptive rejection of hot clutter that is essentially non-stationary over the CPI [1,2].
17.2.1 SC STAP algorithm: analytic solution If hot clutter is absent, or is stationary over the CPI, spatial or spatio-temporal adaptive processing for hot clutter (interference) mitigation is not coupled with cold clutter processing. Indeed, without hot clutter, conventional processing that comprises nonadaptive beamforming followed by a non-adaptive weighted FFT is reasonably close to the optimum cold clutter processing, since the slow time Doppler properties of the cold clutter are quite homogeneous over the broad area illuminated by the transmit antenna. In this regard, HF OTHR are not much different from most ground-based radars operating in surface clutter conditions. The particular shape of the sea clutter Doppler spectrum that is observed over relatively long CPIs, typical for surface
target detection mode, makes the weighted FFT practically optimal for most Doppler frequencies. Theoretical solutions for electromagnetic scattering from ocean waves quite accurately model observed sea clutter spectra [17,18] (see Figures 17.9 and 17.10). For
Doppler spectra for linear (HV) polarisation basis gp 'Xjisuap JSMod
frequency= 14.80 MHz Phi inc. = 0 degs. Phi scat. = 160 degs. Theta inc. = 75 degs. Theta scat. = 80 degs. DWS = (*, 30, 16,8,4,4,4)
power density, dB
wind speed (knots)
Doppler, Hz
Figure 17.9
Doppler, Hz
Theoretical Doppler spectra of sea clutter showing dependence on polarisation and wind speed
power density, dB
sea clutter spectrum
Doppler, Hz residual spectrum
Figure 17.10
Comparison of observed and theoretic (smooth curve) sea clutter Doppler spectra
Doppler bin
Figure 17.11 A typical sea clutter Doppler spectrum for skywave radar
Doppler bin
Figure 17.12 A typical sea clutter Doppler spectrum for surface-wave radar, with target (arrowed) skywave OTHR, the fine (high-order) details of the sea clutter Doppler spectrum that are observed by surface-wave radar are blurred by the ionospheric channel, as one might expect. Figure 17.11 presents a typical spectrum for skywave radar, while Figure 17.12 shows a typical spectrum for SWR. The most important distinction here from airborne radar cold clutter is the absence of strong dependence between angle and Doppler, and hence no space or space-time adaptive processing is required in addition to the conventional sea clutter Doppler processing. In the case of stationary hot clutter (over the CPI), an adaptive spatial or spatiotemporal filter that is constant over the entire CPI is applied, and so the sea clutter (cold clutter) signal can again be processed at the output of SAP or fast time STAP by the same conventional weighted FFT. Obviously, time-invariant STAP retains the stationarity of the cold clutter.
In this subsection, we search for conditions under which some slow time variant of the fast time STAP that removes non-stationary hot clutter can also retain the stationarity of the sea (cold) clutter, so that the output cold clutter signal can once again be processed using a weighted FFT. More specifically, we demonstrate that under a certain broadly defined cold clutter model, there exists a time-varying fast time STAP filter wja, different for each PRI k and each range bin t, that maintains the same properties of the output cold clutter signal as some reference time-invariant STAP filter n>o. As we have already mentioned, the theoretical solution derived in this subsection is impractical, and therefore the reference filter n>o is not specified at this stage. Let the 7V-variate complex column vector xja be the antenna array snapshot corresponding to the &th repetition period and the tth range bin, i.e. x^ € CNxl where k represents slow time and t represents fast time. In general, we may define the snapshot x^ to be the mixture: Xkt=Skt+ckt+gkt+nkt
f o r * = 1, . . . , A f ;
t=l,...9T
(17.1)
where Sfo is the desired signal backscattered by a point target, cja is the radar signal backscattered by terrain or the sea surface (cold clutter), gkt is the total interfering signal, comprising direct path, specular and diffuse multipath scattering (hot clutter) and nkt is additive white noise of power P% with the correlation property: E{nktn%t,)=hk>Ztt>P2nlN
(17.2)
Typically, the target signal;% originating from some direction Oo takes the form: (17.3) where a is a complex Gaussian-distributed (scalar) amplitude, \jrt is the signal waveform, (DD is the target-signal Doppler frequency (in radians), >t is a range-dependent phase, and s(#o) is the array-signal manifold (steering) vector. The cold clutter snapshot Ckt is simulated here as a stationary random TV-variate Gaussian process with the correlation property: (17.4) (i.e. the range sidelobes are ignored) where Rck_kt is the cold clutter spatial covariance matrix at the slow time lag (k — kf), and RQ is the standard iV-variate Hermitian cold clutter spatial covariance matrix. The hot clutter signal gkt is assumed to be a convolutive mixture of J external interference signals jj£ , / ? = 1 , . . . , / , where each j ^ is a complex waveform radiated by the pth interferer at time (k/fpR -f t): (17.5)
where Q is the assumed maximum number of propagating paths for any of the J interfering sources, and Hu is defined below. The /-variate vectory^ = [jkt , . . . , j ^ ]T consists of complex waveforms radiated by all J sources at time (k/ fpR + t). As usual, we assume that: (17.6) where Pl is the /?th interfering signal power, i.e. the interferers are assumed to be Sp
mutually independent and temporally white (broadband). For diffuse multipath, Q is usually defined as the number of range bins covering some range interval AR involved in the scattering [5,9]: (17.7) where B is the signal bandwidth and c is the speed of light. For continuously distributed scatterers, it is more accurate to determine the interval between lines of constant path delay by the range grid, defined by the maximum hot clutter suppression for continuously distributed clutter. The latter itself is usually defined by the input hotclutter-to-noise ratio. The larger this ratio is, the smaller the separation should be. This phenomenon has been known for many years [19,20]; an example of the accurate evaluation of the number Q is given in Reference 9 for one particular hot-clutter-tonoise ratio. The NJ-variate matrix Hu introduced above in equation (17.5) represents the instantaneous total impulse response, relating the radiated interference signals jj£ to the received hot clutter snapshots gkt. Naturally, Hu incorporates the time-varying channel characteristics experienced by each propagation path. For diffuse multipath, the time variation of the synthetic wavefront from the scatterers along each line of constant path delay is defined by a differential Doppler shift along that line. More precisely, the (i, j)th element of the matrix Hu is a complex coefficient which is practically constant over the kth PRI, and is a measure of the contribution of the 7th interference source with relative delay I to the final hot clutter output at the zth receiver during the &th PRI. Although the number of interfering sources J is assumed to be strictly less than the number of antenna elements N, the total number of independent sources seen by the antenna array may approach JQ. Thus, even for modest J and Q, their product may exceed the number of antenna elements N. If: JQ > N
(17.8)
then a purely spatial approach will be generally ineffective. On the other hand, it is necessary to emphasise that the product JQ itself does not entirely define the best performance of the adaptive technique. For example, if all scatterers are situated in an extremely thin layer (of the ionosphere) with constant path delay (auroral scattering), then the rank of the covariance matrix would always be unity for a single interfering source. Therefore, in this case, pure SAP would give us the maximum efficiency for
distributed scattering cancellation, regardless of the number of such point scatterers. This phenomenon was described in Reference 20. Equally, pure SAP should deliver effective hot clutter cancellation for a single interferer if: 2«W
(17.9)
The existence of numerous point scatterers along the path of equal delay over a wide angle means that the main beam direction will be unaffected by SAP, even if this main beam intersects this path [20,9]. On the other hand, if there is a single scatterer along the path of equal delay that intersects the main beam, then pure SAP would not deliver effective suppression of hot clutter collected by the main beam. To be more accurate, under the condition Q <^C N9 such hot clutter could be rejected, but the main beam would also be affected. These quite simple considerations are necessary for a clear understanding of the correspondence between SAP and STAP in the hot clutter mitigation problem. Hereafter we will be dealing with the general case JQ > N, and since for airborne applications the direct path is always present, there should be at least two different paths, hence STAP will always be potentially more effective than SAP, even if SAP itself is effective for Q <^N. If the model of equation (17.5) is adopted for some fixed g , then we note that the number of taps L that are necessary for hot clutter suppression is defined directly by the model. Let us introduce the TVL-variate stacked vector gk0 consisting of L successive fast time samples gkt stacked on top of each other. According to equation (17.5), we may write:
(17.10)
or more compactly: (17.11) The number of rows in the stacked block matrix Hu is equal to NL9 and the number of columns is J(Q + L — 1). Thus the stacked noise-free hot clutter spatio-temporal covariance matrix: (17.12)
is always rank-deficient if: NL > J(Q +L-
1)
(17.13)
i.e. (17.14) This condition and the basic presentation of equation (17.10) are well known in the field of multiple-input-multiple-output (MIMO) systems [21]. For Q = I, STAP cannot outperform SAP, as expected. Although for L = 1, we find that the covariance matrix is rank-deficient if N > JQ, i.e. SAP alone can effectively suppress hot clutter. It is interesting to note that the number of taps L-Q usually recommended [5,9] is justified only for J = N/2. The number of taps necessary to cope with the maximum number of independent interferers 7 max = A^ — 1 is: Lmax > /max(Q " D
(17.15)
which again agrees with the results of MIMO studies [21]. Note that when the condition of equation (17.13) is satisfied, the rank of the covariance matrix R% is: rank [^f] < J(Q + L - I )
(17.16)
Thus the condition equation (17.13) actually generalises the well known condition for spatial suppression of J independent interferers: N>J
(17.17)
that obviously follows from equation (17.13) when L = Q = I. It is necessary to emphasise that the condition of equation (17.13) or equation (17.17) guarantees hot clutter rejectability irrespective of the signal-to-noise ratio (SNR) obtained as a result of such rejection. For independent interferers and pure SAP, the main beam would be affected when the direction of arrival of one of the interferers is close to the target signal direction. For STAP a similar unfortunate scenario may occur, when the target signal stacked vector i% (defined similarly to gkt and jkt) can be fairly accurately presented as a linear combination of the columns of Hu. Once again, note that the condition of equation (17.13) is applied to the model of equation (17.5); if diffuse multipath is present, then this model is valid only for correspondingly small hot-clutter-to-noise ratios. In the Mountain Top experiment, for example, the 20 to 30 range bins involved leads to the following minimum number of taps for an N = 14 element antenna array and a single interferer: ^min ;C ~rz
T
(17.18)
i.e. L m i n = 3. The real data processing of Reference 5 agrees with this estimate in an indirect way. Indeed, the transition from 5 taps to L = 15 taps only increases the so-called self-rejectability of the hot clutter samples involved in covariance matrix
averaging. This is understandable since for the 210 x 210 element covariance matrix, there are only about 200 range bins involved in averaging that are collected from the 130 to 180 km range region. On the other hand, the ranges corresponding to distances of 180 to 230 km that are not involved in averaging seem to be equally clean for both 5 and 15 taps. Obviously, in practical applications L should exceed Lm[n because of the additional constraints that are usually imposed. Suppose now that the number of taps L is properly chosen, and so our problem is to find an TVL-variate STAP filter w^t with which to process the JVZ-variate stacked vector Xkt' (17.19)
to form the scalar output zkt = Wkt*kt- We may similarly compute the scalar output sequences that correspond to the target signal,%, the backscattered cold clutter c^t and the hot clutter gkt: ~ ff ~ s
kt = WkfSkt
yut = ^kAt
(17.20)
x
kt = Wktgkt
The first set of constraints for the STAP vector w^t is designed to ensure the undistorted reception of the target signal. There are several approaches using deterministic linear constraints that can protect the desired signal against distortions caused by temporal adaptivity [5]. For example, we may introduce the set of L linear constraints such that:
w£ AL(Oo)= el
(17.21)
where the TVZ x L matrix Ai(Oo) = s(Oo) <S> IL (® represents the Kronecker product) and eL = [1,0, . . . , 0]T is the L-variate unit vector. Such constraints ensure the distortionless reception of the target signal from the expected DoA #o. If one requires the output signal s^t to be more robust in the presence of pointing errors, constraints on the steering vector derivatives might also be imposed, e.g.:
(17.22)
where (17.23)
and * " A2L(Oo)= e\t
(17.24)
In general, we may define an NL x q matrix Aq together with the g-variate vector eq to implement q linear constraints: H ^ A ^ O ) = e\
(17.25)
The cold clutter is assumed to be a stationary process (in a broad sense), and thus it may be approximated by the multivariate autoregressive (AR) model of order K: (17.26) Note that the scalar moving average (MA) model is usually considered to be an alternative to the AR model, however it is now known that even the multivariate MA model can be presented as a multivariate AR model of finite order under surprisingly mild conditions [22] (in the scalar case this order is infinite). In the above equation, Bj are the Af-variate matrices that are the solutions of the multivariate Yule-Walker equations [23]:
(17.27)
where (17.28) We are interested in applications where the cold clutter at least locally (i.e. over some small number of adjacent PRIs) can be described by the above AR model of comparatively low order K
belong to the same range bin t is: (17.30) Note that according to equation (17.28), the scalar innovative noise Wfa\kt *s white: (17.31) where M = diag [i?^, . . . , / ^ ] . Now, for any given constant (reference) STAP filter ivo, the cold clutter scalar output is given by: (17.32) A comparison of equations (17.30) and (17.32) makes it clear that their right-hand sides will be statistically identical, i.e. the correlation properties of these two slow time sequences are identical, if: (17.33) and provided that the white innovative noise wktkkt is of the same power as the white noise H > ^ | ^ , i.e.: (17.34) To be more precise, the difference between the scalar cold clutter slow time sequence ykt = w^Ckt (k = 1, . . . , M) at the output of the constant filter wo and the analogous sequence y^t = W^*-kt a t the output of the PRI-varying STAP filter w^t, tailored to this particular tth range bin, consists only of different scalar values of temporally white innovative noise, since: *hlh
* *"!*,
for k = 1, . . . , M
(17.35)
According to equations (17.31) and (17.34), these two white noise sequences are statistically identical, since they are of the same power. Thus for the /cth order AR cold clutter model of equation (17.26), the system of K linear data-dependent (stochastic) constraints of equation (17.33) plus the single square-type deterministic constraint equation (17.34) ensure that the beamformed cold clutter output sequence obtained with the time-varying STAP filter n>kt is a stationary /cth order scalar random AR process with the same autocorrelation properties as the process obtained for some time-invariant STAP vector woThis property is basic to the whole idea, and actually means that for a multivariate stationary AR process, the condition of stationarity on its filtered output does not necessarily imply that this multivariate filter is time-invariant. Obviously, for the stationarised output sequence ykt (k = 1, . . . , M) we can apply a slow time Doppler processing, the same as for the true stationary process ykt .
With regard to the above deterministic (equations (17.25) and (17.34)) and stochastic (equation (17.33)) constraints, we may now formally define the optimal fast time data-dependent (varying from PRI to PRI) STAP filter w^ as the solution of the following optimisation problem: find (17.36) where (17.37) with /?| is the hot clutter covariance matrix at the &th PRI and INL = IN® IL, subject to the constraints:
(17.38)
The set of deterministic constraints always includes one to specify the gain in the target direction equation (17.21): w%s(00) = *o S(O0) = 1
(17.39)
where s(Oo) = [s(#o), 0, . . . , 0 ] r , so the stochastic constraints may be rewritten in the standard form: (17.40) Now let us define the NL x K data-dependent (stochastic) constraints matrix Ckt as: Ckt = [Bick-u\B2h-2it\
• • • \BKck-K,t]
(17.41)
and then introduce an augmented NL x (ic-\-q) linear constraints matrix Akt such that: (17.42) thus the optimisation problem equation (17.36) may be presented in the standard form: find (17.43) subject to the constraints: (17.44) (17.45)
The standard approach to solving this problem is to write it in the form. Find:
(17.46) where A^ € C ^ + ^ x l are the Lagrangian multipliers and X^ is an additional multiplier associated with the constraint equation (17.45). Therefore the optimal solution is: (17.47) Techniques to compute the multiplier X^t are described, for example, in References 24 and 25. Naturally, the solution strictly exists only if the subspace defined by the constraints is not empty. For example, for a vector ivo oc (R$)~1S(0Q), no other vector can satisfy the conditions equations (17.39) and (17.45). The above techniques enable exploration of the admissible set of solutions, and it can be shown that an initial condition WQ which makes this admissible set non-empty always exists. Of course, the solution equation (17.47) is far from being useful in practical applications, however our current aim is simply to show the existence of a solution that enables non-stationary hot clutter rejection with a stationary scalar cold clutter output signal. In order to create a more realistic solution, we may immediately introduce several modifications. First, according to our assumptions, the spatial distribution of the cold clutter is comparatively broad; this means that the spatial covariance matrix is close to being diagonal, i.e. very well conditioned. Note that a poorly conditioned spatial covariance matrix means that the cold clutter is concentrated in a relatively narrow angular sector, in which case the above-mentioned traditional linear constraints may be applied. For well conditioned M, slow time fluctuations of the STAP vector Wkt would not significantly affect the product w^R^wja, since in most cases when the main beam is not affected, these fluctuations are negligible in the Euclidean sense (i.e. wkt R^Wkt oc WfaWkt). An analysis and discussion of such fluctuations for HF OTHR-type models was presented in Reference 14, where it was shown that the constraint equation (17.45) could be neglected in most cases. Thus we may simplify our optimisation problem equation (17.43) by retaining only the linear constraints equation (17.44). In that case, the following optimal solution always exists: (17.48) Note that even if we admit some slow time power fluctuations in the output innovative white noise component wkt\k0 it would not destroy the structure of the cold clutter whitening filter (for example). Indeed, according to our AR model equation (17.29),
the (K + l)-lag slow time filter: (17.49) is the whitening filter for the cold clutter (in fact, for any WQ). Due to our linear constraints equation (17.33), the filter: (17.50) processes the cold clutter data for the given range bin Ck-j,t U — 1, . . . ,K) in such a way that the output sample is also white noise, even though the power of this white-noise sample may differ from that produced by the reference filter H>O when the quadratic constraint equation (17.45) is dropped. For this reason, processing involving the whitening {K + l)-lag MTI coherent integration filter would have the same whitening filter for the cold clutter output signal for both the filters wjd andn>o. The second major simplification stems from restricting the general multivariate AR model of equation (17.26) to the simpler scalar multivariate model: (17.51) where Bj = bjljy. Such a collapsed solution to the Yule-Walker equations is obtained when in equation (17.27), the cold clutter spatial covariance matrix with yth slow time lag is of the special form: Rcj
=
rjRl
for J = O 9 . . . 9K
(17.52)
where Yj is the yth correlation coefficient of the scalar cold clutter process. Here the bj (j = 1, . . . , K) are given by the /c-variate Yule-Walker equation:
(17.53)
where Pi is the power of the innovative noise in the AR model. The physical meaning of this condition is that the properties of the cold clutter are the same for all antenna sensors, and that only the innovative noise power could be slightly different for different beam steering directions if R^ is not exactly a diagonal unitary matrix. This assumption is justified by the homogeneity of the cold (sea) clutter in both range and azimuth. Note that for airborne or spaceborne radars this assumption is not valid. The most significant simplification achieved by this model is that the stochastic constraints equation (17.33) now do not depend on particular values of bj, i.e.: w%h-j,t = ™oh-j,t
for J = I 9 . . . ,K
(17.54)
Next Page
This means that the specific cold clutter AR model does not need to be identified, and overestimation of the AR model order K simply means that additional STAP degrees of freedom are wasted. The physical meaning of this new set of constraints equation (17.54) is clear: in order to keep the scalar cold clutter output stationary, we need to keep fixed only a few slow time scalar output samples prior to the current one.
17.2.2 SC STAP algorithm: operational routines Apart from the obvious question about the relevance of the model of equation (17.51) to any real life cold clutter, there are more serious concerns about the applicability of the theoretical solution Wkt equation (17.48). How do we obtain the cold-clutter-only sample c^t if we can only measure the mixture of hot and cold clutter? If this sample were available, why not simply subtract it from the input mixture directly? How do we obtain wo? In this subsection, we demonstrate some practical solutions that approximate the ideal one, relying on specific properties of the hot and cold clutter environment, particularly for the HF OTHR context. Although these practical routines may be used in other applications, we stress that in any specific case there will be some best way to adapt the basic principles of the stochastic constraints idea, relying upon the features specific to that problem. 17.2.2.1 Pulse-waveform (PW) HF OTHR An operational approach for purely spatial adaptive processing was introduced in References 14,11,15 and 16; we now generalise this to the STAP case. In most PW HF OTHR systems, scattering from the earth's surface (cold clutter) and from targets occupies only a limited range within each repetition period. For skywave radar, the finite duration of the oblique backscattered signal (OBS) is dictated by the radar-ionosphere-target geometry. For surface-wave radar, natural attenuation leads to suppression of the backscattered signal far from the end of the repetition period. Naturally these scattered (cold clutter) signals are submerged in the hot clutter and are not directly available for treatment by stochastic constraints. Meanwhile, the hot clutter signal usually occupies the entire repetition period (the entire CPI, actually) and in most cases some region within the PRI can be easily identified as containing hot clutter only. Hence this operational routine relies on a priori information on the distribution of the cold-clutter-free ranges within the PRI. As we already mentioned in the introduction to this section, the non-stationarity of the hot clutter forces us to estimate its properties within the actual PRI &, or its immediate vicinity, making the traditional passive radar mode practically useless for hot clutter estimation. We have already discussed the local stationarity property of the hot clutter over the limited number of consecutive PRIs. Assume, for example, the second-order (K = 2) cold clutter AR model of equation (17.51). We form the fast time STAP filters w™ in the order k = 1, . . . , M — /c; the first of which (k = 1) is stochastically unconstrained, and thereafter the filters are stochastically constrained. We construct this initial STAP
Previous Page
This means that the specific cold clutter AR model does not need to be identified, and overestimation of the AR model order K simply means that additional STAP degrees of freedom are wasted. The physical meaning of this new set of constraints equation (17.54) is clear: in order to keep the scalar cold clutter output stationary, we need to keep fixed only a few slow time scalar output samples prior to the current one.
17.2.2 SC STAP algorithm: operational routines Apart from the obvious question about the relevance of the model of equation (17.51) to any real life cold clutter, there are more serious concerns about the applicability of the theoretical solution Wkt equation (17.48). How do we obtain the cold-clutter-only sample c^t if we can only measure the mixture of hot and cold clutter? If this sample were available, why not simply subtract it from the input mixture directly? How do we obtain wo? In this subsection, we demonstrate some practical solutions that approximate the ideal one, relying on specific properties of the hot and cold clutter environment, particularly for the HF OTHR context. Although these practical routines may be used in other applications, we stress that in any specific case there will be some best way to adapt the basic principles of the stochastic constraints idea, relying upon the features specific to that problem. 17.2.2.1 Pulse-waveform (PW) HF OTHR An operational approach for purely spatial adaptive processing was introduced in References 14,11,15 and 16; we now generalise this to the STAP case. In most PW HF OTHR systems, scattering from the earth's surface (cold clutter) and from targets occupies only a limited range within each repetition period. For skywave radar, the finite duration of the oblique backscattered signal (OBS) is dictated by the radar-ionosphere-target geometry. For surface-wave radar, natural attenuation leads to suppression of the backscattered signal far from the end of the repetition period. Naturally these scattered (cold clutter) signals are submerged in the hot clutter and are not directly available for treatment by stochastic constraints. Meanwhile, the hot clutter signal usually occupies the entire repetition period (the entire CPI, actually) and in most cases some region within the PRI can be easily identified as containing hot clutter only. Hence this operational routine relies on a priori information on the distribution of the cold-clutter-free ranges within the PRI. As we already mentioned in the introduction to this section, the non-stationarity of the hot clutter forces us to estimate its properties within the actual PRI &, or its immediate vicinity, making the traditional passive radar mode practically useless for hot clutter estimation. We have already discussed the local stationarity property of the hot clutter over the limited number of consecutive PRIs. Assume, for example, the second-order (K = 2) cold clutter AR model of equation (17.51). We form the fast time STAP filters w™ in the order k = 1, . . . , M — /c; the first of which (k = 1) is stochastically unconstrained, and thereafter the filters are stochastically constrained. We construct this initial STAP
filter w™t using the hot-clutter-only training samples from some common range region within the first three consecutive PRIs: (17.55) where Aq(#o); Cq are the deterministic constraints from equation (17.25); and: (17.56) Due to the local stationarity of the hot clutter, we believe that the hot clutter samples are properly rejected by the STAP filter w™ over all ranges of these three initial PRIs, so that the scalar output for operational ranges consists mainly of cold clutter, noise and possible targets, i.e.: (17.57) Note that iv™ is a function only of A:, not of fast time t. For the next adaptive filter w™ (now range-dependent and stochastically constrained), we apply the sliding-window average to the next three repetition periods: (17.58) to again ensure a proper hot clutter suppression over these three PRIs. Clearly the moving average across (/c -f 1) adjacent repetition periods also ensures that /c covariances / ^ 1 are common to each successive average R™. The system of K stochastic constraints corresponding to equations (17.57) and (17.58) may then be written as: (17.59) The right-hand sides of these constraints consist of the cold clutter samples mainly due to equation (17.57); whereas equation (17.58) ensures that the filter w™ properly processes the cold clutter samples only, since the hot clutter component is to be rejected. Thus the trick is that we do not require the cold-clutter-only samples c^ in order to introduce the constraints equation (17.54); what we do require is the corresponding scalar output, assuming that the hot clutter component is properly rejected. For an arbitrary slow time k, the operational solution: for k = 2, . . . , M - 2 (17.60) is defined by the sliding-window covariance matrix: (17.61)
and the system of stochastic constraints: (17.62) where Z^ = [xkt\xk+\j]> The latter simply means that: (17.63) The generalisation of these three equations for arbitrary /c is obvious. The efficiency losses compared with the ideal solution w^t are introduced by hot clutter covariance matrix averaging and by the finite efficiency of the hot clutter rejection of the input samples x^ used in the constraints. Note that the hot clutter quasistationarity over the (/c + 1) consecutive repetition periods simply means that the signal eigensubspace dimension of the resulting averaged covariance matrix should still be less than the total number of degrees of freedom in the STAP filter (NL). Even when the non-stationarity is significant, proper hot clutter rejection can be achieved with this averaging provided: (K + I)[J(Q + L - I)] < NL
(17.64)
Recall from equation (17.13) that J(Q + L — 1) is the rank (signal eigensubspace dimension) of the intrasweep spatio-temporal hot-clutter-only covariance matrix. Obviously the number of taps L that guarantees this condition is: (17.65) with the additional necessary condition: N>(K+\)J
(17.66)
To summarise, using the stated hot and cold clutter models and operational routines, the above conditions specify the proper choice of purely spatial (N) and total (M,) degrees of freedom which guarantee high hot clutter rejectability under almost arbitrary non-stationarity of the hot clutter. Obviously, we expect that if the AR model is an adequate description of the cold clutter, then we may apply the traditional weighted FFT to the output signal with no degradation imposed by the time-varying STAP filter tffc. Naturally, the above approach may also be applied to CW OTHR provided there is access to some hot-clutter-only range bins within each PRI. 17.2.3
SC STAP algorithm: efficiency analysis by simulation results
In this subsection, we will first specify the models for hot and cold clutter signals that we adopt for numerical simulations. The problem to be addressed is that most existing models for ionospherically propagated signals do not incorporate spatial fluctuations [26]. Second, we shall illustrate the efficiency of the theoretical solution
of equation (17.48), since with simulated data we have access to the hot and cold clutter separately. At this stage, we use the true values of the hot-clutter-plus-noise covanance matrix Rf and the true cold clutter snapshots Cfa. Third, we present the results of an operational routine using the true hot clutter covariance matrix. In this way, we can delineate the losses imposed by the SC STAP operational implementation from those due to covariance sample averaging. Finally, the finite sample covariance matrix estimation is introduced to complete the fully operational routine analysis. 17.2.3.1
Generalised Watterson model for ionospherically propagated hot clutter For / interfering sources and Q different propagation paths for each source, the overall hot clutter snapshot v e c t o r ^ of equation (17.1) may be presented as a superposition of JQ independent hot clutter components gpt : (17.67) Each individual component is modelled by an N-variate spatio-temporal random process [27]: (17.68) Here Spt is an N x Af diagonal matrix whose elements are defined by the array steering vector corresponding to the DoA of the €th mode propagating from the pth source. Similarly, Awpi is the regular component of the ionospheric Doppler shift, api is the RJVIS amplitude, and recall that yPt is the transmitting waveform of the pth external interferes Spatial and temporal fluctuations of the propagation media that are responsible for the non-stationarity of the hot clutter are both represented by the vector v f eCNxl: (17.69) where ep is the innovative temporally white noise of power fipi, with E{ep ep } = QPt e QNXN b e m g m e covariance matrix of spatial fluctuations. Similarly to the cold clutter model of equation (17.51), the scalar coefficients pphere are defined by the Doppler spectrum of fluctuations for the £th propagation path of the pth interfering source. Note that the scalar version of this equation is the well known Watterson [26] model, recommended by CCIR for HF applications. In order to introduce spatial fluctuations in addition to temporal ones, the simplest Markov model is adopted: (17.70)
where £?• is the 7th element of the innovative noise vector ek , Y^ j+\ is a sample of white noise of unit power and tf. is the spatial correlation coefficient for the £th hot clutter mode of the /?th interfering source: (17.71) The value of the coefficient may be calculated by: (17.72) where dj is the jth sensor position in the antenna array and vpi is a damping factor. Under the proposed model, the Af x N spatial covariance matrix for the hot clutter sample gkt may be written as the Hadamard (element-wise) product: (17.73) where Cpl eCNxN
has the elements:
Cfk =exp[-vpe\dj-dk\]
foTJ,k=l,...,N
(17.74)
According to this model, any quasi-instantaneous spatial covariance matrix of a given mode is of first rank, i.e. : (17.75) On the other hand, the spatial sample covariance matrix averaged over the entire coherent processing interval ^2k=\^2t=\SktSkt tends to the well conditioned matrix product on the right-hand side of equation (17.73). Note that the traditional plane wave model used for the construction of the matrix Spi is not suitable for a highly perturbed and diffuse propagating media. The following generalisation of the Karhunen-Loeve expansion is proposed for diffuse scattering along the line of constant path delay I: (17.76) where Fpi e £NxN i s the covariance matrix of the €th constant path delay scattering, averaged over the infinite temporal interval: (17.77) Obviously Fpi depends on the geometry of the equal delay path. Also note that the Gaussian-distributed vectors vk are all statistically independent over the set of different modes I and sources p. The above complete description introduces a general model for non-stationary hot clutter that links the slow time spatial wavefront fluctuations with the well
known temporal slow time (Doppler) fluctuations. Moreover, the final generalisation equation (17.77) also incorporates diffuse multipath scattering. A quite remarkable correspondence between this model and real ionospherically propagated signals has been demonstrated [27] in terms of purely spatial adaptive processing. 17.2.3.2 Cold clutter model and other simulation parameters Standard models can be used to specify the stationary cold clutter within the adopted AR model of equation (17.51). For example, terrain scattering can be modelled by the first-order Markov chain: (17.78) where b\ = —r\ is the interperiod temporal correlation coefficient introduced in equation (17.51). A single stochastic constraint is required for this model. We shall present results of simulations conducted using the second-order AR model of HF sea scattering: (17.79) where fef} = -1.9359, bf = 0.998 and P$ = 0.009675 [14,28]. In a slight variation of equation (17.51), here the innovative white noise %kt is introduced with unit power, weighted by the scalar P§. The spatial properties of the backscattered cold clutter are defined by the transmitting antenna pattern; in our model the elements of the JV-variate random complex vector ^kt are given by the first-order spatial AR process: (17.80) with scalar innovation y^_ j uncorrelated over ranges t, repetition periods K and spatial channels j , i.e. (17.81) Given these models for hot and cold clutter, we now specify the main parameters. For a typical skywave OTHR ship detection mission, we may set fyy — 10 Hz, CPI=25.6s, and r^ = 0.5 [14,11,16,29]. We simulate a uniform linear antenna array (hence ^ — K^ = . . . = ?jy_i) with Af = 16 sensors at half-wavelength spacings, beam steered in the direction #o = 0. In order to demonstrate the superiority of STAP over SAP, we should choose a scenario where pure SAP is ineffective. One such obvious scenario occurs with the lack of purely spatial degrees of freedom, when JQ > N leads to a rather large number of taps for the operational algorithm. Indeed for / = 4, Q = 4 and the second-order AR model of cold clutter (i.e. /c = 3 adjacent repetition periods are involved in covariance matrix averaging), the number of required taps
Table 17.1
Hot clutter simulation parameters Mode 1 Mode 2 Mode 3 Mode 4
Interferer direction of arrival (<91£) (degrees) Hot-clutter-to-noise ratio (HCNR) (dB) Temporal correlation coefficient (pu) Spatial correlation coefficient (C u )
0.5 30 1.00 1.00
20.5 25 0.90 0.91
39.3 20 0.88 0.90
44.9 35 0.91 0.90
according to equation (17.65) is Lmin = 9, which leads to somewhat time-consuming calculations. Instead, we have chosen a single-interferer scenario with one (direct) path affecting the main beam, with four propagating modes: J=I and Q = 4. Here only STAP can effectively mitigate hot clutter, despite the excessive number of purely spatial degrees of freedom. The hot clutter DoAs (0u), spatial ( f u ) and temporal (pu, W = 1) parameters of each propagation mode are listed in Table 17.1. Clearly L > 2 taps are sufficient to effectively cancel this interferer; we choose L = 3. For this setup, the rank of the hot clutter covariance matrix averaged over (K+ 1) = 3 repetition periods is still significantly less than the total number of degrees of freedom: (K + I)J(Q + L - 1) = 18 « 48 = NL
(17.82)
17.2.3.3 SC STAP potential efficiency analysis We begin with an investigation of the potential efficiency of the theoretically ideal SC STAP wkt of equation (17.48) in order to compare it with the existing approaches and (later) simulation results for the operational routines. For the adopted hot clutter model, the true covariance matrix B^ (k = 1, . . . , 256) is a block Toeplitz Hermitian matrix with the structure:
R? = Toep[R?0, Ffx,..., * £ _ , ]
(17.83)
where, according to the model equation (17.67): (17.84) and for q = 1, . . . , L — 1: (17.85) Obviously when the tap delay exceeds the maximum multimode propagation delay we have (17.86)
We shall compare the ideal SC STAP hot clutter rejection with the following traditional processing schemes: •
conventional (non-adaptive) beamformer, with output signal-to-hot clutter ratio (SHCR) given by: (17.87)
•
optimal spatial-only time-dependent beamformer: "SAP = Kno)~ls(°o)
(17.88)
with SHCR given by: qSAP= S11Oo)(R^0)-1 •
S(O0)
(17.89)
standard fast time (stochastically unconstrained) STAP with time-dependent intra-PRI adaptivity: (17.90) (17.91)
•
time-independent STAP with the hot clutter covariance matrix averaged over the entire CPI: (17.92) (17.93) (17.94)
Figure 17.13 illustrates the SHCR simulation results for each of the above processing schemes. Note that when hot clutter is absent, it is only the additive white-noise component (P^ = 1) that limits the SNR, in this case to log10 q = 12 dB. As expected, the performance of the unconstrained STAP qSTAP is extremely effective, since the output signal-to-hot-clutter-plus-noise ratio is around 1OdB across all repetition periods. Purely spatial adaptive processing qSAP is naturally less effective, since the main beam is adversely affected by the direct propagation mode. Integrated STAP qSTAP loses about 30 dB in efficiency compared with the qSTAP benchmark. Meanwhile, the proposed SC STAP output: (17.95) corresponding to the ideal theoretic solution: (17.96)
signal-to-hot-clutter ratio, dB
repetition period
Figure 17.13
Comparison of SHCR performance of the ideal SC STAP processing with several traditional processing schemes
demonstrates remarkable performance: the two stochastic constraints used do not significantly reduce the hot clutter rejectability compared with the unconstrained processing. The total improvement in hot clutter rejectability is in fact about 60 dB compared with the non-adaptive conventional beamformer output qCBF. Next we illustrate the impact of these schemes on the output Doppler spectrum of the cold clutter, with and without a target. Figure 17.14a shows the output of the Blackman-weighted FFT Doppler spectra for a range bin containing only cold clutter, and Figure 17.14b shows identical simulation results, but with the addition of a target signal. We see that, despite the quite simplistic second-order AR model employed, this simulated Doppler spectrum reasonably corresponds to the actual sea clutter Doppler spectrum for skywave OTHR, with two prominent first-order Bragg lines and higher-order components in their vicinity (c.f. Figure 17.11). Both Figures display the Doppler spectra obtained for unconstrained STAP wSTAP, averaged STAP iv-^p and SC STAP wsc. We see that unconstrained STAP is completely inappropriate in this case. Subclutter visibility (the main-peak-to-sidelobe ratio) is completely destroyed (^20 dB). Obviously typical targets cannot be detected at all, despite the perfect hot clutter rejection of this filter. As expected, averaged STAP demonstrates a CBF-type cold clutter spectrum, since such a spatial filter is constant over the entire CPI; unfortunately it gives huge losses in hot clutter rejectability, as we saw in Figure 17.13 (qSTAP)- Meanwhile, SC STAP for both non-target (Figure 17.14a) and target (Figure 17.14b) scenarios retains subclutter visibility (SCV) at about the same level as the averaged STAP processing. Thus the remarkable potential efficiency of the stochastically constrained STAP approach is demonstrated. Next we consider the performance of the corresponding operational routine.
normalised Doppler frequency
normalised Doppler frequency
Figure 17.14
a weighted FFT Doppler spectra for a range bin containing only cold clutter b identical simulation parameters as for a but with a target present
17.2.3.4 Operational approach: true hot clutter covariance matrix For the routine described in Section 17.2.2.1, with cold-clutter-free fast time training samples available within each PRI, we begin with simulations involving the true hot clutter covariance matrix i^ 1 . In this way, we can delineate the dynamic losses introduced by the operational approach (using three adjacent repetition periods and the second-order AR cold clutter model) from the traditional losses caused by the finite sample size in the hot clutter covariance matrix estimation. The latter are analysed in the next subsubsection.
signal-to-hot-clutter ratio, dB
repetition period
Figure 17.15
Several stages in the SHCR analysis of the SC STAP operational routine
In the second-order sea scattering model of equation (17.79), three consecutive PRIs are seen in the sliding window described by equation (17.61). The initial STAP filter is given by equation (17.55), and the operational SC STAP filter is defined by equation (17.60) with stochastic constraints equation (17.62), which involve the range bins containing a mixture of hot and cold clutter and possible target. Figure 17.15 illustrates the losses thus obtained with the curve labelled qz • In comparison, the curve labelled qy shows the result when the cold-clutter-only data is used to construct the stochastic constraints. We see that this moving average does not degrade the hot clutter rejectability more than 3 or 4dB, i.e. the improvement from the CBF processing is still around 55 dB. Note that this result relies on the proper choice of both spatial and temporal degrees of freedom (see equation (17.82)). These simulations show that while the averaging of R^1 over the entire CPI devastates the rejectability, a moving three-PRI average does not cause significant dynamic losses, assuming a proper choice of the SC STAP filter architecture (N and L). In order to demonstrate the ability of the operational constraints equation (17.62) to maintain the output cold clutter properties, Figure 17.16a shows the Doppler spectra obtained for the operational range bin (i.e. including hot and cold clutter), processed by the above operational filter, but still using the true hot clutter covariance matrix F?" at this stage. By comparison, the same range bin has again been processed by the ideal SC STAP filter, where the constraints were computed using the true cold-clutter-only samples (curve labelled wy). In this case, there is no significant difference between the Doppler spectra for the ideal and operational solutions. Therefore the operational routine that uses the mixture of hot and cold clutter samples for the stochastic constraints performs exceptionally well, providing that each hot clutter covariance matrix Rf1 is properly estimated. Figure 17.16b emphasises this conclusion for the simulation including the target.
normalised Doppler frequency
b
Figure 17.16
normalised Doppler frequency
a several stages in the analysis of the weighted FFT Doppler spectra for a range bin containing only cold clutter b identical simulation parameters as for a but with a target present
17.2.3.5 Operational approach: finite sample size considerations The final step towards a truly operational routine is the replacement of the true hot clutter covariance matrix R^n by its sample estimate R^n. We do this by averaging over all range bins that are free of cold clutter contamination, for each PRI. For both HF OTHR and airborne radar, the size of the hot clutter training sample used
to form Rf is a serious issue. In Reference 30, devoted to airborne radar STAP, the interval of training ranges within each PRI is referred to as 'a period where the target detection performance degrades severely and the receiver is essentially 'blind''. A similar problem exists in HF OTHR, where the number of cold-clutter-free ranges is limited. This motivates our investigation into means of reducing the length of the training sequence necessary for hot clutter cancellation. One significant contribution has already been made by equation (17.14), where the minimum number of taps Lm[n is given in terms of N, Q and / . According to the famous result of Mallet, Reed and Brennan [31], in order to obtain an average 3 dB loss in SNR compared with the optimum, the number of independent samples T used for estimation of some NL variate covariance matrix is: T3 > 2NL
(17.97)
Apart from being too large a number in most cases, this estimate does not leave room for the trade off between the number of spatial and temporal degrees of freedom. Less well known are the results of References 32 to 35, that show that by proper diagonal loading: (17.98) the number of snapshots T sufficient for 3 dB losses may be reduced to: T3 > 2 rank [flf]
(17.99)
A detailed analysis of convergence and convergence rate appears in Section 17.4. Recall that the rank of R^ is equal to the signal eigensubspace dimension of Rf. Where Rf is not averaged over adjacent PRIs, this dimension is defined by equations (17.13) and (17.16), i.e.: (17.100) This first means that the blind zone can be dramatically reduced (compared with using T > 2NL). Secondly, this is analytical evidence for the conclusion made in Reference 30. Indeed, according to equations (17.99) and (17.100), in order to decrease the blind zone, the number of spatial degrees of freedom Af should be maximised, followed by the corresponding reduction in temporal degrees of freedom L. For our model with L = 3 and averaging over three consecutive PRIs, the rank of the averaged covariance matrix Rav is 18. (Note that since one (direct) mode is non-fluctuating, its contribution to the rank of the averaged matrix does not depend on the number of averaged PRIs.) Thus, for this scenario, significant savings are achieved by the revised requirement T >2 rank [^f 1 ] = 36, rather than the standard sample volume T > 2NL = 96.
According to equations (17.61) and (17.98), the operational estimate: (17.101) involves 3T training samples in total. Thus averaging over the (slightly) nonstationary hot clutter training samples belonging to three consecutive PRIs reduces the random errors for this estimate. Of course, the maximum reduction is obtained when the actual non-stationarity could be ignored: (17.102) Under such extreme circumstances, the total number of necessary training samples (T = 36) could even be distributed over these three PRIs. When there is significant non-stationarity, however, there will be some optimal trade off in the number of PRIs involved, given that the sample volume T is fixed for each PRI. Indeed by increasing the number of PRIs involved in averaging (K) we increase the signal subspace dimension of the true covariance matrix: (17.103) On the other hand, we somewhat decrease the random error by use of the estimate: (17.104) Clearly, the optimum number K is not identical to the minimum number. Therefore with simulations that involve the true covariance matrices, we expect covariance matrix averaging to introduce some extra losses in hot clutter rejectability compared with the ideal case of using Rf. On the other hand, the fully operational routine using a finite training sample volume per PRI gains in the reduction of stochastic errors due to the averaging process. The curves labelled qosc and wosc in Figures 17.15 and 17.16 demonstrate the efficiency of the fully operational routine, involving a loaded sample covariance matrix, equations (17.98) and (17.101). Comparing the output Doppler spectra obtained for range bins with and without targets, we see that the operational SC STAP routine achieves both highly efficient target detection and SCV protection. The curve in Figure 17.15 indicates the signal-to-hot-clutter ratio for the operational filter across all 256 PRIs. Although the 36 training samples are not strictly independent (due to multimode propagation), additional stochastic losses are negligible. These results show that the operational SC STAP approach can deliver remarkably good performance. Since each range bin is processed by an individual fast time STAP filter (due to the range-dependent constraints A^, despite the fact that the covariance matrix estimate R™ is the same for all ranges), the cost of such performance is a higher computational load. However, for HF OTHR applications at least, where the
number of range bins is not large and the Nyquist rate fyy is measured in dozens of kHz, this load is reasonable. We would also like to discuss the important matter of so-called self-rejectability, that occurs when the adaptive filter is applied to the same training range bins that have been used to construct the sample covariance matrix. For a standard (nonloaded) sample covariance matrix with T = 2 x filter size, it was demonstrated in Reference 36 that we can obtain a 6 dB average improvement, instead of 3 dB losses, if the same sample covariance matrix is used in loss factor calculations instead of the true one. Hence the results of adaptive STAP filtering of the same hot clutter samples that have been used for adaptive filter estimation (see e.g. Reference 5) can be easily misinterpreted, especially when the sample number is not large. In the example of Reference 5, nearly 200 ranges seem to have been involved in traditional averaging of the 14 x 15 = 210 variate covariance matrix.
17.2.4 SC STAP algorithm: efficiency analysis by real data processing The most important remaining question to consider is the adequacy of the scalar low-order AR model that is applied to real cold clutter. This question has been partly discussed in References 11 and 1 with cold clutter data collected by skywave and surface-wave facilities in different countries. Here we present some new real HF OTHR clutter data to illustrate the ability of the proposed SC STAP algorithm to retain the initial SCV. The first set of experimental data was collected in 2000 by the ComRad experimental facility, developed by the Australian Cooperative Research Centre for Sensor Signal and Information Processing (CSSIP) under a contract with Telstra Applied Technologies in collaboration with the Australian Defence Science and Technology Organisation (DSTO). Rather than using the same narrowband deramping receivers from the previous experimental facility Iluka [37], ComRad employed a 4OkHz 32channel digital receiver that enabled digital matched filter (range) processing. Like Iluka, ComRad comprised a transmit site at Stingray Head (65 km south-west of Darwin) and a receive site at Gunn Point (30 km north-east of Darwin) [38]. The receiving system (Figure 17.8) was based on a two-dimensional dual-polarisation antenna array of 96 elements that permitted experiments in various receive array configurations of the 32 input signals. The example below was recorded in the simplest configuration: the 32 doublet-monopole elements comprising the front uniform linear array some 500 m long, with a linear frequency modulated constant waveform (LFMCW) of repetition frequency fNy = 10 Hz and a CPI of 102.1 s, so that the total number of sweeps (repetition periods) is M = 1021. In order to concisely demonstrate the efficacy of SC STAP for both skywave and surface-wave applications, we have used this same data set but ingesting different numbers of sweeps: M = 127,255 (typical of a skywave CPI) and M = 511,1021 (typical of a surface-wave CPI). This data set contains some environmental noise, and has some 30 range cells with no backscattered signal to act as the supervised training region. All the Figures presented here as Doppler spectra correspond to one particular range and azimuth resolution cell. Figure 17.17 shows the output sea clutter Doppler
Doppler bin
Figure 17.17
Output Doppler spectra of CBF (broken line) and unconstrained instantaneous SAP (solid line) for 12.7 s CPI
Doppler bin
Figure 17.18
Output Doppler spectra of CBF (broken line) and SC SAP (solid line) for 2 stochastic constraints and 12.7 s CPI (K = 2, K = 3)
spectrum for conventional beamforming (CBF) (broken line) versus unconstrained instantaneous SAP (solid line) using intra-PRI adaptivity equation (17.88), for M= 127. As expected, unconstrained SAP leads to a dramatic degradation in SCV. Figures 17.18-17.20 show the same CBF output (broken lines) compared with SC SAP (i.e. L = 1) for the order of the cold clutter AR model (equal to the number of stochastic constraints) /c = 2, 3 and 5, respectively. All these Figures relate to the case where the number of PRIs involved in averaging the interference covariance matrix was chosen to be minimal: K = /c -+- 1. We see that for this relatively short CPI of 12.7 s, SC SAP gives a quite significant SNR improvement of about 9 dB, but only when the number of stochastic constraints K is sufficiently high. The choice K = 2 is
Doppler bin
Figure 17.19
Output Doppler spectra of CBF (broken line) and SC SAP (solid line) for 3 stochastic constraints and 12.7 s CPI (K = 3, K = 4)
Doppler bin
Figure 17.20
Output Doppler spectra of CBF (broken line) and SC SAP (solid line) for 5 stochastic constraints and 12.7 s CPI (K = 5, K = 6)
clearly too low to accurately reproduce the conventional output, whereas K = 3 or 5 yields a much better match without a noticeable degradation in SCV. The results of similar processing with M = 255 sweeps are illustrated in Figures 17.21-17.24. Figure 17.21 again shows the extreme degradation in SCV caused by fluctuations in unconstrained instantaneous SAP equation (17.88). Figures 17.2217.24 demonstrate that here five stochastic constraints is ample to obtain a reasonable match between SC SAP and CBF. Comparison with the previous set of results for 12.7 s CPI shows that, since longer CPIs reveal the fine structure of the sea clutter spectrum, they require more stochastic constraints for satisfactory (distortionless) processing. The choice K = 2 could be deemed acceptable for M = 127, however for M = 255, this results in a significant distortion of the energetic components of the Doppler spectrum. Note that external noise reduction is no longer apparent in
Doppler bin
Figure 17.21
Output Doppler spectra of CBF (broken line) and unconstrained instantaneous SAP (solid line) for 25.5 s CPI
Doppler bin
Figure 17.22
Output Doppler spectra of CBF (broken line) and SC SAP (solid line) for 2 stochastic constraints and 25.5 s CPI (K = 2, K = 3)
M = 255 because of the now noticeable broadband (ionospheric) clutter component. For this reason, we concentrate on the quality of sea clutter reproduction for M = 255,512,1021. Sets of similar processing results for the surface-wave-related cases M = 511 and 1021 appear in Figures 17.25-17.27 and Figures 17.28-17.29, respectively. We find that in both cases reasonable sea clutter accuracy is obtained using K = 10. Although this number is not low, it equates to an interference covariance matrix averaging interval of only 1.1s, instead of the 102.1 s required by time-independent SAP where the matrix is averaged over the entire CPI. These real data processing results demonstrate that, despite the complexity of global models that properly describe real HF sea clutter [39,40,18], we may successfully apply simple models locally without major degradation.
Doppler bin
Figure 17.23
Output Doppler spectra of CBF (broken line) and SC SAP (solid line) for 3 stochastic constraints and 25.5 s CPI (K = 3, K = 4)
Doppler bin
Figure 17.24
Output Doppler spectra of CBF (broken line) and SC SAP (solid line) for 5 stochastic constraints and 25.5 s CPI (K = 5, K = 6)
Finally, Figures 17.30-17.33 present similar processing results for a relatively short 32-channel skywave radar CPI, where no clutter-free range cells were recorded, so we used the same interference data set above recorded by the ComRad surfacewave facility. As expected, unconstrained SAP is completely inappropriate, and five or so stochastic constraints satisfactorily retain the quality of the sea clutter spectrum that is required for reliable target detection.
17.2.5 Summary In this section, we have identified the similarity between HF OTH and airborne radars, with respect to the problem of multimode interference (hot clutter) rejection utilising STAR The problem of non-stationary hot clutter cancellation via fast time STAP has
Doppler bin
Figure 17.25
Output Doppler spectra of CBF (broken line) and unconstrained instantaneous SAP (solid line) for 51.1s CPI
Doppler bin
Figure 17.26
Output Doppler spectra of CBF (broken line) and SC SAP (solid line) for 5 stochastic constraints and 51.1s CPI (K = 5, K — 6)
been formulated, mainly relying upon recently investigated properties of HF signals propagating through perturbed ionospheric regions. It has been demonstrated that the standard fast time STAP algorithms are inappropriate for the removal of non-stationary hot clutter when the backscattered signal (cold clutter) properties must be preserved in some way. Fast time (intra-PRI) STAP filters that vary from PRI to PRI in an unconstrained fashion cause a severe degradation in the cold clutter Doppler spectral properties (namely, subclutter visibility). On the other hand, the application of time-independent STAP algorithms, for which the hot clutter spatio-temporal covariance matrix is averaged over a relatively long CPI, severely degrades the hot clutter rejectability. The only previously known approach that can theoretically perform simultaneous hot and cold clutter rejection is the uniform STAR This has a problem size defined by
Doppler bin
Figure 17.27
Output Doppler spectra of CBF (broken line) and SC SAP (solid line) for 10 stochastic constraints and 51.1s CPI (K = 10, K = 11)
Doppler bin
Figure 17.28
Output Doppler spectra of CBF (broken line) and unconstrained instantaneous SAP (solid line) for 102.1 s CPl
NML, where Af is the number of antenna receivers, M is the number of PRJs within each CPI and L is the number of fast time taps. For all HF OTHR applications, where typical values are N = 16 to 32 and M = 128 to 256, this approach is practically useless, simply because of the lack of sufficient training samples needed to adapt the system. With the goal of overcoming these limitations of the standard approach, this study has investigated the theoretical existence of slow time-varying fast time spatiotemporal filters that can provide high hot clutter rejectability and stationarity of the hot clutter scalar output signal. We have demonstrated that for a particular multivariate scalar low-order AR model of the cold clutter, data-dependent (stochastic) constraints can retain the stationarity of the cold clutter scalar output. In its purely theoretic form, this SC STAP solution is quite impractical since these stochastic constraints rely on
Doppler bin
Figure 17.29
Output Doppler spectra of CBF (broken line) and SC SAP (solid line) for 10 stochastic constraints and 102.1s CPI (K = 10, K = 11)
Doppler bin
Figure 17.30
Output Doppler spectra of CBF (broken line) and unconstrained instantaneous SAP (solid line) for skywave data
access to the cold-clutter-only snapshots, which are usually contaminated by hot clutter. Operational routines that implement the main principles of the proposed theoretic SC STAP were then introduced (in the context of HF OTHR applications). Here the a priori known distinctions between the hot and cold clutter distributions in range are exploited. More specifically, the ranges that are always free of cold clutter (in pulse waveform radar systems) are utilised for accurate hot clutter covariance matrix estimation. The efficiency of the SC STAP approach has been demonstrated by simulations, conducted for typical HF OTHR scenarios. A generalised Watterson model of ionospherically propagated HF signals has been presented; this model introduces both temporal (Doppler) and spatial fluctuations. For diffuse scattering, the applicability of this model was discussed in the
Doppler bin
Figure 17.31
Output Doppler spectra of CBF (broken line) and SC SAP (solid line) for skywave data and 2 stochastic constraints (K = 2, K = 3)
Doppler bin
Figure 17.32
Output Doppler spectra of CBF (broken line) and SC SAP (solid line) for skywave data and 5 stochastic constraints (K = 5, K = 6)
case where the scattering involves equal numbers of propagation modes and distinct scattering layers of equal path delay. For a given number of interfering sources and propagation modes, the minimum number of fast time taps required for effective hot clutter rejection has been specified. Also, we have defined the rank of the hotclutter-only covariance matrix in terms of the number of sources, scattering layers, antenna sensors and taps. This determines the minimum sample volume necessary for accurate hot clutter covariance matrix estimation. AR models that encompass the main properties of sea scattering have been used for cold clutter simulations. These simulations have demonstrated the high efficiency of the SC STAP algorithm, both in hot clutter rejection and in cold clutter post processing.
Next Page
Doppler bin
Figure 17.33
Output Doppler spectra of CBF (broken line) and SC SAP (solid line) for skywaye data and 15 stochastic constraints (K = 15, K = 16)
Finally, real skywave and surface-wave OTHR sea-scattered data has been processed in order to show that a simple AR model of the cold clutter can be used locally over a small number of adjacent PRIs in forming stochastic constraints, despite the fact that real cold clutter is globally far from being properly described by this model. Thus, for typical HF OTHR applications the proposed SC STAP method is verified. The performance achieved comes at a considerable computation cost: each range bin should be processed by an individual STAP filter that should be updated each PRI, ideally. Obviously, each finger beam requires an individual set of SC STAP filters. Nevertheless, for HF OTHR with a modest number of range bins and a comparatively low bandwidth, this computational load is reasonable. Naturally, for airborne radar applications the trade offs and peculiarities of operational SC STAP may vary from those described in this section, specifically, non-adaptive cold clutter MTI is hardly possible with slow/fast time STAP in such radars; real airborne radar data is required to be analysed before specifying that type of operational approach, however we believe that appropriate solutions would exist for this and other similar applications.
17.3
SC STAP unsupervised training applications
In this section, we continue our study into the adaptive spatial and spatio-temporal mitigation of non-stationary interference in the field of high frequency (HF) overthe-horizon radar (OTHR). The previous section discussed the fundamental ideas behind the new stochastic constraints approach, which has been proposed to achieve effective hot clutter suppression while maintaining distortionless output cold clutter post processing
Previous Page
Doppler bin
Figure 17.33
Output Doppler spectra of CBF (broken line) and SC SAP (solid line) for skywaye data and 15 stochastic constraints (K = 15, K = 16)
Finally, real skywave and surface-wave OTHR sea-scattered data has been processed in order to show that a simple AR model of the cold clutter can be used locally over a small number of adjacent PRIs in forming stochastic constraints, despite the fact that real cold clutter is globally far from being properly described by this model. Thus, for typical HF OTHR applications the proposed SC STAP method is verified. The performance achieved comes at a considerable computation cost: each range bin should be processed by an individual STAP filter that should be updated each PRI, ideally. Obviously, each finger beam requires an individual set of SC STAP filters. Nevertheless, for HF OTHR with a modest number of range bins and a comparatively low bandwidth, this computational load is reasonable. Naturally, for airborne radar applications the trade offs and peculiarities of operational SC STAP may vary from those described in this section, specifically, non-adaptive cold clutter MTI is hardly possible with slow/fast time STAP in such radars; real airborne radar data is required to be analysed before specifying that type of operational approach, however we believe that appropriate solutions would exist for this and other similar applications.
17.3
SC STAP unsupervised training applications
In this section, we continue our study into the adaptive spatial and spatio-temporal mitigation of non-stationary interference in the field of high frequency (HF) overthe-horizon radar (OTHR). The previous section discussed the fundamental ideas behind the new stochastic constraints approach, which has been proposed to achieve effective hot clutter suppression while maintaining distortionless output cold clutter post processing
stationarity. The introduced operational implementation is based on the availability of range bins specifically free of cold clutter, a situation that is quite typical of pulse waveform (PW) OTHR. Meanwhile, even within the PW OTHR architecture, attempts to increase the radar duty cycle (i.e. the transmitted energy) usually lead to multifrequency operation with a transmission pause for each given frequency, limited to the operational range depth. For surface-wave (SW) OTHR, such a multifrequency mode of operation leaves little room for (sea) clutter-free ranges being available. For frequency modulated continuous waveform (FMCW) OTHR, the accessibility of such range bins, containing interference and noise signals only, is also problematic. Traditionally, FMCW systems operate with a linear frequency modulated (LFM) waveform and have receiving systems that employ a mixing of the received waveform with a time-delayed version of the transmitted waveform. This process is known as deramping and is followed by spectral analysis (weighted Fourier transform) within the comparatively narrow bandwidth of a lowpass filter at the output of the mixer. This bandwidth is usually adjusted to the operational range depth, so that the frequencies corresponding to the skip zone ranges are usually filtered out at this stage. Thus only operational range bins that contain hot clutter (interference), cold clutter and possible targets (unsupervised training samples) are available for any type of processing. Note that the simple solution of increasing the lowpass filter bandwidth has its limit because of the significant increase in range sidelobes, due to the frequency mismatch [41]. Unfortunately, these range sidelobes also smear the multimodal interference structure somewhat. Obviously, the type of operational routines discussed in Section 17.2 based on access to cold-clutter-free ranges are not applicable to these types of OTHR. In this section, we introduce and analyse operational stochastically constrained (SC) SAP and STAP routines that implement the same fundamental principle, but for unsupervised training. The basic idea of the first introduced approach is straightforward: we reject the cold clutter using a moving-target indicator (MTI), calculate the STAP weights from this data that will reject hot clutter but maintain cold clutter stationarity, then apply these weights to the original data, enabling the cancellation of cold clutter using standard Doppler processing. Obviously, hot clutter non-stationarity necessitates the MTI filter order selection to be minimal. For an interfering signal, the significant width of the blind zone in Doppler frequency is not an issue, but for target detection such a low-order MTI filter rejects many low-speed targets along with the cold clutter. For this reason, MTI cold clutter rejection is appropriate for hot clutter extraction, but is inappropriate for target detection. When the MTI filter can be designed in a non-adaptive fashion, the implementation of this basic idea is quite simple, and appears to be already discussed in classified papers [42^4]. Therefore, we concentrate on the efficiency analysis of this approach for typical HF OTHR scenarios. In the more general case when the cold clutter spectral properties are unknown a priori, implementation of the basic idea is less trivial, and the hot-clutter-only rejection filter should be properly retrieved from the associated three-dimensional STAR
17.3.1
Operational routine for unsupervised training
Our goal is now to construct a complementary operational routine that implements an approximated version of the generic solution for the unsupervised training scenario. We may formulate a similar approach, providing we can somehow obtain proper rejection of the hot clutter component in both sides of equation (17.54). Clearly, we must be prepared to sacrifice some performance compared with supervised training, due to the absence of samples containing only hot clutter and noise. 17.3.1.1 Known (local) cold clutter model In Section 17.2, we demonstrated using real OTHR sea clutter data that, despite the quite complicated nature of the sea clutter Doppler spectrum over the entire dwell, stabilisation by stochastic constraints can be quite effective, even if we adopt a simplified local low-order AR model (i.e. accurate only for a limited number of consecutive sweeps). In fact, it has long been known that a simplified one-stage MTI filter, for example, can effectively reject cold clutter even if this clutter is much more complicated than an ordinary Markov chain, when such a filter is truly optimal. According to the general philosophy discussed in Section 17.2, our goal is to apply the shortest possible MTI filter to reject the backscattered cold clutter first, in order to extract the hot clutter samples that can be later used for hot clutter covariance matrix sample averaging, similarly to equations (17.61) and (17.104). Here it is crucial to involve the minimum possible number of repetition periods, due to the non-stationarity of the hot clutter, which increases the signal subspace dimension of the averaged covariance matrix / ^ 1 . On the other hand, if the cold clutter residues are significant, the hot clutter covariance matrix estimate will be significantly compromised. Since the AR model equation (17.51) for SC purposes also needs to be of minimum possible order (/c), it could be used directly to design the MTI filter providing that the parameters bj (j = 1, . . . , K) and Pi are known a priori or estimated. Thus we assume that a (K + l)-variate slow time preprocessing filter: (17.105) would lead to effective cold clutter mitigation within the output snapshot x\v i.e.: (17.106)
(17.107) This gives rise to the following straightforward operational routine.
Algorithm 1 Step 1 Given the observed Af-variate snapshots x/a (k = 1, . . . , M; t = I, ...,T)9 form the stacked TVL-variate samples x^t (k = 1, .. .9M;t = I9 .. .,T — L + 1) using equation (17.19). Step 2 Preprocess the samples xja by the (/c +1)-variate slow time preprocessing filter with impulse response [ 1, b\, . . . , bK] to get the cold-clutter-free training samples x\t using equation (17.105). Step 3 Compute the sample estimate of the hot-clutter-plus-noise averaged covariance matrix: (17.108) Step 4 Construct the initial (k = K + 1) STAP filter: (17.109) Note that this filter is stochastically unconstrained and so is range-invariant Step 5 For the next adaptive filter H ^ 2 1 ( n o w range-dependent and stochastically constrained), apply sliding window averaging to the preprocessed input data JcJJ+2 i(17.110) in order to ensure hot clutter rejection over the repetition periods k = 2, . . . , /c + 2. Compute all subsequent STAP filters (k = K + 29 . . . , M) by: (17.111) where (17.112) and the system of stochastic constraints is: (17.113) where (17.114)
Note that the particular parameters used in Section 17.2.3.2 to simulate HF scattering from the sea K = 2,
bf} = -1.9359,
b(2} = 0.998,
P^ = 0.009675
(17.115)
mean that all three covariance matrices Rj* (j = 1,2,3) are almost equally weighted by \bj\2, and the cold clutter component in xKk+l t is rejected to the level of the weak temporally white noise. In Step 5, note that the K repetition periods k = 2, . . . , K + 1 are common in the construction of R^+1 and R™+2- ^ n e s v s t e m °fK stochastic constraints corresponding to equation (17.63) may then be written as: **+i,r*ft
=
w?+2,t*kt
for * = 2, . . . ,K + 1
(17.116)
It is worth emphasising that the hot clutter covariance matrix estimates are computed using the preprocessed samples x\t, but the stochastic constraints use the original snapshots i ^ . Thus the right-hand sides of these constraints consist of the cold clutter samples mainly due to the properties of the estimate equation (17.110), and condition equation (17.107) ensures that the filter wf+2t properly processes the cold clutter samples, since the hot clutter is to be rejected. Our final comment is a reminder that the matrix in the square brackets in equation (17.109) has dimension q x q, while that of equation (17.111) is (q + K) X (q + ic), since all STAP filters but the first have K additional data-dependent linear constraints. A comparison of this algorithm with its counterpart for supervised training, equations (17.60) to (17.63), reveals that the only significant difference between these two techniques is how we obtain the cold-clutter-free training samples to estimate the hot-clutter-plus-noise covariance matrix. For unsupervised training, we heavily rely on the ability of the *c-order slow time filter rejecting the cold clutter signal practically to the level of white noise. In this regard, we stress that such rejectability validates the use of a low-order AR cold clutter model. On the other hand, suitability of a low-order AR model to real cold clutter over a very limited number of repetition periods (i.e. locally) does not mean that the same model is appropriate for describing cold clutter over the entire CPI (i.e. globally). Therefore, the proposed preprocessing cannot be used as a moving target indicator (MTI) filter as such since, for low-speed targets, this processing is far from being globally optimal and leads to an unacceptably broad range of blind Doppler frequencies. It is thus essential that the proposed preprocessing is applied only to extract hot clutter that is uniformly distributed over the entire Doppler frequency band, while, for proper target detection, we retain the output scalar cold clutter signal as close as possible to its original form. It was demonstrated in Section 17.2, by processing of real skywave and surfacewave OTHR data, that the low-order AR model supports the stochastic constraints approach by preserving the global structure of the output scalar cold clutter signal, which is more complicated than suggested by the model. In Section 17.3.2, we demonstrate that the low-order AR model also provides effective real cold clutter rejection by the associated preprocessing filter. Nevertheless, none of the preprocessing schemes can entirely reject real cold clutter and so some cold clutter residues are always present
within the training sample xKkt. This means that additional losses are expected compared with supervised training, especially if these residues are above the white noise level. Another source of loss stems from a reduction in the training sample size: for supervised training we separately estimate each R^ using the set of hot clutter training samples within each repetition period Jc, while, for unsupervised training, we use only one set of (T — L + 1) ranges to estimate a single matrix that approximates the sum of all (K + 1) matrices Rf.
17.3.1.2 Known order of the (local) cold clutter model In some applications it is reasonable to admit that a preprocessing MTI-type filter can be defined based on some fairly standard a priori assumptions. In the field of SW OTHR, for example, such a local model might be inferred from an analytical description of the sea-echo Doppler spectra [45]. In general, however, our prior information is much more limited. To accommodate this, we specify only the AR model order K prior to data processing, whereas the actual parameters values bj are unknown. Therefore, both the cold clutter temporal properties, as well as the hot clutter spatial (for SAP) or fast time spatio-temporal (for STAP) properties, are to be estimated simultaneously using the unsupervised data x^t- It should be clear that this can be achieved only by incorporating slow time to deal with the cold clutter. The main idea behind the algorithm that we propose is again straightforward. For the properly chosen AR model of order /c, the appropriate NL(K + l)-variate three-dimensional STAP filter w^ should then effectively reject both hot and cold clutter. On the other hand, since the hot clutter signal is uncorrelated over adjacent sweeps, effective mitigation will occur only if each of the (K +1) fast time TVZ-variate two-dimensional STAP subfilters effectively rejects hot clutter. Thus, using the first M, components of the three-dimensional STAP filter, we may reject the hot clutter component, while the cold clutter contribution should remain unperturbed. Clearly we are decomposing the overall three-dimensional STAP filter, extracting the component responsible for hot clutter mitigation only. Let us introduce the NL(K H- l)-variate doubly stacked vector:
(17.117)
In the absence of a target signal, we may write: (17.118)
since the hot and cold clutter signals are mutually independent. Correspondingly:
(17.119)
since the hot clutter is uncorrelated over adjacent repetition periods. For the cold clutter component described by the model equation (17.52), we have: (17.120) where (17.121) rj (j = 0, . . . , K) are the interperiod (scalar) correlation lags, and Rc0 is defined in equations (17.4) and (17.27). In order to justify some of our assertions, let us temporarily simplify our problem. Suppose we ignore the non-stationarity of the hot clutter over the interval of (K + 1) adjacent repetition periods, and let us assume that the cold clutter spatial covariance matrix is diagonal, i.e. RQ = INL- (The angular distribution of cold clutter is always assumed to be sufficiently wide to exclude any possibility that pure SAP can solve the problem, as we discussed in Section 17.2.) Thus the covariance matrix RQ is always well conditioned, and so this second assumption does not change the essence of the problem. Under these two simplifying assumptions, we may present the overall NL(K + I)variate covariance matrix R^ in the form: (17.122) It is known [46, p. 268] that if A^ and uf (s = 1, . . . , m) are, respectively, the eigenvalues and eigenvectors of some m-variate matrix A, and Xf and uf(t= 1, . . . , « ) correspond to somerc-variatematrix B, then the eigenvectors uc- (j = 1, . . . , mn) of the matrix: C = Im®B + A®In
(17.123)
are given by: uj = uf^uf
f o r J = 1, . . . , m ;
f = 1, ...,n
(17.124)
and correspond to the eigenvalues of C equal to (A^ + kf). By this property, the NL(K + l)-variate vector wfk that minimises the overall hot and cold clutter power: (17.125)
is given by: (17.126) where «min{-} denotes the eigenvector corresponding to the minimum eigenvalue of the indicated matrix. We also have that the total output power is P%ut — P% + Pi. These considerations illustrate two important issues. First, when the specified order K is sufficient for effective cold clutter mitigation by the corresponding /c-variate MTI-type filter, augmented NL(K + l)-variate STAP can simultaneously reject hot and cold clutter. The hot clutter rejection can be achieved up to the spatial (or more generally, fast time spatio-temporal) optimum processing limit, while the cold clutter rejection can be achieved up to the temporal optimum processing limit, i.e. the simultaneous rejection is uncompromising in its effect on the other component. Second, each of the (K + 1) blocks of this filter effectively mitigates the hot clutter component, since Rf has a block-diagonal structure. Indeed, each of the iVL-variate components of the vector w'k in equation (17.126) is proportional to the minimum eigenvector
OfSf. Now suppose that the total sample size (T — L + 1) used for (loaded) sample averaging in the calculation of /?£ (see Section 17.2) is sufficiently large to essentially approach the efficiency of the true (deterministic) value for the STAP filter: (17.127) where 5(0o) = [sT (0o)\0\... | O ] r , then we expect the first TVL-variate piece of this augmented solution to provide effective fast time STAP rejection of the hot clutter for the &th repetition period. It was demonstrated in Section 17.2 that the minimal sample volume sufficient for 3 dB average losses compared with the ideal solution equation (17.126) double the signal subspace dimension (i.e. rank) of the noise-free covariance matrix. Since the covariance matrix rank is comparatively small, we may introduce the following operational routine.
Algorithm 2 Step 1 Given the observed N-variate snapshots xja (k = 1, . . . , M\ t = 1, . . . , T), form the stacked M,-variate samples xia (k = 1, . . . , M; t = 1, . . . , T — L + 1) using equation (17.19), and the doubly stacked NL(K + l)-variate samples xt (k = K+ 1, . . . , M ; t = 1, .. . , T - L + 1) using equation (17.117). Step 2 Compute the (loaded) sample covariance matrix estimates: (17.128)
for k = K + 1, . . . , 2/c + 1 and some small constant /3, then compute the averaged matrix: (17.129) Step 3 Let Ag(O0) = [ATq(6o)\0\... \O]T then define the initial (it = K + 1) stochastically unconstrained and range-independent TVL(K + l)-variate filter: (17.130) The first NL elements of this vector form the first fast time STAP filter H ^ 1
where ENL = [INL\O\...
v
i.e.:
\O]T.
Step 4 For the next fast time STAP filter H ^ 2 1 ( n o w range-dependent and stochastically constrained), compute the NL(K + l)-variate filter w™+21 ^y aPP^YmE sliding window averaging: (17.132) in order to ensure hot clutter rejection over the repetition periods k = 2, . . . , / c + 2. 2,
Compute all subsequent operational fast time STAP filters w™ (k = K + ...,M)by:
(17.133) (17.134) where: (17.135)
(17.136) and the system of stochastic constraints is: (17.137) where Z^ is defined by equation (17.114).
In Step 3, although the entire NL (K -f l)-variate filter w™+x1 would provide effective cancellation for both hot and cold clutter, we expect that constructing a fast time STAP vector H ^ 1 1 from its first NL elements will deliver equally effective cancellation of the hot clutter only, over the first (K + 1) repetition periods. This will be true provided that the number of degrees of freedom in the vector exceeds the rank of the averaged hot clutter covariance matrix R™+\, i.e.: NL » (K + l ) [ / ( g + L - I)]
(17.138)
or more precisely, taking into account the number of deterministic and stochastic constraints (see equation (17.64)): NL > (K + I)[Z(G + L - I)] + (q + ic)
(17.139)
In Step 4, for our standard example of K = 2, once again the initial filter w™ rejects hot clutter over the repetition periods k — 1,2,3, and the second filter w™ rejects over k = 2,3,4, thus ensuring that K covariances R* are common for each successive average Rf^. The system of K stochastic constraints that ensures the stationarity of the cold clutter component is: KHENL*kt
= w%Hxkt = ^ENJCK
= #%Hxh
for * = 2, . . . , K + 1 (17.140)
and in general IVg11Xk-JJt = #gHxk-U
for j = 1, . . . ,K
(17.141)
Thus both sides of these constraints will consist mainly of cold clutter, since the hot clutter component is rejected. As usual, the stochastic constraints approach forms the kernel of this algorithm. The main difference with all previous routines is the method of separating hot and cold clutter: here we embed the fast time STAP solution into the more general augmented three-dimensional STAP one to extract the desired solution as the component of the NL(K + l)-variate filter. It should be clear that the augmented STAP is only suitable for extracting the fast time STAP solution, and cannot be used by itself as a final operational routine for target detection. As before, this is because the very small number of slow time lags (K) involved in cold clutter mitigation leads to a wide range of blind Doppler frequencies, where any targets would be rejected along with the cold clutter.
17.3.2
Operational SC STAP algorithm: simulation and real data processing results
Naturally both of the proposed operational algorithms need to be verified. The most important question for Algorithm 1 is the relevance of the low-order AR cold clutter model, and this shall now be addressed by real data processing, since there is no point filtering simulated cold clutter of a known AR model. The other important question
on the convergence rate of the process to the true value as T -> oc for Algorithm 1 has been partly addressed in our previous papers on supervised training [14,28] with the established sample size T — 2(K + I)[J(Q + L — I)] necessary to guarantee average losses of 3 dB compared with the optimal solution. Thus, in order to justify the first operational routine, we have to demonstrate that via a low-order AR model we can construct a preprocessing MTI-type filter that can efficiently reject cold clutter. First, we make use of the same real SW OTHR data as in Section 17.2, where the CPI is 100 seconds. The solid line in Figure 17.34a shows the standard Doppler spectrum for one particular range bin; note that the subclutter visibility (SCV) is approximately 50 dB in this case. Forward and backward averaging has been used to define the 3 x 3 (K — 2) intersweep temporal covariance matrix M3. The preprocessing filter has been defined as:
(17.142)
in order to keep unchanged the white-noise output power (since HH^ 77 H = 1). The dotted line in Figure 17.34a illustrates the Doppler spectrum of the residues after preprocessing. We see that the noise floor obtained by this prefiltering is essentially
Doppler frequency, Hz
relative Doppler frequency
Figure 17.34
Noise-floor comparison in two sets of real surface-wave OTHR data
the same as in the initial spectrum. Moreover, the eigenvalues of the sample matrix R3: Xi = 3.42,
X2 = 0.0833,
X3 = 0.0020
(17.143)
suggest that the cold clutter could be rejected by about 27 dB compared to the input level. This number agrees with the SCV, taking into account the compression gain of M = 1000 repetition periods (~ 30 dB). Note that Figure 17.34a also confirms our expectation of the very wide blind Doppler bandwidth which makes this type of processing inappropriate for target detection. Figure 17.34b illustrates similar processing results of data obtained from DSTO's HF SWR facility located at Port Wakefield, South Australia. This radar employs a 16-element linear receiving array and operates over the frequency band 5-17MHz. The LFMCW waveform sweep rate and bandwidth are selectable over a wide range; typically the waveform repetition frequency is around 4 Hz and the bandwidth 50 kHz. Since the repetition period is almost three times longer than in the previous data, the energetic components of the sea clutter Doppler spectrum (solid line) obviously occupy a significantly wider range of relative Doppler frequencies. Consequently, the third-order preprocessing filter (K = 2) shown by the dotted line is not extremely effective, rejecting the most prominent Bragg lines barely to the noise floor level. The peak-to-noise ratio at the output of this preprocessing filter (followed by a standard weighted FFT) is about 3OdB, and the initial SCV is about 65 dB. Nevertheless, if we increase the order of the AR model to K = 3, then the corresponding fourth-order preprocessing filter (dashed line) rejects all input energetic cold clutter components far below the input white noise level. Thus for typical SW OTHR data, small-order AR models are proven to provide quite effective cold clutter suppression, in turn enabling effective hot-clutter-only sample extraction. Verification of the second operational routine is not so simple, but the following two important questions may be addressed by simulation studies. First, we need to demonstrate that for some standard hot and cold clutter models, hot clutter alone can be rejected by an JVL-variate component of the augmented NL (K + l)-variate optimal solution. This optimal solution is constructed from the exact covariance matrix, and is interpreted as the limit solution, when the number of training samples tends to infinity. Second, we need to demonstrate that the convergence rate with respect to T is sufficiently high, since the number of range bins available in most HF OTHR applications is usually limited. The following simulations are based on the simple scenario of pure SAP with an TV = 16 element antenna array and a single fluctuating interfering source. The generalised Watterson model described in Reference 27 and Section 17.2 has been used to simulate the spatial and temporal (Doppler) fluctuations of the interferes The spatial and temporal correlation coefficients have been chosen to reflect typical spatial fluctuations that make traditional mitigation techniques ineffective; in the notation of Section 17.2, these parameters are t}1 = 0.90 and pu = 0.89, respectively, with 5OdB HCNR and 0u = 59.1 degrees. The second-order AR model with M = 256
TV= 16,M= 256, T=Al, L= 1, p = 0.89, C = 0.9, HCCCR=0 dB, deterministic R
N= 16, M= 256, T=4\, L= 1, p = 0.89, C = 0.9, HCCCR=OdB
long X long C CBFX
repetition period
relative Doppler frequency N= 16,M=256, T=4\,L=\, p = 0.89, C = 0.9, HCCCR = OdB, stochastic R
long G
N= 16,M= 256, 7=41, L=I, p = 0.89, C = 0.9, HCCCR = OdB Op SC STAP CBFC Det SC STAP
short G
repetition period
Figure 17.35
relative Doppler frequency
Simulation results for a hot-clutter-to-cold-clutter ratio (HCCCR) of OdB a standard processing Doppler spectra b and c scalar filter output d Doppler spectra for proposed techniques
sweeps per dwell and the parameters of equation (17.115) have again been used to simulate HF scattering from the sea surface. Figure 17.35a shows the Doppler spectrum for the cold clutter, uncorrupted by any interfering signal, at the output of the conventional beamformer (curve labelled 'CBF C ) . We see that the SCV benchmark is approximately 80 dB, which is an upper bound for most practical situations. Also illustrated is the Doppler spectrum of the cold clutter signal at the output of the standard (unconstrained) optimal SAP/STAP filter: *W
= (^T^o)
(17.144)
(note that here Rgn = R8" and 5(0O) = s(Oo), since L = I) (the look direction is #Q = 0). Clearly the spatial non-stationarity of the interferer leads to a significant degradation in SCV (^—30 dB), due to the fluctuations of the spatial (antenna array) weight vector. For this simulation, the instantaneous (per sweep) hot-clutter-to-coldclutter ratio (HCCCR) per antenna array element was chosen to be 0 dB, so that the subclutter visibility for the conventional beamformer ('CBF X') is also about 50 dB. Therefore, in this particular case, as far as the final SCV is concerned, conventional hot clutter mitigation is as ineffective as no hot clutter mitigation at all.
Figure 17.35b shows the results of our analysis into the limit efficiency of the algorithm described by equations (17.133) to (17.137), calculated by replacing the sample matrix Rf by the true covariance matrix ^ , i.e. the sample volume is infinitely large. The curve labelled 'long X' presents the power of the total output signal for the 'augmented' NL(K + l)-variate STAP filter w™t\ 'long C shows the power of the cold clutter component at the output of this filter; 'long G' corresponds to the hot clutter power for the same filter, and 'short G' illustrates the hot clutter output power from the truncated 16-element SAP filter H^, simply formed from the first 16 elements of the 48-variate vector wfr We see that in the given scenario, the hot clutter component is rejected far below the level of the cold clutter residues at the output of the augmented STAP filter w™. Not surprisingly, the short filter w™ rejects the hot clutter slightly better than does the augmented filter due to the independence of the hot clutter over adjacent repetition periods. Thus the potential effectiveness of the proposed routine is extremely high, since the scalar output of the operational filter w™ consists almost entirely of cold clutter only. The most important remaining question deals with finite sample size. In this regard, Figure 17.35c differs from the previous Figure only by the use of the finite sample size 7 = 41. For this reason, the gap between the hot and cold clutter residues is smaller than in the previous case using deterministic covariance matrix calculations. Nevertheless, this gap is still large enough to guarantee that the hot clutter component at the output of the filter w^ will be negligible. Correspondingly, the Doppler spectrum of the total signal at the output of the operational filter wft (curve labelled 'Op SC STAP' in Figure 17.35d) is practically indistinguishable from the cold-clutter-only component Doppler spectrum at the output of the conventional beamformer ('CBF C ) , and from the spectrum of the signal at the output of the SC STAP filter computed with the true covariance matrix ('Det SC STAP'). Thus the ability of the operational routine to suppress fluctuating interfering signals and to retain the initial SCV (very high in this example) is verified in this case. In order to explore the limitations of the proposed technique within the framework of the adopted model, we conducted similar simulations for input HCCCRs equal to 1OdB, 2OdB and 3OdB (Figures 17.36 to 17.38 respectively). We see that only in the (worst) last case does the hot clutter output power approach the power of the cold clutter residues at the output of the augmented adaptive filter w™. Meanwhile, SCV at the output of the operational filter w™ is degraded up to the level of about 70 dB, compared with the initial value of about 80 dB. It is worth mentioning that even for this worst case (HCCCR = 3OdB), the potential efficiency of hot clutter mitigation is still extremely high. This means that with an appropriate sample volume, one can approach the initial SCV level. This fact underlines the necessity of a proper analytic study into convergence rate issues - this is the subject of the following section in this chapter.
N= 16, M=256, T=4l, L = 1, p = 0.89 C = 0.9, HCCCR= 1OdB
N= 16, M=256, 7 = 4 1 , I = 1, p = 0.89, C = 0.9, HCCCR= 1OdB, deterministic R long X long C CBFX
long G short G
CBFC
repetition period
relative Doppler frequency
N= 16,M= 256, 7 = 4 1 , I = l , p = 0.89, C = 0.9, HCCCR= 1OdB
N= 16, M= 256, F=41, I = 1, p = 0.89, C = 0.9, HCCCR= 1OdB, stochastic R long X long C
Op SC STAP CBFC Det SC STAP
long G short G
relative Doppler frequency
repetition period
Figure 17.36
Same simulation as in Figure 17.55 but for a hot-clutter-to-cold clutter ratio (HCCCR) of 1OdB
TV= 16, M= 256, 7=41, I = 1, p = 0.89, C = 0.9, HCCCR = 2OdB
N= 16, M=256, r = 4 1 , L= 1, p = 0.89, C=0.9, HCCCR=2OdB, deterministic R long X • long C
^STAP
long G short G
CBFC
relative Doppler frequency
repetition period
N= 16, M=256, 7 = 4 1 , I = 1, p = 0.89, C = 0.9, HCCCR = 2OdB, stochastic R
N= 16, M= 256, 7=41, I = I , p = 0.89, C = 0.9, HCCCR=2OdB long X long C
Op SC STAP CBFC Det SC STAP
long G short G
repetition period
Figure 17.37
relative Doppler frequency
Same simulation as in Figure 17.35 but for a hot-clutter-to-cold clutter ratio (HCCCR) of 2OdB
N= 16, M= 256, 7=41, L = 1, p = 0.89, C = 0.9, HCCCR= 3OdB
N= 16, M= 256, 7=41, L= 1, p = 0.89, C = 0.9, HCCCR = 3OdB, deterministic R long X long C
^BFX
^STAP
CBFC
relative Doppler frequency
long G short G
repetition period
N= 16,M=256, T=4\,L=l, p = 0.89, C = 0.9, HCCCR = 3OdB, stochastic R
N= 16, M=256, T=41, L = 1, p = 0.89, C = 0.9, HCCCR = 3OdB long X long C
Op SC STAP CBFC Det SC STAP
long G short G
repetition period
relative Doppler frequency
Figure 17.38
Same simulation as in Figure 17.35 but for a hot-clutter-to-cold clutter ratio (HCCCR) of 3OdB
Figure 17.39
Comparison of range cuts for range bin 12 a dotted line: CBF, solid line: unsupervised SC SAP (SNRI 15 dB) b dotted line: SAP (SNRI 4dB), solid line: supervised SC SAP (SNRI HdB)
Finally, we present some results of ComRad field trials to illustrate the efficiency of the supervised and unsupervised SC methods. Our results illustrate an example of low-power transmitter operation, where the last 30 or so range bins of the 80 available ranges have the sea clutter signal deeply submerged into the environmental noise. Therefore, we were able to use these last 30 ranges for supervised training, and compare the efficiency of supervised and unsupervised training SC techniques
Figure 17.40
Comparison of range cuts for range bin 19 a dotted line: CBF, solid line: unsupervised SC SAP (SNRI 13 dB) b dotted line: constant SAP (SNRI 3 dB), solid line: supervised SC SAP (SNRI 8 dB)
Figure 17.41 Comparison of range cuts for range bin 3 7 a dotted line: CBF, solid line: unsupervised SC SAP (SNRI 10 dB) b dotted line: SAP (SNRI 2 dB), solid line: unsupervised SC SAP (SNRI 8 dB) against the conventional beamformer (CBF) and standard SAP beamformer averaged over the entire dwell. For unsupervised training, we use the last 75 ranges, since the first five ranges are affected by the extremely strong direct-wave propagation. Algorithm 1 has been employed with the 30 range bins involved in cold clutter AR parameter estimation for the MTI filter design, averaging across all antenna array elements. Figures 17.39 to 17.41 present the detailed range profiles for the target range bin 19and37, and the target-free range 12. The total noise power across the frequency bins 32 to 224 has been calculated to compare the signal-to-noise ratio improvement (SNRI) with respect to the CBF, bearing in mind that in the look direction (ideal planar wavefront), all beamformers are normalised to the same gain. Interestingly, this analysis demonstrates that unsupervised training (10-15 dB SNRI) is slightly better than supervised training (8-13 dB SNRI) in this case. This could be explained
by an improved interference averaging with most of the available ranges involved (75 out of 80). Practically, though, this improvement is not indicated by the actual SNRIs for the reference targets. This discrepancy between the expected and actual SNRIs is once again explained by the fact that the MTI residues that contain the target components have been used for interference covariance matrix estimation. Since perfect antenna calibration is not attainable in practice, some degradation in SNRI is inevitable. This is a well known phenomenon, and a straightforward modification that excludes target-suspicious range bins from averaging provides one antidote. Nevertheless, both SC SAP options demonstrate the considerable improvement achievable in practice compared with CBF and averaged SAP in this environmental noise situation.
17.3.3
Summary
We have extended the domain of practical application of the stochastic constraints (SC) method to unsupervised training scenarios, typical of existing FMCW OTHR systems. Two operational routines have been proposed here. The first technique includes slow time preprocessing of the input data by the MTI-type filter that can suppress the cold clutter component far below the interfering signal level. We have demonstrated by real SW OTHR data processing that effective cold clutter rejection can be achieved by involving a very modest number of repetition periods in preprocessing. In fact, the order of the preprocessing filter is equal to the order of the AR cold clutter model, which in turn is equal to the number of stochastic constraints that secure the stationarity of the output cold clutter signal. It is highly significant that low-order AR models have now been proven to be adequate only for the local description of the cold clutter, i.e. over a very small number of consecutive repetition periods. Correspondingly, the preprocessing approach that we have introduced can only be used for the extraction of the hot clutter signal, and is completely inappropriate for target detection due to the unacceptably broad blind Doppler frequency bandwidth. Within this first approach, the AR model, and consequently the preprocessing filter, are assumed to be known a priori (or estimated). The second, more general, approach relies only upon the chosen order of preprocessing filter sufficient for cold clutter rejection, while the second-order moments for both hot and cold clutter are unknown. In this second method, the desired fast time SAP/STAP hot clutter rejection filter is defined as part of the augmented STAP filter that involves (K -f 1) consecutive repetition periods for additional cold clutter mitigation. More precisely, the M,-variate fast time operational STAP filter (or N-variate SAP filter) w™t is defined as the vector consisting of the first NL (or N) elements of the augmented NL(K -f l)-variate STAP filter (or the N(K + l)-variate SAP filter) # g \ Although this augmented filter w™t simultaneously provides both hot and cold clutter rejection, the short version w^ was demonstrated to reject effectively only the hot clutter component, far below the cold clutter signal level. The proposed stochastic
Next Page
constraints method ensures the stationarity of the scalar cold clutter signal at the output of the operational filter w™. Simulation results presented demonstrate that, for a typical scenario, the SC technique provides effective interferer mitigation, retaining the initial high level of subclutter visibility. These simulations involved a quite modest number of range resolution bins, typical of existing FMCW OTHR installations. The boundaries of usefulness of the SC method are explored in terms of subclutter visibility as a function of input hot to cold clutter ratio. Most of the efficiency analysis results presented here are obtained by direct simulations. Nevertheless, we have demonstrated by real SW OTHR data processing that unsupervised training scenarios typical of FMCW OTHR can be successfully treated in an operational mode using stochastic constraints principles. Naturally, the techniques introduced here could be useful for applications other than HF OTHR. The similarities between HF OTHR and airborne radar have already been discussed in Section 17.2.
17.4
SC STAP convergence analysis
17.4.1 Introduction In the two previous sections, we introduced the generic (non-operational) and several practical schemes that implement SC STAR The high efficiency of both hot clutter mitigation and the maintenance of subclutter visibility for cold clutter has been demonstrated by mathematical modelling and actual HF OTHR data processing. Moreover, simulations have demonstrated that there is a negligible difference between the generic solution and its operational variants. The goal of this section is to provide an analytic insight into the efficiency of hot clutter mitigation by SC SAP/STAR There are two main reasons for the stochastic behaviour of the loss factor for SC SAP/STAR The first is the traditional one caused by the finite accuracy of the sample hot clutter covariance matrix estimate. Over the relatively short interval of time, where the non-stationarity of the hot clutter could be ignored, this accuracy naturally depends on the number of hot clutter training samples (snapshots) involved in the covariance matrix estimation. In this regard, an accurate evaluation of the SC SAP/STAP convergence rate (the dependence of the hot clutter rejectability on the number of snapshots) is very important. Thus our first goal is to determine the (conditional) distribution for the signal-to-hot-clutter loss factor due to the finite sample size under the given set of (sample) stochastic constraints. More specifically, stochastic constraints are treated at this stage as fixed (deterministic) ones in order to define the conditional distribution. For SC SAP applications, where the differential time delay for multipath could be ignored, the hot clutter (interference) snapshots may be treated as independent identically distributed (HD) samples to simplify the analysis. In the more general case with significant multipath delay spread, these snapshots are not independent, which is important even for SC SAP analysis. Indeed, for STAP with the M,-variate stacked vector gkt, accurate statistical independence occurs only when such samples
Previous Page
constraints method ensures the stationarity of the scalar cold clutter signal at the output of the operational filter w™. Simulation results presented demonstrate that, for a typical scenario, the SC technique provides effective interferer mitigation, retaining the initial high level of subclutter visibility. These simulations involved a quite modest number of range resolution bins, typical of existing FMCW OTHR installations. The boundaries of usefulness of the SC method are explored in terms of subclutter visibility as a function of input hot to cold clutter ratio. Most of the efficiency analysis results presented here are obtained by direct simulations. Nevertheless, we have demonstrated by real SW OTHR data processing that unsupervised training scenarios typical of FMCW OTHR can be successfully treated in an operational mode using stochastic constraints principles. Naturally, the techniques introduced here could be useful for applications other than HF OTHR. The similarities between HF OTHR and airborne radar have already been discussed in Section 17.2.
17.4
SC STAP convergence analysis
17.4.1 Introduction In the two previous sections, we introduced the generic (non-operational) and several practical schemes that implement SC STAR The high efficiency of both hot clutter mitigation and the maintenance of subclutter visibility for cold clutter has been demonstrated by mathematical modelling and actual HF OTHR data processing. Moreover, simulations have demonstrated that there is a negligible difference between the generic solution and its operational variants. The goal of this section is to provide an analytic insight into the efficiency of hot clutter mitigation by SC SAP/STAR There are two main reasons for the stochastic behaviour of the loss factor for SC SAP/STAR The first is the traditional one caused by the finite accuracy of the sample hot clutter covariance matrix estimate. Over the relatively short interval of time, where the non-stationarity of the hot clutter could be ignored, this accuracy naturally depends on the number of hot clutter training samples (snapshots) involved in the covariance matrix estimation. In this regard, an accurate evaluation of the SC SAP/STAP convergence rate (the dependence of the hot clutter rejectability on the number of snapshots) is very important. Thus our first goal is to determine the (conditional) distribution for the signal-to-hot-clutter loss factor due to the finite sample size under the given set of (sample) stochastic constraints. More specifically, stochastic constraints are treated at this stage as fixed (deterministic) ones in order to define the conditional distribution. For SC SAP applications, where the differential time delay for multipath could be ignored, the hot clutter (interference) snapshots may be treated as independent identically distributed (HD) samples to simplify the analysis. In the more general case with significant multipath delay spread, these snapshots are not independent, which is important even for SC SAP analysis. Indeed, for STAP with the M,-variate stacked vector gkt, accurate statistical independence occurs only when such samples
are separated in range by more than (Q + L — 1) range bins, i.e.: t* - t > Q + L - l
(17.145)
For this reason, in any given sample gkt (t = 1, . . . , T), the number of truly independent samples is even fewer, and this emphasises the need for adaptive techniques that can operate efficiently with minimal sample support. Thus our first goal is to determine the statistical properties of hot clutter rejectability when a finite number of stochastic training samples is used to estimate the hot clutter covariance matrix. The second source of stochastic loss factor behaviour is the random nature of the input data-dependent linear constraints. Since hot clutter snapshots and cold clutter samples are statistically mutually independent, the problem of SC influence on the hot clutter rejectability can be treated separately. This analysis is the second goal of our study. In this section, we shall concentrate on the performance analysis of the generic solution equation (17.48), treating the results as performance bounds for the practical schemes. For the chosen hot clutter covariance matrix estimate /^ 1 , the generic adaptive SC STAP filter is defined as (equation (17.48)): (17.146) The conditional loss factor r]\ due to the random nature of /^*1 is therefore:
< 1
(17.147)
since the deterministic constraints Aq hold the useful target signal power (equation (17.39)): H#5(0b) = *?5(0b)
(17.148)
We see that the loss factor is conditioned by the particular stochastic constraints A** that are fixed for this analysis. Therefore we are especially interested in the second loss factor rj2 that evaluates the losses in hot clutter rejection caused by additional stochastic constraints compared with the deterministic constrained generic solution: (17.149)
This solution is the optimum one for hot clutter mitigation under the deterministic constraints Aq. Hence this loss factor is: (17.150) Since the hot clutter snapshots (gkt + /%) and the cold clutter snapshots cja are statistically independent, the overall loss factor is simply: Ti = mm
(17.151)
and its PDF could be analysed in two steps: (a) derivation of the conditional PDF for rj\, i.e. w(r)\\Akt)> and (b) derivation of the PDF for rj2 over the set of stochastic constraints A^, i.e. w(rj2). Obviously for any arbitrary set of deterministic constraints, the PDF w(rj\ \Aq) gives the final solution to the convergence rate analysis problem.
17.4.2
Conditional loss factor rj\ analysis: LSMI versus SMI for SC SAP
An accurate PDF w(r]\ \A/a) is available for the SC SAP algorithm under the condition of negligible delay spread from a scattering region (equation (17.5)). Under this condition, the set of AT-variate hot clutter snapshots (gkt -f /%) (t = 1, . . . , T) is statistically independent. The SMI algorithm uses the ML (unstructured) estimate of the spatial-only covariance matrix: (17.152) that has a complex Wishart distribution CW(T, TV, /?f). Appendix A (Section 17.4.2.1) derives the following (exact) PDF for the conditional loss factor rj\ in the SMI case:
(17.153) where q + K is the total number of linear constraints in A^. When q -f K — 1, the single constraint is the one that is imposed on a beam peak value, while the entire approach collapses to the standard SMI algorithm, studied by Mallet, Reed and Brennan [31]. We see that the conditional loss factor does not depend on the efficiency of the generic SC SAP algorithm, nor even on the nature of the constraints in A^. The existence of q + K linear constraints simply decreases the actual number of antenna degrees of freedom involved in adaptation. From the convergence rate point of view, the linearly constrained SMI algorithm is therefore similar to the generalised interference cancellation algorithm, with some prebeamforming that directly reduces the number of spatial channels involved in adaptive processing [47, pp. 162-192].
The derivation in Appendix A (Section 17.4.2.1) was first published in Reference 48, and was recently presented in Reference 49. The standard ML covariance matrix estimate /^f1 obviously requires at least N samples, while 3 dB conditional losses are achieved with the number of snapshots: T3>2(N-q + K)
(17.154)
When the interferer-to-noise ratio is sufficiently high, and the number of interfering sources impinging on the beampattern sidelobes are significantly less than the number of adaptive channels in the antenna array (J <£ N)9 then a significantly faster convergence rate than this could be achieved. One of the earliest techniques in this category is the loaded sample matrix inversion (LSMI) algorithm, proposed in Reference 32 for convergence rate enhancement, and analysed in Reference 34. In what follows, we generalise this analysis, initially for the SC SAP algorithm. In the LSMI approach, the loaded (regularised) estimate of the covariance matrix is defined as (equation (17.98)): (17.155) First, we analyse the losses caused by the loading itself, when T —>• oo and the statistical losses can be ignored. Thus we are here interested in the loss factor:
(17.156) where the indices in A^ and F^ are omitted for simplicity. We shall assume that: a
The hot clutter power significantly exceeds the additive white-noise power; more specifically, that: (17.157)
b
where Us is the NxJ matrix of signal subspace eigenvectors, /X2A = diag [Ai, . . . , Xj] is the 7-variate diagonal matrix of signal subspace eigenvalues, Xj ^> 1 and Un is the N x (N — J) matrix of noise subspace eigenvectors. The overall number of linear constraints is less than the dimension of the noise subspace: q + K
c
(17.158)
The spatial correlation between the interferer steering vectors and the cold clutter and array-signal manifold is comparatively low (i.e. sidelobe interfering, typical for SAP applications): (17.159)
Under these quite reasonable assumptions, Appendix B (Section 17.4.2.2) calculates the loss factor to be:
(17.160) Since 1 <£2«/x2
(17.161)
the losses caused by loading are negligible in this case, and hence for both the optimal and loaded filters, the optimal SNR is: (17.162) with ?7o — 1 • Given the same assumptions, the stochastic loss factor due to the finite sample volume T > J is:
(17.163) where (flfp" 1 = 08/ + R/T)~l. The approximate PDF is derived in Appendix C (Section 17.4.2.3), and agrees with that of Cheremisin [34]: (17.164) We see that, contrary to the standard SMI algorithm, additional linear constraints do not affect the convergence rate which is only a function of sample volume T and number of interferers / , and is independent of the antenna array dimension N. The average 3 dB losses are: T3 = 2J 17.4.2.1
(17.165)
Appendix A to Section 17.4.2 JL ^
I
The sample covariance matrix Rk is statistically equivalent to the matrix R£ TRj-, where T ~ CW(T, N, IN)9 hence the loss factor of equation (17.147) is now: (17.166) where we have introduced the N x (q + K) matrix: (17.167) and the N-variate vector: (17.168)
Note from equation (17.167) that A77A = Iq+K, hence we may present the matrix A as a product: A = CT
(17.169)
where C is an jV-variate unitary matrix, and F is a N x (q + /c) matrix comprising an upper (q + K) X (q + A:) identity matrix block: (17.170) Since CHC = CCH = IN and E{CTCH} = TIN, the loss factor is now: (17.171) where Ti = CHTC. Next we decompose the matrices Ti and T^ into the blocks: (17.172) according to the dimensions: (17.173) so that (17.174)
(17.175) But now f12 = -T 1 1 Ti 2 T^ 2 1 andf 2 1 = -T^ 2 1 T 2 IT 1 1 , hence: (17.176)
Since Ti ~ CW(T9 N, I N ) , according to the complex variable generalisation of theorem 2.4.1 in Reference 50, presented in Reference 48, we may state that: ^ _I
• •
the matrices (Ti 2 T 22 2 ) and T 22 are independent %2~CW(T,N-q + K9IN-q+K)
•
(T 1 2 T"/) - CAf(O, IN-q+K O Iq+K).
Let us now denote:
and (17.177) where C\ is a (q + A:)-variate orthonormal matrix. Since CfC\ = C\C^ = Iq+K, the random matrix: (17.178) has the same complex Gaussian distribution as does the matrix Xq+K. With the definition y = Xq+Ke, we obtain: (17.179) Now applying the complex variable generalisation of theorem 2.4.8 in Reference 50, presented in Reference 48, we conclude that the random variable rj = yHf^Y has the generalised complex F-distribution:
(17.180) Substitution finally gives us the desired result:
(17.181) with 0 <: ?7i < 1. Note that q + /c > 1: there is always at least one constraint (in the beampattern in the steering direction). For a single SC q + K = 1, this result reduces to the classical result of Mallet, Reed and Brennan [31]. 17.4.2.2
Appendix B to Section 17.4.2
According to equation (17.157), we may express: (17.182) (17.183)
The term (1 H- P)
x
in the latter could be neglected, thus:
(17.184) (17.185) By assumptions a and b, we may assume that: (17.186) and therefore employ the first-order expansion:
i~
•
i—
(17.187) and ignore second-order terms 0([(1 -f P)/(P + M2)]2) in the denominator of equation ((17.156)) to obtain:
(17.188) where B = A%UnU** Akt, hence: (17.189) We may similarly apply a first-order expansion to the numerator of equation ((17.156)) to obtain:
(17.190)
and therefore:
(17.191)
(17.192)
(17.193) Obviously for /3 = 0, r]o = 1, and for 1 < fi2 <£ /z2, under the stated assumptions, rjo 2^ 1. 17.4.2.3
Appendix C to Section 17.4.2
(17.194) and (17.195) where (17.196) therefore, we may present: (17.197) where X ~ CJV(O, I^ Q Ij), i.e. X is a (N x T) data matrix consisting of HD Gaussian random numbers. With these notations, we may present the loaded sample matrix as:
(17.198)
Here we have assumed that the loading factor is properly selected above the internal noise level, so that: (17.199) Since:
(17.200) we can present the loaded sample matrix inverse as the first-order expansion:
(17.201) At the same time:
(17.202) Since
T/3/JJL2
<£ 1, we may apply first-order expansion again:
(17.203) and by substituting this into equation (17.201), we obtain:
(17.204) Therefore, we can define the following approximation for J\:
(17.205) Using the introduced approximation for (/*/# + jRk), let us define products in the rj\ loss factor expression equation ((17.163)):
(17.206) Now
(17.207)
(17.208) Similarly, we may ignore O(l//z) terms in our expansion for: (17.209) and therefore expression equation (17.147) for the loss factor r\ \ could be rewritten as: (17.210) where we have introduced the (N-
/)-variate vector: (17.211)
We also introduce a uniform transformation: (17.212) so that X\ and X are identically distributed, and: (17.213) Since T > J, i.e. the number of training samples always exceeds the number of interfering sources, we have C22 > 0. Correspondingly: (17.214)
Now we may apply the generalisation of theorem 2.4.1 in References 50 and 48, according to which the matrices C22 and C^2 C21 are mutually independent, with the matrix C22 having the complex Wishart distribution CW(T, J, Ij) and the matrix C2IC22 having J (N — /)-variate HD vectors with the complex Gaussian distribution CAf(O, IN-J). We now introduce the normalised vector: (17.215) i.e. U\ is a (N — J) x (N — J) unitary matrix with the vector h/(hHh) column. Since: UxU? = Uf Ux = IN-j
as the first (17.216)
the matrix C22 C2\U\ has a distribution identical to the matrix C 22 C21 and therefore, also, consists of J HD vectors with distribution CAf(O, IN-J)- Let: P1 E= C221C21U1C - CAT(O,//)
(17.217)
then we have: (17.218) By virtue of the generalisation of theorem 2.4.8 in References 50 and 48, the product: vf C2^1Vi =E h
(17.219)
has the generalised complex F-distribution: GCF(J,T+
1 -J)
= const \h\J~l\l
+ h\~{T+l)
(17.220)
By substitution of variables, we obtain: w(m) - const (1 - ^ i ) 7 " 1 ^ " 7
(17.221)
where (17.222) and (17.223) Finally we obtain: (17.224)
17.4.3 Conditional loss factor rj\ analysis: LSMlfor SC STAP When the loaded SMI technique is used in SC STAP implementation, the M,-variate hot clutter covariance matrix estimate is constructed from hot-clutter-plus-noise training samples (gkt + /%): (17.225) where gkt is the hot clutter snapshot and /% is the external noise snapshot. For a properly chosen loading factor /3, we can assume that: (17.226) and therefore: (17.227) According to equations (17.10) and (17.11): (17.228) We assume that the number of taps is properly chosen: (17.229) hence the rank of the NL x J(Q -\- L — 1) matrix H^ is: (17.230) For simplicity, let us assume the worst case r = J(Q +L —I), then using the singular value decomposition we may write: (17.231) where U\ is the J(Q -f- L — l)-variate unitary matrix of eigenvectors of the product H^1HkZ, A 2 is the diagonal matrix comprising the roots of the eigenvalues of the same product and JJ8 is the NL x J(Q + L — 1) matrix comprising the maximum J(Q + L — 1) eigenvalues of the product HkiHu. Obviously U8 may be treated as signal subspace eigenvectors for the matrix Rf1. We now have:
(17.232)
where G (17.233) Unfortunately, the vectors y are no longer independent; indeed, by equation (17.10):
(17.234)
where
(17.235)
i s a / ( Q + L — l)-variate shift matrix. The TVL-variate white-noise vector j % has a similar property:
(17.236)
This time correlation of the snapshots makes the only important distinction between the SAP and STAP cases. Note that:
and (17.237) which means that we may introduce the transformed data matrix Gi = U^ G, whose column vectors have the same (individual) properties as those of G. (Naturally, the intersample correlation equation (17.236) is affected.) Taking this into account, we may now write equation (17.232) in the form:
(17.238) Again we present A in the form: (17.239)
with jj? ^> 1 and XJ(Q+L-i)
^> 1. A first-order expansion for I RfI J
gives us:
(17.240) The analogy with the pure spatial case (Appendix C, Section 17.4.2.3) is quite clear here, specifically due to equation (17.201). Similarly to the derivation in Appendix C, we find:
(17.241) and
(17.242) while the loss factor is: (17.243) where (17.244) Since the training samples are temporally correlated, we cannot directly apply Wishart distribution properties; nevertheless, since E{G\Gf} = 77/(£>+L-1) similarly to a true Wishart distribution, we propose the following conjecture. Conjecture: the moments of the loss factor r)\ in equation (17.147) for the LSMI algorithm are approximately defined by the distribution: (17.245) Simulation results presented in Section 17.2.3.5 prove this conjecture to a high accuracy. Even if the true moments are slightly different, equations (17.241) and (17.243)
still demonstrate that the minimum number of hot clutter snapshots required for successful hot clutter rejection does not depend on the antenna dimension N, but depends on the product J(Q -f L — 1). For such a critically small sample volume, the linear independence of the training data is sufficient to guarantee effective rejection of strong hot clutter.
17.4 A
Conditional loss factor r\2 analysis: exact PDFfor a single stochastic constraint
When the cold clutter could be described locally by a first-order AR (Markov) model, a single stochastic constraint is sufficient for output cold clutter spectrum stabilisation. In this case, we are able to derive the exact PDF. According to equation (17.42), we have here: (17.246) The loss factor is given by equation (17.150), and hence we have: (17.247) Introducing the notation: (17.248) we may write: (17.249) (17.250)
(17.251) With the notation: (17.252) so that (17.253)
together with (17.254) so that (17.255) (17.256) and finally (17.257) we can express the loss factor as: (17.258) Note that this expression is valid for multiple stochastic constraints. For a single SC, X = g is a M,-variate vector, and so the loss factor is: (17.259) Evidently Prob{^2 < c} for c < 1 is equivalent to: (17.260) According to equation (17.257): (17.261) where V is a A^L-variate matrix; ^ 0 i s a NL-variate vector of Gaussian independent variables: E{gog"} = INL
(17.262)
and R \ is the TVL-variate spatio-temporal covariance matrix of cold clutter snapshots. Since cold clutter samples in adjacent range cells are independent:
^ = diag[4, ...,4]
(17.263)
where RQ is the TV-variate spatial covariance matrix of cold clutter snapshots (see equation (17.52)). Evidently the matrix V is rank-deficient, since: V(i^)55(0o) = WQ RlY = 0
(17.264)
This property is essential for studying the eigendecomposition of the kernel in equation (17.260): (17.265) Since V is rank-deficient, at least one eigenvalue of P(c) is zero. Further, we will now show that only one eigenvalue of D(c) is positive. Let u\ be an eigenvector of D(c) that corresponds to a positive eigenvalue X: B(c)u\ = k u u
k > 0
(17.266)
so that: (17.267) Since 1 - c > 0 and VH(I - UiUf)V > 0, the matrix [ A / + (1 - c)VH x (I - UiUf)V] is PD for any X > 0. Next we have to show that there could not exist another eigenvector «2 that corresponds to the same or a different positive eigenvalue. First, observe that for any X > 0: (17.268) i.e. the eigenvector could not be orthogonal to the vector Vf; this follows from equation (17.267). Second, since wf «2 = 0? we have:
(17.269) Let us introduce the eigendecomposition: (17.270) with A > 0. The product in equation (17.269) is now: (17.271) since (17.272) Now that equation (17.271) contradicts n f «2 = 0, we conclude that there is only a single positive eigenvalue of B(c); hence it may be decomposed as: (17.273)
Since the unitary transformation U(c)g0 does not change the properties of g, the integral distribution of the loss factor, Prob{^2 < c}, is: (17.274) where all elements of g are independent variables with complex Gaussian distribution CAf(0,1). Thus IXj12 (j = 1, . . . ,NL) are independent x2-distributed variables with the characteristic function: (17.275) hence the characteristic functions for the two terms in equation (17.274) are: (17.276) For distinct non-zero eigenvalues Xj(c), we have: (17.277) where the coefficients Cj are obtained using the identity: (17.278) namely (17.279)
For v = (\/iXj(c)), we obtain: (17.280)
(17.281)
Finally: (17.282)
Having derived the two characteristic functions 0\(v) and #2(i>), we can now determine the corresponding PDFs:
(17.283)
According to equation (17.274): (17.284) Now: (17.285) and since
(17.286) we finally obtain: (17.287)
Note the extreme cases: c = 0:
X+(C) = O, Prob{y 2 <0} = 0
(17.288)
c=l:
k+(c) = 1 [A.3(c) =
(17.289)
17.4.5
INL(C)
= O]9 Prob{y2 < 1} = 1
Conditional loss factor r]2 analysis: approximate PDF for multiple stochastic constraints
Let us reintroduce the expression equation (17.258) for the loss factor r\2 using the notation (c.f. equation (17.252)):
/ f = eTq[A^ C^rxAqp
(17.290)
and U = [Ui U 2 ],
U 2 G CNLx(NL~q)
(17.291)
where U\ is defined in equation (17.254), and U2 is a complementary orthogonal matrix such that: (17.292) This converts equation (17.258) into: (17.293) where X is the NL x K matrix: (17.294) According to equations (17.41) and (17.42): (17.295) (17.296) Note that according to our model equation (17.52): (17.297) where RQ is defined in equation (17.52), and Rt is the /c x K temporal covariance matrix of cold clutter snapshots C^: X0 - CAT(O, INL O IK)
(17.298)
and Xo is an NL x k matrix of independent complex Gaussian variables. Substitution of equation (17.298) into equation (17.293) already leads to an important conclusion: under the scalar autoregressive (AR) cold clutter model equation (17.51), the PDF for the conditional loss factor 772 does not depend on the temporal correlation properties of cold clutter. Let us apply a singular value decomposition to the matrix V: (17.299) with (17.300) (17.301) Since uniform transformation U^XQ retains the original properties (equation (17.298)) of the data matrix XQ, i.e. £/3X0 is statistically equivalent to XQ, we can rewrite
equation (17.293) as: (17.302)
where U[ = UJfUuW2 = JTfU2. Unfortunately, further attempts to derive the accurate PDF for the loss factor rji lead to intractable results. A lower bound for t]2 could be therefore obtained, if we ignore A in equation (17.302). Under this simplifying approximation: (17.303) where (17.304) Now using the standard transformation: (17.305)
where Uq is a [q x q] unitary matrix with the first column equal tofq/Jfqfq9 finally get:
we
(17.306) where (17.307) according to the generalisation of theorem 2.4.8 in References 50 and 48. Based on this property, approximate /3 distribution for 772 could be presented as: (17.308) In order to validate this approximation, simulations described below have been conducted for SAP scenario (L = 1). The following calculations demonstrate the accuracy of this derivation for 772. The simulated receiving system consists of a uniform linear array (ULA) with TV = 16 elements. The generalised Watterson model has been adopted to reflect the spatial fluctuations of the ionospherically propagated interference signal. Three independent interferer modes I = 3 have been assumed with the following description
(see equations (17.67) and (17.68)): (17.309) with 0 < t < T. Here Sg = diag[5"i(6^), ... ,SN(OI)], being the diagonal matrix formed by the £th mode steering vector corresponding to its (mean) direction of arrival Gi; in the ULA case:
(17.310) Aw£ is the regular component of the ionospheric Doppler shift for the €th mode, (t — I) is the corresponding group delay in Nyquist rate units, at is the RMS power of the €th mode, n^ is additive white noise of power P% and vf is the N-variate random process, modelled by the two-dimensional (spatio-temporal) Markov chain: (17.311) (17.312) (17.313) where (d/+i — dj) is the interelement distance (in half-wavelength units), and jk,t-e is the transmitted interference waveform, modelled for radar applications by scalar Gaussian band-limited noise. Since for SAP applications, ionospheric Doppler shifts and group delays that exceed the Nyquist rate of the interfering signal are not important, these parameters have not been specified. For simplicity, we take the interferer mode parameters to be equal, with aj/P^ = 30 dB, I = 1,2,3, while DoAs are fixed at 0 = (0.06,0.10, 0.14)TT. For the clutter process c^, two different models have been treated. The first model adopts the first-order Markov chain equation (17.78). According to equation (17.54), this model involves one stochastic constraint to protect subclutter visibility, so the exact analysis for the loss factor 772 may be applied. Our second clutter model (equation (17.79)) deals with the second-order AR model and therefore may be analysed by the approximate expression of equation (17.308). Simulation parameters for the clutter models are as usual chosen to reflect HF terrain scattering b\ = 0.999, and a sea-surface scattering b\ = -1.9359, b{2] = 0.998, P^ = 0.009675. The spatial properties of the clutter are defined by the transmitter antenna pattern equation (17.80) with scalar innovation equation (17.81). To reflect the typical OTHR scenario, we use the following parameters: (17.314)
In accordance with our adopted model equations (17.311) and (17.312), the interference-only covariance matrix is of fixed rank £ = 3 for any given repetition period t since:
(17.315) fori= 1, ...,512. We start with the first-order model of equation (17.78), where we may compare the explicit PDF of equation (17.287) with its approximation of equation (17.308) and its stochastically sampled distribution. We may first fix the interference covariance matrix at some Ro, then compute conditional distributions over the set of stochastic constraints. Curve 7 of Figure 17.42a shows the stochastically sampled probability function Prob{^2 < c; Rk = Ro] over 106 simulated trials for the clutter sample cfa, adopted as the single stochastic constraint. This coincides very well with the explicit PDF (curve 8). Curve 9 shows the approximation for the single constraint Prob{?72 < c) = cN~l and as expected slightly overestimates the actual loss
probability
two constraints
Af= 16, SNR =-3OdB, Ie+ 06 trials
Figure 17.42
Comparison of CDFs a one constraint b two constraints
curves 10 & 11 curves 7 & 8
one constraint
probability
N= 16, SNR = -3OdB, Ie+ 06 trials
factor. Note that the unconditional distribution averaged over the set of all covariance matrices produced by the model of equation (17.315), both for the explicit analytic conditional distribution (curve 10) and the empirical distribution (curve 11), are practically identical and again are overestimated by the approximation of curve 9. Figure 17.42b repeats this experiment for the second-order clutter model, but without explicit expressions. Curve 12 illustrates a particular Ro-conditional empirical distribution and is compared with its approximation (curve 13) described by equation (17.308) for K = 2: Prob{772 < c; Rk = Ro) = c N ~ 2 [ ( N - I ) - ( N -
2)c]
(17.316)
Curve 14 shows the unconditional empirical distribution obtained for the set Rk, generated by the model of equation (17.315). We observe good accuracy for our approximation of equation (17.308). Using the stochastic constraints, we have clearly obtained extremely low degradation in interference rejectability. Results of numerical simulations and computations for situations that reflect the most typical features of the HF radar interference environment have demonstrated the accuracy of these analytical results.
17.5
List of variables
General rules: • • • • • •
boldface lowercase letters denote column vectors uppercase letters denote scalars or matrices uppercase blackboard letters denote matrices of an interim nature • estimated version of D D stacked version of D D doubly stacked version of D
•
D vector or matrix comprising the first TVL rows of D
Wxk space of real j x k matrices &xk space of complex j x k matrices Wxk space of Hermitian j x k matrices In e1Znxn rc-variate identity matrix 8jk Kronecker delta function en e 1Znxl = [1,0, . . . , 0 ] r rc-variate unit vector E {•} expectation operator A, eigenvalue A diagonal matrix of eigenvalues u eigenvector U matrix of eigenvectors fpR pulse repetition frequency (pulses per second) fyy Nyquist rate at which receiver outputs are sampled (samples per second) N number of antenna array sensors
M
number of transmitted pulses or sweeps (number of repetition periods) k = 1 , . . . , M slow time index variable (sweep index variable) T number of samples of interest per pulse repetition interval (PRI) (number of range bins) t = 1,..., T fast time index variable (range bin index variable) L number of fast time taps / number of external interference (hot clutter) signals (index variable p = 1 , . . . , J) Q number of propagation paths for the interferers (index variable t = 1 , . . . , Q) q number of linear constraints imposed on the spatio-temporal (two-dimensional) stochastic constraints (SC) STAP filter K number of adjacent PRIs involved in averaging W order of the generalised Watterson model for ionospherically propagated hot clutter #0 target signal direction of arrival (DoA) s(#o) G CNxl array-signal manifold (steering) vector for the look direction Oo Xkt e CNxl observed antenna array snapshot at the kth sweep and the tth range bin sfa £ CNxl target signal snapshot at the kth sweep and the tth range bin cfa G CNx l cold clutter snapshot at the kth sweep and the tth range bin gfa € CN x 1 hot clutter snapshot at the kth sweep and the tth range bin nfa e CNxl additive white noise snapshot at the &th sweep and the tth range bin P^ power of the additive white noise n jfa
GC
Pl
complex waveform representing the pth interferer at the &th sweep and therthrange bin power of the pth interferer (hot clutter) signal
a p
Hki £ CNxJ jfa € CJxX r]o e IZ Tj \ e IZ r]2 G IZ
instantaneous total impulse response at the £th sweep and the £th interferer propagation path complex waveform vector representing all interference signals at the kth sweep and the tth range bin loss factor in the signal-to-hot clutter-plus-noise ratio due to covariance matrix loading loss factor in the signal-to-hot clutter-plus-noise ratio due to finite sample volume loss factor in the signal-to-hot clutter-plus-noise ratio due to the random nature of the data-dependent (stochastic) constraints (SCs)
kkt e CNLxl Skt £ CNL x 1 ckt € CNLx * gto £ CNLx * wto eCNLxl Hu £ cNLxJ^Q+L~x^ jto £ C J ( ^ + L ~ 1 ) x l Zto £ C
sto e C
yto e C
Xto € C w e CNxl w eCNLxl n>o G CNLxX w^ eCNLxl
w G C^(K+1)*1 £
e CNLXi
Rk, Rf1 e HNxN Rck e JiNxN R^ G 7-iNxN
L-stacked antenna array snapshot at the kth sweep and the rth range bin L -stacked target signal snapshot at the kth sweep and the rth range bin L-stacked cold clutter snapshot at the kth sweep and the tth range bin L-stacked hot clutter snapshot at the kth sweep and the tth range bin L-stacked SC STAP filter at the kth sweep and the tth range bin L-stacked instantaneous total impulse response matrix at the kth sweep and the tth interferer propagation path L-stacked interferer waveform vector at the A:th sweep and the tth range bin output from the L-stacked SC STAP filter applied to the L-stacked array snapshot at the kth sweep and the tth range bin output from the L-stacked SC STAP filter applied to the L-stacked target signal at the kth sweep and the rth range bin output from the L-stacked SC STAP filter applied to the L-stacked cold clutter at the &th sweep and the tth range bin output from the L-stacked SC STAP filter applied to the L-stacked hot clutter at the kth sweep and the rth range bin spatial (one-dimensional) SAP filter spatio-temporal (two-dimensional) SC STAP filter any given constant L-stacked spatio-temporal (two-dimensional) SC STAP filter stochastically unconstrained STAP filter (hot-clutter-plus-noise only) at the kth sweep and the rth range bin slow time fast time (three-dimensional) SC STAP filter f a s t t i m e s c S T A p fi|ter responsible for hot clutter mitigation hot-clutter-plus-noise spatial (one-dimensional) covariance matrix at the kth sweep cold clutter intersweep spatial (one-dimensional) covariance matrix for the slow time shift k cold clutter innovative noise spatial (one-dimensional) covariance matrix of the stationary autoregressive (AR) model
Rk9 ^f1 € 7iNLxNL R™ G 1-iNLxNL
R^ G f{NLxNL
Rck G J~lNLxNL
R^ G 1-{NLXNL
Rk, £ f e
UNL{K+1)XNL{K+X)
%ck G ^L(K+1)XM,(K+1) Rf G TiNL(K+\)xNL(K+\) A q G CNLxq Aq G £NL(K+i)xg
Akt G C^ x 0+4)
Akt £ CNLx(K+q^
Axkt G CNLx<"K+q)
^tt
e
CNL(K+l^x(K+q>}
1 * G CNLx{K+q) K
hot-clutter-plus-noise spatio-temporal (two-dimensional) covariance matrix at the &th sweep moving-window average of hot-clutter-plus-noise spatio-temporal (two-dimensional) covariance matrix at the &th sweep noise-free hot clutter spatio-temporal (two-dimensional) covariance matrix at the kth sweep cold clutter intersweep spatio-temporal (two-dimensional) covariance matrix for the slow time shift k cold clutter innovative noise spatio-temporal (two-dimensional) covariance matrix of the stationary AR model hot-clutter-plus-noise slow time fast time (three-dimensional) covariance matrix at the kth sweep c o l d d u t t e r g l o w t i m e fest t i m e (three-dimensional) covariance matrix for the slow time shift k observed slow time fast time (three-dimensional) covariance matrix at the kth sweep general deterministic linear constraints matrix for designing the SC STAP filter general deterministic linear constraints matrix for designing the slow time fast time SC STAP (three-dimensional) filter linear stochastic constraints matrix for designing the generic SC SAP (one-dimensional) filter at the &th sweep and the tth range bin L-stacked linear stochastic constraints matrix at the &th sweep and the tth range bin for designing the generic SC STAP (two-dimensional) filter linear stochastic constraints matrix at the &th sweep and the zth range bin for designing the operational SC STAP (two-dimensional) filter linear stochastic constraints matrix at the &th sweep and the tth range bin for designing the operational slow time fast time SC STAP (three-dimensional) filter building block of A* order of the cold clutter AR model, also equal to the number of stochastic constraints (SCs)
Ckt eCNLxK Bj G CNxN I;to G CN x 1 Pi Bj G CNLxNL \fa G CNLx l ykt
GC
5(0o) £ CNLx 1 bj G C Yj G C p p l G IZ t;pl G IZ r^ G IZ
stochastic constraints matrix for SC STAP at the &th sweep and the tth range bin cold clutter AR model parameters, equal to the Yule-Walker solutions (j = 1,...,Ac) innovative noise component of the cold clutter AR model at the &th sweep and the tth range bin power of the innovative noise in the (simplified scalar) cold clutter AR model L-blocked cold clutter AR model parameters L-stacked innovative noise in AR model at the kth sweep and the tth range bin cold clutter scalar output for the filter n>o at the kth sweep and the tth range bin L-stacked steering vector for the look direction OQ cold clutter (simplified scalar) AR model parameters correlation coefficients of the (simplified scalar) cold clutter process temporal correlation coefficient in the hot clutter model for the /?th interferer and the £th interferer propagation path spatial correlation coefficient in the hot clutter model for the pth interferer and the £th interferer propagation path spatial correlation coefficient in the cold clutter model
References 1 ABRAMOVICH, Y. L, SPENCER, N. K., ANDERSON, S. J., and GOROKHOV, A. Y: 'Stochastic-constraints method in nonstationary hot-clutter cancellation Part I: fundamentals and supervised training applications', IEEE Trans. Aerosp. Electron. SysL, 1998, 34, (4), pp. 1271-1292 2 ABRAMOVICH, Y L, SPENCER, N. K., and ANDERSON, S. J.: 'Stochasticconstraints method in non-stationary hot-clutter cancellation - Part II: unsupervised training applications', IEEE Trans. Aerosp. Electron. SySt., 2000, 36, (1), pp.132-150 3 ANDERSON, S. J. and LEES, M. L.: 'High-resolution synoptic scale measurements of ionospheric motions with the JINDALEE skywave radar', Radio ScL, 1988, 23, (3), pp. 265-272 4 ANDERSON, S. J. and ABRAMOVICH, Y L: 'A unified approach to detection, classification, and correction of ionospheric distortion in HF skywave radar systems', Radio ScL, 1998, 33, (4), pp. 1055-1067 5 GRIFFITHS, L. J.: 'Linear constraints in hot clutter cancellation'. Proceedings of ICASSP-96, Atlanta, 1996, 2, pp. 1181-1184 6 KOLOSOV, A. A.: 'Over-the-horizon radar' (Artech House, Boston, 1987, English translation by W. F. Barton)
7 BLAKE, T. M.: 'Ship detection and tracking using high frequency surface wave radar'. Proceedings of 7th international conference on HF radio systems and techniques, 1997, pp. 291-295 8 JOUNY, 1.1. and CULPEPPER, E.: 'Modeling and mitigation of terrain scattered interference'. Proceedings of IEEE international symposium on Antennas and propagation, 1995,1, pp. 455-457 9 FANTE, R. and TORRES, J. A.: 'Cancellation of diffuse jammer multipath by an airborne adaptive radar', IEEE Trans. Aerosp. Electron. Syst, 1995, 31, (2), pp. 805-820 10 ZATMAN, M. and STRANGEWAYS, H: 'The effect of the covariance matrix of ionospherically propagated signals on the choice of direction finding algorithm'. Proceedings of conference on HF radio systems and techniques, IEE Conf. Publ. no. 392, 1994, pp. 267-272 11 ABRAMOVICH, Y. L, YEVSTRATOV, F. R, and MIKHAYLYUKOV, V N.: 'Experimental investigation of efficiency of adaptive spatial unpremeditated noise compensation in HF radars for remote sea surface diagnostics', Sov, J. Commun. Technol. Electron., 1993, 38, (10), pp. 112-118, English translation of Radioteknika i Elektronika 12 MARSHALL, D. and GAVEL, R.: 'Simultaneous mitigation of multipath jamming and ground clutter'. Proceedings of Adaptive sensor array processing workshop, MIT Lincoln Laboratory, 1996,1, pp. 193-239 13 ENDER, J. H. G.: 'The airborne experimental multi-channel SAR system AER-IF. Proceedings of EUSAR-96, Konigswinter, 1996, pp. 49-52 14 ABRAMOVICH, Y. L, MIKHAYLYUKOV, V. N., and MALYAVIN, I. P.: 'Stabilisation of the autoregressive characteristics of spatial clutters in the case of nonstationary spatial filtering', Sov. J. Commun. Technol. Electron., 1992, 37, (2), pp. 10-19, English translation of Radioteknika i Elektronika 15 ABRAMOVICH, Y. L, GOROKHOV, A. Y, MIKHAYLYUKOV, V N., and MALYAVIN, I. P.: 'Exterior noise adaptive rejection for OTH radar implementations'. Proceedings of ICASSP-94, Adelaide, 1994, 6, pp. 105-107 16 ABRAMOVICH, Y. I. and GOROKHOV, A. Y: 'Adaptive OTHR signal extraction under nonstationary ionospheric propagation conditions'. Proceedings of RADAR-94, Paris, 1994, pp. 420-425 17 BARRICK, D. E. and SNIDER, J. B.: 'The statistics of HF sea-echo Doppler spectra', IEEE Trans. Antennas Propag., 1977, 25, (1), pp. 19-28 18 ANDERSON, S. J., PRASCHIFKA, J., and FUKS, I. M.: 'Multiple scattering of HF radiowaves propagating across the sea surface', Waves Random Media, 1998, 8, pp. 283-302 19 ABRAMOVICH, Y. I. and ZAYTSEV, S. A.: 'One interpretation of the optimum algorithm for the detection of a signal masked by distributed interfering reflections', Radio Eng. Electron. Phys., 1979, 24, (5), pp. 25-32 20 ABRAMOVICH, Y. I. and KACHUR, V. G.: 'Characteristics of adaptive spatial clutter rejection', Radio Eng. Electron. Phys., 1983, 28, (9), pp. 57-64 21 GOROKHOV, A. Y and LOUBATON, P.: 'Subspace based techniques for blind separation of convolutive mixtures with temporally correlated sources', IEEE Trans. Circuits Syst. Fundam. Theory Appl, 1997, 44, (9), pp. 813-820
22 ABED-MERAIM, K. et al.: 'Prediction error methods for time domain blind identification of multichannel FIR filters'. Proceedings of ICASSP-95, Detroit, 1995, pp. 1968-1971 23 MORF, M., VIEIRA, A., LEE, D., and KAILATH, T.: 'Recursive multichannel maximum entropy spectral estimation', IEEE Trans. Geosci. Electron., 1978,16, pp. 85-94 24 STUTT, C. A. and SPAFFORD, L. J.: 'A best mismatched filter response for radar clutter discrimination', IEEE Trans. Inf. Theory, 1968,14, (2), p. 280 25 ABRAMOVICH, Y. I. and SVERDLIK, M. B.: 'Synthesis of a filter which maximises the SNR under additional quadratic constraints', Radio Eng. Electron. Phys., 1970,15, (11), pp. 1977-1984 26 WATTERSON, C. C , JUROSHEK, J. R., and BEUSEMA, W. D.: 'Experimental confirmation of an HF model', IEEE Trans. Commun. Technol, 1970, 18, pp. 792-803 27 ABRAMOVICH, Y. L, DEMEURE, C , and GOROKHOV, A. Y: 'Experimental verification of a generalized multivariate propagation model for ionospheric HF signals'. Proceedings of EUSIPCO-96, Trieste, 1996, 3, pp. 1853-1856 28 ABRAMOVICH, Y L, GOROKHOV, A. Y, and SPENCER, N. K.: 'Convergence analysis of stochastically-constrained sample matrix inversion algorithms'. Proceedings of ISCAS-96, Atlanta, USA, 1996, 2, pp. 449^52 29 ANDERSON, S. J.: 'Adaptive remote sensing with HF skywave radar', IEEProc. F, Radar Sonar Navig., 1992,139, (2), pp. 182-192 30 KOGON, S. M., WILLIAMS, D. B., and HOLDER, E. J.: 'Beamspace techniques for hot clutter cancellation'. Proceedings of ICASSP-96, Atlanta, 1996, 2, pp. 1177-1180 31 MALLET, J. D., REED, I. S., and BRENNAN, L. E.: 'Rapid convergence rate in adaptive arrays', IEEE Trans. Aerosp. Electron. Syst., 1974,10, (6), pp. 853-863 32 ABRAMOVICH, Y I.: 'A controlled method for adaptive optimization of filters using the criterion of maximum SNR', Radio Eng. Electron. Phys., 1981, 26, (3), pp. 87-95 33 ABRAMOVICH, Y I. and NEVREV, A. I.: 'An analysis of effectiveness of adaptive maximization of the signal-to-noise ratio which utilizes the inversion of the estimated correlation matrix', Radio Eng. Electron. Phys., 1981, 26, (12), pp. 67-74 34 CHEREMISIN, O. P.: 'Efficiency of adaptive algorithm with regularised sample covariance matrix', Radio Eng. Electron. Phys., 1982, 27, (10), pp. 69-77 35 GIERULL, C. H.: 'Performance analysis of fast projections of the Hung-Turner type for adaptive beamforming', Signal Process. (Eurosip special issue on subspace methods for detection and estimation, Part I), 1996, 50, pp. 17-28 36 ABRAMOVICH, Y L, MIKHAYLYUKOV, V N., and MALYAVIN, I. P.: 'Test of interference stationarity in adaptive filtering systems', Sov. J. Commun. Technol. Electron., 1992, 37, (3), pp. 1-10, English translation of Radioteknika i Elektronika
37 ANDERSON, S. J., BATES, B. D., and TYLER, M. A.: 4HF surface-wave radar and its role in littoral warfare', J. Battlefield Technology, 1999, 2, (3), pp. 23-27 38 'Innovative radar for export', Australian Defence Science, 1998, 6, (3), p. 12 39 ANDERSON, S. J. and MAHONEY, A. R.: 'Spectral estimation techniques for HF sea clutter analysis'. Proceedings of ISSPA-90, Gold Coast, Australia, August 1990, pp. 123-128 40 ANDERSON, S. J., MAHONEY, A. R., and TURLEY, M. D. E.: 'Applications of superresolution techniques to HF radar sea echo analysis'. Proceedings of PORSEC-94, Melbourne, Australia, March 1994, pp. 491^-99 41 BARRICK, D. E.: 'FMCW radar signals and digital processing'. Technical report ERL 283-WPL26, NOAA, 1973 42 BRENNAN, L. E.: 'Cancellation of terrain scattered jamming in airborne radars'. Proceedings of ASAP-96, MIT Lincoln Laboratory, 1996 43 SLOCUMB, T. H., GUERCI, J. R., and TECHAU, P. M.: 'Hot and cold clutter mitigation using deterministic and adaptive filters'. Proceedings of 5th DARPA Advanced signal processing hot clutter technical interchange meeting, Rome Laboratory, 1997. Classified 44 RABIDEAU, D. J.: 'Advanced STAP for TSI modulated clutter'. Proceedings of ASAP-98, MIT Lincoln Laboratory, 1998, pp. 263-287 45 BARRICK, D. E.: 'Remote sensing of sea state by radar' in DERR, V. E.(Ed.): 'Remote sensing of the troposphere' (Government Printing Office, Washington DC, 1972) chapter 12 46 HORN, R. A. and JOHNSON, C. R.: 'Topics in matrix analysis' (Cambridge University Press, England, 1991) 47 KLEMM, R.:' Space-time adaptive processing: principles and applications' (IEE, UK, 1998) 48 ABRAMOVICH, Y. L: 'Analysis of a direct adaptive tuning method for interference compensation systems with auxiliary linear constraints', Sov. J. Commun. Technol Electron., 1990, 35, (1), pp. 30-37, English translation of Radioteknika i Elektronika 49 ABRAMOVICH, Y. L: 'Convergence analysis of linearly constrained SMI and LSMI adaptive algorithms'. Proceedings of ASSPCC-2000, Lake Louise, Canada, 2000, pp. 255-259 50 SIOTANI, M., HAYAKAWA, T., and FUJIKOSHI, Y: 'Modern multivariate statistical analysis' (Amer. Sci. Press, Ohio, 1985)
Part VIII
Applications in acoustics and seismics
Chapter 18
Space-time adaptive matched field processing (STAMP) Yung R Lee
18.1
Introduction
Element-space pre-Doppler STAP [2] is two-dimensional fully adaptive processing that coherently combines the signals from the elements of an array and the multiple snapshots of coherent signals to obtain large spatial and temporal signal gain, suppress interference and provide target detection in azimuth and velocity. Computational complexity and the need to estimate the interference from limited snapshots make it difficult for real-time implementation. A few cost effective algorithms having real-time capability are mentioned in Reference 3. The adjacent filter beamspace post-Doppler STAP is a reduced-dimension partially adaptive approach. It performs a Doppler filtering with a temporal Fourier transform and a spatial filtering with the conventional beamforming before adaptive processing. The adaptive processing is done in a selected subspace including a few beams and a few Doppler bins [4]. In the complex multipath underwater environment, signals propagate along a vertical slice of ocean within a critical angle defined by the boundary condition of the local ocean environment. Because of the spread in vertical propagation angles, signals observed by a horizontal array will be spread into many horizontal beams when signals arrive off the broadside direction. This is illustrated in Figure 18.1. In addition, signals that propagate in different vertical angles have different phase speeds. Thus, when a source moves toward or away from the array, it also spreads over many Doppler bins if a long temporal integration time is used. Without combining the spread energy in beam and Doppler space, a processor will encounter severe signal mismatch/degradation. Space-time MFP that uses a propagation code to model the signal spread over beam and Doppler bins and coherently combines them should provide improvement in signal estimation while simultaneously providing range and
Higher mode (path, angle), larger cm larger cm, higher angle (off horizontal), smaller Doppler
FWD
Figure 18.1 Multipath Doppler/angle spread
target jammer (owii-sMp)
azimuth, deg
J80fffiQSC
target
AFT
passive forward-sector processing
a
Figure 18.2
Doppler, Hz
b
frequency, Hz
Comparison ofnarrowband passive sonarforward-sector problem with the nominal airborne radar STAP problem a STAP: detect the dot, null the jammer and the slanted clutter b STAMP: detect/combine/class/localise the dots, null the jammer and the clutter
depth localisation. Lee [1] demonstrated the single-element pre-Doppler space-time MFP with real passive sonar data in a shallow water environment. Klemm [5] introduced the element-space pre-Doppler space-time adaptive MFP and illustrated the principle with some numerical examples for active sonar. In this work, beam-space post-Doppler space-time adaptive MFP is presented and its performance with a multiline tow array is studied through a passive sonar simulation. Figure 18.2 shows the comparison of the narrowband passive sonar forward-sector problem with the nominal airborne radar STAP problem.
18.2
Adaptive matched field processing (MFP)
MFP refers to array processing algorithms that exploit the multipath structure of signals propagating in an ocean waveguide by using an acoustic model to calculate replica vectors in a search space. Measured signal-plus-noise data vectors are matched with the replica vectors and an ambiguity surface is computed. The locations at which peaks are found in the ambiguity surface are estimates of the source location. An excellent overview of MFP can be found in the paper by Baggeroer et al. [6]. In a slowly range varying environment, the adiabatic normal mode method [7] has proved to be effective in modelling sound propagation in an ocean waveguide. In a complex range-dependent environment, the parabolic equation (PE) [8] marching algorithm has proved to be more accurate than the adiabatic normal mode method. Both of these are narrowband algorithms. The PE marching algorithm nominally uses a range step and a depth grid that is proportional to the wavelength. It becomes computationally intensive for increasing centre frequency, range and bandwidth. A recent work using the Gaussian beam bellhop [9] ray trace method has shown promise in speeding up broadband range-dependent propagation modelling. A book by F. B. Jensen et al. [10] provides an excellent overview of these models. In this study a shallow water range-independent environment is assumed and the normal mode method is used to model signal propagation. The complex pressure at receiver location r and depth z due to a narrowband source at range r5, depth zs and frequency w, can be expressed as:
where m = 1 , . . . , M are mode indices, (pm (z) are modal depth eigenfunctions, km are modal horizontal wave numbers and a is the narrowband source amplitude. The depth eigenfunctions and horizontal wave numbers satisfy:
where ko(z) = w/c(z) and c(z) is the sound speed profile that covers the water column and extends into the sediment and ocean basement. The pressure field at an Af-hydrophone receiving array can be modelled in a vector form as:
where (r/,z/), / = 1 , . . . , N are hydrophone locations. For a given search location (rs, Zs), the MFP replica then is calculated as:
Let:
be a data vector, a measured pressure field at the receiving array. The conventional MFP response at search location (rs, zs) is calculated as: SCMFPO^ZS) =
h+(rs,zs)x
where ' + ' denotes a Hermitian transpose. Adaptive processing uses the measured signal-plus-noise data vectors to minimise the sidelobe contributions from those components that do not match with the replica vector for a given search cell. Most adaptive techniques used today had their genesis in the minimum variance distortionless response (MVDR) algorithm. Let R be the covariance matrix of the received signal and noise, and R = (xx + ) where () denotes ensemble average over a number of sequential data vectors. The MVDR minimises the variance at the output of a linear weighting of the hydrophone array subject to the distortionless constraint that signals in the look direction have unity gain. The formulation minimises the variance given by: SMVDR
= W + Rw
with respect to the weighting w, subject to the unity gain constraint: w+h(r,,z,) = l The MVDR weight vector WMVDR can be derived as:
where R * is matrix inversion. The MVDR output is:
Applying the singular-value decomposition method, the covariance matrix R can be decomposed into a set of eigenvectors \k associated with a set of eigenvalues kk:
and the inverse of the covariance matrix is given by:
The MVDR weight vector becomes:
The MVDR output becomes:
Without mismatch between data and model and when the signal is loud enough, the replica vector h(rs,zs) is perfectly matched with signal eigenvector in the look direction. The rest of the eigenvectors are orthogonal to the replica vector and have no effect on the output. But, when mismatch is present, the replica vector is no longer orthogonal to the rest of eigenvectors. The noise vectors associated with the least significant eigenvalues then dominate the inverse processing and degrade the signal estimation. In MFP, there are several forms of mismatch, including mismatch due to environmental uncertainties and fluctuation, and mismatch due to system errors. All forms of mismatch are either deterministic or random. Deterministic mismatch degrades the signal estimation and causes localisation bias, but it can be minimised if more ground truth is provided. Random mismatch degrades the signal estimation but will not bias the localisation. Random mismatch cannot be minimised so that robust algorithms were developed to tolerate it to a certain level. Intuitively, one knows that the larger the signal space, the more signals would be included in the estimation, and the estimation is more vulnerable to random mismatch. The robust adaptive algorithms developed to tolerate random mismatch were all based on some form of rank reduction to the signal space. One can reduce the apparent signal space by either adding noise or putting constraints on targets. Adding noise expands the noise space, which obviously reduces the signal space; putting constraints on targets causes each target to occupy more than one degree of freedom, which also reduces the total number of possible targets in the signal space. The white-noise-gainconstrained (WNC) method referred to by Cox [11], dynamically adjusts the sensor noise level by adding white noise power to the diagonal elements of the covariance matrix subject to an inequality constraint on the sensor noise gain. This technique is called LSMI (loaded sample matrix inverse) in the radar community. Adding noise that expands the noise space and effectively eliminates small eigenvalues that would otherwise dominate the sum in the MVDR output calculation due to mismatch. Lee [12] extended this concept in MFP, which will be discussed in the following section.
18.3
Wideband-narrowband feedback loop white-noise-constrained method (FLWNC)
The MVDR white noise processing gain, defined as the amplitude squared of the weight vector |WMVDRI2> is directly proportional to the signal-to-noise ratio (SNR). The MVDR signal degradation due to mismatch is inversely proportional to the SNR. At low SNR, the white noise processing gain approaches unity (the conventional linear processing noise gain) and the mismatch effect is negligible. At high SNR,
the white noise processing gain is high and the MVDR output is very sensitive to mismatch. For each MFP search location, the WNC method dynamically adjusts the sensor noise level by adding white noise to the diagonal elements of the covariance matrix. Adding white noise to the covariance matrix lowers the apparent SNR so that the processing becomes less sensitive to mismatch. Adding white noise, in the amount £, to the diagonal elements of the covariance matrix is the same as adding e to each eigenvalue without modifying the eigenvectors. The WNC weight vector WWNC that results is:
or
and the WNC output is:
or
The wideband-narrowband FLWNC method is shown in Figure 18.3. Widebandnarrowband means using signal and noise in several frequency bins (wideband) for
covariance matrix decomposition WB/NB processing
Figure 18.3
Wideband-narrowband (WB/NB) feedback constrained (FLWNC) adaptive processing
loop
white-noise-
adaptive weight estimation then applying the estimated weight to each frequency bin (narrowband). Instead of ensemble average data vectors over a sequence of time snapshots, averaging over a finite frequency band forms a wideband covariance matrix. For each MFP search location, the MVDR white noise processing gain |WMVDR|2 and the MVDR output are calculated. If the white noise processing gain falls below a preselected constraining value <52, the FLWNC weight vector is set equal to the MVDR weight. If the white noise processing gain is above the constraining value, an amount of white noise that equals the MVDR output is added in the processing and a new white noise processing gain |WWNCI a n d a new output 51WNC are calculated. The new white noise processing gain then is compared with the constraining value. The process is repeated until the new white noise processing gain falls below the constraining value. The feedback-loop approach ensures that the amount of white noise added in each iteration is adequate so that the iteration procedure converges rapidly without overshooting. The calculated adaptive weight is then used to filter data vectors at each frequency bin. This is called wideband-narrowband processing because the weight is calculated with the covariance matrix that is ensemble averaged over a broader frequency band, and then applied to narrowband snapshots at each frequency bin. After filtering through all frequencies of interest, the maximum response over the frequencies is reported.
18.4
MFP examples
MFP detects and localises signals by exploiting the multipath structure of signals propagating in an ocean waveguide. To effectively sample the multipath structure of signals with a short integration time, a vertical array that covers most of the water column of the ocean is needed. In a shallow water experiment that took place in the north-eastern Gulf of Mexico in November 1995, a vertical array was deployed and a source was towed along a 188 m depth bathymetry contour. The test area was located approximately 90 miles south of Panama City, Florida. Figure 18.4 shows a measured sound speed profile in the test area. For the data presented here, the source broadcast a CW tone at 197Hz towed at 76 m depth toward the array starting at approximately 9 km. The tow speed was approximately 2.5 m/s. Figure 18.5 shows a snapshot of the received and the predicted pressure fields when source was at 9.06 km. The predicted pressure field was calculated with a range-independent normal mode code. A complex multipath interference structure versus depth is seen in Figure 18.5. MFP exploits this multipath interference structure to discriminate signal in range and depth. Figure P.9 (see colour signature) shows the resulting conventional MFP range-depth ambiguity surface. A peak appears at 9.06 km in range and 76 m in depth, that is the estimated source location. MFP can in principle also be done with a horizontal array in or close to endfires if the array is long enough to sample the signal multipath interference well. For arrays that do not have extended aperture such as short towed line arrays, MFP with a synthetic aperture was introduced [1,3]. Figure 18.6 shows the received and the predicted pressure fields on the bottom hydrophone for the same experiment over a period of
depth, m
water column sound speed, m/s
bottom
sediment sound speed, m/s
Measured sound speed profile in a Gulf of Mexico experiment
depth, m
Figure 18.4
level, dB
Figure 18.5
Measured and predicted pressure field for source at 9.06 km
418 s. Within this extended period the source moved towards the receiver a distance of 1.063 km. The pressure field on the bottom hydrophone shows significant multipath interference versus range. The data were sampled at 1.22 Hz so that Figure 18.6 could represent the pressure field observed on a 512-element synthetic horizontal array (HLA) whose aft endfire is towards the source. The synthetic HLA MFP, singleelement pre-Doppler space-time MFP, uses this range-varying multipath structure to detect and localise source. The resulting conventional range-depth ambiguity surface for a synthetic HLA with spacing of 2.076 m (tow speed of 2.5 m/s toward receiver)
level, dB
range, m
Figure 18.6
Measured and predicted pressure field on the bottom hydrophone for source towed at 2.5 m/s from 9.18 km toward the receiver
is shown in Figure RlO (see colour signature); it demonstrates source localisation of the single-element pre-Doppler space-time MFP. In the single-element pre-Doppler space-time MFP, source speed (the synthetic HLA spacing) is also a search parameter. Figure P. 11 (see colour signature) shows the resulting conventional range-speed ambiguity surface for source depth of 76 m. The source range and speed are determined and the processing is sensitive to the search speed.
18.5
Space-time adaptive matched field processing (STAMP)
Figure 18.7 shows the STAMP processing diagram for a multiline tow array. It starts with Doppler processing, Fourier transformation of received hydrophone time series Xki (0 into frequency domain X^/(/):
Xki(f) = FFT{xki(t)} where / = 1 , . . . , L is the line index and k = 1 , . . . , K is the phone index in a line. L is the number of towed lines of the tow array and K is the number of hydrophones in each towed line. A plane wave conventional beamforming response £/(/, 0):
array shape environ.
propagation code to generate replica PkAt)
search
Doppler processin
beam space replica (selected beams and Dopplers) B(Z^r 5 5 Z 5 5 V 5 )
conventional beamforming
Pain
r
*> Z& O1, vs
*Plane-wave ~ STAP linel phone Doppler processing phone K xKI(t)
line L phone
phone K xKL(t)
Figure 18.7
WB/NB adaptive MFP
conventional beamforming
output ambiguity surface T39 zs, 6t, vs
XkAf) beam space vector (selected beams and Dopplers) Wi, Oj) Doppler processing
forming covariance matrix R=
conventional beamforming
Xuif)
Space-time adaptive matched field processing (STAMP)
is then calculated at each frequency bin for each towed line, where d is the hydrophone spacing and c is sound speed. A long beam space vector B(/i, 0\)\ B(/i,0i) =[b\(fuOx)M{fuOx
+ A0),.. .Mif\,O\
+ ^VA0\
/71 (/i + A/,0i),&i(/l + A/, 0i + AO),..., fti(/i +AZ,^i+iVA0), • • • ?
fci(/l + M A / , 6»i +A?A0), ^2(/i,01),b 2 (/],^i + A9),...,b2(/i,^i
+ TVAO),
/J2(Zl + AZ, O1), M Z l + AZ,0i + A 0 ) , . . . , *2(Zi + AZ, 01 + NA8), ... , *2(Zi + M Af,Qx)MUx b2(fi+MAf,el+NA6),
+ M Af,Oi + AO),...,
.. .,
bL(fu0i),bL(fuOi
+ AO),.. .,btifuOi + NAO),
M/i+A/.eo.M/i + A/.fli + Afl),..., bh(f\+Af,ex+NA6), .. .,
bL(fi + MAf9Oi),bL(fi + MAf9Oi + A0),..., &L(/I
+ M A / , 0i + Af A0)]
is formed with beam responses from all towed lines for selected beams from 0\ to 0\ -f NAO and Doppler bins from f\ to / i -f- MAf. M is the number of Doppler bins and N is the number of beams. The covariance matrix R is formed by the outer product of B(/i, 0\) and ensemble averaged over a wide Doppler band: R=(B(Zi^i)B+(Z,,*!))/, For MFP, predicted hydrophone pressure time series pki(t) for a signal at search range r5, depth zs, and relative speed vs at frequency f\ and bearing 0\ are calculated with a propagation code for a known array shape and acoustic environment. They are then passed through Doppler processing and conventional beamforming, and are normalised to form the beam space replica. The adaptive weight vector is calculated with the beam space replica and the wideband (averaged over wide Doppler band) covariance matrix R, then applied on each B(/i 9 6\) to get the adaptive narrowband response. It is noted that STAMP will be the same as conventional STAP when one replaces the propagation code with a plane wave signal model.
18.6
Forward sector processing simulation geometry
Figure 18.8 shows the simulation geometry of forward sector processing. In this simulation, the tow ship noise and its bottom bounce energy are treated as stationary broadband point interference. The target is 10 km in front of the tow ship at 90 m in depth. It broadcasts a narrowband signal and moves towards the tow ship with a relative speed of 6 kts. Three array configurations were considered: single-line, 4-line-sequential, and 4-line-vertical. Each single-line consists of 48 phones with a spacing of 2.25 m. The arrays are at a nominal depth of 90 m. The 4-line-sequential configuration connects four single-lines to form a long horizontal line. The 4-linevertical configuration stacks four single-lines vertically with a vertical spacing of 10 m. The environment used in this simulation is the same as the one described in Section 18.4. Figures P. 12 and P. 13 (see colour signature) show beam/time responses (BTRs) and Doppler/azimuth responses of each signal component, respectively, for the conventional plane wave beamforming of a single-line. The own-ship and the bottom interference arrive at relatively higher angles away from the forward endfire at 0°. The target component will be buried underneath the own-ship interference in the combined BTR, but with 256 s integration time, it begins to separate from own-ship
3kts
towed
own-ship noise
bottom bounce
Figure 18.8
Forward sector simulation geometry, f= 200 Hz, target (narrowband) = 12OdB, own-ship (broadband) = 12OdB, bottom bounce (broadband) = 115 dB, white NL = 12OdB, random phase error =0.1 wavelength, no environmental mismatch
noise in the Doppler/azimuth response. The narrowband target signal is spread in Doppler and azimuth due to multipaths shown in Figure P. 13 that can be coherently combined with STAMP processing to enhance detection and localisation. This is the motivation of STAMP processing. The top two panels in Figure P. 14 (see colour signature) show the plane wave beam spectrograms for a single-line steered at 10° off the forward endfire. The high-angle own-ship noise leaks into this shallow angle and causes the high noise background in the conventional beam spectrogram, but is significantly suppressed by the adaptive processing. The bottom left panel shows the STAMP track-cell-gram that tracks the target location and the bottom right panel shows the maximum response over Doppler. The STAMP uses beams of 0° to 30° and six Doppler bins for a 6kt search. It is noted that STAMP processing provides 2-3 dB more signal gain than does plane wave processing for single-line and provides 8-9 dB more with a 4-line-vertical array. Figure P. 15 (see colour signature) shows the range tracking performance of the STAMP. In the simulation the target starts at 10 km and moves towards the towed ship. With single-line, the conventional MFP does not provide range discrimination of the target. With adaptive MFP, single-line STAMP starts to show the target track that is closing in range. The 4-line configurations help to suppress the range sidelobes, and the 4-line-vertical array provides a better performance than does the 4-line-sequential array. Figure P. 16 (see colour signature) shows depth discrimination for STAMP range tracking with the 4-line-vertical array. The target track is formed only at the target depth of 90 m. The target-related cascaded sidelobes are seen at other depths. Similarly, Figure P. 17 (see colour signature) shows speed discrimination for STAMP range tracking with the 4-line-vertical array. The target track is formed at the target
speed of 3 m/s. Away from the target speed, the track becomes defocused and only target-related cascaded sidelobes are seen at search speeds far away from the target speed.
18.7
Summary
STAMP processing that combines STAP and MFP has been developed. Simulations show that STAMP coherently combines signal multipath spread in azimuth and Doppler and greatly enhances the target detection as well as providing target range and depth classification and localisation. In future studies, the robustness of STAMP against array shape error, frequency mismatch and environmental mismatch as well as how STAMP performs in other tactical scenarios will be addressed. References 1 LEE, Y. P.: 'Synthetic aperture matched-field processing'. JASA 100, 1996, pp.2851 2 WARD, J.: 'Space-time adaptive processing for airborne radar'. Lincoln Laboratory technical report TR-IO15, December 13, 1994 3 KLEMM, R.: 'Principles of space-time adaptive processing' (IEE, UK, 2002) Chapter 7 4 WANG, H. and CAI, L.: 'On adaptive spatial-temporal processing for airborne surveillance radar systems', IEEE Trans. Aerosp. Electron. Syst, July 1994, 30, (3), pp. 660-670 5 KLEMM, R.: 'Interrelation between matched-field processing and airborne MTI radar', IEEE J. Ocean. Eng., July 1993,18, (3), pp. 168-180 6 BAGGEROER, A. B., KUPERMAN, W. A., and MIKHALEVSKY, P. N.: 'An overview of matched field methods in ocean acoustics', IEEE J. Ocean. Eng., October 1993,18, (4), pp. 401-424 7 'The KRAKEN normal mode program'. SACLANT Undersea Research Center memorandum (SM-245)/Naval Research Laboratory mem. rep. 6920, 1991 8 COLLINS, M. D.: 'A split-step pade solution for parabolic equation method', /. Acoust. Soc. Am., April 1993, 93, (4), pp. 1736-1742 9 PORTER, M. B.: 'Gaussian beam tracing for computing ocean acoustic fields', J. Acoust. Soc. Am., October 1987, 82, (4), pp. 1349-1359 10 JENSEN, F. B., KUPERMAN, W. A., PORTER, M. B., and SCHMIDT, K.: 'Computational ocean acoustics' (Springer-Verlag, New York: American Institute of Physics, 2000) 11 COX, H.: 'Robust adaptive beamforming', IEEE Trans. Acoust. Speech Signal Process., October 1987, ASSP 35, pp. 1365-1376 12 LEE, Y, FREESE, H., HANNA, J., and MIKHALEVSKY, P.: 'Robust adaptive matched-field processing', Proceedings of IEEE Oceans'93, October 1993, 3, pp. 387-392
Chapter 19
Space-time signal processing for surface ship towed active sonar Dirk Maiwald, Stephan Benen and Helmut Schmidt-Schierhorn
19.1
Introduction
This chapter deals with reverberation suppression in active sonar systems. Reverberation is a phenomenon which is due to reflections of the transmitted acoustic signals by the ocean bottom, from the sea surface and from within the ocean volume. Reverberation limited environments like littoral waters are a severe problem for active sonar systems because the target echo level may not exceed the reverberation level. Figure 19.1 gives an overview of different sonar systems used today. The hullmounted sonar (HMS) covers the self defence range of about 10 kyd, where the term self defence refers to topics like mine avoidance and torpedo defence. The medium range of an HMS is defined as the normal anti-submarine warfare (ASW) operation range. Typical ranges are about 30 kyd. The low frequency active towed array sonar (ACTAS) has a much larger operation range with values well above 100 kyd, see Figure 19.1. It is designed for long-range detection of small, silent and slowly moving submarines. In littoral waters this operation range is reduced due to reverberation. Active towed array sonar systems are not only operating at low frequencies but also in the mid-frequency range. Mid frequency active towed array sonar systems are used e.g. for short-range detection of small, possibly fast, objects like torpedoes. Active sonar systems are used in a sensitive marine environment. Designers and users of active sonar systems are aware of the potential threat of active sonar systems to the marine environment. They take account of the influence of artificial underwater sound on e.g. cetaceans.1 In this chapter, we present sonar systems which follow 1
Whales, dolphins and porpoises are all classified as cetaceans. They are mammals spending their complete life in water
low frequency activated towed array sonar
medium frequency hull mounted sonar HMS
Figure 19.1 Different sonar systems and their corresponding operation range rigorous mammal protection procedures in order to exclude any mammal injury due to active sonar operation. Further information on this topic can be found e.g. on the internet [2]. The design concept of an active towed array sonar system is described in Figure 19.2. A body is towed by a ship which moves with a velocity Vp. The transmission array is placed in this body. Horizontally, the transmission array supplies an omnidirectional pattern; vertically, it avoids the insonification of the sea surface and the ocean bottom. A receiving line array consisting of N equispaced identical sensors is attached to the body with a tow cable of several hundred metres in length. The array can be a single line array, a triplet array or a twin array depending on the purpose of the sonar system. For further information on system design see, e.g., References 4 and 17. The towing depth of the body and the receiving array are approximately identical and can range from several tens to several hundreds of metres. It is controlled by the tow cable length between body and towing platform and the towing platform speed. One great advantage of active towed array sonar systems compared with the more traditional hull-mounted sonar systems is the potential to adapt the sonar operating depth according to the sound velocity profile of the ocean. Active towed array sonar systems are operated at towing speeds well above 20 kts. For moving active sonar systems, reverberation has a space-frequency characteristic. Due to platform motion, the returns e.g. from the sea surface or the ocean bottom are Doppler shifted by a value which is proportional to the cosine of the angle ft between the platform velocity vector and the vector to the position of the individual scatterer.
cable 2
receiving array
towed body with transmitter water depth design features:
Figure 19.2
Design concept of active towed array sonar system (ATLAS ELEKTRONIK GmbH). The receiving array can be a single line array, a twin array (shown in the picture), or a triplet array
The reverberation bandwidth depends on parameters such as beam direction, beam width, platform velocity, bandwidth of the transmitted pulse and frequency. The Doppler shift for a stationary scatterer is approximately given by: /D^^/CCOS/5
(19.1)
where Vp and C denote the magnitude of the platform velocity and the sound velocity, respectively, fc is the transmit centre frequency. For the narrowband case, this spacefrequency characteristic is illustrated in Figures 19.3 and P. 18 (see colour signature). In Figure P. 18, the space-frequency characteristic including reverberation, a jammer, a target and sensor noise for a single sensor signal is shown in the upper right corner. The green bar indicates the space-frequency characteristic given by equation (19.1). This image describes the wavefields received by a sensor at a constant range interval. A single sensor cannot resolve the directions of the different signals. Then, the output of an ideal processing chain is compared with the output of realistic signal processing equipment. For ideal processing, an ideal beamformer with infinite sidelobe suppression is defined, where the beam width of the ideal beamformer is assumed to be equal to the beam width of a realistic beamformer. The Doppler spectrum (upper left plots A and C in Figure P. 18) and the angular spectrum (lower right plots B and D in Figure P. 18) are illustrated for the target direction and the target frequency, respectively. This means that plots A and C give a spectral signal analysis for a single beam direction and plots B and D indicate the spatial spectrum for a single frequency. The width of the peaks in the frequency domain are approximately given by B ~ 1/Tpuise where rpuise denotes the pulse
Doppler spectrum
scenario
sensor spectrum
no target
mainlobe clutter sidelobe clutter
beamdirection
Figure 19.3
Doppler spectrum for one beam direction and different target speeds. One beam signal is analysed where a perfect match between beam direction and target direction is assumed
length. The DFT is assumed to be adapted to the pulse length, i.e. the frequency resolution is determined by the pulse length. First, consider the case where the jammer is switched off. The green curves in plot A and B show the output of the ideal processing. No detection problem occurs in this case because reverberation and target can be discriminated in the spacefrequency plane. The finite sidelobe suppression of the realistic beamformer results in a spectral pedestal in plot C (blue curve) which extends from fc — (2Vp/c)fc to fc + (2Vp/c)fc and overlays the target and the reverberation spectral peaks. This Doppler spectrum is often denoted coloured reverberation. Because of the finite sidelobe suppression, the spatial spectrum given in plot D (blue curve) shows a pedestal for all directions. Depending on the power level of the target echo it may be covered in the spatial (plot D) and the frequency (plot C) domain by reverberation components. As in radar applications, a jammer shows up in a small angular sector and a wide frequency band. Consider the case that the jammer is switched on in Figure P. 18. For ideal processing it only affects the spatial spectrum, as indicated by the red curve in plot B. The jammer does not show up in plot A because it is suppressed by the ideal beamformer. In realistic processing, the jammer fills up the complete frequency and spatial spectrum as shown by the red curves in plots C and D. For FM signals with a large bandwidth, the Doppler shift due to platform motion will be much smaller than the bandwidth of the transmitted pulses. In Figure P. 18
this would mean that the bar in the upper right corner would have a width corresponding to the bandwidth of the FM signal. In this case, the target echo will almost always be covered by the transmitted spectrum in the space-frequency plane. Space-time filtering will not be possible. The Doppler spectra of Figure 19.3 give a detailed look at spectra for a specified direction. Figure 19.3 is more familiar to the sonar community. In Figure 19.3, three different scenarios are shown on the left-hand side, the corresponding Doppler spectra are presented on the right-hand side. In the first case, no target is present. One sees that the reverberation is centred around fc and that it extends from fc — (2 V>/c)/c to fc + (2Vp/c)fc. For comparison, the spectrum of a single sensor signal is drawn. The mainlobe and sidelobe clutter of the reverberation can be identified where the centre frequency of the mainlobe clutter depends on the beam direction. In this case, the beam points in the aft direction, which implies a negative centre frequency for the mainlobe. The second scenario shows a fast target which can easily be detected because the corresponding target Doppler frequency is lying outside of the coloured reverberation. The third case shows a target with a Doppler frequency inside the coloured reverberation area. This scenario results in a severe detection problem. Figure 19.4 illustrates the significance of this detection problem. A ship is assumed to move with Vp = 20 kts and objects are approaching this platform with Vt = 30kts from all directions. Then, the corresponding Doppler frequency lies inside the reverberation range for the complete aft sector which is indicated by the darker bottom half of the circle. These illustrations show that reverberation is a severe problem in clutter areas for both CW and FM pulse processing. To resolve this situation, we propose taking advantage of the ping history for CW and FM pulses, although this history is used differently in the two cases.
Figure 19.4 Illustration of a reverberation detection problem (narrowband case). The circle corresponds to a situation where the moving target lies inside (outside) the reverberation
In radar applications, space-time adaptive processing (STAP) techniques are used for the detection of slowly moving objects by airborne radar systems [21, 10]. In contrast to sonar applications, sequences of coherent pulses are usually used in radar STAP systems. Then, a Doppler shift can be measured by coherent analysis of several pings. In sonar applications, the target Doppler is usually obtained by analysis of a single sounding period. Information on multiple pings can be used to obtain more stable parameter estimates. Using STAP techniques in sonar systems to overcome reverberation has been suggested [6, 8], but the differences between radar and sonar application prevent a direct transfer of techniques. The different wave velocity in air and water is one issue. Phased array radar systems consist of a large number of sensors and one focus of radar signal processing is to reduce the number of processing channels. The number of beams is usually much smaller than the number of sensors. This is different in sonar applications. The sonar systems discussed in this chapter obtain an equidistant sampling of space by beams and possess an equal or larger number of beams than sensors. Furthermore, sensor noise is almost never dominant in sonar systems. In this chapter we adapt radar STAP processing techniques for improving moving target detection for CW pulses. For FM pulse processing this is not possible. Nevertheless, for FM pulses we can use the positional change of moving targets during several pings to discriminate between moving targets and clutter. STAP applications to sonar problems have found some interest during recent years. The first publications known to the authors are References 8 and 6. In Reference 9, STAP techniques are used in connection with matched field processing to estimate the target position, depth and velocity. In Reference 3, ping-to-ping coherence is investigated for a stationary transmitter and receiver. For this scenario, reverberation suppression by multiping processing could be achieved. In Reference 14, stepped CW processing is suggested to overcome reverberation. An adaptive beamspace algorithm to suppress coloured reverberation is introduced in Reference 15. This chapter also suggests using several pings to estimate the corresponding covariance matrices. This chapter summarises and extends results presented in condensed form in conference papers [11, 12]. The main focus is on practical application of algorithms in sonar systems. The active towed array systems of ATLAS ELEKTRONIK have built-in sophisticated processing techniques to solve reverberation problems in littoral waters. These features have been described in detail in previous publications [4, 17, 22, 24, 23]. In this chapter we concentrate on further improvements by implementation of advanced signal processing algorithms.
19.2
Narrowband multiple ping processing
19.2.1 Data model If the target Doppler frequency falls in the region of the coloured reverberant background, depending on SNR it may not be detected by conventional processing. The
use of adaptive space-time processing methods which jointly process data from the individual sensors in both space and time associated with one sounding period can overcome these problems. The following data model applies to CW pulses. Let xm = x(tm) be the Af-dimensional vector of array element outputs at time tm = mTs, where Ts = I/F5 denotes the sampling interval. Then define xP = (xp,... ,xp+M_\)r as the space-time vector obtained by stacking the array outputs at M successive time instances one below the other. In this chapter the symbols' and * denote matrix transposition and Hermitian operation, respectively. The covariance matrix of this vector is given by C*. Now, let x = s + r -f n be composed of a target component s, a reverberation component r, and an uncorrelated noise component n. The target can be described by a complex amplitude at, a direction pt and Doppler frequency ft. We obtain, see e.g. Reference 21: s = at(ds(Pt, fc + ft) ® df(fc + /,)) = atdsf(pu fc + ft)
(19.2)
In this formula ® denotes the Kronecker product, (I5(Pt, fc + ft) is the TV-dimensional steering vector with elements e-P*(fc+ft)n(d/C) cos pt ( n = o , . . . , Af - 1) where d is the inter-element spacing. d_f(fc + ft) is the M-dimensional Doppler vector with elements Q-J^(fc+ft)mTs ( m = o , . . . , M - 1). Reverberation can be modelled as the superposition of independent returns from clutter patches in different directions with associated different Doppler frequencies. The corresponding reverberation covariance matrix is given by: Cx9T = J2 y o(ft' fc + fk)dsJ(pk,
fc + fk)dstf(Pk, fc + fkf
(19.3)
k
The frequency fk is given by equation (19.1) and vo (Pk •> fc + fk) is the power spectral density at the reference sensor of a return from a clutter patch with direction Pk and associated Doppler frequency fk. Equation (19.3) models the transmission-reception situation of different waves at the sensor array. We assume that a history of L pings is available for data processing. Data from ping / at time instance p is denoted by xj. The pulse repetition interval is given by 7PRI. For FM pulses the described data model would be useful for a coherent ping-toping analysis which is not possible in sonar applications.
19.2.2 Fully adaptive CWprocessing A space-time processor is a linear filter which combines all data from a range gate of interest to produce a scalar output [21], i.e. beamforming and Doppler analysis are done simultaneously. The processor can be represented by an MTV-dimensional weight vector w. The output of the processor is given by: z = w*x. The optimal weight vector u;opt maximises the signal-to-noise-plus-interference ratio (SINR) (e.g. Reference 21). For a given beam direction Po and Doppler frequency /o the weight vector is given by: ^opt = C ; X / ( A ) , fc + /o)
(19.4)
In the following part, this processing is called the fully adaptive CW processing. The matrix Cx is not known in advance and has to be estimated by: Cx_ — (I/K) Ylk=\ ^ ^ •>wnere *t denotes data from range gates (training data) near the range of interest. It is assumed that no target signal component is included in this estimate. This estimate is used to calculate the weight vector in equation (19.4) which is then applied to the range gate under consideration, see Figure 19.5. This is the conventional processor architecture used in radar with the processing dimensions changed. For training, data in the vicinity of the range of interest is chosen. This training data selection strategy is also used for the partially adaptive algorithms (see Section 19.2.3). The number M in Figure 19.5 is adapted to the pulse length. A training strategy has to consider several points. First, the number K of snapshots to estimate has to be sufficiently large to allow for the inversion of the (NM x NM)dimensional matrix Cx and to guarantee the efficient suppression of reverberation. It is pointed out that regarding matrix inversion, the number of snapshots K is too small in our applications to sea data. Furthermore, the inversion of Cx implies a high computational burden. Second, the reverberation scenario is not known in advance and depends on range. Thus, the weight vector has to be determined in a data-adaptive way. We propose to use data from L pings for the calculation of:
(19.5)
antenna elements
range
IIIII11IIIPIII1 data cube
iiiiiii
Figure 19.5
Fully adaptive STAP
apply weight
covariance matrix
output
Of course, the basic assumption for using data of multiple sounding periods is that the environmental conditions have not changed drastically during the corresponding time period. This estimation procedure is indicated in Figure 19.5. The effort of further algorithmic improvements has to be twofold. On the one hand, one has to look for computational efficient, suboptimal methods for filtering. These algorithms do not perform beamforming and Doppler analysis at the same time (see e.g. Reference 21). On the other hand, one can invoke algorithms other than matrix inversion for the calculation of the weight vector. One possibility is the use of eigenspace methods (e.g. References 13 and 20), where only the eigenvectors of C^ belonging to the largest eigenvalues are used. Then, a smaller number of snapshots can be used for the estimation of CV
19.2.3 Partially adaptive processing techniques The fully adaptive processing introduced in the preceding section suffers from the high dimensionality of the covariance matrix and is therefore not practical for applications with measured data. The aim is to find reduced-dimension or partially adaptive processing algorithms. Of course, the conventional procedure, i.e. beamforming followed by Doppler filtering, is one possibility to reduce the computational complexity. In this chapter we investigate apost-Doppler adaptive filtering algorithm or frequency-dependent spatial processing algorithm (e.g. References 21 and 10). These algorithms have been successfully applied to radar data. In References 1 and 6 alternative reduced-dimension structures are introduced addressing adaptive Doppler filtering and adaptive FIR filtering, respectively. In contrast to these publications, our algorithm uses information of several sounding periods. Post-Doppler adaptive processing (Figure 19.6) means that first a windowed Doppler filtering for each sensor signal (DFT filter bank) is performed. The window is necessary to suppress leakage in the frequency domain, compare Figure R18 (see colour signature). Therefore, this DFT filter suppresses mainlobe clutter nonadaptively. Second, for each Doppler bin the corresponding array covariance matrix of dimension N x N is estimated and used for adaptive beamforming. Within each Doppler filter, the adaptive beamforming places spatial nulls at the angles where the clutter or jammer falls within the Doppler passband of the corresponding frequency bin. In this algorithm, an Af x TV dimensional matrix has to be inverted M times instead of inverting an NM x NM dimensional matrix in the fully adaptive processing scheme. The spectral cross terms of the complete space-frequency covariance matrix are neglected. The number M has to be adapted to the pulse length which is used in the corresponding sonar system, see Sections 19.4.2.2 and 19.4.2.3. Typical values range from M = 64 to M = 512. Although this algorithm is denoted as post-Doppler adaptive processing in the STAP community, it is a common procedure in the sonar area. Adaptive algorithms in the frequency domain are used e.g. for tow ship noise suppression.
sensor 1 M time samples
sensor N M time samples
lliii
l||||j
||p||^|||lii;;i|||||
Figure 19.6
19.3
Post-Doppler adaptive processing scheme
FM processing
The FM processing presented in this chapter will be based on the echogram. The echogram is a powerful display for detecting and tracking slow moving targets in dense clutter fields. It is a multiple ping range-bearing plot. The use of a ping history is a well known mechanism to detect low SNR targets because targets will show up in successive pings whereas echoes due to noise will disappear. Figure 19.7 shows the setup of an echogram. Data from successive sounding periods are stacked one next to the other. The beam directions are assumed to be north-stabilised. Display data of the echogram is obtained by conventional FM preprocessing; this preprocessing chain is presented in Figure 19.8. To simplify the presentation in Figure 19.8, it is assumed that all range samples of one sounding period are available before the processing starts. In a real system one expects to obtain processing results well before the end of the sounding period. This implies e.g. a block-oriented data processing and transfer scheme. In Figure 19.8 beam data are input to the preprocessing chain. The demod and filter/subsamp block extract the active processing frequency band from the beam signals. Of course, digital filtering belongs to this processing block. The matched filter (MF) correlates the input signals with the transmitted FM pattern where the Doppler due to own-ship motion during transmission and reception is compensated. For FM signals with bandwidth B, the MF compresses the echo signal to approximately \/B. After an integration and a subsampling step, a normalisation is performed for each beam in range direction. The median of windows left and right of the corresponding range point under consideration is used for this procedure.
range, kyd
bearing (rel.)
ping - 39 ping^_38> ping - 37
ping - 2 ping - 1 ping - 0
Figure 19.7
Basic set-up of echogram display. After own-ship motion compensation, non-moving targets lead to horizontal structures. Moving targets are indicated by slanted lines
demod beam data P beams Q range samples
Figure 19.8
filter subsamp
MF
beam data P beams R range samples
integrate subsamp
norm
normalised beam data
beam data P beams S range samples
Preprocessing chain for echogram data; MF = matched filter
In an echogram, reflections of the transmitted acoustic signal from objects will show up as lines. Own-ship motion will be compensated in order to obtain an image in which only moving objects are recognised as slanted lines. Echoes due to non-moving objects will show up as horizontal image structures in an echogram, Figure 19.7.
19.3.1 Image processing background We enhance echogram structures and design an automatic, echogram-based moving target detector by applying image processing algorithms. The Radon transform is the basic tool used in this chapter (see e.g. References 7 and 19). The Radon transform of an image g(x, y) integrates the image intensities along a line and is defined by: g(x,y)= ff g(x,y)8(P-xcos0-ysm0)dxdy (19.6) JJx, y The parameter p(p > 0) is the shortest distance of the origin of the coordinate system to the line, the parameter 0 (0 < 0 < n) is the angle corresponding to the angular orientation of the line. With this definition it can be verified that a single point in the image domain becomes a harmonic curve in the Radon domain. A line in the image results in a peak in the transform domain [19].
19.3.2 Echogram image enhancement In this section the enhancement of the echogram image is addressed. The non-desired structures in an echogram are horizontal and vertical lines. For an example of this see Figure P.25 (see colour signature). These horizontal and vertical lines confuse the operator and complicate the echogram usage. The aim of the image enhancement is to gain an echogram display that is easy to interpret. The corresponding processing is illustrated in Figure P. 19 (see colour signature). The test image contains horizontal, vertical and slanted lines. The objective of echogram image enhancement is to suppress the horizontal and vertical lines. The different lines can be recognised in the Radon domain. Vertical and horizontal lines correspond to 0 = 0 (180) deg and 0 = 90deg, respectively. These regions are suppressed by a weighting function. After application of the inverse Radon transform the non-desired structures are removed. One can think of a signal-to-noise ratio (SNR) as the line level to the background level. Then, an SNR increase of about 6 dB can be recognised between the input and output image.
19.3.3 Automatic echogram detection The echogram as described so far is an excellent detection and tracking display, i.e. targets with low SNR can be visually detected. In contrast to this advantage, an operator faces the problem of visually scanning and checking the display for moving targets. This task is almost impossible for a large number of beams. Therefore, there is a need for an automatic detection algorithm based on echogram data. In this chapter, an algorithm is realised using the space-time information displayed by an echogram. The automatic detector scans the echogram for slanted lines and gives an alarm to the operator and the post-processing if an image structure due to a moving target has been found. The basic image processing algorithm is shown in Figure 19.9. Data from one beam is processed at a time. The resulting image has a small width (sounding periods) and a large height (range samples). A window is moved along the image. This is called region of interest (RoI) selection. The following steps are done for each
echogram one beam L pings
RoI
Figure 19.9
norm
edge detection
radon transform
max
test statistic
Echogram detection algorithm; RoI = region of interest selection, norm = normalisation
window position, meaning that after the final algorithmic step one value is obtained for each window position. In the FM preprocessing described in Section 19.3 a normalisation is performed in range direction, in Figure 19.9 normalisation is performed mainly in the horizontal image direction (sounding periods). One can think of this step as taking the mean for several rows of the image and then dividing each pixel in this area by this mean. The step is performed using a 5 x 5 digital filter mask, which means that the mean is calculated using a few rows and columns, only. The result of this normalisation is a reduction of the level of horizontal lines. Furthermore, the normalisation allows us to set a fixed threshold in the following processing step, the edge detection. Edge detection is done using the conventional Sobel operator in the horizontal and vertical directions; the result is a binary image. This step enhances the edges in the image. Because of the normalisation, only significant non-horizontal image structures contribute to the binary output image, supplying a better discrimination between lines and background noise in the Radon transform processing block. After the Radon transform has been calculated, certain regions in the output matrix can be neglected in the maximum search, i.e. the max processing block. The regions corresponding to horizontal and vertical lines are not taken into account in this final processing step. The parameters are chosen such that targets within the velocity range of about Vmin = 0.7km/h ^0.38 kts to about Vmax = 18km/h ^9.7 kts are detected.
19.4
Experimental results
19.4.1 Sonar system description The principal system design is presented in Figure 19.2. In this section, data from various towed array sonar systems operating in different frequency bands will be analysed. The design frequencies of the systems are about 1.2 kHz, 2.6 kHz and 5.5 kHz, and are denoted by VLFACTAS (very low frequency active towed array sonar system), ACTAS (active towed array sonar system) and MFACTAS (medium frequency active towed array sonar system), respectively. This identification is chosen for clarity in this chapter. It does not necessarily refer to any product names of ATLAS ELEKTRONIK GmbH, although similar systems are available at the company. The receiving array of the ACTAS system is a twin-line array to allow for left/right suppression. The design frequency heavily determines the design and construction of the towed body and the receiving array. The different sonar systems are used for various purposes as indicated in the introduction. Pulses of FM and CW type are used in the different
systems, and the bandwidth and pulse length vary drastically for the different systems. For the MFACTAS typical CW pulses have a length of about 100 ms, whereas CW pulses of a VLFACTAS have a length of typically 4 s or 8 s. The antennas contain more sensors than are used in the analysis presented in this section, the reduced number of sensors is used to reduce the computational burden. It should be noted that not only the system wetend design (towed body, receiving array) is determined by the design frequency but also the mechanical onboard system. Onboard of the towing platform a handling system for the complete wetend and winches for the array and the tow cables have to be supplied. The dimensions of these mechanical system components depend on the design frequency. For a discussion of the system design of the German low frequency active towed array sonar (LFTAS) see Reference 18.
19.4.2
CWpulse sea data analysis
19.4.2.1 Simulation for MFACTAS In this section we first apply the proposed CW processing to simulated sonar data. Data are simulated for the MFACTAS. The receiving line array consists of 16 elements with a design frequency of 5.25 kHz. The transmit centre frequency is chosen as 5 kHz and reverberation is simulated for given range gates by superposition of a large number of independent scatterers. The scenario is illustrated in Figure 19.10 with the following parameters: pulse length = 0.1 s, M = 64 samples, L = 5 pulses. K = 500 snapshots per pulse are used for the fully adaptive processing. Two targets at about 1700 m with speeds of
Figure 19.10
Scenario of CW simulation
30kts and 70 kts, respectively, are simulated. The fast target outside the coloured reverberation is introduced for comparison. The result of the application of the fully adaptive processing is presented in Figures P.20 and R21 (see colour signature). It can be seen that a target which is covered by coloured reverberation and which cannot be detected by conventional processing is easily recognised after fully adaptive filtering. 19.4.2.2 Sea data for MFACTAS We are using a towed array with 64 sensors and the transmitter-receiver separation is about 500 m. CW signals are used at a transmit frequency of approximately 5.4 kHz and a pulse length of about 0.1 s. The platform velocity is about 5 kts, which implies a reverberation bandwidth of about 34 Hz. The parameters for application of the postDoppler algorithm are: M = 128 samples (DFT length), L = 16 pulses and K = A snapshots per pulse. An artificial moving target with low target strength appears at a distance of about 1000 m and a frequency of about 5340Hz. The target lies outside the coloured reverberation, see Figure R22 (see colour signature). The detection is improved after data filtering. Although one might argue that a similar result could be obtained with sophisticated normalisation, this would definitely not be true for high platform velocities. 19.4.2.3 Sea data for VLFACTAS We are using 32 sensors of the array for this analysis. CW signals are used at a transmit frequency of approximately 1 kHz and a pulse length of about 8 s. The platform velocity is about 14 kts, which implies a reverberation bandwidth of about 9 Hz. In Figure P.23 (see colour signature) the spectrum of a single sensor is shown. An artificial moving target with low target strength appears at a distance of about 19 km and a frequency of about 963 Hz, see Figure P.24 (see colour signature). This is a situation with a low SNR signal because it is covered by coloured reverberation, see Figure R24, left side. The estimation of the covariance matrices is performed in the vicinity of the range of interest as outlined before. It is a range adaptive estimation procedure. In detail, for the corresponding ping of interest an analysis window of length M = 512 points is moved along range. This training data is not only taken from the ping of interest but also from L = 16 other pings. Therefore, 64 snapshots are used for estimation of C*. Because of the range dependence of reverberation the stability of this estimate has to be obtained by using multiple pings. In Figure P.24, the detection is clearly enhanced after data filtering. Using conventional processing techniques, one does not obtain this result.
19.4.3 Echogram sea data analysis (A CTAS) Data analysed in this section was recorded by an active towed array sonar system (ACTAS). The transmit frequency is about 2 kHz which is well below the design frequency of the array. In Figure P.25 (see colour signature) an echogram obtained
by ACTAS is shown. Own-ship motion is compensated. A moving target can be seen at about 12 kyd in beams 8 to 16.
19.4.4 Echogram enhancement The moving target can be recognised but the heavy reverberation structures might hinder an immediate detection by the operator. The filtering algorithm based on the Radon transform enhances the display as can be seen in Figure R26 (see colour signature). One observes a level reduction of horizontal structures between 8 kyd and 20 kyd. The moving target at about 21 kyd and beams 8 to 16 is not influenced by the filtering.
19.4.5 Automatic echogram detection The output of the echogram-based detection algorithm is shown in Figures R27 and R28 (see colour signature). In Figure R27 the target structure at about 21 kyd is clearly detected, but in Figure R28, the target level in neighbouring beams is different. This is due to the fact that the target moves through the beams during the 20 sounding periods shown in the image.
References 1 ALSUP, J. and ZABAL, X.: 'Adaptive doppler processing with SWAC data'. Presented at Adaptive sensor array processing workshop, MIT Lincoln Laboratory, 1999 2 NATO SACLANT Undersea Research Center: http://solmar.saclantc.nato.int 3 GRIMMETT, D. and ZABAL, X.: 'Performance criteria for coherent interping reverberation suppression'. Technical report 2591, Naval Command, Control, and Ocean Surveillance Center, San Diego, USA, 1993 4 HOSTERMANN, H. and SCHMIDT-SCHIERHORN, H.: 'The design of the ACTAS twin-array'. Proceedings of Underwater defense technology, 1999. Also available as internal report BL 4940 T 5 of ATLAS ELEKTRONIK 5 HOSTERMANN, H., SCHMIDT-SCHIERHORN, H., and WARHONOWICZ, T.: 'Doppler-sensitive FM pulses: theory and practical results'. Proceedings of Underwater defense technology pacific, 1998, p. 113. Also available as internal report BL 4940 T 4 of ATLAS ELEKTRONIK 6 JAFFER, A.: 'Constrained partially adaptive space-time processing for clutter suppression'. Proceedings of ASILOMAR '94, Pacific Grove, CA, 1994, pp. 671-676 7 JAIN, A. K.: 'Fundamentals of digital image processing' (Prentice-Hall, Englewood Cliffs, 1989) 8 KLEMM, R.: 'Detection of slow targets by a moving active sonar', in LOURTIE, I. M. G. and MOURA, J. M. F. (Ed.): 'Acoustic signal processing for
9 10 11
12
13
14
15 16
17
18 19
20 21 22
ocean exploration (NATO Advanced Science Institutes Series, Kluwer Academic Publishers, Dordrecht, The Netherlands, 1993) KLEMM, R.:' Interrelations between matched-field processing and airborne MTI radar', IEEEJ. Ocean. Eng., 1993,18, (3), pp. 168-180 KLEMM, R.: 'Space-time adaptive processing' (The Institute of Electrical Engineers, London, 1998) MAIWALD, D., BENEN, S., HOSTERMANN, H., and SCHMIDTSCHIERHORN, H.: 'Space-time adaptive processing for active towed array sonar systems'. Proceedings of Underwater defense technology Europe, Hamburg, Germany, 2001. Also available as internal report BL 4940 T 8 of ATLAS ELEKTRONIK MAIWALD, D., BENEN, S., and SCHMIDT-SCHIERHORN, H.: 'Space-time processing for surface ship towed active sonar'. Proceedings of BAE Signal & data processing conference, Dunchurch Park Conference Centre, UK, 2002, pp. 9-30 MESSERSCHMITT, T. R. and GRAMANN, R. A.: 'Evaluation of the dominant mode rejection beamformer using reduced integration times', IEEE J. Ocean. Eng., April 1997, 22, (2) MIO, K., DOISY, Y., and CHOCHEYRAS, Y: 'Adaptive beamformmg for low frequency sonar on comb waveforms'. Proceedings of Underwater defense technology Europe, Hamburg, Germany, 2001 PARSONS, N.: 'Adaptive reverberation suppression'. Proceedings of Underwater defense technology Europe, La Spezia, Italy, 2002 SCHMIDT-SCHIERHORN, H. and HOSTERMANN, H.: 'The influence of the system bandwidth of ASW sonar systems on detection performance with respect to transmission loss'. Proceedings of Underwater defense technology Europe 97, Hamburg, Germany, 1997, p. 360. Also available as internal report BL 4940 T 1 of ATLAS ELEKTRONIK SCHMIDT-SCHIERHORN, H. and WARHONOWICZ, T.: 'Towed twin line array: design and results of sea trials'. Proceedings of Underwater defense technology Pacific, 1998, p. 391. Also available as internal report BL 4940 T 2 of ATLAS ELECTRONIK SCHOLZ, B.: 'LFTASS-the German solution'. Proceedings of Underwater defense technology Europe, Hamburg, Germany, 2001 TOFT, P.: 'The Radon transform: theory and implementation'. PhD thesis, Department of Mathematical Modelling, Technical University of Denmark, Lyngby, 1996 VAN TREES, H. L.: 'Optimum array processing' (John Wiley & Sons, New York, 2002) WARD, J.:' Space-time adaptive processing for airborne radar'. Technical report 1015, MIT Lincoln Laboratory, Lexington, Massachusetts, 1994 WARHONOWICZ, T., SCHMIDT-SCHIERHORN, H., and HOSTERMANN, H.: 'Port/starboard discrimination performance by a twin line array for an LFAS sonar system'. Proceedings of Underwater defense technology Europe, 1999, p. 398. Also available as internal report BL 4940 T 6 of ATLAS ELEKTRONIK
23 WARHONOWICZ, T., SCHMIDT-SCHIERHORN, H., and HOSTERMANN, H.: 'Detection in high clutter areas with an LFAS sonar system'. Proceedings of Underwater defense technology Pacific, 2000, p. 30. Also available as internal report BL 4940 T 7 of ATLAS ELEKTRONIK 24 WARHONOWICZ, T., SCHMIDT-SCHIERHORN, H., HOSTERMANN, H., and PAPZINER, U.: 'The use of large time-bandwidth products in LFASsystems'. Proceedings of Underwater defense technology, 1998, p. 4. Also available as internal report BL 4940 T 3 of ATLAS ELEKTRONIK
Chapter 20
EM and SAGE algorithms for towed array data Pei-Jung Chung and Johann F. Bohme
20.1
Introduction
The goal of array signal processing is to extract information conveyed by propagating waves. A group of sensors located in distinct positions is deployed to measure the propagating wavefield. Due to phase differences among sensors, we can determine characteristics of propagating waves such as the direction of propagation and propagation velocity of signals. These quantities are of fundamental importance in a great diversity of applications, such as sonar, radar, geophysics and wireless communication. Among all existing estimation methods, the maximum likelihood (ML) approach is known to have the best statistical performance, low SNR threshold and robustness against small sample numbers and coherent source signals. Furthermore, based on the asymptotic properties of Fourier transformed data, the ML approach provides a natural way to combine information from different frequencies [4]. Another advantage of this approach is that we can easily establish a detection procedure based on likelihood ratio after getting ML estimates. The ML algorithm typically requires a multi-dimensional search to find the estimates. The high computational cost associated with this procedure is often seen as a main drawback of the ML method. To overcome this difficulty, efficient algorithms for finding ML estimates will be studied intensively in this chapter. In particular, we will concentrate on the EM (expectation and maximisation) [13] and the SAGE (space alternating generalised EM) algorithms [15]. The EM algorithm is a very general and popular iterative algorithm in statistics for locating modes of likelihood functions. It has the advantages of simple implementation and stability. The basic idea behind EM is that, rather than performing a complicated maximisation, one works with augmented data that simplifies the calculation and performs a series of simple maximisations. Application of the EM algorithm
to the direction finding problem has simplified the multi-dimensional search to a one-dimensional search. But the slow convergence associated with EM might limit its usefulness in practice. The SAGE algorithm, preserving the virtues of the EM algorithm, can improve the convergence rates in many cases. Comparative analysis shows that under certain conditions the SAGE algorithm converges faster than the EM algorithm. Based on the component-wise convergence of the EM and SAGE algorithms, an efficient implementation of EM and SAGE is also developed. In the maximisation step, a search space in the neighbourhood of the maximum point rather than the search space given by the original problem is used. An adaptive procedure is suggested to determine the search space at each iteration. This approach is not only suitable for array processing but also applicable to other problems in which a search procedure is needed. Motivated by the simple structure of the augmented data, two recursive procedures derived from the EM and SAGE algorithms are also investigated. In contrast to iterative methods, the recursive EM and SAGE algorithms give a quick update on estimates when new data enters. Under mild conditions, the estimates generated by recursive EM and SAGE algorithms are proved to be strongly consistent and asymptotically normally distributed. A clear advantage of this approach is that recursive EM and SAGE algorithms lead to a very fast and simple implementation of the ML method. The most complicated computation in each recursion is the inversion of the augmented information matrix (see equations (20.24) and (20.34)). Through data augmentation, this matrix is diagonal and easy to invert. More importantly, there is no search in such recursive procedures. Thus recursive EM and SAGE algorithms require much less time to find the ML estimates. This feature greatly increases the potential of the ML approach in space-time adaptive processing (STAP). The potential of these algorithms is demonstrated by experimental results. The sonar data were measured by a towed array of hydrophone sensors in Bornholm Deep, Baltic Sea. The array outputs are broadband and have low signal-to-noise ratio (SNR). We will compare the estimation performance, convergence rate and computational complexity of the above mentioned approaches.
20.2
Signal model
Consider a line array of N sensors receiving propagating waves generated by M farfield sources. The array output x(t), (t = 1 , . . . , T) is divided into K snapshots of length T'. The observation of each data stretch is short-time Fourier transformed. Asymptotically, for large T\ the Fourier transformed data of the &th snapshot can be expressed as: xk(co) = U(oj)sk(co) + uk(oj)
(20.1)
U(oj) = [d(o)90i),...,
(20.2)
d(<w, OM)]
where H(co) e CNxM,
sk(co) = [Sk(co)9...,
SkM(co)]T e C M x l and uk(co) e CNxX
denote the steering matrix, signal waveform and sensor noise, respectively. The steering vector associated with the rath source is determined by the direction of arrival 0m, the sensor position rn and the wave propagation velocity v: d(cO,0m) = [e-J^Omri/v^^^-jasmenrN/vf
(203)
In our study, we assume: (1) (2)
The number of sources, M, is known. Methods for estimating M can be found in References 20, 28 and 29 The signal waveforms are considered as unknown and deterministic.
From the asymptotic theory of Fourier transformation [5] we know that \k(coj), (k = l9...9K9j = 1 , . . . , J) are independent, complex normal distributed with mean H(coj)sk(coj) and covariance matrix Cu(a>j). We assume C11Oy) to be v(a>j)I where V(COJ) is the unknown sensor noise and I is the identity matrix. The problem is to estimate the DoAs O = [ 0 I , . . . , # M ] from the observation x = {x* (
..,KJ=I,...,J.
Based on the asymptotic distribution of the data in frequency domain, we obtain the wideband log likelihood function:
(20.4) j
As is well known [3], for an unknown and fixed O9 the maxima of log /(x|#) with respect to sk(a>j) and v(co) can be expressed explicitly as: (20.5) (20.6)
where H(COJ, Of is a left generalised inverse of H(<w/, 0) [25], P-1 (COJ, 0) = I - P ( G > / , 0) is the orthogonal complement of the projection matrix 1P(COj9 0) = H(&>y, O)YL(COj9 0) and Cx(COj) = (1/K) J2k=\ *k(coj)xk(coj)H is the sample second moment matrix. Substituting sk(coj) and V(COJ) into log/(x|#), one obtains the concentrated log likelihood: (20.7)
Clearly, equation (20.7) is highly non-linear in 0. Maximisation of L(O) often requires a complicated search in a multi-dimensional space. In the following we
will develop iterative and recursive methods to simplify this search procedure and increase computational efficiency.
20.3
EM and SAGE algorithms
The expectation-maximisation (EM) algorithm [13] is a very general and popular iterative algorithm in statistics for finding ML estimates. Because of its simplicity and stability, the EM algorithm has attracted lots of attention since its first appearance. However, the slow convergence associated with EM may limit its application in practice. Various attempts have been made to speed up EM. One such effort is the space alternating generalised EM (SAGE) algorithm [15]. In contrast to Newton-Raphsontype or conjugate gradient methods [17,19,22] which may not converge properly, SAGE has faster convergence and still preserves the virtues of stable convergence and simple implementation. The EM and SAGE algorithms share one common approach: rather than performing a complicated maximisation of the observed data log likelihood, one augments the observations with imputed values that simplify the maximisations and applies the augmented data to estimate the unknown parameters. Because the input data are unknown, they are estimated from the observed data. This procedure continues to iterate between the E- and M-steps until no changes occur in the parameter estimates. Application of EM to the array processing problem was studied in References 2, 14, 18 and 24 under various assumptions on signals and noise. The SAGE algorithm was addressed under the framework of DoA estimation [6]. In References 8 and 9 SAGE was proved to have a faster convergence than EM if certain conditions are satisfied. As EM and SAGE are only guaranteed to converge to local maxima, a good initial estimate is important for both algorithms to converge to the global maximum. In the following subsections, we will give a brief description of the EM and SAGE algorithms and present their application to source localisation.
20.3.1 EM algorithm The original motivation of EM arises from the incomplete data problem [13]. Rather than using ad hoc input data, EM estimates the missing values by their conditional expectation which is optimal in the minimum mean square sense. The statistical inference is based on the augmented data which includes the observed and predicted values. The advantage of such an approach is that maximisation of the augmented data likelihood leads to a monotone increase in the observed data likelihood. Mathematically the EM algorithm can be described as follows. Let x and y denote the observed and augmented data, respectively. The corresponding density functions are denoted by /(x|#) and /(y|#). The augmented data y is specified so that M (y) = x is a many-to-one mapping. Starting from an initial guess ^°\ each
iteration of the EM algorithm consists of an expectation (E) step and a maximisation (M) step. At the (/ -f l)st iteration, (/ = 0,1,...), the E-step evaluates the conditional expectation of the augmented data log likelihood log /(y|#) given the observed data x and the ith iterate # = # [/] : (20.8) For notational simplicity, y is also used to denote a random vector in expressions like equation (20.8). The M-step determines #^+1^ by maximising the expected augmented data log likelihood: (20.9) Very often the M-step is much easier than implementing the observed data maximum likelihood procedure. Also, a simple proof based on Jensen's inequality [25] shows that the observed data likelihood increases monotonically or never decreases with iterations [13]. As with most optimisation techniques, EM is not guaranteed to always converge to a unique global maximum. In well behaved problems log /(x|#) is unimodal and concave over the entire parameter space, then EM converges to the maximum likelihood estimate from any starting value [13,30]. For the direction finding problem, we construct the augmented data yk((oj) by decomposing xk(coj) virtually into its signal and noise parts:
(20.10)
where yk(coj), (k = l,...,K,j with mean:
= 1,...,/) is independent, normally distributed
(20.11)
and covariance matrix:
(20.12)
The noise spectral parameters vm(coj) = cmv(ooj), cm e (1,0), (m = 1 , . . . , M) are positive and satisfy the constraint Ylm=\ vm{^j) = V(CDJ). For convenience, we only consider the case cm = 1 /M.
The statistical properties of yk(a)j) yield the augmented data log likelihood:
(20.13) From equation (20.13) it is easy to see that through data augmentation the log likelihood log / ( y m |# m ) and the corresponding parameter vector $m = [0m,sm,vm], (m = 1 , . . . , M) are decoupled from each other. Note that vm = [vm (a>\),..., vm (coj)]
and sm = [Sjn(Q)I)9..., Sjn(Q)J)9... 9S*(m),-..9S*i((oj)]. Maximisation of log /(y|#) can be performed over distinct parameter sets &m in parallel. This leads to a very simple implementation of the M-step. Evaluation of the conditional augmented data log likelihood is mainly based on the properties of normal distribution [I]. For details the reader is referred to References 14 and 18. With the estimated augmented data, log / ( y m | # m ) represents the virtual observation when only one source is present. Similarly to the observed data log likelihood log /(x|#) (20.4), log / ( y m l#™) can be also concentrated with respect to signal and noise parameters by substituting their ML estimates at an unknown fixed 0m into equation (20.13). The projection matrix P-1O^/, 0) in equation (20.7) is replaced by the orthogonal complement of P m (COJ, 0m) which is the projection matrix into the space spanned by d(a)j, 0m). In summary, given the initial estimate # t0] , the EM algorithm iterates between the E- and M-steps until #M reaches a stationary point #*. Convergence of the algorithm occurs when the data likelihood or #M does not change much with increasing iterations. To avoid notational ambiguity, we recall that the observation in the frequency domain is denoted by x. The (i + l)st iteration proceeds as follows. E-step: calculate for m =
M-step: update # m for m =
(20.14)
In contrast to the multi-dimensional search involved in optimising L(O), the M-step requires only a one-dimensional search. The augmented likelihood to be maximised in equation (20.14) is a geometric mean of beamformer outputs over different frequencies. The price for a simple implementation is the iterations required by the EM algorithm. Thus the rate of convergence is crucial to efficiency of the algorithm.
20.3.2
SA GE algorithm
The space alternating generalised EM (SAGE) algorithm [15] generalises the idea of data augmentation to simplify computations of the EM algorithm. Instead of estimating all parameters at once, SAGE breaks up the problem into several smaller problems by conditioning sequentially on a subset of the parameters and then applies EM to each reduced problem. Because each of the reduced problems considers the likelihood as a function of a different subset of parameters, it is natural to use a different augmentation scheme for each of the corresponding EM algorithms [15,23]. In some settings, this attempt to achieve a more efficient implementation turns out to be very useful for speeding up the algorithm. To present SAGE in its most general form, we will use the following indexing system developed in Reference 23. We define a cycle consisting of an E-step followed by a M-step which will be followed immediately by a new E-step which is the beginning of the next cycle. An iteration consists of one or more cycles. Unlike the EM algorithm, each iteration of SAGE consists of several cycles. The parameter subset associated with the cth cycle r\c is updated by maximising the conditional expectation of log likelihood log f(zc\rjc) of the augmented data zc. The data augmentation schemes are allowed to vary between cycles. Within one iteration, every element of the parameter vector rj must be updated at least once. Let X]c be the vector containing all parameters of r] except the elements of r\c. Then r) — (r)c, r)c) is a partition of the parameter set at the cth cycle. The estimate at the cth cycle, ith iteration is represented by ( - ) ^ . The output of the last cycle of the
/th iteration is used as the input of the (i + l)st iteration: ,[«•+1,0]
=
,,[.-,c]
(20
15)
Starting from the initial estimate rj®'®\ the (/ + l)st iteration of the SAGE algorithm proceeds as follows. Fore= 1,..., C E-step: compute: (20.16) with respect to rjc: (20.17) (20.18) Similarly to EM, it can be shown that any sequence generated by the above procedure increases (or maintains) log /(x|#) at every cycle [15,23]. A natural choice of the augmented data zc for the current problem is to consider one source at each cycle. Formally we can express this augmentation scheme as: ZC(CO) = d(£O, 0c)Sc(CO) + U(CO)
(20.19)
Compared to the augmentation scheme specified in equation (20.10), zc(co) is more noisy since the whole noise component is fully incorporated in every cycle. The parameter vector associated with the cth cycle is given by i)c — (0c, s c , v) where V=
[v((O\)9...9v((Oj)].
Based on the augmentation scheme of equation (20.19), the E- and M-steps can be derived by a similar technique to that used in the EM algorithm [7]. Let CZc (co) = (I/K) Yl=\ ^(co)z^(co)H. The (i + l)st iteration of the SAGE algorithm consists of the following steps. Initialise rj: (20.20)
M-step: update r\c:
(20.21)
Similar to EM presented previously, the augmented log likelihood function to be maximised in equation (20.21) is a geometric mean of beamformer outputs over frequencies. Clearly, optimisation of equation (20.21) requires only a one-dimensional search. It can be observed that each iteration of EM or SAGE requires almost the same computational complexity. The total computational time is determined by the number of iterations required, i.e. by the convergence rate. The convergence analysis in Reference 9 has shown that the SAGE algorithm converges faster than the EM algorithm if certain conditions are observed and augmented information matrices are satisfied.
20.4
Fast EM and SAGE algorithms
In the previous section we have shown that the EM and SAGE algorithms can greatly simplify the complicated multi-dimensional search for finding ML estimates. However, for low convergence rates, the number of iterations to reach a maximum point of the likelihood function is large and the overall computational time is still long. To solve this problem, an efficient implementation of the EM and SAGE algorithms has been proposed in References 7 and 12. The basic idea is as follows: rather than using the same search space given by the original problem at each iteration, one should concentrate on the neighbourhood of the maximum. Thus one can avoid unnecessary computations spent in the intervals where the iterates are impossible to find. More specifically, let A^] = |0# - O1JT11I, a£ ] = 0# - A^] and b{$ = 0% + AJH , (m = 1 , . . . , M). Based on the componentwise convergence of EM and SAGE, we derive the following relation for large /: <$ - A « < (%™ < 6% + A £ ,
m = 1,..., M
(20.22)
The above inequality implies that ^l+l^ lies in an M-dimensional hyperrectangle [a[/],b[l"]] = [af.bf] x . • • x [a[j},b$] x • • • x [a%bl^\ whose boundaries a[/] = [a^\ . . . , a^] and b [l] = [b^\ . . . , Z?^] are determined by estimates from the last two iterations. If a search procedure is needed in the M-step, it is reasonable to confine the search space of the (/ + l)st iteration, @^+i]? ^0 [ a ^ , b ^ ] which is usually considerably smaller than the original search space defined by 0 = [ a ^ , b ^ ] . Motivated by this observation we suggest a modified version of the EM and SAGE algorithms in which the search space is varied with increasing iterations. At the beginning of the /th iteration, the search space is determined by the following procedure.
(1)
calculate:
(2)
determine
(20.23)
In the first three iterations, the search space 0 [ ^ is equal to the original search space 0 . For 1 < / < 3 it is not possible to determine 0 ^ . For / > 4, the search space is calculated from the previous estimates if componentwise convergence happens. It is controlled by comparing the distances between subsequent estimates | A^ | and I A^~ 1] |. For small or moderate i, the componentwise convergence rate could be still larger than one. The constant c > 1 in equation (20.23) is used to take this effect into account. Note that c should be chosen so that the resulting 0 ^ is at most as large as 0 . The adaptive procedure to determine search spaces can be easily built into the EM and SAGE algorithms discussed previously.
20.5
Recursive EM and SAGE algorithms
The EM and SAGE algorithms investigated previously are iterative methods for locating modes of a likelihood function. If very large data sets are involved, numerical procedures can become very expensive. To overcome this problem, we propose two
alternative procedures derived from the EM and SAGE algorithms in which the data are run through sequentially. Under proper conditions, these procedures lead to strong consistency and asymptotic normality. As pointed out in References 7 and 27, the recursive EM and SAGE can be seen as stochastic approximation procedures with specialised gain matrices. By proper formulation, it was shown that they are recursive approximations to EM and SAGE. The practical advantage of such procedures is that, upon the arrival of a new data, a quick update of the estimate is made, there is no need to wait for a long time to collect the whole data set. This feature is particularly important for STAP, as noted before. Based on recursive EM, algorithms for recursive DoA estimation were proposed in References 10, 16 and 20. Major differences between our algorithms and previous works include the following: 1 Different data augmentation schemes. 2 The spectral parameters are updated by observed data rather than by augmented data to obtain better convergence rates and more stability. 3 The step sizes 6& used in References 20 and 16 are limited to ak~x. In the current work we have a more flexible choice ak~~a, and in most cases, a good choice of €u is essential to obtain satisfactory convergence rates. With proper modifications, the proposed algorithms can also be used in tracking moving sources [H]. Unlike subspace tracking methods [31], the recursive procedures based on the ML approach are not only applicable in the narrowband case but also in the wideband case. 20.5.1
Recursive EM algorithm
Suppose Xi,X2,... are independent observations, each with underlying probability density function (PDF) /(x|#), where # denotes an unknown parameter vector. The corresponding augmented data yi,y2> • •. are characterised by the PDF / ( y | # ) . Let (•)* denote the estimate after k observations. The following procedure is aimed at finding the extremum #* of log/(x|#) which would coincide with the maximum likelihood estimate: 0*+1 = tf* + ak-alEM(#kr]
Y(*k, #*)
(20.24)
where a > 0 is a constant and: Kx*, 0*) = V* log /(x*|#)|* = 0* JEM(#*)
= E[-V#VJ log /(y|#)|x*, &]\t=dk
(20.25) (20.26)
represent the gradient vector and the augmented information matrix calculated at ftk, respectively. V# is a column gradient operator with respect to #. A proper choice of a depends on the following matrix: DEM W = \l - OXEM W - 1 ^ W
(20.27)
where IW
= E[-V*V!" log /(xl,?)]
(20.28)
denotes the Fisher information matrix corresponding to one observation. I is an identity matrix with corresponding dimension. Use a = 1 if DEM (^) is a stable matrix. Otherwise 1/2 < a < L A matrix is called stable if all eigenvalues have negative real parts. If the initial estimate ft0 is close enough to ft*, under mild conditions ftk converges with probability one to ft* [7,10]. For large k, the normalised error ka^2(ftk — ft*) is normally distributed with zero mean and a covariance matrix which can be obtained by solving a linear matrix equation. As pointed out by Titterington [27], recursion (20.24) will not lead to asymptotic efficiency. Asymptotic efficiency implies at the same time the best convergence rate achievable by such procedures. However, compared with an optimal procedure: (20.29) the practical advantage of recursion (20.24) is that X E M ( ^ ) " 1 will usually be much easier to compute than X(ftk)~l. Based on the statistical property of the array outputs and the augmentation scheme specified in subsection 20.3.1, we shall develop the recursive EM algorithm for DoA estimation. The array output of the kth snapshot corresponds to the kth observation. The log likelihood has the same form as equation (20.4) except we consider only the kth summand. According to the recursive procedure (20.24), all unknown parameters should be updated simultaneously. However, to avoid complicated JEM (^)> recursive EM will only be applied to the DoA parameters 9. This approach is practical for wideband signals, since inversion of the augmented information matrix could become quite involved if all signal and noise parameters are considered. The signal and noise parameters are updated by computing their ML estimates once 9k is available. In the following, the gradient vector y(x£, ftk) and the augmented information matrix ^EM(ftk) are calculated with respect to 0. Let ftk = [6k, S*, v*] where 0k = [0k,..., k
k
0kM], sk = [sk(coi)T,...,
sk((Oj)T]
and
k
v = [v (co\),..., v (ci)j)]. Define the first and second derivative of the steering vector d(o>y, 0m) as d'(-) and d (•), respectively. Differentiation of the log likelihood of the kth snapshot with respect to 9m yields the mth element of y(x£, ftk):
(20.30)
The augmented information matrix ZEM(#) is calculated by equation (20.26). As the signal sources are decoupled through data augmentation, 2 E M ( # ^ ) is a diagonal
matrix with the rath diagonal element: ./
(20.31) Once the new estimate 0k+l is available, the signal and noise parameters are obtained by computing the ML estimates: (20.32) (20.33)
With the above results, the &th step of the recursive EM algorithm can be summarised as follows: 1
Calculate gradient vector and augmented information matrix using equations (20.30) and (20.31). 2 Update DoA parameters using equation (20.24). 3 Update signal and noise parameters using equations (20.32) and (20.33).
20.5.2 Recursive SA GE algorithm Similarly to the recursive EM algorithm, the recursive SAGE algorithm is a stochastic approximation procedure with a gain matrix derived from the augmented data. For mathematical simplicity, we only consider the case ft = (ft\,..., ftc) in which the parameter subsets are disjoint. Let zc denote the augmented data of the cth cycle with density function f(zc\ftc). To find the maximising point ft* of log f(x\ft) we suggest the following recursive procedure: #k+i
=
^
k l
+ ak-<xIsAGE(ft
r Y(xk,
#*)
(20.34)
where JsAGE ( ^ ) is a block diagonal matrix with the cth block: 4AGE(^)
= E [" v *cV£ log f(zc\ftc)\xk,
ft]\^k
(20.35)
Use a = 1 if: DSAGEW = - I - a J S A G E W " 1 IW
(20.36)
is a stable matrix and otherwise 1/2 < a < 1. The convergence results of recursive EM hold also for recursive SAGE. Namely, if the initial estimateft®is close enough to ft*9 ftk converges with probability one to ft* under mild conditions [10]. The normalised error ka/2(ftk —ft*)is asymptotically normally distributed with zero mean and a covariance matrix which can be obtained by solving a linear matrix equation. To obtain a recursion for DoA estimation, we need to calculate JsAGE ( ^ ) - Note that as in the case of recursive EM, recursion (20.34) is only applied to update 0.
Thus the resulting gain matrix is also diagonal. Using the data augmentation scheme specified by equations (20.19) and (20.35) we obtain the rath diagonal element as follows: J
(20.37)
Using the gradient vector of equation (20.30) and update formulas for signal and noise parameters, the kth. step of the recursive SAGE algorithm proceeds analogously to the recursive EM algorithm except for replacing J E M ( # ^ ) with JsAGE ( ^ ) in the recursion:
0*+1 = 0k + ak-a[IsAGE(#k)rlY(xk, 0*) One can observe that the gain matrices of (20.31) and (20.37) have similar structures except that the coefficients of the second term at the right-hand side are different. Thus they have almost the same computational complexity. Since the first term of equations (20.31) and (20.37) can be neglected for 0 near the true parameters 0*, it is not surprising that both algorithms will have the same convergence rates if the step size of the recursive EM algorithm is chosen to be M times that of the recursive SAGE algorithm. More details about the convergence properties can be found in Reference 7.
20.6
Experimental results
In this section we present experimental results obtained by processing sonar data. Since this work is mainly concerned with DoA estimation methods, the number of sources is assumed to be constant and known. Better results can be expected if additional detection procedures are used. The data was measured in the Bornholm Deep, east of Bornholm Island (Baltic Sea). A uniformly linear array of 15 hydrophone sensors with interelement spacing 2.56 m was towed by the ship Walter von Ledebur. The experiment was conducted during the cruise between October 3rd and 13th, 1983 by Atlas Elektronik, Bremen, Germany. A moving broadband source was given by the cooperating ship Hans Biirkner. Sound waves from two other ships can be also identified. Previous analysis [20] showed that five sources were most frequently detected by F-tests or information theoretic criteria. Thus the number of sources is assumed to be M — 5. Among the detected signals, two phantom sources at 22° and 40° were caused by the multipath or mode propagation of waves generated by the towing ship in shallow waters and endfire position 6 = 0° [21]. The wave propagation velocity was considered as constant with the value v = 1500m/s. The array outputs were sampled at the sampling frequency fs = 1024Hz after lowpass filtering with cut-off frequency fc = 256Hz. A record of 10 minutes was used for processing, divided into 150 data stretches of 4 s duration. Within each data
stretch, the sensor outputs were considered as stationary. The T = 4096 samples of each data stretch were further divided into K = 16 snapshots, and the T' = 256 samples in each snapshot were Fourier transformed with Thomson's multitapering technique [26]. The Fourier transformed data was characterised by low SNRs. The frequency bins which contained significant energy were selected in the processing. The number of selected frequency bins lies between 13 and 39.
20.6.1 EM and SA GE algorithms EM and SAGE algorithms with and without adapted search spaces were applied to the data set. The initial estimate of each data stretch is obtained by a genetic algorithm. The estimation results obtained by the EM and SAGE algorithms are almost identical to those obtained by the fast EM and fast SAGE algorithms. Figures 20.1 and 20.2 show that EM and SAGE provide similar DoA estimates. The number of iterations to reach a maximum point needed by the EM and SAGE algorithms is depicted in Figure 20.3. The maximum number of iterations is set to be 35. In general, EM requires more iterations than SAGE, but at some places, for example, for the 84th data stretch, the SAGE algorithm requires more iterations than the EM algorithm. This is a strong support for the convergence analysis presented in References 7 and 9, namely, SAGE converges faster than EM if the observed and augmented data information matrices satisfy certain conditions. Under certain circumstances, these conditions may not be satisfied. To measure the computational cost of the fast EM and SAGE algorithms, we define the length of the search space ® [ ' ] by / [/] = YIm=I \bm ~ <*m\ a n d t h e t o t a l length of / = X ^ T ^ - ^iter is the number of iterations when the algorithm is
DoA, degree
Figure 20.1
DoA estimates obtained by the EM algorithm (©IEEE 2002)
time
DoA, degree
DoA estimates obtained by the SAGE algorithm (©IEEE 2002) number of iterations
Figure 20.2
number of iterations
time
time
Figure 20.3
Total number of iterations (©IEEE 2002) a EM algorithm b SAGE algorithm
terminated. The computational cost of the ith iteration is approximately proportional to fi\ thus the total computational cost is proportional to /. Because EM and SAGE converge to local maxima, we use a search space with fixed length [#[0] — 12°, O^ + 12°] at each iteration. The search spaces of the fast EM and SAGE algorithms are
^EM^fastEM /sAGE^fastSAGE
time
Figure 20.4
(©IEEE 2002) a ratio between /EM and /fastEM b ratio between /SAGE and /fastSAGE
determined by this constraint and the adaptive procedure presented in Section 20.4 w i t h e = 12. The computational efficiency achieved by fast EM and SAGE, measured by the ratio between /EM and /fastEM and that between /SAGE and /fastSAGE? respectively, is shown in Figure 20.4. This ratio is between 400 and 2000 which implies the computational time can be reduced by up to a factor of 2000. The computational efficiency is improved significantly by the fast EM and SAGE algorithms.
20.6.2 Recursive EM and SA GE algorithms The recursive EM and SAGE algorithms presented previously were applied to each time stretch with the step size e^M = 50/:~ 09 and 6 ^ S A G E = 10Ar0-9, respectively. The initial estimate for each time stretch is the final estimate of the previous time stretch. The first time stretch has the initial estimate <90 = [22° 41° 84° 95° 153°] which is obtained by the genetic algorithm. The final estimates of each time stretch are displayed in Figures 20.5 and 20.6. Both algorithms have similar results since the step sizes are selected so that efEM = 5€^ SAGE . From the first to the 40th time stretch the third, fourth and fifth sources which correspond to the true ships can be tracked with accuracy comparable to that of the iterative methods discussed previously. As the third and the fourth sources become very close in the interval between the 40th and the 60th time stretch, the third and fourth tracks are about 2.8° apart from each other. This separation is necessary for obtaining reasonable convergence. For initial estimates closer than 2° the two tracks
DoA, degree
Figure 20.5
DoA estimates obtained by the recursive EM algorithm
DoA, degree
Figure 20.6
DoA estimates obtained by the recursive SAGE algorithm
will coincide and can never be separated again. To find a remedy, the parameters of a moving source model have been included in the recursive EM and SAGE algorithms in Reference 11. The estimates between the 60th and 120th time stretch are comparable to those presented in Figures 20.1 and 20.2 except for the smoothing effect caused
by the recursions. The separation between the third and the fourth sources is slightly narrower. It can be observed that the rapid changes in the second track in Figures 20.1 and 20.2 in the interval between 130th and 140th time stretch can also be followed by recursive EM and SAGE. In general, the estimates are quite accurate.
20.7
Conclusions
This work deals with computationally efficient algorithms for estimating array signal parameters. In particular, the well known EM and SAGE algorithms and novel recursive versions have been studied intensively. Using data augmentation, the complicated multi-dimensional search involved in maximising likelihood functions has been simplified considerably. As SAGE has a more flexible augmentation scheme than EM, it often has a faster convergence speed. The fast version of EM and SAGE uses an adaptive procedure in the M-step to reduce parameter search spaces without affecting the estimation performance. The recursive EM and SAGE are stochastic approximation procedures with specialised gain matrices derived from the augmented information matrix. Because of the simple structure of the augmented data, the recursive EM and SAGE algorithms can easily be implemented in a STAP framework. Experimental results from sonar data have shown that the EM and SAGE algorithms provide similar estimates. The SAGE algorithm usually requires fewer iterations to reach the stationary point than does the EM algorithm. However, in some cases the SAGE algorithm has a slower convergence than the EM algorithm. This fact implies that the conditions for a faster convergent SAGE algorithm are not satisfied. Furthermore, computational efficiency was improved significantly by the fast EM and SAGE algorithms. Applying the recursive EM and SAGE algorithms to the same batch of data, comparable estimates were obtained with much reduced computational complexity.
References 1 ANDERSON, T. W.: 'An introduction to multivariate statistical analysis' (Wiley, New York, 1984) 2 BOHME, J. F. and KRAUS, D.: 'Parametric methods for source location estimation'. Proceedings of the 9th IFAC/IFORS symposium on Identification and system parameter estimation, Budapest, 1991, pp. 1379-1384 3 BOHME, J. F.: 'Array processing' in HAYKIN, S. (Ed.): 'Advances in spectrum analysis and array processing' (Prentice Hall, Englewood Cliffs N.J., 1991) pp. 1-63 4 BOHME, J. F.: 'Statistical array signal processing of measured sonar and sesimic data'. Proceedings of SPIE 2563 Advanced signal processing algorithms, San Diego, July 1995, pp. 2-20 5 BRILLINGER, D. R.: 'Time series: data analysis and theory' (Holden-Day, San Francisco, 1981)
6 NAIL CADALLI and ORHAN ARIKAN: 'Wideband maximum likelihood direction finding and signal parameter estimation by using tree-structured EM algorithm', IEEE Trans. Signal Process., January 1999, 47, (1), pp. 201-206 7 PEI JUNG CHUNG: 'Fast algorithms for parameter estimation of sensor array signals'. Dr.-Ing. dissertation, Faculty of Electrical Engineering and Information Sciences, Ruhr-Universitat Bochum, Bochumer Unversitatsverlag Bochum, 2002 8 PEI JUNG CHUNG and BOHME, J. F.: 'Comparative convergence analysis of EM and SAGE algorithms in DOA estimation'. Proceedings of IEEE international conference on Acoustics, speech, and signal processing, Salt Lake City, USA, May 7-11, 2001, pp. 2993-2996 9 PEI JUNG CHUNG and BOHME, J. F.: 'Comparative convergence analysis of EM and SAGE algorithms in DOA estimation', IEEE Trans. Signal Process., December 2001, 49, (12), pp. 2940-2949 10 PEI JUNG CHUNG and BOHME, J. F.: 'Recursive EM and SAGE algorithms'. Proceedings of IEEE workshop on Statistical signal processing, Singapore, August 6-8, 2001, pp. 540-542 11 PEI JUNG CHUNG and BOHME, J. F.: 'DOA estimation of multiple moving sources using recursive EM algorithms'. Proceedings of Sensor array and multichannel signal processing workshop, Washington DC, USA, August 4-6, 2002, pp. 323-326 12 PEI JUNG CHUNG and BOHME, J. F.: 'DOA estimation using fast EM and SAGE algorithms', Signal Process., November 2002, 82, (11), pp. 1753-1762 13 DEMPSTER, A. R, LAIRD, N., and RUBIN, D. B.: 'Maximum likelihood from incomplete data via the EM algorithm', Journal of the Royal Statistical Society, 1977,B39,pp. 1-38 14 FEDER, M. and WEINSTEIN, E.: 'Parameter estimation of superimposed signals using the EM algorithm', IEEE Trans. Acoust. Speech Signal Process., April 1988, 36, (4), pp. 477-489 15 JEFFREY, A. FESSLER and ALFRED O. HERO: 'Space-alternating generalized expectation-maximization algorithm', IEEE Trans. Signal Process., October 1994, 42, (10), pp. 2664-2677 16 LIRON FRENKEL and MEIR FEDER: 'Recursive expectation-maximization (EM) algorithms for time-varying parameters with applications to multiple target tracking', IEEE Trans. Signal Process., February 1999, 47, (2), pp. 306-320 17 JAMSHIDIAN, M. and JENNRICH, R. L: 'Conjugate gradient acceleration of the EM algorithm', J. Am. Stat. Assoc, 1993, 88, (421), pp. 221-228 18 KRAUS, D.: 'Approximative maximum-likelihood-Schatzung und verwandte Verfahren zur Ortung und Signalschatzung mit Sensorgruppen'. Dr.-Ing. dissertation, Faculty of Electrical Engineering, Ruhr-Universitat Bochum, Shaker Verlag, Aachen, 1993 19 THOMAS A. LOUIS: 'Finding the observed information matrix using the EM algorithm', Journal of the Royal Statistical Society B, 1982, 44, (2), pp. 226-233 20 MAIWALD, D.: 'Breitbandverfahren zur Signalentdeckung und -ortung mit Sensorgruppen in Seismik- und Sonaranwendungen'. Dr.-Ing. dissertation,
21
22 23
24
25 26 27 28 29
30 31
Faculty of Electrical Engineering, Ruhr-Universitat Bochum, Shaker Verlag, Aachen, 1995 CHRISTOPH MECKLENBRAUKER: 'Parameterschatzung und Hypothesentests fur akustische Wellenfelder unter Beriicksichtigung der physikalischen Ausbreitungsbedingungen'. Dr.-Ing. dissertation, Faculty of Electrical Engineering and Information Sciences, Ruhr-Universitat Bochum, Shaker Verlag, Aachen, 1998 MEILIJSON, I.:'A fast improvement to the EM algorithm on its own terms', Journal of the Royal Statistical Society Series B, 1989, 5, (1), pp. 127-138 MENG, X. L. and VAN DYK, D.: 'The EM algorithm - an old folk song sung to the fast tune', Journal of the Royal Statistical Society Series B, 1997, 59, pp. 511-567 MICHAEL I. MILLER and DANIEL R. FUHRMANN: 'Maximum-likelihood narrow-band direction finding and the EM algorithm', IEEE Trans. Acous. Speech Signal Process., September 1990, 38, (9), pp. 1560-1577 RAO, C. R.: 'Linear statistical inference and its application' (Wiley, New York, 1973) THOMSON, D. J.: 'Spectrum estimation and harmonic analysis', Proc. IEEE, September 1982, 70, (9), pp. 1055-1096 TITTERINGTON, D. M.: 'Recursive parameter estimation using incomplete data', Journal of the Royal Statistical Society Series B, 1984, 46, (2), pp. 257-267 WAX, M. and KAILATH, T.: 'Detection of signals by information theoretic criteria', IEEE Trans. Acoust. Speech Signal Process., 1985, 33, (2), pp. 387-392 WAX, M. and ZISKIND, L: 'Detection of the number of coherent signals by the MDL principle', IEEE Trans. Acoust. Speech Signal Process., 1989, 37, (8), pp. 1190-1196 WU, C. F. J.: 'On the convergence properties of the EM algorithm', Ann. Stat., 1983,11, pp. 95-103 BIN YANG: 'Projection approximation subspace tracking', IEEE Trans. Signal Process., 1995, 43, pp. 95-107
Chapter 21
The common reflection surface (CRS) stack - a data-driven space-time adaptive seismic reflection imaging procedure Jurgen Mann, Eric Duveneck, Steffen Bergler and Peter Hubral
21.1
Introduction
A frequently used way of obtaining information about the earth's interior is to investigate the propagation of elastic energy in the form of seismic waves. These have, for instance, been of help in gathering knowledge about geological structures from the earth's crust down to the core. In particular, reflection seismic methods are applied when information about targets of only several kilometres (up to about 50 km) in depth is of interest. For that purpose, seismic energy is released from controlled sources, such as explosions or vibrators, reflects at discontinuities of the elastic properties in the subsurface and is finally recorded at a number of receiver positions. Reflection seismic measurements are widely applied in the exploration for hydrocarbon reservoirs, where a detailed image of the subsurface geology (up to 5 km depth) is required. The hydrocarbon industry has therefore stimulated intensive research to improve the methods of acquiring and processing seismic data. Recorded seismic data not only contain energy from reflected waves needed to construct a subsurface image, but also energy from a wealth of other wave types initiated by the source which are considered as noise in the context of seismic reflection imaging. In addition, incoherent, random noise may contaminate the data. Therefore, several processing steps are in general conducted before seismic data can be interpreted and a clear image of the subsurface can be obtained. Some of these methods fall into the category of space-time adaptive processing (STAP) techniques. With the strong increase of computer power since the early 1990s, new, computationally more expensive and sophisticated processing methods have become feasible.
One of these methods in reflection seismics is the space-time adaptive common reflection surface (CRS) stack. It suppresses incoherent noise and enhances coherent reflection events, produces a stack section in the time domain which is considered for a first interpretation and finally provides attributes which are a key to connect time-domain seismic data with a depth image of the subsurface. Mathematically, the CRS stack is based on zero-order ray theory (geometrical optics). In contrast to many other seismic imaging methods, no information (e.g. a rough idea about the subsurface elastic properties) other than the data itself are required for the CRS stack. It is therefore classified as a data-driven seismic time-domain imaging method. In this chapter, we firstly discuss the propagation of seismic waves in the subsurface and the acquisition of seismic reflection data to show what kind of events are present in a seismic record and what kind of acquisition geometries are used. We then briefly summarise commonly used processing steps applied to seismic data. As a particular example for a space-time adaptive processing technique, the CRS stack method, its theoretical basics as well as the implementation, is outlined. Finally, the application of the CRS stack to a synthetic data set together with the use of the CRS attributes for the construction of a model of the seismic velocity distribution in the subsurface are demonstrated. Practical results on real data can, e.g. be found in References 1 to 3. To show the correctness of the obtained results, the transformation of the seismic reflection data into a depth image with the help of the velocity model is shown.
21.2 21.2.1
Seismic reflection imaging The seismic wavefield
In reflection seismics, apart from the immediate vicinity of the seismic source, the earth is assumed to behave like an elastic continuum in the case of small deformations. This means that classical continuum mechanics with a linear stress-strain relation (generalised Hooke's law) provides the mathematical background for almost all seismic methods and applications. Let the energy source be described by a body force f (x, t) acting per unit volume on particles of the earth at positions defined by vectors x at some time t, then the particle displacement vector u(x, t) is governed by the elastodynamic wave equation. In seismic literature, the vector field u(x, t) is commonly referred to as the wavefield. In the most general case of an inhomogeneous, anisotropic medium, the elastodynamic wave equation reads: (21.1) where repeated indices are summed over. Here, p = p(x) is the density of the medium and Cijki = CijkiOO the elastic tensor which describes the elastic material properties of the earth at x. Analytical solutions of equation (21.1) are, in general, not available. Elastodynamic equations in seismics, their derivations, and ways to solve them are discussed in many textbooks, e.g. References 4 and 5.
Due to symmetries of the elastic tensor, only 21 of its 81 components are independent. These are often referred to as the elastic parameters. In simpler anisotropic media, the number of elastic parameters reduces corresponding to the symmetry of the respective medium. In isotropic media, where the elastic properties are independent of the direction of the particle displacement, only two independent elastic parameters, the Lame constants X and ^, remain (see, e.g. Reference 4). In many cases, the earth's crust is well enough described by an isotropic medium. This case will be treated in the following, recognising that there are observations in seismics that can only be explained by the presence of anisotropy in the subsurface. In acoustic media, the second Lame constant /x, representing the shear modulus, is equal to zero. In this case, u(x, 0 can be replaced by the pressure field /?(x, t) = —A, V • u(x, t) leading to the scalar acoustic wave equation. In inhomogeneous media, the complexity of the wavefield increases with the complexity of the variation of material properties. In the simple case of an unbounded, homogeneous medium two independently propagating body wave types, namely compressional (P waves) and shear waves (S waves), exist. In an inhomogeneous medium, P and S waves cannot be strictly separated. Single P and S wave arrivals, however, have been observed on seismic records although the subsurface is inhomogeneous. This is due to the fact that high-frequency elastic body waves are approximately separable into independently propagating P and S waves. One way of studying the propagation of high-frequency waves in inhomogeneous media is the ray method. If the distribution of elastic parameters is sufficiently smooth, i.e. they do not vary greatly over the distance of the order of a wavelength, single body waves can be reasonably well described by this method. This means that, apart from interfaces, i.e. discontinuities in the subsurface elastic parameters, P and S waves are again decoupled. In the high-frequency limit, the kinematics of each body wave, i.e. the traveltime and propagation direction of the wavefront, are characterised by a separate eikonal equation. One of the most common ways of solving the eikonal equation is the method of characteristics where the characteristic curves represent rays. In isotropic media, rays are orthogonal trajectories to the moving wavefront. The amplitudes of each body wave are in the high-frequency limit described by the corresponding transport equation. The eikonal as well as the transport equations for elementary P and S body waves are obtained by inserting: u(x, 0 = V(x)F(t -T(X))
(21.2)
into the elastodynamic wave equation (21.1). F(t — r(x)) is the transient highfrequency analytical signal, i.e. its Fourier spectrum F(a)) effectively vanishes for low frequencies. r(x) is called eikonal (where t = r(x) represents the wavefront at time t) and U(x) is the vectorial complex-valued amplitude. A description of body waves with equation (21.2) implies zero-order ray theory which is sufficiently accurate for most situations in reflection seismics [5]. When a transient wave impinges on an interface, reflection, transmission through the interface and wave type conversion occur. Here, boundary conditions regarding continuity of displacement and traction (i.e. components of the stress tensor) have to
be considered across the interface. In reflection seismics, we are mainly interested in waves that are reflected only once on their way from the source to the receiver (primary reflection events). However, the recorded wave field contains a wealth of other wave types due to multiple reflection, scattering, diffraction and surface waves travelling along the free surface of the earth. These have to be identified or removed, so that reflection events on a seismic record can be interpreted. An outline of ways to do this is given in Section 21.2.3.
21.2.2 Acquisition of reflection seismic data In reflection seismics, the reflected seismic wavefield due to controlled seismic sources, recorded at an array of seismic receivers, is interpreted. Although a number of different acquisition geometries can be realised (e.g. involving receivers in boreholes or at the sea bed), the usual case is that of source and receivers located on the same measurement surface (the earth's surface). Here we will discuss only the case of seismics applied to the exploration for hydrocarbons, i.e. measurements with a target depth of a few kilometres. In land seismic acquisition, the seismic source may consist, e.g. of explosives or of a seismic vibrator bringing a frequency-modulated sweep signal into the subsurface. Common marine sources include so-called airguns, which release a high-pressure air bubble into the water. Seismic receivers in the land seismic case (geophones) measure one or more components of some function of particle motion (depending on the natural frequency of the geophone) at the measurement surface due to the arriving elastic wavefield. Marine seismic receivers (hydrophones), on the other hand, measure variations of water pressure due to the arriving acoustic wavefield. They are usually placed in streamer cables of up to several kilometres (up to 10 km) length. For details on seismic sources and receivers we refer to Reference 6. The recorded wavefield at an array of receivers due to the excitation of one seismic source makes up a common shot seismogram, or shot gather. It consists of a number of seismic traces (in three-dimensional acquisition up to several thousand per shot), containing the discrete time series representing the signal recorded at the different receivers in digital form. An example of such a shot gather acquired on land with receivers placed on both sides of the source on a straight line is displayed in Figure 21.1. Apart from the desired reflections, a number of other wave types are present in the recorded wavefield. In the Figure, the direct wave between source and receivers is denoted by a letter A, a head wave is denoted by B, ground roll (i.e. Rayleigh waves travelling at low velocity along the measurement surface) is denoted by C, and some examples of reflection events are denoted by the letter D. Note that the amplitudes in the seismogram are scaled for display to balance the amplitudes of different events. To obtain sufficient information to produce an image of the subsurface below a continuous profile (two-dimensional case) or an entire three-dimensional subsurface image (three-dimensional case), a sequence of common shot experiments, with sources spaced, if possible, at regular intervals, is required. In reflection seismics, directed beams may not be produced during acquisition. Although the seismic source
time, s
offset, m
Figure 21.1 A shot gather extractedfrom a land seismic data set. In addition to the reflection events (some of them marked by the letter D), various other kinds of events can be observed, e.g. that of the direct wave (A), head wave (B) and ground roll (surface waves) (C)
may have a certain directivity pattern (primarily due to free-surface effects), it can be considered as a point source, when compared with the dominant signal wavelength. Typical dominant signal frequencies in reflection seismics often lie in the range between 20 and 50 Hz. Propagation velocities of P waves in the upper few kilometres of the earth's crust vary roughly between 1500m/s (water velocity) and more than 6000 m/s, resulting in a dominant wavelength near the source of the order of 50-100m. In two-dimensional marine acquisition, one streamer is towed behind a vessel with an airgun firing at regular intervals, but in three-dimensional marine acquisitions, usually several streamers are towed side by side. Acquisition of two-dimensional and three-dimensional land seismic data may involve a number of different, often irregular, measurement geometries. For the design of seismic acquisition geometries, target illumination, as well as spatial aliasing issues need to be considered. These are discussed in detail by Vermeer [7].
The recording of overlapping common shot experiments provides redundant information on subsurface structures: points on reflectors in the subsurface are illuminated by several common shot experiments with displaced source locations. Seismic data recorded in this manner are therefore called multicoverage data or, for reasons that will become obvious below, prestack data. As will be discussed in more detail, this redundancy plays a fundamental role in improving the signal-to-noise ratio (SNR), but also in the determination of a velocity model for the transformation of the seismic data into a structural image in depth. In addition, it allows the interpretation of reflection amplitudes as a function of source-receiver separation, or of reflection angle, to obtain further information on the elastic properties of the subsurface.
common offset depth, km
depth, km
common midpoint
distance, km
time, s
distance, km
haltoffset, km
midpoint, km zero offset depth, km
depth, km
common shot
distance, km
Figure 21.2
distance, km
Three-dimensional data volume in the case of two-dimensional acquisition. Only the traveltimes corresponding to one reflector are displayed here. The volume is built up by a set of shot gathers, one of which is displayed in the lower left. Commonly used subsets of this data volume are common midpoint gathers (upper left) and common offset gathers (upper right). The zero offset section (lower right) cannot be directly acquired, but is usually simulated from the prestack data. (Figure modified after Reference 26)
time, s
offset, m
Figure 21.3
Common midpoint gather extractedfrom the same data set as the shot gather in Figure 21.1, the midpoint location of the former coincides with the shot location of the latter. Note the irregular distribution of seismic traces due to irregularities in the acquisition geometry
For reasons given below, many seismic processing steps are carried out on the data sorted into ensembles of traces having the same midpoint between source and receiver coordinates, i.e. common midpoint (CMP) gathers. If traces within CMP gathers are sorted according to their source/receiver distance (offset), a multi-dimensional data hyper volume results. For two-dimensional seismic acquisition (acquisition on a single profile) this is a three-dimensional time-midpoint-offset data volume, which is schematically displayed in Figure 21.2 for a very simple two-dimensional subsurface structure. In this Figure, traveltimes for the second reflector recorded at a number of different offsets are displayed as a function of midpoint coordinate as grey curves. Examples of subsets of the multicoverage reflection surface of the reflector, corresponding to subsets of the data that are often used in processing, are indicated in the Figure. Although the data have been recorded as common shot gathers (lower left), they are usually analysed and processed in CMP gathers (upper left) and common offset gathers (upper right). An intermediate result of seismic processing is a simulated
zero offset (ZO) section (lower right). As for technical reasons, sources and receivers may not coincide in practice in seismic acquisition, zero offset sections cannot be directly measured. In the case of three-dimensional seismic acquisition, the midpoint and offset coordinates are given by two-component vectors, the seismic data then constitute a five-dimensional hyper volume. Figure 21.3 shows an example of a CMP gather taken from the same dataset as the shot gather in Figure 21.1. Its midpoint location coincides with the source location of that shot gather. A number of reflection events is clearly visible. It is also apparent that there are some irregularities in the recording geometry, resulting in an irregular offset spacing.
21.2.3
Seismic reflection processing
For readers not familiar with seismic reflection processing, we will here briefly summarise some of the basic processing steps applied to recorded seismic data. STAP techniques, if defined as processing or filter operations applied to multi-dimensional arrays containing recorded wavefields, play a significant role in this field. The aim of seismic reflection processing is to extract and enhance primary reflection events of a specified body wave mode (usually compressional waves) in the recorded data and to use these to obtain a structural image of the subsurface. This involves the suppression of random ambient noise, as well as different kinds of coherent noise, which include, in the context of reflection seismics, all other events present in the recorded wavefield. Coherent noise that may be present in the data can include (depending on the recording environment) the direct/refracted wave between source and receiver, dispersive waves travelling along the measurement surface (e.g. Rayleigh waves), the wave travelling from source to receiver directly through the air (air wave), dispersive waves travelling in a shallow water layer (guided waves), waves scattered from shallow diffractors (side-scattered noise), monofrequency noise (e.g. associated with power lines), multiple reflections and reverberations. Incoherent, random ambient noise can originate from a number of different sources. Examples of some of the different kinds of noise (coherent and incoherent) can be seen in the shot seismogram in Figure 21.1. At the same time, kinematic information needs to be extracted from the reflection events for the construction of a model of the distribution of the propagation velocity of seismic waves in the subsurface (velocity model). This model is required for transforming the measured reflection events into a subsurface structural image by a process called depth migration. In further steps, additional information on subsurface material parameters may be derived from reflection amplitude behaviour. The character of recorded seismic data, and, thus, the choice of adequate processing procedures, depends strongly on geological subsurface conditions, measurement surface conditions, acquisition parameters and the recording equipment used. Processing parameters therefore need to be chosen interactively, often
involving a considerable amount of interpretation. No completely automatic processing is possible. The seismic multistep processing sequence described here is by no means exhaustive. It is a condensed compilation of the most important procedures used in seismic reflection processing with an emphasis on steps that involve multidimensional processing and filtering procedures, i.e. STAP techniques. A detailed treatment of various practical aspects of seismic data processing can be found in Reference 8. 21.2.3.1 Preprocessing In a first processing step, the recorded data in the form of shot seismograms are prepared for further processing that emphasises reflections. In this initial step, bandpass filtering is applied to suppress noise that lies outside the expected signal bandwidth. During this processing step, STAP techniques can be used to suppress linear coherent noise, i.e. events with a constant stepout between traces (e.g. guided waves or surface waves), if it interferes with primary reflection events. It can be distinguished from these on grounds of its dip, which is in general much steeper than that of reflection events. An appropriate domain for the removal of coherent linear noise is therefore the frequency-wavenumber (f-k) domain. Differently dipping events map into different regions of the f-k domain and can therefore easily be removed (e.g. Reference 9). A transform that allows time variant dip filtering is the linear Radon transform. Also during preprocessing, initial corrections for traveltime variations due to measurement surface topography and near-surface velocity variations are performed (static corrections). 21.2.3.2 Deconvolution As the source signal shape usually differs significantly from the desired Dirac impulse (depending on the utilised seismic source), deconvolution (e.g. Reference 10) is applied to increase temporal resolution. The underlying assumption is that a recorded seismic trace can be viewed as the convolution of a series of spikes, representing reflectors, with a wavelet, i.e. the source signal (assumed to be stationary). A linear filter operator can be found, which, when convolved with a seismic trace, removes the source signal shape or transforms it into the desired signal shape. In predictive deconvolution, reverberations or short period multiple reflections are removed from seismic traces by the application of a prediction error filter. 21.2.3.3 CMP sorting As previously mentioned, for further processing steps, it is useful to resort the recorded traces into ensembles of traces having a common source/receiver midpoint. Although each resulting CMP gather no longer represents the wavefield recorded in a single experiment, the resorting of the data has a number of advantages. If the lateral subsurface heterogeneity is moderate, reflection events in CMP gathers, displayed as a function of offset, show approximately hyperbolic traveltime behaviour with apex locations at zero offset. The shapes of these reflection events carry information on the subsurface velocity distribution below the considered midpoint location. Under
favourable conditions, they can also be used to distinguish primary from multiple reflection events. In Figure 21.3 an example of a CMP gather is displayed. Several approximately hyperbolic events are clearly visible. 21.2.3.4 Velocity analysis In order to extract kinematic information from the reflection events in CMP gathers, a so-called velocity analysis is performed. Each CMP gather is transformed from the traveltime offset domain into a velocity spectrum [H]. In a velocity spectrum, the coherence, measured along hyperbolic trajectories in the CMP gather, is plotted as a function of the parameters that determine the hyperbola shape (see equation (21.3)): zero offset traveltime to and a parameter called stacking velocity t>stack • Maxima in the velocity spectrum describe hyperbolas which approximately follow actual reflection events in the data. If the assumption of a one-dimensional subsurface structure and velocity distribution seems appropriate (velocities and structure only vary in the vertical direction), i>stack c a n be interpreted as an approximation to the root-meansquare (RMS) velocity I>RMS between the corresponding reflector in the subsurface and the measurement surface. It can be used, together with the traveltime to, to obtain a subsurface velocity model. In cases where the assumption of a one-dimensional velocity distribution is no longer valid, stacking velocities can still be used for an initial velocity model estimate, which is then improved by more sophisticated methods in subsequent processing steps. 21.2.3.5 Multiple attenuation If the subsurface structure is reasonably close to one-dimensional, the moveout of events in CMP gathers (i.e. the steepness of the corresponding hyperbolas), described by the parameter t>stack> c a n be used to distinguish primary reflections from multiple reflected events which are usually considered as coherent noise. As the propagation velocity of seismic waves normally increases with depth, multiple reflections can be identified on the grounds of their low stacking velocity values, as compared with primary reflection events of comparable zero offset traveltime. Based on this criterion, CMP gathers can be transformed into a domain, in which multiple reflections are separated from primary reflection events, and can, thus, be removed. Common multiple-removal methods, which make use of the moveout criterion, are based, e.g. on the application of various variations of the discrete Radon transform [12]. In cases of more complex subsurface structure, identification of multiples based on fstack is no longer possible and more sophisticated space-time adaptive multiple removal methods need to be applied (e.g. Reference 13). 21.2.3.6 Normal moveout (NMO) correction and stack The determined values of ustack as a function of to, which describe the shape of reflection events in CMP gathers, can be used to correct for these shapes, aligning reflectors horizontally. By summing up (stacking) traces along the offset direction, the SNR can be increased, as reflection energy sums up coherently, and at the same time the amount of data is significantly reduced. The resulting seismic section (containing one
summed trace for each CMP gather) can be seen as an approximation to a zero offset section (see Section 21.2.2). In the case of significant departure of reflector dips from the horizontal, an additional process, called dip-moveout (DMO) (e.g. Reference 14) needs to be applied before stacking. 21.2.3.7 Velocity model building and migration In order to obtain a structural image of the subsurface from the seismic data, these need to be transformed into the depth domain by a process called depth migration. This process is applied either to the stacked data, assumed to represent a simulated zero offset section (poststack migration), or to the unstacked data (prestack migration). As mentioned earlier, for the application of depth migration, a seismic subsurface velocity model needs to be known. This model can be thought of as representing the long wavelength component of the true subsurface velocity distribution, affecting wavefield kinematics, while the short wavelength velocity variation gives rise to the recorded seismic reflection events. Information about the velocity model is obtained from the seismic data itself, together with additional geological a priori information. Starting with an initial model (usually obtained from stacking velocities) a velocity model is usually constructed iteratively by the repeated application of prestack migration. This process makes use of the already mentioned data redundancy and the fact that, in a correct velocity model, the obtained depth image should be independent of the source-receiver separation (offset) of the recorded data. Seismic depth migration itself can be described as a model-based multidimensional processing technique with the aim of placing reflectors in the correct subsurface position and focusing diffractions. A number of different migration methods exist, which make different assumptions about the data and the velocity model heterogeneity, and are implemented in different data domains. Most migration methods are based, in one way or another, on the scalar wave equation and follow the principle of wavefield continuation and the application of an imaging condition. In the case of poststack migration, the so-called exploding reflector concept may be used, i.e. the data can approximately be viewed as originating from exploding reflectors in the subsurface and travelling through a medium with half the actual velocity. Using this notion, the wavefront shape of an exploding reflector at time t = O coincides with the shape of the reflector itself. The reflector structure in the subsurface can thus be obtained by continuing the measured wavefield into the subsurface and evaluating it at t = O (the imaging condition). In prestack migration, the source and the receiver side need to be taken into account during wavefield continuation before applying the imaging condition. Migration can be implemented based on an integral solution of the wave equation, usually applied in the time domain. This approach is called Kirchhoff or diffraction summation migration [15,16]. Other migration approaches are based on finite-difference solutions of the wave equation, applied either in the space-time, or the space-frequency domain (e.g. Claerbout [17]). Migration can also be performed in the frequency wavenumber domain [18-20]. For a detailed treatment of different migration algorithms, refer to Reference 17.
21.3
Common reflection surface stack
In this section, the CRS stack will be introduced, which replaces the processing steps of velocity analysis, normal-moveout correction and stack, described above. In addition to an improved SNR in the simulated zero offset section, the CRS stack yields kinematic information that can be used, e.g. for the construction of a velocity model for depth imaging. Two of the basic problems one faces in seismic reflection imaging are: the SNR might be very low and the distribution of propagation velocities in the subsurface, called velocity model, is unknown. The velocity model is, however, required for depth imaging. The redundancy in the prestack data is the key to addressing both problems: each point on a reflector contributes to experiments with different source and receiver geometries. The wavefield recorded in a seismic experiment depends on the reflector properties, as well as on its overburden, i.e. the velocity distribution. If we could identify all amplitudes in the data pertaining to one and the same reflection point, all amplitudes could be summed to attenuate the incoherent noise and the space-time distribution of these amplitudes would carry information about the velocity model above the reflection point. However, the velocity model is usually under-determined by seismic reflection measurements made on the earth's surface, such that additional assumptions on the model properties are required. For example, one might assume a model set up by homogeneous, isotropic layers or blocks, or the smoothest possible isotropic model without discontinuities that is consistent with the kinematics of the prestack data. For a strict identification and summation of the amplitudes belonging to a common reflection point in the subsurface, the velocity model would need to be known to propagate the recorded reflection responses back to their common correct subsurface position where they can be summed up. This is the previously mentioned prestack depth migration (see Section 21.2.3) which theoretically provides the best possible image of the subsurface. However, the required velocity model is initially not known and has to be derived by means of iterative applications of prestack depth migration itself and sophisticated methods to update the velocity model until the migration result is consistent, i.e. the subsurface position of a reflection point in the migrated image is independent of the source and receiver geometry. An alternative imaging approach is to avoid the explicit parameterisation of the velocity model in the beginning. Instead, the summation of the amplitudes pertaining to a given reflector point is performed in the prestack data, i.e. in the space-time domain. To be independent of a velocity model, the reflection events have to be parameterised such that the parameters can be directly determined from the prestack data. In this way, the summation of amplitudes to increase the SNR can be separated from the determination of the velocity model and the transformation into the depth domain. The main problem is to identify and parameterise reflection events in the acquired data as generally as possible but, at the same time, with a reasonable number of free parameters and, if possible, with a sound physical interpretation of these parameters. A restriction to a fixed number of free parameters implies certain assumptions on the complexity and smoothness of the subsurface. Such space-time adaptive data-driven
approaches rely on the existence of coherent reflection events in the prestack data. If coherent reflection events can be observed in the acquired multicoverage data, they can be associated with wavefronts that appear at the acquisition surface. The basic idea of the CRS stack is to use a second-order approximation of the traveltime in terms of the acquisition parameters for subsurface models with lateral variations, i.e. models with arbitrarily curved reflectors. For the sake of simplicity, we assume in the following that the data are acquired on a plane acquisition surface, although topography can be considered in the CRS approach as well. Our aim is to describe the recorded reflection events in terms of an analytic approximation of the kinematic space-time reflection response of an arbitrarily curved reflector segment in depth and to sum all amplitudes stemming from one and the same reflector segment, the CRS. The parameters of this analytic function are determined automatically from the prestack data by means of coherence analysis as explained below. In principle, the most obvious approach would be to use a second-order approximation of the kinematic reflection response of a reflection point rather than a segment. However, the direct determination of the parameters of such a common reflection point (CRP) (hyper) trajectory from the prestack data is inherently ambiguous: without information about neighbouring reflection points, it is impossible to decide whether a trajectory actually refers to a single CRP or mixes contributions from various reflection points. A physical justification to use entire reflector segments is the fact that, due to the finite bandwidth of seismic signals, not only a single reflection point contributes to the reflection response, but also its vicinity within the Fresnel zone at the reflector. Finally, from a pragmatic point of view, the concept of reflector segments allows us to use a significantly larger part of the coherent reflection event in the prestack data for the summation of amplitudes, thus attenuating incoherent noise. In other words, there is a trade off between SNR and the achievable resolution.
21.3.1
Classic data-driven approaches
Data-driven imaging methods have been well established in seismic reflection imaging for decades, although their data-driven aspects are often not fully exploited in the actual application. A prominent example of such applications is the so-called CMP stack that was introduced by Mayne [21]. It is based on the assumption of a one-dimensional model consisting of a set of homogeneous layers with parallel, horizontal boundaries. For such models, the reflection events in the CMP gathers can be expressed analytically in terms of a Taylor series of the squared traveltime: 4/i 2
'CMP = t20 + ^—
+ (Xh4)
(21.3)
V
RMS
The zero-order term is the squared reflection time i^ for an experiment with coincident source and receiver. Due to the reciprocity of traveltimes, i.e. their invariance with respect to the exchange of sources and receivers, only coefficients for even orders of the acquisition parameter (here the signed half-offset h between source and receiver) remain. The dominant second-order coefficient can be expressed in terms of the
root-mean-square velocity URMS (in this case roughly coinciding with ustack) of a reflector's overburden. In practical application, the CMP stack is preceded by a stacking velocity analysis (see Section 21.2.3) to setup a stacking velocity model. With this model, CMP trajectories according to equation (21.3) are available for any point (#0, to)', the amplitudes in the prestack data can be stacked along the CMP trajectories and assigned to the respective ZO samples to simulate a ZO section. This kind of second-order description of reflection events has been very successful in spite of its inherent one-dimensional model assumption. Later, this approach has been generalised to dipping plane reflectors by means of a subsequent process that accounts for the reflector's dip. This approach is known as dip moveout (DMO) correction (see, e.g. References 14 and 22), usually applied with the assumption of a (locally) homogeneous subsurface model. In practice, DMO is applied in a sequence starting with a normal moveout (NMO) correction that removes the offset-dependent influence of the reflector's overburden, followed by the DMO correction that considers the reflector's dip, and a final stack of all (moveout corrected) amplitudes for all source and receiver offsets. This standard processing sequence is called NMO/DMO/stack.
21.3.2
Second-order traveltime approximations
If the source and receiver coordinates on the planar measurement surface are given by two-dimensional vectors X5 and xg9 midpoint and half-offset coordinates can be defined by: x
m
=
~Z ( x g ~f~
x
s)
and h=
2^8-**)
The traveltime surface of a reflection event in the prestack data can be expanded into a Taylor series around any point (xo, ho). Up to second order in terms of x w and h it reads:
(21.4) where derivatives with respect to a vector symbolise the partial derivatives with respect to all components of this vector taken at (xo,ho). Due to the reciprocity of traveltimes, reflection traveltimes are symmetrical around h = O. This fact reduces the number of independent parameters if a Taylor expansion is performed around
point (xo, ho = 0) associated with a coincident source/receiver pair at Xo: t (x m , h) = to + 2p m • (xm - X0) + (xm - x o ) r M m (xm - X0) + h r M/,h (21.5) where the vector p m , and the symmetric matrices M m and M^ contain first and second derivatives of the half traveltime with respect to the midpoint and half-offset coordinates, taken at (xo,ho = 0): p m = jdt/d\m, Mm — ^d21/3x^ and Mz2 = ^d2t/dh2. Accordingly, two first and six second spatial derivatives remain in this particular case which will be considered in the following. The second-order approximation oft2 is then [23]: t2(xm,h)
= [t0 + 2p m • (xm - X0)]2 + 2to[(xm - x o ) r M m (x m - X0) + h r M/,h] (21.6)
21.3.3 Physical interpretation of the coefficients If a locally constant near-surface velocity value vo is assumed in the vicinity of xo, the vector p m specifies the emergence direction (emergence angle a and azimuth (p) of the zero offset ray emerging at xo, hereafter called the central ray: Pm = (VQ1 sin a cos 0, VQ1 s i n a s i n 0 ) r . Restricting the offset to h = 0 in equation (21.6) gives a second-order approximation of the traveltimes along zero offset rays in the vicinity of the central ray, i.e. rays which, upon reflection at a subsurface structure, emerge at receiver locations, which coincide with their initial source locations. These rays can be interpreted as belonging to a common wavefront of a hypothetical wave, which travels into the subsurface along the central ray and is reflected into itself, i.e. the wavefront curvature of the downgoing wavefront at Xo coincides with the wavefront curvature of the reemerging reflected wave. This hypothetical wave is termed the normal wave, as all corresponding rays are normal to the reflector structure in the subsurface. The corresponding wavefront curvature can be obtained by transforming M m into a local ray-centred Cartesian coordinate system at xo: M m = T M N T 7 = 1^" 1 TKNT 7 '. Here, T is the 2 x 2 upper left submatrix of the transformation matrix from the ray-centred to the global Cartesian coordinate system (e.g. Reference 5) and KN is the wavefront curvature matrix of the normal wave, containing three independent elements. The matrix T depends on the emergence angle and azimuth, as well as on the choice of the local coordinate system orientation in the plane normal to the ray. If, on the other hand, the midpoint coordinate is restricted to xm = xo (a CMP gather) in equation (21.6), it describes a second-order approximation of the reflection traveltimes along rays between sources and receivers placed symmetrically around the midpoint xo. As noted earlier, due to reciprocity, traveltimes are invariant if source and receiver coordinates are interchanged, i.e. if h is replaced by —h. Again, a hypothetical wave corresponding to these rays can be defined, which travels into the subsurface along the zero offset ray direction pm at xo and is reflected into itself, i.e. the wavefront
curvature of the emerging wave at Xo coincides with the initial wavefront curvature. It can be shown that, if only second-order traveltimes in the CMP configuration are considered, the subsurface reflection points of the reflected rays corresponding to the different source-receiver offsets can be approximately regarded as coinciding at the normal incidence point (NIP) of the zero offset ray on the reflector (e.g. Reference 24). The described hypothetical wave can, thus, be regarded as focusing at the NIP and is therefore also called the NIP wave. Similarly to the normal wave case, the NIP wavefront curvature can be obtained by transforming M^ into a local raycentred Cartesian coordinate system at xo". M^ — T M N I P T ^ = VQ T K N I P T ^ . The matrix KNIP is the 2 x 2 wavefront curvature matrix of the NIP wave, containing three independent elements. In the context of the CRS stack, the emergence angle a and the azimuth 0, as well as the six independent elements of the curvature matrices KNIP and KN are also referred to as CRS attributes or kinematic wavefield attributes. The normal wave and the NIP wave are also known as eigenwaves (e.g. Reference 25), and have a number of applications, e.g. in the estimation of velocity models for depth imaging. The second-order traveltime approximation of t2, given in equation (21.6), can be rewritten in terms of the NIP and normal wave curvature matrices [26]:
(21.7) The eight independent parameters are then the two elements of p m (or alternatively the emergence angle a and azimuth 0), the three curvature values of KN, and the three curvature values of KNIP (as already noted, T also depends on a, <j> and the orientation of the ray-centred coordinate system). In the case of data recorded on a single profile, the midpoint and half-offset vectors reduce to scalars xm and h. Equation (21.7) then reduces to:
(21.8) This expression contains three independent parameters, a, ^TN and ^fNIPEquations (21.7) and (21.8) can also be derived in different ways, e.g. with a geometrical approach [27] or based on paraxial ray theory, using a ray propagator matrix formalism (e.g. Reference 28). The hypothetical NIP and normal wave experiments and their relation to ray propagator matrices have been described by Hubral [25]. If only CMP gathers (xm = xo) are considered, equation (21.8) reduces to the wellknown CMP traveltime formula (21.3). The relation between the stacking velocity and the remaining CRS attributes reads: (21.9)
21.3.4
Implementation
The kinematic wavefield attributes, i.e. the parameters of the CRS operator, have to be determined directly from the prestack data such that the CRS operator fits the actual reflection event in the vicinity of a point (xo,*o) as closely as possible. The fit of a given CRS operator to a reflection event in the data can be evaluated by calculating the coherence of the prestack data along the CRS operator. An established coherence measure in reflection seismics is the coherence criterion semblance [29] which reads:
(21.10) where #[/][y] denotes the yth sample of the /th seismic trace. The CRS operator is here represented by the function k(i), and Af is the number of contributing traces. Semblance is evaluated within a temporal window with a width of W time samples to account for the limited bandwidth of the data. Semblance can be seen as the ratio of correlated and uncorrelated energy within the considered subset of the data. Various other coherence criteria have been discussed by Neidell and Taner [29] which are, e.g. based on cross-correlation of seismic traces or on the stacked amplitude. The operator yielding the highest coherence value provides the optimum fit to the reflection event around point (xo,£o)- To find this optimum operator, we have to vary the kinematic wavefield attributes within physically reasonable bounds and to calculate the coherence along each operator. Consequently, we have to solve a non-linear global optimisation problem, in the particular case of two-dimensional ZO simulation with three independent parameters. This kind of optimisation task requires a lot of computational effort. Therefore, it is advantageous to decompose it into a set of simpler optimisation problems which are applied to subsets of the entire prestack data volume. This strategy not only reduces the amount of data used at a time, but also the number of parameters that have to be determined simultaneously. One possible approach of this kind will be described in Section 21.3.5. As an additional complication, different reflection events might contribute to the same ZO location such that additional local coherence maxima have to be considered, too. Details on efficient implementation strategies for global and local coherence maxima can be found in Reference 30. An important task in the implementation and application of the CRS stack is the appropriate choice of the spatial aperture in which the search for the wavefield attributes and the summation of the amplitudes is performed. Several aspects have to be considered for the aperture: first of all, the aperture should cover the time domain counterpart of the Fresnel zone at the reflector, the so-called projected Fresnel zone. A sufficiently large aperture ensures that all relevant contributions to a particular CRP enter into the summation. In addition, we have to honour the approximate nature of the CRS operator. In too large an aperture, the second-order approximation might no longer be applicable and the resolution as well as the accuracy of the results might suffer. On the other hand, the determination of the attributes is, due to the presence
of noise, more stable the more traces contribute to the coherence analysis, and the SNR of the stacked result also increases with an increasing number of traces. Thus, there is a trade-off between stability and high SNR on the one side and resolution and accuracy on the other side. A detailed discussion of this topic and theoretical as well as practical aperture considerations can be found in Reference 30. The search for the optimum kinematic wavefield attributes and the stack are performed for each ZO location to be simulated, irrespective of whether there is a reflection event or not. In this way, entire sections containing the kinematic wavefield attributes, the coherence measure and the ZO amplitudes are obtained.
21.3.5 Practical aspects Although a comprehensive treatment of implementation strategies and the appropriate choice of processing parameters for the CRS stack is beyond the scope of this text, we will here briefly discuss the basic steps of the CRS procedure for one possible parameter search strategy. Efficient strategies for the CRS parameter search are essential for handling large data sets. This is usually achieved by splitting the multiparameter search, required for each ZO sample to be simulated, into a number of separate searches with fewer parameters, either by using subsets of the data during each search, or by making additional approximations. In the following, we assume that the data processing steps described in Section 21.2.3 have been performed up to the CMP sorting and that a value for the near-surface velocity VQ is given. For the application of the CRS stack, the following steps need to be performed: •
definition of aperture parameters under consideration of frequency content, acquisition geometry and complexity of the data (to be provided by the user) • definition of processing parameters, if possible based on physical constraints, e.g. expected stacking velocities, to limit the range of CRS attribute values to be searched (to be provided by the user) • transformation of the acquisition geometry into midpoint/half-offset coordinates (automatic) • determination of the CRS attributes for each ZO sample based on coherence analysis, e.g. using the semblance criterion (21.10) (automatic). In the simplest case, the following search strategy can be used: (i) Perform an automatic CMP stack, i.e. determine the stacking velocities i>stack (see equation (21.9)) in each CMP gather by restricting the CRS operator (21.8) to xm = JCo- The resulting CMP stack section serves as a first approximation of a ZO section. (ii) In the obtained CMP stack section, the remaining searches are performed in two steps: first, the emergence angle a is determined with a linear approximation of the CRS operator (K^ = O and h = O). Second, K^ is determined with the second-order approximation (21.8), still restricted to h = 0 and using the previously determined value of a. (iii) Calculate A^NIP from i>stack and a according to equation (21.9).
(iv)
The thus obtained attributes a, K^i? and K^ can be refined by an optional local optimisation with the full CRS operator in the entire prestack data. (v) Stack the prestack data along the full spatial CRS operator within the previously defined stacking aperture and assign the result to the respective ZO sample.
Note that the described search strategy requires a sufficient coverage (number of traces) of the used data subsets. If this is not the case, other strategies have to be applied. Also, situations with more than one relevant coherence maximum (e.g. so-called conflicting dip situations where events associated with different emergence angles intersect each other) require an extended search strategy.
21.3.6 A synthetic data example To illustrate the application of the CRS stack and the resulting CRS attribute sections, the process has been applied to a synthetic multicoverage data set. The CRS stack has already been successfully applied to real data sets. Examples of such applications can, for example, be found in References 1 to 3. Here, we prefer a synthetic model as it allows us to compare the CRS results with the forward calculated data. The same applies to the reconstructed velocity model and the structural image in the depth domain (see Section 21.4). The synthetic data (only specular primary P-wave reflections, no diffractions) were modelled by ray tracing in the blocky model displayed in Figure R29 (see colour signature), using a zero-phase wavelet with a dominant frequency of 20 Hz. A total of 600 shots with a shot spacing of 20 m and shot locations offset, m
Figure 21.4
time, s
time, s
location, km
Synthetic data example, diffraction events have not been modelled a near-offset section extracted from the prestack data b common midpoint gather for midpoint location 2000 m
time, s
location, km
Figure 21.5
Synthetic data example: ZO section simulated by means of the CRS stack method
ranging from JC = — 2000 m to x = 9980 m were modelled. Each shot was recorded by 95 receivers with a regular receiver spacing of 20 m, a minimum offset of 100 m and a maximum offset of 1980 m. The resulting CMP spacing is 10 m with a maximum of 48 traces per CMP location. Band-limited random noise was then added to the prestack data. The resulting common offset section for the minimum occurring offset of 100 m is displayed in Figure 21.4a, and a CMP gather for the CMP location 2000 m is shown in Figure 21.4b. A number of hyperbolic reflection events is clearly visible. The CRS stack was applied to this dataset resulting in the stacked simulated zero offset section in Figure 21.5. As might be expected, the SNR is significantly increased, as compared with the unstacked near offset section of Figure 21.4a, and the traveltimes of reflection events in both sections are roughly the same. This is not surprising, given that the offset of 100 m of the section in Figure 21.4a is very small compared with the maximum depth in the model. Together with the stacked section, a number of additional sections is obtained. These are displayed in Figure P.30 (see colour signature). The coherence section (Figure P. 3 0a) shows the maximum semblance value obtained along the CRS operator for each simulated ZO sample. This value is close to zero, where there are no reflection events in the data. At the location of actual reflection events, its value is influenced by the signal strength relative to the random noise along the reflection event, by the number of contributing traces (constant in this example) and by the fit of the CRS operator to the actual reflection event. All CRS attribute sections only have meaningful values where the coherence value is sufficiently high, i.e. on actual reflection events. In principle, a mask could be obtained from the coherence section, so that CRS attributes are only displayed where their values are meaningful. This has not been done here. Figure P.30b shows the emergence angle section. It has been obtained
using an assumed near-surface velocity value of VQ = 2000 m/s, which in this case is known exactly. The emergence angle varies roughly between —30° and +30°. The NIP wave curvature is displayed in Figure R 3 Oc as K^p = /?NIP- In the present example, meaningful values of /?NIP vary between about 200 m and 8 km. In a constant velocity medium the NIP wave radius of curvature would coincide with the normal ray length (i.e. the distance to the NIP). Figure P.3Od shows the normal wave curvature. Its relation to the curvature of reflection events in the zero offset section is obvious: it has positive values at convex parts of reflection events, negative values at concave parts of reflection events and is close to zero when the reflection event's curvature goes to zero. As will be shown in the next section, the emergence angle section and the NIP wave curvature section can be used together with the CRS stack section itself for the construction of a velocity model for depth imaging.
21.4
CRS attributes and velocity model estimation
In the previous section, the CRS stack has been applied to a synthetic multicoverage data set. Here, we will show an example for the application of the resulting CRS attributes, in particular the emergence angle a and the NIP wave radius of curvature m me /?NIP = ^NiPfollowing, it is assumed that a correct value for the near-surface velocity ^o has been used during the application of the CRS stack. If the NIP wave is reinterpreted as a hypothetical upwards travelling one-way wave due to a point source at the NIP, arriving at xo at the one-way traveltime to/2, it can be used for the estimation of the subsurface distribution of seismic velocities: in a correct velocity model, the NIP wave radius of curvature /?NIP? when propagated back into the subsurface, should shrink to zero at its hypothetical source location (the NIP on the reflector) at zero traveltime. Based on this criterion, the CRS attributes /?NIP and a can be applied in inversion methods in various ways: if a velocity model consisting of constant velocity layers separated by curved interfaces is assumed, the velocities and interface shapes can be determined layer by layer (e.g. References 24 and 31). If a smooth velocity model without discontinuities is assumed, the model can be determined with a tomographic approach by minimising the difference between the measured and modelled values of the quantities characterising the considered NIP waves (e.g. Reference 32). The subsurface locations of the corresponding normal incidence points and the local reflector orientations are obtained simultaneously with the velocity model. This approach has advantages over the layer-based approach, when it is not possible (as is often the case) to identify reflection events continuously across the stacked seismic section. An example of the application of the CRS-stack-based tomographic velocity model estimation to the synthetic data shown in Section 21.3.6 is given in Figures P.31a~c (see colour signature). A total of 505 data points consisting of the values /?NIP and a at a number of locations (JCO,*O) n a v e been obtained from the corresponding sections (Figures P.30b and c). This step of picking input data points for the inversion can, in principle, be automatised. In practice, however, multiple
offset, m
depth, m
depth, m
offset, m
depth, m
location, m
Figure 21.6
Result of prestack depth migration using the smooth velocity model obtainedfrom inversion of the CRS attributes a Common-image gather (CIG) at x = 3000 m b CIG at x = 6000 m. Most of the events are flat, i.e. independent of the source/receiver offset. Thus, the reconstructed velocity model (Figure P.3 Ib) is kinematically consistent with the entire prestack data c Stack of the migration results for different offsets. The artifacts are due to shortcomings of the ray-tracing modelling of the synthetic data (no diffractions were modelled)
reflections may still be present in the simulated zero offset section, which have to be avoided during picking. Therefore, manual picking will usually be necessary. Figure P.31a shows the inversion result, consisting of a smooth velocity model (described by B-splines) and the reconstructed normal rays. In Figure P.31b, the same smooth inversion result is shown together with the reflection point locations and local reflector dips (displayed as plane reflector elements) associated with the input data. The reconstructed model resembles a smoothed version of the true blocky velocity model (Figure P.29 (see colour signature)) that was used during modelling of the synthetic seismic data. The reconstructed smooth model is kinematically correct, i.e. reflector elements fall into the correct subsurface locations. This can be seen when plotting the reconstructed reflector elements into the true model as was done in Figure P.3 Ic. To demonstrate that the reconstructed velocity model and the prestack data are consistent not only for the ZO configuration, but for arbitrary source/receiver offsets, a prestack depth migration (see Section 21.2.3) was performed with the reconstructed model. As stated earlier, the kinematic aspects of the migration result should be independent of the source/receiver configuration in the case of a correct velocity model. This can be easily verified by displaying the migration result as a function of the source/receiver offset for a fixed horizontal location. Two such subsets of the depth-migrated prestack data, so-called common image gathers (CIG), are shown in Figures 21.6a and b. Indeed, virtually all events in this CIG are flat and, thus, independent of the acquisition geometry. The stack of the migration results for different offsets is shown in Figure 21.6c. This is the desired structural image of the subsurface. Migration artifacts in the lower part of the Figure are due to shortcomings of the ray-tracing modelling used to create the synthetic prestack data. The CRS stack result itself can also be used as the input for a considerably less costly poststack depth migration. The poststack migration result is not shown here.
21.5
Conclusions
Space-time adaptive processing techniques play an important role in seismic reflection imaging. Their main applications are the suppression of coherent noise and random ambient noise in the recorded data to amplify primary reflection events, as well as obtaining kinematic information required for transforming the data into a depth image of subsurface structures. The depth imaging process itself, i.e. depth migration, can be seen as a model-based STAP procedure. One of the main difficulties in seismic reflection imaging stems from the fact that the shape of recorded reflection events depends not only on the subsurface reflector structure itself, but also on the, a priori unknown, distribution of seismic velocities between reflectors and the measurement surface. We have focussed here on a new data-driven processing technique, the common reflection surface stack. The method makes use of data redundancy to obtain a simulated zero offset section with a significantly improved SNR while reducing the amount of data for further processing and imaging. In addition, kinematic information
is extracted from the data, which can be used in subsequent steps to construct a seismic velocity model, required for transforming the data into the depth domain. Apart from the simulated zero offset section itself, one obtains a number of associated CRS attribute sections: in the two-dimensional case, there is one section containing the emergence angle of the zero offset ray associated with each zero offset sample and two sections containing wavefront curvatures A^NIP and A^N associated with two hypothetical eigenwave experiments. We have demonstrated the CRS stack on a synthetic data example and shown the application of the CRS attributes in the estimation of a smooth velocity model for depth migration. Although the CRS stack method has here been applied to synthetic twodimensional data only, it is already being used in industrial practice with real seismic data in two dimensions and in three dimensions. An extension of the theory to the finite offset case (i.e. the simulation of non-zero offset sections) exists and has been implemented. 21.6 2 L 6.1
Glossary List of variables
(AfW. j \ discrete representation of recorded scalar wavefield Ctjki fourth-order elasticity tensor with up to 21 independent elements f body force vector per unit volume; source term in equation of motion for elastic media F transient high-frequency analytical signal h half-offset; half of vector connecting source and receiver h half-offset between source and receiver in two-dimensional case ho reference half-offset vector for Taylor expansion ki discrete representation of stacking operator KN curvature matrix of the normal wavefront K^ curvature of the normal wave in two-dimensional case KNIP curvature matrix of the NIP wavefront ^NiP curvature of the NIP wave in two-dimensional case Mh matrix of second derivatives of half traveltime with respect to h Mm matrix of second derivatives of half traveltime with respect to \m MN same as M m , but expressed in the local Cartesian ray-centred coordinate system MNIP same as M ^ , but expressed in the local Cartesian ray-centred coordinate system Af total number of traces in semblance calculation p pressure field pm vector of first derivatives of half traveltime with respect to x m Rm? = 1 / X N I P , radius of curvature of the NIP wave S semblance; a normalised coherence measure T 2 x 2 upper left submatrix of the transformation matrix from the ray-centred to the global Cartesian coordinate system
t to ^CMP Un u t>0 URMS ^stack W x x xo XQ Xg xm xm xs a A \x p r 4>
21.6.2
time zero offset traveltime traveltime in C M P configuration vectorial complex-valued amplitude particle displacement vector field near-surface velocity root-mean-square velocity stacking velocity temporal window length for semblance measure spatial location vector spatial coordinate in two-dimensional case reference midpoint vector for Taylor expansion reference midpoint for Taylor expansion in two-dimensional case two-component location vector of receiver on the acquisition surface two-component location vector of source/receiver midpoint on the acquisition surface midpoint of source and receiver in two-dimensional case two-component location vector of source on the acquisition surface emergence angle of the central ray with respect to the acquisition surface normal first Lame constant, a linear combination of bulk and shear moduli in isotropic elastic media second Lame constant, shear modulus in isotropic elastic media density of the elastic medium eikonal azimuth of the central ray with respect to the x-axis of the global Cartesian coordinate system
Specific terminology
depth migration transformation of time-domain seismic data into a depth image direct wave body wave propagating along the acquisition surface, appears as linear event in the shot seismogram eigenwave hypothetical wave propagating along the central ray with coincident incident and emerging wavefronts eikonal equation non-linear first-order differential equation describing the kinematics of a wave in the high-frequency limit ground roll see Rayleigh wave head wave wave reaching a layer boundary under the critical reflection angle, appears as linear event in the shot seismogram at larger offsets moveout temporal shift of a reflection event between neighbouring traces
multicoverage data seismic data set consisting of a multitude of separate seismic experiments NIP wave hypothetical eigenwave focusing in the NIP normal wave hypothetical eigenwave with normal incidence on the reflector poststack after stack prestack before stack primary reflection wave reflected only once on the way from source to receiver projected Fresnel zone projection of the Fresnel zone at the reflector to the acquisition surface along rays associated with the considered experiment Rayleigh wave surface wave propagating along the free surface of an elastic medium seismic traces discrete time series of pressure (marine case) or particle displacement (land case) recorded at the receiver seismogram ensemble of seismic traces associated with a certain source/receiver configuration semblance a normalised coherence measure stacking velocity moveout parameter in the CMP gather stack process of summing amplitudes across seismic traces; the resulting stacked section transport equation second-order differential equation describing the amplitudes of a wave in the high-frequency limit zero order ray theory approximate high-frequency approach for solving the wave equation
References 1 MANN, J., JAGER, R., MULLER, T., HOCHT, G., andHUBRAL, R: 'Commonreflection-surface stack - a real data example', J. Appl. Geophys., 1999,42, (3,4), pp. 301-318 2 TRAPPE, H., GIERSE, G., and PRUESSMANN, J.: 'Case studies show potential of common reflection surface stack - structural resolution in the time domain beyond the conventional NMO/DMO stack', First Break, 2001,19, (11), pp. 625-633 3 BERGLER, S., HUBRAL, P., MARCHETTI, P., CRISTINI, A., and CARDONE, G.: '3D common-reflection-surface stack and kinematic wavefield attributes', The Leading Edge, 2002, 21, (10), pp. 1010-1015 4 AKI, K. and RICHARDS, R G.: 'Quantitative seismology - theory and methods' (W. H. Freeman & Co., 1980, vol. 1) 5 CERVENY, V: 'Seismic ray theory' (Cambridge University Press, 2001)
6 SHERIFF, R. E. and GELDART, L. R: 'Exploration seismology' (Cambridge University Press, 1982, vol. 1) 7 VERMEER, G. J. 0.: 3-D seismic survey design (Society of Exploration Geophysicists, Tulsa, OK, USA, 2002) 8 YILMAZ, O.: Seismic data analysis (Society of Exploration Geophysicists, Tulsa, OK, USA, 2001) 9 BUTTKUS, B.: 'Spectral analysis and filter theory in applied geophysics' (Springer Verlag, 2000) 10 ROBINSON, E. A. and TREITEL, S.: 'Geophysical signal analysis', (PrenticeHall, Inc., 1980) 11 TANER, M. T. and KOEHLER, F.: 'Velocity spectra - digital computer derivation and applications of velocity functions', Geophysics, 1969, 34, (6), pp. 859-881 12 BEYLKIN, G.: 'Discrete radon transform', IEEE Trans. Acoust. Speech Signal Process., 1987, AASP35, pp. 162-172 13 VERSCHUUR, D. J., BERKHOUT, A. J., and WAPENAAR, C. P. A.: 'Adaptive surface-related multiple elimination', Geophysics, 1992, 57, (09), pp. 1166-1177 14 DEREGOWSKI, S. M.: 'What is DMO?', First Break, 1986, 4, (7), pp. 7-24 15 SCHNEIDER, W.: 'Integral formulation for migration in two and three dimensions', Geophysics, 1978, 43, pp. 49-76 16 SCHLEICHER, J., TYGEL, M., and HUBRAL, P.: '3-D true-amplitude finiteoffset migration', Geophysics, 1993, 58, (08), pp. 1112-1126 17 CLAERBOUT, J. F.: 'Imaging the earth's interior' (Blackwell Scientific Publications, 1985) 18 STOLT, R. H.: 'Migration by Fourier transform', Geophysics, 1978, 43, (1), pp. 23-48 19 GAZDAG, J.: 'Wave-equation migration by phase shift', Geophysics, 1978, 43, pp. 1342-1351 20 STOFFA, P. L., FOKKEMA, J. T., DE LUNA FREIRE, R. M., and KESSINGER, W. P.: 'Split-step Fourier migration', Geophysics, 1990, 55, (04), pp. 410^21 21 MAYNE, W. H.: 'Common reflection point horizontal data stacking techniques', Geophysics, 1962, 27, (6), pp. 927-938 22 DEREGOWSKI, S. M. and ROCCA, F.: 'Geometrical optics and wave theory of constant offset sections in layered media', Geophysics, 1981,29, (3), pp. 384-406 23 SCHLEICHER, J., TYGEL, M., and HUBRAL, P.: 'Parabolic and hyperbolic paraxial two-point traveltimes in 3D media', Geophys. Prospect, 1993, 41, (4), pp. 495-514 24 HUBRAL, P. and KREY, T.: Interval velocities from seismic reflection traveltime measurements (Society of Exploration Geophysicists, Tulsa, OK, USA, 1980) 25 HUBRAL, P.: 'Computing true amplitude reflections in a laterally inhomogeneous earth', Geophysics, 1983, 48, (8), pp. 1051-1062 26 HOCHT, G.: 'Traveltime approximations for 2D and 3D media and kinematic wavefield attributes'. PhD thesis, Universitat Karlsruhe, 2002. http://www.ubka.uni-karlsruhe.de/vvv/2002/physik/3/3.pdf
27 HOCHT, G., DE BAZELAIRE, E., MAJER, R, and HUBRAL, R: 'Seismics and optics: hyperbolae and curvatures', J. Appl Geophys., 1999, 42, (3,4), pp.261-281 28 BORTFELD, R.: 'Geometrical ray theory: rays and traveltimes in seismic systems (second-order approximations of the traveltimes)', Geophysics, 1989, 54, (3), pp. 342-349 29 NEIDELL, N. S. and TANER, M. T.: 'Semblance and other coherency measures for multichannel data', Geophysics, 1971, 36, (3), pp. 482-497 30 MANN, J.: 'Extensions and applications of the common-reflection-surface stack method' (Logos Verlag, Berlin, 2002) 31 BILOTI, R., SANTOS, L. T., and TYGEL, M.: 'Multiparametric traveltime inversion', Stud. Geophys. Geod., 2002, 46, pp. 177-192 32 DUVENECK, E.: 'Determination of velocity models from data-derived wavefront attributes'. Extended abstracts of the 65th meeting of the European Association of Geoscientists and Engineers, 2003
Part IX
Space-time techniques in communications
Chapter 22
STAP for space/code/time division multiple access systems Christoph M. Walke
22.1
Introduction
In recent years, STAP techniques for deployment in mobile radio systems have earned intense scientific interest. Apart from applications such as mobile station (MS) localisation for location-based services, the two major objectives of STAP in mobile radio communications are the reduction of required transmission powers by exploitation of antenna or spatial diversity gains and the use of spatial multiplexing, i.e. the provision of spatial communication channels based on the spatial inhomogeneity of the propagation medium. Both objectives will be examined in detail in this chapter. The examinations are based on a state-of-the-art cellular mobile radio system with isotropic antennas at the MS and a uniform linear antenna array with N isotropic elements at each base station (BS). The area that is to be covered by the radio system is assumed to be seamlessly paved with cells of hexagonal shape, with one fixed BS in the centre of each cell providing radio coverage for an average of K MS in each cell. The overall available transmission bandwidth Bs can either be used in each cell of the system or be shared among r cells constituting a frequency cluster (see Chapter 19 of Reference 9), with r denoting the size of a cluster counted in cells. The introduction of clusters with r > 1 reduces the available transmission bandwidth B = Bs/r per cell, but increases the distance between cells that use the same frequency band. With the increase of the distance between cells using the same frequency band, mutual intercell interference (ICI) between cells is significantly reduced. The cellular radio system is assumed to be interference-limited, i.e. thermal noise in the receive signals is negligible as compared with the impact of ICI. Furthermore, time division duplex (TDD) operation is assumed, i.e. the uplink (UL) direction where the MS transmit to their assigned BS is separated from the downlink (DL) direction where the BS transmits to the MS in the time domain. As a medium access technique, a
code Q
iV-fold spatial code reuse
code
code 1
code 1
Af-fold spatial code reuse
power per code
MS
MS
time slot 1 time slot duration frequency
bandwidth per cell B for frequency-reuse cluster size r total system bandwidth B8 = r B
Figure 22.1
Architecture of an SD/TD/CDMA system with N BS receive antenna elements, Q orthogonal spreading codes per cell and a total system bandwidth Bs shared among the r cells of one frequency-reuse cluster. k{ and k\ with i — 7 , . . . , NQ denote the different MS per cell
hybrid space division/time division/code division multiple access (SD/TD/CDMA) technique is considered,1 which controls the concurrent access of the K MS in each cell to the shared radio resource on the UL and DL, Figure 22.1. According to Figure 22.1, the time division multiple access (TDMA) component divides the time axis into fixed length time slots of duration Thurst that can be flexibly assigned to different MS and to the two duplex directions UL/DL. In each time This type of mobile radio system is standardised as universal mobile telecommunications system terrestrial radio access (UTRA) TDD mode in the international mobile telecommunications (IMT 2000) family of harmonised standards for third generation mobile radio systems [19]
slot, a burst containing phase-modulated (and optionally spread and scrambled) data symbols is transmitted from/to one MS. Furthermore, for the purpose of adaptive channel estimation, a midamble consisting of training symbols that are known to the receiver is also periodically transmitted within the bursts. The required frequency of such a channel estimation procedure strongly depends on the coherence time of the physical propagation medium, i.e. the channel between the MS and the BS antennas. With increasing velocities of MSs, of scatterers located in the propagation scenario and even of the BSs, if BSs are imagined to be located on trains, planes, busses etc., the channel coherence time decreases due to the increased Doppler spread. In the rest of this chapter, it is assumed that the channel coherence time is sufficiently larger than the duration of a burst (time-invariant channel), and that midamble-based channel estimation at the receiver takes place frequently enough to adaptively provide the receiver with up-to-date channel estimates (perfectly known channels). This adaptation process is thus much faster than any possible changes in the location of the MS or changes in the propagation scenario between MS and BS. The aforementioned assumption can be forced in each mobile communications system by fixing an upper limit of allowable velocities of objects in the propagation scenario and by setting the burst length and the frequency of channel estimation accordingly. The code division multiple access (CDMA) component additionally separates up to Q MS within one time slot by imprinting an MS-specific, binary spreading code on each phase-modulated data symbol of duration T3. Each binary spreading code consists of a sequence of Q binary elements ±1 of duration Tc — T8/ Q, where said binary elements are denoted as chips. The transmitted sequence of phase-modulated and spread data symbols then does not only change at symbol level, but at chip level. At the receiver, despreading is performed by applying a code-matched filter at chip level to recover the phase-modulated symbol. This direct sequence (DS) CDMA operation in combination with orthogonal or pseudo-orthogonal2 spreading codes creates code channels for the concurrent transmission of the data symbols of up to K = Q MS on the one hand and allows for the exploitation of frequency diversity on the other hand, because the original symbol bandwidth B/Q is g-times spread to the available transmission bandwidth per cell B and the transmitted symbols thus become more robust against frequency-selective fading. The reason for frequency-selective fading in the mobile radio system as considered in this chapter is multipath propagation, as it occurs in densely built pico- and microcellular propagation scenarios with low MS and BS antenna heights. Transmitted electromagnetic waves propagate on the line-ofsight path between transmitter and receiver and also on paths that stem from reflection, diffraction and scattering of the transmitted waves at obstacles in the propagation scenario. The received wave thus is a sum of differentially delayed, Doppler-shifted and attenuated waves that interfere constructively and destructively. The coherence bandwidth of the channel between transmitter and receiver is then significantly smaller
Even if orthogonal spreading codes are used at the K MS, the orthogonality of the signals received at the BS is destroyed when the K spread and scrambled bursts are transmitted over K different frequencyselective channels
than the transmission bandwidth per cell. On the contrary, in macrocells, it is possible that only line-of-sight propagation takes place between transmitter and receiver, and the channel then becomes flat (non-frequency-selective). The assignment of the orthogonal or pseudo-orthogonal spreading codes to the MSs that transmit within the same time slot (Figure 22.1) is handled by the BS, and the assigned codes are periodically made known to each MS, accordingly. In a pure CDMA system, if all available spreading codes within one time slot are already assigned to MSs, no further MS can be allowed within that time slot. However, in a CDMA system with an antenna array at least at the BS, it becomes possible to spatially separate two or more MSs even if they use the same spreading codes. Note that the spreading codes are further chip-level multiplied with a cell-specific scrambling code also consisting of Q chips. This scrambling operation (and the corresponding descrambling operation at the receiver) does not affect the cross-correlation between the spreading codes that are used by different MS during one time slot and within one cell, but reduces the cross-correlation between codes used by MS in different cells that use the same frequency band. Thus the scrambling operation further decreases the detrimental effect of ICI. The space division multiple access (SDMA) component theoretically allows us to assign each of the Q available spreading codes per time slot to up to N MS 3 per cell, which are separated by spatially selective transmission/reception of the A/-element BS antenna array. Per single time slot and cell, an overall number of NQ MS can be supported by the hybrid SD/TD/CDMA architecture. The number of MS sharing one spreading code is specified by the code reuse factor R = KfQ. As a performance measure for an SD/TD/CDMA system, the spectral efficiency: overall data rate per cell K r—A .A- p = — •m (22.1) system bandwidth Bs rQ in bit/s/Hz is widely used, where 770 denotes the spectral efficiency of a pure TDMA system with K=I active MS per time slot without spreading (Q = 1) and without frequency reuse in the cellular system (cluster size r = 1). 77 is conditioned on quality of transmission (QOT) parameters such as a desired bit error ratio (BER) for the transmitted data symbols of each burst, which is not exceeded within a certain outage probability. For example, increasing K increases the spectral efficiency r), but due to the increased amount of interference the BER also increases, so that the QOT is no longer met. In the rest of this chapter, we will exclusively concentrate on the UL of an SD/TD/CDMA system. Section 22.2 initially introduces a discrete-time system model to describe the spreading, transmission, propagation and reception of the data bursts of K concurrently active single-antenna MS in one cell of the cellular system, where a multi-antenna receiving BS being located in the centre of the cell is presumed and where only one time slot of the TDMA component of Figure 22.1 is considered. V=
The number of spatially separable MS per cell heavily depends on the spatial selectivity of the channel and/or the positions of the MS, see Section 22.5 for a detailed analysis
In Section 22.3, joint detection (JD) algorithms are presented, which perform STAP on the received signals in order to jointly recover the transmitted data symbols of the K MS. JD can be considered as spatio-temporal equalisation of the receive signals, which contain: • • •
multiple access interference (MAI) due to non-ideal separation of the signals of the K MS in the space and code domain inter-symbol interference (ISI) between the transmitted symbols of all K MS due to the frequency selectivity of the channel intercell interference (ICI) stemming from signals transmitted from MS of other cells in the cellular system that use the same frequency band.
The computational complexity of the JD algorithms can be significantly reduced by transforming the time domain (TD) system model into a block-diagonal frequency domain (FD) system model, which is presented in Section 22.4. In Section 22.5, the performance of the presented JD algorithms with respect to exploitation of spatial and frequency diversity, cancellation of MAI and ISI arising from MS within the considered cell (intracell interference cancellation) and cancellation of ICI arising from MS of neighbouring cells that use the same frequency band (ICI cancellation) is examined. Section 22.6 finally sums up the most relevant results on STAP in SD/TD/CDMA systems.
22.2
System model
The time-discrete single-time-slot chip-rate system model presented in this section is suitable for describing the reception of data bursts originating from K single-antenna MS of one isolated cell (the so-called reference cell) of a cellular radio system at a BS that uses an TV-element antenna array. The ICI stemming from concurrent transmission of data bursts of MS in neighbouring cells is modelled as noise. A slow fading channel is assumed throughout this chapter, i.e. the channel coherence time is assumed to be sufficiently larger than the duration Tburst of one time slot, for which the system model is valid. Then, a time-invariant channel impulse response can be applied. Furthermore, perfect frequency and time synchronisation is postulated. Aiming at the development of a stacked multiuser system model, the Nd phase(k)
modulated symbols d\ of MS k = 1 , . . . , K at symbol index n = 1 , . . . , Nd, which are transmitted in one burst during a time slot, are gathered in the A^-dimensional vector: (22.2) and then stacked into the Nd ^-dimensional total symbol vector: (22.3)
The receive signals xf1 sampled at each BS receive antenna element m = 1 , . . . , N at chip instance / are gathered in the (NdQ + L — 1)-dimensional vector: (22.4) The size of the receive vector x ^ is enlarged as compared with the data vector d(*) which contains Nd symbols due to Q-fold spreading/scrambling of each data symbol before transmission and because the spread and scrambled data symbols are transmitted over a frequency-selective physical propagation channel from the single MS antenna to the multiple BS receive antennas. This channel is modelled by a filter Y^mJc) w ^jj ^ chip-level spaced filter taps, which is different for each pairing of the receive antenna m = I,... ,N and transmitting MS k = 1 , . . . , K. The Af vectors x(m) a r e backed i n to the N(NdQ + L — 1)-dimensional total receive vector: (22.5) Similarly, a corresponding noise contribution n™ as received at each BS receive antenna element m = 1 , . . . , N is arranged in: (22.6) and (22.7) respectively, where the noise n models the ICI that is created by transmitting MS of nearby cells of the cellular system that use the same frequency band as the reference cell. Spread and scrambling of the data symbols d ^ of each MS k is performed by the NdQ x Nd matrix: (22.8) where c ^ denotes the combined spread and scrambling code applied by MS k and I(Nd) denotes the Nd x Nd unit matrix. Stacking the K matrices C ^ on a blockdiagonal yields the total multiuser spread- and scrambling matrix: C = Blockdiag[C (1) ,..., C ( / °]
(22.9)
The spatio-temporal channel matrix H is composed of Toeplitz subblocks (see p. 70 of Reference 6) according to:
(22.10)
The (NdQ + L — I) x NdQ Toeplitz subblocks of H are defined as: (22.11) and h^m'^ e C L with m = 1 , . . . , N and k = 1 , . . . , K represents the time-invariant, frequency-selective propagation channel between MS k and receive antenna element m sampled at chip rate and cut to a total length of L taps. The stacked system model for JD can then be expressed as x = HCd + n
(22.12)
= Ad + n
(22.13)
where the N(NdQ + L — I) x NdK JD system matrix A is introduced as a product of the total spread and scrambling matrix C and spatio-temporal channel matrix H.
22.3
Time domain linear joint detection
JD algorithms determine an estimate d of the total data vector d by removing the MAI and ISI contained in the total receive vector x according to equation (22.12). At least knowledge of the total system matrix A in equation (22.13) is required for JD, i.e. the receiving BS of each cell has to know the combined spread and scrambling codes of all its assigned K MS and needs to estimate the spatio-temporal channel matrix H, which contains the channel impulse responses between each MS k = 1 , . . . , K and each receive antenna array element m = 1 , . . . , N, equations (22.10) and (22.11). A spatiotemporal channel matrix estimate may be adaptively acquired by blind [15, 25], semi-blind [4] or non-blind [23] multiuser channel estimation algorithms. In mobile communications systems, most commonly non-blind channel estimation is performed based on midambles (training sequences), which are transmitted within data bursts and contain symbols that are known to the receiver, so that the channel impulse response can be easily determined by solving a set of equations at the receiver, with the channel taps as unknowns. In general, the length of such a training sequence depends on the length of the channel impulse responses that have to be estimated (this depends mainly on the maximum delay spread of the channels) and the number of different channel impulse responses that are required (this depends mainly on the number of transmit antennas, whereas the number of receive antennas is without influence). In state-of-the-art mobile radio systems, the midamble may consume up to 25 per cent of each data burst. For notational simplicity, perfect channel estimation is assumed. JD algorithms jointly estimate the data symbols of all K concurrently transmitted data bursts, instead of estimating only the data symbols of one data burst k and considering the K — 1 remaining data bursts as interference, as is the case in single detection algorithms (see Chapter 3 of Reference 26). JD algorithms are both suited for single-antenna receivers, which are not able to exploit the spatial diversity of the
MIMO channel, and multi-antenna receivers with options to include spatial maximum ratio combining (MRC) [10, 21] and optimum combining (OC) [28, 32, 21]. 4 If an estimate of the spatio-temporal noise (ICI) covariance matrix is available, JD algorithms can be further extended to suppress dominant intercell interferers. Three major classes of JD algorithms can be identified for application in the time domain: non-linear algorithms such as the maximum likelihood sequence estimator (MLSE, see Chapter 4.1 of Reference 26, Chapter 5.1.4 of Reference 20 and Chapter 2.4 of Reference 13), linear algorithms (see Chapter 2.5 of Reference 13) based on the zero forcing (ZF) or minimum mean square error (MMSE) criterion and decision feedback (DF) algorithms (see Chapter 2.6 of Reference 13, and Reference 33) as successive ZF/MMSE MAI and/or ISI cancellation approaches. Only linear JD algorithms will be further discussed, for they represent a reasonable tradeoff between performance and computational complexity and easily lend themselves to analytical description.
22.3.1
Zero forcing block linear equalisation
The zero forcing block linear equaliser (ZF-BLE) stems from the classical Gauss-Markov theorem (see Chapter 4.4 of Reference 17) and is specified by the equalisation criterion: d = arg min
((x - AdO 77 Q" 1 (x - AdO)
(22.14)
where Q n = Efnn77} denotes the noise covariance matrix. The total symbol vector estimate is yielded by multiplying the stacked total receive vector x with the spatiotemporal filter matrix T as: d= T x
(22.15)
= (AHQ-1A)"1A//Q;1x
(22.16)
= d + (A 7 7 Q^A)- 1 A 7 7 Q- 1 Ii
(22.17)
The ZF-BLE produces an unbiased estimate with minimum variance (see Chapter 11.5 of Reference 31 and Chapter 4.4 of Reference 17), i.e. structured ISI and MAI in the data symbol estimate d are completely removed. The price to be paid for the lack of bias in symbol estimate d is the introduction of additional correlations in the ZF-BLE output noise term in equation (22.17), where the output noise covariance matrix is given as (A 7 7 Q- 1 A)" 1 (see Chapter 4.4 of Reference 17).
4
In MRC, the outputs of the multiple antennas are considered as spatial diversity branches and are combined in the sense of a spatial matched filter in order to maximise the symbol-to-noise ratio of the combined signal that is yielded by combining the branch signals. In contrast, in OC, the branch signals are combined in order to maximise the symbol-to-noise-and-interference ratio of the combined signal by modifying the spatial matched filter in a way in which dominant intracell interferers are spatially cancelled
22.3.2
Minimum mean square error block linear equalisation
A generally improved minimum variance estimate (see Chapter 4.5 of Reference 17) d is yielded by extending the ZF-BLE by a Wiener estimator (see Chapter 11.5 of Reference 31) to a minimum mean square error block linear equaliser (MMSE-BLE) according to: (22.18) (22.19) Wiener estimator
ZF-BLE
where Qd = Efdd77} denotes the data covariance matrix. The MMSE equalisation criterion is then given as: (22.20)
The Wiener estimator exploits knowledge of the noise covariance (A^Q" 1 A)" 1 in the ZF-BLE estimate according to equation (22.17) and on the data covariance matrix Qd in order to improve the ZF-BLE estimate. Note that the Wiener estimator causes the MMSE-BLE estimate d to become biased.
22.4
Frequency-domain linear joint detection
A significant reduction of the computational complexity associated with the inversion of the NdKxNdK matrices A ^ Q - 1 A (ZF-BLE, equation (22.17)) and (A^Q" 1 A + Q^ ) (MMSE-BLE, equation (22.18)) is achieved by transforming the time domain (TD) JD system matrix A into the block-diagonal frequency domain (FD) system matrix T) A. Owing to the block structure of T) A, inversion complexity is reduced from O(NjK^) to Nd - O (K^) (neglecting the permutation and discrete Fourier transform (DFT) overhead). Furthermore, to preserve the block-diagonal structure in the FD, uncorrelated data symbols with covariance matrix Q^ = CTJI(A^/Q and a decoupled spatio-temporal noise covariance matrix: Qn=Qa®l(NdQ+L-l)
(22.21)
composed of an Af x iV spatial noise covariance matrix Q^ and an (Nj Q + L — 1) x (NdQ + L - I ) uncorrelated temporal noise covariance matrix are assumed. 22.4.1
Block-diagonal FD system model
The basis of the considerations on FD-JD is the transformation of the TD system matrix A in equation (22.13) with blocked block-Sylvester form into a blockSylvester matrix A&y with Nd blocks of size N(Q + L — \) x K and finally into a block-diagonal matrix T) A with block-size NQ x K, Figure 22.2.
A
A4, block-Sylvester structure
blocked blockSylvester structure
Figure 22.2
block-circulant structure
Exemplary structure of TD-JD system matrices A, A^ and A^ for K = 3andN = 2
D e f i n e t h e N(NdQ m a t r i x P ( ^ 1 ,A^2):
+ L -
1) x N(NjQ
[P(NuN2)]Ni(n2-l)+ni,N2(ni-l)+n2
fl: = 1
[O:
+ L - I ) unitary permutation
/ii = l , . . . , M 2 = 1,...,A^
W
(22.22)
else
The system matrix in block-Sylvester form A ^ then can be written as: Afo = P(Ar,^(2+L-i) • A • ¥(Nd,K)
(22.23)
where the N(Q + L — I) x K block A^ in the left upper corner of A^5 carries the complete matrix information, Figure 22.2.
The block-Sylvester matrix A ^ of dimension N(NdQ + L — 1) x NdK can be extended to a NQNd,be x Nd,beK block-circulant matrix (see Chapter 5.6 of Reference 6) with block size NQ x K by increasing5 the number of block columns from Nd to: Ndjbc = Nd + W
(22.24)
with (22.25)
and wrapping around the surplus rows of the last W blocks Ab, as visualised in Figure 22.2. A&c is then yielded as:
(22.26)
The relationship between the blocked block-Sylvester TD system matrix A and the block-circulant TD system matrix A&c can be expressed by introducing the NNd,bc Q x N(NdQ + L - I ) combined row selection and permutation matrix P r : (22.27) and the NdK x Nd,be K combined column selection and permutation matrix P c : (22.28) as (22.29) Block-circulant matrices lend themselves to block diagonalisation via block discrete Fourier transform (DFT) matrices (see Chapter 2.5 of Reference 6), a property evolving from the fact that the eigenvectors of a circulant matrix are simply the inverse Fourier vectors contained columnwise in the JVi x N\ DFT matrix F(N1)5 Increasing the number of block columns requires the appendage of W zeros at the end of each data vector d^ ^ (equation (22.2)), i.e. a guard period of minimum length W is assumed after the transmission of each data burst. Alternatively, as well a cyclic transmission model with the last W symbols in d ^ overwriting the first W data symbols can be applied to force the block circulance of the system matrix
A unitary block-DFT matrix can then be defined as: F(NuN2) = F(N1) ® 1(N2)
(22.30)
and the block diagonalisation of A^c follows as: ^bc = FfNdb^NQ) • T)A • F(NdJc9K)
(22.31)
The block-diagonal FD system matrix T) A is composed of NQ x K block eigenvalues T) ^ ? w * m n — IJ • • • > Nd,bc> and is determined as:
(22.32)
where A^ denotes the first Nd,bcNQ x K block column of A ^ . Plugging together equations (22.29) and (22.31), the desired block diagonalisation of the TD system matrix A, yielding the FD system matrix T) A , follows as: (22.33)
The block diagonalisation of the TD system matrix A via permutation and DFT matrices turns the transmission of the total symbol vector d over a frequency-selective system channel into the transmission of N^bc representations of d at different frequency indices over N^bc frequency-flat NQ x K system channels stemming from the DFT of the block-circulant system matrix A&c. This is easily seen by introducing the transformed vectors: (22.34) (22.35) and (22.36) and rewriting equation (22.12) with A of equation (22.33) ?
= VA • D + n
(22.37)
or, equivalently in decoupled manner: (22.38) This block-diagonal FD system model is depicted in Figure 22.3.
22.4.2 FD ZF-BLE and MMSE-BLE The decoupled transmission model for each frequency index n = 1 , . . . , Nd,bc given by equation (22.38) can be interpreted as multiuser detection problem with NQ equations and K unknowns. Thus at each frequency index n, the TD-JD algorithms as
Next Page
Figure 22.3
Block-diagonal FD-JD system model (with Nbc = ^d,bc)
specified in Section 22.3 are applicable, yielding the equalisation matrix T according to equation (22.15) for the FD ZF-BLE: T
=VcF?NdM,K){?>%d(Nd,bcQ) ® Q ; 1 ) ^ ) " 1 X V%V{NdJbcQ) ® Qal)F(NdM,NQ)Vr
(22.39)
and the FD MMSE-BLE: T
=
P
^ ) W ( I ( %
c
0 ® Qal)^A +
X VHA{l{Nd,bcQ) ® Q^)J7MjV9NQ)Vr
22.5
(22.40)
Performance of FD joint detection
In the decoupled FD system model of equation (22.38), the K columns of each NQ x K block eigenvalue T>^ at frequency index n = 1 , . . . , Nd,bc can be interpreted as K vectors in an NQ-dimensional space, where the vector magnitude represents the amount of available channel and spreading code power at frequency index n = 1 , . . . , Nd,be- If a JD is applied to equalise each block eigenvalue, as seen from one specific MS, only the amount of channel and spreading code power lying in the subspace that is orthogonal to the subspace spanned by the vectors of the K - 1 other MS can be exploited.6 For K = 1, both ZF/MMSE-BLE according to If in addition to the JD of K data bursts stemming from MS within the cell ICI cancellation of KJCI dominant intercell interferers is to be performed, the amount of exploitable spatial and frequency diversity is further reduced
Previous Page
Figure 22.3
Block-diagonal FD-JD system model (with Nbc = ^d,bc)
specified in Section 22.3 are applicable, yielding the equalisation matrix T according to equation (22.15) for the FD ZF-BLE: T
=VcF?NdM,K){?>%d(Nd,bcQ) ® Q ; 1 ) ^ ) " 1 X V%V{NdJbcQ) ® Qal)F(NdM,NQ)Vr
(22.39)
and the FD MMSE-BLE: T
=
P
^ ) W ( I ( %
c
0 ® Qal)^A +
X VHA{l{Nd,bcQ) ® Q^)J7MjV9NQ)Vr
22.5
(22.40)
Performance of FD joint detection
In the decoupled FD system model of equation (22.38), the K columns of each NQ x K block eigenvalue T>^ at frequency index n = 1 , . . . , Nd,bc can be interpreted as K vectors in an NQ-dimensional space, where the vector magnitude represents the amount of available channel and spreading code power at frequency index n = 1 , . . . , Nd,be- If a JD is applied to equalise each block eigenvalue, as seen from one specific MS, only the amount of channel and spreading code power lying in the subspace that is orthogonal to the subspace spanned by the vectors of the K - 1 other MS can be exploited.6 For K = 1, both ZF/MMSE-BLE according to If in addition to the JD of K data bursts stemming from MS within the cell ICI cancellation of KJCI dominant intercell interferers is to be performed, the amount of exploitable spatial and frequency diversity is further reduced
equations (22.39) and (22.40) reduce to a scaled matched filter that uses the power of all NQ dimensions and thus performs maximum ratio combining (MRC) of the NQ available space/code branches under exploitation of all spatial and frequency diversity that is available at that frequency index n. For K > 1, i.e. increasing intracell interference, the amount of exploitable power and diversity decreases accordingly. If an FD ZF-BLE is used and only a small amount of power can be exploited in the remaining orthogonal subspace, the system channel of MS k at the considered DFT index n is powered up by the ZF-BLE, so that the product of the block eigenvalue *D1^ and the FD equalisation matrix T ^ , which is applied to each block eigenvalue at DFT index n and represents the FD equivalent of the TD equalisation matrix T, equals the K x K unit matrix. Note that powering up the system channels also increases the noise power at the output of the ZF-BLE, so that the SNIR deteriorates. The trade-off between deploying the NQ degrees of freedom (DoF) in the space and code domain to either exploit spatial and frequency diversity or to cancel intra- and intercell interference will be examined in the following three subsections with respect to the efficient application of JD algorithms in a cellular SD/TD/CDMA system. 22.5.1
Exploitation of spatial and frequency diversity
Each column of the NQ x K FD-JD block eigenvalues T>^ at frequency index n = 1 , . . . , Nd,bc (equation (22.32)) can be decomposed into a sum of contributions stemming from single physical signal propagation paths m = 1 , . . . , M ^ . M^ denotes the number of multipaths between MS k = 1 , . . . , K and the receiver antenna array, which are characterised by delay r^m\ Doppler7 frequency / ^ , attenuation and phase shift /3^ and antenna array response vectors8 S^1J1 e C ^ and Sy^ G C 1 for the TV-element BS antenna array and the single-antenna MS k — 1 , . . . , K, respectively. The convolution of each MS-specific combined spreadand scrambling code: c(k)(t)
= {[c(*)]l5(0
+ . . . + [c<%5(* - (Q - \)TC)}
(22.41)
with the pulse shaping filter pr(t) at MS site and the (matched) filter pR(t) at the BS receiver site yields the continuous-time signature: g{h\t) = pR(t) * c(k\t) * pT(t)
(22.42)
The delayed and Doppler-shifted echo caused by propagation path m then reads as: g(k>m)(t) = eJ2*fDt. g(M0(j _ r ( m >)
(22A3)
7 Owing to the assumption of a slow fading channel throughout this chapter, the effect of Doppler shifts are negligible. However, for completeness of presentation, the Doppler shifts are included at this stage of the presentation Under the far-field and narrowband assumption, the antenna array response vectors can be modelled as steering vectors, i.e. the receive signal at different receive antenna elements only differs by a phase shift which depends on array geometry, carrier wave length and arrival and departure angles of each propagation path
Synchronous baseband sampling of g^k^ at chip rate rc = 1/TC, windowing to a total length of Q + L — 1 discrete samples starting from the discrete-time instance —&o and padding N^bcQ ~ (L + Q — 1) zeros at the end of g^m\nTc) produces the Nd,be Q -dimensional vector:
(22.44) k m
Selection of each Qth contribution in g( > \ starting with offsets q = 1 , . . . , Q, yields Q columns of Nd,be -dimensional vectors according to: (22.45) which are subject to an Nd ,be -point DFT (equation (22.30)):
The kth column, k = 1 , . . . , K of each NQ x K block eigenvalue uy index n — 1 , . . . , Nd,be then can be rewritten as:
at frequency
(22.47)
where $%$
€ C^ denotes the nth column of the Q x Nd,bc matrix
It is instructive to note that T)A products of: • •
represents the sum over M^
Gf^y Kronecker
frequency-index-independent rank-1 N x 1 outer products of the receive and transmit antenna array responses S ^ and s ^ frequency-index-dependent <2-dimensional vectors &\N™£)9 which are composed of the nth points of the DFTs of samples of the delayed and Doppler-shifted signature g(*'m)(0>
where each Kronecker product is additionally weighted with the complex path amplitude £ (m) . The degree of spatial and frequency diversity offered by the two Kronecker factors in equation (22.47) will be investigated in the following two subsections. 22.5.1.1 Flat fading In the frequency flat fading case, the delay spread Ox of all occurring paths M^ between MS k and the BS is small compared to the chip duration Tc. For notational simplicity, a root-raised-cosine pulse shape filter pj(t) and corresponding matched receive filter pR(t) is assumed in the sequel. Furthermore, slow fading channels and perfect temporal synchronisation for each MS k is assumed, i.e. the mean path delay
scattering region around single-antenna MS A:
BS receiver antenna array
Figure 22.4
Gaussian angle of arrival channel model
of all M^ multipath echoes received from MS k is assumed as a multiple of the chip duration Tc in equation (22.43), and g^'m) in equation (22.44) reduces to the spreading and scrambling code C ^ embedded into an {Ndjbc Q)-dimensional zero vector. The Mi column of each block eigenvalue according to equation (22.47) then can be rewritten as: jy(k,n) _ ej2n{n{\-nd)-l)/Najbc
. c(*) ^
g(*)
(22.48)
where rid e N + is determined by the mean path delay. The spatial signature (see Chapter 3.3 of Reference 14) s^ between MS k and the BS antenna array is defined as: (22.49) As can be deduced from equation (22.48) for the flat fading case, the block eigenvalues at different frequency indices n = 1 , . . . , N^bc only differ by a phase shift. Furthermore, obviously no frequency diversity is available. The degree of spatial diversity offered in each block eigenvalue T) A by the spatial signature s ^ in equation (22.49) can be investigated by considering a Gaussian angle of arrival (GAA)9 channel model (see Chapter 6.2.8 of Reference 14, and Reference 7) as depicted in Figure 22.4. A single MS with azimuth angle
A channel model with uniform angle distribution is discussed in References 21 and 7, leading to oscillating interantenna correlations and the need to differentiate between the real and imaginary parts of the impinging signals. Here, the GAA model is found to be more suited for discussion of correlation issues
deviation O^^GAA- For simplicity of presentation, a uniform linear array (ULA) with interelement spacing d is assumed as the antenna array configuration. Note that for large CT^GAA, the scattering region also surrounds the BS antenna array. For normally distributed path attenuations P^ with zero mean and variance °\ GAA^ m e n o r m a lised magnitude of the correlation: (22.50)
between the elements of the spatial signature s^ ^ that are associated to antenna element m\ and W2 can be given in closed form as: (22.51) If maximum ratio combining (MRC) is applied at each block eigenvalue in order to exploit spatial diversity, the correlation between the elements in each column of *DA has to be as low as possible.10 According to equation (22.51), the largest correlation appears for adjacent receive antenna elements in the ULA, e.g. r\^ and is depicted exemplarily in Figure 22.5 for different angles of arrival (po,GAA and standard deviations O^^GAA as a function of d (normalised to the carrier wave length kc). For a fixed normalised interelement spacing d/Xc, correlation rapidly decreases for increasing standard deviations CF^GAA (increasing spatial selectivity of the channel) and in general is smallest for signals impinging on the array from broadside (^\ Consider the single antenna MS and BS case first. The multipath delays r ^ then become temporally resolvable at different chip instances, causing shifts r > Tc of the pulse-shaped combined spreading and scrambling codes c ^ in g(*»m). The consequence of these shifts and the selection of each Qth sample of g^'m) with different offsets q = 1 , . . . , Q in equation (22.45) is as follows: the summation over all M^ <2-dimensional vectors 0(^7^) m equation (22.47) yields not only the nth point of the DFT of the sum of the delayed, Doppler-shifted and attenuated pulse shapes of each path m, as would be the case without spreading (Q = 1), but Q times the nth point of the DFT of different sums of delayed and attenuated pulse shapes. These sums at each discrete-time index are weighted with different combinations of the spreading code elements. If the spectrum of the channel impulse response has deep frequency-selective fades, the probability of all Q DFTs 10 For a detailed investigation of the degradation of MRC caused by correlation see Chapter 5.2.5 of Reference 10, and Reference 21
Figure 22.5
Magnitude of receive antenna element correlation for GAA model as a function of the antenna interelement spacing d/Xc, parameterised with MS azimuth angle cpo,GAA and angle standard deviation CT^GAA
of differently weighted channel taps fading away at one specific frequency index n is significantly decreased. Frequency diversity can then be exploited by applying MRC over the Q entries of *D^ , as it is exemplarily depicted in Figure 22.6 for a channel impulse response with M^ = 3 paths and 2 = 1,2 and 4. In the upper plot of Figure 22.6, where Q = 1, summation over the Q entries of each column of a block eigenvalue D £ in the sense of MRC only yields a single entry with index q = 1. The power available from MRC at each DFT index n can be interpreted as the transmission factor of a frequency domain propagation channel. With Q = 1, no frequency diversity can be exploited, so that deep fading of the transmission factor for some of the DFT indices n can not be combated, leading to distortions of the received symbols when transferred back to the time domain. However, for Q > 1 (middle and lower plot), the transmission factors for single indices q still tend to fade away for some of the DFT indices n, but due to the power collection performed by MRC over the Q column entries of each block eigenvalue D^ , the overall MRC transmission factor can be prevented from deep fades, resulting in less symbol distortions in the time domain. The same observations hold for the multi-antenna receiver case. At each frequency index n, X>^ 'n* according to equation (22.47) can be interpreted to contain the stacked
power, dB power, dB power, dB
DFT index n
Figure 22.6
Power for each block eigenvalue element YDA 'n ]q, q = U • • • > Q and for MRC of the Q elements with k = 1, n = / , . . . , N^bc and fixed burst size Nd,be Q = 1024. Exemplary frequency-selective channel withM^k) = 3paths, path delays r (1) = Tp, r{2) = 4.4TC, r (5) = 8.2TC and amplitudes ft]) = LO, p& = 0.89eJ'0-46, ft3) = 0.56e-J'0J9
results of Q summations over M^ A^-dimensional array response vectors S ^ sTm » which are independent of the frequency index n but, within each summation, weighted with different frequency-index dependent and spreading-dependent elements of the Q-dimensional vector B^^y For 2 = 1 , only one Af-dimensional summation over the M ^ array response vectors is performed, allowing for the exploitation of spatial diversity at each frequency index n only. For Q > I5 Q differently weighted Af-dimensional summations are available at each frequency index n, combining spatial and frequency diversity. Thus in contrast to the flat fading case, where only N different (spatial) fading branches for MRC are available at each block eigenvalue (equation (22.47)), in the frequency-selective case, NQ different fading branches can be exploited.
22.5.2 Intracell interference cancellation As performance criterion for the intracell interference cancellation performance of the FD ZF/MMSE-BLE, the signal-to-noise-and-interference ratio SNIR of each estimated data symbol [d]n, n = 1 , . . . , Nd of the data burst of MS k — 1 , . . . , K at the output of the spatio-temporal equalisation matrix T (equations (22.39) and (22.40)): desired symbol
(M) =
_E{l[diag(TA)d];|2} E{|[diag(TA)d]/j+E{|[Tn]y|2} ISI and MAI
noise
is applied, where the operators diag(-) and diag(-) return the diagonal of a matrix and a matrix with zeros on the diagonal, respectively, the operator [•]# returns the kth element of a vector and the symbol selection index j is defined as j = (k—l)Nd + n. y(k,n) -s con( iitioned on the system channel A (or T>A) and further depends on the data and chip transmission powers as well as the noise (ICI) power. y^k^ lends itself to examination based on a statistical description or Monte Carlo simulation of the system matrix and the noise and data covariance matrices. Also, the average uncoded bit error ratio (BER) of each detected data burst of MS k = 1 , . . . , K will be frequently considered as a performance measure under the assumption of quaternary phase shift keying (QPSK) modulated data symbols. To initially investigate the performance of intracell interference (MAI and ISI) cancellation, the ICI contained in the noise vector n is modelled as spatio-temporally uncorrelated process in this subsection, which is a common assumption for cellular radio systems with large numbers of MS in each cell. The ICI model will be refined in subsection 22.5.3. In addition to the uncorrelated temporal noise covariance matrix as introduced in equation (22.21), the spatial noise covariance matrix Q^ in equation (22.39) then equals the scaled identity matrix Cr2I(N), where a2 denotes the noise power. Plugging equation (22.39) into equation (22.52) directly11 yields the FD ZF-BLE SNIR: (22.53) Owing to the assumption of a circulant system model, the SNIR for all Nd transmitted symbols in a burst of MS k is equal, and the symbol index n can be skipped. The SNIR y^ then is determined by the symbol power a j , the noise power a2 and the average value of the £th diagonal element12 of the inverted Grammian13 of the block eigenvalue V^ taken over all frequency indices n = 1 , . . . , Ndjbc
11 The matrix F(Nd t,c,NQ) Pr is assumed to be unitary here. This holds for (L — 1)/ Q e N and can be approximated for Nc » L — 1, which in general is given 12 The notation [ - ] ^ denotes the kth diagonal element of a square matrix 13 The Grammian of a matrix X is defined as X ^ X
22.5.2.1 Flat fading In the flat fading case, each column of the block eigenvalues at different frequency indices n = 1,.. .,Nd,bc is equal up to a phase rotor, equation (22.48), so that taking the average over Nd^bc inverted Grammians of block eigenvalues according to equation (22.53) can be replaced by the inverted Grammian of the first block eigenvalue T>A . The SDMA component of the SD/TD/CDMA architecture becomes vital in the case of code reuse, i.e. when two or more MS within the same cell use exactly the same combined spread- and scrambling code and thus have to be spatially separated. For the sake of simplicity, in the rest of this chapter orthogonal binary spreading codes taken from a Hadamard tree (see page 161 of Reference 16) will be used. The choice of orthogonal spreading codes allows for a decoupled investigation of MS separation in the domains code and space, because the orthogonality of the transmitted signals is not destroyed by flat fading channels. This, however, does not hold for frequency-selective channels, as will be discussed in subsection 22.5.2.2. As seen from MS &, reuse of its code c ^ splits the overall number of MS K into two subsets of spatially interfering MS K1 and spatially non-interfering MS K.NI according to: K f = [k' I c « " - c^) = Qa2c} Kf1 = Ik1Ic^" -C^=O)
(22.54) (22.55)
where o% is the code transmission power and K^ denotes the number of spatially interfering MS, i.e. the number of elements of Kj . For a statistical description of the FD ZF-BLE14 performance in the flat fading case, it is sufficient to define an Af x K{k) matrix S^ } containing the spatial signatures sf] of all K{k) MS Id e Kf} that spatially interfere:
Sf = [sp...s(y]
(22.56)
Assuming equal statistics15 of the spatial signatures contained in S^ , the FD ZF-BLE SNIR of equation (22.53) simplifies to the flat fading expression: (22.57) with the average SNR per receive antenna element being defined as: (22.58) 14 In contrast to the FD MMSE-BLE, the FD ZF-BLE lends itself easily to analytical treatment, and thus will be considered exclusively in the rest of this chapter. The FD ZF-BLE performance can always be regarded as an upper bound on the FD MMSE-BLE performance [32] 15 Slow power control of each MS k is assumed here, so that the spatial signatures S ^ only contain the fast fading attenuation and not the attenuation due to path loss and shadowing
and a\ and a2 denoting the average channel and combined spread- and scrambling code power, respectively. Note that y^ is conditioned on the time-variant channel characteristics contained in the spatial signatures of S ^ . The probability density function (PDF) py{y^) of the SNIR y^ can be given in closed form for the following marginal cases of the GAA model (Section 22.5.1.1): •
Fading-large angle spread (F-LAS): assumptions: o^ ^> 0, M^ -> oo, arbitrary MS position. In the GAA model, low correlations rmi?m2 between the signals impinging on two adjacent BS antenna elements are achieved for large angle spreads o^ (corresponding to large standard deviation G^GAA, equation (22.51) and Figure 22.5), which imply the presence of scatterers around both MS and BS. This type of scattering scenario is modelled by the Rayleigh channel model (see Chapter 9.9 of Reference 9), where the entries in the spatial signature S^ are assumed i.i.d. N(O, a%)-distributed16 irrespective of the position of the MS and BS antenna elements, y^ then is x2-distributed17 with: N
M]RC
=
N
~
Kik)
(22-59)
+ 1
degrees of freedom (DoF) (see Reference 32 and Chapter 5.1.3.1 of Reference 30). Note that the DoF represent the diversity order, i.e. the number of independently fading spatial branches, available at the receiver.18 The density, expectation and variance o f / ^ are summarised in Table 22.1. Exactly the same density is yielded for MRC in a flat Rayleigh fading environment with A ^ c BS antenna elements and only one MS (see Chapter 5.2.2 of Reference 10, and Reference 32). Thus, as seen from one MS k, the suppression of each of the K^ — 1 other MS (intracell interferers) that deploy the same combined spread and scrambling code reduces the effective number of diversity branches by one, which degrades the diversity gain of the FD ZF-BLE. Fading-small angle spread (F-SAS): assumptions: a^ —> 0, M^ —>• oo, orthogonal spatial signatures s^\ For small angle spreads a^ (small standard deviation CT^GAA), the signals impinging on the BS antenna elements are completely correlated (equation (22.51) and Figure 22.5), i.e. undergo the same fading process stemming from local scattering at the MS k. No independently fading branches are available, reception with Af antenna elements only causes Af-fold
•
6
The complex-valued normal distribution N(O, o ) is related to the real-valued normal distribution
N(0,a2/2) via N(O,a2) = N(0,a2/2) + j • N(0,o2/2) 17
The complex-valued
x2 -distribution
is related to the real-valued
x2 -distribution
via
x\N){x) = 2x\2N)(2x) ° The DoF in the F-LAS are easily derived from the vector space model introduced at the beginning of this Section [2]: despreading projects each of the K MS-specific iV^-dimensional vectors into an iV-dimensional subspace, which is shared only among the K^ spatially interfering MS. With HD complex normally distributed entries in the spatial signatures of each MS, each MS on average has l/N of its power in each dimension of the ^-dimensional vector subspace. The subspace orthogonal to the subspace spanned by the K^ - 1 other MS contains only (N - (K^k) - X))/N of the overall power of each MS k, so that the maximum diversity gain of iV reduces to Nj^RC
Table 22.1
Normalised PDF, expectation and variance of the SNIR at the output of the FD ZF-BLE in the F/NF-SAS and F-LAS channel with uncorrelated noise
GAA scenario
F-LAS (Rayleigh) F-SAS NF-SAS (LOS)
SNR improvement at the FD ZF-BLE output, defined as antenna gain: (22.60)
•
The density of y^ in Table 22.1 is x?n with only one DoF, resulting in increased variance Af2 of y^ compared with the single MS F-LAS case with Nj^RC = N. Several spatially interfering MS k can only be spatially separated when their distance in azimuth is larger than the BS antenna beam width (psw^ so that all spatial signatures s^ become pseudo-orthogonal. However, spatial separation of MS does not further decrease the antenna gain as in the F-LAS case. No fading-small angle spread (NF-SAS): assumptions: o^ —>> O, M^ = 1, orthogonal spatial signatures s^ . The signal between MS k and the BS propagates only on the line-of-sight (LOS), and no fading occurs at all. As in the F-SAS case, the antenna gain ga equals the number of receive antenna elements Af, but with zero variance due to the lack of fading (Table 22.1). Orthogonal spatial signatures are required for spatial separation of K > 1 MS, causing no further antenna gain reduction.
To illustrate the difference between antenna gain in the F-SAS and NF-SAS cases on the one hand, and diversity gain in the F-LAS case on the other hand, closed-form expressions for the average probability of bit error: (22.61) for coherent QPSK modulation are summarised in Table 22.2, with F(-), pFq(-) and erfc(-) denoting the gamma function, Gaussian hypergeometric series and
Table 22.2
Average BER achieved by the FD ZF-BLE for different flat fading channel models (coherent QPSK, uncorrelated Gaussian noise)
GAA scenario
F-LAS (i.i.d. Rayleigh)
F-SAS
NF-SAS (LOS)
complementary error function, respectively (see page 255, 556 and 297 of Reference 1, respectively, and Reference 11). Note that the demand for uncorrelated noise required to derive the FD ZF-BLE SNIR in equation (22.53) has to be extended to the demand for uncorrelated Gaussian noise, so that the conditional BER for coherent QPSK PQpSK\y«) = Q(Vr^) = (l/2)erfc(vV fc )/2) can be applied (see Chapter 5.2.7 of Reference 20). Taking PQPSK(P) as a function of the SNR p, parameterised with Q and Af or NMRC> ** *s r e a ( hly s e e n m a t increasing the spreading gain Q only shifts the abscissa without altering the shape of PQPSK(P)- The same holds for increasing the antenna gain ga = N in the two considered SAS cases. In contrast, changing N^0 in the F-LAS case does alter the shape of PQPSK(P)> The diversity gain is then defined as the reduction of SNR required to meet a target PQPSK when increasing the number of receive antenna elements. Note that the diversity gain depends on the target PQPSK, whereas the antenna gain ga does not.19 This dependence is sketched in Figure 22.7 for the F-LAS, F-SAS and NF-SAS cases. Note that, compared with the F-SAS case, the reduced variance of y^ in the F-LAS case (Table 22.1) leads to considerable performance improvement for N = 4 receive antenna elements and one spatially interfering MS K{k\ so that N$RC = 4. However, if K{k) = 4 MS have to be spatially separated, NMRC = 1 holds, and the performance of the F-LAS reduces to the single receive antenna case with one spatially interfering MS and NMRC = 1. Figure 22.8 depicts FD-JD results for an F-LAS channel and both FD ZF- and MMSE-BLE. As performance measure, the average received energy per bit per noise spectral power density: (22.62)
19
A diversity measure independent of the target PQPSK is introduced in Chapter 3.4 of Reference 5
F-LAS F-SAS NF-SAS
diversity gain F-SASiV= 4 antenna
NF-SAS N= 4
NF-SAS JV= 1 antenna gain
Figure 22.7
Average BER as function of the average SNR per receive antenna elementfor the F-LAS andFVNF-SAS channel models and N or NMRC receive antenna elements
required by the FD-JD algorithms to achieve a target raw BER20 PQPSK = 10~3 is evaluated rather than presenting the BER plots they stem from, Figure 22.7. The Q orthogonal spreading codes are randomly assigned to the K MS under the constraint that the size K{k) of the set of spatially interfering MS Kf} is equal for alU = 1 , . . . , K MS. For R = K/Q > 1, then K{k) = R MS spatially interfere and have to be separated under loss of spatial microdiversity, where R represents the code reuse factor. N= 1, 2, 4 and 8 BS antenna elements are considered. For R= 1, FD ZF-BLE and FD MMSE-BLE perform exactly equal, because the spatio-temporal equalisation matrices T according to equations (22.39) and (22.40) reduce to a scaled matched filter in both cases. Note that the data points for R = 1 represent the single-user or matched filter bound, so that the degradation in required Et>/ No when increasing the number of spatially interfering MS K^ = R qualifies the multiuser detection algorithm performance. For R > 1, the FD MMSE-BLE always achieves the lower required Eb/NQ as compared with the FD ZF-BLE. The required Eb/No for the FD ZF-BLE is yielded from the closed-form average BER expression in Table 22.2. For a fixed number of receive antenna elements N, the required Eb/NQ of both algorithms increases when increasing the number of spatially interfering MS K(k) _ ^ because DoF for spatial microdiversity are sacrificed for spatial user 20 The rather demanding choice of a raw BER of PQPSK = 10~ 3 ensures evaluation of algorithm performance in the interference-limited regime
ZF-BLE MMSE-BLE
Figure 22.8
Average Eb/No requiredfor an average BER O/PQPSK = 10 3 by the FD ZF/MMSE in an F-LAS channel with uncorrelated Gaussian noise, N receive antenna elements and code reuse factor R = K/ Q
separation. On the other hand, for fixed R, the required Et,/No decreases when increasing the number of receive antenna elements (DoF) N. The relative difference between ZF and MMSE criterion increases with the number of spatially interfering MS K^ = R and decreases with the number of receive antenna elements N. 22.5.2.2 Frequency-selective fading In analogy to the examination of spatial and frequency diversity as performed in subsection 22.5.1.2, a decoupled investigation of intracell interference cancellation in the space and code domain is not possible for frequency-selective fading, even if orthogonal spreading codes are deployed at the transmitting MS. However, the performance trade-off between intracell interference cancellation and exploitation of both spatial microdiversity and frequency diversity by the NQ DoF of the linear FD-JD algorithms lends itself to examination by means of Monte Carlo link level computer simulation. The simulations are based on a stochastic scatter channel model as presented in Chapter 3 of Reference 3, parameterised to model a microcellular rich scattering and frequency-selective propagation scenario with delay spread l.25Tc, maximum excess delay TM = 12.8rc and 10Om distance between MS and BS. In each channel realisation, the scatterer constellation surrounds both MS and BS and is assumed fixed for the duration of one time slot Tburst, corresponding to the transmission of NdQ = 1024 chips per burst. The line-of-sight path is assumed to
ZF-BLE MMSE-BLE
Figure 22.9
Average Et,/No required for an average BER of PQPSK = 10 3 by the FD ZF/MMSE in a microcellular frequency-selective channel with uncorrelated Gaussian noise, N receive antenna elements, K = 8 MS and code reusefactor R = K/ Q. NQ/K denotes the ratio of equations to unknowns at each block eigenvalue uy. The hyperbolic dotted lines are meant to illustrate the data points from curves with different numbers of receive antennas N that have the same ratios NQ/K
be obstructed, and the BS antenna interelement spacing d equals one carrier wavelength (spatial microdiversity). Simulations are performed at a carrier frequency of fc = 2GHz with chip-rate rc = 3.84 Mchip/s and a root-raised-cosine pulse shape with zero excess bandwidth.21 Furthermore, spatio-temporally uncorrelated Gaussian noise is assumed as intercell interference. Figure 22.9 depicts ZF/MMSE FD-JD results for the frequency-selective fading channel. The results are based on the simulation of K = 8 single antenna MS, each with spreading gain Q = 1, 2, 4, 8 and 16. The Q orthogonal spreading codes are randomly distributed on the K MS under the constraint that, for K > Q, each code
The overall length of the channel impulse response h*-m' -* including the pulse shape is truncated to L = 28 chips
is reused by the same number R = K^ of MS. Four different BS receive antenna array configurations N = 1, 2, 4 and 8 are considered. For a fixed number of receive antenna elements N, the required Eb/NQ of both algorithms increases with increasing code reuse factor R, because the ratio of DoF NQ compared with the number of MS K in each block eigenvalue TXy decreases. In the vector space model introduced at the beginning of Section 22.5, the probability that the subspace orthogonal to the subspace spanned by the K—\ other MS contains only a few power of MS k increases with decreasing ratio NQ/K. The spatio-temporal equaliser then powers up the system channel of MS k and enhances the noise in the symbol estimate. As in the flat fading case, this Eb/No increase is more pronounced for the FD ZF-BLE. The effect of exploiting frequency diversity is best spotted for the single antenna BS case N=I. For R=I, each of the K = 8 MS uses its own spreading code with spreading gain Q, so that the number of equations NQ in each block eigenvalue T>^ equals the number of unknowns K. Increasing the spreading gain to Q = 16, so that R = 0.5 holds, halves the spectral efficiency r\ according to equation (22.1), but also vastly reduces the required Eb/Wo, because the number of independent equations (or vector space dimensions) NQ at each frequency index n doubles; compare the composition of the block eigenvalues T>^ in equation (22.47) and Section 22.5.1.2. Note that, as already stated in Section 22.5.1.2, increasing the spreading gain Q in the frequency flat fading case does not yield any performance improvement due to the lack of frequency diversity. Taking into account larger numbers of receive antenna elements N, the Eb/No reduction (exploitation of frequency diversity) when increasing the spreading gain from 8 to 16 diminishes quickly even for small apertures N = 2. Largest gains for fixed K and N by doubling the spreading factor Q are achieved for the transition from NQ/K = 1 to NQ/K = 2, whereas the gains for transition between states with NQ/K > 1 is less pronounced. For both the flat and frequency-selective fading case, the same desired Eb/No can be achieved with different combinations of N and Q. In the flat fading case, the choice of Q for fixed K determines the size of the set of spatially interfering MS K1 and thus the spatial DoF that have to be sacrificed for spatial user separation, whereas in the frequency-selective case the possible exploitation of frequency diversity has also to be kept in mind when selecting Q. With regard to Figure 22.9, obviously the benefit of exploiting frequency diversity by choosing large spreading factors vanishes with increasing ratios NQ/K. On the other hand, note that when considering the average required SNR (equation (22.62)) per receive antenna element p ~ 1/Q-Eb/No, each doubling of the spreading factor Q leads to another decrease of the required SNR by 3 dB irrespective of the amount of exploited frequency diversity. Operation at low SNR allows us to reduce the required cluster size r, so that the spectral efficiency r) (equation (22.1)) increases. This mechanism leads to the two dominant operational points of SD/TD/CDMA systems with high spectral efficiencies: 1
Deployment of the DoF NQ for exploitation of spatial and frequency diversity (large spreading factors Q > K without code reuse, small frequency cluster sizes r owing to small required SNRs).
2
Deployment of the DoF NQ for intracell interference cancellation (small spreading factors Q < K (code reuse), large frequency cluster sizes r owing to large required SNRs).
22.5.3 Intra- and intercell interference cancellation In the previous section, the performance of FD-JD algorithms with regard to intracell interference cancellation and exploitation of spatial and frequency diversity was examined. The noise (ICI) contribution n was modelled as uncorrelated Gaussian noise, which is a popular assumption for noise-limited communication systems. However, cellular radio systems are in general interference-limited. This is due to the fact that, because of the scarce radio spectrum, the transmission bandwidth is reused by several cells of a system. With decreasing cluster size (causing decreasing distances between cells that use the same transmission bandwidth), mutual interference between the cells increases. The detrimental effect of increased ICI can be kept low by applying spatial ICI cancellation techniques in addition to the intracell interference cancellation techniques. Dominant intercell interferers are then spatially suppressed by exploiting knowledge of the N x N spatial noise (ICI) covariance matrix Qa according to equation (22.21) in the FD-JD process (see Section 22.4). Qa can, e.g. be adaptively determined from noise samples at each receive antenna element estimated during midamble-based channel estimation, without requiring any knowledge of the intercell interferer's midambles and spreading codes. After channel estimation, which is based on a transmitted midamble of symbols that are a priori known at the receiver, the part of the received signal that is due to the transmitted midamble is reconstructed based on the estimated channels and the a priori known symbols of the midamble and then subtracted from the actually received signal, yielding an estimate of the noise received at each antenna element. From this estimated noise, an estimated spatial noise covariance matrix Qa can be calculated, so that adaptive estimation of both the channels of the intracell interferers and the spatial noise covariance matrix of the intercell interferers (considered as noise) on a burstto-burst basis is possible, irrespective of the form of the intercell interference. The estimated channels of the intracell interferers and the estimated spatial noise covariance matrix then are used for an enhanced JD of the intracell interferers' symbols, now comprising intercell interferer cancellation. 22.5.3.1 Flat fading Aiming at an analytical examination of the performance of spatial ICI cancellation, the flat fading case will be treated in this subsection. In subsection 22.5.1.1, the transition from the frequency-selective system model to the flat fading system model was performed by assuming ideal synchronisation and by assigning each MS k a spatial signature s ^ containing the complete propagation channel information. The assumption of ideal synchronisation is justifiable for the intracell interferers, but not for the intercell interferers. Furthermore, carrier phase variations of intercell interferers have to be taken into account in interference modelling. To this aim,
redefine the total noise vector n of equation (22.12) as: (22.63)
where s^ denotes the spatial signature (compare equation (22.49)) of each intercell interferer / = 1 , . . . , KJCI extended to contain the carrier phase variations, o^ is the interferer's transmission power and the (N^Q + L — 1)-dimensional vectors z ^ with elements: (22.64) model the spread and scrambled symbol stream22 of intercell interferer / sampled without perfect synchronisation (the average path delay r ^ is not compensated). The intercell interferers' data symbols contained in d/c/ are assumed QPSK modulated. Furthermore, random binary scrambling codes of length Q are assumed. The pulse shape g^ (t) in equation (22.64) is then defined by equation (22.42) with the combined intercell interferer's spread and scrambling code c^l\t). The average time delay r ^ is assumed to be uniformly distributed in the interval [0, Tc]. For uncorrelated data symbols [djci]j with power crj, uncorrelated elements of the combined spread- and scrambling codes c (l) with power the sampled symbols [z^]^, n = 1 , . . . , (Nd Q + L — 1) can be shown to possess the following properties [22]: 1 E{[z®U = 0 2 E{|[z«] n | 2 } = acV2 3 E{[z«]JzW] m } = o*o*8ijSnm. The flat fading noise/ICI covariance matrix then is given by equation (22.21) with the spatial intercell interference covariance matrix: (22.65) The flat fading FD ZF-BLE SNIR with spatial ICI cancellation then is yielded from equation (22.52) as: (22.66)
22 In contrast to the limited burst length of Nj Q chips of intracell interferers, for the intercell interferers a transmitted burst of infinite length is assumed, so that each element [z^]n has the same distribution
In compliance to equation (22.57), equation (22.66) can be further simplified to: (22.67)
with the average SNR per receive antenna element p as defined in equation (22.58). In what follows, the impact of different intercell interference constellations on the performance of the FD ZF-BLE SNIR in an F-LAS channel will be examined. Analytical treatment is only possible for the cases of uncorrelated spatial ICI covariance matrices Q^ ~ I(#) or correlated spatial ICI covariance matrices composed of equal-power interferers with a^ = 0 and K -> oo [29]. The results are listed in Table 22.3. For an uncorrelated spatial ICI covariance matrix Qa with K —» O, the SNIR y^ equals the expression in equation (22.57), and the corresponding PDF, expectation and variance are given in Table 22.1. As expected, spatial ICI cancellation does not improve the SNIR when no spatial correlations in the received noise vector can be exploited. Correspondingly, the spatial noise covariance matrix then is of diagonal shape as defined by the first addend in equation (22.68). For a completely correlated spatial ICI covariance matrix with K -» oo, the cases of resolvable and non-resolvable ICI have to be considered. The structure of the spatial ICI covariance matrix then is defined by the second addend in equation (22.68); no further structural properties are known apart from the fact that the matrix is necessarily a Hermitian. Resolvable ICI occurs for K^ H- KJCI < N. As deduced in Reference 29, KJCI intercell interferers with known joint ICI covariance matrix Qa can be suppressed by a ZF-BLE as if they were intracell interferers with known combined spreadand scrambling codes c ^ and spatial signatures s^R\ as long as K^ + KJCI < N
Table 22.3
PDF of SNIR at the output of the FD ZF-BLE with spatial ICI cancellation for the frequency flat F-LAS channel and correlated/uncorrelated ICI
Degree of correlation
PDF
Py(y^)
Uncorrelated: K -> 0 Correlated: /c —»• oo, K(k) + K1Ci < N
Correlated: K —> oo, *<*> + K1Ci > N
holds.23 The PDF of the SNIR at the output of the ZF-BLE as listed in Table 22.3 is X2 distributed with N — K^ — KJCI + 1 DoF and expectation: E{y{k)] = (K + I)Qp • (N - K{k) - Kici + 1)
(22.69)
Note that, compared with the case of spatially uncorrelated interference, the spatial DoF available for exploitation of microdiversity are reduced by Kici when spatial ICI cancellation is applied. However, with K - • oo, the expectation of the SNIR becomes infinite, accounting for the fact that the major source of interference, the Kici intercell interferers, are completely suppressed. Non-resolvable ICI occurs for K^ + Kici > N. In this case, the intercell interferers can no longer be completely suppressed as if they were intracell interferers. As shown in Reference 29, the PDF of y^ then is based on the null distribution of Hotelling's T2 statistic (see pages 96-99 of Reference 18), and is given in Table 22.3. The rather simple expectation follows as: (22.70)
Note that equation (22.70) only holds for K{k) + KICi > N + 1. A closed-form expression for the average BER PQPSK under exploitation of a modified central limit 23 In this context, the term resolvable is understood in a sense that resolvable intercell interferers can be completely cancelled by the antenna array irrespective of their position. This works even if the intercell interferers are situated closer than the Rayleigh criterion of the antenna allows (these closely situated interferers are then cancelled as if they were one interferer). In contrast, to spatially separate the intracell interferers by means of JD, the Rayleigh criterion has to be met, because otherwise no proper reconstruction of the data symbols transmitted by two closely situated intracell interferers was possible
ZF-BLE MMSE-BLE
Figure 22.10
ZF-BLE MMSE-BLE
Normalised expectation of output SNIR for FD ZF/MMSE-BLE with spatial ICI cancellation in F-LAS channel for SNR J) = OdB a Only K —• 0 (spatially uncorrelated ICI) and K —> oo (Kici equal-power intercell interferers transmitting unsynchronised QPSK symbols) for K{k) =4, 8. b Different factors K = O9 2.33, 9, 99, oo for K(k) = 4
theorem to ensure the Gaussian assumption for the intercell interference is derived in Appendix 3 of Reference 30. Figure 22.1 Oa depicts the normalised expectation of the SNIR y^ for FD ZF-BLE (dot sign) and MMSE-BLE (plus sign) for an F-LAS channel. The spatial DoF ofN = 8 receive antennas are available to suppress KJCI equal-power intercell interferers and K^ = 4 or K^ = 8 intracell interferers. In Figure 22.10a, only the cases of completely correlated ICI (solid lines) and completely uncorrelated ICI (dashed lines) are considered, whereas in Figure 22.10b, for K^ = 4, different factors /c are also depicted. For convenience, the average SNR per receive antenna element p is set equal to 0 dB so that, for Q = 1, the ordinate directly represents the average SNIR at the equaliser output.24 Regarding Figure 22.10a, consider first the case K = 0. The SNIRs y^ then are independent of the number of intercell interferers KJCI. In compliance to the results of Figure 22.8, the FD MMSE-BLE outperforms the FD ZF-BLE, with E{y(k)}/(Qik)p) = N(^RC = N- K^ - 1 as predicted in Table 22.1. Furthermore, 24
Note that only the MMSE BLE depends on the noise power a2 and thus on the SNR p
the performance improvement by applying the MMSE criterion instead of the ZF criterion is larger for K{k) = 8 than for K{k) = 4. For K —>- oo, the noise is entirely composed of the signals of KJCI intercell interferers, and ICI cancellation becomes possible. Consider the FD ZF-BLE first. For g(k) _ 4^ Up ^0 KICI — N — K^ = 4 intercell interferers can be completely spatially cancelled so that the SNIR y^ tends towards infinity (outside the presentable range), compare equation (22.69). For KICI > N — K^ = 4, complete ICI cancellation is no longer possible, but still significant gains in the SNIR as compared with the case K = O are possible. The FD ZF-BLE SNIR then follows the null distribution of Hotelling's T2 statistic with expectation as presented in equation (22.70). For K^ — 8, Kici = N — K^ = O intercell interferers are spatially resolvable, and the ZF-BLE y{k) of the range KICi > 1 is described by equation (22.70). Note that, due to the central limit theorem, the performance curves for K —• oo of both algorithms converge towards the performance curves for /c = O, when Kici -> oo. Figure 22.10b depicts the normalised expectation of the SNIR y^ for both FD ZF/MMSE-BLE for an F-LAS channel, Af = 8 BS receive antennas and K{k) = 4 intracell interferers for different factors /c, thus partially extending the results of Figure 22.10a. The rationale behind the introduction of the factor K was the modelling of ICI as being composed of a large number of low-power interferers, yielding an uncorrelated spatial ICI covariance matrix, and a few Kici dominant equal-power intercell interferers contributing to a correlated spatial ICI covariance matrix. With regard to the results in Figure 22.10, large SNIR gains are achievable not only for the case of a completely correlated spatial ICI covariance matrix (K -+ oo), but also for various mixes of ICI. For example, the FD MMSE-BLE with spatial ICI cancellation achieves an output SNIR y^ = 10 dB for the following cases: • • •
Kici = 4 dominant intercell interferers constituting 70 per cent of the interference power and 30 percent spatially uncorrelated interference Kici = 8 dominant intercell interferers constituting 90 per cent of the interference power and 10 percent spatially uncorrelated interference Kici = 11 dominant intercell interferers constituting 99 percent of the interference power and 1 percent spatially uncorrelated interference.
As shown in Reference 29, both JD algorithms with spatial ICI cancellation use the remaining spatial DoF that do not have to be sacrificed for intracell interference cancellation adaptively to either cancel ICI (K -+ oo) or to combat fading (K -> 0). Figure 22.11 depicts the Eb/No required to achieve a raw BER PQPSK = 10~3 for both FD ZF/MMSE-BLE with and without ICI cancellation for different ICI mixes, characterised by the factor K and the number of equal-power intercell interferers Kici. For both considered algorithms with spatial ICI cancellation (solid lines), large gains in Eb/NQ are achieved when comparing spatially correlated with spatially uncorrelated noise. With increasing numbers Kici of intercell interferers and decreasing factor /c, the Eb/NQ gain rapidly deteriorates towards the performance in spatially
w/o spatial ICI cane.
w/o spatial ICI cane.
w/spatial ICI cane. w/spatial ICI cane.
Figure 22.11
Eb/N0 required by FD ZF/MMSE-BLEfor PQPSK = 10~3 with (solid lines) and without (dashed lines) spatial ICI cancellation of KJCI intercell interferers in flat Rayleigh fading channel, parameterised with ratio K ofspatially correlated to uncorrelated ICIpower; N = 8, Q = 1, K^ = 4 intracell interferers a ZF-BLE b MMSE-BLE
uncorrelated noise (/c = 0), compare Figure 22.10. The algorithms without spatial ICI cancellation (dashed lines), i.e. assuming a spatially uncorrelated ICI covariance matrix Q^ = (J2I(N) irrespective of the current ICI situation, show significant performance degradations for large factors K and small numbers KJCI of intercell interferers as compared with the case of spatially uncorrelated noise (/c = 0). 22.5.3.2 Frequency-selective fading In the frequency-selective case, the temporal correlations in the ICI covariance matrix are neglected when performing JD with spatial ICI cancellation, so that spatial ICI cancellation becomes suboptimum. However, an estimate of the spatio-temporal ICI covariance matrix is impossible to achieve in state-of-the-art cellular radio systems. To gain more insight into the performance of FD-JD algorithms with combined intraand spatial intercell interference cancellation in frequency-selective channels, the
same simulation environment as in Section 22.5.2.2 is used. However, depending on the choice of /c, all states between a completely structured spatio-temporal ICI covariance matrix, which is yielded from the complete spreading, scrambling and propagation of QPSK symbols of KICI equal-power intercell interferers over a frequency-selective propagation channel A/c/d/c/, and a completely uncorrelated spatio-temporal ICI covariance matrix, can be simulated. Simulations were performed for K^ =4 intracell interferers and N = 8 BS antenna elements. The resulting performance of an FD MMSE-BLE in terms of required Eb/No for a BER of 10~3 with and without spatial ICI cancellation is depicted in Figure 22.12. Note that, in contrast to the results of Figure 22.9, the effect of non-blind channel [23], and spatial ICI covariance matrix estimation (see Chapter 3.5 of Reference 30) is contained in the results of Figure 22.12 to point out the robustness of the considered JD algorithms against channel estimation errors.
w/o spatial ICI cancellation
w/spatial ICI cancellation
Figure 22.12
Eb/No required by FD MMSE-BLE for an average BER PQPSK = 10~3 with (solid lines) and without (dashed lines) spatial ICI cancellation of KJCI intercell interferers in micro cellular frequency-selective fading channel, parameterised with ratio K of spatially correlated to uncorrelated ICIpower. N — 8, Q — 1, K^ = 4 intracell interferers. Effect of channel and spatial ICI covariance matrix estimation included
As in the flat fading case, algorithm performance with spatial ICI cancellation improves with increasing spatial correlations in the noise term, whereas algorithm performance without spatial ICI cancellation deteriorates, each effect being more pronounced for small numbers KJCI of intercell interferers and large factors /c. Note that, due to the assumption of the same frequency-selective channel model for both intra- and intercell interferers, the spatial ICI covariance matrix Q^ not only contains one spatial signature S^ per intercell interferer i = I9.. .,KICI as in the flat fading case, see equation (22.65), but also several temporally resolvable spatial signatures per intercell interferer, so that the effective number K'IC1 of spatial signatures stemming from intercell interferers increases proportionally to the maximum excess delay TM of the channel impulse responses. Concluding, the achievable Eb/No gain for a fixed number KJCI of intercell interferers with spatial ICI cancellation in a frequency-selective channel is reduced as compared with the flat fading case, but still amounts to several dB. Reconsidering the two dominant operational points of SD/TD/CDMA systems as motivated in subsection 22.5.2.2, it is obvious that the application of spatial ICI cancellation is especially suited to the second operational point, which is characterised by small spreading factors, code reuse, large frequency cluster sizes and comparatively small numbers of MS per cell. As shown in Chapter 4.3 of Reference 3 and Chapter 6.3 of Reference 30, with increasing frequency reuse factor the main portion of ICI power is caused by only a few intercell interferers. These dominant intercell interferers can be efficiently suppressed by spatial ICI cancellation.
22.6
Conclusions
In this chapter, linear JD techniques were presented as an application of STAP in the field of mobile radio communications. STAP was performed on the UL of a cellular SD/TD/CDMA system by adaptively equalising the multiuser system channel matrix, which contains the spatio-temporal channel impulse responses between all transmitting MS and the receiving BS antenna array and the code signatures of the MS. Perfect channel state information at the receiver was assumed, whereas in real-world systems blind/semi-blind/non-blind tracking of the spatio-temporal channel impulse responses has to be performed on a time slot basis. The same holds for the estimation of the spatial ICI covariance matrix. In particular, the trade-off between deploying the available DoF in the space and code domain for exploitation of spatial and frequency diversity (to reduce fading) or for intra- and intercell interference cancellation (to create spatial transmission channels) was examined. Further performance improvements can be achieved by applying computationally more demanding decision feedback and maximum likelihood JD techniques (see Chapter 4.2 of Reference 13 and Chapter 5 of Reference 30). It should also be mentioned that the channel state information available at the BS on the UL can be exploited to perform a spatio-temporal predistortion of the DL multiuser system channel, if the channel coherence time is sufficiently large as compared with the dwell time between UL channel estimation and DL predistortion and if the Tx/Rx front ends of
both the BS antenna array and the MS are accurately calibrated (see References 28 and 12, and Chapter 4.2 of Reference 30). The SD/TD/CDMA system, on which the investigations in this chapter are based, is flexibly parameterised to model a wide range of mobile radio systems with TDMA, CD/TDMA or SD/TDMA as access techniques. For instance, TD/CDMA systems are standardised as the air interface for the international mobile telecommunications (IMT-2000) family of harmonised 3G standards [19]. Strong efforts are presently undertaken by the time division-synchronous code division multiple access (TDSCDMA) Forum [8,24], a consortium of international telecommunication companies including Siemens, Motorola, Nortel Networks, China Mobile and China Telecom, to push further the development and deployment of TD/CDMA in 3G systems. The results as presented in this chapter are either of analytical or simulative nature. However, it is worth mentioning that computationally more efficient overlapsave versions [27, 28] of the FD-JD algorithms as presented in this chapter have been implemented in the X-band SABA25 hardware demonstrator at the Institute of High Frequency Technology of RWTH26 Aachen in order to examine both real-time feasibility and performance under real-world conditions.
22.7
List of variables
22.7.1
Variables with roman/calligraphic letters
A, Afo, Abe B Bs c C d d d EbINo fc fo g H I K KJCi K^ L
multiuser system matrices bandwidth per cell bandwidth of cellular system combined spread and scrambling code combined multiuser spread and scrambling matrix antenna spacing in uniform linear array stacked multiuser data symbol vector estimated stacked multiuser data symbol vector average received energy per bit per noise spectral power density carrier frequency Doppler frequency combined code/pulse shape signature spatio-temporal multiuser channel matrix unit matrix number of users per cell number of intercell interferers number of users per cell that use the same code as user k number of taps in the spatio-temporal channel impulse response
25
Smart Antennas for Broadband Access
26
Rheinisch-Westfalische Technische Hochschule
M n N ^d, Nd,bc NMRC PQPSK P Q Qd Qn r rm l ,ml R s S Tburst Tc Ts T W x
22.7.2 D 7>A T n £
r\ K Xc p Gx Oy a2 T (p
Variables with calligraphic letters
frequency-domain frequency-domain DFT matrix frequency-domain frequency-domain
22.7.3 /3 y
number of propagation paths stacked multiantenna noise vector number of receive antenna elements number of data symbols per MS in one transmitted burst spatial degrees of freedom for optimum combining average bit error ratio for coherent QPSK modulation unitary permutation matrix spreading gain, length of combined spread and scrambling codes data symbol covariance matrix noise/intercell interference covariance matrix frequency reuse factor correlation between two antenna elements code reuse factor steering vector, spatial signature multiuser spatial signature matrix time duration of one burst time duration of one chip time duration of one data symbol spatio-temporal equalisation matrix postfix length for block-circulant system model stacked multiantenna receive vector
stacked multiuser data symbol vector multiuser system matrix stacked multiantenna noise vector stacked multiantenna receive vector
Variables with greek letters
propagation path amplitude symbol-to-noise-and-interference ratio at the output of the spatio-temporal equaliser spectral efficiency of cellular system ratio of correlated to uncorrelated intercell interference carrier wave length average signal-to-noise ratio per receive antenna delay spread angle spread signal or noise power propagation path delay azimuth angle
References 1 ABRAMOWITZ, M. and STEGUN, I. A.: 'Handbook of mathematical functions' (Dover Publications, Inc., New York, 1974) 2 BAINES, S. J., BURR, A. G., and TOZER, T. C : 'Performance limits for multi-user decorrelating detectors in DS-CDMA cellular radio systems'. Proceedings of the IEEE fourth international symposium on Spread spectrum techniques and applications (ISSSTA '96), Mainz, Germany, 1996, pp. 486^91 3 BLANZ, J. J.: 'Empfangsantennendiversitat in CDMA-Mobilfimksystemen mit gemeinsamer Detektion der Teilnehmersignale'. (Fortschrittsberichte VDI series 10, (535), VDI-Verlag, Diisseldorf, 1998) 4 BONEK, E. and LAURILA, J.: 'Semi-blind signal separation and detection. Proceedings European Wireless 99 workshop MW-WeW2, (1), October 1999, pp. 1-15 5 BRUNNER, C : 'Efficient space-time processing schemes for WCDMA'. Forschungsberichte Lehrstuhl fur Netzwerktheorie und Schaltungstechnik, Technische Universitat Miinchen (Shaker Verlag, 2000) 6 DAVIS, R J.: 'Circulant matrices' (Chelsea Publishing, New York, N.Y, 1994) 7 ERTEL, R. B., CARDIERI5P., SOWERBY, K. W., RAPPAPORT, T. S., and REED, J. H.: 'Overview of spatial channel models for antenna array communication systems', IEEEPers. Commun., February 1998,5, (1), pp. 10-22 8 TD-SCDMA Forum. Information about TD-SCDMA. Technical report, http://www.tdscdma-forum.org, 2001 9 GIBSON, J. D.: 'The mobile communications handbook' (CRC Handbook published in cooperation with IEEE Press, 1999, 2nd edn.) 10 JAKES, W. C : 'Microwave mobile communications' (John Wiley & Sons, New York, 1974) 11 JAMES, A. T.: 'Distributions of matrix variates and latent roots derived from normal samples', Annals of Mathematical Statistics, 1964, (35), pp. 475-501 12 KEUSGEN, W. and WALKE, C. M.: 'A system model considering the influence of front-end imperfections on the reciprocity of up- and downlink system impulse responses'. Proceedings of ASST 2001, Aachen, Germany, September 2001, pp. 243-248 13 KLEIN, A.: Multi-user detection of CDMA signals - algorithms and their application to cellular mobile radio (Fortschrittsberichte VDI series 10, (425), VDI-Verlag, Diisseldorf, 1996) 14 LIBERTI, J. C. and RAPPAPORT, T. S.: 'Smart antennas for wireless communications' (Prentice Hall, Englewood Cliffs, New Jersey, 1999) 15 LIU, H. and XU, G.: 'Multiuser blind channel estimation and spatial channel pre-equalization'. Proceedings of ICASSP, Detroit, USA, May 1995, pp. 1756-1759 16 LUKE, H. D.: 'Korrelationssignale' (Springer-Verlag, Berlin Heidelberg, 1992)
17 LUENBERGER, D. G.: 'Optimization by vector space methods (series in decision and control)' (John Wiley & Sons, Wiley Professional Paperback Series, New York, London, Sydney, Toronto, 1997) 18 MUIRHEAD, R. J.: 'Aspects of multivariate statistical theory' (John Wiley & Sons, Inc., Wiley Series in Probability and Mathematical Statistics, New York, 1982) 19 OJANPERA, T. and PRASAD, R.: 'An overview of air interface multiple access for IMT-2000/UMTS', IEEE Commun. Mag., September 1998, pp. 82-95 20 PROAKIS, J. G.: 'Digital communications' (McGraw-Hill Book Company, New York, 1983, 2nd edn.) 21 SALZ, J. and WINTERS, J. H.: 'Effect of fading correlation on adaptive arrays in digital mobile radio', IEEE Trans. Veh. Technol, 1994, 43, (4), pp. 1049-1057 22 SHAH, A. and HAIMOVICH, A. M.: 'Performance analysis of maximal ratio combining and comparison with optimum combining for mobile radio communications with cochannel interference', IEEE Trans. Veh. Technol, 2000, 49, (4), pp. 1454-1463 23 STEINER, B. and JUNG, P.: 'Optimum and suboptimum channel estimation for the uplink of CDMA mobile radio systems with joint detection', Eur. Trans. Telecommun. Relat. Technol, January 1994, 5, pp. 39-50 24 TSG RAN WG2: 'Overview of the TDD harmonisation and the key features of TD-SCDMA'. Technical report TSGR2#6(99)782, 3GPP, available at http://www.3gpp.org, August 1999 25 VAN DER VEEN, A. J., TALWAR, S., and PAULRAJ, A. J.: 'A subspace approach to blind space-time signal processing for wireless communication systems', IEEE Trans. Signal Process., January 1997, 45, pp. 173-190 26 VERDU, S.: 'Multiuser detection' (Cambridge University Press, Cambridge, UK, 1998) 27 VOLLMER, M., GOTZE, J., and HAARDT, M.: 'Efficient joint detection techniques in the frequency domain for third generation mobile radio systems'. Proceedings of IEEE international conference on Third generation wireless communications, Silicon Valley, USA, 2000 28 WALKE, C. M.: 'Joint detection and joint pre-distortion techniques for SD/TD/CDMA systems', Frequenz - Zeitschrift fur Telekommunikation, July 2001, 55, pp. 204-213 29 WALKE, C. M.: 'The impact of inter-cell interference cancellation on the performance of SD/TD/CDMA systems'. Proceedings of ASST 2001, Aachen, Germany, September 2001, pp. 229-236 30 WALKE, C. M.: 'On the spectral efficiency of interference-limited mobile radio systems with space/time/code division multiple access' (Shaker Verlag, Berichte aus der Hochfrequenztechnik, D 82 (Diss. RWTH Aachen), 2003) 31 WHALEN, A. D.: 'Detection of signals in noise' (Academic Press, Inc., New York, 1971)
32 WINTERS, J. H., SALZ, J., and GITLIN, R. D.: 'The impact of antenna diversity on the capacity of wireless communication systems', IEEE Trans. Commun., 1994, 42, (2), pp. 1740-1751 33 WOLNIANSKY, G. J., FOSCHINI, G. J., GOLDEN, G. D., and VALENZUELA, R. A.: 'V-BLAST: an architecture for realizing very high data rates over the rich-scattering wireless channel'. Proceedings of ISSSE-98, Pisa, Italy, 1998
Chapter 23
Underwater communication with vertical receiver arrays Johann F. Bohme and Rolf Weber
23.1
Introduction
Since the early 1990s, there has been a tremendous increase in the research and development of underwater acoustic (UWA) communication systems. This has been the response to the growing demand for wireless underwater communications, which was initiated by a shift in applications from almost exclusively military to commercial ones. Examples for such commercial applications of UWA communications are remote control of autonomous underwater vehicles, ocean monitoring, manned and unmanned oceanographic exploration, and communication between submersibles. This development has been accompanied by an ever growing need for higher data rates to cope with the huge amount of data to be transmitted over long distances. Typical data rates range from a few kilobits per second for simple command and control tasks up to hundreds of kilobits per second for video image transmission [I]. As efficient communication systems are emerging, the scope of their applications continues to grow, allowing for real-time communication not only in point-to-point links but also in network configurations. The desired high data rates are in contrast to the transmission conditions induced by the underwater acoustic communication channel, which is bandlimited and highly reverberant, thus posing many obstacles to reliable high-speed digital communication. Therefore, combating time-varying multipath induced by the transmission channel especially the technically interesting horizontal shallow-water acoustic channel - is considered the most challenging task [22]. Although underwater acoustic channels and systems share many features with their radiofrequency counterparts, the adverse effects that underwater propagation has on the digital acoustic signals often require the development of specialised communication techniques.
This chapter presents blind space-time adaptive processing methods for receiver architectures with application to UWA communication systems.
23.2
The underwater acoustic channel
The underwater acoustic channel is regarded as one of the most difficult communication channels, making special signal processing techniques necessary to ensure reliable, high-rate data transmission. This section provides a brief overview of the characteristics of underwater acoustic channels and their effects on the communication problem. A detailed treatment of underwater sound transmission may be found in References 3, 5 and 27.
23.2.1
Transmission loss and ambient noise
For a given source level, the received signal-to-noise ratio (SNR) is influenced by the losses encountered by propagating sound and the ambient noise. The main sources for transmission loss are geometric spreading, absorption and scattering. Assume a sound source being located in an unbounded, homogeneous and lossless medium. The power generated by the source is then radiated equally in all directions. Since there is no loss in the medium, the power crossing an arbitrary sphere with the source in its centre will be constant for all directions. Thus, the decrease in sound intensity, which is defined as power per unit area, is proportional to the distance squared from the source. In reality, the medium will be bounded. Assuming ideal upper and lower bounds, the spreading is no longer spherical because sound cannot cross the bounding planes. Beyond a certain range, the power radiated by the source is therefore distributed instead over the surface of a cylinder of constant height. This leads to a decreasing sound intensity that is proportional to the distance from the source. Absorption losses result from the conversion of acoustic energy into heat. The observed significant losses in sea water are due to shear viscosity, volume viscosity and ionic relaxation. As an example, Figure 23.1 shows the attenuation coefficient in dB/km as a function of frequency for sound in sea water at a temperature of 14 degrees Celsius, a salinity of 35 parts per thousand and zero depth. For non-zero depth, the hydrostatic pressure has to be taken into consideration, leading to lower values of the attenuation coefficient. It can be seen from Figure 23.1 that the absorption of sound by sea water is a strong function of frequency. The attenuation coefficient rises rapidly with frequency, effectively limiting either the maximum operating range of a system or the maximum usable frequency for a particular given range. The sea surface is both a reflector and a scatterer of sound waves and has a profound effect on propagation especially in shallow water. Its influence depends on the roughness of the sea surface relative to the wavelength of the signal. If the sea surface were perfectly smooth, it would form an almost perfect reflector. When the sea is rough, measurements demonstrate that the loss on reflection is no longer zero. In this case, the surface acts as a scatterer, sending energy in all directions.
attenuation coefficient, dB/km
frequency, kHz
Figure 23.1
Attenuation of sound in sea water close to the surface
However, energy that is scattered in a direction different from that of the receiver is effectively lost. The sea bottom is also a reflecting and scattering boundary of the sea, and its effects are usually more complicated because of its diverse and multilayered composition. Due to the imperfect reflection of energy at the rough sea bottom, sound will penetrate into the sediment layers. The absorption of sound in the sediment is significantly higher than that in the water, resulting in additional transmission losses. Besides transmission losses, signal detection and reception at the receiver are also affected by various types of noise. There are many sources of ambient noise present in the ocean, with turbulence, ship traffic noise, breaking waves and rain being the dominant ones. Significant noise can also be generated by marine life such as fish, whales and shrimp. The ambient noise level decreases with frequency over the operating range of most acoustic communication systems. The noise level at the hydrophone is additionally increased by self-noise due to machinery noise, propeller noise and hydrodynamic noise. Generally, these man-made noises are localised, highly directional and concentrated at low frequencies. Therefore, one wants to use as high a centre frequency as possible consistent with transmission loss considerations.
23.2.2
Sound speed variability
When the spatial scale of inhomogeneities in the ocean is larger than the wavelength of the sound, acoustic waves can be reasonably modelled as propagating along rays
through the ocean. In a homogeneous medium, the rays would be straight lines emanating from the source. The ocean, however, constitutes no homogeneous medium as, for example, the speed of sound in the ocean varies with depths, the season, geographic location and time. More precisely, at a given location it is determined by three basic physical quantities: temperature, salinity and pressure. According to Snell's law, the spatial variability of the speed of sound induces a bending of the rays towards regions of lower sound speed, which is referred to as refraction. When a negative gradient of the speed of sound exists just beneath the sea surface, the rays are refracted downwards and a shadow zone close to the surface is produced. A similar behaviour can be observed close to the sea bottom when the rays are upward refracted due to a speed of sound that increases with depth. This demonstrates the possible existence of regions with low sound intensity from the source, making signal reception more complicated or even impossible if the receiver is placed in such a zone. The speed of sound in the sea increases with temperature, salinity and depth. In particular, the shallow waters of coastal regions show a great variability of the sound speed profile due to the effects of surface heating and cooling, salinity changes and water currents. For example, after the decline of winter storms, the water is well mixed, leading to a sound speed profile almost constant over depth. As a typical shallow-water sound speed profile shows no distinct minimum that would focus a large portion of the power radiated from the source at the corresponding depth, a sound channel is created between the sea surface and the bottom instead. The inevitable numerous interactions with the boundaries result in high signal losses and an increased temporal variability of the sound field.
23.2.3 Multipath propagation Multipath propagation occurs whenever there is more than one transmission path between the source and the receiver, especially if an omnidirectional source is used. In shallow water, the numerous propagation paths between the source and the receiver are due to reflections of sound energy at the surface and bottom boundaries. Since both boundaries support scattering of energy, and sound energy can also penetrate into the bottom and reflect off buried sediment layers, additional paths appear. These effects result in multiple versions of the transmitted signal that arrive at the receiving hydrophone, displaced with respect to one another in time and spatial orientation. The impulse response of the channel consists therefore not of a single spike but is dispersed in time. Pulses that are subsequently transmitted from the source thus overlap and affect each other at the receiver, causing a severe degradation of the acoustic communication signal. This effect is referred to as intersymbol interference (ISI). The delay spread is defined as the time span between the first and the last arrival. Typically encountered delay spreads range from a few milliseconds up to several tens of milliseconds in shallow water, and even seconds in deep sea. Such delay spreads translate into several tens, or a hundred, of symbol intervals for moderate to high data rates, and typical delay spreads in the commonly used radio channels are of the order of several symbols only [19].
The reciprocal of the delay spread is an approximative measure of the coherence bandwidth of the channel. More specifically, it is a measure of the maximum frequency difference for which signals are still strongly correlated in amplitude. If the coherence bandwidth of the channel is greater than the bandwidth of the transmitted signal, the received signal will undergo flat fading: it varies in gain, but the spectrum of the transmitted signal is preserved. If the coherence bandwidth of the channel is smaller than the bandwidth of the transmitted signal, the channel causes frequency selective fading on the received signal. Due to the adverse relationship between the available signal bandwidth and the commonly encountered delay spreads, the technically interesting horizontal underwater acoustic channels are typically frequency selective. Therefore, combating the underwater multipath to achieve a high data throughput is considered to be the most challenging task of an underwater acoustic communication system.
23.2.4 Doppler effect Delay spread and coherence bandwidth describe the slowly time-varying dispersive nature of sound propagation in the ocean. However, they do not offer information about the temporal fluctuations of the acoustic channel caused by source/receiver motion and medium variability. The relative motion between the transmitter and the receiver or the motion of water in the sound channel change the length of a propagation path between the transmitter and the receiver. This corresponds to an expansion or compression of the time axis for the received signal, which is referred to as the Doppler effect. Since there is a multitude of possible propagation paths, the individual (differential) Doppler effects of each path superimpose at the receiver. When a pure sinusoidal tone is transmitted, it is subject to a mean frequency shift - the Doppler shift - and an additional spectral broadening - the Doppler spread. Typical values for the Doppler spread range from a few hertz up to several tens of hertz. The reciprocal of the Doppler spread is used to characterise the time-varying nature of the channel in the time domain. It is a measure of the time interval over which the channel impulse response is essentially invariant, thus determining the adaptation rate of digital signal processing algorithms implemented at the receiver. If the reciprocal of the Doppler spread is of the order of the symbol interval, the channel will change during the transmission of a symbol, thus causing distortion at the receiver. In this case, the channel is referred to as fast fading as opposed to slow fading, dominating transmission over the underwater acoustic channel especially for low data rates and a moving source. Until the early 1990s, it was thought that the rapid variations of the shallow water channel prevent adaptive signal processing algorithms at the receiver from converging. In 1994 Stojanovic, Catipovic and Proakis [24] showed that in many cases the channel variations can be decomposed into a slowly varying input delayspread function, and a fast varying instantaneous phase that is, however, strongly correlated for different delay taps. Recognising this fact has led to a suitable receiver architecture for phase-coherent communications consisting of jointly optimised ISI
and phase-offset compensation. This multichannel architecture is reported on in the next section.
23.2.5
Summary
The dominant features of the underwater acoustic channel are that it is bandlimited with most systems operating below 30 kHz, and that it is reverberant with spreading both in travel time and Doppler. For underwater acoustic channels, the product of delay spread and Doppler spread typically exceeds unity. In this case, the channel is said to be overspread [29], while common mobile radio channels are underspread. The rapid temporal and spectral fluctuations that come along with an overspread channel prohibit the direct adoption of techniques developed for the mobile radio channel. Special signal processing algorithms are necessary instead, which must be based on space-time processing to allow for reliable, high-speed communication links.
23.3
Underwater acoustic communications - a brief overview
The last few decades have witnessed considerable progress in improving communications over underwater acoustic channels characterised in the previous section. Approaches to system design vary according to the techniques used for compensating the effects of multipath and phase variations. These techniques may be classified according to the choice of modulation scheme, and the signal processing method employed at the receiver. This section is not intended to be an exhaustive review of all past and current developments in underwater acoustic communications but serve as an introduction to the key contributions and employed signal processing techniques. The interested reader is referred to References 1,13 and 22 for further information.
23.3.1 Incoherent digital receivers Although the communication signal is subject to amplitude distortions due to fading and fast phase variations during transmission, its frequency content is largely preserved owing to the predominantly linear nature of sound propagation in the underwater acoustic channel. This naturally led to the use of multiple frequency-shift keying (FSK) [18] as the incoherent modulation scheme of choice. In an FSK system, the bit stream of the source is mapped onto distinct frequencies. Decisions regarding which frequency is sent are based on energy detected at the output of narrowband filters. This incoherent energy detection eliminates the task of carrier-phase tracking at the receiver, but it does not solve the problem of multipath. To overcome ISI, incoherent systems insert guard time between successive tones to ensure that all reverberations expire before each subsequent pulse is received. To combat frequency-selective fading on any single tone, typically several tones that carry the same information are simultaneously transmitted. These different frequencies must be separated by more than the coherence bandwidth to ensure independent fading. The requirements on guard time and frequency spacing together lead to a low available data rate, typically
below 2500bit/s, and to a low spectral efficiency [13]. Despite this fact, incoherent FSK is a good solution for applications where moderate data rates and robust performance are required. Therefore, these methods are still receiving attention. A representative example for an FSK-based communication link that uses incoherent detection is the digital acoustic telemetry system (DATS) [4], which offers data rates up to 1200bit/s. The DATS transmitter consists of a digitally controlled oscillator that can simultaneously generate up to 16 tones in the frequency range between 45 and 55 kHz. To allow for Doppler tracking and coarse time synchronisation at the receiver, a pilot tone at 60 kHz and a short pulse at the beginning of each data block, centred at 30 kHz, are additionally transmitted. At the receiver, the data is quadrature demodulated and Fourier transformed using the fast Fourier transform (FFT) algorithm, with the FFT outputs being the statistics supplied to the decoder.
23.3.2
Coherent digital receivers
With the goal of increasing the bandwidth efficiency of an underwater acoustic communication system, research focus since the early 1990s has shifted towards phase-coherent modulation techniques. Although there has been a number of phasecoherent systems that strive to avoid or suppress intersymbol interference [13, 22], more sophisticated systems allow for ISI in the received signal to achieve higher data rates. Therefore, these systems have to employ special signal processing techniques to compensate for the intersymbol interference. The use of spatial arrays of receivers, which we refer to as multichannel receivers, can significantly improve the demodulation capabilities of communication systems. If the fluctuations in the channels from the transmitter to different sensors are not independent, the coherent spatial structure of the received signal can be exploited. Array processing is then typically used for multipath suppression employing traditional beamforming methods. Such a system has been investigated in References 11 and 25. The front-end beamformer uses a least mean square (LMS) algorithm to adaptively steer nulls in the direction of surface or bottom reflected waves. It has been found, however, that the proposed technique is more effective at shorter ranges with difficulties arising as the range increases relative to depth [25]. To compensate for the residual ISI, the use of a single-channel decision-feedback equaliser has therefore been suggested to complement the performance of the beamformer. The system has been tested in shallow water at lOkbit/s showing an estimated bit error rate of 10~2 without, and 10~3 with the equaliser. If the receivers are widely separated so that the fluctuations in the channel from the transmitter to one receiver are independent of those to another receiver, diversity techniques can be exploited to enable the system to combat fading. This approach attempts to capture all of the signal energy that has propagated from the transmitter to the receivers. A technique based on simultaneous multichannel equalisation, spatial diversity combining and synchronisation has been presented in Reference 23. All sensor output signals are processed by the multichannel, fractionally-spaced feedforward section of a decision-directed equaliser [18] and are phase corrected. After
coherent combining, the intersymbol interference resulting from previously transmitted symbols is cancelled in the feedback section of the equaliser. The receiver parameters that are adaptively adjusted are the tap weights of the feed-forward filters, the carrier-phase estimates and the tap weights of the feedback filter. For the joint adaptation of all these parameters, the difference between the estimated data symbol and its true value, which is assumed to be known to the receiver during training, is used. This joint optimisation of the combiner and the equaliser usually leads to better results compared with the separately adjusted beamformer and equaliser structure. Long-range transmission experiments at rates between 0.6 and 3 kbit/s have been conducted in both deep- and shallow-water environments, leading to estimated bit error rates below 10~4. Although the receiver structure of References 23 and 24 is canonical nowadays, its major drawback is the large training overhead necessary for adapting the equaliser filter coefficients and the carrier-phase recovery unit, especially in rapidly timevarying environments. A fundamental problem also arises from new applications like underwater acoustic communication networks, which are the focus of current research. It is inefficient to retrain all receivers if a new node joins the network or if an existing receiver temporarily loses convergence. Therefore, receiver architectures that allow for efficient blind space-time adaptive processing techniques to equalise communication channels are of current interest. The notion blind means that no training symbols periodically inserted into the data stream are used to adjust the receiver to the current channel conditions. This task shall be performed solely based on the received signal and some known (statistical) properties of the transmitted symbol sequence a(n), e.g. certain moments like the variance a\ or the kurtosis y%. Suitable algorithms will be the topic of the following sections.
23.4
Spatial-temporal receiver architecture
Starting from the model of a linearly distorting communication channel plus additive noise, a fully digital receiver structure is presented in the following. The proposed receiver will be capable of simultaneously processing the signals received at a group of sensors (hydrophones), thus allowing for efficient space-time processing. For the interested reader, there is a vast body of literature on digital communication systems, e.g. Reference 18, and synchronisation techniques in particular, e.g. Reference 16.
23.4.1
Communication over channels with ISI
The model of a digital communication link that is affected by intersymbol interference and additive noise is shown in Figure 23.2. All signals and systems are represented in the complex baseband [18], making the model independent of a specific choice for the carrier frequency a>c. The transmitted symbols a(n) are drawn from a finite alphabet A and are modelled by independent and identically distributed (HD) random variables. To transmit the discrete-time sequence a(n) over a channel with continuous-time impulse response
receiver
Figure 23.2
Model of a communication link with ISI and additive noise
hc(t), the signal: OO
s(t)= J2 a(n)gT(t-nT)
(23.1)
« = — OO
is generated, applying an element of a(n) every T seconds to the transmit filter gr(t). The time interval T is called the symbol duration. The channel is represented by a linear filter with impulse response hc(t) and additive noise w O), which is usually modelled as a stationary Gaussian process. In Figure 23.2, for simplicity neither an offset 8 between the local clocks at the transmitter and the receiver nor a possible carrier-phase offset 6 due to deviations from the nominal carrier frequency coc are taken into account. Including both effects, the signal at the input to the baseband receiver system is given as: OO
r(t) = J^
a{
^ 80 ~ sT ~ nT^ QJm + W(O
(23.2)
n——oo
Here, the impulse response g(t) comprises the effects of the cascade formed by the transmit filter and the possibly slowly varying channel impulse response.
23.4.2 Multichannel digital receiver Signals r^v\t), 1 < v < Nc, received at Nc sensors that are well separated in space are typically subject to independent distortion. Thus, the probability that all sensor output signals simultaneously undergo a deep fade is low, so that proper combining of all available signals improves the quality of transmit symbol sequence recovery. In 1992, Balaban and SaIz [2] derived the space-time receiver structure optimal in a minimum mean-squared error (MMSE) sense for the case of known transmission channels from one source to the Nc sensors assuming perfect time and carrier-phase synchronisation. It consists of a bank of matched filters g^p(t), each matched to the impulse response of the cascade of the common transmit filter gj (t) and the individual transmission channel from the source to the respective sensor v, and the additive noise. The signals at the output of the matched filters are combined into one signal that is subsequently sampled at multiples of the symbol interval T. The discrete-time signal is then equalised with the help of a transversal filter, possibly of infinite order. Finally, an estimate a(n) of the transmitted symbol sequence is obtained at the output of a decision device which utilises the finite alphabet property of the underlying signal constellation. This structure is summarised in Figure 23.3.
equalizer
Figure 23.3
decision device
Optimal space-time receiver structure without explicit synchronisation units timing recovery
multichannel equalisation and carrier-phase recovery
Figure 23.4
Multichannel receiver front-end, synchronisation and equalisation
To perform its primary task of detecting the transmitted symbol sequence, the receiver must additionally estimate the synchronisation parameters (e, 0) and compensate for their influence on the received signal. Furthermore, the structure in Figure 23.3 implies that all channel impulse responses are known - an assumption not fulfilled in typical communication scenarios. Therefore, an alternative all-digital receiver structure is considered [16], whose multichannel extension is shown in Figure 23.4. The signals impinging at the sensors 1 to Nc are first converted down to baseband using the nominal carrier frequency and are band-limited using the lowpass filter gR(t). The band-limited baseband signals are subsequently sampled at multiples of Ts, which is typically a fraction of the symbol interval T. Bandwidth of the lowpass filter and sampling rate I/Ts have to be chosen such that the sampling theorem is fulfilled for the desired signal component contained in r^v\t) [16]. Instead of individual matched filters in each subchannel, only the common component matched to the known transmit filter gr(t) is realised in gMF(&)5 leaving the
remaining channel-dependent part to later processing steps. Prior to further processing, it is, however, advantageous to adjust the average amplitude dynamics in each subchannel to a common level by using adaptive gain control (AGC). Then, the normalised offset e between the timing grids of the transmitter and the receiver is determined and compensated for using digital interpolation techniques characterised by the interpolator h\nt(t) and downsampling. A detailed treatment of suitable algorithms for the application at hand is beyond the scope of this chapter and the reader is referred to References 15 and 16. The outputs of the timing recovery unit are signals x (y) (/), 1 < v < Nc, at an intermediate rate 1/T1, with 1/7) = p/T, p e N. Thus, p represents the oversampling with respect to the symbol duration T. The signals x^v\l) are input to a processing unit performing joint multichannel equalisation and carrier-phase recovery. Possible algorithms for efficient blind space-time adaptive processing that are able to recover the transmitted symbol sequence without explicit training will be treated in detail in the following sections.
23.4.3
Signal model
The transmission systems from the source to the input of the Nc intermediate rate sampling units are characterised by impulse responses h^v\t), 1 < v < Nc- The input sequences x^ (/) to the multichannel equaliser are then given as: (23.3)
with w^v\l) representing the additive noise component of each subchannel. For the model (23.3), the following assumptions are made: The physical channels h^v\t) are approximated as finite impulse response filters with a maximal length of Lh symbol intervals. Shorter impulse responses are zero padded to reach Lf1. 2 For arbitrary oversampling factors /?, the additive noise processes w^v\l) are assumed to be white Gaussian noise of equal power o\, being uncorrelated from sensor to sensor. 3 The carrier-phase offset O^ (t) of each subchannel is approximated by a first-order polynomial: 1
6{v\t) = Acot + 0^\
1 < v < Nc
(23.4)
The term O^ represents a constant phase offset that may vary from subchannel to subchannel. The constant carrier frequency offset Aco, however, is assumed equal to all subchannels. With these assumptions, the sequences x^v\l), 1 < v < Nc, are jointly cyclostationary with period p [9]. In the following, the scalar processes are summarised into a single vector process that turns out to be wide-sense stationary. As a first step, the intermediate rate sequences x^v\l) are decomposed into p symbol rate sequences x\ (n), 0 < i < p, in the following manner
(polyphase decomposition): x^\n)=x^\np + i)
(23.5) n =v
This defines p virtual channels h\v\nT) := h^v\iT'/p+nT) per physical subchannel v that are corrupted by additive noise processes w\ {nT) := w^v\iT/p + nT). Gathering all samples per symbol interval into the (p • A^c)-variate vector:
x(«) = [x$\n), • • • ,x^c)(n),.. .,*£,(»),.. .,xl^(n)f
(23.6)
and defining the (p • Nc) x (p • Nc) dimensional unitary diagonal matrix:
(23.7) leads to the vector model: (23.8)
where h(n) and w(rc) are defined equivalent to equation (23.6). As long as p • Nc > 1, this vector model results from oversampling in space or time, allowing a compact and efficient representation of joint space-time signal processing. Additionally, the vector: h = [hr(0),...,hr(L^-l)f
(23.9)
is introduced, summarising all coefficients of the Nc impulse responses. Stacking M consecutive snapshots \(n) to \(n — M + 1) into one (p • Nc • Af)variate vector: xM(n) = [xT(n),..., xT(n - M + l ) f
(23.10)
the matrix model: XAf(/i) = ejnAa)T GM TM(h) a L / r + M -i(") + wM(n)
(23.11)
follows immediately. In equation (23.11), O M denotes the (p- Nc M) x (/? • Nc M) dimensional unitary diagonal matrix: 0 M = diag(0, Q-jAa)T 6, . . . , Q-JW-I)AvT 0)
( 2 31 2 )
TM(H) the (p • Nc • M) x (Lh H- M — 1) dimensional block-Toeplitz convolution matrix:
(
h(0)
h(l)
...
h(La-l)
0
0 : 0
h(0) : ...
h(l)
... . h(0)
ML/,-1) . h(l)
0
...
0
\
... 0 . : • • • h(Lh - I)/ (23.13)
2LLh+M-\(n) the (Lh + M — l)-variate vector: *Lh+M-\(n) = [a(n),... ,a(n - Lh - M + 2 ) ] r (23.14) of information-bearing symbols, and the vector WM(H) is defined equivalent to equation (23.10). Using the special structure of the vectors &Lh+M-\(n) and W M ( « ) , it readily follows that the covariance E {XM(« + Arc) x^(n)} is a function of An only. Hence, the vector process x(n) turns out to be wide-sense stationary.
23.4.4 Multichannel equalisation According to Figure 23.4, the Nc intermediate rate sequences x^v\l) enter a multichannel equaliser, which is composed of a bank of Nc - possibly fractionally-spaced equalisers. Each filter acts on one sensor output sequence as shown in Figure 23.5, but typically all filters are jointly optimised to enhance efficiency. For simplicity, a common length of Lf symbol intervals for all filters is assumed. Introducing the polyphase decomposition f[v\n) = f^v\np + i), 0 < / < /?, similar to equation (23.5), the symbol rate output y^v\n) of a single physical channel v is given as: (23.15) In order to coherently combine all sequences y^l\n) to y^Nc\n), a phase offset compensation has to be performed in all physical subchannels. If the phase offsets
carrier-phase recovery
Figure 23.5
decision device
Multichannel equalisation and carrier-phase recovery
show enough correlation, their compensation can be performed after subchannel combining [23]. This is also shown in Figure 23.5. Using equation (23.15) and setting M = Lf, the combined output of the multichannel equaliser may be written as:
(23.16) The (p • A^c)-variate vectors f(0) to f ( L / — 1) are constructed corresponding to equation (23.6) and are summarised in the (p • Nc • L/)-variate vector: fL/ = [ f r ( 0 ) , . . . , f r ( L / - l ) ] r
(23.17)
Neglecting for the moment the influence of additive noise and of a carrier-phase offset, the output sequence y(n) is given as: y(n) = f[fTLf(h)aLh+Lf-i(n)
(23.18)
The transmission of symbols a(n) to the output y(n) is thus completely characterised by the coefficients of the (Lf1 +Lf — l)-variate vector: g = Tl1(M) fLf
(23.19)
Without additive noise being present, it is well known that an optimal equaliser should completely cancel ISI [18], eventually allowing for a constant delay 0 < no < Lh + Lf — 2. This zero-forcing (ZF) equalisation requires: Tlf(h)fLf,no = In^1
(23.20)
with the (Lh +Lf- l)-variate vector lWo+i being all zero except for the (no + l)th element, which reads one. Equation (23.20) can be solved for the unknown equaliser coefficients if the matrix Tif(h) has full column rank. This requirement is met if the so-called length-and-zero conditions are fulfilled [7]: 1 The length M of the observation window satisfies M > Lh — 1. 2 The components of the vectorial Z-transform H(z) := ^2n^o ^(n^ z~n common zeros.
nave no
Therefore, ZF equalisation is feasible with a finite-length multichannel equaliser. This fact was first noticed by Slock [21].
23.5
Multichannel constant modulus algorithm
The conventional equalisation of communication channels employs transmitted symbols as a reference, which are also known to the receiver. The signal at the output of the equaliser is compared with the reference symbols to extract an error signal. The criterion for adaptation of the equaliser filter coefficients is then the minimisation of the expected power of this error signal, with the actual minimisation step being usually performed recursively using an LMS algorithm or a recursive least-squares (RLS) algorithm [18]. The first blind equalisation algorithms proposed in the literature tried to mimic this approach: the coefficients of the equaliser filter are determined by optimisation of a cost function, which, however, is not derived from an Euclidean measure of distance. For the recursive optimisation of such a generalised cost function, usually an LMS-like stochastic gradient descent (SGD) algorithm is used [7]. The most prominent example from this class of algorithms is the constant modulus algorithm, developed by Treichler and Agee in 1983 [26]. In the following, an extension of their basic algorithm is treated that allows for efficient blind space-time adaptive processing [14].
23.5.1 Blind stochastic gradient descent algorithms The goal of blind equalisation is the design of adaptive algorithms that converge to the optimal ZF or MMSE equaliser setting without the assistance of training symbols. A common approach is based on the optimisation of generalised cost functions J(fLf), which can be written as:
J(fLf) = E{V(y(n))}
(23.21)
with *I> being a scalar, real-valued function of the signal: y(n)=f[fxLf(n)
(23.22)
at the output of the equaliser. The cost function should represent the actual amount of intersymbol interference. Thus, minima of J(fLf) also correspond to minima of the intersymbol interference. Without noise, the overall impulse response should therefore be: g = e ^ • l, 0 + i
(23.23)
allowing for a constant delay 0 < no < Lh+Lf—2, and a phase offset 0. A stochastic gradient descent algorithm may be used for iteratively optimising equation (23.21). Assuming that differentiation and expectation may be interchanged, the gradient of the cost function J(JLf) is given as:
(23.24)
where \jr denotes the derivative of ^ . Replacing the expected value in equation (23.24) by the current realisation, and substituting the iteration index by the time index - as
is common with stochastic approximation methods - the recursion: hf(n)
= fLf(n
- 1) - tif1r(f[fXLf№\tLf=fLf{n-\)X*Lf(n)
(23-25)
for the filter coefficients is found. The small positive step size [if determines the convergence behaviour of the recursion (23.25), which is initialised by fL/(0). Equilibria of the cost function (23.21) must satisfy: E(IKfZ7 x L / (n)) • x*Lf(n)} = 0
(23.26)
Since the cost functions of blind algorithms are usually complex non-linear functions of both the channel and equaliser parameters, equation: T L / (h) • E W ( g r a L / k + L / - i ( / i ) ) • *Lh+Lf-\(n)}
=0
(23.27)
is usually evaluated instead of equation (23.26). Thus, all minima of adaptive blind equalisers must satisfy: E{V(g r a L / j + L / _! W) • *Lh+Lf-\(n)} e N(TLf(M))
(23.28)
where Af (TLf (H)) denotes the nullspace of the matrix TL 7 (II). According to this representation, Ding [6] classifies all algorithm minima into two categories. Those minima g that satisfy the trivial nullspace condition: E(V(g r a L , + L / - i (n)) • a L , + L / _ i (n)} = 0
(23.29)
depend solely on the cost function and are named cost-dependent minima. Minima fLf, on the other hand, that satisfy the non-trivial nullspace condition of equation (23.27) and: E M f J 7 TLf QL) a u + L / _ i ( n ) ) • 2nh+Lf-x(n)} £ 0
(23.30)
also depend on the channel convolution matrix TL1 (h) and are referred to as lengthdependent minima. Because of analytical simplicity, most analysis of SGD blind equalisers was carried out for cost-dependent minima only. Stable length-dependent minima, however, typically represent local minima of blind equalisers. Although for T-spaced equalisers length-dependent minima only disappear if the corresponding filter exhibits a doubly infinite parameterisation, it is sufficient for fractionally-spaced equalisers to satisfy the length-and-zero conditions, as this makes the nullspace of T L / (h) trivial [7].
23.5.2
The constant modulus algorithm
In 1980, Godard [10] introduced a class of algorithms that are specified by the cost functions: (23.31)
indexed by the positive integer q, where: _ E{|a(n)l2^} For J-spaced equalisers, the special cost function for q = 2 was independently developed by Treichler and Agee [26] as the constant modulus algorithm (CMA) using the philosophy of property restoral. For input signals to the channel that have a constant modulus \a{n)\2 = R2, the CMA penalises output samples y(n) that do not have the desired constant modulus characteristics. This approach had been purely heuristic and was later justified by the convergence analysis performed by Foschini [8], who demonstrated that all cost-dependent minima of the CMA cost function were global minima corresponding to zero ISI. The multichannel version of the CMA regarded here is theoretically free of length-dependent minima as long as the lengthand-zero conditions are fulfilled, making this algorithm one of the most frequently used blind equalisation algorithms [12]. The gradient of the cost function (23.31) for q = 2 reads: VJG(fLf) = EKIvWI - R2)y(n)xlf(n)}
(23.33)
Introducing the a priori signal: y(n,n - 1) = f[f(n - l)xLf(n)
(23.34)
that is formed with the equaliser filter coefficients fif of the previous iteration step n — 1, the recursion: hf(n) =fLf(n - D - M/ (\y(n,n - 1)|2 - R2)y(n,n - l)v*Lf(n) =fLf(n - 1) - nfe(n,n
- l)x*Lf(n)
(23.35)
for the filter coefficients is obtained. In equation (23.35), the a priori error signal: e(n9n - 1) = (\y(n9n - 1)|2 - ^ 7 ^ 5 7 ) y{n9n - 1) \ E{\a(n)n/
(23.36)
is used as an abbreviation. To initialise the recursion, usually a single non-zero value I/Nc is placed close to the middle of the impulse response of each physical subchannel. This is referred to as centre-spike initialisation, a generalisation of the strategy that was already employed by Godard [10] in the single-channel case. So far, the influence of additive noise on the convergence behaviour of the CMA has been neglected. If additive noise is present, a ZF equaliser is no longer optimal but has to be replaced by a Wiener filter minimising the MMSE cost function: J(tLf) = E(If^ xLf(n) - a(n - no)\2}
(23.37)
for a given delay no [18]. It has been shown - see Reference 12 and the references therein - that the constant modulus cost function exhibits global minima in the vicinity of the optimal Wiener solution. However, the additive noise causes additional local
equaliser i{n)
decision device
DD-PLL CMA
Figure 23.6
Blind adaptive CMA equalisation with subsequent carrier-phase recovery unit
minima so that ill-convergence of the recursion (23.35) can be avoided only through proper initialisation. Length-dependent minima of the constant modulus cost function vanish if the length-and-zero conditions are met. It is, however, sufficient for the transfer functions of the subchannels to have almost common zeros to lead to additional local minima. Therefore, the proper initialisation becomes an important issue again. Beside a region where most of the energy is concentrated, impulse responses of typical communication channels show small pre- and post-cursors making the definition of an exact channel length impossible. As a rule of thumb, the length of the equaliser filter is chosen two to three times the length of the significant part of the channel impulse response, representing a compromise between equalisation capabilities on the one hand and misadjustment of the adaptive process on the other hand. It follows from the special form of the cost function (23.31) for q = 2 that the CMA is invariant to a possible carrier frequency offset Aco. Therefore, the equaliser parameter adaptation can occur independent of the operation of the carrier-phase recovery unit. This results in an already reconstructed signal constellation at the output of the equaliser, which rotates with a frequency Aco in the complex plane. This rotation can be compensated for by a subsequent decision-directed phase-locked loop as proposed in Reference 10. The resulting combined structure is summarised in Figure 23.6.
23.5.3 Experimental results Within the EC-funded project ROBLINKS, in spring 1999, a sea trial for underwater communication was conducted in the North Sea about 10 km off the Dutch coast. At the transmitting side, an almost omnidirectional acoustic source was deployed from the stern of a support ship and lowered to a depth of approximately 9 m. The total water depth in this shallow-water area is about 18 m. The signals were received by a vertical array consisting of Nc = 3 pairs of hydrophones, separated horizontally by 15 cm at depths of approximately 7,11 and 15m below the sea surface. The array was fixed to an oceanographic platform, which also hosted all recording facilities. The support ship anchored at various positions with distances of 1, 2, 5 and 10 km from the platform. In addition, the ship was sailing at a moderate speed during the last two
sensor
time, ms
Figure 23.7
Initial channel impulse responses
days to obtain measurements with a moving source. More details on the ROBLINKS experiments can be found in Reference 28. To examine the performance of the described receiver structure, OQPSKmodulated [18] communication data at a rate of 1560.67 bit/s that had been recorded while the ship moored at a distance of 2 km away from the platform was analysed. The carrier frequency was 3079 Hz and approximately 100 000 data bits were continuously transmitted. The signals received at three vertically displaced hydrophones of the array were jointly processed by the multichannel receiver with an oversampling factor /7 = 2. Prior to the information-bearing signal, a linear FM sweep of 200 ms duration spanning the frequency band from 1 to 5 kHz was transmitted to obtain initial information about the channel impulse responses. Figure 23.7 shows the results of matched filter analysis of the FM sweeps received at all six hydrophones. It is apparent that the delay spread of the channels is of the order of 5 to 7 ms, corresponding to 8 to 11 symbol intervals. The length of the adaptive equaliser has been chosen to Lf = 21 so that the filter spans approximately two times the length of the significant part of the channel impulse response. The step size /x/ of the CMA has been set to 2 • 10~2. The results obtained from applying the proposed receiver to the experimental data are depicted in Figure 23.8. The estimated bit error rate (BER) over time is shown in Figure 23.8a. Each bar corresponds to the number of erroneous bits relative to 500 transmitted symbols. As can be seen, an initial BER of six per cent is reduced to values below one per cent within seconds. For the remaining transmission time, however, a considerable error floor remains leading to an average BER of 0.5 per cent during the last 13 seconds. This slow convergence behaviour of the receiver is also reflected by the mean-square error (MSE), measured between the signals prior and after the decision device, which is shown in Figure 23.8b. Again, within seconds, the MSE reaches values of —9 dB, which is not further reduced in the sequel.
BER, % MSE, dB
Figure 23.8
Results of multichannel blind CMA equalisation a estimated BER b estimated MSE
Additional results may be found in Reference 35. They all suggest that the multichannel constant modulus algorithm is well suited for blind space-time adaptive equalisation of underwater acoustic communication channels. For low signal-to-noise ratios or severe linear distortions, however, it suffers from slow convergence due to saddle points of the underlying cost function. Therefore, to both assure and accelerate convergence to a global minimum, a model-based initialisation strategy has been pursued [30]. It replaces the simple centre-spike initialisation by a coarse estimate of the inverse channel impulse response obtained from a simple numerical propagation model that uses the basic environmental parameters usually available. In a second approach [34], blind non-linear equalisation techniques have been examined as their
superior behaviour in the case of trained equalisers is well known. As a further alternative, Section 23.6 introduces the multichannel Shalvi-Weinstein algorithm that is closely related to the CMA but exhibits far better convergence properties.
23.6
Super-exponential blind equalisation
In Reference 20, Shalvi and Weinstein have proposed a single-channel T-spaced algorithm for blind equalisation based on higher-order statistics. It is closely related to the constant modulus algorithm covered in the previous section and shares many of its positive attributes while exhibiting a super-exponential convergence rate [7]. The originally iterative T-spaced algorithm has also been converted to a sequential algorithm, allowing an adaptive implementation using empirical cumulants. An extension towards an iterative single-channel fractionally spaced equalisation algorithm has been presented by Ding [6]. Based on this approach, an algorithm for blind adaptive multichannel equalisation has been developed in Reference 32, which is presented in the following.
23.6.1 Iterative Shalvi- Weinstein algorithm Assume that a (p • Afc)-variate equaliser f (n), which spans Lf symbol intervals, is applied to the fractionally-spaced sensor output signal of equation (23.6), which we assume to be noise free for simplicity. According to equations (23.16) and (23.18), the output of the equaliser then is:
(23.38) The basic idea of the iterative super-exponential algorithm is to determine the equaliser coefficients fLf(n -f 1) at iteration step n + 1 such that the overall impulse response becomes: (23.39)
(23.40)
with q\ G N and [•]/ being the /th element of the vector in brackets. The vector g(n) of the combined impulse response of channel and equaliser has already been defined in equation (23.19), and the index n highlights the fact that the equaliser coefficients ^LfW available at iteration step n are used to compute g(n). It has been shown in
Reference 20 that such an algorithm would converge super-exponentially fast, which means exponential to the power, if the overall impulse response were accessible. As the overall impulse response is unavailable for adjustment, the algorithm has to be implemented based on the equaliser coefficients only. As the number of equaliser taps shall be finite, the equality requirement in equation (23.39) should be relaxed and the Euclidean distance between TT (h) iif (n + 1) and g(n + 1) should be minimised instead: (23.41)
This leads to the normal equation: (23.42) with the (p - Nc -Lf) x (p • Nc • Lf) dimensional matrix Tj* (h) T^ (h) being rank deficient with rank Ly1 + Lf — 1 if the length-and-zero conditions are met. The minimum norm solution of equation (23.42) is then obtained using the Moore-Penrose pseudo inverse [6]: (23.43) (23.44)
Observing that under the length-and-zero conditions, T ^ ( h ) (T^ (h)T^(h)) 1 " Tj* (h) = I, it should be evident that solving the least-squares problem of equation (23.41) together with the subsequent normalisation of equation (23.44) is equivalent to one step of the iteration (23.39) to (23.40). The algorithm in equations (23.43) and (23.44) is still implicit since both T^ (h) T[ (h) and T£ (h) g(n + 1) are unknown. In the following, we show how at each iteration cycle - both quantities can be expressed by joint cumulants of the input and the output of the equaliser. In the noise-free case and in the absence of a carrier-phase offset, the covariance matrix of XLf(n) is given by:
(23.45) so that (23.46)
with known variance a\. Using the linearity property of cumulants [17] together with the HD property of the symbol sequence a(n), the cumulant: cum(v(«) : q\',y*(n) : q\ - l;x*(n))
(23.47)
of order 2q\ can be formed. Comparing expression (23.47) with equation (23.39) shows that it corresponds to the multiplication of the first block row of T^ (h) with the vector g(n + 1) up to a constant factor. Extending this result leads to: cum(y(n) : q\;y*(n) : q\ - l;x*LAn)) T* (h) g(* + 1) = — — ^(23.48) L f cum(a(n) \q\\a*{n) : q\) It is assumed that the cumulant of order 2q\ of the symbol sequence a(n) required as denominator in equation (23.48) is known and non-zero. As in practice the remaining cumulants in equations (23.45) and (23.48) are unknown, they have to be replaced by sample estimates [20]. 23.6.2
Recursive Shalvi-Weinstein algorithm
The multichannel, fractionally-spaced super-exponential algorithm derived in Section 23.6.1 is an iterative batch algorithm that needs all available data for joint processing. In a time-varying scenario, however, this is not desirable since different data segments may correspond to different channel impulse responses. It is therefore necessary to obtain an adaptive algorithm that sequentially processes the data and is thus also capable of tracking channel variations. We therefore propose an intuitive adaptive algorithm for multichannel, fractionally-spaced, super-exponential blind equalisation that works purely sequentially and is capable of converging without any initial training [32]. In the following, the special value q\ = 2 is chosen. Then, according to equation (23.48), the expression T£ (h)g(n + 1) is determined by the fourth-order cumulant [17]:
(23.49) With the special choice of q\, the remaining constant factor cum (a(n) : q\;a*(n) : q\) in equation (23.48) reduces to the kurtosis y% of the sequence a(n). Equation (23.49) can be further modified: due to the normalisation step (23.40), we have:
E(IKn)I2) = al
(23.50)
and for circular-symmetric signal constellations typically encountered in digital communications: E{y2(n)} = 0
(23.51)
This leads to the simplified expression:
cum(y(n),y(n),yHn)^lf(n))
=E{(\y(n)\2 -2a2a)y(n)xlf(n)}
(23.52)
With the above result in mind, the quadratic cost function: (23.53) is formed and minimised at each time step n, with /3n representing a general weight function [20]. Calculation of the gradient of equation (23.53) with respect to fLf(n) leads to the normal equation:
(23.54) With a properly chosen weight function fin, it may be inferred from equations (23.45) and (23.52) that the two sides of equation (23.54) converge to T* (h) T^ (h) and Tj* (h) g(n + 1), respectively. Thus, the optimisation of the quadratic cost function (23.53) leads asymptotically to the normal equation (23.42) and therefore to the same solution fLf(n + 1) as in equation (23.43). So far the influence of additive noise has been neglected. It has been shown in Reference 31, however, that - under benign conditions - minimising the same cost function (23.53) formed from noisy quantities asymptotically leads to the optimal Wiener solution.
23.6.3 Adaptive implementation The equaliser coefficients iif (n) that minimise the quadratic cost function (23.53) at each processing step n can be recursively obtained by a stabilised RLS algorithm with proper initialisation. As in the derivation of the recursive single-channel algorithm in Reference 20, the weight function fin is replaced by a constant 0 < fi < 1, effectively allowing for tracking of time-varying scenarios. The resulting algorithm is summarised in Table 23.1, and a detailed derivation is found in Reference 31. The a priori error signal: (23.55)
Table 23.1 Adaptive multichannel Shalvi-Weinstein algorithm Input: xLf(n) Output: y(n) Parameters: 0 < p < I9S > 0 Initialisation: P(O) = 5"1I, f(0): 'centre-spike'
of the Shalvi-Weinstein algorithm equals the a priori error signal of the constant modulus algorithm as given in equation (23.36). This demonstrates the close relationship between both algorithms. However, comparing the update equation: (23.56) of the Shalvi-Weinstein algorithm with the corresponding equation (23.35) of the CMA shows the typical difference between an RLS algorithm and an LMS algorithm. Although in the latter case the current data vector x£ (n) determines the direction of correction, the gain vector k(n) of the RLS algorithm is obtained by more complex operations and also depends on prior data vectors. It is therefore assumed that the RLS algorithm leads to a better update estimation and thus a better convergence behaviour than does an SGD algorithm. Up to now, the influence of a carrier-phase offset has been neglected. From the formal correspondence between the constant modulus algorithm and the ShalviWeinstein algorithm it can be inferred that the latter is also invariant to a carrier-phase offset 0. Under the assumptions made, an already reconstructed signal constellation appears at the output of the equaliser, however rotating with a frequency Aco. This remaining rotation is again compensated for by a decision-directed phase-locked loop. The resulting structure is shown in Figure 23.9.
equaliser f(«)
decision device
DD-PLL SWA
Blind adaptive SWA equalisation with subsequent carrier-phase recovery unit
MSE, dB
BER, %
Figure 23.9
Figure 23.10
Results of multichannel blind SWA equalisation a estimated BER b estimated MSE
23.6.4 Experimental results To enable comparison with the corresponding results in Section 23.5.3 obtained with the multichannel CMA, the same data set is now analysed with the Shalvi-Weinstein algorithm. Furthermore, the number of sensors (Nc = 3) and the length of the equaliser filter (Lf — 21) are kept constant. The forgetting factor fi is chosen to 3 • 10~3. The thus obtained results are summarised in Figure 23.10. The estimated bit error rate is shown in Figure 23.10a. The comparison with Figure 23.8a shows the enhanced convergence properties of the proposed multichannel Shalvi-Weinstein algorithm: within one second the BER drops from an initial value of approximately three per cent to virtually zero. This rapid convergence is also reflected by the meansquare error shown in Figure 23.10b. Again, within a few seconds, the MSE reaches values of—14 dB and below. Further results may be found in Reference 3 3. Compared with the results obtained in Reference 35, they all prove that the proposed multichannel fractionally spaced super-exponential algorithm outperforms the previously used multichannel constant-modulus algorithm.
23.7
Concluding remarks
In this chapter, we have focused on space-time adaptive processing in the context of blind equalisation of underwater acoustic channels. To justify the great efforts in signal processing necessary to establish a reliable underwater communication link, special emphasis has been put on the characterisation of the transmission medium. The typical underwater acoustic channel has been found to be overspread due to rapid temporal and spectral variations, therefore calling for special signal processing techniques. Their development with time has been summarised in a brief history of underwater acoustic communication, highlighting only key contributions. Subsequently, a spatial-temporal receiver architecture has been introduced that efficiently allows for joint processing of signals received on many sensors. Based on this structure, a signal model suitable for the description of blind space-time adaptive equalisation algorithms has been developed. Then, the well known constant modulus algorithm for blind channel equalisation has been treated as an example for the class of stochastic gradient descent algorithms. The results obtained with measured shallow water communication data have demonstrated the general applicability of this algorithm but have also shown its slow convergence as a major drawback. As an alternative, an adaptive multichannel version of the Shalvi-Weinstein algorithm for blind equalisation has been derived. This algorithm is closely related to the constant modulus algorithm but shows a better convergence behaviour. This theoretically obtained result has been exemplarily verified by analysing the same data set as with the CMA. Although in this chapter we have presented two successful strategies for blind spacetime adaptive equalisation with applications to underwater acoustic communication, blind equalisation is still an active area of research. In particular, for many proposed algorithms the bridge between theory and application is still lacking, constituting a rich set of research challenges for many years to come.
References 1 BAGGEROER, A. B.: 'Acoustic telemetry - an overview', IEEEJ. Ocean. Eng., October 1984, 9, (4), pp. 229-235 2 BALABAN, P. and SALZ, J.: 'Optimum diversity combining and equalization in digital data transmission with applications to cellular mobile radio part I: theoretical considerations', IEEE Trans. Commun., May 1992, 40, (5), pp. 885-894 3 BREKHOVSKIKH, L. M. and LYSANOV, YU. P.: 'Fundamentals of ocean acoustics' (Springer-Verlag, New York, 1991, 2nd edn.) 4 CATIPOVIC, J., BAGGEROER, A. B., VON DER HEYDT, K., and KOELSCH, D.: 'Design and performance analysis of a digital acoustic telemetry system for the short range underwater channel', IEEEJ. Ocean. Eng., October 1984, 9, (4), pp. 242-252 5 CLAY, C. S. and MEDWIN, H.: 'Acoustical oceanography' (John Wiley & Sons, New York, 1977) 6 DING, Z.: 'On convergence analysis of fractionally spaced adaptive blind equalisers', IEEE Trans. Signal Process., March 1997, 45, (3), pp. 650-657 7 DING, Z. and LI, Y: 'Blind equalization and identification' (Marcel Dekker, New York, 2001) 8 FOSCHINI, G. J.: 'Equalizing without altering or detecting data', AT&T Tech. J, October 1985, 64, (8), pp. 1885-1911 9 GARDNER, W. A.:' Introduction to random processes with applications to signals and systems' (McGraw-Hill, Inc., New York, 1990, 2nd edn.) 10 GODARD, D. N.: 'Self-recovering equalization and carrier tracking in twodimensional data communication systems', IEEE Trans. Commun., November 1980, 28, (11), pp. 1867-1875 11 HOWE, G. S., TARBIT, P., HINTON, 0., SHARIF, B., and ADAMS, A.: 'Subsea acoustic remote communications utilising an adaptive receiving beam former for multipath suppression'. Proceedings of Oceans 1994 MTS/IEEE conference, Brest, September 1994, pp. 313-316 12 JOHNSON, C. R., Jr., SCHNITER, P., ENDRES, TJ., BEHM, J.D., BROWN, D.R., and CASAS, R. A.: 'Blind equalization using the constant modulus criterion: a review', Proc. IEEE, October 1998, 86, (10), pp. 1927-1950 13 KILFOYLE, D. B. and BAGGEROER, A. B: 'The state of the art in underwater acoustic telemetry', IEEEJ. Ocean. Eng, January 2000, 25, (1), pp. 4-27 14 MAYRARGUE, S.:' A blind spatio-temporal equalizer for a radio-mobile channel using the constant modulus algorithm (cma)'. Proceedings of the IEEE international conference on Acoustics, speech and signal processing, ICASSP 1994, Adelaide, April 1994, 3, pp. 317-320 15 MENGALI, U. and D'ANDREA, A. N.: 'Synchronization techniques for digital receivers' (Plenum Press, New York, 1997) 16 MEYR, H., MOENECLAEY, M., and FECHTEL, S. A.: 'Digital communication receivers' (John Wiley & Sons, New York, 1998)
17 PORAT, B.: 'Digital processing of random signals' (Prentice Hall, Englewood Cliffs, 1994) 18 PROAKIS, J. G.: 'Digital communications' (McGraw-Hill, Inc., Boston, 1995, 3rd edn.) 19 RAPPAPORT, T. S.: 'Wireless communications principles & practice' (PrenticeHall, Upper Saddle River, 1996) 20 SHALVI, O. and WEINSTEIN, E.: 'Super-exponential methods for blind decoloration', IEEE Trans. Inf. Theory, March 1993 39, (2), pp. 504-519 21 SLOCK, D. T. M.: 'Blind fractionally spaced equalization, perfect reconstruction filter banks and multichannel linear prediction', Proceedings of IEEE international conference on Acoustics, speech and signal processing, ICASSP 1994, Adelaide, April 1994, IV, pp. 585-588 22 STOJANOVIC, M.: 'Recent advances in high-speed underwater acoustic communications', IEEEJ. Ocean. Eng., April 1996, 21, (2), pp. 125-136 23 STOJANOVIC, M., CATIPOVIC, J. A., and PROAKIS, J. G.: 'Adaptive multichannel combining and equalisation for underwater acoustic communications', J. Acoustic. Soc. Am., September 1993, 94, (3), pp. 1621-1631 24 STOJANOVIC, M., CATIPOVIC, J. A., and PROAKIS, J. G.: 'Phase-coherent digital communications for underwater acoustic channels', IEEEJ. Ocean. Eng., January 1994,19, (1), pp. 100-111 25 TARBIT, P. S. D., HOWE, G., HINTON, 0., ADAMS, A., and SHARIF, B.: 'Development of a real-time adaptive equalizer for a high-rate underwater acoustic communicating link'. Proceedings of Oceans 1994 MTS/IEEE conference, Brest, September 1994, pp. 307-312 26 TREICHLER J. R. and AGEE, B. G.: 'A new approach to multipath correction of constant modulus signals', IEEE Trans. Acoust. Speech Signal Process., April 1983, 31, (2), pp. 459-472 27 URICK, R. J.: 'Principles of underwater sound' (Peninsula Publishing, Los Altos, 1986, 3rd edn.) 28 VAN GIJZEN, M. B., VAN WALREE, P. A., CANO, D., PASSERIEUX, J.-M., WALDHORST, A., WEBER, R., and MAILLARD, C : 'The roblinks underwater acoustic communication experiments'. Proceedings of the fifth European conference on Underwater acoustics, ECUA 2000, Lyon, July 2000, pp. 555-560 29 VAN TREES, H. L.: 'Detection, estimation, and modulation theory, part III' (John Wiley & Sons, New York, 1971) 30 WALDHORST, A., WEBER, R., SCHULZ, R, and BOHME, J. R: 'Initializing blind adaptive equalizers using a-priori environmental knowledge'. Proceedings of the sixth European conference on Underwater acoustics, ECUA 2002, Gdansk, June 2002, pp. 555-560 31 WEBER, R.: Mehrkanalige referenzdatenfreie Entzerrer fur die akustische Dateniibertragung in seichtem Wasser. Dissertation, Ruhr-Universitat Bochum, Shaker Verlag, Aachen, 2003 32 WEBER, R. and BOHME, J. R: 'Adaptive super-exponential methods for blind multichannel equalization'. Proceedings of the Second IEEE Sensor array and multichannel signal processing workshop, Rosslyn, August 2002
33 WEBER, R., SCHULZ, R, WALDHORST, A., and BOHME, J. R: 'Adaptive multichannel super-exponential blind equalization of underwater acoustic channels' . Proceedings of Oceans 2002 MTS/IEEE conference, Biloxi, October 2002, pp. 2429-2437 34 WEBER, R., SCHULZ, R, WALDHORST, A., and BOHME, J. R: 'Blind decision-feedback equalization of shallow water acoustic channels'. Proceedings of the 6th European conference on Underwater acoustics, ECUA 2002, Gdansk, June 2002, pp. 549-554 35 WEBER, R., WALDHORST, A., SCHULZ, R, and BOHME, J. R: 'Blind receivers for MSK signals transmitted through shallow water'. Proceedings of Oceans 2001 MTS/IEEE conference, Honolulu, November 2001, pp. 2183-2190
Chapter 24
Reduced-rank interference suppression and equalisation for GPS and downlink CDMA Wilbur L. Myrick and Michael D. Zoltowski
24.1
Reduced-rank interference suppression and equalisation
Minimum mean square error (MMSE) based interference suppression and equalisation techniques continue to interest researchers seeking to improve detection performance while minimising computational complexity [4-7,17]. Both MMSE equalisation and interference suppression filters have been applied to a variety of interfering environments requiring rapid adaptation to mitigate non-stationary interference. Rapid adaptation requires the MMSE filter to converge in a short period of time at the expense of minimising computational complexity. In recent years, reduced-rank signal processing has been discovered as being a possible solution which provides the rapid convergence that is necessary to suppress non-stationary interference [3, 12, 13]. This chapter explores the use of reduced-rank MMSE processing in two types of interference environment. The first environment involves multiple access interference (MAI) caused by multipath delay spread of Walsh-Hadamard codes in a code division multiple access (CDMA) cellular system [3]. The second environment involves both intentional and unintentional jamming found in the global positioning system (GPS) receiver band [15, 16].
24.1.1 Motivation for reduced-rank MMSE processing The need for utilising reduced-rank MMSE filtering in non-stationary environments is motivated by the need to increase convergence speed when solving a set of Wiener filter equations involving an optimisation criteria that seeks to minimise interference. For example, in equalisation the length of an MMSE equaliser should be at least equal to the channel length to achieve the desired performance, and longer equalisers yield better error rates [18]. Hence, equalisers in the high-speed CDMA downlink will by necessity span many chips in length with a corresponding large number of degrees
of freedom. In order to reduce the number of filter coefficients to be estimated, the received signal vector may be projected onto a lower dimensional subspace where the Wiener filter is constrained to exist. This dramatically increases the speed of convergence for adaptive methods, if the subspace is chosen properly, but the overall MMSE for the reduced-rank filter may be higher than the MMSE for the full-rank filter. The example involving the mitigation of jammers targeted at GPS receivers addresses the issue of large space-time Wiener filters associated with joint spacetime processors. Space-time processors are able to effectively null both wideband and narrowband jammers by leveraging degrees of freedom available in two dimensions. This implies that the space-time processor has the added advantage of maintaining as many degrees of freedom as possible compared with temporal or spatial processing alone. However, a disadvantage of space-time processing is the large dimensionality of the space-time weight vector. In the GPS receiver environment, narrowband jammers may have the ability to change frequencies dynamically. In addition, the rapid dynamics of the aircraft during manoeuvring can cause arrival angles of wideband jammers to vary rapidly. Thus, an interference suppression algorithm will only be effective if it can rapidly converge. This chapter discusses the application of the multistage Wiener filter (MSWF) algorithm to both CDMA downlink MMSE processing as well as space-time power minimisation requiring rapid adaptation. For both CDMA and GPS jammer interference suppression the most widely known reducedrank techniques in signal processing are the principal components method [11] and the cross spectral method [11], which are based on the eigendecomposition of the covariance matrix associated with the observed statistics. The next sections develop the framework of reduced-rank processing as it applies to the MSWF and its application to interference suppression. Eventually this development will provide enough insight into understanding the differences between the MSWF and other reduced-rank processing techniques.
24.1.2
Understanding the multistage Wiener filter
The structure of the MSWF is depicted in Figure 24.1. At each stage, a rank one basis is selected based on mutual correlation between a desired scalar signal, dn-\ and the observed (N — 1) x 1 vector signal xn-\ where n = \,...,N — 1. N defines the dimension of the MSWF weight vector accounting for both dn-\ and Xn-I as time series data or a combination of multisensor and time series data. In either case, the data is initially supplied to the MSWF as do and Xo. An analysis filter bank is then formed based on the sequence of cross correlation vectors that are estimated using the desired and filtered vector signals at each stage. Goldstein and Reed [8-10] first formulated the concept of utilising mutual correlation information at consecutive stages to determine a unique subspace that maximises the signal-tointerference noise ratio (SINR) associated with the desired signal do. This innovative analysis and synthesis filter bank provides a unique way to tune into the subspace partition that provides the best SINR based on the interference environment and data collection platform. Much research has gone into determining how one can find the best way to tune the MSWF to maximise SINR based on statistical analysis
analysis filter bank
Figure 24.1
synthesis filter bank
Structure of successive stages of the multistage Wiener filter
of the data. One notable feature about the MSWF is that it can always replace the standard Wiener filter while outperforming the standard Wiener filter in terms of SINR when a lack of data samples, e.g. caused by non-stationary interference, forces the standard Wiener filter solution to diverge from maximising the SINR. Hence, with reduced-rank processing based on the MSWF an inherent robustness is achieved as well as computational reduction compared with eigenvector methods. Although the computational reduction is not necessarily obvious from Figure 24.1, an alternative structure will provide further insight into justifying such a statement. The observed vector process in Figure 24.1 can be thought of as being decomposed by a sequence of (N — 1) x 1 nested filters pi, P2, • •, PiV-2 where D < N — 2 such that D is defined to be the order of the MSWF. Now pn can be found recursively using: Pw =
!№„_, ^1]II
(24
'1}
and dn = p^\n-\. The input process to the (n + l)th stage is xn[k] = Bnxn-\[k], where Bn is an (N — 1) x (N-I) blocking matrix such that Bnpn = 0. Now Bn does not necessarily have to be a square blocking matrix. A second formulation of the MSWF discussed in a later section describes an alternate formulation of the blocking matrices yielding blocking matrices which reduce in dimension as more stages are selected in the MSWF. The original formulation of the MSWF as found in Reference 8 discusses various methods to determine this blocking matrix as well. The outputs of the various stages are linearly combined via the scalar weights w\9W2,...9wp-\, chosen so that the mean square error at each stage is minimised. If the decomposition is carried out for the full N stages, then the MSWF is exactly equivalent to the full-rank classical Wiener filter [8]. The filter-bank structure whitens the error residue at each stage, and compresses the coloured portion of the observed data subspace and hence it is optimal in terms of reducing the MSE for a given rank. The MSWF does not require any eigendecomposition or inversion of the covariance matrix, and so represents a significant reduction in complexity
Table 24.1 Basic MSWF algorithm initialisation:
where null(pn) implies an orthogonal projection onto pn
backward recursion:
over the full-rank Wiener solution and other reduced-rank techniques. This is very important for practical implementations, particularly if the rank one decomposition can be stopped after a few stages. The basic algorithm, based on Reference 8, is listed in Table 24.1. Notice that the desired signal at each stage, dn is the output of a length Af filter. A notable feature is that the first filter is orthogonal to filters of all the following stages, i.e.: p f p i =8n\,
n=
l,29...9N
8ni denotes the Kronecker delta function which is 1 for n = i and 0 otherwise. However, the filters p n ,n = 2,3,...,iV are not mutually orthogonal in general. Honig and Xiao [14]firstproposed choosing a projection matrix onto the subspace orthogonal to p n as the blocking matrix at each stage, hence retaining the length N of the filter and observed signal Xn [k] composed of both dn and Xn at time k forming an NxI vector. With this choice for the blocking matrix, T^ = [pi p2 . . . PD] forms an orthonormal basis for the reduced dimension subspace. Moreover, w is constrained to lie in the Krylov subspace spanned by ( r ^ R ^ r ^ R ^ r ^ , . . . ,Rj^." 1T^x) [14]. In this case:
*+1
=
II(I-P^)R,- 1 P ,II
(24>2)
where (24.3) pi = Tdx and R0 = R**.
Joham and Zoltowski [19] proved that this choice of the blocking matrix, i.e. Bn = I — p«p^ is optimum in terms of maximising the correlation between the scalar signals dn[k] and dn-\[k] at each stage. They developed a covariance-level order recursive form of the MSWF working within the Krylov subspace, in which the backwards recursion coefficients and hence the weight vector and the mean square error, may be updated at each stage via a simple recursion.
24.1.3 Lattice structure of the MSWF The blocking matrix is a very useful concept to develop and analyse the performance of the MSWF, but in practice there is no need to calculate or store these NxN matrices. A new reduced-complexity implementation was presented by Ricks and Goldstein in Reference 20 based on the following substitution: at the nth stage: (24.4)
(24.5)
This leads to the lattice-type structure for a £>-stage MSWF, as shown in Figure 24.2. This architecture has the benefit of being modular and scalable for hardware implementation, as well as being computationally more efficient than the structure depicted in Figure 24.1.
the filter can be truncated at any stage D
modular structure
Figure 24.2
MSWF as a lattice filter
24.1.4 MSWF related to Wiener-Hopffilter weights To give additional insight into the formulation of the MSWF, a derivation is now developed that relates the Wiener-Hopf filter weights to the MSWF weights applied to the space-time power minimisation problem. Our original problem dealt with reducing the large dimensionality of the space-time correlation matrix while minimising: Ji77Kh st hHSx = 1
(24.6) r
where ^i is a M x 1 vector, ^i = [ 1 , 0 , . . . , 0 ] , h is the NMxI full rank Wiener weight vector solution and K is the NMxNM correlation matrix of the NMxI observation vector containing both do and xo as illustrated in 24.1. N is defined as the number of taps behind each of the M antennas. An example using this solution is illustrated in a later section. The well known solution to equation (24.6) is: ^ 0 K h = Sx
(24.7)
found using the method of Lagrange multipliers with wo being a scalar. Knowing this solution, the h can be rewritten in terms of an orthogonal decomposition: h = Si - B 0 Ii x where Bo NMx(NM rewritten as:
(24.8) — 1) is chosen such that B^Si = 0 and equation (24.7) can be
WoK[Si - B 0 Ii x ] = Si
(24.9)
One can solve for K xx described earlier by multiplying both sides of equation (24.9) by B ^ yielding: K^hx
= ILdx 7
(24.10) an
7
where K xx = BQ KBO d Ktx — BQ K(SI. Let us define K xx = KXXl9 hx = hXl and kdx = k^xi where the subscript 1 indicates the stage of the MSWF. Notice that both KXXl and k ^ have been reduced in dimension, respectively. Repeating the orthogonal decomposition as above, the hXl solution can now be rewritten as: hXl = wi[ki-BihX2]
(24.11)
where w\ is a scalar Wiener filter parameter of the first stage, ki is a normalised kdXl and Bi (NM - l)x(NM - 2) is chosen such that B f ki = 0. This orthogonal decomposition is the first stage of the MSWF. Substituting equation (24.11) into equation (24.10) yields: wiKXXl[ki
- B I h x J = R^ 1
(24.12)
and solving for K ^ 2 by multiplying both sides of equation (24.12) by B f yields: KXX2hX2 = k^X2
(24.13)
where Kx^2 = BfK x ^ 1 Bi and k^^2 = B f K ^ 1 ki. Notice again that both K 0 J 2 and kdX2 have been reduced in dimension, respectively and represent the set of equations that must be solved at the second stage of the MSWF. Continuing this
type of orthogonal decomposition reduces the dimension of the corresponding covariance matrix and cross correlation vector at each stage allowing a generalisation of K 0 ;. = B^ 1 Kj 0 7 - 1 Bi-I and k^. = B ^ 1 K 0 7 - 1 k;_i. This facilitates the use of a second generalised equation of the orthogonal decomposition at any stage as: u>,IWk,- - B,hx.+1 ] - k*,
(24.14)
If both sides of equation (24.14) are multiplied by k ^ then equation (24.14) can be solved for u>; as: (24.15) indicating that tu;, the desired MSWF weight at the /th stage, can be calculated at any stage without calculating the wts of the successive stages. It is now possible to form a closed-form expression for the Wiener-Hopf filter weights based on the MSWF parameters generated at each stage of the decomposition. Using the above generalised orthogonal decomposition at each stage to the (NM- l)th stage, hx can be written as: hx = W\[k\ - Bi[W2Ek2 - B 2 [. . . [wNM-2[kNM-2
- BATM-2WiVM-ill . ..]]]] (24.16)
where Y?NM-I — ^x-(NM-2)- After some rearranging, a closed-form expression of hx can be written as: (24.17) If the truncation of at least the last stage occurs, hx is reduced to: (24.18) since WMM-\ = 0. This expression provides insight into the effects of truncation on the Wiener-Hopf filter weights since we can now rewrite the above expression as: (24.19) where the Uhstage < NM — 1. This reduced-rank interpretation of the MSWF can be defined as the truncation of stages after the /th stage, thus the more stages that are added to h^, the better an estimate of hx. Unfortunately this is only true if hx is being estimated with an adequate amount of sample support. If hx is being estimated in a nonstationary interference environment then additional stages could in fact degrade the filter performance. An essential property of the MSWF is that its subspace formation (based on k ^ at each stage) extracts the most correlated information necessary to
estimate hx at the initial stages of the filter. This eliminates the need to fully decompose the space-time correlation matrix to find a solution associated with the specified set of Wiener-Hopf equations. In fact, these equations may not yield a well defined solution due to the lack of sample support. The next sections illustrate how we apply the MSWF to two types of interference environments involving a CDMA cellular communication system and mitigation of jammers at a GPS receiver.
24.2
Application of MSWF to CDMA downlink
24.2.1 Introduction Mobile units in current CDMA cellular systems use a Rake receiver, which is a maximal-ratio combiner and can be interpreted as a bank of filters matched to the channel that combines the energy from multiple paths [I]. The Rake filter is the optimum (maximum likelihood) demodulator when there is no interference from other users [I]. In IS-95 and the proposed third generation (3G) systems, orthogonal Walsh-Hadamard codes are used to spread the different users' data symbols on the forward link. At the downlink receiver, after removing the coherent carrier, the signal is multiplied by the synchronised basestation long code and then decorrelated with the desired user's spreading code. In a flat fading environment (i.e. transmitter spectral signal characteristics are preserved at the receiver), this will ensure that any interference due to other users in the same cell is eliminated. However, in urban wireless systems, the fading is often not flat and the orthogonality of the underlying Walsh-Hadamard codes is destroyed at the receiver, resulting in multiple access interference (MAI) at the receiver. Furthermore, if the multipath delay spread is a significant portion of the symbol period, there will be considerable intersymbol interference (ISI) in addition to the MAI. There are also major interference issues if the mobile unit is near the edge of a cell and is receiving significant out-of-cell transmission, regardless of whether the fading is flat or not. In such environments, the Rake receiver is suboptimal, because it inherently treats MAI as uncorrelated noise. The multipath-induced MAI also necessitates very tight power control. When many or all users are active in the cell, the bit error rate curve of the standard Rake receiver flattens out at higher SNR [2]. Thus, in situations where the number of active users approaches the spreading gain, the Rake receiver does not provide adequate performance. In this section, we present reduced-rank, chip-level (i.e. at spreading rate) MMSE estimators based on the MSWF algorithm. The main goal of this section is to demonstrate the superior performance of reduced-rank MMSE equalisation for the CDMA downlink. We show that, with perfect knowledge of the channel statistics, the MSWF requires only a small number of stages to achieve near full-rank MMSE performance over a wide range of signal-to-noise ratios (SNRs). The performance results are for the CDMA forward link with synchronous users, saturated loading, frequency selective fading, long code scrambling and employing two antennas at the mobile receiver.
The channel is assumed to be unvarying with time, which is generally true over a short time interval. We also assume that synchronisation with the basestation long code has been achieved.
24.2.2
Data and channel model
The channel model is shown in Figure 24.3. If the transmission is from only one basestation, the impulse response for the /th antenna channel between the transmitter and receiver (mobile station) is given by: AU-I
A1-(O = J^
h
aWPrc(t
i = 1,2
- TkX
(24.20)
where hCi [k] is the time-invariant complex gain associated with the kth multipath at the /th antenna. prc(t) is the composite chip waveform (including the matched lowpass filters on the transmit and receive end). The chip waveform is assumed to have a raised cosine spectrum. Nm is the total number of delayed paths, i.e. multipath arrivals, some of which may have zero or negligible power, so that the channel impulse response is sparse. The transmitted sum signal may be described as: Nu N5-I
sin] = cbs[n] J^ Y^
b
jMc№
(24.21)
~ Ncm]
j=\ m=0
where Cbs[ri] is the basestation-dependent long code, bj[m] is the bit/symbol sequence of the jth user, Cj[n] is the jth user's channel (short) code of length Nc, Nu is the total number of active users and N5 is the number of bit/symbols transmitted during a given time window. The signal received at the /th antenna (after convolving with a matched filter having a square-root raised cosine impulse response) is given by: xtif) = X > [ / i ] W - nTc) + I71-(O,
/ = 1,2
(24.22)
n
where Tc is the duration of one chip and rn(t) is a noise process assumed white and Gaussian prior to colouration by the receiver chip-pulse matched filter.
antenna 1
symbol estimate antenna 2
Figure 24.3
Chip-level MMSE equalisation for CDMA downlink with 1 basestation
24.2.3 Edge of cell/soft hand-off We consider the interference problem when the desired user is at the edge of a cell so that the total received signal at the mobile station is the sum of the contributions from two basestations, plus noise: Xf (t) = jcf } (0 + x- 2) (0 + r\i(t),
i = l,2 antennas at receiver
(24.23)
where x\ it) denotes the signal received at the /th antenna due to transmission from the /th basestation, and Jc1-(O denotes the total received signal at the zth antenna. At each antenna, we oversample the signal Jc1-(O to obtain r\i[n\ = xt(nTc) and
r2i[n]=Xi((Tc/2)+nTc). In the soft hand-off mode, the desired user's data is transmitted simultaneously from the two basestations. At the receiver, two equalisers are designed, one for each basestation. The output of each of the two chip-level equalisers is correlated with the desired user's channel code times the corresponding basestation's long code. These two symbol estimates are averaged to get the symbol estimate for the soft hand-off mode. The block diagram for the chip-level MMSE equaliser employing soft handoff is shown in Figure 24.4. In the normal mode, the second basestation is treated as interference.
24.2.4
Chip-level MMSE estimator
The chip-level MMSE equaliser is designed to minimise the mean squared error between the multiuser synchronous sum signal, s[n] and the sum of the equaliser outputs, as depicted in Figures 24.3 and 24.4. Given the orthogonality of the channel codes, an estimate of the symbol, bj[m] can be obtained via a correlate and sum with
basestation 1
antenna 1 antenna 2 symbol estimate basestation 2
Figure 24.4
Chip-level MMSE equalisation for CDMA downlink, two basestations, soft hand-off
Cj and the basestation-dependent long code at the output of the chip-level MMSE equaliser, once per symbol. Equation (24.23) can be written more compactly in vector form as: x[n] = H(1)s(1)[>z] + H ( 2 ) s ( 2 ) M + rf[n] = WsM + jy[n]
(24.24)
where H ^ is the 2Ng x (L H- Ng — 1) channel convolution matrix, Ng is the length of the equaliser and L is the length of the channel in chips: (24.25)
(24.26)
s(/) [n] is an (L+N g — 1) vector of the transmitted signal, with the superscript denoting the corresponding basestation. The corresponding formulae for only one transmitting basestation can be found by simply removing the term involving H ^ from equation (24.24). Krauss and Zoltowski [2] made some simplifying assumptions to derive a chiplevel MMSE equaliser that can be easily implemented. The sequence values for the multiuser sum signal are assumed to be HD random variables. Otherwise the covariance matrix of the sum signal s[n] is a complicated expression involving the Walsh-Hadamard spreading codes that vary from index to index. The HD assumption is valid if the (long) scrambling code is viewed as a random HD sequence and/or all users are active with equal power. With this assumption, the covariance matrix of the signal is £{s[n]^s[/i]} = or21 and the MMSE equaliser was shown to be: gf
= [o2snnH + -Rrmr1H(l)sDc
(24.27)
where SDC is a column vector of all zeros except 1 in the (Dc + l)th position, Dc is the combined delay of the equaliser and channel, aj is the signal power and "JZr1J1 = E[rjH[n]ri[n]]. In Reference 2 and this section, the delay D c , 0 < Dc < (Ng + L — 2) that yields the smallest MMSE is calculated using the actual channel statistics and that Dc is used in the simulations. Equation (24.27) has the form of the well known Wiener-Hopf solution: W = R x - 1 I^
(24.28)
where Rxx is the channel covariance matrix and r ^ is the cross-correlation vector.
In References 2 and 17, Krauss and Zoltowski showed that the MMSE significantly outperforms the Rake receiver - especially when a large number of channel codes are active relative to the spreading factor. The difference is more pronounced when soft hand-off is unavailable. In general, the length of an MMSE equaliser should be at least equal to the channel length to achieve the desired performance, and longer equalisers yield better error rates [18]. Hence, equalisers in the high-speed CDMA downlink will by necessity span many chips in length with a corresponding large number of degrees of freedom. In order to reduce the number of filter coefficients to be estimated, the received signal vector may be projected onto a lower dimensional subspace, and the Wiener filter given by equation (24.28) can be constrained to lie in this subspace. This increases the speed of convergence dramatically for adaptive methods, if the subspace is chosen properly. But the overall MMSE for the reduced-rank filter may be higher than the MMSE for the full-rank filter. The most widely known reduced-rank techniques in signal processing are the principal components method [11] and the cross spectral methods [11], both based on eigendecomposition of the channel covariance matrix.
24.2.5 Performance examples Our first set of results solves for chip-level MMSE equalisation when only one basestation is transmitting and finds the ideal MSWF solution after various stages, assuming R xx and Tdx are known (perfect channel estimation). The same simulations are also done for the edge of cell situation, when two basestations of equal power are received at the mobile station and the receiver uses soft hand-off. A wideband CDMA forward link was simulated similar to one of the options in the US cdma2000 proposal. The chip rate was 3.6864MHz (Tc = 0.27 |Jis), three times that of IS-95. Simulations were performed for a saturated cell, i.e. all 64 possible channel codes are active, and all users have equal power. The spreading factor was Nc = 64 chips per bit. The data symbols were BPSK, and spread with one of 64 Walsh-Hadamard codes. The signals were summed synchronously and then multiplied with a QPSK scrambling code of length 32 678, similar to the IS-95 standard. The channels were modelled to have four equal power multipaths, uniformly distributed within a delay spread of 10|xs (corresponding to about 37 chips). The multipath coefficients are complex normal, independent random variables. The arrival times at antenna 1 and 2 are the same, but the multipath coefficients are independent. Figure 24.5 shows a typical channel's impulse response, sampled at the chip rate. In the two basestation case, the maximum delay spread of the downlink channel from the second basestation is also 10|xs, with four dominant multipath arrivals at random. The channels are scaled so that the total energy from each of the two basestations is equal at the receiver. Specifically: (24.29)
normalised magnitude
antenna 1 antenna 2
time in chips
Figure 24.5
Typical channel impulse response for simulated model
SNR is defined to be the ratio of the sum of the average power of the received signals over all the channels, to the average noise power, after chip-matched filtering. Since the spreading factor (number of chips per symbol) is equal to the number of users, and each user contributes the same amount of power, this chip signal SNR is equal to the post-correlation (or despread) SNR per user per symbol. The curves were generated by averaging 100 or more different channels. Note that the abscissa in the graphs is the post-correlation SNR for each user which includes a processing gain of 101og(64) « 18dB. Figure 24.6 plots the mean square error for the different reduced-rank methods as a function of the subspace dimension, D. The channel statistics and noise power are assumed to be known. In the single basestation case, Figure 24.6a, the dimension of the full space is 114 (the equaliser length is 57 at each of two antennas, as multipath delay spread is 37 chips and the chip pulse waveform is cut off after five chips at both ends). The MSE for MSWF is seen to drop dramatically with D, and achieves the performance of the full-rank Wiener filter at dimension approximately 7! In contrast, the principal components method takes longer than twice the delay spread, and the cross-spectral method does only slightly better. Figure 24.7a displays the BER curves obtained with the MSWF for different sizes of the reduced-dimension subspace. For comparison, the BER for a conventional Rake and full-rank chip-level MMSE equaliser are also shown. The channel statistics are assumed to be known perfectly, so these curves serve as an informative upper bound on the performance. It is observed that even a two-stage reduced-rank filter outperforms the Rake at all and only a small number of stages of the MSWF are needed in order to achieve near full-rank MMSE performance over a practical range of SNRs.
mean squared error, dB
mean squared error, dB
multistage nested Wiener filter principal components cross-spectral components
normal hand-off
dimension of reduced-rank subspace forg c
dimension of reduced-rank subspace forg c
Figure 24.6
multistage nested Wiener filter principal components cross-spectral components
MSE versus rank of reduced dimension subspace (post-correlation SNR = WdB)
RAKE MSNWF, stage 2 MSNWF, stage 5 MSNWF, stage 10 exact MMSE, rank 114
Figure 24.7
average BER
average BER
soft hand-off
RAKE MSNWF, stage 5 MSNWF, stage 10 MSNWF, stage 20 MMSE, rank 4*57
BER for different chip-level equalisersfor CDMA downlink with known channels (note: typically these channels are estimated using known training sequences)
Figures 24.6b and 24.7b display similar plots, but for the edge of cell scenario. In this case, there are four effective channels at the receiver, because we sampled the received signal at twice the chip rate at each antenna. It can be shown that the two polyphase channels created from either antenna are nearly linearly dependent in the case of a sparse multipath channel as in our simulations. The dimension of the full space is 228 which makes the full-rank calculations very cumbersome. Amazingly, the MSE for MSWF still goes down very steeply with rank and achieves the full-rank value for a subspace dimension of only eight or so. Compared with the PC and CS methods, this is a huge difference in effective rank reduction. In the BER plots of Figure 24.7b, the bit error is calculated for the soft hand-off mode. With perfect channel estimation, the MSWF can achieve error rates similar to the full-rank MMSE over practical SNR range after stopping at stage as low as 5!
24.3 24.3.1
Application of MSWF to GPS jammer suppression Introduction
GPS is known to provide significant force enhancement capability. This force enhancement capability has been demonstrated in every US military operation since (and including) the 1991 Gulf War, but with this capability is a concern about the vulnerability of the GPS signal to jamming. The jamming threat is serious because of the physical design of the GPS system. The received power from the GPS satellites is approximately —157 dBW [31]. Many jammers available on the arms market today either already cover the GPS frequencies, or can be modified to do so. Therefore, in recent years there has been an effort to develop algorithms to suppress these various types of jammer. Current methods to mitigate jamming issues are primarily focused on temporal processing, spatial processing and DFT frequency excision techniques [39]. The Mayflower Company holds two patents [27, 34] illustrating the use of temporal processing and utilising the technique of preprocessing GPS signals. One of the patents illustrates an antijam temporal-based preprocessor that can be attached to a GPS receiver without modification to that receiver. It is stated as expected in their patent that the preprocessor is not effective against wideband RF interference. Papers such as References 21 to 25 present temporal processing methods to null out narrowband jammers in DSSS type systems. Spatial processing is very effective against cancelling wideband jammers. This type of processing makes use of antennas to generate diversity. Papers such as References 26, 29, 33 and 30 use such techniques to cancel both wideband and narrowband jammers. The drawbacks of the spatial processing alone is that you need a large number of elements in order to cancel numerous narrowband and wideband jammers. This motivated the need to develop space-time antijam algorithms that can effectively null both wideband and narrowband jammers simultaneously. References 28 and 35 to 38 describe the advantages of using spatial and temporal processing jointly for nulling jammers. The disadvantages of space-time processing is that the associated computation and processing speed requirements are substantially greater than those needed for spatial or temporal processing alone. In this section, an innovative antijam (AJ) filter is presented that circumvents the disadvantages of space-time processing by operating in a reduced dimension based on the MSWF. The filter also utilises the preprocessing technique so that the antijam filter can be used with a GPS receiver without modification to that receiver. The already patented temporal preprocessing filter by Mayflower Company motivates the feasibility of using the preprocessing technique in the presented space-time filter approach.
24.3.2 Power minimisation and joint space-time preprocessing In the joint processing approach, each sample value input to the GPS receiver is formed from a linear combination of samples across both space and time. The spacetime weights are realised through a tapped delay line behind each digitised baseband antenna, as shown in Figure 24.8. The output of the preprocessor is then fed to a
baseband I & Q demod
baseband I & Q demod
GPS receiver
baseband I & Q demod
Figure 24.8
Power minimisation-based joint space-time preprocessor
standard digital GPS receiver. The goal of the preprocessor is to suppress jammers as well as possible while simultaneously passing as many undistorted GPS signals. Note that the antijam space-time filter will not be optimised for any one GPS satellite signal in terms of maximising the SINR. The advantage of this approach is that the antijam space-time filter remains a separate component so that a standard digital GPS receiver may be employed. The criterion for determining the optimal set of space-time weights is premised on the fact that the respective power levels of the desired GPS signals are significantly below the noise floor, as well as below the respective power levels of the potential jammers. The goal then is to drive the power of the preprocessor output down to the noise floor. This approach serves to place notches at the respective centre frequencies of strong narrowband interferers and spatial nulls in the respective directions of broadband interferers.
24.3.3
Space-time filter characteristics
In order for the GPS receiver to provide accurate navigation information, it is necessary to track the signals from at least four different GPS satellites. Given the parallax
error associated with GPS satellites at near-horizon relative to the aircraft, it is generally desirable to track the respective signals from a larger number of GPS satellites, e.g. 12. It is desired then that the preprocessor passes unaltered as many GPS signals as possible. Thus, the magnitude of the multi-dimensional Fourier transform of the space-time weights should be as flat as possible in the spectrum as a function of frequency and angular dimensions. The goal then is to simultaneously have a point-like null in the multi-dimensional spectrum at the frequency-angle coordinates of each strong narrowband interferer and a sharp line null (i.e. a spatial null across the entire frequency band) along the angular coordinates of each broadband interferer. To achieve the objective above, a space-time extension to the standard spatial power minimisation scheme using a null steering antenna [30, 29] is proposed. For analysis purposes, a standard linear array geometry consisting of a reference antenna and M — 1 equispaced auxiliary antennas is considered. The set of weights which form the beam whose data is sent to the GPS receiver is chosen to minimise the output power of the beamformer under the constraint that the weight for the first tap of the reference element be unity. In the case of strong interference, the space-only power minimisation scheme [30, 29] attempts to place spatial nulls in the direction of each strong interferer, whether it is narrowband or broadband. The maximum number of interferers that can be spatially nulled is M — 1, where M is the number of antennas. Of course, if two interfering sources are relatively closely spaced in angle, a single null may be sufficient to cancel them. If there is a mixture of narrowband and broadband interferers, however, space-time processing offers the capability of cancelling more interferers than does space-only processing. Cancellation of a narrowband interferer with space-time processing only requires a point null in the multi-dimensional Fourier transform at the frequency-angle coordinates of the interferer. If there are N taps per each of M antennas as shown in Figure 24.8, up to NM — 1 such point nulls may be effected for the purpose of cancelling narrowband interferers. Space-time processing also averts an unnecessary spatial null in the direction of a narrowband interferer which may potentially blot out an angular sector of space where a GPS satellite lies. The disadvantage of space-time processing is that the associated computation and processing speed requirements are substantially greater than those needed for space-only preprocessing. To decrease the required computation and processing speed, a reduced complexity implementation of the space-time filter based on the MSWF is developed.
24.3.4 Data and channel model In a direct space-time extension of space-only power minimisation, the space-time weights are determined by minimising the output power of the space-time preprocessor under the constraint that the value of the first tap of the tapped delay line behind the reference element be unity. In the special case where there are no jammers and the receiver generated noise is uncorrelated across both time and space, power minimisation will asymptotically force the values of all the other space-time weights to zero since the preprocessor output power under these conditions will be the noise
power at the reference antenna (the GPS signals are significantly below the noise floor). That is, the space-time weights converge to a multi-dimensional Kronecker delta function. Note that the magnitude of the multi-dimensional discrete Fourier transform of a Kronecker delta function is perfectly flat along both the frequency and angular dimensions. This is a desirable feature as under these very special conditions we would like the space-time preprocessor to pass unaltered all the GPS signals in the field of view. Referring to Figure 24.8, \m(n) is an N x 1 vector containing N successive samples of the output of the mth antenna sampled at a rate above or equal to the Nyquist rate for the P(Y) code: xm(«) = [xm(n),xm(n - 1),... ,xm(n - N + l ) f The NM x 1 space-time snapshot, x(n), is formed from concatenating m = 1,2,..., M as:
(24.30) xm(n),
(24.31)
Similarly, the N tap weights for the mth antenna are placed as the components of an NxI vector as: hm = [A 1n (O),MD, • • • >hm(N - l ) f ,
m = 1,2,..., M
(24.32)
and the entire set of space-time weights is formed from a concatenation of h m , m = 1,.. . , M a s :
(24.33)
The output power of the space-time preprocessor is: E{ \hHx(n) | 2 } = h H Kh (24.34) where K = E{x(n)xH (n)}. Assume that the first antenna of the linear array is the reference antenna. To incorporate the unity weight constraint on the first tap of the reference antenna, define x(n) as the (NM - l ) x l subvector of x(n) containing all but the first element of x(n). Similarly, hx is defined as the (NM — 1) x 1 subvector of h containing all but the first element of h: (24.35) (24.36)
With these definitions, the power at the preprocessor output may be expressed as: E{\hHx(n)\2} = E{\xx(n) + hf x(n)\2}
(24.37)
Expressing the preprocessor output power in this fashion facilitates an adaptive filtering formulation where the output of the first tap of the reference antenna serves as the desired signal and the error signal is x\(n) + h f x(n). As a result, LMS and/or RLS-based adaptations are possible, as developed previously for the case of space-only processing [29]. To determine the asymptotic form of the optimal space-time weights, partition the NM x NM space-time correlation matrix as follows:
The output power of the preprocessor may then be expressed as: E{\hHx(n)\2} = kxx(0) + k £ h x + h f k ^ + h H K x x h
(24.38)
Since K xx is positive definite, it follows that the optimal solution is h x = - K x ^ k J x and the minimum output power of the preprocessor is kxx(0) — k ^ K ^ k ^ . One practical approach using a finite number of samples and batch mode processing over L samples at each antenna, estimates Kxx and kj x by: (24.39) Recall that x(n) contains all the elements of x(n) except for the first element corresponding to x\ (n).
24.3.5 Dimensionality reduction techniques To reduce the dimensionality of the associated space-time preprocessor, rankreduction techniques such as the cross-spectral metric or principal components could be used. A brief overview of these methods is therefore necessary to motivate the use of the MSWF. The matrix K xx can be spectrally decomposed as Kx* = Yli=z\~ ^/e*ef\ where A./ are the eigenvalues of Kxx indexed in descending order and e/ are the corresponding eigenvectors. One can then seek dimensionality reduction through the transformation y(n) = T f x(ra), where Tx = [e/(i)ie/(2)i • • • '-Vi(K)] is an (NM — 1) x K matrix containing K < NM eigenvectors of Kxx and {/(1), i(2),..., i(K)} is a subset of the integers {1,2,..., NM]. Recall that the minimum power at the preprocessor output is ^ ( 0 ) — k ^ K ^ k ^ ^ . The minimum power in the reduced dimension subspace is kxx (0) - k^.T x (Tf K x ^T x ) ~ 1TxKix. Given that the columns of T x are eigenvectors of K xx , it follows that the K < NM eigenvectors of K xx comprising T x can be selected as those which maximise the cross-spectral
metric [32] defined as: (24.40) The principal components technique would instead select the K largest eigenvectors of Kxx to form T*. Both techniques are quite computationally intensive since it is necessary to generate the eigenvectors of K 0 ; before finding the reduced dimensioned matrix T x as well as compute (T^K 0 -T x )" 1 . 24.3.6
Performance examples
24.3.6.1 Linear arrays with adaptive MSWF Two scenarios are presented to illustrate the performance of the reduced-rank MSWF preprocessor in terms of nulling both wideband and narrowband jammers while operating in a reduced-rank mode. Consider M = N = I. These definitions imply an M = I element equispaced linear array with N = I taps at each antenna. We first constrain Zi1(O) = 1 so that x\(n) (first tap behind first antenna) is our reference signal. The other taps behind each antenna element form the column data vector x(n) entering the stages of the MSWF as illustrated in Figure 24.8. Table 24.2 summarises the values used in the first and second scenarios. Five of the six jammers for this simulation are narrowband jammers with different angles of arrival (AoA). In both scenarios, the narrowband jammers have different frequency offsets relative to the Ll frequency. Since we are assuming a 20MHz receiver bandwidth at each Table 24.2
Simulation parameters
Jammer type
SNR, (dBW)
AoA scenario 1
AoA scenario 2
Bandwidth, (MHz)
Wideband Wideband Wideband Wideband Wideband
-100 -110 -100 -100 -110
20° n/a n/a n/a n/a
20° 0° -20° -40° -60°
20 20 20 20 20
Jammer type
SNR, (dBW)
AoA
AoA
Frequency, (MHz)
Narrowband Narrowband Narrowband Narrowband Narrowband
-100 -100 -100 -100 -110
60° 15° -10° -30° -55°
60° n/a n/a n/a n/a
-10 -5 0 5 10
antenna, the noise floor was determined to be approximately — 128dBW after filtering at each antenna. There are four satellites in the field of view each generating its own 1023 length C/A code. All satellite C/A codes are assumed to be received at-157.6dBW. 24.3.6.2 Reduced-rank performance Figures 24.9 and 24.10 indicate the impressive power minimisation of the MSWF as a function of rank. Observe the exceptional power minimisation performance of the MSWF after stopping at only the 10th and 6th stages, respectively. Both Figures illustrate that the principal components (PC) and cross-spectral (CS) metric are unable to effectively null the jammers at lower ranks. Even when there are many wideband jammers, the MSWF performance is exceptional. The eigenvector-based methods do not account for the information in kj x and thus require a higher rank to completely null the jammers. Note that the MSWF is able to null the jammers effectively at low ranks with the added advantage of not requiring the computation of eigenvectors or estimates of K or its inverse. Figures 24.11 and 24.12 indicate that the MSWF in fact generated a point null and a line null for the narrowband and wideband jammers, respectively. The full-rank Wiener filter labelled as 'full rank' in both Figures is the optimal solution relative to an infinite amount of sample support before covariance matrix inversion. The significance of all three algorithms operating in a
noise floor full rank sol. MSNWF PC CS
power, dBW
5 NB and 1 WB
rank
Figure 24.9
Power output versus rank (scenario 1)
power, dBW
noise floor full rank sol. MSNWF PC CS
Figure 24.10
Power output versus rank (scenario 2)
baseband frequency, MHz
2D FFT contour
theta, angle of arrival
Figure 24.11 Nulling performance of MSWF (scenario 1)
baseband frequency, MHz
2D FFT contour
theta, angle of arrival
Figure 24.12
Nulling performance of MSWF (scenario 2)
lower dimension (i.e. at lower ranks) as illustrated in Figures 24.9 and 24.10 reveals the amount of subspace compression achieved relative to the full-rank solution. The level of subspace compression can be related to how well a particular algorithm is able to suppress interference that appears non-stationary to a standard Wiener filter.
24.4
Summary of concepts involving reduced-rank filtering
We presented reduced-rank chip-level MMSE equalisers for the CDMA downlink with frequency-selective multipath based on the MSWF, for a known channel case and also for a training-based adaptation. The performance for the single basestation case, and for the edge-of-cell scenario with soft hand-off are very satisfactory. The convergence rate for MSWF operating in a very low-rank subspace was significantly better than LMS, and somewhat better than RLS. The BER performance showed improvement over the full-rank methods for practical SNR range. This excellent performance is achieved at a computational complexity in between LMS and RLS due to the lattice-type structure which allows block-adaptive implementation through simple filtering operations. A space-time preprocessor algorithm was also presented based on the MSWF. A closed-form expression for the Wiener-Hopf weights based on the MSWF was derived which provides insight into the dependency of the Wiener-Hopf filter weights on the number of stages and blocking matrices at each stage. The
preprocessor was shown to effectively null out both narrowband and wideband jammers while operating in a reduced-rank mode and minimising computational complexity.
References 1 PROAKIS, J. G.: 'Digital communications' (McGraw-Hill, New York City, NY, 1995, 3rd edn.) 2 KRAUSS, T. R, ZOLTOWSKI, M. D., and LEUS, G.: 'Simple MMSE equalizers for CDMA downlink to restore chip sequence: comparison to zero-forcing and rake'. Presented at international conference on Acoustics, speech and signal processing, Istanbul, Turkey, 5-9 June 2000 3 CHOWDURY, S. and ZOLTOWSKI, M. D.: 'Adaptive MMSE equalization for wideband CDMA forward link with time-varying frequency selective channels'. Presented at international conference on Acoustics, speech and signal processing, Orlando, Florida, 13-17 May 2002 4 FRANK, C. D. and VISOTSKY, E.: 'Adaptive interference suppression for direct-sequence CDMA systems with long spreading codes'. Proceedings of 36th Allerton conference on Communication, control, and computing, Monticello, IL, 23-25 September 1998 5 GHAURI, I. and SLOCK, D. T. M.: 'Linear receivers for the DS-CDMA downlink exploiting orthogonality of spreading sequences'. Proceedings of 32nd Asilomar conference on Signals, systems, and computers, Pacific Grove, CA, November 1998 6 HOOLI, K., LATVA-AHO, M., and JUNTTI, M.: 'Multiple access interference suppression with linear chip equalizers in WCDMA downlink receivers'. Proceedings of Global telecomm. conference, Rio de Janero, Brazil, December 1999 7 ZOLTOWSKI, M. D. and KRAUSS, T. P: 'Two-channel zero forcing equalization on CDMA forward link: trade-offs between multi-user access interference and diversity gains'. Proceedings of 33rd Asilomar conference on Signals, systems and computing, Pacific Grove, CA, 25-27 October 1998 8 GOLDSTEIN, J. S., REED, I. S., and SCHARF, L. L.: 'A multistage representation of the Wiener filter based on orthogonal projections', IEEE Trans. Inf. Theory, November 1998, 44, (7), pp. 2943-2959 9 GOLDSTEIN, J. S. and REED, I. S.: 'A new method of Wiener filtering and its application to interference mitigation for communications'. Proceedings of IEEE MILCOM, Monterey, CA, November 1997 10 GOLDSTEIN, J. S., GUERCI, J. R., and REED, I. S.: 'An optimal generalized theory of signal representation'. Proceedings of IEEE ICASSP, Phoenix, AZ, March 1999 11 GOLDSTEIN, J. S. and REED, I. S.: 'Reduced-rank adaptive filtering', IEEE Trans. Signal Process., February 1997, 43, (2), pp. 492^96
12 KANSAL, A., BATALAMA, S. N., and PADOS, D. A.: 'Adaptive maximum SINR rake filtering for DS/CDMA multipath fading channels', IEEEJ. SeI. Areas Commun., December 1998,16, pp. 1831-1839 13 HONIG, M. L. and GOLDSTEIN, J. S.: 'Adaptive reduced-rank residual correlation algorithms for DS-CDMA interference suppression'. Proceedings of 32nd Asilomar conference on Signals, systems and computing, Pacific Grove, CA, November 1998 14 HONIG, M. L. and XIAO, W.: 'Large system performance of reduced-rank linear interference suppression for DS-CDMA'. Proceedings of Allerton conference on Comm., control and computing, UIUC, October 1999 15 MYRICK, W. L., ZOLTOWSKI, M. D., and GOLDSTEIN, J. S.: 'Anti-jam space-time preprocessor for GPS based on multistage nested Wiener filter'. Presented at IEEE Military communications (MILCOM '99), Atlantic City, NJ, 3-6 October 1999 16 MYRICK, W. L., ZOLTOWSKI, M. D., and GOLDSTEIN, J. S.: 'Low-sample performance of reduced-rank power minimization based jammer suppression for GPS'. Presented at IEEE 6th international symposium on Spread-spectrum Techniques and applications, New Jersey, 6-8 September 2000 17 KRAUSS, T. P. and ZOLTOWSKI, M. D.: 'MMSE equalization under conditions of soft hand-off. Presented at IEEE sixth international symposium on Spread spectrum techniques & applications (ISSSTA 2000), NJIT, New Jersey, 6-8 September 2000 18 KRAUSS,T.P.,HILLERY,W.J.,andZOLTOWSKI,M.D.: 'MMSEequalization for forward link in 3G CDMA: symbol-level versus chip-level'. Proceedings of the tenth IEEE workshop on Statistical signal and array processing (SSAP 2000), Pocono Manor, PA, 14-16 August 2000 19 JOHAM, M. and ZOLTOWSKI, M. D.: 'Interpretation of the multi-stage nested Wiener filter in the Krylov subspace framework'. Technical report TR-ECE-00-51, Purdue University, W. Lafayette, IN, November 2000 20 RICKS, D. and GOLDSTEIN, J. S.: 'Efficient architectures for implementing adaptive algorithms'. Presented at Allerton Antenna arrays symposium, UIUC, 20-22 September 2000 21 HSU, F. M. and GIORDANO, A. A.: 'Digital whitening techniques for improving spread spectrum communications performance in the presence of narrowband jamming and interference', IEEE Trans. Commun., February 1978, COM-26, (2), pp. 209-216 22 RODGERS, W. E. and COMPTON, R. T.: 'Adaptive array bandwidth with tapped delay-line processing', IEEE Trans. Aerosp. Electron. Syst., January 1979, AES-15,(1), pp. 21-28 23 KETCHUM, J. W. and PROAKIS, J. G.: 'Adaptive algorithms for estimating and suppressing narrow-band interference in PN spread-spectrum systems', IEEE Trans. Commun., May 1982, COM-30, pp. 1169-1177 24 MASRY, E.: 'Closed-form analytical results for the rejection of narrow-band interference in PN spread-spectrum systems - Part I: linear prediction filters', IEEE Trans. Commun., August 1984, COM-33, pp. 10-19
25 MASRY, E.: 'Closed-form analytical results for the rejection of narrow-band interference in PN spread-spectrum systems - Part II: linear interpolation filters', IEEE Trans. Commun., January 1985, COM-32, pp. 888-896 26 LO, K. W. and VU, T. B.: 'Directional constrained adaptive array using symmetric amplitude control', IEEProc, April 1989,136, pp. 90-97 27 DIMOS, G. and UPADHYAY, T. N.: 'Digital adaptive transversal filter for spread spectrum receivers'. United States patent 5,268,927 to Mayflower Communications, December 1993 28 FANTE, R. L. and TORRES, J. A.: 'Cancellation of diffuse jammer multipath by an airborne adaptive radar', IEEE Trans. Aerosp. Electron. SySt., April 1995, 31, pp. 805-820 29 GECAN, A. S. and ZOLTOWSKI, M. D.: 'Power minimization techniques for GPS null steering antennas'. Institute of Navigation (ION) conference, Palm Springs, CA, 13-15 September 1995 30 ZOLTOWSKI, M. D. and GECAN, A. S.: 'Advanced adaptive null steering concepts for GPS'. Milcom '95, 5-8 November 1995, 3, pp. 1214-1218 31 KAPLAN, E. D. (Ed.): 'Understanding GPS principles and applications' (Artech House, Norwood, MA, 1996) 32 SCOTT GOLDSTEIN, J., REED, I. S., and SMITH, R. N.: 'Low-complexity subspace selection for partial adaptivity'. Milcom '96, 21-24 October 1996, 2, pp. 597-601 33 RAMOS, J., ZOLTOWSKI, M., and BURGOS, M.: 'Robust blind adaptive array: a prototype for GPS'. Digest of 1996 IEEE international symposium on Phased array systems and technology, Boston, MA, 15-18 October 1996, pp. 406-409 34 DIMOS, G. and UPADHYAY, T. N.: 'Standalone canceller of narrowband interference for spread spectrum receivers'. United States patent 5,596,600 to Mayflower Communications, January 1997 35 FANTE, R. L. and VACARRO, J. J.: 'Cancellation of jammers and jammer multipath in a GPS receiver', IEEE Aerosp. Electron. Syst. Mag., November 1998 36 MYRICK, W. L., ZOLTOWSKI, M. D., and SCOTT GOLDSTEIN, J.: 'Anti-jam space-time preprocessor for GPS based on multistage nested Wiener filter'. Milcom '99 37 MYRICK, W. L., ZOLTOWSKI, M. D., and SCOTT GOLDSTEIN, J.: 'Smoothing of space-time power minimization based preprocessor for GPS'. DASP '99 38 MYRICK, W. L., ZOLTOWSKI, M. D., and SCOTT GOLDSTEIN, J.: 'Interference suppression for CDMA via a space-time power minimization based preprocessor with applications to GPS'. Allerton '99 39 LI, C , HU, G., and LIU, M.: 'Narrow-band interference excision in spreadspectrum systems using self-orthogonalizing transform-domain adaptive filters', IEEE J. SeI Areas Commun., March 2000,18, pp. 403-406
Chapter 25
Introduction to space-time coding Sumeet Sandhu, Dhananjay Gore, Rohit Nabar and Arogyaswami Paulraj
25.1
Introduction
Optimal design and successful deployment of high-performance wireless networks present a number of technical challenges. These include regulatory limits on usable radiofrequency spectrum and a complex time-varying propagation environment affected by fading and multipath. In order to meet the growing demand for higher data rates at better quality of service (QoS) with fewer dropped connections, boldly innovative techniques that improve both spectral efficiency and link reliability are called for. Use of multiple antennas at the receiver and transmitter in a wireless network is a rapidly emerging technology that promises higher data rates at longer ranges without consuming extra bandwidth or transmit power. This technology, popularly known as smart antenna technology, offers a variety of leverages which if exploited correctly can enable multiplicative gains in network performance. Smart antenna technology provides a wide variety of options, ranging from singleinput, multiple-output (SIMO) architectures that collect more energy to improve the signal-to-noise ratio (SNR) at the receiver, to multiple-input, multiple-output (MIMO) architectures that open up multiple data pipes over a link. The number of inputs and outputs here refers to the number of antennas used at the transmitter and receiver, respectively. Figure 25.1 shows a typical MIMO system with Mt transmit antennas and Mr receive antennas. The space-time (S-T) modem at the transmitter (Tx) encodes and modulates the information bits to be conveyed to the receiver and maps the signals to be transmitted across space (Mt transmit antennas) and time. The S-T modem at the receiver (Rx) processes the signals received on each of the Mr receive antennas according to the transmitter's signalling strategy and demodulates and decodes the received signal.
transmit antenna array
receive antenna array
output bit stream
input bit stream space-time modem
Figure 25.1
space-time modem
Schematic of a MIMO communication system
Different smart antenna architectures provide different benefits which can be broadly classified as array gain, diversity gain, multiplexing gain and interference reduction. The signalling strategy at the transmitter and the corresponding processing at the receiver are designed based on link requirements (data rate, range, reliability etc.). For example, in order to increase the point-to-point spectral efficiency (in bits/s/Hz) between a transmitter and receiver, multiplexing gain is required which is provided by the MIMO architecture. The signalling strategy also depends on the availability of channel information at the transmitter. For example, MIMO does not require channel knowledge at the transmitter, although it enjoys improved performance if channel information is available. On the other hand, spatial division multiple access (SDMA) does require channel information at the transmitter which is used to increase the network throughput at the media access (MAC) layer. The advantage of point-to-multipoint SDMA over point-to-point MIMO is that SDMA deploys multiple antennas only at the cellular basestation or wireless local area network (LAN) access point, thus reducing the cost of the cellphone or network interface card (NIC). The basic smart antenna architectures are summarised in Table 25.1 along with different algorithms that can be implemented on each architecture. Each combination of algorithm and architecture provides a key differentiating advantage and the corresponding improvement in network performance. The baseline architecture used for comparison is single-input, single-output (SISO), i.e. Mt — Mr = 1, where Mt is the number of transmit antennas and Mr is the number of receive antennas. The newly introduced acronyms in Table 25.1 are as follows: cochannel interference (CCI) and spatial multiplexing (SM). Note that Table 25.1 assumes that in order to maintain low-cost analogue components, the maximum constellation size per transmit antenna cannot be increased when multiple antennas are added. It also assumes that Mr — 1 interferers are jointly nulled for CCI reduction, Mt users are simultaneously served by SDMA and Mt data streams are transmitted over an Mr x Mt MIMO link. Finally, the benefits listed are direct gains achievable with the smart antenna techniques listed in the left-most column. Depending on the channel conditions and adaptation algorithms implemented at the medium access control (MAC) and physical (PHY) layers,
Table 25.1 Key benefits (X) of different smart antenna architectures Metric
Max range and reliability
Max data rate/user
Max network throughput
gain over SISO
array gain/diversity gain
multiplexing gain
interference reduction
SIMO (Mr x 1): Rx diversity CCI nulling MISO (1 x Mt): Tx diversity beamforming SDMA (Mt users) MIMO (Mr x Mt): Tx/Rx diversity CCI nulling SDMA (Mt users) SM (Mt streams)
X X X X X X X X X
these direct benefits may trigger indirect cumulative gains such as improved network throughput. Note that although array gain, diversity gain and interference reduction are all provided by simple SIMO and MISO systems, multiplexing gain which is required to increase point-to-point throughput is only provided by MIMO systems. In fact, SIMO architectures can increase the network throughput only if the basestation uses SDMA technology. In the next few sections we will explore the subtleties of smart antenna gains in greater depth. Starting with a simple signal model, the basic smart antenna benefits, namely array gain, diversity gain, multiplexing gain and interference reduction, will then be discussed in greater detail.
25.2
Multiple antenna channel model
Consider a MIMO system with Mt transmit antennas and Mr receive antennas as shown in Figure 25.2. For simplicity we consider only flat fading, i.e. the fading is not frequency selective. When a continuous wave (CW) probing signal, s, is launched from the 7th transmit antenna, each of the Mr receive antennas sees a complexweighted version of the transmitted signal. We denote the signal received at the /th receive antenna by h^s, where hy is the channel response between the yth transmit antenna and the /th receive antenna. The vector [h\jli2j • • • hMRj]Tl is the signature 1
The superscript T stands for matrix transpose
Figure 25.2
A rray ga in
induced by the yth transmit antenna across the receive antenna array. It is convenient to denote the MIMO channel (H) in matrix notation as shown below:
(25.1)
The channel matrix H defines the input-output relation of the MIMO system and is also known as the channel transfer function. If a signal vector x = [x\X2 • • • XMt]T is launched from the transmit antenna array (XJ is launched from the yth transmit antenna) then the signal received at the receive antenna array, y = [y\y2 • • • yMr]T can be written as: y = Hx + v
(25.2)
where v is the Mr x 1 noise vector consisting of independent complex-Gaussiandistributed elements with zero mean and variance a\ (white noise). Note that the above channel matrix can be interpreted as a snapshot of the wireless channel at a particular frequency and at a specific instant of time. When there is rich multipath with a large delay spread, H varies as a function of frequency. Likewise, when the scatterers are mobile and there is a large Doppler spread, H varies as a function of time. With sufficient antenna separation at the transmit and receive arrays, the elements of the channel matrix H can be assumed to be independent, zero-mean, complexGaussian random variables (Rayleigh fading) with unit variance in sufficiently rich multipath. This model is popularly referred to as the HD Gaussian MIMO channel. In general, if antennas are separated by more than half the carrier wavelength (X/2) [1], the channel fades can be modelled as independent Gaussian random variables.
This point-to-point model can be extended to multiple users by indexing H as HM where HM is the Mr x Mx channel from the wth user to the receiver, as shown below:
Y = [H1...H1,] h i +v
(25.3)
where xu is the Mt x 1 signal transmitted from the uth user. This system model can be easily generalised to unequal numbers of transmit antennas at different users. In this chapter we will focus on the single user case.
25.3
Benefits of smart antenna technology
Equipped with the mathematical system description via equations (25.2) and (25.3), we will now describe different smart antenna gains in detail.
25.3.1 Array gain Consider a SIMO system with one transmit antenna and two receive antennas as shown in Figure 25.3. The two receive antennas see different versions, y\ and y2, of the same transmitted signal, x. The signals y\ and yi have different amplitudes and phases as determined by the propagation conditions. If the channel is known to the receiver, appropriate signal processing techniques can be applied to combine the signals y\ and y2 coherently so that the resultant power of the signal at the receiver is enhanced, leading to an improvement in signal quality. More specifically, the SNR at the output is equal to the sum of the SNR on the individual links. This result can be extended to systems with one transmit antenna and more than two receive
Figure 25.3
Receive diversity
antennas as follows:2 w*y = w*hx + w*v
(25.4)
where the optimal Mr x 1 linear receive filter is w = h, and the maximum SNR is proportional to the channel norm ||h|| 2 = J^m=I №™\2, where ||h|| 2 is the Frobenius norm.3 The average increase in receive signal power at the receiver = E || h || 2 is defined as array gain and is proportional to the number of receive antennas. Array gain can also be exploited in systems with multiple antennas at the transmitter by using beamforming. Extracting the maximum possible array gain in such systems requires channel knowledge at the transmitter, so that the signals may be optimally processed before transmission. An example of transmit beamforming for 1 x Mt MISO systems is shown below: y = h*(wjc) + u
(25.5)
The optimal normalised Mt x 1 transmit filter is w = h/||h||. Analogous to the SIMO case, the array gain in MISO systems with channel knowledge at the transmitter is equal to E||h|| 2 and is proportional to the number of transmit antennas. The array gain in MIMO systems depends on the number of transmit and receive antennas and is a function of the dominant singular value of the channel.
25.3.2
Diversity gain
Signal power in a wireless channel fluctuates (or fades) with time/frequency/space. When the signal power drops dramatically, the channel is said to be in a fade. Diversity is used in wireless systems to combat fading. The basic principle behind diversity is to provide the receiver with several looks at the transmitted signal over independently fading links (or diversity branches). As the number of diversity branches increases, the probability that at any instant of time one or more branch is not in a fade increases. Thus diversity helps stabilise a wireless link. Diversity is available in SISO links in the form of time or frequency diversity. The use of time or frequency diversity in SISO systems often incurs a penalty in data rate due to the utilisation of time or bandwidth to introduce redundancy. The introduction of multiple antennas at the transmitter and/or receiver provides spatial diversity, the use of which does not incur a penalty in the data rate while providing the array gain advantage discussed earlier. In this chapter we are concerned with this form of diversity. There are two forms of spatial diversity - receive and transmit diversity. Receive diversity applies to systems with multiple antennas only at the receiver (SIMO systems) [2]. Figure 25.3 illustrates a system with receive diversity. Signal x is transmitted from a single antenna at the transmitter. The two receive antennas see independently faded versions, y\ and V2, of the transmitted signal, x. The receiver
3
The superscript * stands for conjugate transpose The Frobenius norm of a matrix A is defined to be ||A|| 2 = J^i j l a i j l 2
combines these signals using appropriate signal processing techniques so that the resultant signal exhibits much reduced amplitude variability (fading) as compared with either y\ or V2- The amplitude variability can be further reduced by adding more antennas to the receiver. The diversity in a system is characterised by the number of independently fading diversity branches, also known as the diversity order. The diversity order of the system in Figure 25.3 is two and in general is equal to the number of receive antennas, M r , in a SIMO system. Transmit diversity is applicable when multiple antennas are used at the transmitter and has become an active area for research since the early 1990s [3, 16, 4]. Extracting diversity in such systems does not necessarily require channel knowledge at the transmitter. However, suitable design of the transmitted signal is required to extract diversity. Space-time coding [5, 6] is a powerful transmit diversity technique that relies on coding across space (transmit antennas) and time to extract diversity. Figure 25.4 shows a generic transmit diversity scheme for a system with two transmit antennas and one receive antenna. At the transmitter, signals x\ and X2 are derived from the original signal to be transmitted, x, such that the signal x can be recovered from either of the received signals y\\ or V21. The receiver combines the received signals in such a manner that the resultant output exhibits reduced fading when compared with yn or yn. The diversity order of this system is two and in general is equal to the number of transmit antennas, Mt, in a MISO system. Utilisation of diversity in MIMO systems requires a combination of receive and transmit diversity described above. A MIMO system consists of Mt x Mr SISO links. If the signals transmitted over each of these links experience independent fading, then the diversity order of the system is given by Mt x Mr. Thus the diversity order in a MIMO system scales linearly with the product of the number of receive and transmit antennas. Mathematically, diversity is defined to be equal to the slope of the symbol error rate (SER) versus SNR graph. This will be shown in greater detail in the following derivation.
Figure 25.4
Transmit diversity
The vector equation in (25.2) can be written as the following matrix equation: Y = HX + V
(25.6)
where the channel input X is an Mt x T codeword spanning T sample times, the channel output is the Mr x T matrix Y observed on Mr receive antennas over T sample times and the receiver noise is the Mr x T matrix V. Consider two Mt x T codewords X ^ and X ^ that are transmitted over Mt transmit antennas across T sample times. If X ^ was transmitted, the probability that X ^ ^ X ^ is detected for a given realisation of the channel H is equal to the following:
(25.7) where Q(JC) = / x °°dr(l/\/27r)exp(—t 1 /2) is the complementary error function, Dij = IH (X^ — X ^ ) | | F is the pairwise Euclidean distance at the receiver and SNR = Es/No is the ratio of the total transmitted signal power to the noise power per receive antenna. This conditional pairwise error probability (PEP) is a function of the channel realisation. Since the transmitter does not know the channel, the best it can do is optimise a criterion that takes channel statistics into account. One popular criterion is the average PEP, i.e. the average of the conditional PEP over channel statistics. It is difficult to compute the expectation of the expression in (25.7). A simpler alternative is to compute the average of a tight upper bound, in particular the Chernoff upper bound: (25.8) For the HD Gaussian channel, the average Chernoff bound simplifies to the following as derived in References 4 and 5:
(25.9)
where det is the determinant of a square matrix, {GI}\LX are the non-zero singular values of the difference matrix A^ = X ^ — X ^ and L is its rank. Taking the limit
at high SNR: (25.10) and taking the logarithm of both sides, we have: (25.11) Consider the logarithm of the PEP in equation (25.11). The right-hand side is clearly linear in the logarithms of SNR and the product of squared singular values of the difference matrix. In addition, the slope of the right-hand side is a product of the number of receive antennas and the rank of the difference matrix. The diversity gain of the space-time codebook is defined to be the minimum value of L over all pairs of codewords. For a given diversity gain, the coding gain is defined to be the minimum of the product (Wi= i af)^^L o v e r aU pairs of codewords. Performance of space-time codes is usually illustrated by plotting the SER versus SNR on a logarithmic scale. Since the PEP is closely related to SER, equation (25.11) is a good approximation to SER especially at high SNRs. Figure 25.5 illustrates the effect of each code metric on the SER curve. Diversity gain affects the asymptotic slope of the SER versus SNR graph - the greater the diversity, the faster the SER drops with SNR. Coding gain affects the horizontal shift of the graph - the greater the coding gain, the greater the shift to the left.
25.3.3 Multiplexing gain The key differentiating advantage of MIMO systems is practical throughput enhancement which is not provided by SIMO or MISO systems. We refer to this leverage
SER (log scale)
uncoded
coding gain
diversitKgain
SNR, dB
Figure 25.5
Diversity gain and coding gain
scatterers
Figure 25.6
Spatial multiplexing
as multiplexing gain and it can be realised through a technique known as spatial multiplexing [7, 8]. Figure 25.6 shows the basic principle of spatial multiplexing for a system with two transmit and two receive antennas. The symbol stream to be transmitted is split into two half-rate substreams and modulated to form the signals x\ and X2 that are transmitted simultaneously from separate antennas. Under favourable channel conditions, the spatial signatures of these signals (denoted by [ynyi2] r and [yixyiiV) induced at the receive antennas are well separated (ideally orthogonal). The receiver can then extract the two substreams, x\ and X2, which it combines to give the original symbol stream, x. This can be mathematically expressed as the theoretical channel capacity as derived in References 9 and 10. Channel capacity of the memoryless MIMO channel in equation (25.2) is defined to be the instantaneous mutual information which is a function of the channel realisation as follows: C| H - logdet (lMr + SNR HK x H*)
(25.12)
When the channel is square and orthogonal (HH* = I), then with an HD input distribution (Kx = (\/Mt)\Mt\ equation (25.12) reduces to: C± = Mt log (l + — SNR^
(25.13)
Hence M = M1 = Mr parallel channels are created within the same frequency bandwidth for no additional transmit power. Capacity scales linearly with number of antennas for increasing SNR, i.e. capacity increases by M bits/s/Hz for every 3 dB increase in SNR. In general, it can be shown that an orthogonal channel of the form described above maximises the Shannon capacity of a MIMO system. For the HD fading MIMO channel model described earlier, the channel realisations become approximately orthogonal when the number of antennas used is very large. When the number of transmit and receive antennas is not equal, Mt ^ Mr, the increase in capacity is limited by the minimum of Mt and Mr. This increase in channel capacity is called multiplexing gain. The capacity-maximising input X is a zero-mean, complex Gaussian vector with covariance Kx = EXX*. For example, when the channel H is fully known at the transmitter, C\u is maximised by Kx that waterpours power over the dominant singular values of H [9]. The conditional capacity can be achieved by coding over
Capacity, bits/s/Hz
SNRS, dB
Figure 25.7
Average channel capacity - HD matrix fading channels
longer and longer block lengths, assuming that the channel is time-invariant. In practice, the fading channel does vary with time, and a commonly used measure of rate in this case is the average channel capacity. Average capacity of the memoryless MIMO channel is defined to be the average mutual information described as follows: C = EHC,H = E H log det(I Mr + SNR HK x H*)
(25.14)
where the average is computed over the channel distribution function. It has been shown in Reference 9 that the HD input distribution is optimal for the HD Gaussian channel. Figure 25.7 shows the average capacity as a function of the SNR for the HD fading channel model for different MIMO configurations. It is clear that the average capacity increases with the number of antennas in the system. At very low SNR, the MIMO multiplexing gain is low, but it increases with increasing SNR becoming asymptotically constant.
25.3.4 Interference reduction In contrast to copper or optical fibre, the wireless medium is an unguided communication link as a result of which cochannel interference is a frequent problem due to the reuse of the frequency spectrum in wireless networks. CCI adds to the overall noise in the system and deteriorates performance. Figure 25.8 illustrates the general
desired signal
interference
Figure 25.8
Interference reduction
principle of interference reduction for a receiver with two antennas. Typically, the desired signal (s) and the interference (/) arrive at the receiver with well separated spatial signatures - [s\S2]T and [i\i2]T, respectively. The receiver can exploit the difference in spatial signatures to reduce the interference, thereby enhancing the signal-to-interference ratio (SIR). Interference reduction usually requires knowledge of the desired signal's spatial channel and benefits from knowledge of the interferers' spatial channels. Interference reduction can also be implemented at the transmitter via SDMA, where the goal is to enhance the signal power at the intended receiver and minimise the interference energy sent towards the cochannel users. Interference reduction allows the use of aggressive reuse factors and improves network capacity. Having discussed the key advantages of smart antenna technology we note that it may not be possible to exploit all the leverages simultaneously in a smart antenna system. This is because the spatial degrees of freedom are limited and engineering trade-offs must be made between each of the desired benefits. The optimal spatiotemporal signalling strategy is a function of the wireless channel properties and network requirements.
25.4
Background on space-time codes
As described in the previous section, MIMO systems promise much higher spectral efficiency than do SISO systems. MIMO systems can also be leveraged to improve the quality of transmission (reduce error rate). This section will focus on MIMO signalling schemes that assume perfect channel knowledge at the receiver and no channel knowledge at the transmitter.
STBC STBC + 4-state TCM Grimm 4-state Yan 4-state
SNR, dB
Figure 25.9
25.4.1
STBC STBC + 8-state TCM Grimm 8-state Yan 8-state
SNR, dB
One receive antenna a 4 state codes b 8 state codes
Space-time trellis codes
For a given number of transmit antennas, the code design objective from equation (25.11) is to construct the largest possible codebook with full diversity gain (L — Mt) and the maximum possible coding gain. A number of hand-crafted STTCs (space-time trellis codes) with full diversity gain were first provided in Reference 5. Full diversity codes with greater coding gain were then reported in Reference 13, where codes were found through exhaustive computer searches over a feedforward convolutional coding (FFC) generator. New codes were then presented in Reference 15 by searching for the codes with the best distance spectrum properties. The distance spectrum of a codebook counts how many pairs of codewords are located at a given product distance (defined as (Y\i= \ crf)l^L for the ij codeword pair in equation (25.11)). To provide some insight into all these codes, Figures 25.9 and 25.10 show the frame error rate for two transmit antennas over an HD Gaussian channel. As a reference, a linear space-time block code (STBC) from Reference 12 is also shown, with
STBC STBC + 4-state TCM Grimm 4-state Yan 4-state SNR, dB
Figure 25.10
STBC STBC+8-state TCM Grimm 8-state Yan 8-state SNR, dB
Three receive antennas a 4 state codes b 8 state codes
and without a concatenated AWGN trellis code (STBC + TCM). The STBC does not provide coding gain but does provide full diversity, which is of the order of 2Mr for both the STBC and the STTCs considered here. In Figure 25.9a note that STBC by itself performs slightly better than all the STTCs, even though it provides no coding gain. This is explained by the multidimensional structure of STBC, i.e. each input symbol is spread over two time samples which improves performance against AWGN. With one receive antenna, concatenated STBC performs significantly better than do all the four-state STTCs. The performance gap reduces with increasing receive antennas in Figure 25.10. With three receive antennas and eight trellis states, in fact, STTC-Yan outperforms STBC-TCM. This performance loss is explained by the well known capacity loss incurred by STBC [18], and will be addressed in greater detail in the next section. The performance gap between STBC-TCM and STTC is also less noticeable with eight state codes for all receive antennas. We conjecture that the distance spectrum of the concatenated scheme degrades in comparison with the distance spectrum of STTC with increasing number of TCM states. Although we will not directly address the distance spectra of space-time codes here, we will introduce a tighter error criterion
than the worst-case PEP which reflects the effects of the distance spectrum on error probability. Let us summarise the three main observations made so far. Space-time block codes which are linear in the input information symbols are simple to encode and decode. Since STBCs outperform STTCs for a small number of antennas, they are of great interest in such MIMO architectures. In general, the distance spectrum of the codebook is a better measure of error performance than the maximum pairwise error probability, and should be incorporated into code design metrics. Finally, linear block codes with the best error performance do not always demonstrate the best capacity performance.
25.4.2 Linear space-time block codes Since linear codes are easier to encode and decode, we will focus on the design of linear codes here. A linear code is defined as a set of codewords that are linear in the scalar input symbols. Complex valued Mt x T modulation matrices {A^ J^l 1 are used to spread the input information symbols over Mt T spatio-temporal dimensions. The real and imaginary part of each input symbol s^ is modulated separately with the matrices A^ and Ak+(K/2)- Define Xk = $isk and Xk+(K/2) = %Sk where 1 S k < K. The modulated matrices are summed to obtain the Mt x T codeword X as follows: (25.15) The number of modulation matrices K is usually upper bounded by the total number of spatio-temporal degrees of freedom 2MtT. When K < 2MtT, the modulation matrices can be designed to be orthogonal (as defined in equations (25.20) and (25.21)) in order to optimise error performance as shown in the next section. For optimal capacity performance, however, in general K = 2MtT and the modulation matrices are not orthogonal. In order to normalise transmit power, the modulation matrices are scaled as follows:
(25.16)
for different input constellations such as quadrature amplitude modulation (QAM), phase shift keying (PSK) and pulse amplitude modulation (PAM). Most of the current spatial modulation techniques can be interpreted as linear codes, the key exceptions being space-time trellis codes. Example 1 - spatial multiplexing: Spatial multiplexing [7], also called BLAST [10], is the simplest example of linear codes. Each incoming symbol is transmitted only once on one antenna and at one symbol time. The modulation matrices are Kronecker
delta matrices, i.e. each Mt xT matrix is equal to 1 in the ith row and the jth column and zero elsewhere. For example, for the case Mt = 2, T = 1, the modulation matrices are as follows:
Example 2 - orthogonal space-time block codes: Orthogonal space-time block codes are a special example of linear codes, where the codeword X is designed to be a unitary matrix. Such modulation matrices exist for limited values of K, Mt and T [6]. For example, the orthogonal block code for Mt = 2, T = 2 is the Alamouti code [12] consisting of four matrices shown below:
(25.17) Example 3 - delay diversity: Delay diversity [3, 16] is really a trellis code that can also be classified as a linear code. The Mt x T modulation matrices are proportional to [0/Mf0], where the order Mt identity matrix is shifted right by k — 1 columns in the &th modulation matrix. For any T > Mt, the total number of complex symbols thus encoded is Mt + T — 1. The case Mt = 2, T = 2 is shown below:
This framework can be extended to non-linear codes by modulating the modulation matrices with non-linear functions of the input symbols. For example, for K = 3, the following vector of input symbols leads to a non-linear space-time code:
This is similar to the analytic representation of TCM [17], except that the coefficients of the analytical expansion are now modulation matrices instead of scalars.
25.5
New design criteria
In this section we will first discuss design criteria for optimising error performance and capacity performance separately. Then we will bring the two together to design capacity-efficient codes that also provide good error performance.
25.5.1 Error performance Evaluating the distance spectrum of space-time codes is difficult. Optimising their distance spectrum is even harder, and possibly not very insightful. We propose a simple new criterion that efficiently incorporates the effects of the distance spectrum, i.e. the union bound on error probability. The union bound is an upper bound on average error probability, where the average is taken over the entire matrix constellation. Moreover, the union bound is a tighter upper bound than the worst case PEP. This follows because the union bound is an average whereas the worst case PEP is the maximum, and can be seen from the following inequalities: (25.18) for a constellation consisting of R equally likely codewords, where pt is the probability that the ith codeword was transmitted. Therefore it is advisable to minimise the scaled union bound, i.e. 1/(R — I)Pu, rather than the maximum PEP, i.e. max/y PEP. To provide a contrast we illustrate the two bounds in Figure 25.11 for the Alamouti code and spatial multiplexing.
union bound/(R-l) and maximum PEP
Alamouti code spatial multiplexing
SNR, dB
Figure 25.11
Union bound (solid) versus maximum PEP (dotted) (1 x2, 4 bits/s/Hz)
The union bound on error probability for linear codes can be averaged over the HD Gaussian channel and written as follows [19]: (25.19) where R is the size of the codebook (R = MK^2 for an input M-QAM constellation), and e^ = jc^ — xkJ is the difference of the input sequences x^ and x^ at the kth location. See Reference 19 for a detailed proof. Let the modulation matrices be unitary,4 i.e. A^A£ = IM, for Mt < T and A^A/: = IT for Mt > T. Then we have the following necessary and sufficient condition for code design. Theorem 1: A linear code consisting of wide unitary modulation matrices minimises PA if it satisfies equation (25.20): AkAf + AiAl = 0
for 1 < k ^ / < K
(25.20)
A linear code consisting of tall unitary modulation matrices minimises PA if it satisfies equation (25.21): A£A/ + A; A* = 0
forl
(25.21)
See Reference 19 for a detailed proof. This gives us the conditions required for optimal error performance. Note that these conditions precisely describe orthogonal space-time block codes [6]. 25.5.2
Capacity performance
We can rewrite the complex channel in equation (25.6) as the following real channel: y
= HAx + v
(25.22)
where y(2MrT x 1) is the channel output vector, Ti (2MrT x 2MtT) is the block diagonal channel, A (2MtT x 2K) is the linear code matrix to be designed, x (2K x 1) is a block of uncoded input symbols with each entry of power E5/2 and v (2MrT x 1) is the noise vector with each entry of power No/2. These quantities are defined as follows:5
4 Unitary modulation matrices achieve the matched filter bound, i.e. they minimise error probability when K — 1 5 vec(A) is the column-by-column vectorisation of matrix A, ® is the matrix Kronecker product and H is the entrywise complex conjugate of H
(25.23) and are all real. This reformulation cleanly separates the channel from the linear code, thereby enabling us to analytically maximise channel capacity. Average capacity of this input-output equation can be written as: C = Efl
^f
l0g det(l2M r + SNR HAATnT)
'
(25 24)
'
Theorem 2: In order to maximise the average channel capacity over all normalised linear codes A for any number of receive antennas, it is sufficient that the optimal linear code A satisfy the following factorisation: A=(ITQB)Q
(25.25)
where B is a (2Mt x r) factor of the input covariance /C^ = AAJ = BBT', and Q is any (rT x 2K) unitary matrix such that QQT = I (where rT < 2K). See Reference 19 for a detailed proof. If the channel is HD Gaussian, then average capacity is maximised by choosing /C = (l/Mt)l2MtT [9]. In this case, B is a unitary matrix and the optimal linear code is unitary. A code that maximises average channel capacity is called capacity-efficient. For the IID Gaussian channel, any code that satisfies AAT = 1/MfI is capacityefficient. Note that Theorem 2 is a sufficient condition to design capacity-efficient codes for any number of receive antennas. This condition is not necessary for limited cases, e.g. the Alamouti code does not satisfy Theorem 2 but is capacity-efficient for one receive antenna (it is not capacity-efficient for two or more receive antennas).
25.5.3
Unified design
In this section we will put together the criteria derived in the previous two sections to design new linear space-time block codes. First, define the following metrics for code design: •
Deviation from unitarity is defined as follows: (25.26)
•
where K is the condition number with respect to the spectral norm. When each of the modulation matrices is unitary, d\ — 0. Deviation from pairwise skew-Hermitianity is defined as follows: (25.27)
•
where \\.\\2F is the squared Frobenius norm. When all the modulation matrices are pairwise skew-Hermitian, di = 0, and when they also satisfy d\ = 0, the code is error-optimal. Deviation from capacity-efficiency is defined as follows: (25.28) When d?, = 0, the code is capacity-efficient for any number of receive antennas. For codes that are capacity-efficient for limited values of Mr, such as the Alamouti code with one receive antenna, it is possible that di, > 0.
Using these metrics, we will design capacity-efficient codes that also perform well with respect to error probability. Our method of code generation is via random search, which is explained in detail in the following. We will focus on the case of two transmit antennas and block length of two. The minimum number of modulation matrices required for capacity-efficient codes is K = 2MtT = 8. A large number of real, 8 x 8 random matrices with zero-mean, unit-variance Gaussian entries were generated, their singular value decompositions
BER
STBC SM LD random CE
SNR, dB
Figure 25.12
BER for two receive antennas, Mr = 2, Mt = 2, 8 bits/codeword
were computed, and their left singular matrix was extracted as a candidate for the code matrix A. Since the left singular matrix is orthogonal by definition, it is automatically capacity-efficient and J3 = 0. For each such code, its non-zero deviations d\ and di were evaluated. After evaluating all candidate codes, we chose a good code that had the lowest values of both d\ and dj. over the ensemble generated. The performance of this chosen code is demonstrated over the HD Gaussian channel by means of simulated BER in Figure 25.12 for two receive antennas and a data rate of four bits per sample time. The new code is labelled 'random CE' and is evaluated against other capacity-efficient codes such as spatial multiplexing (SM) and linear dispersion (LD). For reference we have also plotted the performance of the Alamouti code (STBC). With two receive antennas and a rate of eight bits per codeword, CE outperforms both LD and SM at high SNRs. At a BER of 10~5, CE is at an advantage of about 5 dB. It therefore has a much higher diversity than LD and SM, although not as much as STBC which outperforms all these codes at high enough SNRs. In Figure 25.13, with three receive antennas, CE outperforms STBC by about 1 dB at a BER of 1(T 3 . What this demonstrates is that all capacity-efficient codes don't perform equally well with respect to BER, and that the modulation matrices must also be pairwise
BER
STBC a 2 = 7/8 a 2 = 6/8 a 2 = 5/8 a 2 =4/8 LD random CE SM
SNR, dB
Figure 25.13
BER for three receive antennas Mr = 3, Mx — 2, 8 bits/codeword
skew-Hermitian and unitary in order to minimise BER. Note that, in general, the relative performance of these codes is a function of SNR, number of receive antennas and data rate, and a code that is the best for a given set of conditions may not be the best under different conditions. For reference we have reproduced the LD code [14]:
(25.29)
and our new CE code below:
(25.30)
What is interesting about this code is that the singular values of all modulation matrices are very similar and are listed in Table 25.2. This suggests that non-unitary modulation matrices may outperform unitary modulation matrices in some cases. The relative importance of different design metrics may be determined by formal minimisation of the average union bound on error probability and will not be addressed here.
Table 25.2
25.6
Singular values for random CE
Matrix
o\
02
A1 A2 A3 A4 A5 A6 A7 A8
0.6843 0.6847 0.6867 0.6642 0.6472 0.6691 0.6754 0.6358
0.1784 0.1769 0.1685 0.2428 0.2846 0.2288 0.2094 0.3094
K = o\ 102
3.84 3.87 4.08 2.74 2.27 2.93 3.23 2.06
Receiver design
For a general MIMO channel, the receiver receives a superposition of the transmitted signals and must separate the constituent signals based on channel knowledge. The method of spatial deconvolution determines the computational complexity of the receiver. This problem is similar in nature to the multiuser detection problem in CDMA and parallels can be drawn between the receiver architectures in these two areas. The signal design in this chapter assumed maximum likelihood (ML) decoding, which amounts to exhaustive comparisons of the received signal to all possible transmitted signals. This is computationally prohibitive for higher order constellations such as 64-QAM, which for example would require 642 = 4096 complex multiplications, Euclidean distance computations and comparisons for two transmit antennas. Although ML detection is optimal, receiver complexity grows exponentially with the number of transmit antennas making this scheme impractical. Lower complexity suboptimal receivers include the zero-forcing receiver (ZF) or the minimum mean-square error (MMSE) receiver, the design principles of which are similar to equalisation principles for SISO links with intersymbol interference (ISI). An attractive alternative to ZF and MMSE receivers is the vertical BLAST (V-BLAST) algorithm described in Reference 11, which is essentially a successive cancellation technique. An exciting new algorithm that yields ML-like performance with cubic instead of exponential complexity is the sphere decoding algorithm [20].
25.6.1 Modulation and coding for MIMO The signal design in this chapter did not include the effects of concatenated coding. MIMO technology is compatible with a wide variety of coding and modulation schemes. In general, the best performance in achieved by generalising standard (scalar) modulation and coding techniques to matrix channels. MIMO has been proposed for single-carrier (SC) modulation, direct-sequence code division multiple
access (DS-CDMA) and orthogonal frequency division multiplexing (OFDM) modulation techniques. MIMO has also been considered in conjunction with concatenated coding schemes. Application of turbo codes and low density parity codes to MIMO has recently generated a great deal of interest, as have simpler coding and interleaving techniques such as bit-interleaved coded modulation (BICM) along with iterative decoding. Inclusion of concatenated codes along with soft Viterbi decoding is in fact essential for realising the full diversity gain of practical MIMO systems.
25.7
Concluding remarks
Smart antenna wireless communication systems provide significant gains in terms of spectral efficiency and link reliability. These benefits translate to wireless networks in the form of improved coverage and capacity. MIMO communication theory is an emerging area and full of challenging problems. Some promising research areas in the field of MIMO technology include channel estimation, new coding and modulation schemes, low complexity receivers, MIMO channel modelling and multiuser SDMA network design. References 1 LEE, W. C. Y.: 'Mobile communications engineering' (Mc-Graw Hill, New York, 1982) 2 JAKES, W. C : 'Microwave mobile communications' (Wiley, New York, 1974) 3 WITTNEBEN, A.: 'Base station modulation diversity for digital SIMULCAST'. Proceedings of IEEE Vehicular Technology Conference: 'Gateway to the future technology in motion', St. Louis, USA, May 1991, pp. 848-853 4 GUEY, J., FITZ, M., BELL, M., and KUO, W.: 'Signal design for transmitter diversity wireless communication systems over Rayleigh fading channels'. Proceedings of IEEE Vehicular Technology Conference: 'Mobile technology for the human race', Atlanta, USA, May 1996, pp. 136-140 5 TAROKH, V, SESHADRI, N., and CALDERBANK, A. R.: 'Space-time codes for high data rate wireless communication: performance criterion and code construction', IEEE Trans. Inf. Theory, March 1998, 44, pp. 744-765 6 TAROKH, V, JAFARKHANI, H., and CALDERBANK, A. R.: 'Space-time block codes from orthogonal designs', IEEE Trans. Inf. Theory, July 1999, 45, pp.1456-1467 7 PAULRAJ, A. J. and KAILATH, T.: 'Increasing capacity in wireless broadcast systems using distributed transmission/directional reception', US patent 5,345,599 8 FOSCHINI, G. J.: 'Layered space-time architecture for wireless communication in a fading environment when using multi-element antennas', Bell Labs Tech. J., Autumn 1996,1, (2), pp. 41-59 9 TELATAR, I. E.: 'Capacity of multi-antenna gaussian channels'. Technical report #BLO 112170-950615-07TM, AT&T Bell Laboratories, 1995
10 FOSCHINI, G. J. and GANS, M. J.: 4On limits of wireless communications in a fading environment when using multiple antennas', Wirel. Per. Comm., 1998, 6, (3), pp. 311-335 11 GOLDEN, G. D., FOSCHINI, G. J., VALENZUELA, R. A., and WOLNIANSKY, P. W.: 'Detection algorithm and initial laboratory results using the V-BLAST space-time communication architecture', Electron. Lett., 1999, 35,(1), pp. 14-15 12 ALAMOUTI, S. M.: 4A simple transmit diversity technique for wireless communications', IEEEJ. SeI Areas Commun., October 1998,16, pp. 1451-1458 13 GRIMM, J.: Transmitter diversity code design for achieving full diversity on Rayleigh fading channels'. Ph.D. thesis, Purdue University, 1998 14 HASSIBI, B. and HOCHWALD, B. M.: 'High-rate codes that are linear in space and time'. IEEE Trans. Inf. Theory, 2002, 48, (7), pp. 1804-1824 15 YAN, Q. and BLUM, R. S.: 'Optimum space-time convolutional codes'. Proceedings of IEEE Wireless Communications and Networking Conference (WCNC) 2000, Chicago, USA, September 2000, pp. 1351-1355, volume 3 16 SESHADRI, N. and WINTERS, J. H.: 'Two signaling schemes for improving the error performance of frequency-division-duplex (FDD) transmission systems using transmitter antenna diversity'. Proceedings of IEEE Vehicular Technology Conference, Secaucus, USA, May 1993, pp. 508-511 17 BIGLIERI, E., DIVSALAR, D., McLANE, P. J., and SIMON, M. K.: 'Introduction to trellis-coded modulation with applications', 1991 18 SANDHU, S. and PAULRAJ, A.: 'Space-time block codes: a capacity perspective', IEEE Commun. Lett., 2000, 4, pp. 384-386 19 SANDHU, S.: 'Signal design for multiple-input multiple-output wireless: a unified perspective'. Ph.D. thesis, Stanford University, 2002 20 VITERBO, E. and BOUROS, J.: 'A universal lattice code decoder for fading channels', IEEE Trans. Inf. Theory, 1999, 45, pp. 1639-1642
Index Index terms
Links
3 3G systems
822
A across-track interferometry
185
active towed array sonar
712
medium frequency
727
very low frequency
727
adaptive beamforming
583
715
broadband
588
optimisation
583
space and time adaptation
590
spatial-only adaptation
589
592
subarrays
585
594
subband adaptation
590
adaptive coherence estimator algorithm
429
adaptive coherence estimator test
415
adaptive linear quadratic detector
432
adaptive matched filter test
415
adaptive sidelobe blanker
350
429
adaptive weight training
359
365
adaptivity
307
additive white Gaussian noise
896
affordability
143
Agrawal method
563
581
37
265
208
211
airborne early warning surveillance airborne moving target indication
727
215
This page has been reformatted by Knovel to provide easier navigation.
909
910
Index terms airborne radar 3
Links 6
D LS space-time processing
396
deterministic techniques
375
forward-looking
305
omnidirectional antenna arrays
149
phase and power-selected STAP training
367 73
sideways looking
361
376
131
149
501
422
430
12
heterogeneous clutter
SAR
37
93
5
see also MCARM airborne radar, manoeuvring diving
11
ground target tracking straight and level flight
467 9
see also forward-looking airborne radar see also SLAR airborne reconnaissance radar
149
airborne surveillance radar
123
MCARM
131
aircraft crabbing
167
aircraft pitch angle
430
Alamouti code
898
901
algorithms adaptive coherence estimator
429
baseline systolic
265
blind stochastic gradient descent
841
constant modulus
842
846
CORDIC
273
289
3
D LS
379
data domain
266
development
414
EM
733
277
736
This page has been reformatted by Knovel to provide easier navigation.
608
911
Index terms
Links
algorithms (Continued) IQRD
271
joint detection
791
lattice
269
MSWF
860
multifrequency
440
MVDR
268
post-Doppler STAP
364
power domain
266
recursive
734
robustness
428
SAGE
733
736
SC STAP
611
656
Shalvi-Weistein
847
sphere decoding
905
systolic
277
V-BLAST
905
vectorial lattice
271
Σ∆-STAP
125
along-track interferometry SAR probability of detection
87
742 739
95
185
194
429
431
195
AMF
421
427
amplitude heterogeneity
315
421
clutter discretes
315
clutter edges
322
range-angle varying clutter
320
angle-Doppler mismatch
337
antenna array errors
430
antenna arrays
149
array tilt effects
167
173
circular planar
157
159
circular ring
151
172
This page has been reformatted by Knovel to provide easier navigation.
912
Index terms
Links
antenna arrays (Continued) directivity patterns
164
displaced circular rings
156
172
linear
153
170
octagonal planar
160
172
omnidirectional
152
473
range ambiguity
165
subarrays
150
antenna coordinates
377
472
508
antenna design electronic steering
210
antenna pattern
549
broadband
551
antijam filter
871
antimodulation prefilters
435
ARMA model fitting
574
array gain
885
array interpolation
570
array tilt
167
173
autoregressive models
619
623
auxiliary sensor/echo processor
471
482
average channel capacity
893
AWACS
208
arrays: see antenna arrays 646
B bandpass filter errors
555
BASS-ALE spatial-spectrum estimation
575
Bayesian view
514
beam steering, motion-compensated
33
This page has been reformatted by Knovel to provide easier navigation.
913
Index terms
Links
beamforming
547
adaptive
583
digital
548
subarrays
548
BLAST
897
blind equalisation
841
super-exponential
853
847
blind stochastic gradient descent algorithms
841
blind velocities
245
broadband antenna pattern
551
broadband arrays
543
channel imperfections
553
jammer suppression
582
subarrays
548
superresolution
559
broadband band-limited signals
846
247
581
546
broadband data processing space and time
566
spatial-only
561
C calibration
144
capacity efficiency
902
CDMA
787
CDMA downlink
864
channel model
865
chip-level MMSE estimator
866
edge of cell
866
soft hand-off
866
cellular mobile radio system
785
channel mismatch
430
864
This page has been reformatted by Knovel to provide easier navigation.
914
Index terms
Links
Chernoff bound
890
chip-level MMSE equalisation
865
circular dipole
152
circular planar array circular ring array clutter angle-Doppler relationships clutter bowl
15
17
151 9 10
18
21
clutter discretes
315
350
360
clutter edges
322
341
37
307
amplitude
315
421
angle-Doppler mismatch
337
classes
312
CNR-induced spectral mismatch
327
real-world detection environments
418
site-specific
339
spectral
325
clutter heterogeneity
clutter internal motion
216
clutter mitigation, range ambiguous
247
long single pulse-encoded waveforms clutter modelling clutter scatterer
421
422
422
250 428 9
clutter suppression forward-looking radar
12
one-dimensional techniques
482
Σ∆-STAP
123
CNR-induced spectral mismatch
327
341
cochannel interference
884
893
code design
897
901
codebook
895
codes, concatenated
905
422
This page has been reformatted by Knovel to provide easier navigation.
23
915
Index terms
Links
codesign technology
276
codewords
895
coding gain
891
coherent integration
228
coherent processing interval
607
coherent subspace transformation
567
573
cold clutter
609
629
coloured loading
352
common image gather
776
common midpoint gather
761
770
common midpoint stack
767
772
common reflection point
767
common reflection surface stack
756
synthetic data example
773
traveltime
768
concatenated coding
905
conditional loss factor
665
897
766
777
conditional loss factor analysis multiple stochastic constraints
685
SC SAP
667
SCSTAP
678
single stochastic constraint
681
conjugate gradient method
382
constant false alarm rate
333
415
constant modulus algorithm
842
846
constraining detection schemes
434
constraint convoy tracking
8 494
echelon formation
498
march formation
497
CORDIC algorithm
265
512
289
This page has been reformatted by Knovel to provide easier navigation.
916
Index terms CORDIC processor covariance matrix
Links 273
291
7
28
errors
311
ICM
110
SCNR optimum processing
188
true hot clutter
633
velocity mismatch
114
covariance matrix tapers
351
438
Cramer-Rao bound
375
474
cross spectral method
868
875
cross-spectral metric
437
crow’s nest antenna
152
CRS attributes
775
CRS operator
771
295
477
479
D data tapering
590
data-dependent training techniques
344
decoding
905
delay diversity
898
delay spread
830
demodulation errors
553
deterministic target model
186
deterministic techniques
375
diagonal loading
349
difference beams
123
desired characteristics
135
digital acoustic telemetry system
833
digital communication links
834
dimension of the signal subspace
562
dimensionality reduction techniques
875
190 440 13
9
565
This page has been reformatted by Knovel to provide easier navigation.
917
Index terms dip moveout correction
Links 768
direct data domain least-squares algorithm degrees of freedom
394
main beam constraints
385
non-homogeneity detection
428
one dimension
379
STAP
387
direct uniform model
561
directivity patterns
473
displaced phase centre antenna
86
SAR
193
velocity mismatch
114
Σ∆-implementation
123
displaced phase centre antenna technique
5
566
570
94
108
178
213
distance spectrum
895
distributed target detection
424
diversity gain
808
885
888
Doppler ambiguity
240
488
513
GMTI data processing
524
Doppler blindness
501
506
Doppler centroid
74
89
92
238
241
362
89
93
Doppler frequency Doppler rate
442
Doppler spread
831
Doppler velocity resolution
251
Doppler warping
257
Doppler-domain localised processing
128
DSW
587
430
This page has been reformatted by Knovel to provide easier navigation.
918
Index terms
Links
E Earth’s rotation effects
216
echogram
724
729
automatic detection
726
730
image enhancement
726
730
eigencanceller
438
eigenvector projection method
193
eigenwaves
770
eikonal equation
757
elastodynamic wave equation
756
EM algorithm
733
736
experimental results
747
fast
741
749
recursive
742
749
equalisation
857
extended beam-augmented STAP
436
F factored approach STAP
133
factored signal processing
379
feedback loop white-noise-constrained method
705
filters, adaptive linear
38
filters, AR-based non-linear non-adaptive
56
filters, AR-based two-dimensional FIR
45
improvement factor
47
non-adaptive
57
non-linear combination of adaptive
61
performance
49
filters, non-linear combination of adaptive ARbased two-dimensional FIR MRP
441
61 61
This page has been reformatted by Knovel to provide easier navigation.
919
Index terms
Links
filters, non-linear combination of adaptive AR-based two-dimensional FIR (Continued) MTO
62
filters, non-linear combination of non-adaptive
51
AR-based
56
detection threshold
55
filter bank design
52
flat fading
799
fading-large angle spread
806
fading-small angle spread
806
no fading-small angle spread
807
force enhancement capability
871
forward-backward averaging
439
805
813
473
482
801
810
191
194
forward-looking airborne radar mainlobe clutter suppression
12
sidelobe clutter suppression
18
forward-looking array
470
frequency dependent signal subspaces
578
frequency domain processing MSAR frequency-selective fading ICI cancellation frequency-shift keying
85 787 810 832
G Gaussian assumption, deviation from Gaussian filter bank
431 54
Gaussian target model
187
generalised eigenrelation
429
generalised inner product
344
non-homogeneity detector
427
427
This page has been reformatted by Knovel to provide easier navigation.
920
Index terms generalised likelihood ratio generalised likelihood ratio test modified
Links 127
131
39
307
415
421
431
181
235
467
501
535
425
generalised sidelobe canceller configuration
585
geophones
758
GEOS-C
207
Givens rotations
267
286
global positioning system: see GPS GMTI
208
signal processing
212
GMTI radar systems
178
coordinate systems
507
data processing
514
idealised scenario
502
road map information
502
506
528
sensor data fusion
507
521
536
sensor model
506
510
534
simulation
533
GPS jammer suppression
871
power minimisation
871
preprocessing
871
reduced-rank MSWF preprocessor
876
space-time filter
872
877
Gram Schmidt
267
ground clutter
278
281
413
ground target tracking
467
474
501
convoy detection
494
see also GMTI ground-based radars
285
guard bands
424
This page has been reformatted by Knovel to provide easier navigation.
921
Index terms
Links
H heterogeneous clutter
305
post-Doppler STAP algorithms
359
STAP techniques
344
taxonomy
314
364
see also clutter heterogeneity HF OTHR radar
608
frequency modulated continuous waveform
648
pulse-waveform
624
HF skywave radars
604
ionosphere
604
sea clutter
613
HF surface-wave radars sea clutter
606
610
609
624
624
613
high range resolution radar
419
hot clutter
609
covariance matrix
633
ionospherically propagated
627
models
627
simulation
630
hot clutter mitigation
603
hot-clutter-to-cold-clutter ratio
659
hot-clutter-to-noise ratio
615
Householder reflections
267
hydrocarbon industry
755
758
hydrophones
734
736
758
hypothesis testing problem
417
284
311
829
I improvement factor optimum processor
281 469
This page has been reformatted by Knovel to provide easier navigation.
922
Index terms intercell interference
Links 804
non-resolvable
816
resolvable
815
813
interference reduction
885
interference suppression
857
internal clutter motion
107
110
intersymbol interference
830
833
intracell and intercell interference cancellation
813
flat fading
813
frequency-selective fading
819
intracell interference cancellation
805
frequency-selective fading
810
intrinsic clutter motion
422
invariance tests
416
inverse QRD
265
platform motion
864
804
flat fading
jammer rejection
893
271
27 28
31
broadband
543
582
GPS
871
two-stage processors
434
jammer suppression
jammers
217
coherent repeater
435
sonar
718
jamming
280
with clutter
420
433
283
414
492
492
JDL-GLR processor
441
joint detection algorithms
791
frequency domain
793
MMSE-BLE
793
796
This page has been reformatted by Knovel to provide easier navigation.
923
Index terms
Links
joint detection algorithms (Continued) performance
797
ZF-BLE
792
796
39
379
joint domain localised technique clutter cancellation
42
interfering targets
43
MSAR
84
Joint-STARS
94
177
208
K KASSPER
428
knowledge-aided STAP
352
knowledge-based signal processing
428
Kronecker delta function
874
Kronecker matrix product
7
L land-sea clutter
278
lattice filter
861
lattice processing architecture
269
lattice processor
288
lean matrix conversion
585
likelihood function
515
limited sample support
439
linear codes
897
linear dispersion
903
live data processing
277
data files
278
performance evaluation
280
systolic algorithm
277
This page has been reformatted by Knovel to provide easier navigation.
924
Index terms
Links
loaded SMI
191
SC STAP
678
long single pulse phase-encoded waveforms
668
673
250
ambiguity surface
252
integrated sidelobe clutter level
254
properties
252
STAP simulations
257
loss factor
665
low-earth orbit
236
low-flying targets
525
M map-dependent training selection
348
matched field processing
703
matched filtering
101
maximum likelihood decoding
905
maximum likelihood estimation
307
two-dimensional AR parameters
69
maximum likelihood processing
215
733
MCARM
131
339
heterogeneous clutter
339
STAP techniques
345
mechanical steering
244
media access layer
884
median test output detector
62
MIMD parallel processors
272
MIMO communication system
883
modulation and coding
905
multiple antenna channel model
885
receiver design
905
signalling schemes
894
888
891
906
This page has been reformatted by Knovel to provide easier navigation.
925
Index terms
Links
minimal sample support STAP
348
minimum detectable velocity
211
240
506
513
GMTI modelling minimum mean square error minimum residual power detector
310
363
704
707
857 61
minimum variance distortionless response
265
268
MISO
885
888
MMSE equaliser
866
MMSE receiver
905
MMSE-BLE
793
796
mobile radio communications
785
791
864
modified GLRT
425 130
415
76
186
474
probability of detection
478
481
SNIR
476
480
target stop
480
modified sample matrix inverse detector
38
modified sample matrix inversion
127
modulation schemes
905
Moore’s law
275
moving target detection and imaging moving target indication moving targets
MSWF
87 359
858
algorithm
860
CDMA downlink
864
GPS jammer suppression
871
lattice structure
861
nulling performance
878
Wiener-Hopf filter weights
862
multichannel airborne radar measurement system: see MCARM This page has been reformatted by Knovel to provide easier navigation.
926
Index terms multichannel digital receiver signal model multichannel equalisation
Links 833 837 839
multichannel SAR echoes
76
optimum processing
107
processing schemes
82
multicoverage data
760
multifrequency algorithms
440
multifunction radar
543
multiple access interference
864
multiple antenna channel model
885
multiplexing gain
885
891
MUSE
268
294
MUSIC
560
562
572
567
572
574
multistage Wiener filter: see MSWF
space and time processing MVDR processor lattice
574
268 269
N network interface card
884
NIP wave
770
non-Gaussian noise
431
non-homogeneity detection
425
non-homogeneity detector
344
non-linear adaptive detector
775
38
non-stationarity
423
non-stationary hot clutter cancellation
603
normal moveout correction
768
Norton surface wave
606
609
This page has been reformatted by Knovel to provide easier navigation.
927
Index terms
Links
O omnidirectional array
152
circular planar array
157
displaced circular rings
156
linear
152
octagonal planar array
160
optimum detector optimum filter
473
112
117
83
189
optimum processor
468
orthogonal block code
898
over-the-horizon radar
603
193
305
365
426
900
see also HF OTHR radar overspread channel
832
P pairwise error probability
890
parallel processing architectures
265
parallel processors
272
COTS
899
273
parametric adaptive matched filter methodology
441
partially adaptive techniques
420
phase and power-selected training
366
phased array receive antenna subarrays
6 6
pings
720
post-Doppler STAP algorithms
364
PRI-staggered
365
sonar
723
power-selected deemphasis
347
power-selected training
346
experimental results
371
729
360
This page has been reformatted by Knovel to provide easier navigation.
928
Index terms
Links
Predator UAV
396
prediction
514
prestack data
760
PRF diversity
247
PRI-staggered STAP
213
principal component inverse
437
principal components method
868
processing techniques
468
processors for STAP
293
projection deemphasis
347
projection methods
192
pseudo-median canceller
433
pulse Doppler mode
215
pulse-Doppler waveform
240
766 365
369
875
Q QRD
265
268
265
271
radar data cube
377
417
radar data processing
265
inverse
R radar: see airborne radar, GMTI radar systems, ground-based radars, HF surface wave radars, SAR, ship-borne radars, spaceborne radar see also sonar
live
277
performance evaluation
280
radar platform orientation
11
radar sensor bandwidth effects
467
502
486
This page has been reformatted by Knovel to provide easier navigation.
929
Index terms
Links
radar sensor (Continued) directivity patterns
473
Doppler ambiguities
488
range ambiguities
489
system dimension aspects
480
radar systems
413
RADARSAT-2
179
SAR-STAP scheme
196
Radarsim™ SBR
218
radial target velocity
474
Radon transform
726
rain clutter
216
Rake receiver
864
random sidelobe effects
216
range ambiguity
235
range cells
413
range migration
103
range-ambiguous clutter
242
long single pulse phase-encoded waveforms
250
mitigation with pulse-Doppler waveforms
247
STAP performance
244
range-Doppler ambiguity function
491
730 868 242
489
491
240
range-Doppler image
19
range-Doppler maps
13
real-world detection environments
418
receive diversity
887
recursive algorithms
734
reduced-dimension STAP
348
25
742
reduced-rank interference suppression and equalisation
857
reduced-rank methods
437
868
This page has been reformatted by Knovel to provide easier navigation.
930
Index terms
Links
reduced-rank MMSE equalisation
864
reduced-rank MMSE processing
857
reduced-rank STAP
349
Reed-Mallett-Brennan rule
308
reference cells
413
limited
439
retrodiction
514
522
road map information
528
535
road modelling
529
densities
530
ROBLINKS
844
S SAGE algorithm
733
736
739
experimental results
747
fast
741
749
recursive
742
745
749
sample matrix inversion
191
363
378
SAP
610
615
SAR
73
215
along-track interferometry
87
95
194
autofocus
93
displaced phase centre antenna
86
94
193
Doppler centroid
74
89
92
geometry
88
moving target imaging
182
RADARSAT-2
196
SEASAT
94
space-time-frequency domain
98
spaceborne
178
statistical models of measured signals
184
667
This page has been reformatted by Knovel to provide easier navigation.
931
Index terms
Links
SAR (Continued) target motion measurement velocity SAR
183 75
85
see also multichannel SAR SAR-GMTI
177
SAR-MTI
215
SC SAP conditional loss factor analysis SC STAP unsupervised training
667 603
642
649
662
SC STAP algorithm analytic solution
611
efficiency analysis
630
finite sample size considerations
635
operational routines
624
true hot clutter covariance matrix
633
unsupervised training
650
SC STAP convergence analysis loss factor
638 633 654
665 665
scalar moving average model
619
SCNR optimum processing
188
198
SD/TD/CDMA system
786
812
821
spectral efficiency
788
system model
789 884
894
611
638
SDMA
788
906
sea attenuation of sound
828
speed of sound
829
sea clutter
432
skywave data
645
surface wave data
643
This page has been reformatted by Knovel to provide easier navigation.
932
Index terms SEASAT SEASAT SAR
Links 207 94
seismic depth migration
765
seismic reflection imaging
755
data acquisition seismic reflection processing
758 762
CMP sorting
763
deconvolution
763
multiple attenuation
764
normal moveout correction
764
preprocessing
763
velocity analysis
764
seismic velocity model
765
775
seismic wavefield
756
seismic waves
755
seismogram
758
self-rejectability
638
sensor coordinates
508
sensor data fusion
502
505
shallow water acoustic channel
827
830
521
Shalvi-Weinstein algorithm adaptive multichannel
850
experimental results
852
iterative
847
recursive
849
Shannon capacity
892
ship-borne radars
285
side-looking array
469
473
sidelobe canceller
286
292
419
423
sideways looking airborne radar: see SLAR signal contamination
This page has been reformatted by Knovel to provide easier navigation.
933
Index terms
Links
signal diversity
305
signal model
734
signal processing technology
275
signal-to-hot-clutter ratio
631
SIMD parallel processors
272
SIMO architecture
883
singular value decomposition
267
SINR
310
SINR loss
240
STAP performance metrics
885
887
244
248
363
715
310
skywave radars: see HF skywave radars SLAR
5
sliding hole slow moving target detection motion compensation optimum MSAR
442 12
20
24 107
platform manoeuvre effects
23
platform orientation dependence
15
18
883
906
smart antenna technology array gain
887
diversity gain
888
interference reduction
893
multiplexing gain
891
sonar
560 734
702 746
711
active towed array
712
715
727
FM processing
724
fully adaptive CW processing
721
hull-mounted
715
partially adaptive processing
723
reverberation
717
728
This page has been reformatted by Knovel to provide easier navigation.
727
934
Index terms
Links
space-time adaptive matched field processing: see STAMP space-time adaptive processing: see STAP space-time characteristics
199
space-time codes
891
capacity performance
900
error performance
899
linear
897
linear space-time block codes
895
trellis
895
unified design
901
space-time coding
889
3
392
3
space-time D LS eigenvalue processor
389
space-time D3LS forward processor
space-time D LS backward processor
894
901
399
401
403
390
399
403
406
404
406
space-time D LS forward-backward processor
393
399
402
405
space-time data snapshot
389
steering vector misaligned 3
space-time filter space-time modem space-time snapshot vector space-time-frequency analysis space-time-frequency space spaceborne radar
12
387
883 6 103 82
103
177
207
aperture
249
clutter ambiguities
240
clutter spectrum
212
Doppler ambiguities
241
GMTI
179
GMTI signal processing
212
ground target tracking
467
235
202
This page has been reformatted by Knovel to provide easier navigation.
935
Index terms
Links
spaceborne radar (Continued) long single pulse waveform
251
mechanical steering
237
moving target detection
236
MTI applications
208
MTI design
209
range ambiguity
235
242
94
178
SAR simulation
217
STAP processing
212
238
217
231
spaceborne radar simulators antenna parameters
220
clutter patch parameters
221
environment display
221
environment models
225
evaluation
229
generation parameters
221
graphical user interface
218
jammer parameters
222
orbit parameters
219
processing
228
radar models
224
receiver parameters
220
signal generation
227
target parameters
222
waveform parameters
219
spatial adaptive processing: see SAP spatial multiplexing
884
spatial non-stationarity
609
spatial smoothing
439
spectral heterogeneity
325
sphere decoding algorithm
905
892
897
902
422
This page has been reformatted by Knovel to provide easier navigation.
936
Index terms STAMP
Links 701
forward sector processing
709
711
STAP algorithm development
414
algorithm training
361
Doppler-dependent
6
extended beam-augmented
436
factored approach
133
fundamentals
22 441
6
knowledge-aided
352
minimal sample support
348
mobile radio communications
785
performance metrics
308
post-Doppler
7
22
pre-Doppler
7
701
213
365
6
21
PRI-staggered range-dependent reduced-dimension
348
reduced-rank
349
reduction in computational complexity
437
sonar
722
spectral estimation
575
three-dimensional
436
Σ∆
123
359
364
369
728
see also SC STAP STAP algorithm training
361
phase and power-selected
366
power-selected
365
target-free
370
STAP techniques
371
468
heterogeneous environments
344
subspace
192
This page has been reformatted by Knovel to provide easier navigation.
701
937
Index terms
Links
stationary clutter
184
stationary world matched filter
182
steering vector
199
215
mismatch
419
430
stochastically constrained STAP: see SC STAP subarray configuration, generic errors
550 556
subclutter visibility
657
659
subspace methods
192
471
sum beam
123
superarray
550
superresolution
543
595
broadband arrays
559
581
multidimensional method
563
MUSIC
560
single-dimensional method
564
supervised training
562
603
662
123
131
37
265
surface wave radars: see HF surface wave radars surveillance radar airborne early warning MCARM
131
wide-area
207
211
Swerling I target
131
187
Swerling II case
187
symbol error rate
891
149
359
synthetic aperture radar: see SAR systolic array computer
286
T target dynamics model
509
target seeding
335
This page has been reformatted by Knovel to provide easier navigation.
501
938
Index terms
Links
target-free training
370
targets in the secondary data
330
MCARM
340
signal cancellation
332
targets, moving
76
TD/CDMA
822
TDMA
786
terrain scattered interference
420
test cell
413
theatre defence
208
thermal noise
413
three-dimensional STAP
436
TIGER/Line database
342
time-frequency analysis
350
433 211
99
transmit diversity
889
travelling ionospheric disturbance
610
trellis coded modulation
896
trellis codes
895
two-stage processors
434
U UESA radar
151
underwater acoustic channel
827
ambient noise
829
Doppler effect
831
multipath propagation
830
sound speed
829
transmission loss
828
underwater acoustic communication networks
834
underwater acoustic communication systems
827
coherent digital receivers
832
853
833
This page has been reformatted by Knovel to provide easier navigation.
939
Index terms
Links
underwater acoustic communication systems (Continued) incoherent digital receivers
832
sea trials
844
underwater environment sound propagation models
701
707
711
649
662
703
union bound
899
United States Geological Survey database
342
unmanned air vehicle
396
unsupervised training
603
known cold clutter model
649
known order of cold clutter model
652
operational SC STAP algorithm
656
V V-BLAST algorithm
905
vectorial lattice processing architecture
271
vehicle detection
284
vehicle mapping
177
velocity mismatch
114
velocity SAR
75
vertical receiver arrays
845
visibility curve
281
VLSI
273
85 283
W Walsh-Hadamard codes
864
Watterson model
627
WAVES
571
white-noise-gain-constrained method
705
whitening filter
440
wide-area surveillance
207
211
This page has been reformatted by Knovel to provide easier navigation.
940
Index terms
Links
wideband-narrowband processing
706
Wiener estimator
793
Wiener filter
436
GPS jammer suppression
438
843
859
877
multistage: see MSWF Wiener-Hopf filter weights
862
Wiener-Hopf solution
867
Wigner-Ville distribution wireless communication systems
76
99
906
Y Yule-Walker equation
623
Z zero-forcing equalisation
840
zero-forcing receiver
905
zero-order ray theory
756
757
ZF-BLE
792
796
123
442
Sigma Σ∆-STAP advantages
143
algorithms
125
difference beam desired characteristics
135
limitations
145
MCARM system
131
performance
129
probability of detection
130
probability of false alarm
130
SINR potential
129
subarraying
142
139
135
This page has been reformatted by Knovel to provide easier navigation.
877