Principal Component Analysis in Linear Systems Controllability Observability and Model Reduction

IEEE TRANSACTIONS ON AUTOMATICCONTROL. VOL. AC-26, NO.
1, FEBRUARY 1981 17
PrincipalComponentAnalysis in Linear
Systems:Controllability,Observability,and
ModelReduction
BRUCE C. MOORE
Abstmct--Knlmnn’s minimal realization theory involves geometric o b but arbitrarily small perturbations in an uncontrollable
jeds (controUabk, uuobsewable subspaces) which are snbject to stradural modelmaymake the subspace technically not proper.
instability.SpedkaUy, arbitrarily small pertnrbations in a model may Hence, for the perturbed model,’the theory, taken liter-
cause a change in the dimensions of the associated subspaces. This
situation is manifested in computatiooaldiffiities which arise in attempts
ally, says that (assuming observability) there is no lower
to apply textbmk algorithms for computing a minimal realization. order model with the same impulse response matrix.There
Structural instabiity associated with geometric theories is not unique to may well exist, however, a lower order model which has
control; it arises in the theory of hear eqoatiors as well. In thif setting, effectively the same impulse response matrix. There is a
ttse computational problems have been studied for decades and excellent gap between minimal realization theory and the problem
tools have been developed for coping with the situation. One of the main
goals of this paper is to Can attention to p&zipal component analysis
of finding a lower order approximation, which we shall
(Hotelling, l933),and an algorithm (Golub and Reinsch, 1970) for comput- refer to as the “model reduction problem.”
i n g t h e ~ w h r e ~ ~ s ~ o l a m a t r i x . T o g e t b e r t h e y f o r mThea purpose of this paper is to show that there are some
powerful tool for coping with structural instability in dynamic system. veryusefultoolswhich can be used to cope with these
As developed in this p a pe r
,principal mmponeot analysis is a technique structural instabilities.Specifically, the tools w i be a p
l
for analyzing signals. ( S i a r value decomposition provides the computa-
tional machinery.) For this reason, Kalman’s minimal realization theory is plied to the model reduction problem. We shall draw
recast in t e m of
~ responses to injected signals.Application of the signal heavily from the work of others in statistics and computer
am@& to contrdlability and observabii leads to a eoordinate system in science, where theproblem of structural instability associ-
m special properties For asymp
which the ‘’ininternally b a l a n d ’ model t ated with geometric theories has been studied intensely.
totically stable systems, this yields working approximations of X,, X;, the
Principal component analysis, introduced in statistics
controllable and unobservable subspaces. It is proposed that a natural f i i
stepinmodelreductioniotoapplythemechanicsofminimal~oa (1933) by Hotelling [4], [ 5 ] will be used together with the
*~workingsnbspaces. algorithm by Golub and Reinsch [6] (see [7] for working
code) for computing the singular value decomposition of
matrix.Dempster [8] gives an excellent geometric treat-
I. INTRODUCTION ment of principal component analysis as well as an over-
view of its history. A thorough discussion of the singular
A COMMON and quite legitimate complaint directed
toward multivariable control literature is that the value
paper
decomposition and its history is given in a recent
by Klema and Laub [9]. There are excellent books
apparent strength of the theory is not accompanied by
strong numerical tools. Kalman’s minimal realization the- [lo]-[ 151 within the area of numerical linear algebra which
ory [2], [3], for example, offers a beautifully clear picture explain how structural instabilities arise and are dealt with
of the structure of linear systems. Practically every linear in the theory of linear equations.
systems text gives a discussion of controllability, observa- The material given in SectionsI1 and I11 of this paper is
bility, and minimalrealization. The associatedtextbook more general than that appearing in the remaining sec-
algorithms are far from satisfactory, however,serving tions. In Section I1 minimal realization theory is reviewed
mainly tp illustrate the theory with textbook examples. from a “signal injection” viewpoint. The main advantage
of this viewpoint is that the relevant subspaces are char-
The problemwith textbook algorithmsforminimal
acterized in terms of responses to injected signals rather
realizationtheoryis that they are based on the literal
than in terms of the model parameters ( A , B, C ) . The full
content of the theory and cannot cope with structural
discontinuities (commonly called “structural instabilities”) power of the ability to’inject signals of various types is not
which arise. Uncontrollability corresponds to the situation fullyexploitedin this paper. Section I11 contains very
where a certain subspace (controllable subspace) is proper, general results which are valid whenever one is trying to
find approximate linear relationships that exist among a
set of time variables. In no other way is linearity required.
Manuscript received July 17, 1978; revised October 25, 1979 and June (See [16] for ideas about nonlinear applications.)
5, 1980. This work was supported by the Natural Sciences and Engineer-
ing Research Council of CanadaunderOperatingGrant A3925. This In Section IV controllability and observability analysis
work represents a major revisionof [ 11.
The author is with the Department of Electrical Engineering, Univer- is discussed. Most of the effort is spent coming to grips
sity of Toronto, Toronto, Ont, Canada. with the problem of internal coordinate transformations.
0018-9286/81/0200-0017$00.75 0,1981 IEEE
Authorized licensed use limited to: Chalmers University of Technology Sweden. Downloaded on February 09,2023 at 17:13:18 UTC from IEEE Xplore. Restrictions apply.
18 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, NO.
VOL. AC-26, 1. FEBRUARY 1981
This leads to the description of a coordinate system n

(genericallyuniquewithinsign changes) in which the
Plant ...
model has special properties. This “internally balanced” Fig. 1.
model is essentiallythe same as the “principal axis realiza-
tion” defined in a filtering context by Mullis and Roberts
~71.
For asymptotically stable systems,onecanview the
controllability,observability analysis as a process by which ( x ( t )E W”) which, when started at rest ( x ( 0 )= 0), simu-
onecomputesworkingapproximations of X,X, the lates exactly the small signal input-output characteristics
controllable and unobservable subspaces. In Section V we of the plant with the coordinate system translated to
pursue the natural idea of applying the mechanics of a,, ye.
minimal realization theory using these working subspaces The model (1) is required to simulate only the input-
instead of the exact subspaces, which are subject to struct- output characteristics of the plant, and it is often true that
ural instability. Examples are given in Section VI. little can be said about the relationship of x ( t ) , A , B, C to
the physicalsystem.Furthermore,since(1)is only a
Notation model, we are free to insert a “test” input vector d ( t ) as
follows:
The symbols R, P C [ t , ,t 2 ] represent the field of
real numbers and the ring of piecewise continuous i(t)=Ax(t)+Bu(t)+d(t)
functions on the interval [ t l , t 2 ] ,respectively. The cor-
respondingvectorspaces RXR X R--- and y(t)=Cx(t). (2)
m
P C [ t , ,t 2 ] X X P C [ t l ,t 2 ] will be denoted by R”, The vector d ( t ) is introduced to aid in the discussion of
observability of the model and has no connection with the
P C ” [ t , , t 2 ]F. t r a subspace SER“, S’- wi represent its
l physical plant.
orthogonal complement in R“ withrespect to a basis Minimal realization theory gives a clear geometric
defined by the context. A matrix U will be referred to as answer to the following questions. Under what conditions
an orthonomzal basis matrix for S if its columns form an does there exist a lower order model which, when started
orthonormal basis. at rest, also simulates exactly the small signal characteris-
For a map M : Rk+ F i n , ker( M ) and im( M ) will repre- tics of the plant? If there is such a lower order model, in
sent the kernel and the image of the map. The symbol M whatwaydoesit relate to the original model? The re
wilI also be used to represent a matrix representation, in mainder of this section offers a review of minimal realiza-
which case we shall write M E R n x k . For such a matrix, tion theory.
MT w li represent its transpose, and IIMIIF, 11 MII, will
represent, respectively, the Frobenius and spectral norm. Basic Geometric Review
For a vector u E R“, 11 u 11 will represent the Euclidean
norm a. The geometric picture which goes along with minimal
realization theory is very simple and well known. There
are two important subspaces of the state space (R”): X,
11. A “SIGNALINJECTION”VIEWOF MINIMAL the controllable subspace, and X, the unobservable sub-
REALIZATION THEORY space. The subspace X, is the smallestsubspacewhich
contains the state response ( x ( 0 )= 0) to every piecewise
The two tools, principal component analysis and singu- continuous vector signal injected at the model input termi-
lar value decomposition, are ideal for analyzing vector nals ( u ( t ) ) . Thesubspace X, is the largestsubspace in
time signals. In an effort to make later use of these tools
which arbitrary piecewise continuous
signals
can
be
more transparent, minimal reahation theory w i be
l
injected [through d ( t ) ] with no output response.
reviewed from a “signal injection” point of view. Specifi-
With x(O)=O every state response can be decomposed
cally, the controllable subspace and the unobservable
into the sum of two orthogonal vector signals,one in
subspace w li be characterized in terms of vector time A
responses of the model to test signals injected at ap- X, = (X,nx,)l nX, and one in X k . The signals in X,,
propriate points. completely determine the input-output response of the
To avoid confusion, special care mustbe taken to model. If U is an orthonormal basis matrix for X, then
establish the state space setting. The assumed situation is
the following. There is a plant, illustrated in Fig. 1, with rn I1(t)= UTAUxl(t)+U=Bu(t)
inputs (ti)and r outputs (f), operating quietly (at rest) at
an equilibrium point (ye,ti,). Furthermore, there is a A t ) =C U x , ( t )
corresponding model
is a minimal order modelwith the sameinput-output
I(t)=Ax(t)+Bu(t) characteristics as (1).
MOORE: PRINCIPAL COMPONENT ANALYSIS IN LINEARSYSTEMS 19
dftl=O injecttestsignals
measure state (x(0)=0)
response (x(O)=Ol
responses
signals
Fig. 2. Fig. 3.
111. PRINCIPAL. COhiPOhTENT ANALYSIS

Characterization of X ,
The resultsgivenin this section support the analysis
It isconvenient to think of X, in terms of state re-
techniques to be used in later sections. The organization
sponses to test signals, as indicated in Fig. 2. In this paper
given here is self-contained and, to this author's knowl-
we shall consider a sequence of test signals ui(t), 1 < i < m
edge, unique. Similar mathematical constructions appear
given by u'(t)= e$(?) where e, is the ith column of the
in control literature, but the material is closest in spirit to
m X m identity matrix and a(?) is the scalar unit impulse
Hotelling's work in statistics [4], [5].
- -
function.' Let x]( t ), * ,xm(t ) be functions corresponding
Let I? R+RnXm be a piecewise continuous map repre-
to the state responses of the model to the respective test
sented in matrix form by F(t). One can think of F ( t ) as a
input signals, and let
set of m vectorsignalsinvolving n variables, i.e., each
columnrepresents a singlevectorsignal in R". The
X ( t ) L (XI( t ) x'( t ) . x y t )) . Gramian2
For every instant of time we have X ( ? ) € RnXm,and X, w2L p ( f ) F T ( t ) d t

can be characterized as follows.
Proposition I: The controllable subspace X, is the sub- is a positive semidefinite matrix with a set of nonnegative
space of least dimension which contains i m ( X ( t ) ) for all real eigenvalues u: >:u > -
> u," > 0 and corresponding
t E[0,TI, T> 0. mutually orthogonal unit eigenvectors u l , u,, ,v,,. -
Pruo$ With impulses used as test signals, X ( ? )=eAfB The map F may be represented with v , , ,v, used as --
and the proof is straightforward. orthonormal basis vectors for R", i.e.,
Characterization of X , 4 t ) = u , f i T ( t ) + v 2 f 2 T ( t ) + . . .+u,fT(r) .
For this we apply a series of tests as indicated by Fig. 3. where AT( t ) L uTF(t) for 1 < i < n. Throughout the paper
Within this paper the test signals d'(t) 1 < i < n are we shall refer to the "ith"
A
given by d'(t)= e&?) where e, is the i th column of the principal component = vifiT(t),
n X n identity matrix and a(?) is the scalar unit impulse A
componentvector = vi,
function.*
'( -
Let y t), ,y "(t ) be the output responses correspond- component magnitude = a,, and
A
ing to the respective test signals, and let

component function vector L A ( t ) .
Y(t) ( y ' ( t )y 2 ( t ) . - - y " ( t ) ) . Proposition 3: The following relationships hold for 1 <
( i , j ) < n:
For every instant of time we have Y ( t ) E R r X " ,and X, is
characterized as follows.
Proposition 2: The unobservablesubspace Xa is the
subspace of greatest dimension which is contained in
ker( Y(t ) ) for all t E [0,TI, T> 0.
Proufi With impulses usedas test signals, Y ( t )= CeAf
and the
is proof straightforward. rn
Remark: Since ker(Y(t))= im(YT(t))", it is also true
that X: is the subspace of least dimension which contains
im(YT(t))for a11 ~ E [ O , T I , T > O .
'Theresultsof this sectionarevalid with 8 ( t ) selected to be other

scalar functions, such as the unit step. In this paper we shall not consider
testsother than m
i pulses. but prelimhay work indicates thatfuture
r e f i i e n t of the model reduction framework developed here will take 'The Gramian appears in deterministic control literature [18], [19], but
advantage of this freedom to select from a large class of test signals. its eigenvector structure appears not to have been fully exploited.
20 IEEE TRANSACTIONS ON AUTOMATIC COhTROL, VOL. AC-26. NO. I. FEBRUARY 1981
from which the first two properties follow easily.The third

is a consequence of the fact that
l,f211F(f)ll$dt=tr(l ? ( t ) F T ( t ) dt). response m a t r i x F(t)

Fig. 4.
There is anicevisualimagewhich can beuseful in
interpreting this result. One can think of Jt:z 11 F(t)ll dt as If Jt:2FA(t)&T(t)dt has rank k, it follows from perturba-
the total energy in thesignalsetover [ t l ,t 2 ] , and the tion properties of singular values that
component vectors and magnitudes reflect the spatial
distribution of thisenergy.Specifically,assuming
a,, > 0, the set
that
2EA(t)E:(t)dt>
i=k+ 1
x
n
U
:
( ~ e l " : ~ ~ / ~ u ~ F ( t ~ ~ ~( 3~) d t = l )
is an ellipsoid with semi-axes(l/uj)ui, 1 Q i Q n.

Princ@aI Components of Impulse Response Matrices
Linear Dependence and Least Squares Approximations
If F ( t ) is the impulse response matrix of a linear time-
It is often important to determine fixed (independent of invariant system, the principal components over [0,T ]can
t ) linear relationships which exist among the rows of F ( t ) be given sharp systemic interpretations. Consider the sys-
over [ t l ,t 2 ] .From a geometric point of view, this amounts tem depicted in Fig. 4 where w ( t ) E R" and z ( t ) E R", and
to finding the subspace S,' where let G? represent the class of all input functions a(-)which
are piecewise continuous on [0,TI and satisfy the norm
sF: {u: uEim(F(t)), t ~ [ t t, 2, ] } . bound ( J : I I ~ ( t ) l l ~ d t ) '</ ~1.
It is well known (see [19, p. 75D that the image of the
It is clear that S F is spanned by the component vectors convolution map
corresponding to nonzero magnitudes.
In many cases exact linear dependence is not the case ( z E R " : Z = L ' F ( ~ - T ) ~ ( T ) ~ TT,,a ( - ) € P C " [ O , T ] )
(SF = Rn)and one looks instead for "approximate" linear
relationships over [ t l ,t 2 ] .The following result shows that
the component magnitudes and vectors reveal the possibil- is precisely the space S F . The set (contained in SF)
ities with respect to such approximations.
Proposition 4: Let k be a fixed integer, 1 < k Q n. Over
the class of piecewise continuous FA(t) satisfying
dim{SFA}=k, the residuals providesmore detailed structural information than the
space SF.In the next few paragraphs we shall show that S
J F ~~, f 2 ~ ~ ~ ( t ) - ~ ( t ) ~ ~ ~ ~ t ; isaregionin Rn whose surface is an ellipsoid[closely
related to (3)] defined by the component magnitudes and
JS max / " l l L ) ~ ( ~ ( t ) - - ~ ~ ( t ) ) l l 2 d t vectors of F ( t ) over [0,TI.
Iloll=l t, Let a, o j , 1 < i Q n be the component magnitudes and
vectors of F ( t ) over [0,TI, and let Z, V E Rnx" be defined
are minimized with as follows:
k A
FA(t)=Fk(t) 2 uj&'(t). Z = diag{aI,a2;..,an)
i= 1
A
The error residuals are v = (ul 0 2 " ' u,,).
n
JF= x
i=k+l
a;; JS=U~+~. The following proposition shows that the set s"={ z : z =
V Z p , 11 p 11 = l } (an ellipsoid with semi-axesq u i , 1 Q i Q n )
corresponds to the surface of S.
Proo$ It is easy to verify that Fk(t) gives the stated Proposition 5: The set S can be characterized as follows
residuals. For an approximation FA which minimizesJF or
JS, it is necessarily true that Jt:2FA(t)E:((t) dt =0, where
EA(t ) = F ( t ) - F'(t). Hence, for such an approximation
Proo$ For simplicity we shall assume that a, >O. The
general
W 2 = ~ i f ' F A ( t ) F ~ ( t )tid t + ~ ' 2 E A ( f ) E ~ ( t ) ~ t . proofisbasically the same onlymoretedious.
First it will be shown that 9 is contained in S. For every
MOORE: PRINCIPALCOMPONENTANALYSISIN LINEAR SYSTEMS 21
is a vector p, IIp 11 = 1, satisfying 2 = vzp.

, ? E ithere It is not necessary or desirable, however, to compute such
From thedefinition of principal components, we know an approximation for W 2 .Imagine (it need not be actu-
that ally constructed) a data matrix
w2= I T F ( t ) F T ( t )dt= VZ2VT.

0
This means that q A VE-'p satisfies 2= W2q,which im- Since W; = DND,', it follows that the singular values and
plies (see [19,p. 76D thattheinput w(t)=FT(T-t)q leftsingularvectors ofDN approximate thecomponent
drives z( f ) to 2 at time T. Furthermore, simple manipula- magnitudes and vectors of F( t ) over [t,,t 2 ] .
tion gives Jzo(t)oT(t)dt=l whichmeans that u(.)EQ It is not necessary to store DNinmemory-thisis
and ZES. important because the data matrices may contain many
To complete the proof it is sufficient (because of linear- column vectors. Instead, one may preprocess the data by
ity) to show that the input o( .) constructed in the previ- recursively (treating as few as one column at a time)
ous paragraph is the minimum norm input which drives reducing DN to a unitarily equivalent matrix (see [12, p.
the system to 2 at time T. Let h ( t ) be any input which 3831):
drives the system to 2 at time T. With A(?)=&(?)-w(t),
R = DNQ ( Q unitary-not stored).
simple manipulation gives Jor;jT(t)A(t) = 0 and it follows
that The matrix R E Rnx" has singular values and left singular
vectors equal (to machine precision) to those of DN. The
algorithm (SVD) developedby Golub and Reinsch[6]
(working code in [7D can be used to compute the singular
It is interesting to note that if 2=ujvi, then the mini- values and left singular vectors of R.
mum norm input function is The main advantage in using SVD, instead of comput-
ing the eigenvalues and eigenvectors of W;, is in reduced
o(t)=FT(T-t)vioi-1= resolution requirements. The "squaring process" doubles
which is the ith component function vector, normalized the demand for resolution in the computations. Specifi-
and reflected in time. cally, supposethere is 12 bit resolution associated with the
samples 4 7 , ) . To preserve this same resolution with the
Computation of Component Magnitudes and Vectors computed singular values,
a, -2- *2alQ {computed value of ai}
First let us deal with the special situation where one has
an asymptotically stable model ( A , ByC ) and wishes to Qo, +2-'2a,
compute the components of eArB(eAT'CT) over [O,co). In
this case it is often convenient to use the fact that requires at least 12 bit resolution using SVD and at least
24 bit reeolution using the squared up version.
w,' A &meA'BBTeAT'dt
Perturbation Properties of Component Magnitudes
and Vectors
Wt &meATfCTCeA'
dt
Suppose F( t) is perturbed by AF(t) so that
are the uniquesymmetricpositivesemidefinitematrices F,(t) F(t) + AF(t) is piecewisecontinuous. As one
which satisfy might expect fromthe preceding paragraphs, the perturba-
tion of component magnitudes maybe bound in much the
A we'+ w;A==-B B ~ same way as singular values of a matrix.
Proposition 6: Let a,, u) be the ith component magni-
A'WO' + W,2A= -cTc.
tudes of F(t), G(t),respectively. Then
One can firstsolve for w,"(w,") (see [20] for one algo-
rithm), and then use a specialized routine for computing
the eigenvalues and eigenvectors of a symmetric matrix.
For the more general case it is necessary to compute
Pro08 From the discussion of the computation of
approximate component magnitudes and vectors by sam-
components, the perturbed Gramian is
pling F ( t ) . With evenly spaced sample points T ~r,,
A
, ,7N -- e
(70 A t,, rN = t2), it follows from rectangular approxima- W 2- l im (ON+ADN)(DN +ADN)T

A -iv+Oo
tion that if N is large, then
where AD, =((t2 -tl)/N)1'2(AF(Tl) AF(r2).- mAF(7,)).
Standard perturbation results for singular values implies
that each singular value ofDN +AD, is perturbed from
22 IEEE TRANSACTIONS ON Auromnc CONTROL. VOL. AC-26.NO. 1. FEBRCARY 1981
that of DN by no more than Relationships Between (6G,C) and Components of eA'B,

e art^ T
Let Q,(t,), Q,(t,) represent the extended controllability
and observability matrices corresponding to ( F , G , C ) :
Q,(t,) ( G FG * - * FNG);
In the limit the right-
hand expression
approaches
11 lz:2AF(r)AFT(t)dt11 ;I2.
As is the case with singular vectors, the perturbation of
component vectors requires some explanation. Let uf rep-
resent the i th component vector of FA( t ) and visuahe the
two ellipsoids E , EA, the first with semi-axes uiui, and the
perturbed one with u,%:. If AF(r) has very small principal The matrix Q,(t,) is a data matrix which is closely related
components, then EA is close to E ; that is, no point in E is to eA'B, and QZ(t,) is one formed by sampling CeA' every
perturbed by more than 11 Jz:zAF(t)AFT(t)dt 11 ;I2.If, how- t, seconds over the time interval [0, TI. To aid in the
ever, E has two or more semi-axes of nearly equal lengths discussion, we shall adopt the following notation.
(circular cross section), then theseaxesmaybevery A A
1) V, = (uclu,,~ * ucn); Z, = diag{u,,; . . ,u,,} where
sensitive to small perturbations. The space spannedby
them is not sensitive, however, and this presents no real uti, ucirepresent the ith component vector and magnitude
difficulty. The algorithm SVD gives singular vectors which of eA'B.
A A
are orthogonal (to machine precision), even with repeated 2) V, = ( U ~ , O ~* Zu,,); ; ~ ~ Z,- ~ = diag{u,,,- . . ,uon} where
singular values. uoi, uOirepresent the ith component vector and magnitude
of e A r z .'~
IV. CONTROLLABILITY
AND OBSERVABILITY 3) T ( r , ) Z : ( t , ) ~ T ( t s ) the singular valuedecomposi-
ANALYSIS tion of Q,(r,).
4) ~ ( t s ) Z ; ( t , ) ~ T ( r sthe ) singular valuedecomposi-
We are now prepared to consider the application of
principal component analysis to responses of the model tion of QZ(t,).
Proposition 7: The singular values satisfy
x( t ) =Ax( t ) +Bu( t )
If the diagonal elements of E,, Z, are distinct, then

with the idea of combining it with thesignal-injection
view of minimal realization. The basic idea is close to one lim T ( t , ) = K ; lim c ( t , ) = V , .
f,+O I,+O
advancedby Friedland [21] and, as mentioned in the
Introduction, is very closely related to the filtering work Proof: It is easy to see, using rectangular approxima-
of Mullis and Roberts [17]. tion of integration, that
A central problem to be dealt withis the factthat
internal responses eA'B,eAr"CT depend on the internal
coordinate system. This means that unless there is some
iTeArtCTCeA'dt=t,+O lim (< Q,(t,))'( Q,(t,)) fi
specialsignificance attached to the internal coordinate and we shall prove that
system, the existence of "small"components in eA'B or T
eAr'CT implies nothing about their importancewith re-. iTeA'BBTeA'dt=
spect to input-output properties of the model. To over-
come this problem, we derive a special coordinate system
where input-output properties are reflectedby internal
(5)
principal components. These two relationships imply that the claimed limiting
In the development we shall find it useful to carry along relationships hold.
the discrete time subordinate obtained by sampling (and Now to see that (5) holds, note that
holding inputs) every t, seconds.
and
GGT=r:BBT+-(ABBT+BBqT)+-e-.
t:
2
In analyzing responses of ( A , B, C ) over an interval [0,TI,
it will be assumed that N = T / t , is an integer. Since eA' is bounded on [0,TI, there exists finite K,S > O
MOORE: PlUNClPAL COMPONENT ANALYSIS IN LINEAR SYSIFMS 23
such that for t, < 6

IIFi(t:BBT-GGT)FirII <Kt:
for all i > 0.Hence,

1 .
lltsFiBBTFiT- -F'GGTFiTll <Kt:,
t,
and for t , <6,
N
1
11 2 tseAtskBBTeATtsk
--Qc(t,)Q~(ts)~~ <KNt: =KTt,.
k=O tS The precedingexample illustrates the fact that eAtB,
Since T, K are fixed constants, the resultfollows. E CeAfdepend upon the internal coordinate system. A coor-
dinate transformation x ( t )= P 2 ( t ) gives the model
Principal Components of eAtB, e ATfCT
i=h(t)+iu(t)
The component vectors corresponding to the nonzero y(t)=Cx(t)
principal components of eAtB span X,, the controllable
subspace. For simplicity we shall assume that there are no where A^=P-'AP, B^=P-'B, c = C P . It is important to
components which are identically zero. For this author, observe that
the mental imagedeveloped in the previoussectionis
helpful. The ellipsoid with
semi-axes uclvcl, eati=p-leAtB; eeit =CeAfp.
uC2vc2,* --
,ucnvcnis the surface of the region in the state In discussing coordinate transformations, thefollowing
space corresponding to points which can be reached from notation will be adopted:
the origin with input vectors satisfying j:ll w(t)ll dt Q 1.
In some respects the ratio y, = uCl/ucnserves as a condi-
tion number with respect to pointwise state contr01.~
The component vectors corresponding to the nonzero
principal components of eArfCTspan X i . Again we shall
rule out the trivial case where there are components which
are identically zero. Here one can imagine an ellipsoid in
R" with semi-axes (descending order according to length)
-1 -1 -I
v o n ~ ' o o n - ~ v o n - l ~ " ' ~ u o l '01
which corresponds to the set of all initial conditions which

satisfy J:IICeAtxg 11 2 d t = 1 . The ratio p o =uol/uon acts in For the case where P = I (original coordinate system), we
somerespects as a condition number withrespect to shall simply write W,', W,'.
zero-input state observation.
It may be tempting to treat very small components of Coordinate Invariant Values-Second-Order Modes
eAtB or eAT" as though they were identically zero (i.e.,
approximate X,, X, using least squares). A little reflection Looking again at the example used to illustrate the
reveals that this is not generally appropriate. Consider the dependence on internal coordinates, one can see that
model small components of eATtCTmay be offset by large com-
ponents of eAtB. Intuitively, it seems this problem could
be alleviatedby transforming to a coordinate system
where the components of eAtB all have unit magnitude,
i.e., select P so that W,'( P ) = I . Since W,' = V,Z:V,T it is
clear that this can be achieved by setting P = KZ,.A little
y(t)=(106 10-6)( X I ) . algebra gives W,'( P ) =HTH where
x2
H A ZoVTV,Z,.
It is easy to see that eAtB, eATrCTboth have one very
s m d principal component with yo~ pM 1, 0 ' ~In. this case Note that the singular values of H are the component
the highly distorted ellipsoids simply reflect an internal magnitudes of eiTtcT. Although the matrix H depends on
the initial coordinate system, one can easily verify that its
3This ratio depends on the selected coordinate system-a poor condi- singular values do not. These values, which will be shown
tion numbermaysimplyreflect apoorinternalcoordinatesystem.
Discussion of this will appear later in this section. to reflect input-output properties of thesystem,playa
24 IEEE TRANSACTIONS ON ALTOMATIC C O N ~ O L VOL.
, AC-26, NO. 1, FEBRUARY 1981
central role in the remainder of this paper. For this reason Let &,represent a transformation which gives an inter-
we shall attach special termin~logy.~ nally balanced model. It easy is to verify that
Definition: The singular values of H will be represented Po, = &x-', pi, = C b 2 give,respectively,
A A
output-
by ut > u: > ---
>u,' > O and willbe referred to as
normal and input-normal models. The three coordinate
second-order modes of the system.
systems are, therefore, related by simple scale factors. The
There is an interesting connection between the second-
internally balanced model w li receive the most attention
order modes and discrete-time Hankel matrices corre-
in Iater sections, largely because of the property given in
sponding to ( F ,G , C). Recall that for a given interval
the next paragraph.
[0, TI and sampling time t,, we have the extended control-
Let p:( P ) , p t ( P ) represent the condition numbers (with
lability and observability matrices Q,(t,), Q,(t,) defined
respect to inversion) of w,'( P ) , w , ~ ( P ) .Then, as dis-
by (4). With T fixed, the corresponding Hankel matrix
cussed in previous paragraphs, p c ( P ) ,p , ( P ) act as condi-
M,(t,) = Q,(t,)Q,( t,) grows in size (number of rows and
columns) without bound as ts+O, but MH has at most n tion numbers of the model (2,i, withrespect to e)
nonzero singular values. pointwise state control and zero input state observation,
Proposition 8: Let u;(t,), 1 Q i < n be the ordered singu- respectively. The followingresult shows that, in one sense,
lar values of M H ( t r ) .Then for 1 Q i < n, the internally balanced model gives the best compromise
between the two condition numbers.
lim a,*( t , ) =Ui' Proposition 9: The quantity m a ( pc( P ) , p,( P ) ) achieves
1, -0
its minimum with P = < b .
where u; is the ith second-order mode. Proof: Let p ( M ) represent the condition number of
Pro08 Proposition 7 implies that the matrix M with respect to inversion. It is easy to show
that
1
-Qc(t,)=(Kzc +Rl(t,))U:'(ts)
fi
Furthermore, it is a simple matter6 toshow that
fiQ , < t , ) = K+(t,)(z,V~+Rz(t,>)
where R , ( t , ) , R 2 ( t s are
) n ~ matrices
n satisfying for square nonsingular matrices MI, M2.It follows that
lim II R2(ts)ll =o.
lim II R1(ts)ll= t,+o
1, -0
It follows, therefore, that
~~(t,)=Q~<t~)Q~(t,)
= v*(t,)(zov~v,z,)utcr(t,) and that
+ u,'(t,>E(t,)~:T(t,>
where E ( t , ) is n X n and satisfies l i m t s 4 11 E( t,)ll= 0. Since
U,*(t,), q(t,) each has orthonormal column vectors, and Since pc(
the second-order modes are the singular values of
eb)=pL,(
Pib) = (u1/u,), this completes the proof.
w
H Z,VTV,Z,, the follows.
result w Thefollowing result shows that if the second-order
modes are distinct, then the internally balanced model is
Models with Normalized and Balanced Internal Dynamics essentially unique.
Proposition 10: If the second-order modes are distinct,
It isevident that the condition W:(P)=I does not then the basisvectors defining the internally balanced
define a unique coordinate system. We shall now define model are unique within a change of sign.
and discuss three closely related, essentially unique coor- Proof: Suppose A , B, C is internally balanced so that
dinate systems' in which the structure associated with the W,' = W,' =' 2 and assume that P satisfies W,'( P ) =
second-order modes is displayed clearly. Let Z2= w,'( P ) = 2'. This implies that
diag{u:, u;: . ,u,'}.
Definitions: Themodel (a,&e) is input-normal on p-Iz2p-lT=pT~2p=~2
[O, TI if w,'( P ) = I ; w,'( P ) = E4;oulput-nomi on [o,T I which further implies that P -'Z4P= Z4. These equations
if W,'( P ) = E4; W,'( P ) =I; and internal& balanced on constrain P to be a diagonal matrix with 2 1 in each
[0, TI if w,'(P ) = w,'( P ) = z2. diagonal entry.
4 T h i s terminology is borrowed from Mullis and Roberts [ 171. 61t is not a simple matter, as one reviewer observed, to show that this
5 T h e s e correspond to "principal axis realizations'' introduced by Mul- result holds in general for nonsquare matrices, but the general result is
lis and Roberts [ 171. not needed here.
MOORE: PRINCIPAL COMPONENT ANALYSIS IN LINEAR SYSTEMS 25
The transformation P;.6 can be executed withthe follow- The symmetric,positive semidefinite matrix Jie211rl?,l?r
ing algorithm. e'T1'd.r is, therefore, norm bounded and it follows that
limt,,eaI1'.kl = O and that lim,,,eAII'Z, exists. Since 2,
- A
Step I ) Compute V,, Z, and apply the transformation
P , = V,Z, to give w:(P,>= I. Let 2 P; 'AP,; is nonsingular, limt,meA1lt must exist. A straightforward
B" P r ' B ; ?t CP,.
argument (omitted) shows that controllability of 2, is a,,,
a generic property, and this implies further that asymp-
Step 2) Compute F,, Zocorresponding to the compo- -_- totic stability of a,,
is a generic property. Similar argu-
nent vectors and magnitudes of eiTrET.Apply to ( A , ByC ) ments apply for a,.
the transformation P2 = f,z;1/2
to give the internally The previous stability result is really more general than
balanced model we need here. Consider the case where the state variables
of the internally balanced model ( A , B , C ) are simply
a=P,-'P,'AP,P,; partitioned and not reordered. Specifically, suppose the
j = p - 1 -1B. matrix 2 ' of second-order modesis partitioned where
2 P1 9
e=CP,P,. Z1=diag{u,,---,uk}, Z2=diag{uk+l,~-~,un}. This i m p

ses a corresponding partition of the state variables of the
internally balanced model
Properties of AJymptotically Stable, Internal&Balanced
Models
Let us now limit the discussion to asymptotically stable
systems and assume that ( A , ByC ) is internally balanced
over [0,ao), i.e.,
eArBBTeATIdt= L" eATI C TC e A t d t = Z 2

where we have again inserted the test injection input d ( t ) .
or, equivalently, Wemayview this as two interconnected subsystems:
( A , , , B , , C , ) and (A,,, B,, C2). Using Lemma 7, it is not
+
AX2 Z2AT= - BBT (6) difficult to prove the following.
ATE2+ Z 2 A = - C T C . (7) Proposition 11: The two subsystems ( A , , , B , ,C , ) ,
(A,,, B,, q )are generically asymptotically stable and in-
Under these conditions the model ( A , ByC ) has a rather ternally balanced with
extraordinary property involving the stability of subsys-
tems. Consider arbitrary reorganization of the internally dt= 1 00
eArt T,
II C C eA11tdt=Z2
1
balance2 system into two interconnected subsystems, i.e.,

let (8,B, C ) be obtained by reordering the state variables
and partition (a, e)
i?, as follows:
U Prooj Stability was proved in Lemma 7; the second

part followsfromsimple inspection of (6) and (7) in
partitioned form.
Prelude to Model Reduction
The basic model reduction idea to be pursued in the

Lemma The matrices ill, satisfy limt+me'llt next section follows intuitively from Proposition 11. Con-
= K , , lim,+meAUf=K , where genericallyK , = 0, K2 ~ 0 . sider the case where u ~ > > u ~ +Then
, . the subspace
Proof: It isas@ple matter to show that AX2 +
e2A7= - l?iT where Z is a nonsingular diagonal matrix
(reordered version of X). This implies that
X , im( ):
al,e:+e;AF, = -z,i?T, (where Ik is the k x k identity matrix) in some sense "acts
like" both X, and X$. If the mechanics of Kalman's
and itfollows (integration by parts, or see [22, p. 2991) minimalrealizationtheory are applied to the internally
that for t >0 balanced model with X, used as a working approximation
of X,,, the resulting lower order model is ( A , , , B , , C , ) .
This model is generically asymptotically stable and intern-
ally balanced.
'During theprocess of revising this paper, Parnebo and Silverman This idea isgiven strength by Proposition 10which
proved a stronger stability result [U]. shows that the second-order modes are the singular values
26 IEEE TRANSACTIONS
CONTROL,
AUTOMAIIC
ON NO.
VOL. AC-26, 1. FEBRUARY 1981
of the (infinite) Hankel matrix of the discrete-time sub-

ordinate with t,+O. It is well known that the number of
nonzerosingularvalues of this matrix determines the
order of the model.Hence, if u i then the Hankel
matrix is nearly singular. In some model reduction work
[24]-[27], model reduction is based on near singularity of
the Hankel matrix.
V. TOOLS
FOR MODEL
REDUCTION
Model reduction involvesatradeoffbetweenmodel

order and the degree to which the characteristics of the
plant are reflectedby the model.Because the relative
l/i (subsystem which contributes nothing
to impulse response matrix
importance of various plant characteristics ishighlyde- Fig. 5.
pendent upon the application, there can be no universal
model reduction algorithm. The best one can hope for is a
good set of tools and some reliable guidelines for using d o m i n q t subsystem
them. 1
In this section we shall illustrate one way that principal n f
component analysis can be applied to the model reduction 1
problem for the case where the full model is asymptoti-
cally stable. The intention is to convey a way of thinking
about model reduction which lends itself well to the signal
injection viewof minimal realization theory. The results reduced model obtained
are promising but incomplete; future research w lihope-
. ... by severing these
fully refine the viewpoint and sharpen the tools. , . connections
v
In this section a reduced-order model ( A , , B,, C
), will /
be judged by its impulse response matrix. The error im-

pulse response matrix
weak subsystem
He( t ) A CeArB- CReAR‘BR
Fig. 6.
characterizes the error. We shall say that a reduced order
modelis“good” if the largest principal component of
Reduction by Subsystem Elimination
H e ( t ) over [O,co) is“small” compared to the smallest
principal component of CeArB,i.e., if Minimal realization theory says that there is an exact
lower order model if and only if in some coordinate
system the full model can be organized as follows.
{ 1/2
min1 v T ( ~ r n C e A r B B T e A ~ r C ~ d.r )(8)
<< IIL.ll= v}
This viewpointforces an assumption: the map CeA‘B

may have pathologically small or zero components (e.g.,
two rows of C are identical), and we must assume that this
where the subsystem (A,, BR,C,) has the same impulse
situation has been corrected byaprojection onto an
response matrix as the full model. This is illustrated by
output space of appropriate dimension. To makethe
Fig. 5.
condition (8) simpler, it w i be further assumed that an
l
The mainideaunderlying the model reduction work
output coordinate transformation has been applied so that
here is to eliminate any weak subsystem which contributes
imCeArBBTeA“CT =I
little to the impulse response matrix. In other words, we
shalltry to reorganize the full modelwith an internal
coordinate transformation as illustrated byFig. 6. This
in which case (8) may be replaced by implicitly defines the meaning of a “dominant” subsys-
tem: it is one whose impulse response matrix is close (in
the sense discussed in the beginning of this paragraph) to
that of the full model.
It is here that we must face up to a theoretical gap in

the paper. We are not able to use this definition of y ( t )= (CJ1 C,T,) (10)
dominance directly, and we introduce instead the concept
of internal dominance which is natural within the frame-
work developed in Sections II-IV. Toward the end of this where I ~ , represent k x k and ( n - k ) x ( n -k ) iden-
section, a conjecture will be advanced concerning the t i v matrices,respectively. With
relationship between this concept and actual dominance.
Internai Dominance
VI=(;);
'- 1 nO- k )
v-(
injection of impulses at w ( t ) , d l ( t ) ,respectively, gives
Consider now the organization of ( A , B ,C ) shown in
Fig. 7. Loosely speaking, internal dominance of
( A , , B,, C,) means that signal injection tests involving
d l , x1 give much stronger signal components than corre- and it follows that
sponding tests at the second set of terminals d,, x 2 .
Care must be taken here for the same reasons which imZl(t)glr(l)dt= Tl-'(V~W,2Vl)Tl-1T
called for balancing inSection IV. Clearly, if internal
dominance is to relate to actual dominance, it should be
invariant under coordinate transformations of the type im?:(t)?(t)dt= T;(V;K~VJT~.
x l ( t ) = T l 2 . , ( t )x, 2 ( t ) = T 2 2 , ( t ) ,but the responses to tests
are not invariant under such transformations. Again the Similar expressions hold for 2,(t ) , f2(t). ( 5 ,T2 replace
idea of balancing may be used to resolve this problem. VI, TlJ
In matrix form, we mayexpressthemodelinFig. 1 These expressionsindicate tbat weAmay selectTl so that
with arbitrary transformation x l ( t ) = T I 2 , ( t ) , x 2 ( t ) = the principal components of XI(t ) , Y'(t) are aligned with
&z2(t)as follows. equal magnitudes, i.e.,
28 IEEE TRANSACTIONS ON AUTOMATIC CONTROL. VOL. AC-26. NO. 1, FEBRUARY 1981
Definition: The model (10) is said to be “balanced with Hence, weneed only show that internal dominance im-
respect to X,” iff (1 1) holds and “balanced with respect to plies that
X,” iff (12) holds.
If the model ( A , B, C) in Fig. 1 is transformed to (lo), I I ~ P ~ Q T ~ I I F ~ I I ~ ~ ~ Q , ‘ ~
which is balanced’ with respect to both X,, X2, then we To verify this relationship, observe that
shall say that ( A , ByC) in Fig. 1 “has been balanced with
,,
respect to X X2.” We are now prepared to give a defini-
tion of internal dominance.
Definition: Thesystem ( A R ,BR,CR) is an internal&
dominant subsystem iff in some coordinate system the full
model ( A , B, C) can be organized as in Fig. 1 such that,
when balanced with respect to X,, X,
ll~~llF~ll~;llF- and since (a,2,e) is balanced with respect to X X2,
2: =P T Z ~ P =
, QTX~Q,
Internal Dominance and Second-Order Modes
The following result shows that the second-order modes

2; =p,Tx2p2= Q;X~Q,.
reflect the existence of internallv dominant subsystems.
Proposition 12: Thereexists an internally dominant m s means that Z P , = u12,, QTZ
=elw,, ZP, = u22,,
subsystem of order k if and only if Q:Z=e2W2 for some orthogonal matrices U,,W,, U2,
W,. It follows, therefore, that
XP,QTX= U , ~ : W , ; ZP,Q,’X=U2e;W2.
(13)
i= 1 i=k+l n u s I I ~ P ~ Q ~ ~ I I ~ = I I ~ ~ I ~ and ~ ~ ~
the proof is complete.
where u:, 1 Q i < n are the second-order modes.
This result, Proposition 12, suggests a natural first step
Pro08 First sufficiency. Suppose ( A , B , C) is intern-
in model reduction: compute the internally balanced
ally balancedand
A
-
let Z , = diag{a,, u2, . . , u k } , model and inspect the second-order modes. If condition
Z2 2 diag{uk+,;.. , o n } . It canbeeasilyverified that (11) is not satisfied, there is no internally dominant sub-
( A , B, C) is balanced with respect to X,. X 2 with 2,=E,, system. If (1 1) is satisfied, then the subsystem correspond-
2, = E,. Hence, if (1 1) holds, then ing to the first k state variables of the model is internally
dominant and, generically, internally balanced and
asymptotically stable. This subsystem is precisely that
obtained by applying the mechanics of minimal realiza-
To establishnecessity,assume that B”, (4, e)

is bal-
tion (Section 11) using ( t)as a working basis for Xco.
.> (1 (1 F .
anced with respectto X,, X 2 and satisfies (1 Z: (1 > e: \ “ I
Now let us return to the issue of dominanceversus

internal dominance. An internally dominantsubsystem
This model is related to the internally balancedmodel
( A , B, C ) by a coordinate transformation may be tested for dominance by applying principal com-
ponent analysis tothe impulse response error matrix H,(t).
The interesting question iswhetherit is possible for a
dominantsubsystem of order k to existwith no corre-
sponding internally dominant subsystem of order k.
Conjecture: Everydominantsubsystemis internally
dominant.
and since PQT = I, we may write On the Question of Opiimaiity
X2=ZCPQTZ=ZP,QrZ+ZP2QrX.
Wilson [28], [29] and Applevich [30] have given neces-
Since EPIQrZ has rank k , it followsfrom perturbation sary conditions for optimality of the reduced order model,
.(
properties of singular values that i.e.,necessary conditions for JFIIHe(t)lli d t to be a
minimum. It is naturalto question the relationshp be-
1/ 2
tween their work and the model reduction results given
IlzP2QTZII 0;) . here.
i=k+ 1
The first point to be made is that Wilson and Applevich
‘Note that this does n p imply that the model (8) is internally balanced consider the entire class of k th-order r-output, m-input
or that t h e elements of X:, X: are second-order modes. linear systems as possible reduced-order models of order
k. This is a largerg class than that considered here: those

which can be obtained by subsystem elimination. It is not
surprising, then, that the reduced model of order k defined
by the internally balanced model is not optimal. Computa-
x,= [ 0.461
0.04183
0.01456
0.0112
tional experience of this author suggests, however, that if
it corresponds to an internally dominant subsystem, it is and those of eATzCT
are
near optimal.”
In the situation where one is trying to find an optimal 0.899 -0.0991
0.426
0.0225
kth-order model (from the large class) where there is no -0.361
0.824
-0.0738
1 0.43
internally dominant subsystem of order k, it seems rea- v, =
sonable to use the kth-order subsystem defined by the -0.00891
0.927
0.368
0.0721
internally balanced model as a startingpoint in the model -0.0206
0.997
0.0738
0.0122
reduction computations.
47.95
2.96
VI. ILLUSTRATIVE EXAMPLES 0.667
0.244,
To illustrate internal balancing and the proposed model
reduction framework, we shall consider two single input, The underlined entries in V, show that it is essentially
single output examples. The examples are not particularly antidiagonal (the higher derivatives of the output are most
important other than for the purpose of illustration. excitedby an impulse). For V, it isthe opposite; V, is
As stated in Section V, we shall judge potential reduced essentially diagonal (initial conditions involving lower de-
models on the basis of the relative error in the impulse rivatives cause the greatest output response). The second-
response order modes are
(/owHe(t)’ dt)’/* {a~,u~,u~,u~}={0.576,0.147,0.0904,0.0192}.

The internally balanced model” for this system is given
(i m H ( t ) ’ dt)”’ bY
0.3501
-0.3911
1.45
-0.5183
Example I : The firstexample illustrates that phase i=[
-2.195
-1.218
4.753
-1.45
-4.753 -0.6297 1.196
canonical models, often chosen for convenience, may be -0.3911
extremely poorly balanced. -0.3501 -1.218 -1.196 -1.657
iO*_l
0.7729
0 0 1 0 0.3373
0 0 0 j = 0.2523
?=(0.7729 -0.8047
0.3373
-0.2523).
C=(50 15 1 0) Table I summarizes the respective reduced models found
by subsystem elimination.
(s+ 10)(s+5) Example 2: The following example was considered by
C(sI-A)-’B=
(s+ 1+j4.9)(s+ 1-j4.9)(s+ l)(s+2) Wilson [28].
(0 0 0 -150)
The principal component vectors and magnitudes of eA*B
are given by
0 - 0.0643 0 0.998 C=(O 0 0

1)
- 0.0335 0 0.999 0 s+4
c(sI-A)-’B=
v, = (s+ l)(s+3)(s+5)(s+ 10) *
0 0.998 0 0.0643
0.999 0 0.0335 0
For this system the component vectors and magnitudes of
91t is intuitively clear that the classof models of order k which can be eAzBare
achieved by subsystem elimination does not contain all models of order
k. This fact is shown by example in [31].
‘Ohdeed, Example 2 was considered by Wilson who reported a larger ”It is interesting to observethe“absolutevalue”symmetryofthe
residual. This undoubtedly arises fromthenature of his search algo- model. This propertyhasappeared in every single input, single output
rithm. example tested by this author.
30 IEEE TRANSACTIONS ON AUTOMATICCONTROL, VOL. AC-26, NO. 1, FEBRUARY 1981
TABUI
REDUCED ORDERMODELpRo~rnm--ExAMpLE 1
-3 I .03193
-.05014(s-54.4)
1553 .3332 (s+1.357+j1.183)(~+1.357-jl.l83)
- .5974
s+.5183
- 0.8897 -0.0769
0.4495 0.002
0.8385
-0.2951 -0.008 .’
%=[ 0.0706
0.45
0.3073
10
0.8981
0.3065
0.9483,
0.3169
0.01903
0.359X
14.13 I
0.367
2.35 x 10 - 3
2.5 x 10
0 - 0.0809 0 +0.9967
v, = -0.987x 0 0.999
0 0.9967 0 +0.0809
+0.9999 0 0.987X 0
0.174
0.0173
x, = 4.48 X 10 - 3
3.68 X 10 - 3
-
The second-order modes are VII. CONCLUDING REMARKS
There are several points in this paperwhich, in this
author’s opinion, are natural points of departure for fur-
= {0.0159,0.272~ 10-2,0.127x 10-’,0.8x
ther study. Withrespect to model reduction, the signal
and the internally balanced model is injection resultsgiven in Section I1provide one such
point. The results in Sections IV and V were developed
-0.4378 - 1.168
-0.4143
0.05098’ for impulse injection, but this may not be appropriate if,
2=[ 0.3288
-2.835
-3.135
1.168 say, one is modeling in an environment where computer
- 0.4143 2.835 - 12.48 3.249 control is to be used to implement low gain feedback for
-0.05098 0.3288 -3.249 -2.952 , disturbance rejection. Preliminary resultsshow that one
f-0.1181 1 can “tune” the model reduction tools to certain classes of
inputs.
Another point of departure that is probably more im-
I- 0.006875 J portant than the one discussed in the preceding paragraph
e=(-0.1181 -0.1307
-0.05634
0.006875). is Section 111, which describes verygeneraltools for
detecting near linear dependence. The focus of this paper,
Table I1 summarizes properties oflower order models. model reduction as it relates to minimal realization the-
It is interesting to note that the relative error of Wilson’s ory, is t’ery narrow compared to the domainwhere the
second-order model is 0.04097.The fact that his “optimal” tools are applicable.
modelgiveshigherrelative error than the one obtained One point in this paper might better be described as a
here must be due to the details of his search algorithm. “loose end” than a “point of departure.” The relationshp
MOORE: PRINCIPAL COMPONENT ANALYSIS IN LINEAR SYSTEMS
31
TABLE I1
REDUCED ORDERMODELF’ROPERTIES--EXAMPLE 2
actual
order (k)
[
i=k+l o : v [ i=l of]‘
relative
error
transfer function
3 l
007882 1 .03938 1
1 I .1711 I .4321 I .01395
s+ .4378
between subsystem dominance and internal dominance is G. B. Garbow et al., EISPACK Guide, Lecture Notes in Conputer
Science, 2nd d, vol.6.Berlin:Springer-Verlag,1976.
not clear; the latter concept was clearly created for A. P. Dempster, EkmenB of Continuous Multioar’ate A d s i s .
pragmatic reasons. The relationship between general Reading, MA: Addison-Wesley, 1969.
V. C. Klema and A. J. Laub, “The singular value decomposition:
model reduction and reduction by subsystem elimination Its computation and some applications,” ZEEE Trans. Automat.
is not well understood, either. (A discussion of this is Contr., vol. AC-25, pp. 164-176, Apr. 1980.
given in [31].)
As a final comment, this work has left theauthor with a
Oxford Univ. Pres, 1965. ’
J. H.Wilkinson, The Algebraic Eigenvalue Problem. New Yo&:
G. E. Forsythe and C.B. Moler, Computer Solution of Linear
Algebraic @stem. Englewood Cliffs, NJ: Prentice-Hall,1967.
strong bias toward operating directly on signah whenever G. W. Stewart, Z n t r k t i o n to Matrix Computntionr. New York
possible. This means, of course, a biasagainstworking Academic, 1973.
C. L. Lawson and R J. Hanson, SoIving k a s t Squarm Problems.
with secondary objects such as model parameters. There Englewood Cliffs, NJ: Prentice-Hall, 1974.
are at leasttworeasonswhich support this bias. First, G. Strang, Linear Algebra and its Applications.
.. New York
Academic,1976.
limitations of physical hardware (measurement accuracies, G . Forsythe, M.A. Malcolm, and C. B. Moler, Computer Methodr
regions of linearity, etc.) can usually be stated directly in for Mathematical Computations, EnglewoodCliffs, NJ:Prentice
Hall.1977.
terms of signals. Second, tools for coping with multiple B. 6.Moore, “Principal component analysis in nodinear systems:
signals (principal component analysis +singular value de- Preliminaryresults,” in Prm. 1979 ZEEECon$ Decision Contr.,
Fort Lauderdale, FL., Dec. 1979, pp. 1057-1060.
composition) are available. C. T. Mullis and R A. Roberts, “Synthesis of minimum roundoff
noise fixed point digital filters,” ZEEETrans. Circuits *st., vol.
ACKNOWLEDGMENT CAS-23, pp. 551-562, Sept. 1976.
E. G. Lee and L. Marcus, Foundations of Qtimal Control Theoty.
New York: Wiley, 1967.
For discussions during the writing of the original draft R W.Brockett, FiniteDimensional Linear System. New York
Wiley,1970.
of this paper, the author is indebted to A. J. h u b and W. R H. Bartels and G. W. Stewart, “Solutionof the matrix equation
M. Wonham. Of considerable help in the revision process AX+XB= C,” C o m a Ass. Comput. Mach., vol. 15, pp. 820-826,
1972.
were constructive comments by L. M. Silverman, M. G . B. Friedland, “Controllability index based on conditioning num-
Safonov, S . Y. Kung, and the reviewers of the first draft ber,” Trans. ASME, J. @ m i c Syst., Mensurement, Contr., pp.
444-445, Dec. 1975.
[ 11. W. M.Wonham, Linear Multivariable Control: A Geometric Ap-
proach, 2nd ed. New York: Springer-Verlag, 1979.
L. Pernebo and L. M. Silverman, “Balanced systems and model
REFERENCES reduction,” in Proc. 18th ZEEE Conf:
Decision Contr., Fort
Lauderdale, FL, Dec. 12-14, 1979.
[l] B. C. Moore, “Singular value analysis of linearsystems, parts I, J. Rissanen, “Recursiveidentification of linear systems,” SZAM J.
II,” Dep. Elec.Eng., Univ. Toronto, Toronto, Ont.,Syst. Contr. Contr., voL 9, pp. 420-430, 1971.
Rep.7801 and 7802, July 1978; also in Proc.ZEEE Con$ Dec. H. P. Zeiger and J. McEwen, “Approximate linear realizations of
Contr., pp. 66-73. given dimensionvia Ho’s algorithm,” IEEE Trans. Automat. Contr.,
[2] R. E.Kalman, “Irreducible realizations and the degree of a ra- V O ~ .AC-19, p. 153, 1974.
tional matrix,‘‘ SZAM J. AppL Math., vol. 13, no. 2, pp. 520-544, L. S. Ddong, “Numerical aspects of realizationalgorithms in
1965. linear systems theory,” Ph.D. dissertation, Eindhoven Univ. Tech-
[3] -, “Algebraic structure of finite dimensional dynamical sys- nol., Emdhoven, The Netherlands, 1975.
tern,” Proc. Nut. Acad Sci., U.S.A., vol. 54, pp. 1503-1508, 1965. S. Kung, “A new identification and model reduction algorithm via
[4] H., Hotelling, “Analysis of acomplex of statistical variables into singular value decompositions,”in Proc. 12th Ann. Asilomar Con$
pnnclpal components,” J. E&. Pvch., vol.24, pp. 417-441, Circuits, Syst. Comput., Nov. 1978.
498-520,1933. D.A. Wilson, “Optimum solution of model-reduction problem,”
151 -, “SimpK~edcalculation of principal components,” P v c h e Proc. ZEEE, vol. 117, pp. 1161- 1165, 1970.
metrica, vol. 1, pp. 27-35, 1935. _. , “Model reduction for multivariable systems,” Znt. J. Contr,
[6] G.H. Golub and C. Reinsch, “Singular value decomposition and V O ~ .20, pp. 57-64, 1974.
least squares solutions,” Numer. Math., vol. 14, pp. 403-420, 1970. J. D. Applevich, “Approximation of discrete linear systems,” Z n t
32 IEEE TRANSACTIONS ON AUTOMATIC CONTROL, VOL. AC-26.NO. 1, FEBRUARY 1981
J. Con@., vol. 17, pp. 565-575, 1973. electricalengineeringfrom the University of

P. J. M. Pinquet,“Statespace formulation of a class of model Southern California, Los Angela, in 1970 and
reductionmethods,” M. Phil. thesis (supervised by K. Glover), 1973, respectively.
Dep. Eng., Univ. Cambridge, Cambridge, England, 1978. From 1972 to 1975he was on the faculty of
the Department of Computer Science, Louisiana
State University,BatonRouge.Since 1975he
has been with the Department of Electrical En-
gineering,
University of Toronto, Toronto,
Canada, currently as an Associate Professor. His
B m Moore was born in Russellville, AR, on January 8,1947. He interests include multivariable signal analysis,
received the B.S. degree in electrical engineering from Louisiana State computer process control, and numerical linear
University, Baton Rouge, in 1969, and the M.S. and Ph.D.degreesin systems theory.
PrincipalGains and Principal Phases in the

Analysis of Linear Multivariable
Feedback Systems
Abshcrct-ne concepts of principal gain and prineipalphase are intro-tent, it lends itself well to graphical representations which
dn d for linear multivariable systems, and their use in the analysis of
are a way of presenting results that engineers have found
feedback behavior is demonstrated. A sufficient Nyquist-type stability
helpful in the pasf, as shown by the success of the classic
criterion is presented in terms of thesequantities and is used to char-
acterize the r0bwhw-s oftheclosed-loop W i property when the Nyquist and Bode plots. Technically, frequency-response
methods have the advantageof being largely insensitive to
system madel is subjected to a linear perturbation (either multiplicative or
small errors in a system model. Should the actual system
additive) at any point in the feedback configuration. The results preseoted
are less conservative than tbose obtained via the small gain theorem. suffer from large parameter variations, however, or should
the modelbevery inaccurate because of various ap-
I. INTRODUCTION proximations and uncertainties, then the control system
should naturally bedesigned to have a large degree of
N recent years there has been a revival of interest in the stability. The mere presence of feedback is not sufficient
I development and application of frequency-response to gtiarantee the robustness of the stability property, and
techniques to the design and analysis of linear multivaria- so techniques for assessing the relative stability of a multi-
blefeedback control systems(see, for example,[1]-[3]). variable design are required. This problem has been
One of the major reasons for this has been the availability studied by Doyle [4]who characterizes the robustness of
of increasingly inexpensive computers and the consequent the closed-loop stability property in terms of the spectral
increase in all branches of engineering of interactive com- norm of an appropriate frequency-responsematrix. In this
puting facilities to assist in design and analysis. The paper we gve a less conservative characterization of the
frequency-response approach is particularly attractive in robustness of the closed-loop stability property in terms of
this context since, having a strong complex-variable con- a Nyquist-type plot by introducing phase information.
Thephase information usedterms from the polar de-
Manuscript received March 11, 1980; revised September22, 1980. This composition of a complex matrix [5] which is defined as
work was supported by the Science Research Council.
I. Postlethwaite was with the Research and DevelopmentCenter, follows. Analogous to the polar form of a complex num-
General Electric Company, Schenectady,NY 12301. He is now with the ber, a complex matrix T can be represented in the forms
Department of Engineering, Controland Management Systems Division,
Cambridge University, Cambridge, England.
J. M. Edmunds and A. G. J. MacFarlane are with the Department of T = UHR (1.1)
Engineering,Control and ManagementSystemsDivision,Cambridge
University, Cambridge, England. T= HLU (1 -2)
0018-9286~81/0200-0032$00.75 C 1981 IEEE

Principal Component Analysis in Linear Systems Controllability Observability and Model Reduction

Uploaded by

Document Information

Original Description:

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Principal Component Analysis in Linear Systems Controllability Observability and Model Reduction

Uploaded by

Copyright:

Available Formats

IEEE TRANSACTIONS ON AUTOMATICCONTROL. VOL. AC-26, NO.

0018-9286/81/0200-0017$00.75 0,1981 IEEE

This leads to the description of a coordinate system n

111. PRINCIPAL. COhiPOhTENT ANALYSIS

For every instant of time we have X ( ? ) € RnXm,and X, w2L p ( f ) F T ( t ) d t

ing to the respective test signals, and let

'Theresultsof this sectionarevalid with 8 ( t ) selected to be other

from which the first two properties follow easily.The third

l,f211F(f)ll$dt=tr(l ? ( t ) F T ( t ) dt). response m a t r i x F(t)

is an ellipsoid with semi-axes(l/uj)ui, 1 Q i Q n.

is a vector p, IIp 11 = 1, satisfying 2 = vzp.

w2= I T F ( t ) F T ( t )dt= VZ2VT.

(70 A t,, rN = t2), it follows from rectangular approxima- W 2- l im (ON+ADN)(DN +ADN)T

that of DN by no more than Relationships Between (6G,C) and Components of eA'B,

If the diagonal elements of E,, Z, are distinct, then

such that for t, < 6

for all i > 0.Hence,

which corresponds to the set of all initial conditions which

It follows, therefore, that

e=CP,P,. Z1=diag{u,,---,uk}, Z2=diag{uk+l,~-~,un}. This i m p

eArBBTeATIdt= L" eATI C TC e A t d t = Z 2

balance2 system into two interconnected subsystems, i.e.,

U Prooj Stability was proved in Lemma 7; the second

Prelude to Model Reduction

The basic model reduction idea to be pursued in the

of the (infinite) Hankel matrix of the discrete-time sub-

Model reduction involvesatradeoffbetweenmodel

be judged by its impulse response matrix. The error im-

This viewpointforces an assumption: the map CeA‘B

It is here that we must face up to a theoretical gap in

The following result shows that the second-order modes

To establishnecessity,assume that B”, (4, e)

Now let us return to the issue of dominanceversus

k. This is a largerg class than that considered here: those

(/owHe(t)’ dt)’/* {a~,u~,u~,u~}={0.576,0.147,0.0904,0.0192}.

0 - 0.0643 0 0.998 C=(O 0 0

J. Con@., vol. 17, pp. 565-575, 1973. electricalengineeringfrom the University of

PrincipalGains and Principal Phases in the

0018-9286~81/0200-0032$00.75 C 1981 IEEE

You might also like