Professional Documents
Culture Documents
Automatica: Peter Nauclér Torsten Söderström
Automatica: Peter Nauclér Torsten Söderström
Automatica: Peter Nauclér Torsten Söderström
Automatica
journal homepage: www.elsevier.com/locate/automatica
1
yk = A + Ak (x0 + xk ) , k = 1, . . . , M , (6)
= ... . (9)
where Ak C nn
is a disturbance. The way that Ak enters M
the system makes the estimation problem trickier and far more
Before proceeding, an assumption regarding the uncertainty
nontrivial to handle than the case when the presence of sensor
matrix is needed:
noise is the main random effect.
The model (6) is also considered by Li, Lin, Untaroiu and Allaire Assumption 1. The uncertainty matrix is zero mean and Ak is
(2003), but then the perturbations Ak are assumed to be bounded uncorrelated with Al for k = l. It has an associated covariance
and deterministic. Also, an estimate of A is assumed to have matrix
been estimated beforehand. The unbalance determination problem
Ek l = R k,l ,
T
is formulated as a certain convex optimization problem which (10)
includes upper bounds for the the perturbations. In contrast we
where E denotes the expectation operator and k,l is the Kronecker
will in this paper use the model (6) but treat the perturbations
delta function.
as random variables. In addition, no previous estimate of the
influence coefficient matrix is needed. Recall that the reason for the error Ak in the influence matrix is
In the current context it is assumed that sensor noise is that the separator is dismantled and rebuilt between the test using
negligible compared to the system disturbance Ak . In fact, the a new trial mass. Therefore it is reasonable to assume that this type
effect of measurement noise has been checked in previous practical of error is independent from one experiment to another.
studies on separators, Hillstrm (2008), and found to be negligible The fact that the uncertainty is independent between experi-
to as compared to the effects of the varying dynamics from ments implies that
one experiment to another. To the best of our knowledge there
R,M , cov () = IM R , (11)
is no statistical analysis associated with estimation of (1) and
no algorithms proposed that are devoted to a sound statistical where IM is the identity matrix of dimension M and is the
treatment of the disturbing variable. Kronecker product.
In the separator system, equations of the type (1) can be set The vec operator has many useful properties. One that will be
up for several angular frequencies. The measured quantity yk and extensively employed in this context is
the matrix A then become functions of frequency, while x0 and xk
vec (ABC) = CT A vec (B) .
are frequency independent. Still, the number of experiments must (12)
fulfill (2). Thus, the core of the problem is to be able to perform Application of this result on the system equation (1) yields
unbalance estimation at a single frequency, which is considered in
vec (yk ) = yk = (x0 + xk )T In
this paper.
The paper is organized as follows. The next section contains + (x0 + xk )T In k .
(13)
some preliminary mathematical notations and basic results.
Section 3 contains an analysis of a deterministic approach for Let B(x) and C(x) be matrices whose entries are functions of a real
determining the unbalance based on least-squares. The resulting valued vector x. Furthermore, let [x]k be the k-th element of the
estimate is evaluated for the case when there are random errors vector x. Then we define
in the influence matrix, as in the model (6). Section 4 develops B(x) 2 B(x)
a more advanced method, where the structure of the model (6) B(k) = , B(kl) = . (14)
[x]k [x]k [x]l
is exploited, leading to a nonlinear estimation procedure. The
resulting estimate is shown to have much better (statistical) For products of matrices the chain rule applies,
performance. Both statistical and computational aspects of the
estimator are analyzed. A detailed numerical example, based on (BC)(k) = B(k) C + BC(k) , (15)
the separator model above, is treated in Section 5, showing again where the x-argument is dropped for notational convenience. For
superior behavior of treating the disturbance terms Ak as random differentiation of matrix inverses it holds that
variables. Most details of the statistical analysis of the treated (k)
B(k) , B1 = B1 B(k) B1 .
methods are placed in the appendices. (16)
P. Nauclr, T. Sderstrm / Automatica 46 (2010) 17521761 1755
zk = Axk , k = 2, . . . , M (19) and its covariance matrix is for large SNR given by
if the disturbance is neglected. Both zk and xk are known and
cov x0 = A1 C1 R,M C1 A ,
therefore it is straightforward to compute an estimate of the (25)
nuisance variable A. This can be performed in different ways. One
where
option is apply the vec operator to (19), which gives
C1 = (xT0 In ) (In2 + 81 C1b ) 81 C1a ,
zk = xTk In (26)
(20)
(x0 + x2 )T In
and upon stacking the experiments in a tall vector 0
.. ,
C1a = . (27)
T
z = zT1 ... zTM 1 = 81 ,
(21)
0 (x0 + xM ) In
T
where
1
xT2 In
= ... xT0 In ,
81 =
.. . (22)
C1b (28)
. 1
T
xM In
C1 Cn(M 1)n (M 1) , C1b Cn(M 1)n .
2 2
4. Approach 3: nonlinear regression The covariance matrix of the residual term C(x0 )
is denoted by
In this section we derive a loss function that handles the Q (x0 ) = C(x0 )R,M CT (x0 ) R2nM 2nM , (40)
stochastic uncertainty A in a more sophisticated fashion. This
leads to a problem formulation with a loss function that is which is a function of the unknown variable x0 . Similarly to the
nonlinear in x0 . Thus, there exists no closed form solution and a approaches A1 and A2 an estimate of x0 is found by minimizing
numerical search procedure is required. In order to use standard a quadratic criterion. However, in order to make the covariance
optimization routines, the system equation (1) is reformulated as matrix of the estimation error minimal, the equations should be
a real valued problem. This is done by representing the complex weighted with the inverse of Q (Sderstrm & Stoica, 1989). The
valued quantities with their real and imaginary parts separated. criterion then reads
This operation is denoted here with ()
and we let
V (x, ) = y B(x)2 1 . (41)
Q (x)
Re (yk )
[ ] [ ] [ ]
y x0R
yk = kR = , x0 = ,
ykI Im (yk ) x0I Minimization of V with respect to is straightforward. For a fixed
[ ] value of x = x , the minimum is (Sderstrm & Stoica, 1989)
xkR
xk = , R2n1 , (30)
xkI 1 T
= BT (x )Q
(x )B(x )
1
B (x )Q
(x )y
1
(42)
where Rnm denotes the set of real valued n m matrices and
where Re(yk ) and Im(yk ) are the real and imaginary parts of yk , and insertion of (42) into (41) yields a concentrated loss function
respectively. The corresponding convention with subscripts R and 1 2
W (x) = min V (x, ) = y B BT Q 1
BT Q 1
B y
I will be employed in the sequel. Furthermore, the vectorized
1
Q
matrices with separated real and imaginary parts are defined as 1
= yT yT Q 1 T 1
BT Q 1
B B Q B
Re (vec (A))
[ ] [ ]
2
= R = R2n 1 , (31)
I Im (vec (A))
1 T 1
y B BT Q 1
B B Q y
Re vec Ak
= yT Q1/2 I2nM Q 1/2
1
B BT Q 1
[ ]
k = kR =
2
R2n 1 , B
kI
Im vec Ak
BT Q1/2 Q1/2 y
(43)
1
where the dependence on x is dropped for brevity. The concen-
= ... R2n M 1 ,
2
(32) trated loss function (43) can be formulated as
M W (x) = yT Q 1/2
(x)5 (x)Q 1/2
(x)y, (44)
and the corresponding covariance matrices are defined as
where 5 is the orthogonal projector onto the null-space of
1/2
cov
k = R , cov
= R,M = IM R . (33) B T Q and it is given by
1/2 1/2
1
5 = I2nM Q B B T Q 1
BT Q .
A given complex valued equation B (45)
y = Ax, yR + iyI = (AR + iAI ) (xR + ixI ) (34) The parameter estimation problem becomes a two-step proce-
can be reformulated as a real valued relation dure:
[ ] [ ][ ][ ]
yR
=
AR AI xR yR x 0 = min W (x) (46)
yI AI AR xI yI x
1
xTR In xTI In R = BT (x 0 )Q
(x0 )B(x0 )
1 BT (x 0 )Q
(x0 )y.
1
[ ][ ]
(47)
= . (35)
xTI In xTR In I
By the separation into two estimation steps the complexity
Using this fact, the system equation (1) can be rewritten as of the optimization problem has been significantly reduced.
yk = Dk (x0 ) + Dk (x0 )k , k = 1, . . . , M , (36) Minimization of the original loss function (41) would require a
nonlinear search over 2(n2 + n) unknown parameters. By use of
where the concentrated loss function (43), the problem is reduced to a
nonlinear minimization over 2n variables and a simple weighted
(x0R + xkR )T In (x0I + xkI )T In
[ ]
Dk (x0 ) = (37) linear least squares fit to find the remaining 2n2 unknown
(x0I + xkI )T In (x0R + xkR )T In parameters. The second step is only needed if the nuisance variable
2 A is of any importance.
R2n2n . If all experiments are stacked in a tall vector
The optimization problem (46) is often referred to as a variable
T projection problem (Golub & Pereyra, 1973). Such optimization
y = yT1 yT2 . . . yTM = B(x0 ) + C(x0 ),
(38)
problems frequently appear in sensor array processing (Viberg &
where Ottersten, 1991) and in many other applications (Golub & Pereyra,
2003). However, the fact that Q in (44) is a function of the
D1 (x0 ) D1 (x0 )
0
unknown variable is quite uncommon. Notice that Q depends on
B(x0 ) =
.. ..
, C(x0 ) = . . (39)
. the uncertainty covariance matrix R through (40). Therefore, R
DM (x0 ) 0 DM (x0 ) needs to be a priori known or estimated.
P. Nauclr, T. Sderstrm / Automatica 46 (2010) 17521761 1757
W (k) (x 0 ) = 0 (48)
I
a neighborhood close to the true value x = x0 , i.e. x 0 = x0 + x ,
where x is small. Then (see e.g. Ljung (1999) and Sderstrm and
Stoica (1989)),
T W T W
0 = =
x x=x 0 x x=x0 +x
T W 2 W
+ x . (49)
x x=x0 x2 x=x0
R
Remember that W (k) = W /[x]k , where [x]k is the k-th element
Fig. 2. Level curves of the loss function. The true parameter value is x0 = 0.55.
of x, see Section 2. Eq. (49) implies that the estimation error
approximately is
1 The figure shows that at least in this case the loss function is
2W T W
well behaved.
x = , (50)
x2 x In most applications, the covariance matrix Q should be
positive definite. However, situations where it is ill conditioned,
where the derivatives should be evaluated at x = x0 . The accuracy
or rank deficient may occur. Such situations need to be taken care
of the estimate then becomes
of. It can be done using regularization,
1 1
2W T W 2W
cov (x ) = cov . (51) Q = CR,M CT + I2nM ,
x2 x x2
We are now ready to give the main result of this section: where is a small real number.
In order to use approach A3, the statistics of the uncertainty
Lemma 2. The estimation procedure A3 yields must be known or estimated beforehand. The good news is that
only the structure of R and not its absolute value is of importance.
Ex0 = x0 + O EAk 2 (52) A scaling of the covariance matrix will only scale the loss function
(44). Thus, the value of x 0 that minimizes the criterion (44) will
and the accuracy is for large SNR given by remain the same.
When the projection matrix 5 is computed, the effects of
cov x 0 = H 1 GR,M GT H 1 , (53) rounding errors may become significant. Therefore, it should be
computed in a numerically sound way. First, rewrite (45) as
where
(Mahata, 2003)
[H ]kl = 2T BT (k) Q
1/2 1/2 (l)
5 Q B , (k, l) = 1, . . . , 2n, (54)
5 = I2nM MM , M = Q1/2 B (57)
T (k) 1/2 1/2
[G]k,: = 2 B T
Q
Q C, k = 1, . . . , 2n, (55)
and perform the QR factorization
where [G]k,: means row k of the matrix G.
[ ]
R1
Proof. See Appendix B. = Q1 R 1 ,
M = QR = Q1 Q2 (58)
0
4.2. Computational aspects where Q is an orthogonal matrix and R1 is upper triangular. This
gives
The loss function (44) is a nonlinear function of the unknown
variable x0 . Therefore, numerical optimization is needed in order 1
M = RT1 QT1 Q1 R1 1 Q1 ,
RT1 QT1 = R 1 T
(59)
to compute the estimate x 0 . For this purpose, there are some
computational issues that need to be addressed. where the last equality follows from the orthogonality of Q1 . Using
Any optimization routine need to be started with an initial this result, the projection matrix can be written as
guess of the minimizing variable. Instead of just choosing e.g.
1 Q1 = I2nM Q1 Q1 = Q2 Q2 .
x 0 = 0, the optimization is initialized with the outcome from the 5 = I2nM Q1 R1 R 1 T T T
(60)
procedure A2.
It is not easily seen if there exist local minima from the Eq. (60) is less sensitive to rounding errors compared to direct
expression (44). So far, no problems with convergence to computation of (45). In addition, the use of Q2 forces (60) to be
inaccurate estimates have been experienced. If n = 1, it is positive semidefinite. Therefore, the QR decomposition approach
possible to visually depict the level curves of the concentrated loss should be used for the numerical computations.
function. Such an example is shown in Fig. 2. Here, the number of Many optimization routines converge in fewer iterations if in
experiments is M = 7 and each step the analytical value of the gradient of the loss function
is supplied. Such expressions are given in Nauclr (2008), for any
A = 1 + 0.78i, x0 = 0.55, R = cov
k = 103 I2 . (56) x = x0 .
1758 P. Nauclr, T. Sderstrm / Automatica 46 (2010) 17521761
row #
The benefit of adding additional experiments is, however, much 0.5
5
lesser for A1 compared to the other two approaches. Monte 0.4
Carlo simulations studies produces results that for all approaches 6 0.3
are very similar to the results predicted by the theory (such as
Lemmas 1 and 2). 7 0.2
Consider a model of a separator as described in Section 1 and
0.1
shown in Fig. 1. It is a 2-dimensional model with 12 degrees 8
of freedom. The beam at which the separator bowl is attached
is however modeled with the EulerBernoulli partial differential Fig. 3. The structure of R . Each square shows the magnitude of the corresponding
equation. The masses of the bowl and the frames are in the order element in R . The matrix is scaled so that the greatest element have unit
hundreds of kilograms. The stiffnesses are modeled using the magnitude.
concept of hysteretic damping. It means that they are modeled
as complex valued stiffnesses, which is a is a way to introduce dependent. If M > 3, the further experiments are drawn from a
damping in the system. The damping does not change with statistical distribution
frequency, in contrast to viscous damping.
m1 e1 i
[ ] [ ]
[xk ]1
The complex valued stiffnesses are subject to change between xk = = i , k 4, (64)
[xk ]2 m2 e 2
experiments, which leads to the uncertainty term. Between
each experiment, each stiffness varies uniformly 1% around its where
nominal value. The modeling is quite extensive and the details are
by purpose left out in order to make the presentation compact. The mi {30, 40, 50, 60} [g], i U(0, 2 ) [rad]. (65)
system model becomes All values of mi are equally probable and U(0, 2 ) is a discrete
uniform distribution with resolution 1 degree. Not too much effort
yk = A + Ak (x0 + xk ) , k = 1, . . . , M , (61) is put on choosing good candidates for trial masses. Instead, the
masses are changed according to (65) for each new Monte Carlo
where realization. The purpose with this procedure is to diminish the
effect of specific choices of xk and instead put the focus on the
0.0095 0.5335i 0.0036 + 0.1743i
[ ]
A = 104 , [m/(sg)], performance of the estimators.
0.0089 0.4344i 0.0017 + 0.1932i
Monte Carlo simulations are used to evaluate the performance
]
21e37 180 i of the three estimation algorithms. The covariance matrix of the
[
x0 = 111 i , [g]. (62) estimates are computed using 300 realizations for each value of
17e 180
M. The result is shown in Fig. 4. The figure shows that if the
The unit of A depends on the fact that the measured quantity is in true covariance matrix of the uncertainty is known, the nonlinear
[m/s] and the applied masses are in grams [g]. The quantities are estimation method A3 outperforms A1 and A2. Even with the ad
complex valued since they are associated with a magnitude and = I, A3 gives better performance compared to A1
hoc choice R
an angular position. The structure of the covariance matrix R is and A2. Such a choice is probably natural if the statistics of the
depicted in Fig. 3.
uncertainty is completely unknown. In reality, user choices of R
In order to use A3, the statistics of the uncertainty must be
would probably lead to a performance of A3 that lies somewhere in
known or estimated somehow. Two scenarios here are evaluated.
between the curves marked with squares. Thus, better knowledge
The first is that the statistics of the uncertainty is fully known.
about the system at hand is expected to yield better estimates.
The other scenario is that it is completely unknown and therefore
Finally, we show in Fig. 5 a histogram plot of the estimation
= I is employed. The latter choice clearly deviates from the
R 8 error for M = 14. The error of [x 0 ]1 = Re([x0 ]1 ) is shown. It
true covariance matrix as depicted in Fig. 3. Still, the algorithm A3 can be seen that the estimation error is centered around zero and
can be used, but the weighting is no longer optimal. Therefore, it = R is
the distribution is by far most narrow when A3 with R
is not necessarily so that A3 should perform better than the other
employed.
two approaches in this case.
Each trial weight [xk ]i has certain mass mi and angular position
i , relative to a reference position in the bowl. Typically, x1 = 0, 6. Conclusions
since in the first experiment it is decided if balancing is at all
needed. Thus, if balancing is needed, the first experiment is for An estimation problem which is motivated by the application of
unbalance estimation of rotating machinery has been considered.
free. In this example M 3 is required and it is chosen to use
Two different estimation techniques (A1 and A3) are derived and
analyzed with respect to their respective statistical property. In
[ ]
0 30 30
x1 x2 x3 = (63) addition, an approach (A2) based on A1, is discussed and compared
0 30 30
to the other two approaches using a numerical example. The
as the trial masses (in grams) for the first three experiments. This estimation problem is special in the way that the disturbance is
is done to ensure that the trial masses do not become too linearly entering the system equations. Instead of noisy measurements
P. Nauclr, T. Sderstrm / Automatica 46 (2010) 17521761 1759
(x0 + x2 )T In 2
0
.. ..
+ . . (A.2)
0 (x0 + xM )T In M
= 81 + C1b C1a ,
(A.3)
with C1a and C1b as defined in (27) and (28), respectively.
The first step of the estimation procedure is to compute an
estimate of , as in (23)
M = 81 z (A.4)
= + 81 C1b C1a
(A.5)
Fig. 4. Performance of the different estimators for the separator example.
, + , (A.6)
where
= 81 C1b C1a .
(A.7)
Thus, the estimate of A can be written as
A = A + A , (A.8)
where A is formed from , i.e. vec(A ) = .
Next, let
m = Ax0 . (A.9)
The use of (17) implies that
m = y1 = m + A1 x0 , (A.10)
which follows from (1). Eq. (A.10) can be rewritten as
m = m + m , (A.11)
where
Fig. 5. Histogram plot of the estimation error of the real part of [x0 ]1 . The number m = (xT0 In )1 . (A.12)
of realizations is 300. We will next use the series expansion
(ordinary least squares problems) or noisy inputs (errors in (A + A)1 A1 A1 AA1 + (A.13)
variables problems), the main source of uncertainty is here
considered to act on the system parameters in a stochastic fashion. where the second order terms can be omitted if A is small
An example of unbalance estimation of a separator is consid- compared to A in the sense A A.
ered for evaluation of the estimators. Here, it is shown that the ac- Using (24), (A.8), (A.11) and (A.13) the estimate of x0 can be
curacy can be significantly improved if the nonlinear estimation written as
approach A3 is employed. This is particularly so if the number of x0 = (A + A )1 (m + m )
experiments is increased. In such circumstances, it matters very = (A1 A1 A A1 + )(m + m )
much how the estimation is performed. The nonlinear approach
A3 may then perform considerably much better than the linear es- = x0 + A1 (m A x0 ) +
timators A1 and A2. The analytical accuracy expressions could be = x0 + A1 ((xT0 In )1 (xT0 In ) ) +
employed as a basis for experiment design, i.e. the problem of find-
= x0 + A1 xT0 In (1 81 C1b C1a ) +
ing a sequence of xk that minimizes the estimation error. (A.14)
We are grateful to Dr. Lars Hillstrm at Alfa Laval Machine where the approximation in (A.15) follows from the fact that Ak
Dynamics for fruitful discussions and for letting us use the is assumed to be much smaller than A. Therefore, also A is
separator model. much smaller than A. From (A.14) it is concluded that
Appendix A. Proof of Lemma 1 Ex0 = x0 + O EAk 2 (A.16)
The identification procedure is derived while neglecting the
since the error term in (A.15) is linear in , which has zero mean.
effects of Ak . In the presence of this disturbance (19) and (20) For large SNR, (A.15) is a valid approximation. Then, covariance
modify to matrix of x0 becomes
zk = Axk A1 x0 + Ak (x0 + xk )
cov(x0 ) = A1 C1 cov()C1 A , (A.17)
= xTk In xT0 In 1 + (x0 + xk )T In k ,
with C1 given by (26). Furthermore, the covariance matrix of is
k = 2, . . . , M . (A.1) given by (11), which concludes the proof.
1760 P. Nauclr, T. Sderstrm / Automatica 46 (2010) 17521761
Appendix B. Proofs of Lemma 2 The final proposition is related to the second derivatives of P:
T
Proposition 3. + CT Q(k) C.
(B.17)
(k) (k)
P B = PB . (B.7) By use of Propositions 1 and 3 it follows that BT Q(k) B = 0, so the
first term vanishes. Next, it is argued that if Ak A, then
Proof. Application of the chain rule on (B.5) yields T
the term CT Q(k) C is negligible compared to the middle term of
(B.17). It remains to compute
P(k) B + PB(k) = 0 P(k) B = PB(k) , (B.8)
T
which is the desired result. 2T BT Q(k) C
= 2 CT Q(k) B (B.18)
P. Nauclr, T. Sderstrm / Automatica 46 (2010) 17521761 1761
and again Propositions 1 and 3 give Darlow, M. S. (1989). Balancing of high speed machinery. New York, NY: Springer-
Verlag.
T Foiles, W. C, Allaire, P. E., & Gunter, E. J. (1998). Review: rotor balancing. Shock and
W (k) (x0 ) 2
CT Q1 (I2nM BP) B(k) ,
(B.19) Vibration, 5, 325336.
Golub, G. H., & Pereyra, V. (1973). The differentiation of pseudo-inverses and
which can be equivalently written as (B.13). nonlinear least squares problems whose variables separate. SIAM Journal of
Next, we want to find an expression for W (kl) . Differentiation of Numerical Analysis, 10(2), 413432.
Golub, G. H., & Pereyra, V. (2003). Separable nonlinear least squares: the variable
(B.15) with respect to [x]l yields projection method and its applications. Inverse Problems, 19(2), R1R26.
Goodman, T. P. (1964). A least-squares method for computing balance corrections.
W (kl) = yT Q(kl) y, (B.20) Journal of Engineering for Industry, 86(3), 273279.
Hillstrm, L. (2008). Personal communication.
(k) Kang, Y., Chang, Y. P., Tseng, M. H., Tang, P. H., & Chang, Y. F. (2000). A modified
where Q is given by (B.16). If the model (38) for y is inserted, one
approach based on influence coefficient method for balancing crank-shafts.
obtains Journal of Sound and Vibration, 234(2), 277296.
Kang, Y., Tseng, M.-H., Wang, S.-M., Chiang, C.-P., & Wang, C.-C. (2003). An accuracy
W (kl) = T BT Q(kl) B + 2T BT Q(kl) C
improvement for balancing crankshafts. Mechanism and Machine Theory, 38,
14491467.
T
+ CT Q(kl) C T BT Q(kl) B. (B.21) Larsson, L. O. (1976). On the determination of the influence coefficients in rotor
balancing, using linear regression analysis. In Vibrations in rotating machinery,
Cambridge, UK, September (pp. 9397).
The approximation follows from that the term that is quadratic in Li, G., Lin, Z., Untaroiu, C., & Allaire, P. E. (2003). Balancing of high-speed rotating
is nonzero, and the assumption Ak A. machinery using convex optimization. In IEEE conference on decision and control,
Differentiation of (B.16) with respect to [x]l yields Maui, Hawaii, USA, December (pp. 43514356).
Ljung, L. (1999). System identification (2nd edition). Upper Saddle River, NJ, USA:
Q(kl) = Q(kl)
(I2nM BP) Q(k)
B(l) P + BP(l) PrenticeHall.
Lund, J. W., & Tonnesen, J. (1972). Analysis and experiments on multi-plane
l)
(k)
B P + BP(k)
balancing of a flexible rotor. Journal of Engineering for Industry, 94(1), 233242.
Q(
Mahata, K. (2003). Estimation using low rank signal models. Ph.D. thesis, Department
(k) (l) of Information Technology, Uppsala University, Uppsala, Sweden.
Q 1
B P + B(l) P(k) + BP(kl) .
(B.22) Nauclr, P. (2008). Estimation and control of resonant systems with stochastic
disturbances. Ph.D. thesis, Department of Information Technology, Faculty of
Computation of (B.21) with application of Propositions 14, B(kl) = Science and Technology, Uppsala University, Uppsala, Sweden.
Schneider, H. (1991). Balancing technology. Technical report, Carl Schenck AG.
0, and evaluation at x = x0 yields Sinha, J. K., Friswell, M. I., & Lees, A. W. (2002). The identification of the unbalance
and the foundation model of a flexible rotating machine from a single run-down.
W (kl) T BT Q(kl) B Mechanical Systems and Signal Processing, 16(23), 255271.
k)
(l) Sinha, J. K., Lees, A. W., & Friswell, M. I. (2004). Estimating unbalance and
= T 0 BT Q( B BPB(l)
misalignment of a flexible rotating machine from a single run-down. Journal
l)
(k) of Sound and Vibration, 272, 967989.
BT Q( B BPB(k)
Sderstrm, T., & Stoica, P. (1989). System identification. Hemel Hempstead, United
(k) (l) Kingdom: Prentice Hall International.
BT Q 1
B PB B(l) PB(k) BT Q 1 (kl)
B
BP Tiwari, R., & Chakravarthy, V. (2006). Simultaneous identification of residual
unbalances and bearing dynamic parameters from impulse responses of rotor-
= T BT (k) Q (I2nM BP) B
1 (l)
bearing systems. Mechanical Systems and Signal Processing, 20, 15901614.
Viberg, M., & Ottersten, B. (1991). Sensor array processing based on subspace fitting.
+ BT (l) Q
(I2nM BP) B
1 (k)
(B.23) IEEE Transactions on Signal Processing, 39(5), 11101121.
Zhou, S., & Shi, J. (2001). Active balancing and vibration control of rotating
machinery: a survey. The Shock and Vibration Digest, 33(5), 361371.
= 2T BT (k) Q (l)
(I2nM BP) B ,
1
(B.24)
where (B.23) follows from Proposition 4 and some algebra.
Peter Naucltr received the M.Sc degree in engineering
(I2nM BP) is a symmetric matrix. The
Eq. (B.24) follows since Q 1
physics and the Ph.D. degree in electrical engineering with
expression (B.24) can be equivalently written as (B.14). specialization in automatic control from Uppsala Univer-
sity, Uppsala, Sweden, in 2003 and 2008, respectively. His
Proof of Lemma 2. Using (49) and Lemma 3, the estimation error doctorate work mainly concerned modeling and control
can be written as of mechanical systems with stochastic disturbances. Since
2008 he is with Ericsson AB, Stockholm, Sweden, where he
works with radio access technologies for the fourth gener-
x = H 1 G
+ O
2 , (B.25) ation telecommunication systems.