Professional Documents
Culture Documents
Nonlinear Model Validation Using Correlation Tests - II
Nonlinear Model Validation Using Correlation Tests - II
To cite this article: S. A. BILLINGS & Q. M. ZHU (1994) Nonlinear model validation using correlation tests, International
Journal of Control, 60:6, 1107-1120, DOI: 10.1080/00207179408921513
Taylor & Francis makes every effort to ensure the accuracy of all the information (the “Content”) contained
in the publications on our platform. However, Taylor & Francis, our agents, and our licensors make no
representations or warranties whatsoever as to the accuracy, completeness, or suitability for any purpose of the
Content. Any opinions and views expressed in this publication are the opinions and views of the authors, and
are not the views of or endorsed by Taylor & Francis. The accuracy of the Content should not be relied upon and
should be independently verified with primary sources of information. Taylor and Francis shall not be liable for
any losses, actions, claims, proceedings, demands, costs, expenses, damages, and other liabilities whatsoever
or howsoever caused arising directly or indirectly in connection with, in relation to or arising out of the use of
the Content.
This article may be used for research, teaching, and private study purposes. Any substantial or systematic
reproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution in any
form to anyone is expressly forbidden. Terms & Conditions of access and use can be found at http://
www.tandfonline.com/page/terms-and-conditions
INT. J. CONTROL, 1994, VOL. 60, No.6, 1107-1120
New higher order correlation tests which use model residuals combined with
system inputs and outputs are presented to check the validity of a general class
of nonlinear models. The new method is illustrated by testing both simple and
complex nonlinear system models.
1. Introduction
Downloaded by [University of Bristol] at 07:45 22 February 2015
by e(t) with zero mean and finite variance. Therefore, correlation-based model
validity tests are used to check if
e(t) = e(t) (2.5)
This can be done by testing if all the correlation functions are within the preset
confidence intervals. When (2.5) is true, Bohlin's test (1971) shows that
N-r
2: (e(t) - e)(e(t - r) - e)
1=1
<Pu( r) = -'---=
N
---,-,------
2: (e(t) - e)2
1=1
{I, r=O
Downloaded by [University of Bristol] at 07:45 22 February 2015
=
0, otherwise
N-r
2: (u(t) - u)(e(t - r) - e)
1=1
<Pur(r) = N N 1/2 = 0, Vr (2.6)
[(~I(U(t) - U)2)(~?(t) - ef )]
where <Pu( r) and <Pur( r) are the estimated normalized residual auto-correlation
function and cross-correlation function between the input and residuals respect-.
ively. The overbar denotes the time average operation to give
_ 1 N
e = - 2:e(t)
N'~l
(2.7)
_ 1 N
U =- 2:u(t)
N'~l
For large N the correlation function estimates given in (2.6) are asymptotically
normal with zero mean and finite variance (Soderstrom and Stoica 1990), the
standard deviations are 1/'1/N and the 95% confidence limits are therefore
approximately 1·95/'1/ N. To illustrate the method, consider three typical res-
iduals, from an ARMAX model, as examples
<PEIEI(') =
I,
p,
r
,=
0
>
1
j (2.9)
f 0, otherwise
<PUE I(') = 0, 'iT
where p = 1/2.
j
For the second example
<PE,E,( r) = g: ,= 0
otherwise
(2.10)
, =1
<PUE,( r) = {b: otherwise
Downloaded by [University of Bristol] at 07:45 22 February 2015
where
1
? )1/1
(1 +-;?-
is a constant.
In summary, Bohlin's and Box and Jenkins' approach is to determine
whether there is evidence of an inadequate model and also to suggest ways in
which the model may be modified or improved.
For nonlinear models the validity tests are not as simple as in the linear case
because nonlinear terms can exist in the residuals. Consider a SISO nonlinear
discrete model
yet) = In(y'-I, U'-I, £H) + £(/) (2.11)
where the output, input and residual vectors have been defined in (2.2).
However, the nonlinear function In(') will not, in general, satisfy the superposi-
tion and homogeneity principle. A typical parametric expression for (2.11) is the
polynomial NARMAX (Nonlinear AutoRegressive Moving Average with eX-
ogenous input) model (Billings and Leontaritis 1981, Leontaritis and Billings
1985)
N
yet) = 2.CYjp/t) + Bet) (2.12)
j=1
where Pj(t) denotes nonlinear terms such as PI(t) = yet - I)u(t - 1), P2(t) =
u 2(t)£(t - 1) and P3(t) = £3(t - 3).
Simple nonlinear residuals may take the form of terms such as
£](t) = e(t - 2)e(t - 5) + e(t)
Bit) = u(t - l)u(t - 3) + e(t)
(2.13)
£3(/) = u(t - l)e(t - 2) + e(t)
£4(t) = yet - I)e(1 - 2) + e(t)
and so on. The simple ACF and CCF test are now no longer sufficient (Billings
and Voon 1983) and new tests have to be developed. One approach would be to
use multidimensional correlation functions such as <PEEE('10 '2)' <PuuE('I, '2) and
<PuEE('1o '2) to check for £I(t), £2(t), and £3(t) respectively in (2.13). This
Nonlinear model validation using correlation tests 1111
- £2)2
1=1
(2.15)
where
(2.16)
I, r = 1 )
ct>€,€,( r) ., { 0, otherwise (2.17)
ct>u'€'( r) - 0, Vi
This follows because for large N and assuming ergodicity
ct>€,€,(i) = £[£2'(/)£2'(t - i)]}
(2.18)
ct>u'€'(i) = £[u 2'(t)£2'(t - i)]
where £[ .] is the expectation operator and
To illustrate these tests consider the first two examples in (2.13). For the first
case
Downloaded by [University of Bristol] at 07:45 22 February 2015
1, i=O
a '(i) - PI> i=2
'A../"£,£, - Pz = PI'
<Pu'£~( i) =
1 0,
0, Vi
i =5
otherwise
(2.21)
where
_ e4 ? _ (e 2 ) 3
Pl- O<lpll<I and O<IPzI<I
[(e4)2 + 4(?)3 + e 4 _ (?)4 _ (e 2)2jl/2 '
are constants. For the second example
r = I,
22 ()
<P£,£, { 0,
r = °
otherwise
1, i=O (2.22)
A.. , '(i) = PI, r = 1
wu £, Pz = PI> i=3
10, otherwise
where
is a constant.
The tests correctly identify the missing terms in both cases since either
"* ° "*
<Pt:',,'(r) 0, i"* or <Pu't:'( r) 0, V r. The two tests can also be used to
diagnose omitted linear model terms in the residuals. Applying the tests to the
first two linear residual examples of (2.8) gives
<Pt:~t:~( r) = {~;, ~ : ~
0, otherwise
) (2.23)
<Pu'Z;~(i) = 0, Vi
where PI = (e 4 - (?)2)/2(e'l + (e 2f ) is a constant. For the second example
Nonlinear model validation using correlation tests 1113
¢dd( T) = {~: T = 0
otherwise )
(2.24)
= {~:'
T =1
¢u'd( T) otherwise
where
1
PI = [1 + ? - (ezy + 47?] 1/1
u4 _ (7)2
is a constan t.
The disadvantage of these tests is that the higher order correlation functions
can sometimes exhibit less power when the noise and input variances are small
Downloaded by [University of Bristol] at 07:45 22 February 2015
because the fourth and higher moments become small. Billings and Voon (1986)
used a combination of five tests to try to increase the discriminatory power. An
alternative solution may be to introduce delayed output terms to form more
powerful higher order correlation functions while maintaining the computational
simplicity. This is presented in the following section.
(3.1)
where
(\'(t) = y(t)E(t) )
iY = yE = ~ ~ y(t)E(t). (3.2)
N t= !
In the ideal case where the residuals are zero mean and uncorrelated with all
linear and nonlinear combinations of past inputs and outputs these tests yield
(
¢a.', r) = k 2'
{ 0,
T = 0
otherwise
)
(3.3)
¢aU"( T) - 0, Vi
1114 S. A. Billings and Q. M. Zhu
( ~l(y(t)€(t) - y€)2
(3.6)
(3.7)
and
Nonlinear model validation using correlation tests 1115
1 N
/-I, =- La(t)w,(t) (3.10)
N ,= !
where
aCt) = [aCt - 1), a(t - 2), ... , a(t - sW
aCt) = y(t)E(t)
1 N
- L~(t) = r2 (3.11)
N'=1
Assuming all odd order moments of the random variable E(t) are zero gives
1 [N
E[/-I,] = -E La(t)w,(t) = -
] IN
L
E[a(t)]E[w,(t)] = 0SX!
Downloaded by [University of Bristol] at 07:45 22 February 2015
N 1=1 N,~l
E [ /-1,/-1,T] -_ 1 2I
-r (3.12)
sx s
N
From the central limit theorem (Bowker and Lieberman 1972) the random
vector is asymptotically normal with zero mean and variance given by (3.12).
The random vector /-I, can be normalized as
~
_ VN
- r/-l,Is x s (3.13)
which is asymptotically zero mean with unit variance. Then the variable
T
d ,-
- T -
~,~, -
N/-I,/-I,
~
(3.14)
(3.17)
1116 S. A. Billings and Q. M. Zhu
The previous X- test (Leontaritis and Billings 1985) for nonlinear models
suffered from the necessity to test several possible missing model terms before
any confidence that the model had been properly validated could be established.
The new test gives a rule to choose test terms y(t)e(t), u 2(t), e2( t) only.
4. Applications
Three simulated systems were selected to demonstrate the new model
validity test methods. Each data sequence was of length 1000, and the input for
the first two simulated systems was a uniformly distributed uncorrelated se-
quence with zero mean and variance 1·33 and the noise sequence was an
uncorrelated normally distributed sequence with zero mean and variance 0·36.
Example 1: This simulated system consisted of the model
Downloaded by [University of Bristol] at 07:45 22 February 2015
20 - 1 10
-1. -1.
(a)
20 - 1 10
20 - 1 10
20 - 1 10
Downloaded by [University of Bristol] at 07:45 22 February 2015
Once again, the linear ACF and CCF tests, Fig. 2(a), fail to detect the nonlinear
residuals. The new tests <P(y£)£2(r) and <p(y£).,,(r) in Fig. 2(b) are plotted to show
that <P(y£)<2( r) and <p(y<)u2( r) are outside confidence intervals at r = 5, 2 respect-
ively.
Example 3: Consider the nonlinear rational model
yet - 1) + u(t - 1) + yet - l)u(t - 1) + yet - I)e(t - 1) ( )
yt=
() +et
1 + y2(t - 1) + u(t - l)e(t - 1)
(4.5)
which is of a form that often appears in the chemical and related fields (Ford et
al. 1989, Dimitrov and Kamenski 1991) and was used by Narendra and
Parthasarathy (1990) to study the approximation of severe nonlinearities using
neural networks.
Simulated output data were generated using the model of (4.5) with an
uncorrelated uniformly distributed (zero mean and variance of 1·0) input
sequence and an uncorrelated normally distributed (zero mean and variance of
0,01) noise sequence. Only the input and output data were available for
identification. The resultant identification using the nonlinear rational model
identification algorithm (Zhu and Billings 1993, Billings and Zhu 1994), which
included model term selection and associated parameter estimation, produced
the results in the Table. Figure 3(a) and 3(b) show the measured output, the
one-step ahead model predictions and residuals respectively and Figs 4(a) and
4(b) show the linear ACF and CCF tests and the new tests respectively.
S. Conclusions
The important problem of validating nonlinear models has been investigated
and two new correlation-based tests have been proposed. The new tests are
based on correlation functions defined in terms of the system outputs and these
1118 S. A. Billings and Q. M. Zhu
0,25[+ 1
0.25[+ 1
~ ~
-,20[+ I 1000
(b)
0.3&E+ 0 ,
-',
~\ ~ I ~~j~~.~~~ ~ ~
39[+~---------·--------------'-~
(c)
Figure 3. System outputs, model predictions and residuals: (a) measured outputs; (b) one
step ahead model predictions; (c) residuals.
20 - 1 10
-1. -1.
(a)
20 - 1 10
Downloaded by [University of Bristol] at 07:45 22 February 2015
Figure 4. Validation tests for example three: (a) ACF and CCF tests; (b) new tests.
ACKNOWLEDGMENT
REFERENCES
AKAIKE, H., 1974, A new look at the statistical model identification. IEEE Transactions on
Automatic Control, 19, 716-723.
BAGLIVO, J., OLIVIER, D., and PAGANO, M., 1992, Methods for exact goodness of fit tests.
Journal of the American Statistical Association, 87, 464-469.
BILLINGS, S. A., and I. J. LEONTARITIS, I. J., 1981, Identification of nonlinear systems using
parameter estimation techniques. lEE Conference Proceedings on Control and its
Applications, University of Warwick, Coventry, U.K., pp. 183-187.
BILLINGS, S. A., and VOON, W. S. F., 1983, Structure detection and model validity tests in the
identification of nonlinear systems. Proceedings of the Institution of Electrical Engi-
neers, Pt D, 130, 193-199; 1986, Correlation based model validity tests for nonlinear
models. International Journal of Control, 44, 235-244.
BILLINGS, S. A., and ZHU, Q. M., 1994, A structure detection algorithm for nonlinear rational
models. International Journal of Control, 59, 1439-1463.
1120 Nonlinear model validation using correlation tests
LEONTARITIS, l. J., and BILLINGS, S. A., 1985, Input-output parametric models for nonlinear
systems, Part I and II. International Journal of Control , 41, 303-328.329-344; 1987,
Model selection and validation methods for nonlinear systems. International Journal of
Control; 45, 311-341.
NARENDRA, K. S., and K. PARTHASARATHY, K., 1990. Identification and control of dynamical
systems using neural networks. IEEE Transactions on Neural Networks, 1, 4-27.
PEARSON, K., 1900, On the criterion that a given system of deviations from the probable in the
case of a correlated system of variables is such that it can be reasonably supposed to
have arisen from random sampling. Philosophical Magazine, 50, 157-172.
SODERSTROM, T., and STO'CA, P., 1900, On covariance function tests used in system
identification. Automarica, 26, 125-133.
WADSWORTH, G. P., and BRYAN, J. G .• 1974. Applications of Probability and Random
Variables (New York: McGraw-Hill).
ZI-IU, Q. M., and BILLINGS, S. A., 1993, Parameter estimation for stochastic nonlinear rational
models. lntemational Journal of Control, 57, 309-333.