Bayesian Updating of Structural Models and Reliability Using Markov Chain Monte Carlo Simulation

Bayesian Updating of Structural Models and Reliability using
Markov Chain Monte Carlo Simulation

James L. Beck1 and Siu-Kui Au2
Downloaded from ascelibrary.org by "Indian Institute of Science, Bangalore" on 09/18/21. Copyright ASCE. For personal use only; all rights reserved.
Abstract: In a full Bayesian probabilistic framework for ‘‘robust’’ system identification, structural response predictions and performance
reliability are updated using structural test data D by considering the predictions of a whole set of possible structural models that are
weighted by their updated probability. This involves integrating h(␪)p(␪兩 D) over the whole parameter space, where ␪ is a parameter
vector defining each model within the set of possible models of the structure, h(␪) is a model prediction of a response quantity of interest,
and p(␪兩 D) is the updated probability density for ␪, which provides a measure of how plausible each model is given the data D. The
evaluation of this integral is difficult because the dimension of the parameter space is usually too large for direct numerical integration and
p(␪兩 D) is concentrated in a small region in the parameter space and only known up to a scaling constant. An adaptive Markov chain
Monte Carlo simulation approach is proposed to evaluate the desired integral that is based on the Metropolis-Hastings algorithm and a
concept similar to simulated annealing. By carrying out a series of Markov chain simulations with limiting stationary distributions equal
to a sequence of intermediate probability densities that converge on p(␪兩 D), the region of concentration of p(␪兩 D) is gradually portrayed.
The Markov chain samples are used to estimate the desired integral by statistical averaging. The method is illustrated using simulated
dynamic test data to update the robust response variance and reliability of a moment-resisting frame for two cases: one where the model
is only locally identifiable based on the data and the other where it is unidentifiable.
DOI: 10.1061/共ASCE兲0733-9399共2002兲128:4共380兲
CE Database keywords: Structural models; Markov chain; Reliability; Bayesian analysis; Monte Carlo method; Robust; Simulation.
Introduction flexibility; errors because of the spatial discretization of distrib-

uted structural systems and loads; variation of material properties
There has been increasing interest in using a Bayesian statistical during manufacture; and uncertainties introduced by the construc-
framework 共Box and Tiao 1992; Sivia 1996兲 to treat the challeng- tion process. Because of these modeling errors and uncertainties,
ing problem of updating a structural model based on test data model updating is best tackled as a Bayesian statistical inference
from a structure 共Beck and Katafygiotis 1998; Engelund and So- problem 共Beck 1989兲.
rensen 1998; Geyskens et al. 1998; Katafygiotis and Beck 1998; The paper by Collins et al. 共1974兲 is a pioneering effort in
Katafygiotis et al. 1998; Singhal and Kiremidjian 1998; Zheng Bayesian structural model updating using identified modal param-
and Ellingwood 1998; Enright and Frangopol 1999; Papadimi- eters. A more rigorous and comprehensive Bayesian framework
triou et al. 2000; Papadimitriou et al. 2001; Vanik et al. 2000兲. for model updating is described by Beck and Katafygiotis 共1998兲.
The need for model updating arises because there are always They define the concept of system identifiability and show how to
modeling errors associated with constructing a theoretical model treat the ill-conditioning and lack of identifiability that often char-
of the behavior of a structure, and this leads to uncertain accuracy
acterizes the updating problem. Using this methodology, one does
in the predicted response. There are many sources of modeling
not identify a single ‘‘best’’ model but instead updates a probabil-
errors, such as inexact modeling of material constitutive behavior
ity distribution over a specified set of structural models and a
and boundary conditions 共e.g., there are no perfectly pinned or
specified set of prediction-error probability models for the uncer-
fixed joints兲; unmodeled features such as in-plane diaphragm
tain error between the actual structural response and the corre-
flexibility, neglected ‘‘nonstructural’’ components and foundation
sponding model predictions.
1
Model updating is not an end in itself; it is usually motivated
Professor of Applied Mechanics and Civil Engineering, Division of
by the desire to improve the accuracy of predictions of the system
Engineering and Applied Science, California Institute of Technology,
Pasadena, CA 91125 共corresponding author兲. response or its current condition or ‘‘health.’’ In this paper, we
2
Assistant Professor, School of Civil and Environmental Engineering, focus on updating the ‘‘robust’’ response moments and the ‘‘ro-
Nanyang Technological Univ., Singapore 639798; formerly, PhD candi- bust’’ reliability that a specified system performance is achieved
date, Division of Engineering and Applied Science, California Institute of under some set of possible excitations. As in robust structural
Technology, Pasadena, CA 91125. control, the use of the term ‘‘robust’’ here implies that a set of
Note. Associate Editor: Stein Sture. Discussion open until September possible structural models is used to represent the structure rather
1, 2002. Separate discussions must be submitted for individual papers. To than a single nominal model 共Papadimitriou et al. 2001兲.
extend the closing date by one month, a written request must be filed with
The problem of interest is defined as follows. Let D denote
the ASCE Managing Editor. The manuscript for this paper was submitted
for review and possible publication on October 13, 2000; approved on some test data from a structure and consider a set of possible
July 18, 2001 . This paper is part of the Journal of Engineering Me- structural models specified by M that has been chosen to repre-
chanics, Vol. 128, No. 4, April 1, 2002. ©ASCE, ISSN 0733-9399/ sent the behavior of the structure. Here, M specifies the modeling
2002/4-380–391/$8.00⫹$.50 per page. assumptions used in the analysis 共both structural and probabilis-
380 / JOURNAL OF ENGINEERING MECHANICS / APRIL 2002
J. Eng. Mech., 2002, 128(4): 380-391

tic兲. The essence of Bayesian statistical system identification is and so methods for evaluating R D are differentiated according to
that it gives a rigorous method of using D to update an initial the shape of p D , which depends on the information that the avail-
description of how plausible each model is as a representation of able data D produce about the model parameters ␪.
the structure, that is, the information in D modifies the knowledge The characterization of p D has been studied for model identi-
about the relative plausibilities of the different models specified fication of structures using as dynamic data the measured time
by M 共Beck and Katafygiotis 1998兲. The plausibility of a model histories of response and excitation 共Beck and Katafygiotis 1998;
is quantified by a probability distribution over the model param- Katafygiotis and Beck 1998; Katafygiotis et al. 1998兲, but the
eters ␪⫽ 关 ␪ 1 ,...,␪ n 兴 that define a model within the set of possible results can be applied in general. For a large amount of data 共e.g.,
models 共Cox 1961; Jaynes 1978兲. The updated probability distri- the number of data points M in the measured time histories is
bution p D(␪)⫽p(␪兩 D,M) is obtained using Bayes’ theorem large兲, p D is concentrated in the neighborhood N of a lower-
dimensional manifold S in the parameter space on which p(D兩 ␪)
p D共 ␪兲 ⫽p 共 D兩 ␪,M兲 p 0 共 ␪兩 M兲 /p 共 D兩 M兲 ⫽cp 共 D兩 ␪,M兲 p 0 共 ␪兩 M兲 is globally maximized. The thickness of N around S is of the

(1)
order of ⑀⫽1/冑M . The characterization of p D can be made ac-
where p 0 (␪兩 M)⫽initial 共‘‘prior’’兲 probability distribution speci- cording to the dimension of S, which depends on the amount of
fied by M, which reflects the relative plausibility of each model data available compared to the information to be extracted from
before utilizing the data D, and c ⫺1 ⫽p(D兩 M) them. In the ‘‘identifiable’’ case where the number of model pa-
⫽ 兰 p(D兩 ␪,M)p 0 (␪兩 M)d␪⫽normalizing constant. The term rameters is less than or equal to the number of ‘‘effective con-
p(D兩 ␪,M) gives the probability of obtaining the data D based on straints’’ from the data D, the dimension of S is zero, that is, the
a model specified by the model parameters ␪. It is formulated by updated PDF is concentrated in the close neighborhood N of a
using a probability model for the prediction error relating the finite number of isolated points, referred to as ‘‘optimal param-
‘‘output’’ of the model specified by ␪ to the actual measured eter’’ points. In this case, p D can be well approximated by a
structural output D 共Beck and Katafygiotis 1998兲. The condition- weighted sum of Gaussian PDFs with spread of O(⑀) centered at
ing on M is included here to stress the fact that all probabilities the optimal parameter points 共Beck and Katafygiotis 1998兲. Con-
involved in model updating are always conditional on the choice sequently, an asymptotic approximation of R D can be expressed
of the modeling assumptions. as a weighted sum of values of h evaluated at the optimal param-
Better response predictions can be made by utilizing the infor- eter points, and the problem of calculating R D in the identifiable
mation in the data D through the use of p D given by Eq. 共1兲. For case is reduced to finding the optimal parameter points and their
example, if h(␪) is a structural response quantity of interest, then associated probability weights. This leads to a nonconvex global
many useful performance measures can be formulated as optimization problem, which is not trivial to solve.
In the locally identifiable case 共Katafygiotis and Beck 1998兲,
R D⫽ 冕 h 共 ␪兲 p D共 ␪兲 d␪ (2)
there are multiple optimal parameter points and ‘‘global’’ optimi-
zation algorithms must be used to find them 共Törn and Zilinskas
where the theorem of total probability is used. One example is 1989; Yang and Beck 1998兲. The computational effort is often
h(␪)⫽Var关 y 兩 ␪,M兴 , the conditional variance of a given structural much greater than is needed in a local optimization problem, es-
response y computed with specified model parameters ␪ when the pecially when the dimension of the parameter space is not small
structure is subjected to an uncertain excitation. In this case, R D and the number of optimal parameters is unknown, as is usually
in Eq. 共2兲 is the variance Var关 y 兩 D,M兴 of the response given the the case.
set of structural models M and the data D, which addresses the In the ‘‘unidentifiable’’ case where the number of model pa-
uncertainty in which model best represents the structure while rameters is larger than the number of effective data constraints,
incorporating information from the data D. Another example is the dimension of the manifold S is greater than zero, and there
h(␪)⫽ P(F 兩 ␪,M), the probability of a failure event F of the exists a continuum of optimal parameter points lying on S which
structure, predicted using model parameters ␪; then R D is the give the same global maximum value of p(D兩 ␪). Deterministic
robust failure probability P(F 兩 D,M) which incorporates knowl- search methods for computing an asymptotic approximation of
edge from M and updated information from D 共Papadimitriou R D have been developed 共Katafygiotis et al. 1998兲, which dis-
et al. 2001兲. cretize the manifold S using a finite number of representative
Suppose the model parameters ␪ belong to a continuous pa- points and then approximate p D as a discrete probability mass
rameter space of dimension 3 or more, then the evaluation of R D distributed among the representative points. The main computa-
by direct numerical integration is inefficient or infeasible. Also, tional effort is spent on locating the representative points on S,
the updated probability density function 共PDF兲 p D is known only which requires a series of local optimizations in the parameter
up to a multiplicative constant because the normalizing constant c space. The representative points have to be located over the re-
in Eq. 共1兲 is given by an integral that is also difficult to evaluate. gion of S where the prior PDF p 0 is significant so that the con-
In fact, in terms of quantities whose values can be computed tributions from different parts of S, and hence N, are accounted
explicitly for a given ␪, R D can be expressed as 共leaving depen- for. Consequently, the complexity and computational effort are
dence on M implicit兲 expected to grow with the dimension of the manifold in a similar
manner to that of direct numerical integration, making the method
兰 h 共 ␪兲 p 共 D兩 ␪兲 p 0 共 ␪兲 d␪ practical only when this dimension is no more than 2.
R D⫽ (3) This paper presents a Markov chain simulation method to
兰 p 共 D兩 ␪兲 p 0 共 ␪兲 d␪
evaluate the integral in Eq. 共2兲 for R D without the need for opti-
and so its evaluation involves two multidimensional integrals. Ad- mization to find the manifold S. It is based on the Metropolis-
ditional difficulties come from the nature of the updated PDF p D , Hastings 共MH兲 algorithm and a concept similar to simulated an-
which is usually concentrated in a small volume of the parameter nealing 共Fishman 1996兲 to gain information about the manifold in
space. In practical applications, the variation of p D in the param- an iterative manner. By carrying out a series of Markov chain
eter space is more dominant than that of the response quantity h, simulations with limiting stationary distributions equal to a se-
JOURNAL OF ENGINEERING MECHANICS / APRIL 2002 / 381
J. Eng. Mech., 2002, 128(4): 380-391

quence of intermediate PDFs that converge on p D(␪), the region equal to the target PDF. In other words, the PDF of the Markov
N of significant probability density of p D is gradually portrayed. chain sample ␪k simulated at the kth Markov step tends to the
The Markov chain samples can be used to estimate R D by statis- target PDF as k→⬁. The Markov chain samples, which are de-
tical averaging. pendent in general, can be used for statistical averaging as if they
The difficulties in applying simulation methods to evaluate the were independent, although with some reduction of efficiency in
integral in Eq. 共2兲 for R D are discussed in the next section, fol- the estimator.
lowed by the proposed Markov chain simulation method. The Let p * (␰兩 ␪) be a chosen PDF, called the ‘‘proposal PDF,’’
statistical properties of the estimator for R D are then studied. Fi- which is a PDF for ␰ that depends on ␪. The role of p * will
nally, the method is illustrated with the problem of updating the become clear shortly. For convenience in notation, let q(␪)
robust response variance and reliability of a moment-resisting ⫽p(D兩 ␪)p 0 (␪)⫽c ⫺1 p D(␪). Note that the value of q can be com-
frame with measured modal properties. puted readily for a given ␪, while the same is not true for p D . The
MH algorithm to simulate Markov chain samples 兵 ␪1 ,...,␪N 其

with limiting stationary distribution equal to the target PDF p D is
Difficulties in Evaluating R D by Simulation described as follows. To start the Markov chain, let ␪1 be a point
either chosen deterministically or simulated according to some
Eqs. 共2兲 and 共3兲 suggest two ways of evaluating R D by simulation. PDF that approximates p D . In general, to simulate the next
The former suggests estimating R D as the average of h over sample ␪k⫹1 from the current sample ␪k , k⫽1,...,N⫺1, first
samples simulated from p D , while the latter indicates that the simulate a ‘‘candidate state’’ ␰ from the proposal PDF p * (␰兩 ␪k ).
integrals in the numerator and denominator could be estimated Compute the ratio
individually and then combined to give an estimate for R D . Since
p D is known only up to a multiplicative constant and in general a q 共 ␰兲 p * 共 ␪k 兩 ␰兲
method for simulating independent samples from p D is not avail- r⫽ (4)
q 共 ␪k 兲 p * 共 ␰兩 ␪k 兲
able, the first option of using Eq. 共2兲 is not feasible for existing
methods such as Monte Carlo simulation 共MCS兲 or importance Then, accept the candidate state ␰ with probability min兵1,r 其 and
sampling 共Rubinstein 1981兲. Therefore, consider the application reject with the remaining probability 1⫺min兵1,r 其 . If accepted,
of these existing methods to evaluate R D based on Eq. 共3兲. Using the candidate state will be taken as the next state of the Markov
MCS, the numerator is estimated as the average of h(␪)p(D兩 ␪) chain, i.e., ␪k⫹1 ⫽␰. Otherwise, the current state is taken as the
over samples drawn from the prior PDF p 0 共assuming a method next state, i.e., ␪k⫹1 ⫽␪k . The process is repeated until N Markov
for simulating samples from p 0 is available兲. The resulting esti- chain samples have been simulated.
mate, however, is very likely to be biased, since p(D兩 ␪) is con- We now show that the next sample ␪k⫹1 will be distributed as
centrated in the small neighborhood N of thickness O(⑀) where ⑀ p D if the current sample ␪k is. From the theorem of total prob-
is very small, and hence the chance of generating a sample from ability, the PDF of the next sample is given by
p 0 that lies in N is extremely small. Similar difficulties will be
encountered in evaluating the denominator. Using importance
sampling, it is necessary to choose a sampling density that is
p 共 ␪k⫹1 兲 ⫽ 冕 p 共 ␪k⫹1 兩 ␪k 兲 p 共 ␪k 兲 d␪k (5)
concentrated in N, otherwise similar problems as in a MCS will where p(␪k⫹1 兩 ␪k ) is the transition PDF governing the probabilis-
be encountered. However, this is extremely difficult since infor- tic properties of the Markov chain. From the MH algorithm, the
mation about the manifold S where the probability density is transition PDF is given by, for ␪k⫹1 ⫽␪k
concentrated is not directly available.
Next, consider again evaluating R D based on Eq. 共2兲. Although p 共 ␪k⫹1 兩 ␪k 兲 ⫽p * 共 ␪k⫹1 兩 ␪k 兲 min兵 1,r 共 ␪k⫹1 ,␪k 兲其 (6)
a method for generating independent samples according to p D is
generally not available, it is noted that Markov chain Monte Carlo Using Eqs. 共4兲 and 共6兲, along with the identity min兵1,a/b 其 b
simulation, in particular, the MH algorithm, offers a feasible way ⫽min兵1,b/a 其 a for any positive numbers a and b, and the fact that
to simulate samples according to an arbitrary distribution, at the q differs from p D only by a normalizing constant, one can readily
expense of introducing dependence among the samples. However, show the following ‘‘reversibility’’ condition:
direct application of the MH algorithm to simulate Markov chain
samples according to p D is not feasible due to the small region N p 共 ␪k⫹1 兩 ␪k 兲 p D共 ␪k 兲 ⫽p 共 ␪k 兩 ␪k⫹1 兲 p D共 ␪k⫹1 兲 (7)
of probability concentration of p D . Nevertheless, as our proposed which basically says that the transition rates between the two
method is built on the MH algorithm, we first discuss its imple- states ␪k and ␪k⫹1 are equal when p(␪k )⫽p D(␪k ) and p(␪k⫹1 )
mentation. ⫽p D(␪k⫹1 ), that is, the Markov chain is stationary with station-
ary PDF p D . Note that Eq. 共7兲 is trivial for ␪k⫹1 ⫽␪k . Assuming
the current sample ␪k is distributed as p D , i.e., p(␪k )⫽p D(␪k ),
Metropolis-Hastings Algorithm and using the reversibility condition in Eq. 共7兲, p(␪k⫹1 ) in Eq. 共5兲
The Metropolis-Hastings algorithm is a simple procedure to simu- becomes
冕
late samples according to an arbitrary PDF where the target PDF
need only be known up to a scaling constant. It was originally p 共 ␪k⫹1 兲 ⫽ p 共 ␪k 兩 ␪k⫹1 兲 p D共 ␪k⫹1 兲 d␪k
developed by Metropolis and his co-workers for computing ca-
nonical ensembles in statistical physics 共Metropolis et al. 1953兲
and later generalized by Hastings in Bayesian statistics 共Hastings
1970兲. Its potential use for solving reliability problems has been
⫽p D共 ␪k⫹1 兲冕 p 共 ␪k 兩 ␪k⫹1 兲 d␪k ⫽p D共 ␪k⫹1 兲 (8)
recently demonstrated by Au and Beck 共1999兲. since 兰 p(␪k 兩 ␪k⫹1 )d␪k ⫽1. This means that if the current sample
In the MH method, samples are simulated as the states of a ␪k is distributed as the target PDF p D , then so is the next sample
special Markov chain whose limiting stationary distribution is ␪k⫹1 , and hence p D is the stationary PDF of the Markov chain.
J. Eng. Mech., 2002, 128(4): 380-391

In an actual implementation, the Markov chain is started with Direct application of the MH algorithm to simulate samples
the initial state ␪1 simulated from a PDF different from p D , so according to the target PDF p D is not feasible, however, due to
the Markov chain is in a transient state and its samples will not be the problems arising from the small region N of probability con-
distributed exactly as p D . Under the assumption of ergodicity, centration of p D , as described for the other methods discussed in
however, the Markov chain will converge to the stationary state, the last section. In particular, it is difficult to choose the proposal
and so the PDF of ␪k will tend to p D as k→⬁. Theoretically, the PDF p * so that the acceptance rate of the candidate state is not
assumption of ergodicity is usually satisfied if the proposal PDF too small while at the same time the Markov chain samples ef-
p * has a positive density on the support of the target PDF. It is fectively explore N. To see this, first note that it is not possible to
also usually satisfied even if p * has a restricted support. With a choose a nonadaptive p * that can generate samples lying in N,
finite sample size N used in an actual implementation, ergodicity since the information about the manifold S which localizes N is
often becomes an issue of whether the N Markov chain samples not directly available. Thus, consider choosing an adaptive p * ,
can populate sufficiently well the region of significant probability such as a symmetric one in the original Metropolis algorithm. If
of the target PDF p D . See Au and Beck 共1999兲 for a more de- the Markov chain is started in a region not near N, then the
tailed discussion of ergodicity in applying the MH method to chance of generating a candidate state from p * that visits N is
reliability problems. extremely small, and most of the candidate states will be rejected.
As a result, most of the Markov chain samples are repeated, and
Difficulties in Applying Metropolis-Hastings clearly they cannot be used to estimate R D . On the other hand, if
Algorithm to Reliability Updating the Markov chain is started in N, then in order for the simulated
candidate state to remain in N and have a high probability of
Using the Markov chain samples 兵 ␪1 ,...,␪N 其 generated from the being accepted, the spread of p * around the current sample has to
MH procedure, R D is estimated as the average R̃ D of h over the be O(⑀), because the direction along which N extends around the
samples, which is the same as the usual MCS estimator, except current sample is not known. But this means the candidate state
that the samples are simulated from a Markov chain instead of will be very close to the current sample, and as a result the Mar-
being independent and identically distributed 共i.i.d.兲. Neverthe- kov chain samples will not efficiently explore N. In all these
less, the estimator R̃ D has similar statistical properties to those of cases, the region visited by the Markov chain samples will be
MCS estimators 共see later兲. In order to reduce the initial transient small compared to N, leading to significant bias in the estimate
effect of the Markov chain on the estimate, the first few samples for R D .
共say 10兲 are often not used to compute the estimate R̃ D . In this
work, the Markov chain samples 兵 ␪1 ,...,␪N 其 used for computing Proposed Adaptive Simulation Method
the estimate R̃ D are those simulated after the initial transient
The problems encountered in applying the simulation methods
stage.
discussed in the previous sections, including the MH algorithm,
The proposal PDF p * in the MH algorithm affects the distri-
arise from the fact that the updated PDF p D is concentrated in a
bution of the candidate state ␰ given the current state, and conse-
small neighborhood N of the manifold S containing the maxima
quently the convergence rate of the estimator R̃ D to R D . If the of p D . The process of adapting samples to N in the MH algo-
candidate state is rejected too often because small values of r in rithm is inhibited by the small scale ⑀ of the thickness of N
Eq. 共4兲 are encountered during simulation, the Markov chain will compared to the size of the proposal PDF required to cover N
consist of many repeated samples. As a result, the correlation with an affordable number of samples. This suggests that direct
among samples will be increased, slowing down the convergence adaptation using a proposal PDF which varies with a vastly dif-
of R̃ D . To understand how p * affects the acceptance rate of the ferent length scale from that of the target PDF will not be effec-
candidate state, first note that, if p * (␰兩 ␪)⫽p D(␰), then according tive. In view of this, we introduce a sequence of intermediate
to Eq. 共4兲, r⫽1 and hence the candidate state is always accepted. PDFs which bridge the gap in length scale between the prior PDF
In this case, the MH algorithm reduces to a standard Monte Carlo p 0 and the target updated PDF p D . By successively using the MH
procedure with i.i.d. samples simulated from p D . Although this algorithm to utilize the information from the previous adapted
choice is not possible, it indicates that, if p * is chosen to be intermediate PDFs, the region populated by the Markov chain
‘‘nonadaptive,’’ i.e., p * (␰兩 ␪)⫽p * (␰), then the closer the p * is to samples gradually evolves into the desired region N. Conceptu-
the target PDF, the better the acceptance rate of the candidate ally, this is similar to ‘‘simulated annealing’’ 共Fishman 1996兲.
state, and the faster the convergence. Let p 1 ,...,p m be a chosen sequence of PDFs converging to
Choosing a nonadaptive proposal PDF p * means that the in- p D(⫽p m ) so that their region of significant probability content
formation from the current sample is not used to explore the gradually diminishes to that of p D . For example, p i may be cho-
important region of significant probability density of the target sen as the updated PDF from Bayes’ theorem based on an increas-
PDF during simulation. When information about the important ing amount of data, i.e., p i ⫽p Di , where D1 傺¯傺Dm ⫽D. The
region is not available, constructing a nonadaptive p * so that the proposed method is described as follows. Starting with the prior
candidate state simulated from it will lie in N is similar to con- PDF p 0 as the proposal PDF, the MH algorithm is carried out to
structing an importance sampling density concentrated in N, and simulate samples 兵 ␪(1) 1 ,...,␪N 其 with target PDF p 1 . A kernel
(1)
is thus very difficult. It is therefore more desirable to choose an sampling density ␬ 1 is constructed as a weighted sum of Gaussian
‘‘adaptive’’ p * which depends on the current sample. One popular PDFs centered among these samples to approximate p 1 共Silver-
choice is to have p * localized and symmetric, i.e., p * (␰兩 ␪) man 1986; Ang et al. 1992; Au and Beck 1999兲.
⫽p * (␪兩 ␰), which corresponds to the one used in the original
N
Metropolis algorithm 共Metropolis et al. 1953兲. In this case, p *
can be interpreted as a PDF localized at the current sample. The ␬ 1 共 ␪兲 ⫽ 兺 w k ␾ 共 ␪;␪共k1 兲 ,Ck 兲
k⫽1
(9)
Markov chain simulation process can then be viewed as a ‘‘local
random walk’’ in which the region of probability concentration of where ␾(␪;␪(1)
k ,Ck )⫽multidimensional Gaussian PDF evaluated
the target PDF is adaptively explored. at ␪ with mean ␪(1)k and covariance matrix Ck ; and
J. Eng. Mech., 2002, 128(4): 380-391

w k ⫽probability weights associated with the Gaussian PDFs, global covering of the important region of the target PDF at each
which satisfy w k ⭓0 and 兺 k⫽1
N
w k ⫽1. The choice of w k and Ck is simulation level even with a relatively small number of samples,
completely defined by the samples 兵 ␪(1) k 其共Au and Beck 1999兲, since the Markov chain develops globally rather than locally.
and therefore so is the choice of ␬ 1 . Since ␬ 1 is a weighted sum The success of the proposed strategy relies on effective appli-
of Gaussian PDFs, the kernel marginal PDF for a particular com- cation of the MH algorithm at each simulation level, which re-
ponent ␪ j ( j⫽1,...,n) of ␪ can be obtained by analytically inte- quires that p i⫺1 共which is approximated by k i⫺1 兲 varies with a
grating Eq. 共9兲 with respect to all other components ␪ l , l⫽ j similar length scale to p i for i⫽1,...,m. The choice of the se-
quence 兵 p i 其共corresponding to the ‘‘annealing schedule’’ in the
N
simulated annealing algorithm兲 is thus important to the success of
␬ 1共 ␪ j 兲 ⫽ 兺 w k ␾„␪ j ;␪共k1 兲共 j 兲 ,Ck共 j, j 兲 …
k⫽1
(10) the proposed method. The slower the ‘‘adaptation’’ schedule is,
that is, the less p i⫹1 is changed compared to p i , the higher the
where ␾„␪ j ;␪(1) acceptance rate of the candidate state, and the better the result at
k ( j),Ck ( j, j)…⫽one-dimensional Gaussian PDF
evaluated at ␪ j with mean ␪(1) each simulation level. However, a slow schedule requires more
k ( j) and variance Ck ( j, j);
␪(1) ( j)⫽jth component of the vector ␪(1) simulation levels to reach the final level for the updated PDF, and
k k ; and Ck ( j, j)⫽jth diag-
onal entry of the covariance matrix Ck . Similarly, the kernel mar- hence more Markov chain samples to be simulated. The prudent
ginal cumulative distribution function 共CDF兲 of ␪ j can be readily choice of an adaptation schedule is a trade-off between the effi-
computed in terms of a weighted sum of Gaussian CDFs. More ciency of the MH algorithm at each simulation level and the total
importantly, independent samples distributed as ␬ 1 can be readily number of simulation levels required.
simulated that are approximately distributed as p 1 and so lie in The choice of the adaptation schedule can be made by inves-
the region of significant probability of p 1 . In particular, to simu- tigating the probabilistic properties of the ratio r in the MH algo-
late a sample from ␬ 1 in Eq. 共9兲, first draw an index I from the set rithm appearing in Eq. 共4兲, which governs the acceptance rate of
兵 1,...,N 其 with corresponding probabilities 兵 w 1 ,...,w N 其 , and then the candidate states. It is found that a good choice for 兵 p i 其 is
simulate the sample as a Gaussian vector with mean vector ␪I(1) p i⫹1 ⬃p 2i 共up to a normalizing constant兲. In this case, the size of
and covariance matrix CI . the region of probability concentration for p i⫹1 is roughly 1/&
To proceed, ␬ 1 is used as the proposal PDF for simulating that of p i . When the data D consist of measured response time
Markov chain samples 兵 ␪(2) 1 ,...,␪N 其 with target PDF p 2 . These
(2)
histories 共Beck and Katafygiotis 1998兲, this schedule can be
samples are then used to construct the kernel sampling density ␬ 2 achieved by choosing p i ⫽p Di , where the duration of time history
as in Eq. 共9兲, which gives an approximation to p 2 . In general, the
data is doubled in successive simulation levels from Di to Di⫹1 .
kernel sampling density ␬ i 共which approximates p i 兲 is con-
structed using the Markov chain samples at the ith simulation On the other hand, when the updated PDF with data D is of the
level, which is then used as the proposal PDF for simulating form p D(␪)⫽c exp关⫺J(␪)/2⑀ 2 兴共Vanik et al. 2000兲, where J(␪)
Markov chain samples for the next level with target PDF p i⫹1 . is a measure-of-fit function between data and model, and ⑀ is a
This is continued until the mth simulation level, where Markov measure of the size of the prediction error, then the sequence 兵 p i 其
chain samples for the target updated PDF p D⫽p m are simulated. can be obtained by p i ⫽c i exp关⫺J(␪)/2⑀ 2i 兴 where ⑀ 2i ⫽2 m⫺i ⑀ 2 , i
Let R i be the expectation of the response quantity h(␪) when ⫽1,...,m, with 2 m ⬇⑀ ⫺2 , if the length scale of the prior PDF
␪ is distributed as p i , that is p 0 (␪) is O(1).
R i⫽ 冕 h 共 ␪兲 p i 共 ␪兲 d␪ (11)
Statistical Properties of Estimators
Since p i converges to p D as the simulation level i increases to m, The statistical properties of the estimator R̃ i in Eq. 共12兲 are pre-
R i converges to R D defined by Eq. 共2兲. Using the Markov chain sented in this section, assuming the Markov chain generated ac-
samples 兵 ␪(i)
1 ,...,␪N 其 , R i is estimated at the ith simulation level
(i)
cording to the MH algorithm at each simulation level is ergodic.
by R̃ i where In spite of the fact that R̃ i is computed using dependent
samples from a Markov chain, it still has the usual properties of
N
1 MCS estimators using i.i.d. samples 共Doob 1953兲. For example,
R i ⬇R̃ i ⫽ 兺
N k⫽1
h 共 ␪共ki 兲兲 (12)
R̃ i converges to R i with probability 1 as N→⬁ 共strong law of
large numbers兲, and under similar conditions as those for Monte
The proposed strategy makes use of the fact that, if a nonadap- Carlo estimators, R̃ i is normally distributed as N→⬁ 共central
tive proposed PDF is similar in shape to the target PDF, then the limit theorem兲. If the Markov chain is started with the initial state
acceptance rate of the candidate state will not be small, and the ␪(i)
1 distributed as the target PDF p i , then the Markov chain is
MH algorithm will be effective in generating samples populating stationary, and R̃ i is unbiased, that is, E 关 R̃ i 兴 ⫽R i . Otherwise, R̃ i
the important region of the target PDF. Thus, instead of using an is only asymptotically unbiased, although the bias decays expo-
adaptive proposal PDF within each simulation level, the adapta- nentially with the number of Markov steps.
tion is done from one simulation level to the next, where the latest
An expression for the coefficient of variation 共COV兲 of R̃ i ,
kernel sampling density transfers the information about the im-
portant region from the current level to the next level. At each defined as the ratio of the standard deviation to the mean of R̃ i , is
simulation level, since the candidate state is simulated indepen- next derived assuming that the Markov chain has settled into its
dently of the current sample, successive Markov chain samples stationary state and the proposal PDF for the ith simulation level
are correlated only if the candidate state is rejected. With a rea- is fixed. As a result of stationarity, E 关 R̃ i 兴 ⫽R i . The variance of R̃ i
sonable acceptance rate of the candidate states, this leads to a is thus given by
J. Eng. Mech., 2002, 128(4): 380-391

E 关 R̃ i ⫺R i 兴 2 ⫽E 冋 1
兺
N k⫽1
N
关 h 共 ␪共ki 兲兲 ⫺R i 兴册 2 that this additional variability is of the order of the bias in R̃ i , and
hence is often small. The numerical example shows that the COV
predicted by Eq. 共15兲 is quite close to the actual COV, and hence
N N it can be used for assessing the variability of R̃ i .
1
⫽ 2
N j⫽1 兺 k⫽1
兺 E 兵共 h 共 ␪共j i 兲兲 ⫺R i 兲关 h 共 ␪共ki 兲兲 ⫺R i 兴其
Illustrative Example
N N
1
⫽ 2
N j⫽1 兺 k⫽1
兺 c i共 j,k 兲 (13) The proposed methodology is applied to updating the response
variance and reliability of a structure using its identified natural
where c i ( j,k)⫽covariance between h(␪(i) (i)
j ) and h(␪k ). Since the
frequencies for two cases: locally identifiable and unidentifiable.
Markov chain is stationary and c i ( j,k) is symmetric in j and k, The actual structure is a two-dimensional finite-element model of
c i ( j,k) depends only on the absolute difference between the in- a two-story one-bay moment-resisting frame with a bay width of
dexes j and k, that is, c i ( j,k)⫽c i ( 兩 j⫺k 兩 ). Using this result and 8 m and equal story heights of 3 m. The columns have cross-
carrying out the double sum in Eq. 共13兲 with respect to ( j⫺k), sectional area of 18.8⫻10⫺3 m2 and moment of inertia 0.167
we have ⫻10⫺3 m4 . The beams have cross-sectional area of 10.5
⫻10⫺3 m2 and moment of inertia 0.562⫻10⫺3 m4 . The elastic
E 关 R̃ i ⫺R i 兴 2 ⫽
1
N2 冋
Nc i 共 0 兲 ⫹2
k⫽1
N⫺1
兺
共 N⫺k 兲 c i 共 k 兲册 modulus and mass density of steel are assumed to be 2
⫻1011 N/m2 and 7,850 kg/m3, respectively. Lumped masses of
15⫻103 kg are put on the first and second floors. The natural
冋冉冊册
N⫺1 frequencies of the first two modes are computed to be 3.16 and
c i共 0 兲 k
⫽
N
1⫹2
k⫽1
兺 1⫺ ␳ 共k兲
N i
(14) 9.67 Hz, respectively. Using noisy simulated response time histo-
ries, the identified natural frequencies are f̃ 1 ⫽3.13 Hz and f̃ 2
⫽9.83 Hz, which are used as the data D in the updating.
where ␳ i (k)⫽c i (k)/c i (0)⫽correlation coefficient at lag k, k
⫽1,...,N⫺1. The COV of R̃ i , denoted by ␦ i , is then given by
Locally Identifiable Case: Uncertain Stiffness Param-
E 关 R̃ i ⫺R i 兴 2 ⌬ i2 eters
␦ i2 ⫽ ⫽ 共 1⫹␥ i 兲 (15)
R i2 N A two-degree-of-freedom shear building model is used to model
the moment-resisting frame structure in order to identify the in-
where ⌬ i ⫽ 冑c i (0)/R i is the COV of h(␪) when ␪ is distributed as terstory stiffnesses and to predict the response of the structure.
p i (␪), and ␥ i is a correlation factor The story masses are assumed to be 16.5⫻103 and 16.1⫻103 kg
冉冊
N⫺1 for the first and second stories, respectively, which are computed
k
␥ i ⫽2 兺
k⫽1
1⫺ ␳ 共k兲
N i
(16)
based on lumping half of the mass of the columns on the floors
they are connected to. The interstory stiffnesses are parameterized
as k 1 ⫽k̄ 1 ␪ 1 and k 2 ⫽k̄ 2 ␪ 2 , where ␪ 1 and ␪ 2 are the stiffness
By estimating the covariance sequence 兵 c i (k):k⫽0,...,N⫺1 其
parameters to be identified, and k̄ 1 ⫽k̄ 2 ⫽29.7⫻106 N/m are the
from the Markov chain samples 兵 ␪(i)
k :k⫽1,...,N 其 nominal values for the interstory stiffnesses of the first and second
N⫺k stories, respectively. The nominal values are computed assuming
1
c i共 k 兲 ⬇
N⫺k 兺
j⫽1
关 h 共 ␪共j i 兲兲 ⫺R̃ i 兴关 h 共 ␪共j⫹k
i兲
兲 ⫺R̃ i 兴 (17) the beams are rigid 共shear building assumption兲. They do not
correspond to the interstory stiffnesses of the actual structure,
however, due to the flexibility of its beams.
the COV ⌬ i and the correlation sequence 兵 ␳ i (k):k⫽1,...,N⫺1 其 The prior PDF p 0 for ␪ 1 and ␪ 2 is given by the product of two
can be estimated. Consequently, ␥ i in Eq. 共16兲 and hence ␦ i in Eq. lognormal PDFs with most probable values 共MPVs兲 1.3 and 0.8
共15兲 can be estimated, providing a means for assessing the vari- for ␪ 1 and ␪ 2 , respectively, and unit standard deviation for both
ability of the estimate R̃ i using information from a single simula- ␪ 1 and ␪ 2 . These MPVs for the prior PDF reflect a 30% overes-
tion run. timation of the first-story stiffness and a 20% underestimation of
The term ⌬ 2i /N in Eq. 共15兲 is the familiar term for the square the second-story stiffness relative to the nominal shear building
of the COV in MCS with N independent samples. The COV of R̃ i model. The standard deviations for the prior PDF are chosen rela-
is thus equivalent to the COV in MCS with an effective number tively large to express large initial uncertainty in the structural
of independent samples N/(1⫹␥ i ). The efficiency of the estima- model.
tor using correlated samples of a Markov chain (␥ i ⬎0) is re- Using the modal data D, the updated PDF for the stiffness
duced compared to the case when the samples are uncorrelated parameters ␪⫽ 关 ␪ 1 ,␪ 2 兴 is formulated as 共Vanik et al. 2000兲
(␥ i ⫽0), and smaller values of ␥ i imply higher efficiency.
The result for ␦ i in Eq. 共15兲 is derived assuming that the pro- p D共 ␪兲 ⫽c exp关 ⫺J 共 ␪兲 /2⑀ 2 兴 p 0 共 ␪兲 (18)
posal PDF is fixed in independent simulation runs. According to where c⫽normalizing constant; and J(␪)⫽modal measure-of-fit
the proposed methodology, however, the proposal PDF is chosen function
as the kernel sampling density ␬ i constructed using the Markov
chain samples from the previous simulation level 共except for the 2
first simulation level where the prior PDF is used as the proposal J 共 ␪兲 ⫽ 兺 ␭ 2j 关 f 2j 共 ␪兲 / f̃ 2j ⫺1 兴 2 (19)
PDF兲, and so ␬ i is different for each independent simulation run. j⫽1
This gives rise to additional variability in R̃ i , and the actual COV In Eqs. 共18兲 and 共19兲, ⑀/␭ j is the coefficient of variation of the
of R̃ i will be greater than that given by Eq. 共15兲. It can be argued prediction error between each f̃ 2j and the corresponding model
J. Eng. Mech., 2002, 128(4): 380-391

Fig. 1. Markov chain samples for ␪ 1 and ␪ 2 at simulation levels i

⫽1, 5, 9
squared frequency f 2j (␪). The prediction error includes the uncer-

tainty in the identified frequencies as well as the uncertainty in
predicting the natural frequencies of the structure using the shear
building model. We take ␭ 1 ⫽␭ 2 ⫽1 and explore the effect of
different values for ⑀ on the updated PDF for the model param-
eters and the corresponding updated robust reliability. Decreasing
prediction error levels ⑀ 2i ⫽1/2i⫺1 for successive simulation levels
i⫽1,...,9 are investigated. The sequence of intermediate PDFs
兵 p i :i⫽1,...,9 其 is constructed by successively substituting the se-
quence of values ⑀ 2i ⫽1/2i⫺1 into Eq. 共18兲.
Updated Joint Probability Density Function for Model

Parameters ␪ 1 and ␪ 2
Fig. 1 shows the Markov chain samples at simulation levels i Fig. 2. Prior probability distribution function and updated marginal
⫽1, 5, 9, corresponding to ⑀⫽1, 1/4, 1/16 in Eq. 共18兲. For each probability distribution functions for ␪ 1 and ␪ 2 at simulation levels
simulation level, after the first 10 Markov chain samples are ig- i⫽1, 5, 9
nored, N⫽300 Markov chain samples are simulated, which are
shown with dots in Fig. 1. Note that the Markov chain samples
are not all distinct. To show the population of samples consis- resulting estimators. The reason for this is that, in order to main-
tently, the area of the dots is shown proportional to the number of tain a reasonable acceptance rate of the candidate state, the sup-
samples at each location. The numbers of distinct samples are port size of the proposal PDF has to be similar to the size of the
149, 187, and 142 for simulation levels i⫽1, 4, and 9, respec- important region where the Markov chain is initiated. However,
tively. The contour lines for each updated PDF are plotted at this results in a small chance of generating a sample lying in the
levels 0.01, 0.1, 0.5, and 0.9 relative to the peak value of the other important region, since the spread of the proposal PDF is
updated PDF in each figure. It can be seen that the area enclosed not large enough. In contrast, in the proposed strategy, the pro-
by the outermost contour 共0.01兲, and hence the region of prob- posal PDF has significant probability content in the two important
ability concentration of the updated PDF, diminishes as ⑀ de- regions since it is constructed using the Markov chain samples
creases. The samples populate well the important region of the from the previous simulation level (i⫽8) which are clustered
updated PDFs. More importantly, the density of the population is around the two regions. Consequently, the candidate states and
consistent with the exact updated PDF. hence the Markov chain samples at simulation level i⫽9 are also
The updated PDF bifurcates into two peaks as ⑀ decreases clustered around the two important regions. Here, one sees the
because there are two ‘‘optimal’’ models that give the identified advantage of adapting from one simulation level to the next
frequencies, showing that the class of model is only locally iden- 共using a kernel sampling density兲 rather than within each simula-
tifiable based on the modal data. In spite of this complexity, the tion level 共using an adaptive proposal PDF兲 for updating robust
samples still populate the important neighborhood of the two reliability problems.
peaks, which are essentially disconnected from each other. This
ability to populate samples in disconnected regions of significant Updated Marginal Distributions
probability density offers a robust and powerful way of locating The marginal kernel PDFs for ␪ 1 and ␪ 2 constructed from the
the domain of influence of the optimal parameter points in Baye- Markov chain samples are shown in Fig. 2. The corresponding
sian model updating problems, even when the model is not 共glo- marginal kernel CDFs are shown in Fig. 3. The prior PDFs and
bally兲 identifiable based on the available data, as in the present CDFs in Figs. 2 and 3, respectively, are included for reference.
example. It should be noted that, if the Markov chain samples is The results from three independent simulation runs are shown
generated using an adaptive proposal PDF, either one of the im- with dashed lines. Recall that the estimated marginal PDFs and
portant regions at simulation level i⫽9 is likely to be missed by CDFs are readily obtained from the joint kernel PDF without
the Markov chain samples, resulting in significant bias in the numerical integration. For comparison purposes, the exact mar-
J. Eng. Mech., 2002, 128(4): 380-391

Fig. 4. Estimates ˜␴ y j (D), j⫽1, 2, for updated robust drift standard

deviations 共uncertain stiffness兲
Lyapunov equation 共Soong and Grigoriu 1993兲. Using the up-

dated PDF p D , the robust variances of the interstory drifts are
given by Eq. 共2兲 with h(␪)⫽␴ 2y j (␪)
␴ 2y 共 D兲 ⫽
j
冕 ␴ 2y 共 ␪兲 p D共 ␪兲 d␪
j
(20)
where j⫽1, 2 for the first and second stories, respectively.

The robust failure probability that the interstory drift of the jth
story exceeds the threshold level b⫽15 mm within a duration of
T⫽10 s is also computed
冕
Fig. 3. Prior cumulative distribution function and updated marginal
cumulative distribution functions for ␪ 1 and ␪ 2 at simulation levels P 共 F j 兩 D兲 ⫽ P 共 F j 兩 ␪兲 p D共 ␪兲 d␪ (21)
i⫽1, 5, 9
where P(F j 兩 ␪) is the first excursion probability for given model
parameters ␪. Assuming the out-crossing events follow a Poisson
ginal PDFs and CDFs obtained by numerical integration are also process, P(F j 兩 ␪) can be approximated by 共Soong and Grigoriu
plotted with solid lines in Figs. 2 and 3, respectively. The results 1993兲
for the PDFs can be considered acceptable if fine detail is not
required. The results for the CDFs shown in Fig. 3 exhibit a better P 共 F j 兩 D兲 ⫽1⫺exp关 ⫺2␯ j 共 ␪兲 T 兴 (22)
match with the exact results, as the spurious noise in the PDFs is where ␯ j (␪) is the up-crossing rate for given model parameters ␪,
filtered out by integration in the CDFs. The errors in the marginal given by Rice’s formula
PDFs and CDFs come from two sources. The first source is that
the finite number of Markov chain samples is not distributed ex-
actly as the target PDF, which introduces bias in the estimates.
The second is due to the approximate nature of the kernel PDF
␯ j 共 ␪兲 ⫽
␴ y j 共 ␪兲
2␲␴ y j 共 ␪兲
exp ⫺冉 b2
2␴ y j 共 ␪兲 2 冊 (23)
using a finite number of samples. Thus, even if the samples are The estimates for updated robust standard deviations ˜␴ y j (D)
exactly distributed like the target PDF, there will still be discrep- ( j⫽1,2) are shown in Fig. 4 and the estimates for the updated
ancies between the kernel density estimates and the exact results. robust failure probabilities P̃(F j 兩 D) ( j⫽1,2) are shown in Fig.
The dependent nature of the Markov chain samples in general 5. In these figures, three sample estimates, corresponding to the
does not bias the kernel PDF, but it does slow down the conver- same simulation runs as in Figs. 2 and 3, are shown with circles.
gence of the kernel PDF compared to the case when the samples The exact results obtained by numerical integration are shown
are independent. with solid lines. Simulation level i⫽0 refers to the case when no
data are available and the robust response quantities are computed
Updated Robust Variance and Reliability based on the prior PDF p 0 only, that is, by Eq. 共2兲 with p 0 re-
The response of the structure when it is subjected to stochastic placing p D . To investigate the bias of the simulation results, the
excitations is predicted using the shear building model. Of inter- average of the estimates over 50 independent simulation runs is
est are the interstory drift responses y 1 and y 2 of the first and computed and shown with dashed lines in the figures. In both
second stories, respectively. The structure is assumed to be sub- Figs. 4 and 5, the results from averaging 50 simulation runs
jected to earthquake motion modeled by stationary Gaussian 共dashed lines兲 almost overlap with the exact results 共solid lines兲,
white noise with spectral intensity S⫽1⫻10⫺2 m2 /s3 . The sta- showing that the bias from the initial transient stage of the simu-
tionary variances ␴ 2y 1 (␪) and ␴ 2y 2 (␪) of the interstory drifts for lated Markov chains is negligible, and the estimates are practi-
given model parameters ␪ can be obtained by solving the cally unbiased.
J. Eng. Mech., 2002, 128(4): 380-391

Fig. 5. Estimates P̃(F j 兩 D), j⫽1, 2, for updated robust failure prob-
Fig. 7. Coefficient of variation of updated robust failure probability
abilities 共uncertain stiffness兲
estimates 共uncertain stiffness兲
Variability of Estimates
The sample COV of the updated robust drift variance and failure there is an infinite set of ‘‘optimal’’ mass and stiffness parameter
probability estimates computed from the 50 independent runs are values that give the two identified frequencies.
shown in Figs. 6 and 7. Also, the COV estimates based on Eq. The prior PDF p 0 for ␪ 1 to ␪ 4 is given by the product of four
共15兲, which does not include the variability of the proposal PDF, lognormal PDFs with MPVs 1.3,0.8,0.95,0.95, and standard de-
are computed and shown in these figures: the COV estimates for viations 1,1,0.1,0.1, respectively. Relatively small values of the
the three simulation runs in Figs. 2 and 3 are shown with circles, standard deviations are assumed for the mass parameters to reflect
while the averages of the COV estimates based on Eq. 共15兲 over that, in practice, they can usually be determined more accurately
50 simulation runs are shown with dashed lines. From Figs. 6 and from the structural drawings than the stiffness parameters.
7, it can be seen that the actual COV of the robust drift variance Fig. 8 shows the Markov chain samples at simulation levels
and failure probability estimates 共solid lines兲 are quite close to i⫽1, 5, 9 corresponding to ⑀⫽1, 1/4, 1/16 in Eq. 共18兲. At each
those predicted by Eq. 共15兲共circles and dashed lines兲, indicating level, N⫽500 Markov chain samples are simulated. The four
that the additional variability due to the randomness of the pro- components of the samples are shown in two groups: ␪ 2 versus ␪ 1
posal PDF is negligible and Eq. 共15兲 is useful for assessing the in the first column and ␪ 4 versus ␪ 3 in the second column of Fig.
variability of the updated robust estimates. 8. The distribution of the samples for the stiffness parameters as
shown in the first column of the figure is qualitatively similar to
that shown in Fig. 1 but with more spread. There is no significant
Unidentifiable Case: Uncertain Stiffness and Mass Pa- pattern in the distribution of the mass parameters as shown in the
rameters second column of Fig. 8, as a result of the small uncertainty
Next, the story masses are also updated, in addition to the stiff- assumed in their prior distribution. In fact, the prior distributions
nesses. The story masses are parametrized as m 1 ⫽␪ 3 m̄ 1 and m 2 on the mass and stiffness parameters serve as a ‘‘regularizer’’ for
⫽␪ 4 m̄ 2 , where ␪ 3 and ␪ 4 are the mass parameters to be updated, this unidentifiable problem 共in the sense of Tikhonov and Arsenin
and m̄ 1 ⫽16.5⫻103 kg and m̄ 2 ⫽16.1⫻103 kg are the nominal 1977兲.
values for the first- and second-story masses, respectively. There
are thus four parameters 共two stiffness parameters ␪ 1 and ␪ 2 , and
two mass parameters ␪ 3 and ␪ 4 兲 to be updated. In this example,
the class of models is unidentifiable based on the data because
Fig. 6. Coefficient of variation of updated robust drift variance esti- Fig. 8. Markov chain samples for ␪ 1 to ␪ 4 at simulation levels i
mates 共uncertain stiffness兲 ⫽1, 5, 9
J. Eng. Mech., 2002, 128(4): 380-391

Fig. 9. Prior probability distribution function and updated marginal probability distribution functions for ␪ 1 to ␪ 4 at simulation levels i⫽1, 5, 9
共solid lines: average over 50 runs; dashed lines: three runs兲
The marginal kernel PDFs for ␪ 1 to ␪ 4 constructed using the shown in Fig. 11 and their COV is shown in Fig. 12. These results
Markov chain samples are shown in Fig. 9. The corresponding demonstrate the robustness of the proposed method to problems
marginal kernel CDFs are shown in Fig. 10. Calculation of the of higher dimensions.
exact marginal PDFs and CDFs involves three-dimensional nu-
merical integration, and was not attempted. Instead, the sample
average over 50 simulation runs of the marginal kernel distribu- Conclusions
tions are computed and are shown as solid lines in Figs. 9 and 10.
Finally, the estimates for the updated failure probabilities of The usual practice in system identification is to use system data to
the interstory drifts for the first and second stories are computed identify one model from a set of possible models and then to use
using the Markov chain samples. The results for the estimates are this model for predicting system behavior. In contrast, the present
Fig. 10. Prior cumulative distribution function and updated marginal cumulative distribution functions for ␪ 1 to ␪ 4 at simulation levels i⫽1, 5,
9 共solid lines: average over 50 runs; dashed lines: three runs兲
J. Eng. Mech., 2002, 128(4): 380-391

samples are practically unbiased, and their variability can be con-
sistently estimated by an available formula using information
from a single simulation run.
The success of the proposed methodology relies on a good
choice of the intermediate PDFs and their approximation by ker-
nel PDFs. Future research may focus on developing better inter-
mediate PDFs as well as enhancing the performance of the kernel
PDF approximations.
Acknowledgments
This paper is based upon work partly supported by the Pacific

Earthquake Engineering Research Center under National Science
Fig. 11. Estimates P̃(F j 兩 D), j⫽1, 2, for updated robust failure prob- Foundation Cooperative Agreement No. CMS-9701568. This sup-
abilities 共uncertain stiffness and mass兲
port is gratefully acknowledged.
robust predictive approach rigorously combines the predictions of References

all models, appropriately weighted by their updated probabilities
based on the data. This approach works regardless of whether the Ang, G. L., Ang, A. H.-S., and Tang, W. H. 共1992兲. ‘‘Optimal importance
class of models is identifiable for not. sampling density estimator.’’ J. Eng. Mech., 118共6兲, 1146 –1163.
The updated predictions can give useful information about the Au, S. K., and Beck, J. L. 共1999兲. ‘‘A new adaptive importance sampling
expected performance of a structure. However, the integrals in- scheme.’’ Struct. Safety, 21, 135–158.
volved are difficult to evaluate because they are generally multi- Beck, J. L. 共1989兲. ‘‘Statistical system identification of structures.’’ Proc.,
dimensional, and the updated PDF for the model parameters is 5th Int. Conf. Structural Safety and Reliability, ASCE, New York,
1395–1402.
known only up to a scaling constant and concentrated in a small
Beck, J. L., and Katafygiotis, L. S. 共1998兲. ‘‘Updating models and their
region of the parameter space. Conventional simulation methods
uncertainties. Part I: Bayesian statistical framework.’’ J. Eng. Mech.,
such as standard Monte Carlo or importance sampling simulation
124共4兲, 455– 461.
are found to be inapplicable. Markov chain simulation based on Box, G. E. P., and Tiao, G. C. 共1992兲. Bayesian inference in statistical
direct application of the Metropolis-Hastings algorithm is not ap- analysis, Wiley, New York.
plicable either, since it is difficult to choose a suitable proposal Collins, J. D., Hart, G. C., Hasselman, T. K., and Kennedy, B. 共1974兲.
distribution in the absence of knowledge about the region of con- ‘‘Statistical identification of structures.’’ AIAA J., 12, 185–190.
centration of the updated PDF for the model parameters. Cox, R. T. 共1961兲. The algebra of probable inference, Johns Hopkins,
The proposed methodology combines the Metropolis-Hastings Baltimore.
algorithm with a concept similar to that used in simulated anneal- Doob, J. L. 共1953兲. Stochastic processes, Wiley, New York.
ing: it uses successive simulation levels to adapt to the region of Engelund, S., and Sorensen, J. D. 共1998兲. ‘‘A probabilistic model for
probability concentration of the updated PDF. Theoretical and chloride-ingress and initiation of corrosion in reinforced concrete
numerical investigations show that the Markov chain samples structures.’’ Struct. Safety, 20共1兲, 69– 89.
populate the region of concentration of the updated PDF in a Enright, M., and Frangopol, D. M. 共1999兲. ‘‘Condition prediction of de-
probabilistically correct manner. The resulting robust reliability teriorating concrete bridges.’’ J. Struct. Eng., 125共10兲, 1118 –1125.
and response variance integrals estimated using the Markov chain Fishman, G. S. 共1996兲. Monte Carlo: Concepts, algorithms, and applica-
tions, Springer-Verlag, New York.
Geyskens, P., Der Kiureghian, A., and Monteiro, P. 共1998兲. ‘‘Bayesian
prediction of elastic modulus of concrete.’’ J. Struct. Eng., 124共1兲,
89–95.
Hastings, W. K. 共1970兲. ‘‘Monte Carlo sampling methods using Markov
chains and their applications.’’ Biometrika, 57, 97–109.
Jaynes, E. T. 共1978兲. ‘‘Where do we stand on maximum entropy?’’ The
maximum entropy formalism, R. D. Levine and M. Tribus, eds., MIT,
Cambridge, Mass.
Katafygiotis, L. S., and Beck, J. L. 共1998兲. ‘‘Updating models and their
uncertainties—model identifiability.’’ J. Eng. Mech., 124共4兲, 463–
467.
Katafygiotis, L. S., Papadimitriou, C., and Lam, H. F. 共1998兲. ‘‘A proba-
bilistic approach to structural model updating.’’ Soil Dyn. Earthquake
Eng., 17共7-8兲, 495–507.
Metropolis, N., Rosenbluth, A. W., Rosenbluth, M. N., and Teller, A. H.
共1953兲. ‘‘Equations of state calculations by fast computing machines.’’
J. Chem. Phys., 21共6兲, 1087–1092.
Papadimitriou, C., Beck, J. L., and Au, S. K. 共2000兲. ‘‘Entropy-based
Fig. 12. Coefficient of variation of updated robust failure probability
optimal sensor location for structural model updating.’’ J. Vib. Con-
estimates 共uncertain stiffness and mass兲
trol, 6共5兲, 781– 800.
J. Eng. Mech., 2002, 128(4): 380-391

Papadimitriou, C., Beck, J. L., and Katafygiotis, L. S. 共2001兲. ‘‘Updating Tikhonov, A. N., and Arsenin, V. Y. 共1977兲. Solution of ill-posed prob-
robust reliability using structural test data.’’ Probab. Eng. Mech., lems, Wiley, New York.
16共2兲, 103–113. Törn, A., and Zilinskas, A. 共1989兲. Global optimization, Vol. 350 of Lec-
Rubinstein, R. Y. 共1981兲. Simulation and the Monte-Carlo method, Wiley, ture Notes in Computer Science, Springer-Verlag, Berlin.
New York. Vanik, M. W., Beck, J. L., and Au, S. K. 共2000兲. ‘‘Bayesian probabilistic
Silverman, B. W. 共1986兲. Density estimators, Chapman and Hall, New approach to structural health monitoring.’’ J. Eng. Mech., 126共7兲,
York. 738 –745.
Singhal, A., and Kiremidjian, A. S. 共1998兲. ‘‘Bayesian updating of fra- Yang, C. M., and Beck, J. L. 共1998兲. ‘‘Generalized trajectory methods for
gilities with application to RC frames.’’ J. Struct. Eng., 124共8兲, 922– finding multiple extrema and roots of functions.’’ J. Optim. Theory
929.
Appl., 97共4兲, 211–227.
Sivia, D. S. 共1996兲. Data analysis: A Bayesian tutorial, Clarendon, Ox-
Zheng, R. H., and Ellingwood, B. R. 共1998兲. ‘‘Role of non-destructive
ford.
evaluation in time-dependent reliability analysis.’’ Struct. Safety,
Soong, T. T., and Grigoriu, M. 共1993兲. Random vibration of mechanical

and structural systems, Prentice Hall, Englewood Cliffs, N.J. 20共4兲, 325–339.
J. Eng. Mech., 2002, 128(4): 380-391

Bayesian Updating of Structural Models and Reliability Using Markov Chain Monte Carlo Simulation

Uploaded by

Document Information

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Bayesian Updating of Structural Models and Reliability Using Markov Chain Monte Carlo Simulation

Uploaded by

Copyright:

Available Formats

Bayesian Updating of Structural Models and Reliability using

Markov Chain Monte Carlo Simulation

Introduction flexibility; errors because of the spatial discretization of distrib-

380 / JOURNAL OF ENGINEERING MECHANICS / APRIL 2002

J. Eng. Mech., 2002, 128(4): 380-391

p D共 ␪兲 ⫽p 共 D兩 ␪,M兲 p 0 共 ␪兩 M兲 /p 共 D兩 M兲 ⫽cp 共 D兩 ␪,M兲 p 0 共 ␪兩 M兲 is globally maximized. The thickness of N around S is of the

JOURNAL OF ENGINEERING MECHANICS / APRIL 2002 / 381

J. Eng. Mech., 2002, 128(4): 380-391

MH algorithm to simulate Markov chain samples 兵 ␪1 ,...,␪N 其

382 / JOURNAL OF ENGINEERING MECHANICS / APRIL 2002

J. Eng. Mech., 2002, 128(4): 380-391

JOURNAL OF ENGINEERING MECHANICS / APRIL 2002 / 383

J. Eng. Mech., 2002, 128(4): 380-391

384 / JOURNAL OF ENGINEERING MECHANICS / APRIL 2002

J. Eng. Mech., 2002, 128(4): 380-391

JOURNAL OF ENGINEERING MECHANICS / APRIL 2002 / 385

J. Eng. Mech., 2002, 128(4): 380-391

Fig. 1. Markov chain samples for ␪ 1 and ␪ 2 at simulation levels i

squared frequency f 2j (␪). The prediction error includes the uncer-

Updated Joint Probability Density Function for Model

386 / JOURNAL OF ENGINEERING MECHANICS / APRIL 2002

J. Eng. Mech., 2002, 128(4): 380-391

Fig. 4. Estimates ˜␴ y j (D), j⫽1, 2, for updated robust drift standard

Lyapunov equation 共Soong and Grigoriu 1993兲. Using the up-

where j⫽1, 2 for the first and second stories, respectively.

JOURNAL OF ENGINEERING MECHANICS / APRIL 2002 / 387

J. Eng. Mech., 2002, 128(4): 380-391

388 / JOURNAL OF ENGINEERING MECHANICS / APRIL 2002

J. Eng. Mech., 2002, 128(4): 380-391

JOURNAL OF ENGINEERING MECHANICS / APRIL 2002 / 389

J. Eng. Mech., 2002, 128(4): 380-391

This paper is based upon work partly supported by the Pacific

robust predictive approach rigorously combines the predictions of References

390 / JOURNAL OF ENGINEERING MECHANICS / APRIL 2002

J. Eng. Mech., 2002, 128(4): 380-391

Soong, T. T., and Grigoriu, M. 共1993兲. Random vibration of mechanical

JOURNAL OF ENGINEERING MECHANICS / APRIL 2002 / 391

J. Eng. Mech., 2002, 128(4): 380-391

You might also like