Professional Documents
Culture Documents
Kang 2020
Kang 2020
Abstract: Structural health monitoring models provide important information for safety control of large dams. The main challenge in
developing an accurate dam behavior prediction model lies in the modeling of extreme temperature effect. This paper presents a Gaussian
process regression-based displacement model for health monitoring of concrete gravity dams, which can model the temperature effect by
using long-term air temperature data. Important attractions of Gaussian processes include accurate simulation results, convenient training, and
so forth. Different covariance functions and temperature variable sets are tested on the horizontal displacement prediction problem of concrete
dams. Results show that segmented air temperature based Gaussian process regression models can reflect the extreme air temperature effect
on displacements of concrete gravity dams, considering the prediction accuracy is much better than that of a mathematical model based on
periodic functions. DOI: 10.1061/(ASCE)ST.1943-541X.0002467. © 2019 American Society of Civil Engineers.
Author keywords: Dam behavior modeling; Displacements; Structural health monitoring; Temperature simulation; Gaussian processes.
temperature effect to displacements of concrete gravity dams using are coefficients to be determined.
measured air temperature with a GPR tool. Practical approaches The time effect in Eq. (1) is mainly caused by creep. The asymp-
to consider the influence of extreme environmental temperature to totic variation of time effect can be described as (Léger and Leclerc
concrete dam displacement are proposed. Nonlinear mapping ca- 2007; Mata et al. 2013) or Eq. (5) (Gu and Wu 2006; Su et al. 2015;
pability of GPR models with various covariance functions is studied Li et al. 2013)
to make an appropriate selection. The proposed approach is verified
on a real-world concrete gravity dam. yt ¼ c0 þ c1 θ þ c2 e−θ ð4Þ
yt ¼ c0 þ c1 θ þ c2 lnðθ þ 1Þ ð5Þ
Hydrostatic-Season-Time Model
where θ ¼ t=100 (Gu and Wu 2006) or θ ¼ t=365 (Salazar et al.
Displacement (or deformation) is an overall index of the dam health 2015), t represents the number of days from analysis beginning
status and it is also the most intuitive reflection of dam behaviors. date, c0 , c1 , and c2 are coefficients. Exponential functions or log-
Therefore, the displacement monitoring modeling problem is stud- arithmic functions are generally adopted to represent the evolution
ied. As shown in Fig. 1, the concrete dam displacement is affected of concrete creep at early days. A linear function yt ¼ c0 þ c1 θ is
by several environmental variables. The displacement data obtained recommended to dams running for many years (Tatin et al. 2015).
by an instrument can be formulated by a general statistical model as
30 30
Air temperature ( C)
Air temperature ( C)
20 20
10 10
0 0
-10 -10
-20 -20
-30 -30
2000 2001 2002 2003 2005 2006 2007 2008
(a) Time (year) (b) Time (year)
Downloaded from ascelibrary.org by East Carolina University on 12/06/19. Copyright ASCE. For personal use only; all rights reserved.
Fig. 2. Temperature variations in different years: (a) winter; and (b) summer.
recorded dam temperature for a concrete dam that has been run for S5 ¼ fT 0 ; T 1−2 ; T 3−7 ; T 8−15 ; T 16−30 ; T 31−60 ; T 61−90 ; T 91−120 ;
many years.
It is convenient to simulate the temperature effect by using har- T 121−180 ; T 181−240 ; T 241−300 ; T 301−365 ; T 366−430 ; T 431−490 ;
monic sinusoidal functions as described in the HST model. How- T 491−550 ; T 551−610 ; T 611−670 ; T 671−730 ; T 731−790 ; T 791−850 ;
ever, the effect of extreme low or extreme high temperatures on the
dam displacement cannot be reflected by this type of periodic func- T 851−910 ; T 911−970 ; T 971−1030 ; T 1031−1095 g ð31Þ
tion. The internal temperature changes are mainly influenced by
environmental temperature changes. Except for simulation by har- Eqs. (27)–(31), S1 , and S2 contain data of one-third year and
monic functions, the temperature effect can also be described by air half a year before the displacement observation date. S3 , S4 , and S5
temperature. Considering there is a lag effect existing between the contain data of one to three years before the displacement meas-
internal and outside temperature changes (He et al. 2010; Shi et al. urement, respectively. Variable sets using additional years’ data can
2018), the displacement caused by temperature changes can be be obtained in this manner. The proposed variable sets are adopted
simulated as together with other variables for displacement modeling (Kang et al.
2019b). For example, when S4 is adopted, the complete variable set
yT ¼ b1 T 0 þ b2 T 1−2 þ b3 T 3−7 þ b4 T 8−15 can be summarized as {H1 , H 2 , H3 , T 0 , T 1−2 , T 3−7 , T 8−15 , T 16−30 ,
T 31−60 , T 61−90 , T 91−120 , T 121−180 , T 181−240 , T 241−300 , T 301−365 ,
þ b5 T 16−30 þ b6 T 31−60 þ · · · ð26Þ T 366−430 , T 431−490 , T 491−550 , T 551−610 , T 611−670 , T 671−730 , θ}.
where T p−q is the mean value of air temperature during the p
to q days before the displacement measurement date (Kang et al. Dam Health Monitoring Modeling Using GPR
2019b). Considering Extreme Temperatures
Mata et al. (2014) proposed a model using internal tempera- A MLR model is unable to simulate dam responses accurately
tures and better performance is obtained and compared with a HST when segmented air temperature data are adopted. Therefore, a
model. Considering the internal temperature is correlated to the out- flexible GPR model is preferable to be adopted to model nonlinear
side temperature, it is promising to obtain reliable results using air interactions between many input variables. According to Eq. (11),
temperature for yT modeling. In the literature, no more than one- the model based on GPR for displacement modeling of concrete
third year temperature data is adopted, and the proper length of time dams can be expressed as
still needs further discussion. Fig. 2 shows an example of temper-
ature variations around a reservoir. There is a difference of 10°C yD ¼ GPRðxÞ ¼ Kðx; XÞ½KðX; XÞ þ σ2 I−1 y ð32Þ
between the temperatures of different years. More reasonable sim-
ulation is expected by using long-period air temperature data. There- For displacement modeling, different input variables should be
fore, the following variable sets are proposed (Kang et al. 2019a) taken into account when different temperature effect simulation ap-
proaches are adopted (Kang et al. 2019b). When periodic functions
S1 ¼ fT 0 ; T 1−2 ; T 3−7 ; T 8−15 ; T 16−30 ; T 31−60 ; T 61−90 ; T 91−120 g are adopted, the input can be given as
ð27Þ
x ¼ fH1 ; H 2 ; H 3 ; sinðsÞ; cosðsÞ; sin2 ðsÞ; sinðsÞ cosðsÞ; θg ð33Þ
S2 ¼ fT 0 ; T 1−2 ; T 3−7 ; T 8−15 ; T 16−30 ; T 31−60 ; When the proposed approach is adopted, using simulation with
T 61−90 ; T 91−120 ; T 121−180 g ð28Þ S4 as an example, the input can be given as
S4 ¼ fT 0 ; T 1−2 ; T 3−7 ; T 8−15 ; T 16−30 ; T 31−60 ; T 61−90 ; T 91−120 ; The GPR model is established based on data selected from the
T 121−180 ; T 181−240 ; T 241−300 ; T 301−365 ; T 366−430 ; T 431−490 ; dam safety monitoring system. Several performance criterions are
adopted to evaluate different models. These criterions include the
T 491−550 ; T 551−610 ; T 611−670 ; T 671−730 g ð30Þ root mean squared error (RMSE), the mean absolute error (AEmean ),
PN
ðyD ðiÞ − ȳD ÞðyðiÞ − ȳÞ The proposed methodology was verified on a real-world concrete
R ¼ pffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffiffi
PN i¼1 2
PN 2
ð38Þ gravity dam. The performance was also compared with that of
i¼1 ðy D ðiÞ − ȳD Þ i¼1 ðyðiÞ − ȳÞ
conventional MLR models and harmonic sinusoidal function-
where yD and ȳD are the prediction values and the average value, y based GPR models. All the modeling problems were performed in
and ȳ are the measured values and the average value, and N is the MATLAB (version R2014a) environment.
number of observations. The desired model should have the mini- The open-source package Gaussian Packages for Machine
mum RMSE, AEmax , AEmean and an R close to 1. The RMSE, Learning (GPML) (Rasmussen and Nickisch 2010) was employed
AEmax , and AEmean have the same unit mm as the displacement of to perform the analyses of GPR. The hyperparameters were initial-
the concrete gravity dam. ized before running the GPR model (Rasmussen and Williams
The main steps of the proposed approach are summarized as 2006). Except for special statements, the parameters were initialized
follows. The flowchart is shown in Fig. 3. as ln σf ¼ 0, ln σn ¼ 0.1, ln α ¼ 0, ln c ¼ 0, ln ω ¼ 2, and ln li ¼ 0
1. Select the input vector x. The output variable y is the horizontal according to the experience (Kang et al. 2015, 2017b).
displacement.
2. When the air temperature variables are selected, the values of Concrete Gravity Dam Description and
these segmented variables can be calculated based on measured Data Preparation
air temperature series.
3. Select data from the dam safety monitoring system, and estab- The example shown in Fig. 4 is a 91-m high and 1,080-m long
lish sample sets for the GPR model training and test. Divide the concrete gravity dam. The whole dam is divided into 60 sections
selected data into the training and test sets. Training samples with the same width along the axis. The dam construction was com-
account for about 80%, and test samples account for about 20% pleted in 1953. A safety monitoring system was also installed to
of the total samples. If there is enough data, more test samples know the dam status. The measured items include reservoir water
can be adopted. levels, air temperatures, displacements, seepages, and so on. At the
4. Normalize the original data into [0, 1]. beginning, horizontal displacements at the top of the dam are
measured by using tension wire alignment, which is replaced by
a vacuum laser collimation system with higher accuracy in 1984.
Using Section 32 of the dam as an example, the effectiveness of
Start the proposed displacement model is studied. Data from Section 32
are sufficient for displacement modeling. The water level curve is
Prepare training samples and test samples
No
Accuracy Satisfied?
Yes
Structural health monitoring of dams
Fig. 3. Flowchart of proposed approach for dam health monitoring Fig. 4. Downstream view of concrete gravity dam. (Image by
modeling. Fei Kang.)
4
Training Test
2
Displacement (mm)
0
-2
-4
-6
-8
-10
-12
1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009
Time (year)
40
30
Air temperature ( C)
20
10
0
-10
-20
-30
1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009
Time (year)
shown in Fig. 5. The horizontal displacement curve of Section 32 is Experimental Studies of GPR Models Using
illustrated in Fig. 6. The measured mean daily air temperature curve Harmonic Functions
is shown in Fig. 7. Data are selected from 1997 to 2008, and a total First, the proposed GPR model for dam displacement prediction is
of 156 samples are formed. The training set contains 127 samples compared with multiple linear regression and stepwise regression
within the period 1997–2006, and the test set contains 29 samples using harmonic sinusoidal functions. The adopted environmental
in 2007–2008. variables (input variables) are listed in Table 1, according to section
Before the experiments, outlier detection was performed. The “GPR-Based Displacement Model Considering Extreme Tempera-
standardized residual (ei =σ) is adopted to judge whether the obser- ture.” Initial values and optimization results for hyperparameters of
vation is an outlier, where ei is the residual of data i, σ is the stan- GPR models with representative covariance functions are given in
dard deviation. The points outside the boundary of 3 are regarded Table 2. Three components of dam horizontal displacement are an-
as outliers since the probability of standardized residuals within the alyzed by statistical analysis and presented in Fig. 9. The temper-
range of ½−3; þ3 is 99.7%. The results are shown in Fig. 8, and one ature effect takes the most important part in the dam horizontal
sample is identified as the outlier. The SEard covariance function displacement variations.
is adopted in Fig. 8, because it performs the best for the studied The comparison of prediction values by MLR using periodic
problem. In the following sections “Experimental Studies of GPR functions (MLR-HST) and measured values are illustrated in
Models Using Harmonic Functions” and “Experimental Studies of Fig. 10. Convergence process curves for optimizing hyperpara-
GPR Models Using Air Temperature,” the outlier is removed. In the meters of GPR using harmonic functions are illustrated in Fig. 11.
section “Effect of Outliers to Prediction Accuracy,” the outlier is The adopted optimization algorithm can efficiently find the optimal
kept as it is to observe the influence. parameters. The performance of GPR using periodic functions
Standardized residual 3 3
Standardized residual
2 2
1 1
0 0
-1 -1
-2 -2
-3 -3
-4 -4
Downloaded from ascelibrary.org by East Carolina University on 12/06/19. Copyright ASCE. For personal use only; all rights reserved.
-5 -5
1996 1998 2000 2002 2004 2006 2008 1996 1998 2000 2002 2004 2006 2008
(a) Time (year) (b) Time (year)
Fig. 8. Outlier detection results: (a) MLR-HST model; and (b) GPR-S4 using SEard covariance function.
0 0
Displacement (mm)
Displacement (mm)
-2 -2
-4 -4
-6 -6
-8 -8
-10 -10
Downloaded from ascelibrary.org by East Carolina University on 12/06/19. Copyright ASCE. For personal use only; all rights reserved.
-12 -12
1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2007 2008 2009
(a) Time (year) (b) Time (year)
Fig. 10. Performance of harmonic functions based MLR model: (a) training; and (b) test.
-60 The results of multiple linear regression models using these vari-
GRP (SEiso) - 1 able sets are given in Table 6. However, air temperature based MLR
-80
GRP (SEiso) - 2 models do not perform as well as the MLR-HST models in terms of
Negative marginal likelihood
-100 GPR (SEard) - 1 the prediction errors. The reason is that MLR models are not able to
GPR (SEard) - 2 simulate the nonlinear relationship between many air temperature
-120 variables and the studied dam response.
-140 Prediction results of GPR models using air temperature variable
sets are given in Table 7. The best results are highlighted in bold-
-160 face. As provided in Table 7, with the increase of temperature data
-180
adopted, the prediction errors become smaller and smaller. But
when the period of temperature data increased from two years to
-200 three years, the prediction errors are not reducing any more. That
means two years’ air temperature data is enough to the studied
-220
problem. The prediction results of GPR are much better than that
-240 of MLR when air temperature data are adopted for modeling. Con-
0 20 40 60 80 100 vergence process curves for optimizing hyperparameters of GPR
Number of function evaluations using two years’ air temperature (GPR-S4 ) are illustrated in Fig. 13.
The adopted optimization algorithm converges quickly even as the
Fig. 11. Convergence process curves for optimizing hyperparameters
of GPR using harmonic functions by setting initial parameters as
dimension of input variables is much increased. The performance
fθ; ln σn g ¼ f0; : : : ; 0; lnð0.1Þg.
of GPR-S4 model is also shown in Fig. 14. The GPR model fits
better than the MLR-HST model.
4 4
Predicted value
Predicted value
2 2 Measured value
Measured value
0 0
Displacement (mm)
Displacement (mm)
-2 -2
-4 -4
-6 -6
-8 -8
-10 -10
-12 -12
1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2007 2008 2009
(a) Time (year) (b) Time (year)
Fig. 12. Performance of harmonic functions based GPR (SEard)-1 model: (a) training; and (b) test.
Table 4. Displacement prediction errors of GPR models when harmonic functions are adopted for temperature effect simulation
Training Testing
Covariance functions AEmax AEmean RMSE R AEmax AEmean RMSE R
SEiso 0.5528 0.1555 0.1973 0.9980 1.2696 0.5418 0.6538 0.9895
SEard 0.5667 0.1604 0.2044 0.9978 1.1657 0.4871 0.5858 0.9897
RQiso 0.5444 0.1518 0.1935 0.9981 1.3034 0.5777 0.6857 0.9893
RQard 0.5676 0.1604 0.2043 0.9978 1.1742 0.4913 0.5904 0.9897
Matérn (d ¼ 1) 0.0000 0.0000 0.0000 1.0000 1.7103 0.6224 0.7665 0.9853
Matérn (d ¼ 3) 0.4661 0.1292 0.1668 0.9986 1.3744 0.5770 0.6835 0.9881
Matérn (d ¼ 5) 0.5231 0.1453 0.1864 0.9982 1.3089 0.5749 0.6794 0.9889
PPiso (v ¼ 0) 0.0000 0.0000 0.0000 1.0000 1.7099 0.6221 0.7662 0.9853
PPiso (v ¼ 1) 0.4671 0.1295 0.1671 0.9985 1.3693 0.5744 0.6805 0.9881
PPiso (v ¼ 2) 0.5288 0.1470 0.1883 0.9982 1.2955 0.5647 0.6697 0.9891
PPiso (v ¼ 3) 0.5453 0.1522 0.1939 0.9980 1.2817 0.5539 0.6623 0.9894
Periodic 0.5496 0.1546 0.1963 0.9980 1.2921 0.5703 0.6812 0.9894
LINard 1.9706 0.5181 0.6738 0.9783 3.0911 1.2200 1.5365 0.9126
LINone 0.8766 0.2487 0.3080 0.9951 1.3896 0.6200 0.7278 0.9858
Poly (d ¼ 1) 0.8729 0.2490 0.3082 0.9951 1.3811 0.6193 0.7270 0.9859
Poly (d ¼ 2) 0.5781 0.1642 0.2071 0.9978 1.1882 0.4965 0.6062 0.9900
Poly (d ¼ 3) 0.5712 0.1585 0.2019 0.9979 1.1784 0.5049 0.6084 0.9896
Poly (d ¼ 4) 0.5719 0.1577 0.2013 0.9979 1.1819 0.5052 0.6074 0.9896
Poly (d ¼ 11) 0.5693 0.1563 0.2000 0.9979 1.1929 0.5089 0.6091 0.9894
Poly (d ¼ 12) 0.5744 0.1573 0.2011 0.9979 1.1834 0.5035 0.6044 0.9896
NNone 0.5621 0.1598 0.2020 0.9979 1.1923 0.4847 0.5910 0.9901
Note: For Poly (d ¼ 12), the hyperparameters were initialized as fθ; ln σn g ¼ f−1; −1; lnð0.1Þg. The best results are highlighted in boldface. The unit of
AEmax , AEmean , and RMSE is mm.
Performances of different covariance functions are also studied boldface and are obtained by the SEard covariance function. The
when air temperature is adopted. Initial values and optimization tests AEmax , AEmean , and RMSE of air temperature based GPR
results for hyperparameters of GPR models with typical covariance models are 0.8924, 0.2947, and 0.3726, which are much smaller
functions are given in Table 8. Performances of various covariance than the values 1.1657, 0.4871, and 0.5858 obtained by the GPR
functions are listed in Table 9. The best results are highlighted in model utilizing harmonic sinusoidal functions.
Table 7. Performance of GPR (SEard) models using different air temperature variable sets
Training Test
Air temperature
variable sets AEmax AEmean RMSE R AEmax AEmean RMSE R
S01 0.6817 0.1196 0.1603 0.9987 1.3119 0.6262 0.7348 0.9918
S02 0.7146 0.1254 0.1649 0.9986 1.3375 0.6461 0.7531 0.9928
S03 0.6496 0.1171 0.1553 0.9987 1.2650 0.6472 0.7499 0.9926
S1 (data for 1/3 year) 0.6677 0.1141 0.1532 0.9988 1.3908 0.5620 0.6850 0.9910
S2 (data for 1/2 year) 0.5098 0.0924 0.1217 0.9992 1.1568 0.4099 0.5239 0.9922
S3 (data for 1 year) 0.4988 0.0912 0.1209 0.9992 1.0656 0.4299 0.5204 0.9929
S4 (data for 2 years) 0.2153 0.0611 0.0774 0.9997 0.8924 0.2947 0.3726 0.9951
S5 (data for 3 years) 0.1448 0.0443 0.0557 0.9998 0.9966 0.3359 0.4359 0.9932
Note: The best results are highlighted in boldface. The unit of AEmax , AEmean , and RMSE is mm.
GPR (SEard)
-80 GPR (RQard) provement rates of the proposed approach to other models are
given in Table 11. The GPR-HST model performs much better than
-120 MLR-HST model. The GPR-S4 model performs the best since the
obtained residuals and prediction errors are much smaller than that
-160 of GPR-HST and MLR-HST models.
-240
In the section “Concrete Gravity Dam Description and Data Prepa-
ration,” one sample is identified as outlier. To see the robustness of
-280
different models to outliers, different models are established using
0 20 40 60 80 100 data with the outlier. The simulation results of different models us-
Number of function evaluations ing data with and without the outlier are listed in Table 12, and the
performance of the GPR–S4 model using data with the outlier is
Fig. 13. Convergence process curves for optimizing hyperparameters shown in Figs. 17 and 18. The best result in Table 12 is highlighted
of GPR using two years’ air temperature by setting initial parameters as in boldface. The prediction accuracy comparisons among different
fθ; ln σn g ¼ f0; : : : ; 0; lnð0.1Þg. models are given in Table 13.
The difference between different versions of models is small.
Although certain data are identified as outliers, they may contain
some important information about the dam conditions. But a gen-
In Table 9, the RMSE in the test period is nearly five times that eral conclusion cannot be drawn from the analysis carried out, as
obtained for training. The test error is much larger than the training only one outlier with a small deviation was detected. In practice, the
error mainly because the test samples are not involved in the model models established using data with and without outliers can be
establishment. The GPR (SEard) model performance can be af- compared to see whether the outliers should be deleted from the
fected by the initial hyperparameters. To test if there was an over- original data set.
fitting problem, different initial parameters were tested. The results
are given in Table 10. The best results are highlighted in boldface. Conclusion
Most of the parameter sets can obtain a similar test accuracy. In the
last two rows, it seems the model is overfitting with unsuitable ini- In this paper, a long-term air temperature based GPR model is pre-
tialization. The initialization method for hyperparameters described sented for displacement prediction of concrete gravity dams. First,
0 0
Displacement (mm)
Displacement (mm)
-2 -2
-4 -4
-6 -6
-8 -8
-10 -10
Downloaded from ascelibrary.org by East Carolina University on 12/06/19. Copyright ASCE. For personal use only; all rights reserved.
-12 -12
1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2007 2008 2009
(a) Time (year) (b) Time (year)
Fig. 14. Performance of GPR model using two’ years air temperature: (a) training; and (b) test.
Table 8. Initial values and optimization results for hyperparameters of GPR using two years’ air temperature
Models Initial values for fθ; ln σn g Optimized values
GRP (SEiso) fln l; ln σf ; ln σn g ¼ f0; 0; lnð0.1Þg f0.532; −1.085; −4.240g
GRP (RQiso) fln l; ln σf ; ln α; ln σn g ¼ f0; 0; 0; lnð0.1Þg f1.026; −0.631; −0.717; −4.299g
GPR (SEard) fln l1 ; : : : ; ln l22 ; ln σf ; ln σn g ¼ f−1; : : : ; −1; 0; lnð0.1Þg f3.920; 3.770; −0.660; 1.756; 0.287; 0.652; 5.846; 1.033; −0.427;
5.617; 6.192; −0.392; 2.676; 0.103; 4.703; 3.181; 2.935; −0.544;
0.029; 0.428; 1.847; 0.502; −1.454; −4.374g
GRP (RQard) fln l1 ; : : : ; ln l22 ; ln σf ; ln α; ln σn g ¼ f0; : : : ; 0; 0; 0; lnð0.1Þg f2.301; 1.911; −0.370; 2.422; 0.629; 1.049; 1.776; 3.440; −0.025;
3.439; 2.504; −0.071; 0.195; 0.533; 3.316; 1.159; 0.940; −0.234;
0.741; 3.300; 2.809; 0.545; −1.170; 0.805; −4.416g
Matérn (d ¼ 5) fln l; ln σf ; ln σn g ¼ f0; 0; lnð0.1Þg f1.117; −0.797; −4.290g
Note: For the SEard function, the parameters are initialized as fln l1 ; : : : ; ln l22 ; ln σf ; ln σn g ¼ ½−1; : : : ; −1; 0; lnð0.1Þ.
Table 9. Displacement prediction performances of GPR models when S4 is adopted for temperature effect simulation
Training Testing
Covariance
functions AEmax AEmean RMSE R AEmax AEmean RMSE R
SEiso 0.3043 0.0688 0.0887 0.9996 1.1218 0.3810 0.4876 0.9947
SEard 0.2153 0.0611 0.0774 0.9997 0.8924 0.2947 0.3726 0.9951
RQiso 0.2628 0.0617 0.0802 0.9997 1.1069 0.4217 0.5159 0.9939
RQard 0.2462 0.0584 0.0753 0.9997 1.1049 0.3480 0.4416 0.9949
Matérn (d ¼ 1) 0.0000 0.0000 0.0000 1.0000 1.2602 0.5234 0.6323 0.9926
Matérn (d ¼ 3) 0.2269 0.0502 0.0645 0.9998 1.1068 0.4251 0.5263 0.9936
Matérn (d ¼ 5) 0.2536 0.0618 0.0799 0.9997 1.0737 0.4112 0.5059 0.9941
PPiso (v ¼ 0) 0.0000 0.0000 0.0000 1.0000 1.2600 0.5233 0.6322 0.9926
PPiso (v ¼ 1) 0.2287 0.0506 0.0651 0.9998 1.1031 0.4235 0.5244 0.9936
PPiso (v ¼ 2) 0.2586 0.0627 0.0810 0.9997 1.0757 0.4054 0.5006 0.9943
PPiso (v ¼ 3) 0.2790 0.0644 0.0836 0.9996 1.0928 0.3961 0.4937 0.9945
Periodic 0.2758 0.0624 0.0802 0.9997 1.1444 0.4195 0.5179 0.9938
LINard 0.7883 0.2553 0.3191 0.9947 1.9032 0.7923 0.9385 0.9877
LINone 0.7843 0.2436 0.3073 0.9951 1.9415 0.7812 0.9276 0.9882
Poly (d ¼ 1) 0.7703 0.2410 0.3041 0.9952 1.9558 0.7811 0.9280 0.9893
Poly (d ¼ 2) 0.4162 0.1198 0.1519 0.9988 1.1328 0.4317 0.5194 0.9937
Poly (d ¼ 3) 0.4211 0.0993 0.1289 0.9991 1.2080 0.4263 0.5265 0.9936
Poly (d ¼ 4) 0.4051 0.0964 0.1249 0.9992 1.2018 0.4237 0.5242 0.9935
Poly (d ¼ 5) 0.3978 0.0960 0.1241 0.9992 1.1970 0.4214 0.5221 0.9935
Poly (d ¼ 11) 0.2545 0.0553 0.0736 0.9997 1.0328 0.3994 0.5049 0.9921
Poly (d ¼ 12) 0.3872 0.0969 0.1248 0.9992 1.1882 0.4165 0.5176 0.9936
NNone 0.2709 0.0657 0.0855 0.9996 1.1548 0.4142 0.5254 0.9952
Note: For the SEard function, the initial parameters are set as (ln l1 ; : : : ; ln l22 ; ln σf ; ln σn Þ ¼ ½−1; : : : ; −1; 0; lnð0.1Þ. For Poly (d ¼ 12), the initial
parameters are set as ðln c; ln σf ; ln σn Þ ¼ ½−1; 0; lnð0.1Þ. The best results are highlighted in boldface. The unit of AEmax , AEmean , and RMSE is mm.
2 1.5
MLR - HST MLR - HST
1.5 GPR - HST GPR - HST
1 GPR - S4
GPR - S4
1
0.5
Residual (mm)
Residual (mm)
0.5
0
0
-0.5
-0.5
-1 -1
-1.5 -1.5
1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2007 2008 2009
(a) Time (year) (b) Time (year)
Fig. 15. Prediction residuals of different models: (a) training; and (b) test.
2.0 2.0
1.790 MLR - HST MLR - HST
GPR - HST GPR - HST
1.6 GPR - S4 1.6 GPR - S4
Value of prediction error
1.421
0.892
0.8 0.8 0.726
0.615 0.616 0.586
0.567
0.496 0.487
0.4 0.4 0.373
0.295
0.215 0.204
0.160
0.061 0.077
0.0 0.0
AE max AE mean RMSE AE max AE mean RMSE
(a) (b)
Fig. 16. Performance comparison of different models using data without outlier: (a) on training data; and (b) on test data.
GPR is introduced to establish the displacement model for safety verified on displacement monitoring data of a real case. Mean-
monitoring of concrete gravity dams. Then, long-period air tem- while, the influence of covariance function to GPR performance
perature based variable sets are adopted to simulate the thermal ef- is also studied. The findings through the research are summarized
fect of displacement more accurately. The proposed approach is as follows:
4 4
Predicted value Predicted value
2 Measured value 2 Measured value
0 0
Displacement (mm)
Displacement (mm)
-2 -2
-4 -4
-6 -6
-8 -8
-10 -10
-12 -12
1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2007 2008 2009
(a) Time (year) (b) Time (year)
Fig. 17. Performance of GPR–S4 model using data with outlier: (a) training; and (b) test.
1. GPR is the state-of-the-art tool for nonlinear regression pro- four years, and so on, considering the dam heights, dam shapes,
blems. The introduction of GPR can make dam behavior mod- water depths, and so forth, are noticeably changed for different
eling more effective and convenient. It is demonstrated that GPR dams.
models can capture nonlinear relationships between environmen- 4. Simulation results show that the GPR model using squared
tal variables and the horizontal displacement at the top of the exponential covariance function obtains the best results. For a
concrete dam. preliminary trial, covariance functions SEard and RQard are re-
2. Different from the HST model, long-term air temperature is commended. Other covariance functions, such as polynomial
adopted in the proposed approach to make more precise displa- covariance function, Matérn class of covariance functions, neur-
cement predictions of concrete gravity dams. The air tempera- al network covariance function, and so on, can also be tested to
ture can be obtained from the meteorological station near the obtain the most applicable one for the studied problem.
dam. More accurate prediction results than that of the HST mod- 5. Compared with GPR-HST, the proposed approach improves
el can be obtained without more sensors. the tests AEmax , AEmean , and RMSE by 23.45%, 39.50%, and
3. Although the GPR-S4 model using two years’ data performs the 36.39%, respectively. Compared with MLR-HST, the proposed
best for the studied dam, for another concrete dam, the best fit approach improves the tests AEmax , AEmean , and RMSE by
models may adopt air temperate data of one year, three years, 37.19%, 52.19%, and 48.65%, respectively. The study focuses
Residual (mm)
0.1 0
0 -0.2
-0.4
-0.1
-0.6
-0.2
-0.8
Downloaded from ascelibrary.org by East Carolina University on 12/06/19. Copyright ASCE. For personal use only; all rights reserved.
-0.3 -1
1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2007 2008 2009
(a) Time (year) (b) Time (year)
Fig. 18. Effect of outlier to prediction residuals of GPR–S4 model: (a) training; and (b) test.
Table 13. Prediction accuracy comparison among three models using data with one outlier
Performance Performance Performance
improvement of improvement improvement
Prediction GPR-HST to of GPR-S4 to of GPR-S4 to
accuracy MLR-HST GPR-HST GPR-S4 MLR-HST (%) GPR-HST (%) MLR-HST (%)
Training AEmax 2.668 0.9477 0.3158 64.48 66.68 88.16
Training AEmean 0.5164 0.1676 0.0757 67.54 54.83 85.34
Training RMSE 0.6593 0.2219 0.0976 66.34 56.02 85.20
Test AEmax 1.4353 1.1838 0.849 17.52 28.28 40.85
Test AEmean 0.6345 0.4494 0.2984 29.17 33.60 52.97
Test RMSE 0.7374 0.5419 0.3806 26.51 29.77 48.39
Note: The unit of AEmax , AEmean , and RMSE is mm.
on the horizontal displacement modeling of a concrete gravity (DUT17ZD205 and DUT19LK14), and the Open Research Fund of
dam. The applicability of the proposed method on modeling the State Key Laboratory of Structural Analysis for Industrial Equip-
other device measurements related to the behavior of the dam ment (GZ15207). The open-source toolbox GPML is employed to
needs further verification. perform the analyses of GPR.
6. The horizontal displacement is measured by a vacuum laser
collimation system. According to the technical specification for
concrete dam safety monitoring (Wang et al. 2013), the accuracy References
of displacements measured by the vertical line, the extension
line, and the vacuum laser alignment system can reach from 0.1 Ahmadi-Nedushan, B. 2002. “Multivariate statistical analysis of monitor-
to 0.3 mm. The proposed approach improves the test AEmax ing data for concrete dams.” Ph.D. thesis, Dept. of Civil Engineering
from 1.4208–0.8924 mm, and the RMSE from 0.7256 to and Applied Mechanics, McGill Univ.
0.3726 mm. Therefore, the improvement is meaningful consid- Ardito, R., G. Maier, and G. Massalongo. 2008. “Diagnostic analysis of
ering the resolution of the measuring device. concrete dams based on seasonal hydrostatic loading.” Eng. Struct.
7. The proposed model is tested on horizontal displacement mod- 30 (11): 3176–3185. https://doi.org/10.1016/j.engstruct.2008.04.008.
eling of a concrete gravity dam. The test set has 29 samples that Bui, K. T., D. T. Bui, J. Zou, C. V. Doan, and I. Revhaug. 2018. “A novel
are almost evenly distributed over two years. Therefore, the test hybrid artificial intelligent approach based on neural fuzzy inference
model and particle swarm optimization for horizontal displacement
sample has a certain representativeness. In the future, the pro-
modeling of hydropower dam.” Neural Comput. Appl. 29 (12):
posed approach will require further verification and research for
1495–1506. https://doi.org/10.1007/s00521-016-2666-0.
the cases of more test samples, modeling problems of other mon-
Bukenya, P., P. Moyo, H. Beushausen, and C. Oosthuizen. 2014. “Health
itoring variables and other types of dams. monitoring of concrete dams: A literature review.” J. Civ. Struct. Health
Monit. 4 (4): 235–244. https://doi.org/10.1007/s13349-014-0079-2.
Cheng, L., and D. Zheng. 2013. “Two online dam safety monitoring
Acknowledgments models based on the process of extracting environmental effect.” Adv.
Eng. Software 57 (Mar): 48–56. https://doi.org/10.1016/j.advengsoft
The research is supported by the National Key R & D Program of .2012.11.015.
China (2016YFC0401600 and 2017YFC0404900), the National De Sortis, A., and P. Paoliani. 2007. “Statistical analysis and structural iden-
Natural Science Foundation of China (51769033 and 51779035), tification in concrete dam monitoring.” Eng. Struct. 29 (1): 110–120.
the Fundamental Research Funds for the Central Universities https://doi.org/10.1016/j.engstruct.2006.04.022.
.org/10.1016/j.compgeo.2010.07.012.
-dams-by-country-members. Pérez-Cruz, F., S. V. Vaerenbergh, J. J. Murillo-Fuentes, M. Lazaro-
Jeon, J., J. Lee, D. Shin, and H. Park. 2009. “Development of dam safety Gredilla, and I. Santamaria. 2013. “Gaussian processes for nonlinear
management system.” Adv. Eng. Software 40 (8): 554–563. https://doi signal processing: An overview of recent advances.” IEEE Signal Pro-
.org/10.1016/j.advengsoft.2008.10.009. cess Mag. 30 (4): 40–50. https://doi.org/10.1109/MSP.2013.2250352.
Jia, J. 2016. “A technical review of hydro-project development in China.” En- Prakash, G., A. Sadhu, S. Narasimhan, and J. Brehe. 2018. “Initial service
gineering 2 (3): 302–312. https://doi.org/10.1016/J.ENG.2016.03.008. life data towards structural health monitoring of a concrete arch dam.”
Kang, F., S. Han, R. Salgado, and J. Li. 2015. “System probabilistic Struct. Control Health Monit. 25 (1): e2036. https://doi.org/10.1002/stc
stability analysis of soil slopes using Gaussian process regression with .2036.
Latin hypercube sampling.” Comput. Geotech. 63 (Jan): 13–25. https:// Ranković, V., N. Grujović, D. Divac, and N. Milivojević. 2014. “Develop-
doi.org/10.1016/j.compgeo.2014.08.010. ment of support vector regression identification model for prediction of
Kang, F., J. Li, and J. Dai. 2019b. “Prediction of long-term temperature dam structural behavior.” Struct. Saf. 48 (Mar): 33–39. https://doi.org
effect in structural health monitoring of concrete dams using support /10.1016/j.strusafe.2014.02.004.
vector machines with Jaya optimizer and salp swarm algorithms.” Adv. Ranković, V., N. Grujović, D. Divac, N. Milivojević, and A. Novaković.
Eng. Software 131 (May): 60–76. https://doi.org/10.1016/j.advengsoft 2012. “Modelling of dam behaviour based on neuro-fuzzy iden-
.2019.03.003.
tification.” Eng. Struct. 35 (Feb): 107–113. https://doi.org/10.1016/j
Kang, F., J. Li, and Q. Xu. 2009. “Structural inverse analysis by hybrid .engstruct.2011.11.011.
simplex artificial bee colony algorithms.” Comput. Struct. 87 (13):
Rasmussen, C. E., and H. Nickisch. 2010. “Gaussian processes for machine
861–870. https://doi.org/10.1016/j.compstruc.2009.03.001.
learning (GPML) toolbox.” J. Mach. Learn. Res. 11 (Nov): 3011–3015.
Kang, F., J. Li, S. Zhao, and Y. Wang. 2019a. “Structural health monitoring
Rasmussen, C. E., and C. K. I. Williams. 2006. Gaussian processes for
of concrete dams using long-term air temperature for thermal effect sim-
machine learning. Cambridge, MA: MIT Press.
ulation.” Eng. Struct. 180 (Feb): 642–653. https://doi.org/10.1016/j
Salazar, F., R. Morán, M. Á. Toledo, and E. Oñate. 2017a. “Data-based
.engstruct.2018.11.065.
models for the prediction of dam behaviour: A review and some meth-
Kang, F., J. Liu, J. Li, and S. Li. 2017a. “Concrete dam deformation
odological considerations.” Arch. Comput. Methods Eng. 24 (1): 1–21.
prediction model for health monitoring based on extreme learning
https://doi.org/10.1007/s11831-015-9157-9.
machine.” Struct. Control Health Monit. 24 (10): e1997. https://doi
.org/10.1002/stc.1997. Salazar, F., M. A. Toledo, E. Oñate, and R. Morán. 2015. “An empirical
Kang, F., B. Xu, J. Li, and S. Zhao. 2017b. “Slope stability evaluation using comparison of machine learning techniques for dam behaviour model-
Gaussian processes with various covariance functions.” Appl. Soft Com- ing.” Struct. Saf. 56 (Sep): 9–17. https://doi.org/10.1016/j.strusafe.2015
put. 60 (Nov): 387–396. https://doi.org/10.1016/j.asoc.2017.07.011. .05.001.
Kao, C. Y., and C. H. Loh. 2013. “Monitoring of long-term static defor- Salazar, F., M. Á. Toledo, J. M. González, and E. Oñate. 2017b. “Early
mation data of Fei-Tsui arch dam using artificial neural network-based detection of anomalies in dam performance: A methodology based
approaches.” Struct. Control Health Monit. 20 (3): 282–303. https://doi on boosted regression trees.” Struct. Control Health Monit. 24 (11):
.org/10.1002/stc.492. e2012. https://doi.org/10.1002/stc.2012.
Léger, P., and M. Leclerc. 2007. “Hydrostatic, temperature, time- Shi, Y., J. Yang, J. Wu, and J. He. 2018. “A statistical model of deformation
displacement model for concrete dams.” J. Eng. Mech. 133 (3): during the construction of a concrete face rockfill dam.” Struct. Control
267–277. https://doi.org/10.1061/(ASCE)0733-9399(2007)133:3(267). Health Monit. 25 (2): e2074. https://doi.org/10.1002/stc.2074.
Li, F., Z. Wang, and G. Liu. 2013. “Towards an error correction model Snelson, E. L. 2007. “Flexible and efficient Gaussian process models for
for dam monitoring data analysis based on cointegration theory.” Struct. machine learning.” Ph.D. thesis, Gatsby Computational Neuroscience
Saf. 43 (Jul): 12–20. https://doi.org/10.1016/j.strusafe.2013.02.005. Unit, Univ. College London.
Li, F., Z. Wang, and G. Liu. 2015. “Hydrostatic seasonal state model for Stojanovic, B., M. Milivojevic, M. Ivanovic, N. Milivojevic, and D. Divac.
monitoring data analysis of concrete dams.” Struct. Infrastruct. Eng. 2013. “Adaptive system for dam behavior modeling based on linear
11 (12): 1616–1631. https://doi.org/10.1080/15732479.2014.983528. regression and genetic algorithms.” Adv. Eng. Software 65 (Nov):
Liang, G., Y. Hu, and Q. Li. 2018. “Safety monitoring of high arch dams in 182–190. https://doi.org/10.1016/j.advengsoft.2013.06.019.
initial operation period using vector error correction model.” Rock Stojanovic, B., M. Milivojevic, N. Milivojevic, and D. Antonijevic. 2016.
Mech. Rock Eng. 51 (8): 2469–2481. https://doi.org/10.1007/s00603 “A self-tuning system for dam behavior modeling based on evolving
-017-1287-y. artificial neural networks.” Adv. Eng. Software 97 (Jul): 85–95. https://
Loh, C. H., C. H. Chen, and T. Y. Hsu. 2011. “Application of advanced doi.org/10.1016/j.advengsoft.2016.02.010.
statistical methods for extracting long-term trends in static monitoring Su, H., Z. Wen, X. Sun, and M. Yang. 2015. “Time-varying identification
data from an arch dam.” Struct. Health Monit. 10 (6): 587–601. https:// model for dam behavior considering structural reinforcement.” Struct.
doi.org/10.1177/1475921710395807. Saf. 57 (Nov): 1–7. https://doi.org/10.1016/j.strusafe.2015.07.002.
Mata, J. 2011. “Interpretation of concrete dam behaviour with artificial neu- Su, H. Z., Z. P. Wen, Z. R. Wu, and D. P. Lai. 2005. “Nonlinear com-
ral network and multiple linear regression models.” Eng. Struct. 33 (3): bined monitoring of dam safety.” [In Chinese.] J. Hydraul. Eng. 36 (2):
903–910. https://doi.org/10.1016/j.engstruct.2010.12.011. 197–202.
Mata, J., A. T. de Castro, and J. S. da Costa. 2013. “Time-frequency analy- Tatin, M., M. Briffaut, F. Dufour, A. Simon, and J. P. Fabre. 2015. “Thermal
sis for concrete dam safety control: Correlation between the daily displacements of concrete dams: Accounting for water temperature in