Professional Documents
Culture Documents
Young 2017
Young 2017
Young 2017
a r t i c l e i n f o a b s t r a c t
Article history: Accurate rainfall-runoff modeling during typhoon events is an essential task for natural disaster reduc-
Received 16 May 2016 tion. In this study, a novel hybrid model which integrates the outputs of physically based hydrologic
Received in revised form modeling system into support vector machine is developed to predict hourly runoff discharges in Chis-
30 November 2016
han Creek basin in southern Taiwan. Seven storms (with a total of 1200 data sets) are used for model
Accepted 29 December 2016
Available online 3 January 2017
calibration (training) and validation. Six statistical indices (mean absolute error, root mean square error,
correlation coefficient, error of time to peak discharge, error of peak discharge, and coefficient of effi-
ciency) are employed to assess prediction performance. Overall, superiority of the present approach
Keywords:
Rainfall-runoff especially for a longer (6-h) lead time prediction is revealed through a systematic comparison among
Typhoon events three individual methods (i.e., the physically based hydrologic model, artificial neural network, and
Hydrologic modeling system (HEC-HMS) support vector machine) as well as their two hybrid combinations. Besides, our analysis and in-depth
Support vector regression (SVR) discussions further clarify the roles of physically based and data-driven components in the proposed
Artificial neural network (ANN) framework.
Hybrid approach © 2017 Elsevier B.V. All rights reserved.
http://dx.doi.org/10.1016/j.asoc.2016.12.052
1568-4946/© 2017 Elsevier B.V. All rights reserved.
206 C.-C. Young et al. / Applied Soft Computing 53 (2017) 205–216
2. Description of study site and data collection where Pe and P are the precipitation excess and accumulated rain-
fall depth at time t, respectively, Ia is the initial abstraction (e.g.,
The Chishan Creek basin, located in Kaoping River watershed, infiltration loss), and S is the potential maximum retention. The
southern Taiwan, is an important study area. In typhoon seasons, Soil Conservation Service (SCS) suggested good estimations for Ia
this region frequently suffers from natural hazards (e.g., floods, and S (in SI unit, cm) based on the curve number (CN) derived from
debris flows, and landslides). One of the most serious tragedies empirical data, i.e., Ia = 0.2 S and
is the destruction of an entire village (500 people) in 2009 [40]. 2, 540
S= − 25.4. (2)
The Chishan Creek basin has a total area of 842 km2 with a mean CN
elevation of 824 m and an average slope of 0.4374 m/m. The main Note that the curve number (CN) with a range from 1 to 100 is a
stream is about 118 km long while the stream order of the Chis- function of land use, land cover types, and hydrological soil group
han Creek is up to 5. The rainfall-runoff data were collected by [42].
the Water Resources Agency, Taiwan. While the Sin-Gao-Kou and The storm hydrograph (Q) is obtained by convolving the pre-
Min-Zu stations record the rainfall, the Nan-Fong Bridge station cipitation increments (P) with unit hydrograph ordinates (U), i.e.,
measures the discharge at the outlet of Chishan Creek basin (see
n≤M
Fig. 1). Fig. 2 presents the rainfall and discharge variations dur- Qn = Pm Un−m+1 , where m increases from 1 to n. In practice, the
ing Typhoon Bilis in 2006 and Typhoon Sepat in 2007. The different m=1
patterns at two rain-gauge stations indicate the significant regional translation of excess precipitation to direct runoff can be mod-
and topographic effects on precipitation in this basin [22]. eled using modified Clark unit hydrograph (ModClark) method [43].
C.-C. Young et al. / Applied Soft Computing 53 (2017) 205–216 207
Fig. 2. Measured rainfall and discharge data during the Bilis and Sepat typhoons: (a) and (d) are the rainfall at the Sin-Gao-Kou station, (b) and (e) are the rainfall at the
Min-Zu station, and (c) and (f) are the runoff (discharge) at the Nan-Fong Bridge station.
Based upon the time of concentration (tc ) and maximum traveling outflow at time level t is advanced from the information at the
distance (dmax ), travel time (or translation lag) to each watershed previous time step t − 1 (see Eq. (3)). To obtain reasonable predic-
cell (tcell ) can be calculated, i.e., tcell = tc (dcell /dmax ), where dcell is the tion performance, parameters (constants) in the equations above
distance from the cell to basin outlet. The cell outflow hydrograph should be properly estimated or tuned based upon the watershed
is then routed using a linear reservoir concept [41], i.e., characteristics and the observed data during model calibration. For
example, a larger CN indicates smaller retention (e.g., CN = 98 for
t t
Q (t) = I(t) + [1 − ]Q (t − 1), (3) an almost impervious condition), yielding higher excess rainfall to
R + 0.5t R + 0.5t form surface runoff. Among various watersheds in Taiwan, the time
where Q(t) and Q(t − 1) are the outflows at current and previous of concentration typically ranges from 1.5 to 5.5 h [22,25]. The stor-
time levels t and t − 1, respectively; I(t) is the average inflow at age coefficient is tuned to reflect the rainfall excess partly stored
time t; R is the storage coefficient used to represent discharge in a watershed. The baseflow has an exponential decay constant
attenuation; and t is the time increment. Besides, the baseflow with the Order of O(0.01)–O(0.5). After model calibration, all the
is taken into account using an exponential decrease function [44], hydrological simulations were carried out with the fixed parame-
i.e., Q = Q0 e−kt , where Q0 is the averaged initial baseflow before a ters CN = 83.5, tc = 3.2 h, R = 4.15, and k = 0.35. The time step was set
storm; k is an exponential decay constant. The Muskingum-Cunge as 1 h. Besides, sensitivity analysis indicated that CN is the major
Standard Section method was employed for channel routing. factor affecting the simulated results.
The HEC-HMS model can be applied to simulate or forecast the
river runoff for the historical or current typhoon events as the pre- 3.2. The HEC-HMS-ANN hybrid model
cipitation data can be obtained from the field measurement records
or numerical weather predictions. A whole streamflow hydrograph In this study, a HEC-HMS-ANN hybrid model was developed by
is then obtained after the recursive time integration, where the combining the popular back-propagation artificial neural network
208 C.-C. Young et al. / Applied Soft Computing 53 (2017) 205–216
Fig. 3. Architecture of (a) the neural network (NN) and (b) support vector machine
(SVM). 1 T Nd
1 2
P L Nd
CNN = el (p), (5) − yi (˛i − ˛j ) − ε(˛i + ˛j ) , (8a)
P
p=1 l=1 i=1
C.-C. Young et al. / Applied Soft Computing 53 (2017) 205–216 209
subject to
Nd
(˛i − ˛j ) = 0 , (8b)
i=1
0 ≤ ˛i , ˛j ≤ C, i = 1, 2, ..., Nd
where ˛ and ˛ are the Lagrange multipliers that can be solved using
the quadratic programming algorithm. Finally, the estimation can
be rewritten as
Nd
Nd
1
N
MAE = |(Qm )i − (Qo )i |, (10)
N 4. Results and discussion
i=1
N In this study, we applied several rainfall-runoff prediction
1
RMSE =
approaches in the Chishan Creek basin in southern Taiwan during
[(Qm )i − (Qo )i ]2 , (11)
N the historical storm events. The prediction lead time ranges from
i=1
t + 1 to t + 6 h (based on a recursive method). Note that the rainfalls
were adopted from the field measured data (and can be obtained
N
1
[(Qm )i − Qm ][(Qo )i − Qo ] from the numerical weather predictions in the practical operation).
N
In the following, the results predicted from each individual model
R= i=1
, (12) (i.e., HEC-HMS, ANN, and SVR) and the hybrid models (i.e., HEC-
N N
2 2 HMS-ANN and HEC-HMS-SVR) are presented and discussed.
1 [(Qm )i − Qm ] 1 [(Qo )i − Qo ]
N N
i=1 i=1 4.1. Event-based rainfall-runoff modeling using HEC-HMS
ETp = |Tm,p − To,p |, (13)
We considered a total of seven storm events, where the June
(Qm,p − Qo,p ) 9 Flood (2006), Typhoon Sepat (2007), Typhoon Kamegi (2008),
EQp = × 100%, (14)
Qo,p and Typhoon Jangmi (2008) were chosen for model calibration; and
Typhoon Bilis (2006), Typhoon Krosa (2007), and Typhoon Sinlaku
N
N
2
(2008) were adopted for model validation.
CE = 1 − [(Qo )i − (Qm )i ]2 / [(Qo )i − Qo ] , (15)
The model calibration and validation results for the HEC-HMS
i=1 i=1
model are shown in Figs. 4(a) and 5 respectively. During the cal-
where N is the total number of data points; Qm and Qo is the mod- ibration phase, the reasonable runoff prediction can be obtained.
eled and observed runoff discharge, respectively; Tm,p and To,p are For the validation events, the HEC-HMS model did not fully cap-
the peak time for the modeled and observed peak runoff discharge, ture the runoff patterns (rising/falling limbs), although the time
respectively; Qm,p and Qo,p are the modeled and observed peak and discharge of predicted peak flow are still acceptable. Further,
runoff discharges, respectively; the overbar indicates the mean Table 1 evaluates the prediction performance for model validation
value. events (since excellent results should be achieved through the cali-
210 C.-C. Young et al. / Applied Soft Computing 53 (2017) 205–216
Table 1
Prediction performance of HEC-HMS, ANN, and hybrid models.
MAE (m3 /s) RMSE (m3 /s) R ETP (h) EQP (%) CE
bration process). The MAE, RMSE, R, ETp , EQp , and CE are 50.8 m3 /s,
70.3 m3 /s, 0.937, 0 h, −0.5%, and 0.841 for Typhoon Bilis, respec-
tively. Notice that the worst CE occurs in Typhoon Sinlaku, i.e.,
CE = 0.826.
Table 2
Prediction performance of HEC-HMS, SVR, and hybrid models.
MAE (m3 /s) RMSE (m3 /s) R ETP (h) EQP (%) CE
Fig. 6. Comparison of the observed and simulated runoff discharge for the Nan-Fong
Bridge gauge station for the validation phase using the ANN model with 1h-ahead
prediction (a) Typhoon Bilis, (b) Typhoon Krosa, and (c) Typhoon Sinlaku.
Fig. 7. Comparison of the observed and simulated runoff discharge for the Nan-Fong
Bridge gauge station during Typhoon Bilis using the ANN model with (a) 1 h-, (b)
2 h-, (c) 4 h-, and (d) 6 h-ahead prediction.
212 C.-C. Young et al. / Applied Soft Computing 53 (2017) 205–216
Fig. 8. Comparison of the observed and simulated runoff discharge for the Nan-Fong
Bridge gauge station for the validation phase using the SVR model with 1h-ahead
prediction (a) Typhoon Bilis, (b) Typhoon Krosa, and (c) Typhoon Sinlaku. Fig. 9. Comparison of the observed and simulated runoff discharge for the Nan-Fong
Bridge gauge station during Typhoon Bilis using the SVR model with (a) 1 h-, (b) 2 h-,
(c) 4 h-, and (d) 6 h-ahead prediction.
the SVR model demonstrates a superior prediction capability, i.e.,
less accumulated errors with the increased lead time (as shown in
Fig. 9) The MAE, RMSE, R, ETp , EQp , and CE for the 6-h-ahead runoff additional input from HEC-HMS predictions effectively reduces the
prediction during Typhoon Bilis are 26.6 m3 /s, 49.1 m3 /s, 0.960, 1 h, accumulated errors in the data-driven models associated with the
−5.1%, and 0.922, respectively (see Table 2). increased lead time. When integrating a physically based compo-
nent into ANN (or SVR), the maximum improvements in the root
4.3. Runoff prediction using the hybrid models mean square error (RMSE), correlation coefficient (R) and model
efficiency coefficient (CE) are 28.5%, 3.4%, and 6.8% (or 20.8%, 1.4%,
Last, Figs. 4(d) and 10 compare the 1-h-ahead forecasts and 2.5%), respectively.
from HEC-HMS-ANN and HEC-HMS-SVR hybrid models in train-
ing and validation phases. The prediction and the observation 4.4. Comparison and discussion of model predictions
are in excellent agreement. The ranges of MAE, RMSE, R, ETp ,
EQp , and CE among the validation events (see Tables 1 and 2) To apparently show the forecasting performance, Figs. 12 and 13
are 7.3–15.0 m3 /s, 15.1–29.9 m3 /s, 0.986–0.997, 1 h, 1.1–10.3%, further compare the observation-prediction pairs of hourly runoff
and 0.971–0.994 for the HEC-HMS-SVR model, respectively discharge for all the calibration (or training) and validation events,
(7.8–16.3 m3 /s, 15.3–31.4 m3 /s, 0.984–0.997, 1 h, 2.5–7.9%, and respectively. According to the spread of the scattering, the 1-h-
0.968–0.994 for the HEC-HMS-ANN model). ahead runoff discharge predictions by the data-driven and hybrid
Fig. 11 also shows the 1, 2, 4, and 6-h-ahead runoff predic- models (i.e., ANN, SVR, HEC-HMS-ANN, and HEC-HMS-SVR) are
tions from both hybrid models during Typhoon Bilis. In general, superior to that obtained by the HEC-HMS model. While the HEC-
the simulated patterns remain quite close to the measured data, HMS model yields the correlation coefficient R = 0.946 (0.931) in
reasonably capturing the discharge and rise time of the peak calibration (validation), the rest models give an almost perfect
flow. The HEC-HMS-SVR model presents great accuracy even in R over 0.990. In terms of their linear regressions, the predic-
the 6-h-ahead forecasting with MAE = 24.6 m3 /s, RMSE = 41.4 m3 /s, tions of HEC-HMS model averagely give a 5–9% underestimation
R = 0.973, ETp = 1 h, EQp = −5.6%, and CE = 0.945 (see Table 2 for the of the observations, i.e., y = 0.948 x + 0.64 for calibration and
performance assessment). In addition, the results from the HEC- y = 0.912 x − 3.9 for validation. The slopes of regression mR for the
HMS-SVR model is much better than those by the HEC-HMS-ANN data-driven and hybrid models are 0.986–0.998 (i.e., around or less
model (i.e., MAE = 32.3 m3 /s, RMSE = 53.9 m3 /s, R = 0.954, ETp = 27 h, 1% error).
EQp = 0.5%, and CE = 0.906). Figs. 14–16 show the scatter plots for the n-h-ahead forecast-
Further, Table 3 compares the performances of 6-h-ahead pre- ing results from the ANN, SVR, hybrid models, respectively. The
dictions among the hybrid and data-driven only models. The ANN and SVR algorithms present different generalization capabil-
C.-C. Young et al. / Applied Soft Computing 53 (2017) 205–216 213
Table 3
Assessment of prediction improvement for 6-h ahead runoff discharge.
Fig. 10. Comparison of the observed and simulated runoff discharge for the Nan-
Fong Bridge gauge station for the validation phase using the HEC-HMS-ANN and
HEC-HMS-SVR hybrid models with 1h-ahead prediction: (a) Typhoon Bilis, (b) Fig. 11. Comparison of the observed and simulated runoff discharge for the Nan-
Typhoon Krosa, and (c) Typhoon Sinlaku. Fong Bridge gauge station during Typhoon Bilis using the HEC-HMS-ANN and HEC-
HMS-SVR hybrid models with (a) 1 h-, (b) 2 h-, (c) 4 h-, and (d) 6 h-ahead prediction.
Fig. 14. Scatter plots of predicted and measured runoff discharges using the ANN
model with (a) 2 h-, (b) 4 h-, and (c) 6 h-ahead prediction, where the solid lines
Fig. 12. Scatter plots of predicted and measured runoff discharges for (a) calibration
represent linear regression of the prediction-measurement pairs (circles).
of the HEC-HMS model, (b) training of the ANN model, (c) training of the SVR model,
and (d) training of the HEC-HMS-ANN and HEC-HMS-SVR hybrid models, where
the lines indicate linear regression of the prediction-measurement pairs (symbols).
(HEC-HMS-ANN: x-marks & dashed lines and HEC-HMS-SVR: circles and solid lines).
Fig. 15. Scatter plots of predicted and measured runoff discharges using the SVR
model with (a) 2 h-, (b) 4 h-, and (c) 6 h-ahead prediction, where the solid lines
represent linear regression of the prediction-measurement pairs (circles).
Fig. 13. Scatter plots of predicted and measured runoff discharges for (a) validation were implemented into the structure of ANN and SVR (as connec-
of the HEC-HMS model, (b) validation of the ANN model, (c) validation of the SVR tion weights between hidden and output layers rather than inputs
model, and (d) validation of the HEC-HMS-ANN and HEC-HMS-SVR models, where in this study). Overall, our thorough analysis indicated that the
the lines indicate linear regression of the prediction-measurement pairs (symbols).
combination of HEC-HMS and SVR models yields the most accurate
(HEC-HMS-ANN: x-marks & dashed lines and HEC-HMS-SVR: circles and solid lines).
runoff discharge predictions for the Chishan Creek basin.
C.-C. Young et al. / Applied Soft Computing 53 (2017) 205–216 215
Acknowledgments
References
[27] M. Bray, D. Han, Identification of support vector machines for runoff [39] G. Napolitano, L. See, B. Calvo, F. Savi, A. Heppenstall, A conceptual and neural
modeling, J. Hydroinform. 6 (4) (2004) 265–280. network model for real-time flood forecasting of the Tiber River in Rome,
[28] C.L. Wu, K.W. Chau, Y.S. Li, River stage prediction based on a distributed Phys. Chem. Earth 35 (3–5) (2010) 187–194.
support vector regression, J. Hydrol. 358 (1–2) (2008) 96–111. [40] C.Y. Tsou, Z.Y. Feng, M. Chigira, Catastrophic landslide induced by Typhoon
[29] R. Maity, P.P. Bhagwat, A. Bhatnagar, Potential of support vector regression for Morakot, Shiaolin, Taiwan, Geomorphology 127 (2011) 166–178.
prediction of monthly streamflow using endogenous property, Hydrol. [41] US Army Corps Engineers, Hydrologic Modeling System (HEC-HMS)
Process. 24 (7) (2010) 917–923. Application Guide: Version 3.1.0, Institute for Water Resources, Hydrologic
[30] M.C. Wu, G.F. Lin, H.Y. Lin, Improving the forecasts of extreme streamflow by Engineering Center, Davis, CA, 2008.
support vector regression with the data extracted by self organizing map, [42] US Soil Conservation Service Urban Hydrology for Small Watershed (technical
Hydrol. Process. 28 (2) (2014) 386–397. Release 55), US Department of Agriculture, 1986.
[31] S.M. Hosseini, N. Mahjouri, Integrating support vector regression and a [43] C.O. Clark, Storage and the unite hydrograph, Trans. Am. Soc. Civ. Eng. 110
geomorphologic artificial neural network for daily rainfall-runoff modeling, (1945) 1419–1488.
Appl. Soft Comput. 38 (2016) 329–345. [44] V.T. Chow, D.R. Maidment, L.W. Mays, Applied Hydrology, McGraw-Hill, New
[32] V.N. Vapnik, An overview of statistical learning theory, IEEE Trans. Neural York, 1988.
Netw. 10 (5) (1999) 988–999. [45] D.E. Rumelhart, G.E. Hinton, R.J. Williams, Learning representations by
[33] A.Y. Shamseldin, K.M. O’Connor, A non-linear neural network technique for back-propagating errors, Nature 323 (1986) 533–536.
updating of river flow forecasts, Hydrol. Earth Syst. Sci. 5 (4) (2001) 577–597. [46] M.T. Hagan, M. Menhaj, Training feedforward networks with the Marquardt
[34] L. Xiong, K.M. O’Connor, Comparison of four updating models for real-time algorithm, IEEE Trans. Neural Netw. 5 (6) (1994) 989–993.
river flow forecasting, Hydrol. Sci. J. 47 (4) (2002) 621–639. [47] C.C. Young, C.H. Wu, An efficient and accurate non-hydrostatic model with
[35] A.Y. Shamseldin, K.M. O’Connor, A.E. Nasr, A comparative study of three embedded Boussinesq-type like equations for surface wave modeling, Int. J.
neural network forecast combination methods for simulated river flows of Numer. Methods Fluids 60 (1) (2009) 27–53.
different rainfall-runoff models, Hydrol. Sci. J. 52 (5) (2007) 896–916. [48] M.P. Clark, D.E. Rupp, R.A. Woods, X. Zheng, R.P. Ibbitt, A.G. Slater, J. Schmidt,
[36] A.K. Fernando, A.Y. Shamseldin, R.J. Abrahart, Use of gene expression M.J. Uddstrom, Hydrological data assimilation with the ensemble Kalman
programming for multimodel combination of rainfall-runoff models, J. filter: use of streamflow observations to update states in a distributed
Hydrol. Eng. ASCE 17 (9) (2012) 975–985. hydrological model, Adv. Water Resour. 31 (10) (2008) 1309–1324.
[37] B. Zhang, R.S. Govindaraju, Geomorphology-based artificial neural networks [49] H.K. McMillan, E.O. Hreinsson, M.P. Clark, S.K. Singh, C. Zammit, M.J.
(GANNs) for estimation of direct runoff over watersheds, J. Hydrol. 273 (2003) Uddstrom, Operational hydrological data assimilation with the recursive
18–34. ensemble Kalman filter, Hydrol. Earth Syst. Sci. 17 (1) (2013) 21–38.
[38] R.K. Panda, N. Pramanik, B. Bala, Simulation of river stage using artificial
neural network and MIKE 11 hydrodynamic model, Comput. Geosci. 36 (6)
(2010) 735–745.