Using Non-Linear Garch Model To Predict Silicon Content in Blast Furnace Hot Metal

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 6

Asian Journal of Control, Vol. 10, No. 6, pp.

632 637, November 2008


Published online in Wiley InterScience (www.interscience.wiley.com) DOI: 10.1002/asjc.064

USING NON-LINEAR GARCH MODEL TO PREDICT SILICON


CONTENT IN BLAST FURNACE HOT METAL
Jiu-sun Zeng, Chuan-hou Gao, Xiang-guan Liu, Ke-ping Yang, and Shi-hua Luo

ABSTRACT

Forecasting of silicon content in blast furnace (BF) hot metal has always
been an important tool in the control of iron-making process. To get an accurate
prediction of silicon content is an urgent task for BF operators. The approach
based on generalized autoregressive conditional heteroskedastic (GARCH)
has been introduced to predict step-ahead silicon content in BF hot metal.
The algorithm has been explained in detail and simulation results have been
analyzed from different criteria. It is shown that the algorithm gives good
results and is helpful for practical production.

Key Words: Blast Furnace, silicon content, ARMA, generalized autoregres-


sive conditional heteroskedastic (GARCH).

I. INTRODUCTION has been applied to the same problem. Most of these


models can give good results when the volatility of sili-
Forecasting of silicon content in Blast Furnace con content is low, but in real applications, silicon con-
(BF) hot metal has always been an important tool in tent changes very fast and it is found that time series
control of iron-making process. It is a key index to eval- methods are among the most effective.
uating the status of BF and the quality of iron. To in- An advanced time series method, the generalized
crease the output of iron and save energy, BF operators autoregressive conditional heteroskedastic (GARCH)
make extensive use of silicon content prediction in their model [4] proposed by Bollerslev in 1986, has become
decision-making. So it is of great importance to make popular in recent years. It considers the moments of a
precise predictions of silicon content. time series as variant (i.e., the error term: real value
In recent years, several methods have been applied minus forecasted value does not have zero mean and
to predict silicon content in hot metal. For example, constant variance as with an ARMA process). The er-
fuzzy controller [1] and artificial neural networks [2] ror term is assumed to be serially correlated that can be
have been tested. In addition, the chaotic method [3] modeled by an ARMA process. Thus, a GARCH pro-
cess measuring the conditional mean and conditional
variance can be used to do forecasts.
In this paper we focus our attention on one-step-
ahead forecast of silicon content. To illustrate our
Manuscript received April 18, 2006; accepted October 29, model, data of silicon content from Handan Steel are
2007. presented for analysis. The forecasts of the GARCH
Jiu-sun Zeng, Chuan-hou Gao, Xiang-guan Liu, and Ke- model are evaluated from different criteria.
ping Yang are with the Department of Mathematics, Zhejiang
The outline of our paper is as follows. In Section
University, Hangzhou 310027, China. Chuan-hou Gao is the
corresponding author (e-mail: gaochou@zju.edu.cn).
II, the general GARCH method is discussed. Section III
Shi-hua Luo is with the School of Information Management, describes the predictive algorithm in detail. Section IV
Jiangxi University of Finance and Economics, Nanchang is our numerical results and analysis and Section V
331305, China. states our conclusion.

q 2008 John Wiley and Sons Asia Pte Ltd and Chinese Automatic Control Society
J.-s. Zeng et al.: Using Non-linear GARCH Model to Predict Silicon Content in Blast Furnace Hot Metal 633

II. GARCH METHOD where 2 = 1 and t is basically a white noise process.

2.1 The GARCH (p, q) process p  0, q>0


A primary feature of the autoregressive condi- 0 > 0, i 0, i = 1, . . . , q
tional heteroskedasticity (ARCH) model, as developed
by Engle [5], is that the conditional variances change i  0, i = 1, . . . , p.
over time. Following the similar idea, numerous mod-
els incorporating this feature have been proposed. To perform predictions, this model is extended to
Among these models, Bollerslev’s generalized ARCH ARMA models with GARCH ( p, q) errors. We define
(GARCH) model is certainly the most popular and the ARMA-GARCH model as:
successful because it is easy to estimate and interpret
by analogy with the autoregressive moving average 
r 
s
yt = i yt−i + i t−1 + t
(ARMA) time series model. Using the ARCH and i=1 i=1
GARCH models to do empirical research has become 
t =  t h t (2)
popular in recent years. For example, Franses and
Ghijsels [6] applied the GARCH model to analyzing 
q 
p
h t = 0 + i 2t−i + i h t−i
daily returns in Europe’s main stock markets. Brooks i=1 i=1
[7] used the GARCH model to analyze the French
Franc and Deutsch Mark exchange rates but, to our where yt is an ARMA process and i , i are unknown
knowledge, predictions from a general ARMA model parameters, r is the order of autoregressive part and s the
with GARCH errors hasn’t received as much attention. order of moving average part. It is important to note that
Among the research were Garcia and Contreras [8] who in the presence of GARCH effects the optimal predictor
used GARCH model to predict day-ahead electricity for the dependent variable is a function of past values
prices in several European markets. not only of the observations and the errors (yt−i , t−i )
The information recovery process in time-series but of the conditional variances and the squared errors
analysis uses historical observations to derive estimates (h t−i , 2t−i ) as well.
of current and future values of the dependent variable.
Among the most popular estimation techniques are the
maximum-likelihood (ML) approach, which requires 2.2 GARCH (1, 1)
the availability of information on the entire probability
The simplest, but often very useful, GARCH pro-
distribution, generalized methods of moments (GMM),
cess is, of course, the GARCH (1, 1) process, which is
which reduces the informational requirements to spe-
also the model we use here. It follows as:
cific moments of the data, and nonparametric proce-
dures [9]. Given the Gaussian distribution for the time t |t−1 ∼ N (0, h t )
series is satisfied, we adhere to the Box-Jenkins mod- 
eling approach of parsimony, i.e. using the fewest t = t h t
model parameters as supported by the data, to estimate (3)
h t = 0 + 1 2t−1 + 1 h t−1
an ARMA process with conditional-heteroskedastic
(GARCH) error components. 0 >0, 1 0, >0
Let t denote a real-valued discrete-time stochas-
tic process, and t the information set (-field) of all For GARCH (1, 1) model, it is required that 1 +
information through time t. The GARCH ( p, q) pro- 1 <1, the closer 1 + 1 is to 1, the effect on volatility
cess is then given by is longer.
Considering the fact that the silicon content se-
ries is significantly auto correlated only for 2 lags, the
t |t−1 ∼ N (0, h t )
ARMA process we use here is ARMA (2, 0). Thus the

t =  t h t model becomes:


q 
p (1) yt = ayt−1 + byt−2 + t
h t = 0 + i 2t−i + i h t−i 
i=1 i=1 t =  t h t (4)
= 0 + A(L)2t + B(L)h t h t = 0 + 1 2t−1 + 1 h t−1

q 2008 John Wiley and Sons Asia Pte Ltd and Chinese Automatic Control Society
634 Asian Journal of Control, Vol. 10, No. 6, pp. 632 637, November 2008

A GARCH model is used to fit data with heteroskedas- Table II. Result of Engle’s ARCH-test.
ticity, and another important feature is that it can be fit-
Lag P-Value Statistics
ted to data with excess kurtosis; in successive analysis
we will find that the data of silicon content has both of 10 2.00e − 07 50.68
these features. 15 1.30e − 06 55.81
20 6.20e − 06 60.38

III. GARCH FORECASTING METHOD are known to be asymptotically unbiased and efficient.
The parameter estimation is based on maximizing a like-
Most silicon content series are characteristic of lihood function for the available data.
high frequency, non-constant mean and variance and It should be noted that during the parameter es-
high volatility, among others. These factors are the main timation procedure, it is possible that additive outliers
reasons to apply the GARCH method. Due to the high appear. But since we cannot predict when the BF oper-
volatility of silicon content, a difference logarithmic ators exert a strong control measure we have decided to
transformation, neglect it and use the real data without any adjustment.
In this step, another testing has been applied to the
dsi(n) = log(si(n)) − log(si(n − 1)), (5) fitted residuals (actual values of silicon content minus
fitted values) of the estimated models. The Lagrange
is applied to smooth the volatility effect. Multiplier (LM) test is used to test whether there is
Before using GARCH to perform forecasts, rig- additional GARCH effect in the fitted residuals. If the
orous statistical hypothesis testing should be applied hypothesis testing is not significant, it shows that the
to validate the model assumptions of the GARCH fitted residuals have no additional GARCH effect and
specification. The diagnosis checks verify the statisti- also the parameter estimation is valid.
cal significance and assumptions of the parameters in With all the statistical significance and hypothe-
the GARCH model and its residuals (actual values of ses tests on parameters and residuals are validated, the
silicon content minus mean value). model can be used to forecast silicon content. Our pre-
Statistical tests on the residuals include the Ljung- dictive procedure is as follows. We estimate the param-
Box statistic, and plots, such as the autocorrelation and eters in the GARCH (1, 1) model (from (3) to (4)) and
partial correlation of the residuals (see [6, 7, 10]). The the ARMA (2, 0) process for samples and generate a
computed statistical values for the Ljung-Box statistic one-step ahead forecast for t . With the forecasted t ,
together with corresponding p-values are presented (see we get the corresponding forecast for yt . Next, the first
Table I). The null hypothesis is that no serial correlation data point is deleted from the sample and a new one is
exists and the hypothesis is accepted when the p-values added at the end of sample, then we estimate the pa-
are high. In this case we reject the hypothesis. rameters and generate a one-step ahead forecast again.
Another essential test is that of the ARCH ef- The GARCH (1, 1) is applied time and time again and
fect (i.e., heteroskedasticity). Engle’s ARCH statistic the forecasts are generated.
has been used. This test supports that there exists het-
eroskedasticity. P-values are close to zero for all of the
three lags. The result is presented in Table II. With these IV. SIMULATION OF PREDICTIVE
P-values we also reject the hypothesis that no ARCH- ALGORITHM
effects exist.
After these statistical hypotheses testing applied it In our empirical analysis, we use data collected
is time to estimate the unobservable parameters through from the Intelligent Control System on 2000 m3 BF,
Maximum Likelihood (ML) estimation. ML estimates Handan Iron & Steel Co., China, as sample data to
implement the algorithm. The data set contains 1001
data points. After difference logarithmic transform (5)
Table I. Result of Ljung-Box-Pierce Q-test. it turns into a sample with 1000 data points. Fig. 1 gives
Lag P-Value Statistics the original data series of silicon content.
We divide the 1000 data points into two groups;
10 0 206.86 the first group consists of data points from 1 to 500, the
15 0 217.15
20 0 220.49 second from 501 to 1000. The two groups are then used
to test the new algorithm. For each group, the first 450

q 2008 John Wiley and Sons Asia Pte Ltd and Chinese Automatic Control Society
J.-s. Zeng et al.: Using Non-linear GARCH Model to Predict Silicon Content in Blast Furnace Hot Metal 635

Fig. 1. Time series graph for original data of silicon content.

Fig. 2. The predictive result of the first group.

Fig. 3. The predictive results of second group.

data points are used as training data, and the final 50 Figs 2 and 3 show the result of the predictive algo-
data points are used as testing data. After the necessary rithm: From Figs 2 and 3 we can see that most predicted
estimation and diagnostic test, the estimated parameters values of both groups follow the change of silicon con-
are applied to predict. The predictive results are got tent and give good forecasts.
through the inverse transform of (5): To further evaluate our predictive model, several
important criteria are considered. The first criterion is
ˆ = exp(dsi(n))
si(n) ˆ ∗ si(n − 1) (6) model’s relative error, which we are trying to minimize.

q 2008 John Wiley and Sons Asia Pte Ltd and Chinese Automatic Control Society
636 Asian Journal of Control, Vol. 10, No. 6, pp. 632 637, November 2008

Table III. Results of prediction.

Group No Number for Training Standard Deviation Kurtosis Perr H

1 450 0.060 2.63 0.032 86%


2 450 0.075 3.86 0.034 84%

It is denoted to be Perr and computed as follows: the results previously reported in technical literature.
 N But there are still short-comings. The GARCH model
N 
Perr = ˆ
(si(k) − si(k)) 2
si(k)2 (7) is only part of a solution since BF operators’ decision-
k=1 k=1 making is not solely based on expected forecasts of
silicon content. Improvements to the models like the in-
ˆ is the predicted value, si(k) the observed
where si(k) clusion of other dependent variables should be consid-
value and N the total predicted numbers. ered in future research.
The second criterion is the relation between orig-
inal values and predicted values. It is denoted to be H
and computed as follows: ACKNOWLEDGEMENTS
 N 
1  The authors thank the Handan Steel Corporation in China
H= Sk × 100% (8) for providing field data and expert knowledge. The second
N k=1 author would like to thank sponsorship and financial support
from Zhejiang Provincial Natural Science Foundation of China
where
 under Grant No. Y107110, Research Fund for the Doctoral
1 ˆ − si(k)))<0.1
((si(k) Program of Higher Education of China (for new teachers) under
Sk = (9) Grant No. 20070335161. The fourth author would like to thank
0 else. sponsor and financial support from the Office of Education,
Jiangxi Province under Grant No. GJJ08358.
Generally, if Sk equals to one, namely, ((si(k)ˆ −
si(k)))<0.1, we say that the prediction hits the tar-
get. And H is the percentage of prediction that hits the REFERENCES
target. 1. Luo, S.-H., X.-G. Liu, and M. Zhao, “Prediction
Table III gives some statistical features of sample for silicon content in molten iron using a combined
data and the results of prediction. fuzzy-associative-rules bank”, Proc. Int. Conf.
It is seen from the data in Table III that, for both Fuzzy Syst. Knowl. Discov., pp. 667–676 (2005).
groups of data, the GARCH model gives more than 84% 2. Zhao, M., X.-G. Liu, and S.-H. Luo, “An
hit rate at the range of [si] ± 0.1, and the accuracy of evolutionary artificial neural networks approach for
prediction is below 0.04. The result is helpful for BF BF hot metal silicon content prediction”, Proc. Int.
operators to judge the situation of the BF and make cor- Conf. Nat. Comp., pp. 374–377 (2005).
responding control measures. What should be noted is 3. Gao, C. H., Z. M. Zhou, and Z. J. Shao, “Chaotic
that the second data group has a kurtosis of 3.86 and un- local-region linear prediction of silicon content
der the condition of excess kurtosis, the GARCH model in hot metal of blast furnace”, Acta Metall. Sin.,
is also able to achieve a good result. Comparing to other Vol. 41, pp. 433–436 (2005).
methods [11], the GARCH model works better under 4. Bollerslev, T., “Generalized autoregressive
situations with higher standard deviation and kurtosis. conditional heteroskedasticity”, J. Econ., Vol. 31,
pp. 307–327 (1986).
5. Engle, R. F., “Autoregressive conditional hetero-
V. CONCLUSION skedasticity with estimates of the variance of
UK inflation”, Econometrica, Vol. 50, pp. 987–
Since its introduction by Bollerslev in 1986, the 1007 (1982).
GARCH models have been widely used in forecast- 6. Franses, P. H. and H. Ghijsels, “Additive
ing financial time series and have fully shown its use- outliers, GARCH and forecasting volatility”, Int. J.
fulness. This paper applies the GARCH (1, 1) model Forecast., Vol. 15, pp. 1–9 (1999).
to predict silicon content in hot metal to control the 7. Brooks, C., “A double-threshold GARCH model for
BF iron-making and good result is achieved, taking ac- the French-Franc/Deutsch-mark exchange rate”, J.
countof the complex nature of silicon content series and Forecast., Vol. 20, pp. 135–143 (2001).

q 2008 John Wiley and Sons Asia Pte Ltd and Chinese Automatic Control Society
J.-s. Zeng et al.: Using Non-linear GARCH Model to Predict Silicon Content in Blast Furnace Hot Metal 637

8. Garcia, R. C., Contreras J. et al., “A GARCH applied to these areas, and prediction and modeling of
forecasting model to predict day-ahead electricity noise from industrial process.
prices”, IEEE Trans. Power Syst., Vol. 20, No. 2,
pp. 867–874 (2005). Xiang-guan Liu was born in Fu-
9. Ljung, G. and G. Box, “A measure of lack of fit in jian, China, in 1943. He received
time series models”, Biometrika, Vol. 66, No. 2, pp. the B.S. degree in Mathemat-
65–70 (1986). ics from University of Science
10. Jarque, C. and A. Bera, “Efficient tests for and Technology of China (USTC)
normality, homoskedasticity, and serial indepen- in 1966. After graduation from
dence of regression residuals”, Econ. Lett., Vol. 6, USTC, he worked in Pangang Iron
No. 2, pp. 55–59 (1980). and Steel Corporation of China and
11. Meade, N., “A comparison of the accuracy of short won the First Level Award for
term foreign exchange forecasting methods”, Int. J. Progress on Science and Technol-
Forecast., Vol. 18, pp. 67–83 (2002). ogy of China and became a faculty of the Department of
Mathematics at Zhejiang University since 1995, where
he is currently a Professor.
His research interests are in the modeling and con-
Jiu-sun Zeng was born in Jiangxi, trol of metallurgical process, in particular in the devel-
China, in 1982. He received the opment of expert system and optimization theory. The
B.S. degree in Mathematics from expert system he developed was successfully applied to
Zhejiang University, China, in several blast furnaces in China.
2004, and is currently a Ph.D. can-
Ke-ping Yang was born in
didate of Zhejiang University, ma-
Zhejiang, China, in 1982. She
joring in Operational Research and
received the B.S. degree in In-
Control Theory. He is expected to
formation and Computer Science
graduate in 2009.
from Ningbo University, China,
His research interests are in the areas of system in 2004, and the M.S. degree in
identification and control theory, with application to Operational Research and Control
complex industrial system, specifically on metallurgical Theory from Zhejiang University,
process. He is currently working on subspace identifi- China, in 2006. She is currently
cation methods and model predictive controller design a statistician in Annpro Analytic
for blast furnace ironmaking process, other interests in- of Hangzhou, China. Her research interests are in the
clude application of statistical method to control. field of statistical algorithms and application to indus-
trial systems and clinical study. She is an expert on
Chuan-hou Gao was born in An- statistical theory and algorithms.
hui, China, in 1975. He received
the B.S. degree in Chemical En- Shi-hua Luo was born in Jiangxi,
gineering from Zhejiang Univer- China, in 1975. He received the
sity of Technology, China, in 1998, B.S. degree and M.S. degree in
and the Ph.D. degree in Opera- Mathematics from Jiangxi Normal
tional Research and Cybernetics University, China, in 1998 and
from Zhejiang University, China, 2001, and the Ph.D. degree in
in 2004. From June 2004 until May Operational Research and Control
2006, he was a Postdoctor in the Theory from Zhejiang University,
Department of Control Science and Engineering at Zhe- China, in 2007. He is currently an
jiang University. Since June 2006, he has joined the De- associate professor of Jiangxi Uni-
partment of Mathematics at Zhejiang University, where versity of Finance and Economics. His research inter-
he is currently an Associate Professor. ests are in the field of modelling and optimization of
His research interests are in the areas of optimiza- complex systems, especially on the extraction of non-
tion and control for complex industrial process, in par- linear characteristics of chemical process and financial
ticular in chaos and model predictive control techniques process.

q 2008 John Wiley and Sons Asia Pte Ltd and Chinese Automatic Control Society

You might also like