Professional Documents
Culture Documents
Microsoft PowerPoint - Time series Analysis - Week_10_171110898455171261865fd7378c16dc.03
Microsoft PowerPoint - Time series Analysis - Week_10_171110898455171261865fd7378c16dc.03
• Yt depends on one previous Yt−1 value and one previous error term et−1.
• Series is assumed stationary in the mean and variance
• An ARMA model with higher-order terms
1
Examples of ARIMA(1,0,1) models
2
ARIMA models
• ARIMA(p, d, q) model yields a variety of patterns in the ACF and PACF, so that it is
unwise to state rules for identifying general ARIMA models
• Simpler AR(p) and MA(q) models do provide some identifying features that can help
zero in on a particular ARIMA model identification
• In practice, it is hardly necessary to deal with values p, d, or q that are other than 0,
1, or 2 3
Seasonality and ARIMA models
• Data separated by a whole season may exhibit AR, MA, mixed ARMA, or mixed
ARIMA properties, just as consecutive data points may exhibit in case of non
seasonal ARMA models
4
Seasonal ARIMA model
• ARIMA(1, 1, 1)(1, 1, 1)4 model
• Coefficients φ1, Φ1, θ1, and Θ1 are estimated from the data, and the equation is used for
forecasting
• A seasonal MA model eg., ARIMA(0,0,0)(0,0,1)12 will show a spike at lag 12 in the ACF
but no other significant spikes;
• PACF will show exponential decay in the seasonal lags; that is, at lags 12, 24, 36, …. 5
An example dataset for ARIMA model identification
• Dataset of Monthly French industry sales of printing and writing paper (in thousands
of francs) between 1963–1972
6
7
An example dataset for ARIMA model
identification
• Plot shows a very clear seasonal pattern in the data plot and a general
increasing trend
• Autocorrelations are almost all positive, and the dominant seasonal
pattern shows clearly in the large values or r12, r24, and r36
• So, take the first difference to address the linear trend, and the 12th
difference to address the periodicity
8
9
Identification of ARIMA model
11
• Fluctuations increase as one moves from left to right on the graph – non-
stationarity in the variance
• For achieving stationarity in variance, a logarithmic or power transformation of
the data may be done
12
• After logarithmic transformation, the series has achieved stationarity in variance
• For achieving stationarity in mean, differencing may be done
• As there is no strong seasonality, and so we take a first difference rather than a seasonal difference
13
• Series has achieved stationarity in variance and also mean
14
Tentative ARIMA model
15
Steps to identify a Box-Jenkins ARIMA model
• Make the series stationary
• Differencing (non-seasonal and/or seasonal) usually takes care of non-stationarity
in the mean
• Logarithmic or power transformations often take care of non-stationary in the
variance
• Consider non-seasonal aspects
• An examination of the ACF and PACF of the stationary series obtained in the
previous step can reveal whether a MA or AR model is feasible
• Consider seasonal aspects
• Examination of the ACF and PACF at the seasonal lags can help identify AR and
MA models for the seasonal aspects of the data
• Not as easy to identify as in the case of the non-seasonal aspects
16
Estimating the ARIMA model parameters
• Suppose the class of model identified is ARIMA (0, 1, 1). This is a family of models
depending on one MA coefficient θ1
• If some of the estimated parameters are insignificant (p-values larger than 0.05),
then, a revised model with the insignificant terms omitted may be considered
• Mixed models are generally harder to identify than pure AR or MA models. So, we
may begin with either a pure AR or a pure MA model and extend the selected model
to a mixed ARMA/ARIMA model
18
Criteria for selecting the preferred ARIMA model
• Can we choose the model which gives the smallest sum of squared errors or the largest
value for the likelihood?
• Does not always work—often the MSE can be made smaller and the likelihood made
larger simply by increasing the number of terms in the model
• This is analogous to the problem of selecting a regression model by maximizing the R2
value – R2 value can be increased by adding another explanatory variable
• For ARIMA models, the likelihood may be penalized for each additional term in the model.
If the extra term does not improve the likelihood more than the penalty amount, it is not
worth adding.
• Akaike’s Information Criterion or AIC (Akaike, 1974)
22
Portmanteau tests
• Rather than study the rk values one at a time, an alternative approach is to
consider a whole set of rk values, (say, k = 1 to 15/ 20) all at one time, and
see whether the set is significantly different from a zero set
• Box-Pierce test
25