Professional Documents
Culture Documents
Arima Model
Arima Model
Lecture 8
ARIMA Forecasting II
March 16 and 21, 2011
Content
The Box-Jenkins Modeling Process
Seasonal ARIMA Models
Concluding comments on ARIMA models
Identification
Fitting
Diagnostics
Refitting if necessary
Forecasting
Identification
What does the data look like?
What patterns exist?
Is the data stationary?
Tools
Plots of data
PACF
ACF
Model Fitting
Trial model is proposed
e.g. ARIMA(0,1,2)
Parameter estimates
Test statistics
Goodness of fit measures
Residuals
Diagnostics
Martin L. Puterman 2008
Diagnostics
Determines whether model fits data adequately.
The goal is to extract all information and ensure that
residuals are white noise
Key measures
ACF of Residuals
PACF of Residuals
Ljung-Box-Pierce Q Statistic (Portmanteau Test)
Tests whether a set of residual autocorrelations is
significantly different than zero.
See next slide for details
rj
Q(k ) N ( N 2)
j 1
N j
Forecast
423.5
421.3
420.8
420.7
420.7
Lower
360.7
340.4
324.8
311.6
299.9
Prob
Level
0.009382
Fitted Model
Xt+1-Xt = .238 (Xt Xt-1) or Xt+1 = Xt + .238 (Xt Xt-1)
10
Wages Data
Autocorrelations of Wages (0,0,12,0,0)
Partial Autocorrelations
1.0
0.5
0.0
-0.5
-1.0
0.0
10.3
20.5
30.8
0.5
0.0
-0.5
-1.0
0.0
41.0
Time
10.3
20.5
30.8
41.0
Time
Plot of Wages
6.0
5.7
Wages
Autocorrelations
1.0
5.4
5.1
4.8
0.9
18.9
36.9
54.9
72.9
Time
11
Partial Autocorrelations
Autocorrelations
1.0
0.5
0.0
-0.5
-1.0
0.0
0.5
0.0
-0.5
-1.0
0.0
10.3
20.5
30.8
10.3
41.0
Time
20.5
30.8
41.0
Time
Correlation
Lag
Correlation
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
0.013874
0.152615
0.077908
0.013874
0.118463
0.328709
-0.086446
0.028815
0.205977
-0.066169
0.200640
0.016009
0.008538
0.494130
-0.024546
0.058698
0.200640
0.003202
0.036286
0.091782
12
Standard
Error
0.1200761
0.1202638
0.1170662
0.1019158
Prob
Level
0.247006
0.897645
0.075128
0.000000
T-Value
1.1577
0.1286
-1.7797
-5.3252
Forecasts of Wages
Autocorrelations of Residuals
6.4
6.0
0.5
Wages
Autocorrelations
1.0
0.0
-0.5
-1.0
0.0
5.6
5.2
12.3
24.5
36.8
49.0
4.8
1982.9
Lag
1984.9
1986.8
1988.8
1990.8
Time
Ljung-Box 24
20
34.96
0.020343
Inadequate Model
13
Standard
Error
0.1150536
0.1161695
0.1119487
0.1181367
6.011709E-02
Prob
Level
0.06361
0.49744
0.00269
0.00028
0.00000
T-Value
1.8549
0.6785
-3.000
-3.625
-14.23
Wages Chart
Autocorrelations of Residuals
6.2
1.0
Wages
Autocorrelations
5.9
0.5
0.0
5.2
-0.5
-1.0
0.0
5.5
12.3
24.5
36.8
49.0
4.8
1982.9
Lag
1984.9
1986.8
1988.8
1990.8
Time
14
Section
Parameter
Estimate
-0.5495576
Standard
Error
8.447082E-02
T-Value
-6.5059
Prob
Level
0.000000
Wages Chart
Autocorrelations of Residuals
6.2
1.0
Wages
Autocorrelations
5.9
0.5
0.0
5.5
5.2
-0.5
-1.0
0.0
12.3
24.5
36.8
49.0
4.8
1982.9
Lag
1984.9
1986.8
1988.8
1990.8
Time
15
Model Comparison
Model
RMSE
LjungBox (24)
Residual
ACF
Autocorrelations of Residuals
34.96
Autocorrelations
(0,1,3)x(0,0,1)12 .0316
1.0
0.5
0.0
-0.5
-1.0
0.0
12.3
24.5
36.8
49.0
Lag
11.08
Autocorrelations
(0,1,3)x(0,0,2)12 .0245
Autocorrelations of Residuals
1.0
0.5
0.0
-0.5
-1.0
0.0
12.3
24.5
36.8
49.0
Lag
Autocorrelations of Residuals
(0,1,0)x(1,1,0)12 .0239
15.84
Autocorrelations
1.0
0.5
0.0
-0.5
-1.0
0.0
12.3
24.5
36.8
49.0
Lag
16
(I a1B) xt = et
which becomes xt a1xt-1 = et
which becomes xt = a1xt-1 + et
(I a1B a2B2) xt = et
17
18
Concluding Comments
The ARIMA models are not designed for models with multiplicative seasonality. In
such cases;
Models with persistent trends can be de-trended and ARIMA applied to the detrended series.
Several automatic fitting programs do a good job fitting ARIMA models (Not NCSS)
Need to compare all models on the basis of out-of sample forecasts on holdout data.
Simpler ARIMA models seem to work better out of sample even though they may not give
the best fit.
Recall from early slides that fitting is different than forecasting.
ARIMA models forecasts can be pooled with those from one or more other models.
Martin L. Puterman 2011
19