Professional Documents
Culture Documents
Isengildinamassa2011 PDF
Isengildinamassa2011 PDF
Isengildinamassa2011 PDF
Applied Economics
Publication details, including instructions for authors and subscription information:
http://www.tandfonline.com/loi/raec20
To cite this article: Olga Isengildina-Massa , Scott Irwin , Darrel L. Good & Luca Massa (2011) Empirical confidence intervals
for USDA commodity price forecasts, Applied Economics, 43:26, 3789-3803, DOI: 10.1080/00036841003724429
Taylor & Francis makes every effort to ensure the accuracy of all the information (the “Content”) contained
in the publications on our platform. However, Taylor & Francis, our agents, and our licensors make no
representations or warranties whatsoever as to the accuracy, completeness, or suitability for any purpose of the
Content. Any opinions and views expressed in this publication are the opinions and views of the authors, and
are not the views of or endorsed by Taylor & Francis. The accuracy of the Content should not be relied upon and
should be independently verified with primary sources of information. Taylor and Francis shall not be liable for
any losses, actions, claims, proceedings, demands, costs, expenses, damages, and other liabilities whatsoever
or howsoever caused arising directly or indirectly in connection with, in relation to or arising out of the use of
the Content.
This article may be used for research, teaching, and private study purposes. Any substantial or systematic
reproduction, redistribution, reselling, loan, sub-licensing, systematic supply, or distribution in any
form to anyone is expressly forbidden. Terms & Conditions of access and use can be found at http://
www.tandfonline.com/page/terms-and-conditions
Applied Economics, 2011, 43, 3789–3803
sion and misunderstanding in forecast interpretation. research will provide valuable information that can
As numerous procedures have been proposed to be used to more accurately estimate confidence limits
calculate confidence limits of forecasts generated for USDA price interval forecasts.
by statistical models, these procedures provide little
guidance for forecasts based on a combination or a
consensus process rather than formal models, as is the
case with USDA forecasts. In reviewing the predic- II. Empirical Methods
tion interval literature, Chatfield (1993) observes that
when theoretical formulae are not available or there The empirical approach to the estimation of forecast
are doubts about model assumptions the use of confidence intervals is based on the notion that
empirically based methods should be considered as confidence limits for future forecasts can be based
a general purpose alternative. Empirical methods are on the distribution of historical forecast errors. The
based on the notion that forecast confidence limits empirical approach was first applied to the construc-
may be estimated by evaluating historical forecast tion of confidence limits for economic forecasts by
errors. An empirical method was first applied to Williams and Goodman (1971). Their approach
construction of forecast confidence limits by Williams consists of splitting the data in two parts and fitting
and Goodman (1971). The method has been used the method or model to the first part in order to find
successfully in a variety of fields since the initial forecast errors. The model is then refitted each year
work of Williams and Goodman (e.g. Murphy and adding an additional observation in the first part and
Winkler, 1977; Stoto, 1983; Keilman, 1990; increasing the part of the sample used to estimate
Zarnowitz, 1992; Shlyakhter et al., 1994; Jorgensen forecast errors. The key assumption of this method is
and Sjoberg, 2003). that future forecast errors belong to approximately
The purpose of this study is to investigate alterna- the same distribution as past forecast errors.1
tive approaches to estimate empirical forecast error Williams and Goodman (1971) argue that this
distributions and the associated confidence limits for assumption is less restrictive than the standard
USDA forecasts of marketing year average corn, assumption that a forecasting model describes the
soybean and wheat prices. USDA interval price series adequately in the future. Therefore, by accu-
forecasts over the 1980/81 to 2006/07 marketing mulating forecast errors through time one can obtain
years are included in the analysis. Empirical error an empirical distribution of forecast errors and
distributions are estimated using nonparametric determine the confidence limits for the future fore-
approaches, parametric distributions and quantile casts by using the percentage points of the empirical
regressions. The procedures are compared based on distribution generated from the past errors. The
out-of-sample performance, where the first 15 obser- benefit of this method is that it can be applied in a
vations (1980/81–1994/95) are used to generate con- straightforward manner to any type of error distri-
fidence limits for the 16th year (1995/96); the first 16 bution, including fat-tailed and/or asymmetric dis-
observations are used to generate confidence limits tributions. This method is also particularly useful
1
It is worth noting that most theoretical variance expressions are based on the same assumption.
Empirical confidence intervals 3791
when the forecasting model is not fully identified and Weibull distribution produced better calibrated
based on consensus or a combination of several post-sample forecasts for their data set. Williams and
sources, which makes the standard model-based Goodman (1971) found that for their dataset on the
methods for confidence limit calculation impossible number of telephone lines in use in designated cities
to apply. a gamma distribution provided the best fit. Several
Empirical methods have also been recommended other studies (e.g. Stoto, 1983; Cohen, 1986) found
for cases when theoretical formulae yield prediction that forecast error distributions of population growth
intervals that are too narrow. For example, Williams forecasts approximately followed a normal distribu-
and Goodman (1971) found that between 20% and tion. Shlyakhter et al. (1994) proposed the use of
26% of 18-month-ahead forecasts fell outside the exponential distributions for data with large devia-
90% prediction intervals. Makridakis et al. (1987) tions from normality, such as population and energy
found that 13 of the 15 univariate forecasting data. Jorgensen and Sjoberg (2003) used a nonpara-
methods had more than 30% of the 4-year-ahead metric histogram approach to find the desired points
forecasts outside the 95% confidence interval. of the software development effort distribution.
Chatfield (1993) argues that one of the reasons for Taylor and Bunn (1999a, b) suggested an approach
narrow theoretical prediction intervals may be that to empirical confidence interval estimation that
innovations are not normally distributed. This short- addresses the small sample problem by pooling data
Downloaded by [Stony Brook University] at 06:23 20 December 2014
coming may be easily corrected with an empirical across time and forecasting horizons and estimating
method which does not rely on the assumption of forecast error distributions via quantile regression.
normality. The authors develop forecast error quantile models
The empirical methods of constructing confidence that are functions of lead time, k, as suggested by
intervals have been used successfully in a variety of theoretically derived variance expressions. The use of
fields (e.g. Murphy and Winkler, 1977; Stoto, 1983; quantile regression avoids the normality and optim-
Keilman, 1990; Zarnowitz, 1992; Shlyakhter et al., ality assumptions underlying theoretical forecast
1994; Jorgensen and Sjoberg, 2003). One of the main variance expressions. Another benefit of this
limitations of empirical methods is the data require- approach is that it relaxes the assumption that error
ment. Therefore, these methods have been most distributions for each forecast month are indepen-
widely used in areas where data limitations are less dent. This assumption is likely to be violated in the
common, such as weather, population and software case of USDA price forecasts, since forecast errors
development forecasting. Recently, the importance tend to decline from the beginning to the end of the
of empirical control of a model’s probability assess- forecasting cycle as more information becomes
ments has been recognized in engineering applica- available.
tions (e.g. Rebba and Mahadevan, 2006). The empirical methods described in this section
While empirical methods are gaining popularity, use historical forecast errors for the construction of
there is no consensus on how the forecast error confidence intervals based on the assumption that
distributions should be estimated. Gardner (1988) future forecast errors will follow approximately the
proposed the use of Chebychev’s inequality to derive same distribution as the past errors. These methods
prediction intervals as it provides an upper bound for differ in the way that historical error distributions are
any distribution with known mean and SD. Based on estimated. Our study will provide insight regarding
the 1001 series of the M-competition, Gardner found the relative accuracy of the alternative methods
that Chebychev’s inequality produced more accurate by providing a side-by-side comparison of their
confidence intervals than when normally distributed out-of-sample performance.
errors are assumed. However, Bowerman and
Koehler (1989) objected to the use of Chebychev’s
inequality because it gives unnecessary wide predic-
tion intervals for a correctly specified and unchanging III. Data
model. Makridakis and Winkler (1989) provide the
evidence that actual forecast errors based on this USDA corn, soybean and wheat interval price
approach tend to be larger, on average, than in-sample forecasts are released monthly, usually between the
fit errors. With data on daily electricity peak loads, 9th and 12th of the month within World Agricultural
Allen and Morzuch (1995) demonstrated that proba- Supply and Demand Estimates (WASDE, selected
bility forecasts calculated from Chebychev’s inequal- issues from 1980 to 2007) reports. USDA price
ity were the worst-calibrated. Their study also forecasts are published in the form of an interval
included four parametric distributions, normal, and generated using a balance sheet approach, with
gamma, Cauchy and Weibull. They found that the published intervals reflecting uncertainty associated
3792 O. Isengildina-Massa et al.
Downloaded by [Stony Brook University] at 06:23 20 December 2014
Fig. 1. The 2006/07 WASDE forecasting cycle for corn, soybeans and wheat relative to the US marketing year
with prices in the future (see Vogel and Bange (1999) where fm ¼ 1, . . . , FM, and i ¼ 1980/81, . . . , 2006/07.
for a detailed description of the forecast generation Thus, each subsequent forecast is essentially an
process). The price in these forecasts refers to the update of the previous forecast as it describes the
average marketing year farm price, which is an same terminal event. The WASDE forecasting cycle
average of the price received by farmers weighted generates 18 updates (FM ¼ 19) of marketing year
by the monthly amounts marketed. The first price average price for both corn and soybeans and 15
forecast for a marketing year is released in May updates (FM ¼ 16) for wheat. While the forecasts are
preceding the US marketing year (September–August published in the form of an interval, the probability
for corn and soybeans and June–May for wheat). with which the realized price is expected to fall within
Estimates are typically finalized by August (for the forecast interval is not specified.
wheat) and November (for corn and soybeans) of The descriptive statistics for USDA interval price
the following marketing year (Fig. 1). Note that forecasts for corn, soybeans and wheat over 1980/81
USDA price forecasts are considered fixed-event through 2006/07 are presented in Table 1. Prior to
forecasts because the series of forecasts is related to harvest (the first six forecasts from May to October
the same terminal event ðqiFM Þ, where FM is the for corn and soybeans and the first three forecasts
release month (November for corn and soybeans and from May to July for wheat), soybean price interval
September for wheat) of the final estimate of average forecasts were the widest, averaging about 19% of the
farm price for the ith marketing year. The forecast of forecast price compared to 17% for corn and 14% for
the terminal event for month fm is denoted as qifm , wheat.2 Soybean interval forecasts prior to harvest
2
Isengildina et al. (2004) provide survey evidence that USDA price intervals are symmetric. That is, a midpoint is forecast and
then an equal interval is added to each side of the midpoint. Therefore, the average forecast price is computed in this study by
taking an average of the midpoint of forecast prices for each month.
Empirical confidence intervals 3793
Table 1. Descriptive and accuracy statistics for published USDA interval forecasts of corn, soybean and
wheat prices, 1980/81–2006/07 marketing years
Notes: Average interval is calculated by dividing interval range by its midpoint. Hit rate is the proportion
of times the interval contained the final estimate. Misses above and below describe cases when the final
estimate fell above or below the forecast interval. The sum of hit rate, misses below and misses above may
not add up to 100 due to rounding.
Downloaded by [Stony Brook University] at 06:23 20 December 2014
were also the most accurate with an average hit rate The other two commodities did not exhibit such
(proportion of times forecast intervals contain the persistent tendencies.
final or ‘true’ value) of 67% compared to 46% An important basic assumption of empirical
for both corn and wheat. After harvest (the last 10 approaches to the estimation of confidence limits is
forecasts from November to August for corn and that the distribution of forecast errors remains stable
soybeans and from August to May for wheat),3 corn over time. Forecast errors are calculated in this study
interval forecasts were the widest and the most as percentage errors, which allows us to compare
accurate, averaging about 7% of the forecasted results across commodities and may be preferred to
price (relative to 6% and 5% for wheat and soybeans, unit errors when mean price levels change (as they
respectively) with an average hit rate of 71% (relative did for most agricultural commodities after 2006).
to 65% for both soybeans and wheat). Poor forecast The validity of this assumption for corn, soybean
accuracy after harvest is associated with the fact and wheat price forecast errors is tested by dividing
that forecast intervals were usually collapsed to point the sample in two parts, from 1980/81 to 1994/95 and
estimates in May after harvest for wheat and from 1995/96 to 2006/07 and examining whether the
soybeans and in August after harvest for corn. first two moments of forecast error distributions
No trends in the magnitude of interval forecasts differed between two sub-periods. An independent
over time were detected. Thus, intervals in the same sample t-test was used to test the difference in the
months did not become smaller (or larger) from the mean forecast errors across the two sub-samples and
beginning to the end of the sample period. Levene’s F-statistic was used to test the difference
Another issue is whether forecast intervals accu- in the variance of forecast errors.
rately reflect the shape of the underlying price The results of this analysis are presented in Table 2
distribution. Statistics on the proportion of misses and demonstrate that the mean forecast error for
above and below the forecast interval reported in soybean price forecasts prior to harvest was signifi-
Table 1 provide an insight on this issue. If the forecast cantly higher in the more recent sub-period. These
intervals accurately reflected the shape of the under- results also suggest that the variance of corn price
lying price distribution, one would expect equal forecast errors after harvest was significantly smaller
probability of misses above and below the forecast and the variance of wheat price forecast errors prior
interval, which is about what we observe for corn to harvest was significantly larger in the 1995/96
price forecasts after harvest. Table 1 demonstrates through 2006/07 sub-period. Forecast errors were not
that for soybean price forecast intervals the propor- statistically different across the two sub-periods in all
tion of misses above the interval was 2.2 times greater other cases, indicating that error distributions
than the proportion of misses below the interval remained relatively stable over time. These results
prior to harvest and 2.9 times greater after harvest. suggest that empirical approach to calculating
3
The last several months (17 and 18 for corn and soybeans and 14 and 15 for wheat) were not included in the analysis because
errors were often zero, so the distributions were impossible to estimate.
3794 O. Isengildina-Massa et al.
Table 2. Comparison of the first two moments of percentage error distributions of USDA interval forecasts
of corn, soybean, and wheat prices, 1980/81–1994/95, and 1995/96–2006/07 marketing years.
Variance
Levene’s F-Statistic 0.36 2.89* 0.60 0.50 10.05*** 0.43
Notes: Forecast error is calculated as the percentage difference between the final estimate and the midpoint
of the forecast interval times 100.
* and *** indicate significance at 10 and 1% levels, respectively.
confidence intervals would be most efficient for corn the first part of the sample to estimate forecast error
price forecasts prior to harvest and soybean and distributions for the second part of the sample.
wheat price forecasts after harvest. In order to determine a split point between the first
This section described several major concerns and the second part of the sample, data availability
regarding WASDE interval forecasts of corn, soy- issues should be taken into account. In this study,
beans and wheat prices: (1) these intervals are 27 annual observations are available for each
characterized by relatively low hit rates; (2) in monthly forecast for each commodity. Dividing the
soybeans, symmetric forecast intervals do not accu- sample in two parts, from 1980/81 to 1994/95 and
rately reflect the shape of the underlying price from 1995/96 to 2006/07 provides us with a minimum
distribution and (3) confidence levels associated of 15 years of data to estimate forecast error
with these forecast intervals are not specified. Since distributions and 12 years out-of-sample to evaluate
error distributions of these forecasts remained rela- the performance of the proposed approaches. One
tively stable over time, confidence limits calculated drawback of an empirical method is the requirement
using empirical methods described in this study may of enough historical data in order to estimate forecast
provide a powerful tool for improving WASDE price error distributions. Taylor and Bunn (1999b) argue
forecast intervals. that ‘. . . the number of forecast errors needed
depends on the accuracy required, but it is probably
fair to say that empirical methods need at least fifty
observations to produce reasonable results’ (p. 328).
It has been shown that variance estimates using
IV. Forecast Error Distribution Estimation Gardner’s approach are unreliable for small sample
sizes and are based on model-fitting errors rather
The empirical methods of calculating confidence than true post-sample forecast errors (e.g. Bowerman
intervals are based on splitting the data in two parts and Koehler, 1989). At the same time Williams and
and fitting the method or model (e.g. Williams and Goodman (1971) demonstrate that empirical confi-
Goodman, 1971; Gardner, 1988) to the first part dence intervals using 24 observations were satisfac-
in order to find the forecast error distribution, which tory at the 80% level, but 90% and 95% levels
is then used to provide confidence limits for the improved when more observations were included.
forecast intervals in the second part of the sample. These arguments suggest that some confidence
This study uses actual USDA forecast errors from intervals calculated within this study may not be
Empirical confidence intervals 3795
very precise due to small sample sizes, but the use
of the larger sample for the calculation of the most
recent forecast intervals should provide reasonable
estimates.
Another issue is the selection of the confidence
level for interval forecasts. As was mentioned, con-
fidence levels associated with published USDA inter-
val price forecasts are not specified. Isengildina et al.
(2004) conducted a survey of USDA officials
involved in compiling USDA corn and soybean
price interval forecasts inquiring about the confidence
levels associated with published forecasts. Analyst
responses were variable across respondents (by as
much as 30% in the beginning of the season) and
over the forecasting cycle (from 65% in May prior to
harvest to 95% in April after harvest). The average
Downloaded by [Stony Brook University] at 06:23 20 December 2014
where 2 ^
¼ 1=FðxÞ½1 ^
FðxÞ, f^ ðxÞ is the hypothesized
^
density function, FðxÞ is the hypothesized cumulative
distribution function, FN ðxÞ ¼ Nx =N, and Nx is the
number of observations less than x. The Anderson–
Darling statistic is preferred to the chi-squared
statistic because it does not require binning.8
Furthermore, the Anderson–Darling statistic is
selected over the Kolmogorov–Smirnov statistic
because it concentrates on the differences between Fig. 4. Logistic distribution estimates for USDA soybean
the tails of the fitted distribution and input data price forecasts made in November based on 1980/81–1994/95
rather than the middle of the distribution. (a) and on 1980/81–2006/07 (b) sub-samples
Figure 4 presents the best fitting probability
distributions for the November soybean price fore-
casts based on the first 15 and the first 26 observa-
tions. The logistic distribution was selected as is selected. For example, Williams and Goodman
best-fitting in both cases. The main drawback of (1971) report that forecast errors from their data set
this approach is that a different best-fitting distribu- approximately followed a gamma distribution. It was
tion may be chosen from one year to the next as more found that the forecast errors from the data set used
observations are added. Allowing distributions to in the present study most often followed a logistic
vary over time can be viewed as a violation of the distribution. The logistic distribution was selected as
underlying assumption of the empirical approach best-fitting using an Anderson–Darling statistic in
that future forecast errors belong to approximately 72% of cases in corn and soybeans, and in 86% of
the same distribution as past forecast errors. To avoid cases in wheat.9 Therefore, this distribution was
this shortcoming, the probability distribution that is applied in all cases for the calculation of confidence
most often ranked as the best fitting distribution interval. The upper and lower bounds for the 80%
7
Other distributions, such as gamma and Weibull were not included in this analysis because they are not defined for negative
values.
8
To calculate the chi-squared statistic, one must break up the x-axis domain into ‘bins’ or classes of data. However, there are
no clear guidelines for selecting the number and location of the bins and different conclusions can be reached from the same
data depending on how bins are specified.
9
Computations were also made where the best-fitting distribution was allowed to change from year-to-year for corn. These
alternative computations yielded quite similar results to those limited to the logistic distribution and are available from the
authors upon request.
3798 O. Isengildina-Massa et al.
and 90% confidence levels were calculated by
integrating the logistic density function using the
Lobatto adaptive quadrature method. The empirical
forecast intervals were constructed by adding these
upper and lower bounds to the midpoint of the
published forecasts. A benefit of this approach is that
the shape and properties of the distribution are
well-known and do not change from one year to the
next as observations are added/dropped. A disadvan-
tage of this approach is that logistic distribution is
not always the best fitting parametric distribution.
Finally, since the logistic distribution is not as flexible
as kernel density estimator, it may not fit the data as
well as the kernel density function. Logistic distribu-
tion confidence intervals were computed using
MATLAB computing library.
Downloaded by [Stony Brook University] at 06:23 20 December 2014
Notes: Average interval is calculated by dividing interval range by its midpoint. Average below refers to the average size of the
interval below the midpoint, average above refers to the average size of the interval above midpoint.
Quantile regression relaxes the assumption that 80% confidence level prior to harvest and 90%
forecast errors for separate months are uncorrelated. confidence level after harvest would be wider than
All quantile regressions were estimated using Eviews published intervals. Table 3 demonstrates that prior to
econometric software. harvest, empirical confidence intervals were 1.6 times
(histogram) to 2 times (kernel density) wider on aver-
age than published forecast intervals of corn prices.10
V. Results After harvest, the difference was not as dramatic with
the histogram-based intervals about 25% wider, and
Forecast error distributions estimated using histo- kernel density-based intervals about 63% wider,
gram, kernel density, logistic distribution and quantile on average, than published intervals. Logistic distri-
regression procedures described in the previous sec- bution and quantile regression-based procedures
tion were used to calculate empirical confidence generated intervals that were wider than the
intervals for USDA corn, soybean and wheat price histogram-based intervals but narrower than kernel
forecasts over the 1995/96 through 2006/07 marketing density-based intervals. A similar pattern is observed
years. These empirical confidence intervals are com- for both soybean and wheat price forecast intervals.
puted using the first 15 observations (1980/81–1994/ Another issue with the USDA price forecasts is
95) to generate confidence limits for the 16th year whether the symmetric forecast intervals accurately
(1995/96); the first 16 observations to generate confi- reflected the distributions of the underlying commod-
dence limits for the 17th year (1996/97) and so on. ity prices. Because empirical confidence intervals were
Table 3 presents descriptive statistics of empirical calculated using actual forecast errors, they take into
confidence intervals as well as those of published account the asymmetries of error distributions. If the
USDA forecast intervals of corn, soybean and wheat forecast intervals accurately reflected the shape of the
prices over 1995/96 through 2006/07 marketing years. underlying price distribution, one would expect equal
Because of the generally low hit rates of the published probability of misses above and below the forecast
forecasts shown in Table 1, it was expected that interval. Table 4 demonstrates that for corn, soy-
the empirical confidence intervals calibrated at the beans and wheat the price forecast intervals prior to
10
It is important to keep in mind that these interval widths are conditional on the selected confidence levels (80–90%). If the
selected confidence level was equal to the actual hit rates, the widths of the empirical confidence intervals would be
approximately equal to that of published intervals.
3800 O. Isengildina-Massa et al.
Table 4. Accuracy statistics for empirical confidence intervals for USDA corn, soybean and wheat price forecasts,
1995/96–2006/07 marketing years
Notes: Empirical price forecast intervals were calculated using percentage errors from the 1980/81 marketing year forward.
Target confidence level is 80% prior to harvest and 90% after harvest.
*, ** and *** indicate significance at 10, 5 and 1% levels, respectively.
harvest the proportion of misses above the interval The next step of the analysis is to assess and
was much greater than the proportion of misses compare the performance of the empirical confidence
below the interval. This is at least partially attribut- intervals to the target confidence level. Accuracy of
able to the fact that the mean of the forecast error forecast intervals is traditionally examined in terms
distribution of soybean prices prior to harvest shifted of hit rates and forecast coverage. Hit rates, which as
to the right in the prediction sub-period as shown noted earlier describe the proportion of times forecast
in Table 2.11 Another violation of constant error intervals contain the final or ‘true’ value ( yt), may be
distribution assumption highlighted in Table 2 was defined as an indicator variable, Itk ,
the increase in variance of the forecast errors of wheat
1 if yt 2 ½lt=k ðÞ, ut=k ðÞ
prices prior to harvest in the prediction sub-sample. Itk ¼ ð8Þ
0 if yt 2
= ½lt=k ðÞ, ut=k ðÞ
This resulted in low accuracy of empirical confidence
intervals for wheat prior to harvest that was where ½lt=k ðÞ, ut=k ðÞ are the lower and upper limits
significantly different from the target level of 80% of the interval forecast for yt made at time k with
across all methods. These findings emphasize that confidence level . The closer the hit rate to the stated
empirical methods may be sensitive to the changes confidence level, the more accurate is the forecast.
in variance of the forecast error distribution. Forecast coverage is based on the expectation of the
11
There is some evidence that mean forecast error of corn prices prior to harvest also shifted to the right, but the shift was not
statistically significant.
Empirical confidence intervals 3801
indicator variable, Itk and examines whether the levels in four out of six cases. The other three
proportion of times the forecast interval includes approaches performed relatively well with only minor
the true value is equal to the target (stated ) confi- deviations from the target level in all cases except
dence level. Thus, forecast coverage may be examined wheat prior to harvest. It appears that logistic
by testing the hypothesis H0 : EðItk Þ ¼ against distribution did not capture the asymmetry in soy-
H1: EðItk Þ 6¼ . If H0 is not rejected and the interval bean price distribution after harvest as well as the
hit rate is equal to the stated confidence level, other two methods. The accuracy of the kernel
forecasts are said to be calibrated. Since the indi- density and logistic distribution methods was sig-
cator variable Itk has a binomial distribution nificantly higher than the target level in wheat after
(Christoffersen, 1998), the likelihood function under harvest.
the null hypothesis is
LðÞ ¼ ð1 Þn0 n1 ð9Þ
where L is a likelihood function. Under the alterna- VI. Summary and Conclusions
tive hypothesis, the likelihood function is
This study applied and compared several procedures
Lð pÞ ¼ ð1 pÞn0 pn1 ð10Þ
Downloaded by [Stony Brook University] at 06:23 20 December 2014
rate of 77% before harvest and 92% after harvest, of forecast intervals is an interesting topic for the
followed by quantile regression (73% and 91% before future research.
and after harvest, respectively), logistic distribution
(73% and 90% before and after harvest, respectively)
and histogram (66% and 84% before and after Acknowledgements
harvest, respectively). The kernel density-based The funding support of the USDA under cooperative
approach generated the best fitting distribution for Agreement 43-3AEK-5-80076 is gratefully acknowl-
the empirical approach, as the shape of the kernel edged. Any opinions, findings, conclusions or recom-
distribution is dictated entirely by data. This mendations expressed in this publication are those
approach, however, may be unstable in small samples of the authors and do not necessarily reflect the view
as additional observations are added or dropped. of the USDA.
Quantile regression combines the benefits of the
kernel density approach with the ability to pool the
data across years and forecasting cycles thus greatly References
increasing the number of observations used for
Allen, P. G. and Morzuch, B. J. (1995) Comparing
distribution estimation. The best fitting parametric probability forecasts derived from theoretical distribu-
distribution (logistic in this case) also provided tions, International Journal of Forecasting, 11, 147–57.
reasonable results and is attractive due to the Bowerman, B. L. and Koehler, A. B. (1989) The appropri-
known properties of the distribution. The histogram ateness of Gardner’s simple approach and Chebychev
approach, on the other hand, demonstrated poor prediction intervals, unpublished paper presented at
The 9th International Symposium on Forecasting,
performance and should be avoided in construction Vancouver, BC, 18–20 June 1989.
of empirical confidence intervals. While the results Chatfield, C. (1993) Calculating interval forecasts, Journal
of the accuracy tests do not provide a clear indication of Business and Economics Statistics, 11, 121–35.
about the preferred method, it appears that the Christoffersen, P. F. (1998) Evaluating interval forecasts,
benefits offered by quantile regression approach International Economic Review, 39, 841–62.
Cohen, J. E. (1986) Population forecasts and confidence
(flexibility of the distribution estimation and ability intervals for Sweden: a comparison of model-based
to pool data) lend it to be somewhat superior to the and empirical approaches, Demography, 23, 105–26.
other methods. Further research on the details of Gardner Jr, E. S. (1988) A simple method of computing
quantile regression use for empirical confidence prediction intervals for time-series forecasts,
interval estimation is currently under way. Management Science, 34, 541–6.
Isengildina, O., Irwin, S. H. and Good, D. L. (2004)
The findings of this study can be used to estimate Evaluation of USDA interval forecasts of corn and
confidence intervals for corn, soybean and wheat soybean prices, American Journal of Agricultural
price forecasts. Specifically, confidence limits for Economics, 84, 990–1004.
future forecasts can be calculated based on percent- Jorgensen, M. and Sjoberg, D. I. K. (2003) An effort
age points of the distributions of past forecast errors. prediction interval approach based on the empirical
distribution of previous estimation accuracy,
The use of an empirical approach to confidence Information and Software Technology, 45, 123–36.
interval calculation provides accurate intervals that Keilman, N. W. (1990) Uncertainty in National Population
reflect the properties of the underlying price Forecasting: Issues, Backgrounds, Analyses,
Empirical confidence intervals 3803
Recommendations, Vol. 3, Publications of the Taylor, J. W. and Bunn, D. W. (1999b) Investigating
Netherlands Interdisciplinary Demographic Institute improvements in the accuracy of prediction
(NIDI) and the Population and Family Study Centre intervals for combinations of forecasts: a simulation
(CBGS), Swets and Zeitlinger, Amsterdam. study, International Journal of Forecasting, 15,
Makridakis, S., Hibon, M., Lusk, E. and Belhadjali, M. 325–39.
(1987) Confidence intervals: an empirical investigation Tufano, P. (1996) Who manages risk? An empirical
of the series in the M-competition, International examination of risk management practices in the
Journal of Forecasting, 3, 489–508. gold mining industry, Journal of Finance, 51,
Makridakis, S. and Winkler, R. L. (1989) Sampling 1097–137.
distributions of post-sample forecasting errors, Ueberhuber, C. W. (1997) Numerical Computation 2:
Applied Statistics, 38, 331–42. Methods, Software, and Analysis, Springer-Verlag,
Murphy, A. H. and Winkler, R. L. (1977) Reliability of Berlin.
subjective probability forecasts of precipitation and
Vogel, F. A. and Bange, G. A. (1999) Understanding
temperature, Journal of the Royal Statistical Society,
USDA crop forecasts, Miscellaneous Publication
Series C, 26, 41–7.
No. 1554, US Department of Agriculture, National
Rebba, R. and Mahadevan, S. (2006) Model predictive
capability assessment under uncertainty, American Agricultural Statistics Service and Office of the Chief
Institute of Aeronautics and Astronautics Journal, 44, Economist, World Agricultural Outlook Board,
2376–84. Washington, DC.
Shlyakhter, A. I., Kammen, D. M., Broido, C. L. and Williams, W. H. and Goodman, M. L. (1971) A simple
Downloaded by [Stony Brook University] at 06:23 20 December 2014
Wilson, R. (1994) Quantifying the credibility of energy method for the construction of empirical confidence
projections from trends in past data, Energy Policy, 22, limits for economic forecasts, Journal of the American
119–30. Statistical Association, 66, 752–4.
Silverman, B. W. (1986) Density Estimation for Statistics World Agricultural Supply and Demand Estimates
and Data Analysis, J.W. Arrowsmith Ltd., Bristol. (WASDE). Selected issues from 1980 through 2007.
Stoto, M. A. (1983) The accuracy of population projections, Available at: http://www.usda.gov/oce/commodity/
Journal of American Statistical Association, 78, 13–20. wasde/index.htm (accessed 22 March 2010).
Taylor, J. W. and Bunn, D. W. (1999a) A quantile Zarnowitz, V. (1992) Business Cycles: Theory, History,
regression approach to generating prediction intervals, Indicators, and Forecasting, The University of Chicago
Management Science, 45, 225–37. Press, Chicago.