Professional Documents
Culture Documents
Testbank KTLTC Trac Nghiem Kinh Te Luong Tai Chinh
Testbank KTLTC Trac Nghiem Kinh Te Luong Tai Chinh
Testbank KTLTC Trac Nghiem Kinh Te Luong Tai Chinh
kinh tế lượng tài chính (Trường Đại học Kinh tế Thành phố Hồ Chí Minh)
Chapter 1
Correct answers denoted by an asterisk.
1. The linear relationship between two variables (y and x) can be represented by the
equation . Which of the following statements is true?
(I) Parameter a is termed the intercept
(II) Parameter a is termed the slope
(III) Parameter b is termed the gradient
(IV) Parameter b is termed the constant.
(a) I and IV only
(b)* I and III only
(c) II and III only
(d) II and IV only.
2. Assume that the relationship between a company’s stock price (y) and dividends paid
per share (x) is linear. If the slope of the equation is 0.50 and the intercept is 30, what
would be the expected stock price if the dividend paid was 3?
(a) 33
(b) 30.50
(c)* 31.5
(d) 30.
3. Which of the following values are closes to the roots of the following quadratic
equation: ?
(a) 0 and 4
(b) 1 and 4
(c) 0.5 and 3
(d)* 0.3 and 3.7.
(A) (B)
b. log(x)
c. ex
d. x.
12. Writing out all the terms in the expression would lead to:
a. * x11 + x12 + x21 + x22
b. x11 × x12 × x21 × x22
c. x1 + x2
d. x11 + x22.
a. 5/x
b. * 1/x
c. 5log(x)
d. 5/log(x).
(a)*
(b)
(c)
(d) .
21. For two conformable matrices A and B, expanding the parentheses of (AB)-1 gives:
a. A-1B-1
b. * B-1A-1
c. BA
d. AB.
(a)
(b)*
(c)
(d) .
25. The point where the capital market line is tangential to the efficient frontier is
(a) The point where the portfolio returns are minimised
(b) The point where the portfolio returns are maximised
(c) The point where the portfolio’s Sharpe ratio is minimised
(d)* The point where the portfolio’s Sharpe ratio is maximised.
(a)* The sampling distribution of the mean of any random sample of observations will
tend towards the normal distribution with mean equal to the population mean as the
sample size tends to infinity
(b) The sampling distribution of the mean of any random sample of observations will
tend towards the normal distribution with mean equal to the population mean as the
sample size tends to zero
(c) The cumulative distribution function of the mean of any random sample of
observations will tend towards the normal distribution with mean equal to the population
mean as the sample size tends to infinity
(d) The probability distribution function of the mean of any random sample of
observations will tend towards the normal distribution with mean equal to the population
mean as the sample size tends to infinity.
27. Consider the following data series: 11, 10, 6, 8, 4, 3, 7. What is its semi-interquartile
range of this series?
(a) 6
(b) 5
(c) 4
(d)* 3.
(A) (B)
29. Suppose that we have a function given by y = a1 + a2x. a1 and a2 are, respectively,
(a) The domain and range of the function
(b) The order and power of the function
(c) * The intercept and slope of the function
(d) The slope and intercept of the function.
32. Which of the following three equations is/are correct regarding the summation
operator?
i=1Kxi+i=1Kzi=i=1Kxi+zi (i)
i=1Kxizi=i=1Kxii=1Kzi (ii)
i=1Kcxi=ci=1Kxi (iii)
a. (iii) only
b. * (i) and (iii) only
c. (i) and (ii) only
d. (i), (ii) and (iii).
(d) * A constant.
35. Using the chain rule or otherwise, what is the (first) derivative of the following
function? y = (2x2 + 4x – 6)3
(a) 3(2x2 + 4x – 6)2
(b) (4x + 4)2
(c) * 3(2x2 + 4x – 6)2(4x + 4)
(d) 3(2x2 + 4x – 6)3(4x + 4)2.
Chapter 2
4. Data that have been collected over a period of time on one or more variables is referred
to as
(a) Cross-sectional data
(b) Time-cross-sectional data
(c)* Time-series data
(d) Panel data.
5. Data that have been collected on one or more variables at a single point in time is
referred to as
(a)* Cross-sectional data
7. An individual invested £106.40 in the stock market and the value of his investment two
years later is £138.22. What are the simple and continuously compounded returns on his
investment?
(a) 26% and 30%, respectively
(b) –29% and -34%, respectively
(c)* 30% and 26%, respectively
(d) 30% and 30%, respectively.
8. An individual has £10000 capital to invest in the stock market. He invests 30% of his
capital in stock A, 25% in stock B and 45% in Stock C. What is the return on his/her
portfolio assuming that the simple returns on stocks A, B and C are 5%, 10% and 12%,
respectively?
(a)* 9.0%
(b) 9.7%
(c) 9.3%
(d) 9%.
The average nominal annual rent in the US denominated in dollars and the CPI (2008
levels) are given in the table below:
Year Average annual rent (US Dollars) CPI (2008 levels)
200 9908 100
8
200 9998 99.7
9
201 10012 101.3
0
2011 10180 104.5
201 10396 106.7
2
(d) 11093.
11. The numerical score assigned to the credit rating of a bond is best described as what
type of number?
(a) Continuous
(b) Cardinal
(c)* Ordinal
(d) Nominal.
12. Suppose that we wanted to sum the 2007 returns on ten shares to calculate the return
on a portfolio over that year. What method of calculating the individual stock returns
would enable us to do this?
(a)* Simple
(b) Continuously compounded
(c) Neither approach would allow us to do this validly
(d) Either approach could be used and they would both give the same portfolio return.
13. If we wish to compare the spread of two series with considerably different mean
values, which of the following measures would be the most appropriate?
(a) The semi-interquartile range
(b) The standard deviation
(c) The range
(d) * The coefficient of variation.
14. For a series with a negative skew in its distribution (a long left tail), which of the
following best describes the relationship between its measures of central tendency?
(a) mean > median > mode
(b) * mode > median > mean
(c) mode > mean > median
(d) median > mode > mean.
15. Which of the following statements is TRUE concerning the correlation between two
series?
(a) * It is unit-free
(b) It scales with the product of the units of the two series
(c) It scales with the ratio of the units of the two series
(d) It will take the value –1 if there is no association between the two series.
16. What is the sum of the following infinite set of terms? 5, 2.5, 1.25, 0.625, …
(a) Infinity
(b) 5
(c) 20
(d) * 10.
17. What is the sum of the first 12 terms in the following sequence? 12, 24, 48, …
(a) * 49,140
(b) 24,576
(c) 768
(d) 98,292.
18. If I have £10,000 now and I want it to grow by 50% within eight years, what interest
rate, compounded annually, is required (to one decimal place)?
(a) * 5.2%
(b) 6.2%
(c) 4.6%
(d) 7.4%.
19. If a savings account pays a nominal interest rate of 10% per year, compounded
monthly, what is the effective interest rate to one decimal place?
(a) 11.2%
(b) 9.5%
(c) 10.0%
(d) * 10.5%.
20. If you place £10,000 in a savings account, how long would it take to reach £20,000
assuming an annual interest rate of 3%, continuously compounded, rounded to the nearest
year?
(a) 26
(b) 34
(c) *23
(d) 20.
21. What would be a fair price to pay today, to the nearest dollar, for a zero coupon bond
having exactly six years to maturity and to be redeemed at $1000 if the annual discount
rate is 6%?
(a) $1000
(b) * $747
(c) $864
(d) $553.
22. Which of the following statements is FALSE concerning the internal rate of return?
(a) For projects where the cashflow payments change sign, there can be more than one
internal rate of return
(b) The internal rate of return is the discount rate that sets the net present value of all of
the cashflows to be received equal to the asset’s purchase price
(c) * In order to calculate an internal rate of return, all of the incoming cashflows must be
identical
(d) We cannot calculate a different internal rate of return for each cashflow.
Chapter 3
2. What does a positive linear relationship between x and y in a simple regression imply?
(a) Increases in the independent variable are usually accompanied by increases in the
regressor
(b) The relationship between x and y cannot be explained by a straight line
(c) Decreases in the independent variable is usually accompanied by increases in the
regressors
(d)* Increases in the regressor are usually accompanied by increases in the dependent
variable.
3. Which of these is NOT a reason for adding a disturbance term to a regression model
?
(a) Some determinants of the effect variable may be omitted from the model
(b) Some determinants of the effect variable may be unobservable
(c)* Some determinants of the independent variable may be omitted from the model
(d) There may be errors in the way that the dependent variable is measured which cannot
be modelled.
5. The method of estimating econometric models which involves fitting a line to the data
by minimising the sum of squared residuals is the
(a)* Method of ordinary least squares
(b) Method of moments
(c) Method of generalised squared moments
(d) Method of maximum likelihood.
6. Suppose you have 5-year annual data on the excess returns on a fund manager’s
portfolio (‘fund ABC’) and the excess returns on a market index (where is the return
on fund ABC, is the risk-free rate and is the return on the market index):
Year t Excess return on fund ABC Excess return on market index
1 14.0 16.0
2 32.0 21.7
3 11.6 6.0
4 21.2 16.2
5 17.4 11.0
7. Given the data in Question 6, what is the estimated beta ( ) of Fund ABC?
(a) 3.1
(b) 2.1
(c)* 1.1
(d) None of the above.
8. Suppose that the unbiased estimator of the standard deviation of the disturbance (s) is
5.1. What is the nearest value to the standard errors of the estimated CAPM alpha ( ) of
Fund ABC from Question 6?
(a) 3.5
(b) 4.5
(c) 5.5
(d)* 6.5.
9. The estimated alpha ( ) and beta ( ) of a rival fund, Fund DEF, are 2.3 and 3.1,
respectively. If the expected market risk premium is 12%, what would we expect the
excess return of Fund DEF to be?
(a)* 39.5%
(b) 30.7%
(c) 5.4%
(d) 64.8%.
11. The estimators and determined by OLS will be the Best Linear Unbiased
Estimators (BLUE) if which of the following assumptions hold?
(I) The errors have zero mean
(II) The variance of the errors is constant and finite over all values of the independent
variable(s)
(III) The errors are linearly independent of one another
(IV)There is no relationship between the error and corresponding independent variables
(a) I and II only
(b) I, II and III only
(c) II, III and IV only
(d)* I, II, III, and IV.
13. Using the test of significance approach, what is the test statistic value of a hypothesis
to test whether the true value of statistically different from zero?
(a)* 1.10
(b) 0.91
(c) –0.62
(d) Cannot say without more information.
14. Assuming there are 1000 observations in your sample, what are the test statistic and
critical value of a two-sided hypothesis test of whether the true value of statistically
different from zero be given a 5% significance level?
(a)* 1.10 and 1.96, respectively
(b) 0.91 and 1.65, respectively
(c) –0.62 and 1.96, respectively
(d) Cannot say without more information.
15. Consider a bivariate regression model with coefficient standard errors calculated
using the usual formulae. Which of the following statements is/are correct regarding the
standard error estimator for the slope coefficient?
i. It varies positively with the square root of the residual variance (s)
ii. It varies positively with the spread of X about its mean value
iii. It varies positively with the spread of X about zero
iv. It varies positively with the sample size T
a. * (i) only
b. (i) and (iv) only
c. (i), (ii) and (iv) only
d. (i), (ii), (iii) and (iv).
16. In a time-series regression of the excess return of a mutual fund on a constant and the
excess return on a market index, which of the following statements should be true for the
fund manager to be considered to have ‘beaten the market’ in a statistical sense?
a. * The estimate for α should be positive and statistically significant
b. The estimate for α should be positive and statistically significantly greater than the
risk-free rate of return
c. The estimate for β should be positive and statistically significant
d. The estimate for α should be negative and statistically significant.
(c) * That OLS gives minimum variance coefficient estimates only among the class of
linear unbiased estimators
(d) That OLS ensures that the errors are distributed normally.
19. Which of the following is a correct interpretation of a ‘95% confidence interval’ for a
regression parameter?
(a) * We are 95% sure that the interval contains the true value of the parameter
(b) We are 95% sure that our estimate of the coefficient is correct
(c) We are 95% sure that the interval contains our estimate of the coefficient
(d) In repeated samples, we would derive the same estimate for the coefficient 95% of the
time.
20. Which of the following statements is correct concerning the conditions required for
OLS to be a usable estimation technique?
a. * The model must be linear in the parameters
b. The model must be linear in the variables
c. The model must be linear in the variables and the parameters
d. The model must be linear in the residuals.
21. Which of the following is NOT a good reason for including a disturbance term in a
regression equation?
(a) It captures omitted determinants of the dependent variable
(b) * To allow for the non-zero mean of the dependent variable
(c) To allow for errors in the measurement of the dependent variable
(d) To allow for random influences on the dependent variable.
22. Which of the following is NOT correct with regard to the p-value attached to a test
statistic?
(a) * p-values can only be used for two-sided tests
(b) It is the marginal significance level where we would be indifferent between rejecting
and not rejecting the null hypothesis
(c) It is the exact significance level for the test
(d) Given the p-value, we can make inferences without referring to statistical tables.
23. Which one of the following is NOT an assumption of the classical linear regression
model?
a. The explanatory variables are uncorrelated with the error terms.
b. The disturbance terms have zero mean
c. * The dependent variable is not correlated with the disturbance terms
d. The disturbance terms are independent of one another.
24. Which of the following is the most accurate definition of the term ‘the OLS
estimator’?
(a) It comprises the numerical values obtained from OLS estimation
(b) * It is a formula that, when applied to the data, will yield the parameter estimates
(c) It is equivalent to the term ‘the OLS estimate’
(d) It is a collection of all of the data used to estimate a linear regression model.
25. Two researchers have identical models, data, coefficients and standard error
estimates. They test the same hypothesis using a two-sided alternative, but researcher 1
uses a 5% size of test while researcher 2 uses a 10% test. Which one of the following
statements is correct?
a. Researcher 2 will use a larger critical value from the t-tables
b. * Researcher 2 will have a higher probability of type I error
c. Researcher 1 will be more likely to reject the null hypothesis
d. Both researchers will always reach the same conclusion.
26. Consider an increase in the size of the test used to examine a hypothesis from 5% to
10%. Which one of the following would be an implication?
a. * The probability of a Type I error is increased
b. The probability of a Type II error is increased
c. The rejection criterion has become more strict
d. The null hypothesis will be rejected less often.
27. What is the relationship, if any, between the normal and t-distributions?
a. A t-distribution with zero degrees of freedom is a normal
b. A t-distribution with one degree of freedom is a normal
c. * A t-distribution with infinite degrees of freedom is a normal
d. There is no relationship between the two distributions.
Chapter 4
yt = β1 + β2X2t + β3X3t + ut
Suppose that a researcher wishes to test the null hypothesis: β2 = 1 and β3 + β4 = 1. The
TABULATED value of the F-distribution that we would compare the result of testing this
hypothesis with at the 10% level would be approximately
(a) 19.48
(b) 2.76
(c) * 2.37
(d) 3.11.
7. Which one of the following statements must hold for EVERY CASE concerning the
residual sums of squares for the restricted and unrestricted regressions?
a. URSS > RRSS
b. URSS ≥ RRSS
c. RRSS > URSS
d. * RRSS ≥ URSS.
8. Which one of the following is the most appropriate as a definition of R2 in the context
that the term is usually used?
a. It is the proportion of the total variability of y that is explained by the model
b. * It is the proportion of the total variability of y about its mean value that is
explained by the model
c. It is the correlation between the fitted values and the residuals
d. It is the correlation between the fitted values and the mean.
9. Suppose that the value of R2 for an estimated regression model is exactly one. Which
of the following are true?
i. All of the data points must lie exactly on the line
ii. All of the residuals must be zero
iii. All of the variability of y about its mean has been explained by the model
i. The fitted line will be horizontal with respect to all of the explanatory variables.
11. Which of the following are often considered disadvantages of the use of adjusted R2
as a variable addition/variable deletion rule?
i. Adjusted R2 always rises as more variables are added
ii. Adjusted R2 often leads to large models with many marginally significant or
marginally insignificant variables
iii. Adjusted R2 cannot be compared for models with different explanatory variables
iv. Adjusted R2 cannot be compared for models with different explained variables.
(a) * I only
(b) I and II only
(c) I and III only
(d) I, II and III.
13. If you are interested in conducting a multiple hypotheses test to determine whether
and are both unity for a regression , what would the
restricted regression be?
(a)
(b)
(c)*
(d) .
14. What would the restricted regression be if you are interested in testing the null
hypothesis and against the alternative hypothesis or
for a regression ,?
(a)*
(b)
(c)
(d) .
15. Assuming that the restricted sum of squares of the restricted regression in Question 14
is 436.1 and the unrestricted sum of squares is 397.2, what would the conclusion of the
hypothesis test be? (The significance level is 5%.)
(a)* Reject the null hypothesis
(b) Do not reject the null hypothesis
(c) Reject the alternative hypothesis
(d) Cannot say.
17. Trying many variables in a regression without basing the selection of candidate
variables on a financial or economic theory is popularly referred to as
(a) Data fitting
(b) Data clipping
(c)* Data mining
(d) None of the above.
18. Why is R2 a commonly used and perhaps better measure of how well a regression
model fits the data than the residual sum of squares (RSS)?
(a) The RSS is often too large
(b) The RSS does not depend on the scale of the dependent variable whereas the R2 does
(c)* The RSS depends on the scale of the dependent variable whereas the R2 does not
(d) The RSS depends on the scale of the independent variable whereas the R2 does not.
19. How can the two models be validly compared to determine the model that better
represents the data yt?
(a) By observing their respective R2
(b) By observing their respective Adjusted R2
(c) By estimating an encompassing or hybrid model
(d)* All of the above.
20. What is the relevant encompassing model required to compare the two regression
models?
(a)*
(b)
(c)
(d) Encompassing models cannot be used to compare these specifications.
Chapter 5
2. Which of the following would NOT be a potential remedy for the problem of
multicollinearity between regressors?
(a) Removing one of the explanatory variables
(b) * Transforming the data into logarithms
(c) Transforming two of the explanatory variables into ratios
(d) Collecting higher frequency data on all of the variables.
3. Which of the following conditions must be fulfilled for the Durbin–Watson test to be
valid?
(i) The regression includes a constant term
(ii) The regressors are non-stochastic
(iii) There are no lags of the dependent variable in the regression
(iv) There are no lags of the independent variables in the regression.
6. Which of the following is NOT a good reason for including lagged variables in a
regression?
(a) Slow response of the dependent variable to changes in the independent variables
(b) Over-reactions of the dependent variables
(c) The dependent variable is a centred moving average of the past 4 values of the series
8. Which of the following would you expect to be a problem associated with adding
lagged values of the dependent variable into a regression equation?
(a) * The assumption that the regressors are non-stochastic is violated
(b) A model with many lags may lead to residual non-normality
(c) Adding lags may induce multicollinearity with current values of variables
(d) The standard errors of the coefficients will fall as a result of adding more explanatory
variables.
9. A normal distribution has coefficients of skewness and excess kurtosis which are,
respectively,
(a) * 0 and 0
(b) 0 and 3
(c) 3 and 0
(d) Will vary from one normal distribution to another.
10. Which of the following would probably NOT be a potential ‘cure’ for non-normal
residuals?
(a) * Transforming two explanatory variables into a ratio
(b) Removing large positive residuals
(c) Using a procedure for estimation and inference which did not assume normality
(d) Removing large negative residuals.
11. What would be the consequences for the OLS estimator if autocorrelation is present
in a regression model but ignored?
(a) It will be biased
b. It will be inconsistent
b. * It will be inefficient
b. All of (a), (b), and (c) will be true.
12. If OLS is used in the presence of heteroscedasticity, which of the following will be
likely consequences?
i. Coefficient estimates may be misleading
ii. Hypothesis tests could reach the wrong conclusions
iii. Forecasts made from the model could be biased
13. If a residual series is negatively autocorrelated, which one of the following is the
most likely value of the Durbin–Watson statistic?
a. Close to zero
b. Close to two
c. * Close to four
d. Close to one.
14. If the residuals of a model containing lags of the dependent variable are
autocorrelated, which one of the following could this lead to?
a. Biased but consistent coefficient estimates
b. * Biased and inconsistent coefficient estimates
c. Unbiased but inconsistent coefficient estimates
d. Unbiased and consistent but inefficient coefficient estimates.
16. Which one of the following would be the most appropriate auxiliary regression for a
Ramsey RESET test of functional form?
(a) *
b.
b.
b. .
17. If a regression equation contains an irrelevant variable, the parameter estimates will
be
a. * Consistent and unbiased but inefficient
b. Consistent and asymptotically efficient but biased
c. Inconsistent
d. Consistent, unbiased and efficient.
18. Put the following steps of the model-building process in the order in which it would
be statistically most appropriate to do them:
(i) Estimate model
(ii) Conduct hypothesis tests on coefficients
(iii) Remove irrelevant variables
(iv) Conduct diagnostic tests on the model residuals.
19. Test statistics for the LM test and the Wald test are usually constructed to follow a
(a)* χ2 distribution and F-distribution, respectively
(b) χ2 distribution and t-distribution, respectively
(c) F-distribution and χ2 distribution, respectively
(d) t-distribution and χ2 distribution, respectively.
(a) I only
(b) I and II
(c)* I, II, and III
(d) I, II, III, and IV.
(A) (B)
25. The graphs above are time series plots of residuals from two separate regressions.
Which of these combinations is true?
(a)* A shows negative autocorrelation and B shows positive autocorrelation
(b) A shows positive autocorrelation and B shows negative autocorrelation
(c) A shows heteroscedasticity and B shows homoscedasticity
(d) A shows homoscedasticity and B shows heteroscedasticity.
(a) I only
(b) I and II only
(c)* I, II, and III only
(d) I, II, III, and IV.
30. Which of the following statements are true about parameter stability tests?
(I) Parameter stability tests test the assumption that the estimated parameters of a model
are constant for the entire sample
(II) Chow test and predictive failure tests are two types of parameter stability tests
(III) Backward and forward predictive failure tests are two types of parameter stability
tests
(IV) Parameter stability tests examine violations of the classical linear regression model
assumptions.
(a) I only
(b) I and II only
(c)* I, II, and III only
(d) I, II, III, and IV.
(a) 0.2
(b) * 0.23
(c) 0.5
(d) There is insufficient information given in the question to form more than a one-step-
ahead forecast.
a. 0.4
b. 0.0
c. * 0.07
d. –0.1.
5. A process, xt, which has a constant mean and variance, and zero autocovariance for all
non-zero lags is best described as
6. Which of the following conditions must hold for the autoregressive part of an ARMA
model to be stationary?
(a) * All roots of the characteristic equation must lie outside the unit circle
(b) All roots of the characteristic equation must lie inside the unit circle
(c) All roots must be smaller than unity
(d) At least one of the roots must be bigger than one in absolute value.
8. If a series, yt, follows a random walk (with no drift), what is the optimal 1-step-ahead
forecast for y?
a. * The current value of y
b. Zero
c. The historical unweighted average of y
d. An exponentially weighted average of previous values of y.
9. Consider a series that follows an MA(1) with zero mean and a moving average
coefficient of 0.4. What is the value of the autocorrelation function at lag 1?
a. 0.4
b. 1
c. *0.34
d. It is not possible to determine the value of the autocovariances without knowing
the disturbance variance.
11. Consider the following picture and suggest the model from the following list that best
characterises the process:
a. An AR(1)
b. An AR(2)
c. * An ARMA(1,1)
d. An MA(3).
The acf is clearly declining very slowly in this case, which is consistent with their being an
autoregressive part to the appropriate model. The pacf is clearly significant for lags 1 and
2, but the question is: does it them become insignificant for lags 2 and 4, indicating an
AR(2) process, or does it remain significant, which would be more consistent with a
mixed ARMA process? Well, given the huge size of the sample that gave rise to this acf
and pacf, even a pacf value of 0.001 would still be statistically significant. Thus an
ARMA process is the most likely candidate, although note that it would not be possible to
tell from the acf and pacf which model from the ARMA family was more appropriate.
The DGP for the data that generated this plot was y_t = 0.9 y_(t–1) – 0.3 u_(t–1) + u_t.
12. Which of the following models can be estimated using ordinary least squares?
i. An AR(1)
(ii) An ARMA(2,0)
(iii) An MA(1)
(iv) An ARMA(1,1).
a. (i) only
b. * (i) and (ii) only
13. If a series, y, is described as ‘mean-reverting’, which model from the following list is
likely to produce the best long-term forecasts for that series y?
a. A random walk
b. * The long term mean of the series
c. A model from the ARMA family
d. A random walk with drift.
14. Consider the following AR(2) model. What is the optimal 2-step-ahead forecast for y
if all information available is up to and including time t, if the values of y at time t, t-1
and t-2 are –0.3, 0.4 and –0.1, respectively, and the value of u at time t-1 is 0.3?
yt = –0.1 + 0.75yt-1 – 0.125yt-2 + ut
a. –0.1
b. 0.27
c. * –0.34
d. 0.30.
15. What is the optimal three-step-ahead forecast from the AR(2) model given in
Question 14?
a. –0.1
b. 0.27
c. –0.34
d. * –0.31.
16. Suppose you had to guess at the most likely value of a one hundred-step-ahead
forecast for the AR(2) model given in Question 14 – what would your forecast be?
a. -0.1
b. 0.7
c. * –0.27
d. 0.75.
Use the following to answer Questions 19 and 20. Suppose that you have estimated the
first five autocorrelation coefficients using a series of length 81 observations and found
them to be
19. Which autocorrelation coefficients are significantly different from zero at the 5%
level?
(a) The first and fifth autocorrelation coefficient
(b) The first, second, third, and fifth autocorrelation coefficient
(c)* The first, third, and fifth autocorrelation coefficient (+/-1.96/căn T)
(d) The second and fourth autocorrelation coefficient.
21. Consider the following MA(2) process where the errors follow a
standard normal distribution. What is the variance of ?
(a)
(b)
(c)
(d)* All of the above.
22. A model where the current value of a variable depends upon only the values that the
variable took in previous periods plus an error term is called
(a)* An autoregressive model
(b) An autoregressive moving average model
(c) An autoregressive integrated moving average model
(d) A periodic lag model.
25. Which of these is an appropriate way to determine the order of an ARMA model
required to capture the dynamic features of a given data?
(a) Graphically plotting the time series of the data
(b) Determining the number of parameters that maximises the information criteria
(c)* Determining the number of parameters that minimises the information criteria
(d) None of the above.
(b)* The length of the in-sample period used to estimate the model is fixed so that the
start date and end date successively increase by one observation
(c) The initial estimation date changes as additional observations are added one at a time
to the estimation period
(d) The length of the out-of-sample period used to estimate the model is fixed so that the
start date and end date successively increase by one observation.
28. What are the closest to the mean squared errors for model A and B’s forecasts?
(a) 0.58 and 0.98, respectively
(b) 0.98 and 0.58, respectively
(c)* 0.45 and 1.95, respectively
(d) 1.95 and 0.45, respectively.
29. What are the closest to the mean absolute errors from models A and B?
(a)* 0.58 and 0.98, respectively
(b) 0.98 and 0.58, respectively
(c) 0.45 and 1.95, respectively
(d) 1.95 and 0.45, respectively.
30. Based on the MAE and MSE forecast evaluation metrics, which of these statements
are true?
(a)* Model A outperforms Model B at forecasting the house price index
(b) Model A underperforms Model B at forecasting the house price index
(c) Model A and Model B perform equally well at forecasting the house price index
(d) We cannot tell which model does best.
Chapter 7
Assume that the Y’s are endogenous and the X’s exogenous variables, and that the error
terms are uncorrelated.
(a) Equations that are part of a recursive system can be validly estimated using OLS
(b) Unnecessary use of two-stage least squares (2SLS) – i.e., on a set of right hand side
variables that are in fact exogenous – will result in consistent but inefficient coefficient
estimates
(c) 2SLS is just a special case of instrumental variables (IV) estimation
(d) * 2SLS and indirect least squares (ILS) are equivalent for over-identified systems.
(a) We do not need to specify which variables are endogenous and which are exogenous
(b) Standard form VARs can be estimated equation-by-equation using OLS
Which of the following coefficient significances are required to be able to say that y1
Granger-causes y2 but not the other way around?
a. α13 and α14 significant; α21 and α22 not significant
b. * α21 and α22 significant; α13 and α14 not significant
c. α21 and α23 significant; α11 and α13 not significant
d. α11 and α13 significant; α21 and α23 not significant.
7. Which of the following statements is TRUE concerning VAR impulse response
functions?
(i) Impulse responses help the researcher to investigate the interactions between the
variables in the VAR
(ii) An impulse response analysis is where we examine the effects of applying unit shocks
to all of the variables at the same time
(iii) Impulse responses involve calculating the proportion of the total forecast error
variance of a given variable that is explained by innovations to each variable
(iv) If the ±2 standard error bars around the impulse responses for a given lag span (i.e.,
include) the x-axis, it would be said that the response is statistically significant.
(a) (i), (ii), (iii), and (iv)
(b) (i), (ii), and (iii) only
(c) (i) only
(d) * (i) and (ii) only.
8. In the context of simultaneous equations modelling, which of the following statements
is TRUE concerning an exogenous variable?
a. The values of exogenous variables are determined within the system
b. * The exogenous variables are assumed to be fixed in repeated samples
c. Reduced form equations will not contain any exogenous variables on the RHS
d. Reduced form equations will contain only exogenous variables on the LHS.
9. Comparing the information criteria approach with the likelihood ratio test approach to
determining the optimal VAR lag length, which one of the following statements is true?
a. The choice of stiffness of penalty term will not affect the model choice
b. The validity of information criteria relies upon normal residuals
c. * Conducting a likelihood ratio test could lead to a sub-optimal model selection
d. An application of the univariate information criteria to each equation will give
identical results to the application of a multivariate version of the criteria to all of
the equations jointly.
10. The second stage in two-stage least squares estimation of a simultaneous system
would be to
(c)
(d) None of the above.
12. A variable x is defined as ________ if its value is determined outside of the equation
or system of equations. What is the blank?
(a) Endogenous
(b)* Exogenous
(c) Homogeneous
(d) Heterogeneous.
13. Which of these is not an appropriate method of estimating equations that are from a
simultaneous system?
(a) Indirect least squares
(b) Two-stage least squares
(c)* Aggregate least squares
(d) Instrumental variables.
14. Which of these statements is true about vector autoregressive models?
(I) They allow the value of a variable to depend on more than just its own lags
(II) All variables are endogenous
(III) The researcher does not need to specify which variables are endogenous or
exogenous
(IV) All variables are exogenous
(a) I only
(b) I and II only
(c)* I, II, and III only
(d) I, II, III, and IV.
15. Which of these is an approach used to determine the appropriate lag lengths of VAR
models?
(a) Graphically plotting the time series of the data
(b) Selecting the number of lags that maximises the information criteria
(c)* Selecting the number of lags that minimises the information criteria
Chapter 8
2. Which of the following are problems associated with the Engle–Granger approach to
modelling using cointegrated data?
(i) The coefficients in the cointegrating relationship are hard to calculate
(ii) This method requires the researcher to assume that one variable is the dependent
variable and the others are independent variables
(iii) The Engle–Granger technique can only detect one cointegrating relationship
(iv) The Engle-Granger technique does not allow the testing of hypotheses involving the
actual cointegrating relationship.
(a) Johansen’s test for cointegration centres on the rank of the matrix Γ1
(b) If the variables yt are cointegrated, Π will be of full rank
(c) If the rank of Π is zero, the variables are cointegrated
(d) * Provided that all of the series in y are non-stationary, the rank of Π can be at most
k-1.
5. You have the following data for Johansen’s λmax rank test for cointegration between 4
international equity market indices:
r λmax 5% Critical Value
0 40.03 30.26
1 26.81 23.84
2 13.42 17.72
3 8.66 10.71
(a) 0
(b) 1
(c) * 2
(d) 3.
Which one of the following most accurately describes the process for yt?
8. Which one of the following best describes most series of asset prices?
(a) An independently and identically distributed (iid, i.e., ‘completely random’) process
(b) * A random walk with drift
(c) An explosive process
(d) A deterministic trend process.
9. If there are three variables that are being tested for cointegration, what is the maximum
number of linearly independent cointegrating relationships that there could be?
(a) 0
(b) 1
(c) *2
(d) 3.
10. If the number of non-zero eigenvalues of the pi matrix under a Johansen test is 2, this
implies that
(a) * There are 2 linearly independent cointegrating vectors
(b) There are at most 2 linearly independent cointegrating vectors
(c) There are 3 variables in the system
(d) There are at least 2 linearly independent cointegrating vectors.
11. If a Johansen ‘max’ test for a null hypothesis of 1 cointegrating vectors is applied to a
system containing 4 variables, which eigenvalues would be used in the test?
(a) The largest 1
(b) * The second largest
(c) The second smallest
(d) The smallest.
12. Consider the testing of hypotheses concerning the cointegrating vector(s) under the
Johansen approach. Which of the following statements is correct?
(a) If the restriction is (are) rejected, the number of cointegrating vectors will rise
(b) If the restriction(s) is (are) rejected, the number of eigenvalues will fall
(c) Whether the restriction is supported by the data or not, the eigenvalues are likely to
change at least slightly upon imposing the restriction(s)
(d) * All linear combinations of the cointegrating vectors are themselves cointegrating
vectors.
14. Which of the following are consequences of using non-stationary data in regressions?
(I) Shocks will be persistent
(II) It can lead to spurious regressions
(III) t-ratios will not follow a t-distribution
(IV) The F-statistic will not follow an F-distribution.
(a) I only
(b) I and II only
(c) I, II, and III only
19. A researcher would like to test for a unit root in a series. She runs the regression
. What should her null hypothesis be assuming that she adopts the
Dickey–Fuller test approach?
(a)*
(b)
(c)
(d) .
20. Assuming the researcher in Question 19 would like to run an augmented Dickey–
Fuller test instead. What is the appropriate regression she would have to run and the null
hypothesis of the test?
22. Assume that you are trying to model the relationship between house prices and rents.
If you find that both series are non-stationary and a linear combination of the two series is
stationary, which of the following is true?
(I) Regressing the levels of house prices on the levels of rents could lead to spurious
regressions
(II) House prices and rents are cointegrated
(III) An appropriate linear combination of house prices and rents is I(1)
(IV) House prices and rents are not cointegrated.
(a) I only
(b)* I and II only
(c) I, II, and III only
(d) I, II, III, and IV only.
Chapter 9
1. Volatility clustering is
(a) The tendency for financial asset returns to have distributions that exhibit fat tails
(b)* The tendency for financial asset return volatility to appear in bunches
(c) The tendency for volatility to rise more following a large price fall than following a
price rise of the same magnitude
(d) All of the above.
(a) I only
(b)* I and II only
(c) I, II, and III only
(d) I, II, III, and IV.
(a) I only
(b) I and II only
(c) I, II, and III only
(d)* I, II, III, and IV.
9. GJR and EGARCH are types of GARCH models that allow for:
(a) An asymmetric response of returns to positive and negative shocks in the dependent
variable
(b) An asymmetric response of returns to positive and negative shocks to its lagged
values
(c) A symmetric response of volatility to positive and negative shocks
(d)* An asymmetric response of volatility to positive and negative shocks.
10. Assume that you have estimated a GJR model of monthly stock returns and you
obtain the following equations:
Suppose that , what would be the fitted conditional variance for time t if
and then if ?
(a) 1.62 and 1.67, respectively
(b) 1.64 and 1.59, respectively
(c)* 1.59 and 1.64, respectively
(d) 1.67 and 1.62, respectively.
11. Suppose that a researcher estimates a GARCH(1,1) model and obtains a log
likelihood function (LLF) value of 71.22. She is interested in testing whether an
ARCH(1) model is a better model at describing volatility. If she estimates a model which
imposes the necessary restrictions and obtains an LLF value of 68.21, what would be the
conclusion of her likelihood ratio test (assuming a 5% significance level)?
(a) Statistical evidence suggesting that ARCH(1) is better than GARCH(1,1)
(b)* Statistical evidence suggesting that ARCH(1) is not better than GARCH(1,1)
(c) Statistical evidence suggesting that GARCH(1,1) is better than ARCH(1)
(d) We cannot say because we would need to know the number of observations.
12. What would typically be the shape of the news impact curve for a series that exactly
followed a GARCH(1,1) process?
(a) It would be asymmetric, with a steeper curve on the left than the right
(b) It would be asymmetric, with a steeper curve on the right than the left
(c) * It would be symmetric about zero
(d) It would be discontinuous about zero.
14. Which of the following would represent the most appropriate definition for implied
volatility?
(a) * It is the volatility of the underlying asset’s returns implied from the price of a traded
option and an option pricing model
(b) It is the volatility of the underlying asset’s returns implied from a statistical model
such as GARCH
(c) It is the volatility of an option price implied from a statistical model such as GARCH
(d) It is the volatility of an option price implied from the underlying asset volatility.
15. Suppose that a researcher wanted to obtain an estimate of realised (‘actual’) volatility.
Which one of the following is likely to be the most accurate measure of volatility of stock
returns for a particular day?
(a) The price range (high minus low) on that day
(b) The squared return on that day
(c) * The sum of the squares of hourly returns on that day
(d) The squared return on the previous day.
16. Which of the following is the most plausible test regression for determining whether a
series y contains ‘ARCH effects’?
a.
b. *
c.
d. .
17. Consider the following conditional variance equation for a GJR model.
ht = α0 + α1 +βht-1+γut-12It-1
where It-1 = 1 if ut-1 < 0
= 0 otherwise
For there to be evidence of a leverage effect, which ONE of the following conditions
must hold?
a. α0 positive and statistically significant
b. * γ positive and statistically significant
c. γ statistically significantly greater than α0
d. α1+β statistically significantly less than γ .
18. Consider the three approaches to conducting hypothesis tests under the maximum
likelihood framework. Which of the following statements are true?
i. The Wald test is based on estimation only under the null hypothesis
ii. The likelihood ratio test is based on estimation under both the null and the
alternative hypotheses
iii. The Lagrange multiplier test is based on estimation under the alternative
hypothesis only
19. Which one of the following problems in finance could not be usefully addressed by
either a univariate or a multivariate GARCH model?
a. Producing option prices
b. Producing dynamic hedge ratios
c. Producing time-varying beta estimates for a stock
d. * Producing forecasts of returns for use in trading models
e. Producing correlation forecasts for value at risk models.
Chapter 10
4. The unknown parameters of a Markov switching model are usually estimated using:
(a)* Maximum likelihood
(b) Instrumental variables
(c) Indirect least squares
(d) Ordinary least squares.
5. The key difference between threshold autoregressive and Markov switching models is
that:
(a) The latter can be estimated using ordinary least squares while the latter is estimated
using the indirect least squares estimation technique
(b) Under the latter, the state variable is assumed to be known and observable, while it is
latent under the former
(c)* Under the former, the state variable is assumed to be known and observable, while it
is latent under the latter
(d) None of the above.
(b) if
if
(c) if
if
(d) if
if .
8. Suppose that a researcher wishes to test for calendar (seasonal) effects using a dummy
variables approach. Which of the following regressions could be used to examine this?
i. A regression containing intercept dummies
ii. A regression containing slope dummies
iii. A regression containing intercept and slope dummies
iv. A regression containing a dummy variable taking the value 1 for one observation
and zero for all others.
a. * (ii) only
b. (i) and (ii) only
c. (i), (ii), and (iii) only
d. (i), (ii), (iii), and (iv).
10. Consider the following two equations in a state space model, where yt is the observed
series, and ut and ηt are noise terms.
11. The ratio of the variance of the error term ηt to the variance of the error term ut is used
as the basis of a test for:
(a) * Whether it is necessary to allow for time-varying parameters
(b) Whether the observed series, yt, is excessively noisy
(c) Whether the Markov property holds
(d) Whether the Kalman filter will be efficient.
12. Consider the following two equations in a state space model, where yt is the observed
series, and ut and ηt are noise terms.
Chapters 11-14
(a) I only
(b) I and II only
(c) I, II, and III only
(d)* I, II, III, and IV.
(a) I only
(b)* I and II only
(c) I, II, and III only
(d) I, II, III, and IV.
8. To test for unit roots in panel data, Levin, Lin and Chu (2002) develop a test based on
the equation . What is the appropriate null
hypothesis for this test?
(a) *
(b)
(c)
(d) .
9. Logit and probit models are more appropriate than linear probability models because:
(a) Logit and probit can estimate probabilities that are negative
(b) Logit and probit cannot estimate probabilities that are greater than one
(c) Logit and probit cannot estimate probabilities that are negative but not greater than
one
(d)* Logit and probit cannot estimate probabilities that are negative or greater than one.
10. Which of the following statements about logit and probit models is true?
(I) They cannot be estimated by ordinary least squares
(II) They can be estimated using maximum likelihood
(III) They can be estimated using non-linear least squares
(IV) They can be estimated using instrumental variables.
(a) I only
(b) I and II only
(c)* I, II, and III only
(d) I, II, III, and IV.
11. If the maximised value of the log-likelihood function for a logit model is 34.55 and
for a restricted model where all of the slope parameters are set to zero is 30.67, what is
the pseudo-R2?
(a) 0.13
(b)* –0.13
(c) 0.11
(d) –0.11.
12. Appropriate modelling of limited dependent variables that are assigned numerical
values having a natural ordering can be done using:
(I) Probit models
(II) Logit models
(III) Ordered probit models
(IV) Ordered logit models.
(a) I only
(b) I and II only
(c) II and III only
(d)* III and IV only.
13. In the context of an event study, a cumulative abnormal return:
(a) Is calculated by summing the individual returns over all the firms separately for each
day
(b) * Is calculated by summing the individual returns over time for each firm separately
(c) Takes the geometric product of the individual returns over all the firms separately for
each day
(d) Takes the geometric product of the individual returns over time for each firm
separately.
14. Which of the following statements is TRUE concerning the calendar time
methodology sometimes used in event studies?
(a) It will weight all the firms in the sample that underwent the event equally
(b) It can involve the calculation of a buy-and-hold abnormal return
(c) If the slope parameter in the test regression is positive and significant, this will
provide evidence of an abnormal return in the event study
(d) *It will give more weight in the sample to firms which underwent the event at a time
when few other firms did so.
15. The traditional Fama–MacBeth approach to tests of the CAPM involves:
(a) * A set of time-series regressions to estimate the betas for each stock and then a cross-
sectional regression to estimate the risk premium
(b) A set of cross-sectional regressions to estimate the betas and then a time-series
regression to estimate the risk premium
(c) A single cross-sectional regression to estimate the CAPM beta
(d) A time-series regression to estimate the betas for each stock and then another time-
series regression to estimate the risk premium.
16. In the Fama–MacBeth regressions, the parameter estimates in the second stage are
interpreted as:
(a) Factor loadings
(b) * Factor risk premia
(c) Average returns for each stock
(d) The volatilities of returns for each stock.
17. In the Fama–MacBeth regressions, the parameter estimates in the first stage are
interpreted as:
(a) * Factor loadings
(b) Factor risk premia
(c) Average returns for each stock
(d) The volatilities of returns for each stock.
18. In Fama–French (1993)- and Carhart (1994)-type models, for there to be no evidence
of outperformance by a fund manager, we would require:
(a) The intercept is positive but not statistically significant
(b) The intercept is not positive and significant and slope estimates are all insignificant
(c) * The intercept is not positive and significant
(d) The slope estimates are all insignificant.
19. If we use the block maximum approach to estimating the parameters of a member of
the extreme value family of distributions and we select a large number of short blocks,
which of the following is a likely disadvantage?
(a) * A number of data points would be classified as extreme when they are not, leading
to bias in the shape parameter estimate
(b) Too few data points would be classified as extreme, leading to excessive noise in the
shape parameter estimate
(c) Too few data points would be classified as extreme, leading to bias in the shape
parameter estimate
(d) A number of data points would be classified as extreme when they are not, leading to
excessive noise in the shape parameter estimate.
20. Which of the following distributions would be most appropriate for modelling the
central part of the distribution of a set of stock returns?
(a) Gumbel
(b) Fréchet
(c) Weibull
(d) * Normal.
If a researcher uses daily data to examine a particular problem and creates a variable that
assigns a numerical value of 1 to Monday observations, what term would best describe
this type of number?
Nominal
The price of a house is best described as what type of number?
Cardinal
Which of the following is NOT a feature of continuously compounded returns (i.e. log-
returns)?
They can be added across a portfolio of assets to give portfolio returns
Suppose that observations are available on the monthly bond prices of 100 companies for
5 years. What type of data are these?
Panel
The score associated with a credit rating is best described as a/an:
Ordinal number
In a cross-country study, a researcher codes the US as “1”, Europe as “2” and the rest of
the world as “3”. This is best described as a/an:
Nominal number
The yield to maturity on a bond is best described as a/an:
Cardinal number
A share price is quoted in units of pennies. Which term best describes this sort of data?
Discrete
Which of the following statements is FALSE concerning log-price relatives (log returns)?
They can be validly averaged cross-sectionally
Which of the following statements is TRUE concerning simple returns?
They can be validly averaged cross-sectionally
Suppose that the simple returns on a stock for each of four years are 10%, -6%, 13% and
-8%. The appropriately calculated aggregate return over the whole four-year period to the
nearest 1% is:
Return = [(1+0.1) (1+–0.06) (1+0.13) (1+–0.08)] -1 = 0.07495, which is 7%
Suppose that the log returns on a stock for each of four years are 10%, -6%, 13% and
-8%. The appropriately calculated aggregate return over the whole four-year period to the
nearest 1% is:
Return = 10 + –6 + 13 + –8 = 9%.
Suppose that we estimate a relationship between volatility, y in percent, and the number
of stocks in a portfolio, x given by y = 86.4 – 1.2x. If a portfolio contains twenty - five
stocks, what would be the volatility to the nearest 1%?
86.4 – (1.2 ´ 25) = 56.4
Given the scenario in question 1, How many stocks would be required to achieve a
volatility of 10%?
64
A straight line has a gradient of 1.4 and crosses the x-axis at -0.8. What is the value
of y when x = 3.2?
5.6
Writing out all the terms in the expression would lead to:
3
(i) 3x
(ii) x3
(iii) 27x3
(iv) 27x
18. The derivative of a function y with respect to x is:
(i) The gradient of the curve
(ii) The rate of change of y with respect to x
(iii) The area under the curve
(ii)
(iii)
(iv)
If A is of dimension 1 x 4 and B is of dimension 4 x 1, what is the most accurate term to
describe the matrix A?
A row vector
If A is of dimension 1 x 4 and B is of dimension 4 x 1, what is the most accurate term to
describe the result of the matrix multiplication BA?
A matrix
If A is of dimension 1 x 4 and B is of dimension 2 x 4, what will be the dimensions of the
matrix multiplication AB?
This matrix multiplication is undefined
(i)
(ii)
(iii)
(iv) Undefined
Consider a test of the null hypothesis that the true value of the slope coefficient is –1.
Using a 5% one-sided test, where the alternative is of the form H1: β < -1, what is the
appropriate conclusion?
(i) H0 is rejected
(ii) H0 is not rejected
(iii) H1 is rejected
(iv) There is insufficient information given in the question to reach a conclusion
17. Consider an identical situation to that of question 16, except that now a 2-sided
alternative is used. What would now be the appropriate conclusion?
(i) H0 is rejected
(ii) H0 is not rejected
(iii) H1 is rejected
(iv) There is insufficient information given in the question to reach a conclusion.
18. Which one of the following would be the most appropriate as a 95% (two-sided)
confidence interval for the intercept term of the model given in question 21?
(-5.46,2.86)
19. Which one of the following is the most appropriate definition of a 99% confidence
interval?
99% of the time in repeated samples, the interval would contain the true value of the
parameter
20. Which one of the following statements best describes a Type II error?
It is the probability of failing to reject a null hypothesis that was wrong
21. Suppose that a test statistic has associated with it a p-value of 0.08. Which one of the
following statements is true?
(i) If the size of the test were exactly 8%, we would be indifferent between rejecting and
not rejecting the null hypothesis
(ii) The null would be rejected if a 10% size of test were used
(iii) The null would not be rejected if a 1% size of test were used
(iv) The null would be rejected if a 5% size of test were used.
(i), (ii), and (iii) only
1. Which of the following is a typical characteristic of financial asset return time-series?
They have no trend
2. Which of the following is a DISADVANTAGE of using pure time-series models
(relative to structural models)?
They are not theoretically motivated
3. Which of the following conditions are necessary for a series to be classifiable as a
weakly stationary process?
(i) It must have a constant mean
(ii) It must have a constant variance
(iii) It must have constant autocovariances for given lags
(iv) It must have a constant probability distribution
0.33
13. The (unconditional) variance of the AR (1) process for y given in question 12 will be
1.19
14. The value of the autocovariance function at lag 3 for the AR (1) model given in
question 12 will be
0.076
15. The value of the autocorrelation function at lag 3 for the AR (1) model given in
question 12 will be
0.064
16. Which of the following statements are true concerning the autocorrelation function
(acf) and partial autocorrelation function (pacf)?
(i) The acf and pacf will always be identical at lag one whatever the model
(ii) The pacf for an MA(q) model will in general be non-zero beyond lag q
(iii) The pacf for an AR(p) model will be zero beyond lag p
(iv) The acf and pacf will be the same at lag two for an MA (1) model
(i), (ii), and (iii) only
17. An ARMA(p,q) (p, q are integers bigger than zero) model will have
An acf and pacf that both decline geometrically
18. The pacf is necessary for distinguishing between
An AR and an ARMA model
19. The characteristic roots of the MA process
yt = -3ut-1 + ut-2 + ut
are
1 and 0.5
20. Consider the following picture and suggest the model from the following list that best
characterises the process:
An AR (2)
21. Consider the following picture and suggest the model from the following list that best
characterises the process:
An MA (2)
22. Which of the following statements are true concerning the acf and pacf?
(i) The acf and pacf are often hard to interpret in practice
(ii) The acf and pacf can be difficult to calculate for some data sets
(iii) Information criteria represent an alternative approach to model order determination
(iv) If applied correctly, the acf and pacf will always deliver unique model selections
(i), (ii), and (iii) only
23. Which of the following statements are true concerning the Box-Jenkins approach to
diagnostic testing for ARMA models?
(i) The tests will show whether the identified model is either too large or too small
(ii) The tests involve checking the model residuals for autocorrelation, heteroscedasticity,
and non-normality
(iii) If the model suggested at the identification stage is appropriate, the acf and pacf for
the residuals should show no additional structure
(iv) If the model suggested at the identification stage is appropriate, the coefficients on
the additional variables under the overfitting approach will be statistically insignificant
(ii) and (iv) only
24. Which of the following statements are true concerning information criteria?
(i) Adjusted R-squared is an information criterion
(ii) If the residual sum of squares falls when an additional term is added, the value of the
information criterion will fall
(iii) Akaike’s information criterion always leads to model orders that are at least as large
as those of Schwarz’s information criterion
Which model do you think, given these results, is the most appropriate for the data?
The appropriate response to this set of diagnostic results would be to go back to the
identification stage and propose a larger model
26. Which of the following statements are true concerning the class of ARIMA(p,d,q)
models?
(i) The “I” stands for independent
(ii) An ARIMA(p,1,q) model estimated on a series of logs of prices is equivalent to an
ARIMA(p,0,q) model estimated on a set of continuously compounded returns
(iii) It is plausible for financial time series that the optimal value of d could be 2 or 3.
(iv) The estimation of ARIMA models is incompatible with the notion of cointegration
(ii) and (iv) only
27. Which of the following statements is true concerning forecasting in econometrics?
In-sample forecasting ability is a poor test of model adequacy
28. If a series, y, follows a random walk, what is the optimal one-step ahead forecast of
y?
The current value of y
29. If a series, y, follows a random walk with drift b, what is the optimal one-step ahead
forecast of the change in y?
The average value of the change in y over the in-sample period
30. An “ex ante” forecasting model is one which
Includes only previous values of variables on the RHS
31. Consider the following MA (2) model
yt = 0.3 + 0.5ut-1 - 0.4ut-2 + ut
What is the optimal two-step ahead forecast from this model, made at time t, if the values
of the residuals from the model at time t and t-1 were 0.6 and –0.1 respectively and the
values of the actual series y at time t-1 was –0.4?
0.24
32. What is the optimal three-step ahead forecast from the MA (2) model given in
question 31?
0.3
33. Which of the following statements are true concerning the estimation and forecasts of
an exponential smoothing model, St = a yt + (1-a) St-1?
(i) Using the standard notation, the larger the value of a, the less weight is attached to
more recent observations
(ii) If a = 0, there will be no updating as new observations become available
(iii) The one-step ahead forecast only from an exponential smoothing model will be the
most recently available smoothed value
(iv) If a = 1, the model is equivalent to a random walk for the series y
(ii) and (iv) only
34. Which one of the following statements is true concerning alternative forecast
accuracy measures?
Mean squared error penalizes large forecast errors disproportionately more than small
forecast errors
35. Which one of the following factors is likely to lead to a relatively high degree of out-
of-sample forecast accuracy?
A model that is based on financial theory
Which of the following are advantages of the use of panel data over pure cross-sectional
or pure time-series modelling?
(i) The use of panel data can increase the number of degrees of freedom and therefore the
power of tests
(ii) The use of panel data allows the average value of the dependent variable to vary
either cross-sectionally or over time or both
(iii) The use of panel data enables the researcher allows the estimated relationship
between the independent and dependent variables to vary either cross-sectionally or over
time or both
(i) and (ii) only
Consider the following equation and determine the class of model that it best represents:
In order to determine whether to use a fixed effects or random effects model, a researcher
conducts a Hausman test. Which of the following statements is false?
If the Hausman test is not satisfied, the random effects model is more appropriate
Suppose that we estimate a logit model based on an intercept and two explanatory
variables and the parameter estimates are respectively:
A 1-unit increase in x3 will cause an increase in the probability that the outcome
corresponding to y = 1 to:
Fall by 0.05
A dependent variable whose values are not observable outside a certain range but where
the corresponding values of the independent variables are still available would be most
accurately described as what kind of variable?
Censored
Which of the following statements will be true if the number of replications used in a
Monte Carlo study is small?
(i) The statistic of interest may be estimated imprecisely
(ii) The results may be affected by unrepresentative combinations of random draws
(iii) The standard errors on the estimated quantities may be unacceptably large
(iv) Variance reduction techniques can be used to reduce the standard errors
(i), (ii), (iii), and (iv)
(ii) If it is desired that the distributional properties of the data in the experiment are
known exactly
(iii) If the distributional properties of the actual data are unknown
(iv) If the sample of actual data available is very small
(i) and (iii) only
Suppose we have a panel of data with T = 250 and N = 2. What would be the most
appropriate procedure to test for unit roots?
Separate unit root tests on each individual time series
What is the appropriate critical value for a 2-sided 5% size of test of H0: β3 = 1?
2.06
Under the matrix notation for the classical linear regression model, y = Xβ + u, what are
the dimensions of u? T x 1
Suppose that a test that the true value of the intercept coefficient is zero results in non-
rejection. What would be the appropriate conclusion?
Retain the intercept
Suppose that 100 separate firms were tested to determine how many of them “beat the
market” using a Jensen-type regression, and it is found that 3 fund managers significantly
do so. Does this suggest prima facie evidence for stock market inefficiency? No
Consider the following regression equation estimated using 1,000 daily observations.
(1)
Which one of the following would be a possible restricted regression for a test of the null
hypothesis H0: β2 + β3 = 1?
(i) The restricted regression would be the one labelled as equation (1) above
(ii)
(iii)
(iv)
(iii)
Consider the following regression equation estimated using 1,000 daily observations.
(1)
Which of the following null hypotheses could be tested using an F-test?
(i) β2 = 1
(ii) β32 = 1
(iii) β4 = -β2
(iv) β3β4 = 0
(i) and (iii) only
Consider the following regression equation estimated using 1,000 daily observations.
(1)
Suppose that the test in question 9 were conducted, [Which one of the following would
be a possible restricted regression for a test of the null hypothesis H0: β2 + β3 = 1?] what
would be the relevant critical value from the statistical tables with which to compare the
test statistic? 3.84
Consider the following regression equation estimated using 1,000 daily observations.
(1)
Suppose that the test in question 9 were conducted, [Which one of the following would
be a possible restricted regression for a test of the null hypothesis H0: β2 + β3= 1?] and
the two required residual sums of squares are 30.2 and 28.1, what is the F-test statistic?
74.4
Consider the following regression equation estimated using 1,000 daily observations.
(1)
What would be the null hypothesis for the standard regression F-test for equation (1)
above?
(i) β2 = 0 and β3 = 0 and β4 = 0
(ii) β2 = 0 or β3 = 0 or β4 = 0
Which one of the following most accurately describes the process for yt?
A deterministic trend process
and the test statistic takes a value of +3.2. What is the appropriate conclusion?
(i) yt is stationary
(ii) yt contains exactly one unit root
(iii) yt contains at least one unit root
(iv) yt contains exactly two unit roots
10. Suppose that the following Dickey-Fuller test regression is conducted
and the value of the test statistic is –6.3. What is the appropriate conclusion?
(i) yt is stationary
(ii) yt contains exactly one unit root
(iii) yt contains at least one unit root
(iv) yt contains exactly two unit roots
11. If two variables, xt and yt are said to be cointegrated, which of the following
statements are true?
(i) xt and yt must both be stationary
(ii) Only one linear combination of xt and yt will be stationary
(iii) The cointegrating equation for xt and yt describes the short-run relationship between
the two series
(iv) The residuals of a regression of yt on xt must be stationary
(ii) and (iv) only
12. If the Engle-Granger test is applied to the residuals of a potentially cointegrating
regression, what would be the interpretation of the null hypothesis?
The variables are not cointegrated
13. Consider the following model for yt:
0 and 8
18. What is the rank of the pi matrix given in question 17?
1
19. An appropriate way to describe the pi matrix in question 17 would be to say that it is
A singular matrix
20. Consider a system containing 4 variables, and where the Johansen test has been
applied with the following results:
0 29.65 30.26
1 20.91 23.84
2 10.67 17.72
3 8.55 10.71
0
21. If a Johansen “trace” test for a null hypothesis of 2 cointegrating vectors is applied to
a system containing 4 variables is conducted, which eigenvalues would be used in the
test?
The smallest 2
22. What problems may arise if the Perron (1989) procedure that allows for a known
structural break is used when testing for a unit root?
(i) The actual break date may not be known in advance and the procedure does not
incorporate an approach to determine the break date
(ii) If the break date is determined by examining the data, then the critical values Perron
derived will no longer be appropriate
(iii) The test procedure can only allow for a break in the level of the series and not in the
trend growth rate
(i) and (ii) only
1. Which of the following features of financial asset return time-series could be captured
using a standard GARCH (1,1) model?
(i) Fat tails in the return distribution
(ii) Leverage effects
(iii) Volatility clustering
(iv) Volatility affecting returns
(i) and (iii) only
2. If the standard tools for time-series analysis, such as estimation of the acf, pacf and
spectral analysis, find no evidence of structure in the data, this implies that the data are
which of the following?
Uncorrelated
3. Consider the following ARCH (3) model for a time-series.
Which of the following parameter values would render the model nonsensical?
9. Consider the estimation of a GARCH-M model. If the data employed were a time-
series of daily corporate bond percentage returns, which of the following would you
expect the value of the GARCH-in-mean parameter estimate to be?
Between 0 and 1
10. Suppose that we have estimated a GARCH model for daily equity returns, and we are
interested in producing a 10-day forecast of the volatility (measured by the standard
deviation of returns) for use in a value at risk model. How could such a forecast most
validly be calculated?
Produce 1, 2, 3, …, 10 step ahead conditional variance forecasts and add them up and
take the square root
11. Suppose that we are interested in testing the null hypothesis that a GARCH (2,2)
model can be restricted to a process with a constant conditional variance using the
likelihood ratio test approach. Which of the following statements are true?
If the relevant values of the log-likelihood functions are –112.3 and –118.4, the value of
the test statistic is 12.2
12. What is the most appropriate value of a forecast (to 2 decimal places) of the
conditional variance for time t+2 if the conditional variance and residual at time t are 0.02
and –0.465 respectively and the model is as follows?
0.04
13. Suppose that you were asked to provide a guess at a 20-step ahead forecast for the
model in question 12. What would the most appropriate guess be (to 2 decimal places)?
0.03
14. Which one of the following equations is the form that is usually used for the
conditional covariance equation in a “diagonal VECH” model (using the standard
notation)?
(i)
15. What is the most important disadvantage of the diagonal VECH approach to building
multivariate GARCH models that is overcome by the BEKK formulation?
The diagonal VECH model does not ensure a positive-definite variance-covariance
matrix
16. If a variable or set of variables was described as chaotic, what would this mean?
There is a perfectly deterministic non-linear relationship describing the series or set of
series
17. Which of the following are true of artificial neural network models?
(i) Provided that there are a sufficient number of hidden layers, they can approximate
functions to any desired degree of precision
Câu Hỏi 15
If the residuals of a regression on a large sample are found to be heteroscedastic which of
the following might be a likely consequence? (1) The coefficient estimates are biased, (ii)
The standard error estimates for the slope coefficients may be too small, (iii) Statistical
inferences may be wrong
b. (ii) and (ii) only
Câu Hỏi 16
Which of these is a test for heteroscedasticity?
d. White test
Câu Hỏi 17
Entity fixed effects models
c. Allow the intercept in the regression model to differ cross-sectionally but not over
time, while all of the slope estimates are fixed
both cross-sectionally and over time
Câu Hỏi 18
The method of estimating econometric models which involves fitting a line to the data by
minimising the sum of squared residuals is the
b. Method of ordinary least squares
Câu Hỏi 19
Which one of the following is NOT an assumption of the classical linear regression
model?
d. The dependent variable is not correlated with the disturbance terms
20. To deal with endogeneity, which regression model can be applied?
d. instrumental variable regression
Câu Hỏi 21
If a regression equation contains an irrelevant variable, the parameter estimates will be
a. Consistent and asymptotically efficient but biased
Câu Hỏi 22
To select between fixed effect and random effects models, which test should be used?
C. Hausman test
Câu Hỏi 23
What is the most appropriate interpretation of the assumption cov(ui, uj)=0 concerning
the regression disturbance terms?
The errors are linearly independent of one another
Câu Hỏi 24
Standard errors
c. Give us an idea of the precision of estimates of alpha and beta
Câu Hỏi 25
Which of these is not a viable 'solution' for heteroscedasticity?
c. Taking the first differences of the series
Câu Hỏi 1
Suppose you have calculated the following regression results: y = 1.25 +0.64x. The
standard errors of alpha and beta are 1.22 and 0.58, respectively. Using the test of
significance approach, what is the test statistic value of a hypothesis to test whether the
true value of beta statistically different from zero?
b. 1.10
Câu Hỏi 2
Which of the following conditions must be fulfilled for the Durbin Watson test to be
valid? (i) The regression includes a constant term, (ii) The regressors are non-stochastic,
(iii) There are no lags of the dependent variable in the regression, (iv) There are no lags
of the independent variables in the regression
b. (i), (ii) and (iii) only
Câu Hỏi 3
Which of the following would probably NOT be a potential "cure" for non-normal
residuals?
b. Transforming two explanatory variables into a ratio
Câu Hỏi 4
Which one of the following is the most appropriate as a definition of R2 in the context
that the term is usually used?
d. It is the proportion of the total variability of y about its mean value that is explained by
the model
Câu Hỏi 5
Which of the following is a correct interpretation of a "95% confidence interval" for a
regression parameter?
d. We are 95% sure that the interval contains the true value of the parameter
Câu Hỏi 6
To test the validity of instruments in the 2SLS model, which test can be used?
d. Sargan/Hansen test test of overidentifying)
Chapter 1: Economic Questions and Data
KTL_001_C1_1: Analyzing the behavior of unemployment rates across U.S. states in
March
of 2010 is an example of using
O time series data.
O panel data.
cross-sectional data.
O experimental data.
KTL_001_C1_2: Studying inflation in the United States from 1970 to 2010 is an
example of
using
O randomized controlled experiments.
time series data.
O panel data.
O cross-sectional data.
is an example of
○ experimental data.
○ cross-sectional data.
a time series.
○ longitudinal data.
KTL_001_C1_8: The reason why economists do not use experimental data more
frequently is for all of the following reasons except that real-world experiments
○ with humans are difficult to administer.
○ are often unethical.
● cannot be executed in economics.
○ have flaws relative to ideal randomized controlled experiments.
KTL_001_C1_10: In the graph below, the vertical axis represents average real GDP
growth for 65 countries over the period 1960-1995, and the horizontal axis shows the
average trade share within these countries.
This is an is an example of
○ experimental data.
● cross-sectional data.
○ a time series.
○ longitudinal data.
○ is given by corr ( X , Y ) = c o v ( X , Y ) v a r ( X ) v a r ( Y )
KTL_001_C2_4: Assume that Y is normally distributed N ( μ , σ 2 ) N(μ,σ2). Moving
from the mean ( μ μ) 1.96 standard deviations to the left and 1.96 standard deviations to
the right, then the area under the normal p.d.f is
○ 0.67
○ 0.05
● 0.95
○ 0.33
KTL_001_C2_5: The variance of Y, σY2is given by the following formula:
KTL_001_C2_8: Assume that you assign the following subjective probabilities for your
final grade in your econometrics course (the standard GPA scale of 4 = A to 0 = F
applies):
KTL_001_C2_9: The mean and variance of a Bernoulli random variable are given as
○ cannot be calculated
○ np and np(1-p)
○ p and p(1-p)
● p and (1-p)
○
Chapter 3: Review of Statistics
KTL_001_C3_1: An estimator ^ μ Y μ^Y of the population value μ Y μY is consistent if
○ ^ μ Y → μ Y μ^Y→μY.
○ its mean square error is the smallest possible.
○ Y is normally distributed.
● ¯ Y → 0 Y¯→0.
KTL_001_C3_2: A type II error is
○ typically smaller than the type I error.
KTL_001_C3_7: When testing for differences of means, you can base statistical
inference on the
● Student t distribution in general
○ Normal distribution regardless of sample size
○ Student t distribution if the underlying population distribution of Y is normal, the two
groups have the same variances, and you use the pooled standard error formula
○ Chi-squared distribution with (nm+nw - 2) degrees of freedom
KTL_001_C3_8: Assume that you have 125 observations on the height (H) and weight
(W) of your peers in college. Let SE(HW) = 68, SE(H) = 3.5, SE(W) = 68. The sample
correlation coefficient is
○ 1.22
● 0.50
○ 0.67
○ Cannot be computed since males and females have not been separated out.
KTL_001_C3_9: You have collected data on the average weekly amount of studying
time (T) and grades (G) from the peers at your college. Changing the measurement from
minutes into hours has the following effect on the correlation coefficient:
○ decreases the r(TG) by dividing the original correlation coefficient by 60
○ results in a higher r(TG)
● cannot be computed since some students study less than an hour per week
○ does not change the r(TG)
KTL_001_C3_10: A low correlation coefficient implies that
○ the line always has a flat slope
○ in the scatterplot, the points fall quite far away from the line
● the two variables are unrelated
○ you should use a tighter scale of the vertical and horizontal axis to bring the
observations closer to the line