Professional Documents
Culture Documents
10.4324 9781351140409 Previewpdf
10.4324 9781351140409 Previewpdf
ECONOMETRICS
Volume 16
UNDERSTANDING
ECONOMETRICS
UNDERSTANDING
ECONOMETRICS
JON STEWART
First published in 1976 by Hutchinson
Second edition published 1984 by Unwin Hyman
This edition first published in 2018
by Routledge
2 Park Square, Milton Park, Abingdon, Oxon OX14 4RN
and by Routledge
711 Third Avenue, New York, NY 10017
Routledge is an imprint of the Taylor & Francis Group, an informa
business
© 1976, 1984 Jon Stewart
All rights reserved. No part of this book may be reprinted or
reproduced or utilised in any form or by any electronic, mechanical,
or other means, now known or hereafter invented, including
photocopying and recording, or in any information storage or
retrieval system, without permission in writing from the publishers.
Trademark notice: Product or corporate names may be trademarks or
registered trademarks, and are used only for identification and
explanation without intent to infringe.
British Library Cataloguing in Publication Data
A catalogue record for this book is available from the British Library
Publisher’s Note
The publisher has gone to great lengths to ensure the quality of this
reprint but points out that some imperfections in the original copies
may be apparent.
Disclaimer
The publisher has made every effort to trace copyright holders and
would welcome correspondence from those they have been unable to
trace.
Understanding
Econometrics
Jon Stewart
Senior Lecturer in Econometrics,
University of Manchester
London
UNWIN HYMAN
Boston Sydney Wellington
© Jon Stewart 1976, 1984
Stewart, Jon
Understanding econometrics- 2nd ed
1. Econometrics
I. Title
330'.028 HB139
ISBN 0 04 445663 8
Printed in Great Britain by Billing and Sons Ltd, London and Worcester
Contents
Introduction 11
1.1 The nature of econometrics 11
1.2 Economic models 11
1.3 A simple model 12
1.4 A model for a game of chance 17
1.5 Random disturbances 23
1.6 Sample statistics 29
1.7 Exercises 31
2 The two variable linear model 33
2.1 The least squares principle 33
2.2 The correlation coefficient 38
2.3 Least squares estimators: expectation 42
2.4 Least squares estimators: variance 46
2.5 Least squares estimators: distribution 50
2.6 Confidence intervals: normal distribution 53
2.7 Confidence intervals: t distribution 56
2.8 Hypothesis testing 61
2.9 Prediction 66
2.10 Asymptotic properties 71
2.11 Exercises 78
3 The linear model with further explanatory variables 80
3.1 Introduction 80
3.2 Solution of the normal equations 83
3.3 An interpretation of multiple regression 88
3.4 Multiple correlation 95
3.5 Nonlinear relationships 101
3.6 Properties of the estimators 108
3.7 Tests involving several parameters 113
6 Understanding Econometrics
3.8 The use of restrictions 119
3.9 Specification error 125
3.10 Multicollinearity 129
3.11 Dummy variables 138
3.12 Exercises 146
Interlude 149
4 Alternative disturbance specifications 151
4.1 Introduction 151
4.2 Heteroscedasticity 153
4.3 Testing for heteroscedasticity 163
4.4 Serial correlation 169
4.5 Testing for serial correlation 178
4.6 Two step and iterative methods 183
4.7 Some further disturbance problems 188
4.8 Exercises 194
5 Distributed lags and dynamic economic models 196
5.1 Introduction 196
5.2 Distributed lags 198
5.3 Estimation in dynamic models I 209
5.4 Estimation in dynamic models II 213
5.5 Alternative dynamic hypotheses 222
5.6 Exercises 227
6 Simultaneous equation models 229
6.1 Introduction 229
6.2 Identification 233
6.3 The reduced form 238
6.4 Estimation: single equation methods 241
6.5 Estimation: complete system methods 252
6.6 Simultaneous dynamic models 257
6.7 Forecasting and policy simulation 259
6.8 Finale 263
6.9 Exercises 264
I have written this book in the belief that it is possible to present the
methods of econometrics to a relatively wide audience. The topics
covered are those that one would expect to fmd in a comprehensive
introductory course, but very little mathematics is assumed and a prior
knowledge of statistical method is not essential.
The key features of the presentation are as follows. First, it is
necessary to explain what an economic model is and why such models
are used. It is then necessary to explain why random disturbances are
introduced and to show how this is achieved. In the course of this
discussion a statistical survival kit is provided, containing those com-
ponents which are essential to an understanding of the nature of
econometric methods. The various ideas can then be applied in the
context of the two variable regression model, and the extension to
multiple regression is achieved by interpreting a multiple regression
coefficient as a coefficient from a simple regression on adjusted data.
This interpretation allows a full discussion of the use of the single
equation linear model, and topics such as the use of restrictions, specifi-
cation error and multicollinearity are included. The later chapters cover
disturbance problems, the use of lags, dynamic models and simultaneous
equation methods.
It is explicitly recognized that the application of econometric
techniques does require the use of a computer and, where possible, the
calculations necessary under various extensions of the basic linear model
are expressed in a form which is suitable for the application of a
program designed for ordinary least squares estimation. Details which
are more appropriate to hand calculation are not given and matrix
algebra is not required. The emphasis is on understanding why parti-
cular techniques are used, and this is always very carefully explained.
To all those who have helped with the production of this book, I
extend my thanks. I received valuable comments from a number of
people, but I would like to make special mention of the contributions
made by Bill Farebrother, Ray O'Brien, Stuart Moore, George
8 Understanding Econometrics
Hadjimatheou, Michael Parkin and Martin Timbrell. Nina Roach made a
really excellent job of the typing and Anne Bennett gave invaluable
help in the preparation of diagrams. My wife and children had to
endure both my presence and my absence during long days of writing:
to Christine and the boys, my thanks once more.
Jon Stewart
August 1975
Preface to the second edition
In this equation, C and D are variables which can take different values
at different points of observation of the economic system. The numbers
50 and 0·8 are constants. To flx ideas, suppose that consumption and
income flows were observed for each of two time periods. Then,
whereas consumption and income would generally take distinct values
in each period, the existence of a fixed relationship would imply that
Introduction 13
the numbers 50 and 0·8 remain the same. Although the values of the
variables change, the relationship between the variables does not.
The graph corresponding to equation 1.3.1. is shown in Figure 1.
c
C=50+0.8D
o~------~~~----~,oo~------~,5~o------~2~oo~----~2~5o~o~
Figure 1
It can be seen from the graph that the equation represents a straight
line. If consumer behaviour in the (hypothetical) economy is described
by this equation, each pair of consumption and income values repre-
sents a single point which would lie on the straight line. A more general
representation for a straight line or linear relationship between C and
Dis:
C =ex+ {3D (1.3.2)
where ex and {3 are described as parameters of the relationship. In our
hypothetical economy, ex = 50 and {3 = 0·8. In a second case there might
again be a linear relationship, but with different parameter values.
14 Understanding Econometrics
Parameter a: is called the intercept and parameter {3 the slope. The
intercept represents the value of consumption which, according to the
equation, would hold if income were zero. The slope represents the
change in consumption resulting from a unit change in income. If this is
not obvious, use equation 1.3 .I with income levels of 0 and I:
C= 50+ 0·8(0) =50
C= 50+ 0·8(1) = 50·8
c c
400 400
C>-20+ 1Oy'D C>-20+ 1Oy'D
300 300
Figure 2
c
••
70
•
•
65
• •••
•
60
•
• 65 70 75 80 D
Units: £thousand million
Figure 3
The probabilities associated with the six values shown are exactly the
same as those associated with the original outcomes: although
[ V- .E(V)] 2 is a distinct random variable, the probabilities can be
deduced from the distribution of the original random variable V. Since
we now know the probabilities for the random variable (V- E(V)jl,
we can compute the variance of Vby computing the expected value of
((V-E(V)]l:
Technical note 1
When V1 and V2 are such that cov ( V1 , V2 ) = 0, the random variables
V1 and V2 are said to be uncorrelated. Although independent random
variables are always uncorrelated, uncorrelated random variables are
not always independent. Thus independence implies zero covariance,
but zerQ covariance does not necessarily imply independence.
Introduction 23
We now have a model which is similar, in many respects, to that
which will be used for the disturbance term in the consumption func-
tion. There is, however, a small problem of interpretation. As we shall
see from the discussion in the next section, the data taken from the
real economic system consist of a set of observations on consumption
and income, usually relating to different periods of time. The natural
representation for n observations on consumption is e 1 , e2 , ••• , e11 •
But we shall also find that, according to the model, each observation
on consumption is just one of the values that could have been assumed
by a corresponding random variable. The crucial distinction here is
between the use of the terms variable and random variable, and a
possible confusion arises from the use of the term variable in the dis-
cipline to which the probability model is applied. Thus consumption
is a single economic variable, but one can have a model which implies
that each observation on consumption corresponds to a different
random variable. To avoid a completely unworkable system of nota-
tion, the symbols e1 , e2 , . • . , G1 are used to represent a set of n
observations on consumption, but they are also taken to represent a
set of random variables,just as V1 , V2 , ••• , V11 is a set of random
variables in the model for throwing a die. Having made this point, no
confusion need arise, and it should be obvious from the context what
is intended. If the symbols e1 , e2 , ..• , e11 appear in a formula giving
instructions for a calculation to be performed on observed data then,
when the calculation is carried out, the symbols would be replaced by
the values actually observed. But, in discussing the theoretical impli-
cations of the model, we may also take e1 , e2 , .•• , e11 to represent a
set of random variables.
Figure 4
26 Understanding Econometrics
In Figure 4, the horizontal axis represents the possible values of u.
Above the horizontal axis is a curve, which captures the idea that
values in an interval under the centre of the curve are more likely than
values in an equal interval well away from the centre. The height of the
curve is certainly connected with probability, but it does not give the
probability of the corresponding point value on the horizontal axis. The
curve is actually the graph of the pdf, which is denoted as f(u ). The
probability of finding a value between any two points on the horizontal
axis is given by the area contained between the curve, the horizontal
axis and vertical lines drawn through the two points in question. This
is illustrated in Figure 4: the shaded area corresponds to the probability
of finding a value of u between the points u =a and u =b. The reader
with some knowledge of calculus will recognize that this area corres-
ponds to the definite integral of f(u), evaluated between u =a and
u =b. Fortunately, for the most frequently used distributions, there
are tables which enable one to obtain probabilities without having to
evaluate the integrals. The use of such tables is discussed further in
Sections 2.6-2.8, 3.7, 4.3 and 4.5.
A normal distribution is not completely defined until one specifies
values for the expectation and variance. In this context, the expectation
and variance are said to be parameters of the distribution. As in the
case of a discrete random variable, the expectation is a theoretical
average over the values that could occur and the variance is a measure
of spread. Calculation of these values from a known distribution now
requires integration rather than summation, but we need not pursue
this point. What is important is that the interpretation is the same as
that in the discrete case. A further property of the normal distribution
is that the expectation falls under the highest point on the graph of the
pdf, so the distribution shown in Figure 4 has an expectation equal
to zero. Remember that an exactly equivalent statement is that the
Technical note 2
which means 'add all the values of the variable, in this case X, starting
with X 1 and finishing with Xn'· Often it is obvious from the context
what the range of summation should be and, if so, one of the shortened
forms ~tXt or ~Xt is used instead. It is convenient to be able to carry
30 Understanding Econometrics
out algebraic manipulation of expressions which involve the summation
notation, and the following rules are useful in this respect:
If c is a constant,
t=ll
L:C=nc
t=l
Note, however, that it is not true that ~(XrYr) is the same as ~X,~Y,.
Each of these statements can easily be verified by inventing a few
numerical examples (see exercise 1.3).
Given the summation notation, the formula for the sample mean
of a set of observations on X is
(1.6.1)
This represents a simple average of the observations, and X is a conven-
tional notation for the sample mean of the variable written underneath
the bar. The sample variance is written as s} and defmed as
s} = ~(X, - X} 2/n ( 1.6.2)
The reader with some knowledge of elementary statistics may wonder
why the sample variance is not defined with a divisor of n- 1 rather
than n. The answer is that if the sample variance is used in a purely
descriptive way, there is no reason to do anything other than average
the squared deviations. For reasons best explained later, there is a
purpose in using n- 1 when the sample variance is used as an estimate
of an unknown variance, in the context of a certain type of probability
model. It is also worth noting that, for descriptive purposes, the sample
standard deviation may be a more effective measure of spread than the
sample variance. The sample standard deviation is the square root of the
sample variance:
Introduction 31
(1.6.3)
The advantage of this quantity is that the units of measurement are the
same as those for the original sample.
Although there are obvious analogies between the measures relating
to a probability distribution and those relating to a sample, there are
also some crucial differences. The expectation and the variance are
theoretical concepts and they can only be evaluated by making some
assumption about underlying probabilities. The sample mean and
sample variance can be calculated directly from a given set of observa-
tions. A more subtle difference is that the expectation and variance
of a random variable are not themselves random, but the formulae for
the sample mean and sample variance could each defme a random
variable. According to our model, this would be true of the measures
for a sample of consumption values, because observations on consump-
tion are considered to be values taken by a set of random variables. But
it would not be true of the corresponding measures for income, because
the observations on income are taken to be fixed nonrandom quantities.
In this case, the sample measures would be purely descriptive.
The discussion in this chapter has covered a number of important
ideas, and it is useful to present a brief summary. It has been argued
that an exact relationship between economic variables will not generally
correspond to what is observed in the real system, and so disturbances
are added to the model. Since the model is designed to explain the
behaviour of the dependent variable, a necessary preliminary is to have
some 'explanation' of how the disturbance values are generated. This is
achieved by constructing a probability model for the disturbances. To
follow up the implications of the probability model, it is necessary to
have some basic ideas of statistical method. To the reader who has not
previously encountered statistical arguments, the range of ideas covered
in this chapter might be somewhat daunting. However, we have now
covered some essential elements of a statistical survival kit, which
should be quite sufficient to enable the reader to understand the argu-
ments which follow.
1.3 The variable X takes the values 1, 2 and 3, the variable Y takes the
values 2, 4 and 6, and cis a constant equal to 10. Use these
examples to demonstrate the validity of the rules of summation
given in Section 1.6.
Year Ct Dt
1970 I 57·81 63·74
1971 2 59-72 64·54
1972 3 63-27 70·21
1973 4 66·33 75·06
1974 5 65-05 74·05
1975 6 64·65 74·00
1976 7 64·71 73·44
1977 8 64·52 72-29
1978 9 68·23 78·26
1979 10 71-60 83-67
1980 II 71-55 84-77
Source: Economic Trends. annual
supplement, 1983 edition
References
Christ, C. F., Econometric Models and Methods, Wiley, 1966
Harvey, A. C., Econometric Analysis of Time Series, Philip Allan, 1981
Johnston, J., Econometric Methods, 2nd ed., McGraw-Hill, 1972
Judge, G. G. et al., The Theory and Practice of Econometrics, Wiley, 1980
Maddala, G. S., Econometrics, McGraw-Hill, 1977
Pindyck, R. S., and Rubinfeld, D. L., Econometric Models and Economic
Forecasts, 2nd ed., McGraw-Hill, 1981
Hey, J. D., Statistics in Economics, Martin Robertson, 1974 or, at a more formal
level,
Mendenhall, W., Scheaffer, R. L., and Wackerly, D. D., Mathematical Statistics with
Applications, 2nd ed., Wadsworth, 1981
Chiang, A. C., Fundamental Methods of Mathematical Economics, 2nd ed.,
McGraw-Hill, 1974
Arestis, P., and Hadjimatheou, G., Introducing Macroeconomic Modelling: An
Econometric Study of the United Kingdom, Macmillan, 1982