L Sample Inference

Tests for General S
Estimation of S
Tests under conditional homoskedasticity
The Delta Method and Non-Linear Hypothesis
Large-Sample Based Inference in the Linear Model

Walter Sosa-Escudero
Econ 507. Econometric Analysis. Spring 2009
February 26, 2009

Tests for General S
Estimation of S
Motivation
In the classical linear model with finite n, we had to introduce

distributional assumptions to perform basic hypothesis tests
and confidence intervals.
A really important advantage of the large sample theory is to
provide a framework for inference, without distributional
assumptions.
We will derive simple test statistics to evaluate linear
hypothesis in the linear model.
We will discuss a simple strategy to handle the non-linear case.

Tests for General S
Estimation of S
Our starting point will be the asympototic normality result:

p

n n 0 N (0, 1 S1 )
x
where x = E(xi x0i ) and S = V (xi ui ).

Note that n1 Xi0 Xi is a consistent estimator for x .
Recall that our assumptions for asymptotic analysis allow for
conditional heteroskedasticity. Alternative consistent
estimators for S depend on what we are willing to assume on
this.
1
We will use the notation AV (n ) 1
x Sx and
d (n ) =
1 S
1 .
AV
x
x

Tests for General S
Estimation of S
Road Map
Road Map
1
Derive general test statistics assuming there is available a

p
consistent estimator S S
Derive consistent estimators for S with and without

conditional heteroskedasticity.
Obtain particular cases of the previous tests for the

conditional homoskedastic case.

Tests for General S
Estimation of S
Test for Linear Hypothesis under general S
Hypothesis about single coefficients:

H0 : j = j0 vs. HA : j 6= j0
From the asymptotic normality, using Slutzkys theorem, it easy to
check that when H0 : j = j0 holds:

n j j0 d
tj = q
N (0, 1)
d (j )
AV

Tests for General S
Estimation of S
Multiple Linear Hypothesis

H0 : R r = 0, R is a q K matrix with (R) = q, and r <q
Result: under all the assumptions for asymptotic analysis, and
p
S S, when H0 : R r = 0 holds:
h
i1
d
d (n ) R0
W = n (Rn r)0 R AV
(Rn r) 2 (q)

Tests for General S
Estimation of S
Proof: by asympototic normality, under H0

d
n(Rn r) N 0, R AV(n ) R0
or
h
i1/2
d
n(Rn r) N (0, Ik )
R AV(n ) R0
By Slutzkys theorem and linearity:

h
i1/2
d
c n ) R0
R AV(
n(Rn r) N (0, Ik )
The desired result follows by forming the quadratic form.

Tests for General S
Estimation of S
Estimation of S
A general estimator
S = E(xi ui ui x0i ) = E(u2i xi x0i )
We will need an additional assumption

Assumption 6 (fourth moments): E (xik xij )2 exists and is finite
for all k, j = 1, 2, . . . , K.
Result
1X 2 2 p
Sw
e i xi xi S
n
i=1
where ei are the OLS residuals.


Tests for General S
Estimation of S
Proof:
ei = yi x0i = yi x0i x0i ( ) = ui x0i ( )
e2i = u2i 2ui x0i ( ) + ( )0 xi x0i ( )
Replacing
n
1X 2 0
e i xi xi =
n
i=1
i=1
i=1
1X 2 0 2X
ui xi xi
ui x0i ( )xi x0i +
n
n
n
1X
( )0 xi x0i ( )xi x0i
+
n
i=1
First note that
1X 2 0 p
ui xi xi E(u2i xi x0i )
n
i=1
by Kolmogorovs LLN, since we assumed finite second moments

(expectaction exists) and iid.

Tests for General S
Estimation of S
We will show the other two terms converge to zero

I) A =
2
n
Pn
i=1 ui
x0i ( )xi x0i
"K
#
n
X
2X
ui
xik (k k ) xi x0i
A =
n
i=1
k=1
Pn
K
0
X
u
x
x
x
i
i
ik
i
i=1
= 2
(k k )
n
k=1
Note k k 0, by consistency. So if we can show

we are done.
h i
< ,

Tests for General S
Estimation of S
1 Pn
n
i=1 ui xik

xi x0i is a K K matrix with typical (h, j) element:
Pn
i=1 ui xik xih xij
n
By the Cauchy-Schwartz inequality:

1/2
1/2
E|xik xih xij ui | E |xik xih |2
E |xij ui |2
Both factors in the RHS are < , by our fourth moments
assumption 5 and by assumption 3. Hence, we can use the LLN:
n
1X
p
ui xik xi x0i E(ui xik xi x0i ) < ,
n
i=1
so by the product rule and continuity, A 0.


Tests for General S
Estimation of S
II) B =
1
n
Pn
)0 xi x0 ( ) xi x0
i
i
i=1 (
Using the same trick as before:

#" K
"K
#
n
X
1X X
xik (k k )
xik0 (k0 k0 ) xi x0i
B=
n
0
i=1
k=1
k =1
Now we have a sum of K 2 matrices. The (h, j) element of the

k, k 0 summand will be
n
1X
(k k )(k0 k0 )
xik xik0 xih xij
n
i=1
Using again the Cauchy Schwartz inequality and the finite fourth
moments assumption: E|xik xik0 xih xij | <
p
And again, by consistency and LLN, B 0. q.e.d.


Tests for General S
Estimation of S
Then, using Sw as an estimator for S and noting

n
1X 2 2
1
Sw =
ei xi xi = (X 0 BX)
n
n
i=1
with B diag(e21 , . . . , e2n ),

c w (n ) =
1
1
AV
x Sw x
= n(X 0 X)1 n1 (X 0 BX)n(X 0 X)1
= n(X 0 X)1 (X 0 BX)(X 0 X)1
This is Whites heteroskedasticity consistent estimator for the
asymptotic variance of n . Remember that in the derivation of all
result we never ruled out the possibility of conditional
heteroskedasticity, then its consistency does not depend on it.

Tests for General S
Estimation of S
Returns-to-scale revisited:
. reg ltc lq lpl lpf lpk
-----------------------------------------------------------------------------ltc |
Coef.
Std. Err.
t
P>|t|
[95% Conf. Interval]
-------------+---------------------------------------------------------------lq |
.7209135
.0174337
41.35
0.000
.6864462
.7553808
lpl |
.4559645
.299802
1.52
0.131
-.1367602
1.048689
lpf |
.4258137
.1003218
4.24
0.000
.2274721
.6241554
lpk | -.2151476
.3398295
-0.63
0.528
-.8870089
.4567136
_cons | -3.566513
1.779383
-2.00
0.047
-7.084448
-.0485779
-----------------------------------------------------------------------------. reg ltc lq lpl lpf lpk, robust
-----------------------------------------------------------------------------|
Robust
ltc |
Coef.
Std. Err.
t
P>|t|
-------------+---------------------------------------------------------------lq |
.7209135
.0325376
22.16
0.000
.656585
.785242
lpl |
.4559645
.260326
1.75
0.082
-.0587139
.9706429
lpf |
.4258137
.0740741
5.75
0.000
.2793653
.5722622
lpk | -.2151476
.3233711
-0.67
0.507
-.8544698
.4241745
_cons | -3.566513
1.718304
-2.08
0.040
-6.963693
-.1693331
------------------------------------------------------------------------------

Tests for General S
Estimation of S
Variance estimation under conditional homoskedasticity

If we further assume E(u2i |xi ) = 02 , then using LIE:
S = E(u2i xi x0i ) = 02 E(xi x0i ) = 02 x
So the asymptotic variance of n reduces to
1
2 1
1
2 1
1
x Sx = 0 x x x = x
So a consistent estimator for AV(n ) can be

c h = n s2 (X 0 X)1
AV
which is n times the finite sample estimator for the variance of n
in the classical linear model.

Tests for General S
Estimation of S

Under conditional homoskedasticity AV(n ) = 2 1
x and
2
0
1
c
AVh = n s (X X) provides a consistent estimator:
a) Single linear hypothesis: Our test for H0 : j = j0

n j j0
tj = q
d (j )
AV
reduces to:
j j0
tj = p
s2 ajj
where ajj is the jth element of the main diagonal of X 0 X, so we

get our old t test. Note that we do not get the t distribution in
finite samples, but instead we get the N (0, 1) distribution
asymptotically.

Tests for General S
Estimation of S
Multiple linear hypothesis

Our W statistic
h
i1
d (n ) R0
W = n (Rn r)0 R AV
(Rn r)
simplifies to:
i1
h
d (n ) R0
(Rn r)
= n (Rn r)0 R AV

1
= (Rn r)0 R s2 (X 0 X)1 R0
(Rn r)
= qF
This provides an approximation of our old F statistic for the

asymptotic case.

Tests for General S
Estimation of S
Summary of results
We derived general test for single coefficient (t) and multiple

(W ) hypothesis, in our asympotic framework that does not
assume normality and allows for conditional heteroskedasticity.
c w estimator provides a general consistent strategy
Whites AV
for estimating the AVAR of n , robust to the presence of
conditional heteroskedasticity.
Under the conditional homoskedasticity assumption, our the
general t test simplifies to the old one, but this is an
asympotic result.
W simplifies to nF .

Tests for General S
Estimation of S
The Delta Method

Suppose we want to perform inference about a non-linear function
of , say a().
Example 1
yi = 0 + 1 south + zi0 2 + ui
yi is log-wages, south is a dummy indicating if the person lives in the
southern region zi is a vector of control variables.
The percent difference between south/not south is given by
e1 1
and for small values of 1 is very similar to 1 . Suppose we are interested
exactly. A consistent estimator is e1 1. A natural problem is how to

construct a confidence interval for .

Tests for General S
Estimation of S
Example 2: consider now

yi = 1 exper + 2 exper2 + zi0 + ui
where exper is work experience in years. The level of experience
that maximizes expected wages is:
1
22
and a consistent estimate is provided by

=
1
22
How can we construct an estimate for the standard deviation of a

confidence interval for ?

Tests for General S
Estimation of S
Result (Delta Method): suppose xn is a sequence of random

vector of dimension K such that
p
d
xn and n(xn ) Z
and a(x) : <K <r is a function with continuous derivatives
A()
a()
0
(note A() is an r K matrix).
Then:
d

n a(xn ) a() A()Z

Tests for General S
Estimation of S
Proof: Take a first-order mean value expansion of a(xn ) around :

a(xn ) = a() + A(yn ) (xn )
where the mean value yn is a vector between xn and .
From this, get

n a(xn ) a() = A(yn ) (xn )
p
Now yn (why?) so A(yn ) A() by continuous

differentiability.
Then, by the hypothesis of the theorem and Slutzkys Theorem
d

n a(xn ) a() A()Z

Tests for General S
Estimation of S
As a simple corollary note that if
then
d
n(n ) N (0, AV(n ))

d

n a(n ) a() N 0, A()AV(n )A()0

Tests for General S
Estimation of S
Example 1 (Blackburn and Neumark, 1992, also in Wooldridge,

2002)
yi = 0 + 1 south + zi0 2 + ui
a(1 ) = e1 1
with
A(1 ) = e1
So, according to the delta-method

h i2
c e1 1 = e1 AV(1 )
AV

Tests for General S
Estimation of S
. reg lwage exper tenure married black south urban educ

-----------------------------------------------------------------------------lwage |
Coef.
Std. Err.
t
P>|t|
-------------+---------------------------------------------------------------exper |
.014043
.0031852
4.41
0.000
.007792
.020294
tenure |
.0117473
.002453
4.79
0.000
.0069333
.0165613
married |
.1994171
.0390502
5.11
0.000
.1227801
.276054
black | -.1883499
.0376666
-5.00
0.000
-.2622717
-.1144281
south | -.0909036
.0262485
-3.46
0.001
-.142417
-.0393903
urban |
.1839121
.0269583
6.82
0.000
.1310056
.2368185
educ |
.0654307
.0062504
10.47
0.000
.0531642
.0776973
_cons |
5.395497
.113225
47.65
0.000
5.17329
5.617704
-----------------------------------------------------------------------------. nlcom exp(_b[south])-1
lwage |
Coef.
Std. Err.
t
P>|t|
-------------+---------------------------------------------------------------_nl_1 | -.0868943
.0239677
-3.63
0.000
-.1339315
-.0398571

Tests for General S
Estimation of S
Example 2:
yi = 1 exper + 2 exper2 + zi0 + ui ,
a(1 , 2 ) = 1 /(22 )
. regress lwage edup edusi edus eduui eduu exper exper2 if muest==1
-----------------------------------------------------------------------------lwage |
Coef.
Std. Err.
t
P>|t|
-------------+---------------------------------------------------------------edup |
.2104513
.0629835
3.34
0.001
.0869123
.3339903
edusi |
.4148728
.0678469
6.11
0.000
.2817946
.547951
edus |
.7587112
.0695764
10.90
0.000
.6222406
.8951817
eduui |
1.018209
.077569
13.13
0.000
.866061
1.170356
eduu |
1.560496
.0769774
20.27
0.000
1.409509
1.711483
exper |
.0283668
.0071065
3.99
0.000
.0144279
.0423058
exper2 | -.0002502
.0001509
-1.66
0.098
-.0005462
.0000458
_cons |
.2130178
.0934142
2.28
0.023
.0297906
.3962449
-----------------------------------------------------------------------------. nlcom -_b[exper]/(2*_b[exper2])
_nl_1:
-_b[exper]/(2*_b[exper2])
-----------------------------------------------------------------------------lwage |
Coef.
Std. Err.
t
P>|t|
-------------+---------------------------------------------------------------_nl_1 |
56.6962
20.7191
2.74
0.006
16.05673
97.33567
------------------------------------------------------------------------------

Tests for General S
Estimation of S
Summary of variance results

Starting point:
p

n n 0 N (0, AV (n ))
1
0
2
0
AV (n ) = 1
x Sx , x = E(xi xi ), S = V (xi ui ) = E(ui xi xi ).
0
1
x = X X. S is any consistent estimator of S.
d (n ) =
1
1
AV
x S x
Tests
H0 : j = j0 , use tj
h
i1
d (n ) R0
H0 : R r = 0, use W = n (Rn r)0 R AV
(Rn r)
Allowing for conditional heteroskedasticity
P
2
2
0
1
Sw = n1 n
i=1 ei xi xi = n (X BX)
c w (n ) =
1
1
AV
x Sw x
Under homoskedasticity
1
2 1
S = E(u2i xi x0i ) = 02 E(xi x0i ) = 02 x , so AV (n ) = 1
x Sx = 0 x
d h (n ) = n s2 (X 0 X)1
AV

L Sample Inference

Uploaded by

Copyright:

Available Formats

You might also like

L Sample Inference

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

L Sample Inference

Uploaded by

Copyright:

Available Formats

Tests for General S

Large-Sample Based Inference in the Linear Model

February 26, 2009

Large-Sample Based Inference in the Linear Model

In the classical linear model with finite n, we had to introduce

Large-Sample Based Inference in the Linear Model

Our starting point will be the asympototic normality result:

where x = E(xi x0i ) and S = V (xi ui ).

Large-Sample Based Inference in the Linear Model

Derive general test statistics assuming there is available a

Derive consistent estimators for S with and without

Obtain particular cases of the previous tests for the

Large-Sample Based Inference in the Linear Model

Test for Linear Hypothesis under general S

Hypothesis about single coefficients:

Large-Sample Based Inference in the Linear Model

Multiple Linear Hypothesis

Large-Sample Based Inference in the Linear Model

Proof: by asympototic normality, under H0

By Slutzkys theorem and linearity:

Large-Sample Based Inference in the Linear Model

where ei are the OLS residuals.

Large-Sample Based Inference in the Linear Model

First note that

by Kolmogorovs LLN, since we assumed finite second moments

Large-Sample Based Inference in the Linear Model

We will show the other two terms converge to zero

x0i ( )xi x0i

Note k k 0, by consistency. So if we can show

Large-Sample Based Inference in the Linear Model

By the Cauchy-Schwartz inequality:

so by the product rule and continuity, A 0.

Large-Sample Based Inference in the Linear Model

Using the same trick as before:

Now we have a sum of K 2 matrices. The (h, j) element of the

And again, by consistency and LLN, B 0. q.e.d.

Large-Sample Based Inference in the Linear Model

Then, using Sw as an estimator for S and noting

with B diag(e21 , . . . , e2n ),

Large-Sample Based Inference in the Linear Model

Large-Sample Based Inference in the Linear Model

Variance estimation under conditional homoskedasticity

So a consistent estimator for AV(n ) can be

Large-Sample Based Inference in the Linear Model

Tests under conditional homoskedasticity

where ajj is the jth element of the main diagonal of X 0 X, so we

Large-Sample Based Inference in the Linear Model

Multiple linear hypothesis

This provides an approximation of our old F statistic for the

Large-Sample Based Inference in the Linear Model

We derived general test for single coefficient (t) and multiple

Large-Sample Based Inference in the Linear Model

The Delta Method

exactly. A consistent estimator is e1 1. A natural problem is how to

Large-Sample Based Inference in the Linear Model

Example 2: consider now

and a consistent estimate is provided by