Professional Documents
Culture Documents
Instrumental Variables: 2-Stage and 3-Stage Least Squares Regression of A Linear Systems of Equations
Instrumental Variables: 2-Stage and 3-Stage Least Squares Regression of A Linear Systems of Equations
Data Description
Prize Winnings and Performance Statistics for n = 146
professional women (LPGA) golfers for 2009 season
Exogenous Performance Variables:
Average Driving Distance
Percentage of Fairways reached on Drive
Percentage of Greens Reached in Regulation
Percentage of Sand Saves (in hole in 2 shots from close
traps)
Average Putts per hole on greens reached in regulation
Numbers of Events, Events Completed, Rounds
The fitted (predicted) values for SCORE will be used in equation 2 in place of
SCORE, and the fitted values for RANK in equation 3. Equation 1 has no right
Model 3: ln(Prize)
-1
-1
PZ = Z Z'Z Z'
-1
V 2SLS s X'PZ X
2
-1
'
^
SSR
R2
SSR SSE
SSE
s
n rank ( X )
2
1
-1
21
V 22
M
2 n
V 2SLS V
2
21
2
0 22
L
M M O
0 L
0
22n
X'PZ PZ X X'PZ X
1
1
-1
-1
X'PZ X X'Z Z'Z Z'Z Z'Z Z'X X'PZ X
0
Z
M
e22n
2
0 e22
L
S = Z'
M M O
0 L
0
^
e z z'
i 1
2
2i i i
^
'
e2i Y2i xi 2SLS
1
-1
-1
z'
z'
2
Z = X =
M
'
z n
x'
x'
x'
Model Description - I
Y 21
Y 22
Y1
Y2
M
Y
Y
1,146
2,146
F1
G1
1 D1
1 D
F2
G2
2
X1
MM
M
M
eki Yki Y ki
S12
S 22
S32
Y1
Y2
Y3
1 SC1
1 SC
2
X2
MM
1 SC146
E 1
E 2
M
E 146
1 RA1
S13
S 23
S33
146
1
S12 =
e1ie2i
146 (7 3) / 2 i 1
W S 1 Z Z'Z Z' S 1 PZ
1
C1
C
2
M
1 RA2
X 3
MM
1 RA146 C146
1 146 2
S11 =
e1i
146 7 i 1
S11
S S 21
S31
Y31
Y
32
Y3
Y
Y
3,146
S1
P1
R1
S2
P2
R2
M
M M
Model Description - II
^
-1
V 3SLS X'WX
-1
-1
-1
where:
S 11
S 1 S 21
S 31
S 12
S 22
S 32
S 13
S 23
S 33
S 11X1'PZ X1
S 12 PZ
S 22 PZ
S 32 PZ
S 12 X1'PZ X 2
S 22 X 2'PZ X 2
S 32 X 3'PZ X 2
S 13 X1'PZ X 3
S 23 X 2'PZ X 3
S 33 X 3'PZ X 3
S 11PZ
21
W
S
PZ
S 31PZ
S 13 PZ
S 23 PZ
S 33 PZ
-1
Estimation Results
EQ
1
EQ
2
EQ
3
SAS Program
data lpga2009;
infile 'lpga2009.dat';
input golfer drive fairway green putts sandsv prize lnprize
events girputts complete aveposrank rounds strokes;
lnprize1=log(prize);
run;
proc syslin 2sls out=regout;
instruments drive fairway green girputts sandsv rounds events complete;
strokes: model strokes = drive fairway green girputts sandsv rounds; output
residual=e1;
rank: model aveposrank = strokes events; output residual=e2;
prize: model lnprize1 = aveposrank complete; output residual=e3;
run;
proc syslin 3sls data=lpga2009 itprint out=regout3;
instruments drive fairway green girputts sandsv rounds events complete;
strokes: model strokes = drive fairway green girputts sandsv rounds / xpx;
output residual=e1;
rank: model aveposrank = strokes events / xpx;
output residual=e2;
prize: model lnprize1 = aveposrank complete / xpx;
output residual=e3;
STATA Program
insheet using lpga_2009_meq.csv
generate lnprize=ln(prize)