Professional Documents
Culture Documents
Econ 3044: Introduction To Econometrics Chapter-4: MLR: Further Issues and Dummy Variables
Econ 3044: Introduction To Econometrics Chapter-4: MLR: Further Issues and Dummy Variables
Lemi Taye
Lemi Taye (AAUSC) Ch 4: MLR: Further Issues and Dummy VariablesDecember 28, 2019 1 / 43
Overview
Lemi Taye (AAUSC) Ch 4: MLR: Further Issues and Dummy VariablesDecember 28, 2019 2 / 43
More on Using Logarithmic Functional Forms
Lemi Taye (AAUSC) Ch 4: MLR: Further Issues and Dummy VariablesDecember 28, 2019 3 / 43
More on Using Logarithmic Functional Forms
Thus, when nox increases by 1%, price falls by .718%, holding only
rooms fixed.
When rooms increases by one, price increases by approximately
100(.306) = 30.6%.
The estimate that one more room increases price by about 30.6%
turns out to be somewhat inaccurate for this application.
The approximation error occurs because, as the change in log(y)
becomes larger and larger, the approximation %∆y ≈ 100 · ∆ log(y)
becomes more and more inaccurate.
Lemi Taye (AAUSC) Ch 4: MLR: Further Issues and Dummy VariablesDecember 28, 2019 4 / 43
More on Using Logarithmic Functional Forms
\ = β̂2 ∆x2 .
Now, fixing x1 , we have ∆log(y)
Using simple algebraic properties of the exponential and logarithmic
functions gives the exact percentage change in the predicted y as
Lemi Taye (AAUSC) Ch 4: MLR: Further Issues and Dummy VariablesDecember 28, 2019 5 / 43
More on Using Logarithmic Functional Forms
When ∆x2 = 1,
%∆y = 100 · [exp(β̂2 ) − 1]. (4)
Applied to the housing price example with x2 = rooms and
[ = 100[exp(.306) − 1] = 35.8%, which is notably
β̂2 = .306, %∆price
larger than the approximate percentage change, 30.6%, obtained
directly from (2).
The adjustment in equation (3) is not as crucial for small percentage
changes.
Lemi Taye (AAUSC) Ch 4: MLR: Further Issues and Dummy VariablesDecember 28, 2019 6 / 43
Models with Quadratics
y = β0 + β1 x + β2 x2 + u.
Lemi Taye (AAUSC) Ch 4: MLR: Further Issues and Dummy VariablesDecember 28, 2019 7 / 43
Models with Quadratics
This says that the slope of the relationship between x and y depends
on the value of x; the estimated slope is β̂1 + 2β̂2 x.
If we plug in x = 0, we see that β̂1 can be interpreted as the
approximate slope in going from x = 0 to x = 1. After that, the
second term, 2β̂2 x, must be accounted for.
Lemi Taye (AAUSC) Ch 4: MLR: Further Issues and Dummy VariablesDecember 28, 2019 8 / 43
Models with Quadratics
Lemi Taye (AAUSC) Ch 4: MLR: Further Issues and Dummy VariablesDecember 28, 2019 9 / 43
Models with Quadratics
In the estimated equation (5) with β̂1 > 0 and β̂2 < 0, the turning
point (or maximum of the function) is always achieved at the
coefficient on x over twice the absolute value of the coefficient on x2 :
We obtain the turnaround value of rooms using equation (8) (even though
β̂1 is negative and β̂2 is positive). The absolute value of the coefficient on
rooms, .545, divided by twice the coefficient on rooms2 , .062, gives
rooms∗ = .545/[2(.062)] ≈ 4.4; this point is labeled in Figure 2.
the partial effect of bdrms on price (holding all other variables fixed)
is
∆price
= β2 + β3 sqrf t. (10)
∆bdrms
If β3 > 0, then (10) implies that an additional bedroom yields a higher
increase in housing price for larger houses.
We must interpret this equation with extreme care. If we simply look at the
coefficient on atndrte, we will incorrectly conclude that attendance has a
negative effect on final exam score.
In model (12), only two observed factors affect wage: gender and
education.
Because f emale = 1 when the person is female, and f emale = 0
when the person is male, the parameter δ0 has the following
interpretation: δ0 is the difference in hourly wage between females and
males, given the same amount of education.
The key here is that the level of education is the same in both
expectations; the difference, δ0 , is due to gender only.
At this point, you may wonder why we do not also include in (12) a
dummy variable, say male, which is one for males and zero for females.
This would be redundant. In (12), the intercept for males is β0 , and
the intercept for females is β0 + δ0 . Because there are just two
groups, we only need two different intercepts.
This means that, in addition to β0 , we need to use only one dummy
variable; we have chosen to include the dummy variable for females.
Using two dummy variables would introduce perfect collinearity
because f emale + male = 1, which means that male is a perfect
linear function of f emale.
Including dummy variables for both genders is the simplest example of
the so-called dummy variable trap, which arises when too many
dummy variables describe a given number of groups.
where the intercept for females is α0 and the intercept for males is
α0 + γ0 ; this implies that α0 = β0 + δ0 and α0 + γ0 = β0 .
In any application, it does not matter how we choose the base group,
but it is important to keep track of which group is the base group.
The negative intercept—the intercept for men, in this case—is not very
meaningful because no one has zero values for all of educ, exper, and
tenure in the sample. The coefficient on f emale is interesting because it
measures the average difference in hourly wage between a man and a
woman who have the same levels of educ, exper, and tenure. If we take a
woman and a man with the same levels of education, experience, and
tenure, the woman earns, on average, $1.81 less per hour than the man.
Lemi Taye (AAUSC) Ch 4: MLR: Further Issues and Dummy Variables
December 28, 2019 30 / 43
A Single Dummy Independent Variable
\F − wage
(wage \M = exp(−.297) − 1 ≈ −.257.
\M )/wage
This more accurate estimate implies that a woman’s wage is, on average,
25.7% below a comparable man’s wage.
Lemi Taye (AAUSC) Ch 4: MLR: Further Issues and Dummy Variables
December 28, 2019 34 / 43
Using Dummy Variables for Multiple Categories
(The other factors are the standard ones for wage regressions, including
education, experience, gender, and marital status; see Krueger’s paper for
the exact list.)
Lemi Taye (AAUSC) Ch 4: MLR: Further Issues and Dummy Variables
December 28, 2019 40 / 43
Interactions Involving Dummy Variables