Professional Documents
Culture Documents
The Nature of Econometrics and Economic Data: - Steps in Econometric Analysis
The Nature of Econometrics and Economic Data: - Steps in Econometric Analysis
The Nature of Econometrics and Economic Data: - Steps in Econometric Analysis
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 1 password-protected website or school-approved learning management system for classroom use. 3
1 3
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Nature of Econometrics and Economic Data (1 of 22) The Nature of Econometrics and Economic Data (3 of 22)
• What is econometrics? • Economic model of crime (Becker (1968))
• Econometrics is the use of statistical methods to analyze economic data. • Derives equation for criminal activity based on utility maximization.
• Econometricians typically analyze nonexperimental data.
• Typical goals of econometric analysis:
• Estimating relationships between economic variables.
• Testing economic theories and hypotheses.
• Evaluating and implementing government and business policy.
• Common applications
• Forecasting macroeconomic variables (interest rates, inflation rates,
• Functional form of relationship not specified.
GDP).
• Equation could have been postulated without economic modeling.
• Forecasting non-macro variables (less visible).
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 2 password-protected website or school-approved learning management system for classroom use. 4
2 4
1 2
9/21/2021 9/21/2021
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Nature of Econometrics and Economic Data (4 of 22) The Nature of Econometrics and Economic Data (6 of 22)
• Model of job training and worker productivity • Econometric model of job training and worker productivity
• What is the effect of additional training on worker productivity?
• Formal economic theory not really needed to derive equation:
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 5 password-protected website or school-approved learning management system for classroom use. 7
5 7
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Nature of Econometrics and Economic Data (5 of 22) The Nature of Econometrics and Economic Data (7 of 22)
• Econometric model of criminal activity • Econometric analysis requires data.
• The functional form has to be specified. • There are several different kinds of economic data sets:
• Variables may have to be approximated by other quantities. • Cross-sectional data
• Time series data
• Pooled cross sections
• Panel/Longitudinal data
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 6 password-protected website or school-approved learning management system for classroom use. 8
6 8
3 4
9/21/2021 9/21/2021
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Nature of Econometrics and Economic Data (8 of 22) The Nature of Econometrics and Economic Data (10 of 22)
• Cross-sectional data sets • Table 1.2: Cross-sectional data on growth rates and country characteristics
• These may include samples of individuals, households, firms, cities,
states, countries, or other units of interest at a given point of time or in a
given period. obsno
1
country
Argentina
gpcrgdp
0.89
govcons60
9
second60
32
2 Austria 3.32 16 50
• Sometimes pure random sampling is violated, for example, people refuse 61 Zimbabwe 2.30 17 6
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 9 password-protected website or school-approved learning management system for classroom use. 11
9 11
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Nature of Econometrics and Economic Data (9 of 22) The Nature of Econometrics and Economic Data (11 of 22)
• Table 1.1: Cross-sectional data set on wages and other characteristics • Time series data
• This includes observations of a variable or several variables over time.
obsno
1
wage
3.10
educ
11
exper
2
female
1
married
0
• Examples include stock prices, money supply, consumer price index,
2
3
3.24
3.00
12
11
22
2
1
0
1
0
gross domestic product, annual homicide rates, automobile sales, and so
4
5
6.00
5.30
8
12
44
7
0
0
1
1
on.
• Time series observations are typically serially correlated.
. . . . . .
. . . . . .
. . . . . .
525
526
11.56
3.50
16
14
5
5
0
1
1
0
• Ordering of observations conveys important information.
• Data frequency may include daily, weekly, monthly, quarterly, annually,
and so on.
• Typical features of time series include trends and seasonality.
• Typical applications include applied macroeconomics and finance.
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 10 password-protected website or school-approved learning management system for classroom use. 12
10 12
5 6
9/21/2021 9/21/2021
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Nature of Econometrics and Economic Data (12 of 22) The Nature of Econometrics and Economic Data (14 of 22)
• Table 1.3: Time series data on minimum wage, unemployment, and • Table 1.4: Pooled cross sections on two years of housing prices
related data for Puerto Rico
obsno year hprice proptax sqrft bdrms bthrms
1 1993 85,500 42 1600 3 2
obsno year avgmin avgcov prunemp prgnp 2 1993 67,300 36 1440 3 2
1 1950 0.20 20.1 15.4 878.7 3 1993 134,000 38 2000 4 2
2 1951 0.21 20.7 16.0 925.0 . . . . . . .
3 1952 0.23 22.6 14.8 1015.9 . . . . . . .
. . . . . . . . . . . . .
. . . . . . 250 1993 243,600 41 2600 4 3
. . . . . . 251 1995 65,000 16 1250 2 1
37 1986 3.35 58.1 18.9 4281.6 252 1995 182,400 20 2200 4 2
38 1987 3.35 58.2 16.8 4496.7 253 1995 97,500 15 1540 3 2
. . . . . . .
. . . . . . .
. . . . . . .
520 1995 57,200 16 1100 2 1
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 13 password-protected website or school-approved learning management system for classroom use. 15
13 15
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Nature of Econometrics and Economic Data (13 of 22) The Nature of Econometrics and Economic Data (15 of 22)
• Pooled cross sections • Panel or longitudinal data
• Two or more cross sections are combined in one data set. • The same cross-sectional units are followed over time.
• Cross sections are drawn independently of each other. • Panel data have a cross-sectional and a time series dimension.
• Pooled cross sections are often used to evaluate policy changes. • Panel data can be used to account for time-invariant unobservables.
• Example: • Panel data can be used to model lagged responses.
• Evaluating effect of change in property taxes on house prices. • Example:
• Random sample of house prices for the year 1993. • City crime statistics; each city is observed in two years.
• A new random sample of house prices for the year 1995. • Time-invariant unobserved city characteristics may be modeled.
• Compare before/after (1993: before reform, 1995: after reform). • Effect of police on crime rates may exhibit time lag.
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 14 password-protected website or school-approved learning management system for classroom use. 16
14 16
7 8
9/21/2021 9/21/2021
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Nature of Econometrics and Economic Data (16 of 22) The Nature of Econometrics and Economic Data (18 of 22)
• Table 1.5: Two-year panel data set on city crime statistics • Causal effect of fertilizer on crop yield
• “By how much will the production of soybeans increase if one increases
obsno city year murders population unem police the amount of fertilizer applied to the ground.”
• Implicit assumption: all other factors that influence crop yield such as
1 1 1986 5 350,000 8.7 440
2 1 1990 8 359,200 7.2 471
3
4
2
2
1986
1990
2
1
64,300
65,100
5.4
5.5
75
75 quality of land, rainfall, presence of parasites, and so on are held fixed.
. . . . . . .
. . . . . . .
. . . . . . .
297
298
149
149
1986
1990
10
6
260,700
245,000
9.6
9.8
286
334
• Experiment = Feasible
299
300
150
150
1986
1990
25
32
543,000
546,200
4.3
5.2
520
493
• Choose several one-acre plots of land; randomly assign different amounts
of fertilizer to the different plots; compare yields.
• Experiment works because amount of fertilizer applied is unrelated to
other factors influencing crop yields.
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 17 password-protected website or school-approved learning management system for classroom use. 19
17 19
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Nature of Econometrics and Economic Data (17 of 22) The Nature of Econometrics and Economic Data (19 of 22)
• Causality and the notion of ceteris paribus • Measuring the return to education
• “If a person is chosen from the population and given another year of
education, by how much will his or her wage increase?”
• Implicit assumption: all other factors that influence wages such as
experience, family background, intelligence, and so on are held fixed.
• Ceteris paribus: “other relevant factors being equal.”
• Most economic questions are ceteris paribus questions. • Experiment ≠ Infeasable
• It is important to define which causal effect one is interested in. • Choose a group of people; randomly assign different amounts of
• It is useful to describe how an experiment would have to be designed to education to them (infeasable!); compare wage outcomes.
infer the causal effect in question. • Problem without random assignment, amount of education is related to
other factors that influence wages (e.g. intelligence).
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 18 password-protected website or school-approved learning management system for classroom use. 20
18 20
9 10
9/21/2021 9/21/2021
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Nature of Econometrics and Economic Data (20 of 22) The Nature of Econometrics and Economic Data (22 of 22)
• Effect of law enforcement on city crime level • Testing predictions of economic theories
• “If a city is randomly chosen and given ten additional police officers, by • Economic theories are not always stated in terms of causal effects.
how much would its crime rate fall?” • For example, the expectations hypothesis states that long-term interest
• Alternatively: “If two cities are the same in all respects, except that city A rates equal compounded expected short-term interest rates.
has ten more police officers than city B, by how much would the two
cities’ crime rates differ?”
• Experiment ≠ Infeasable
• Randomly assign number of police officers to a large number of cities
(virtually impossible, as no two cities are alike in all respects except size • An implicaton is that the interest rate of a three-month T-bill should be equal to
of police force!). the expected interest rate for the first three months of a six-month T-bill; this
• More importantly, in reality, the number of police officers occurs can be tested using econometric methods.
contemoraneously with determination of crime rate.
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 21 password-protected website or school-approved learning management system for classroom use. 23
21 23
• Experiment ≠ Infeasable
• Government randomly chooses minimum wage each year and observes
unemployment outcomes.
• Experiment would theoretically work because level of minimum wage is
unrelated to other factors determining unemployment.
• In reality, the level of the minimum wage will depend on political and
economic factors that also influence unemployment.
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 22
22
11 12
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Chapter 2
The Simple Regression Model
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 1 password-protected website or school-approved learning management system for classroom use. 3
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Simple Regression Model (1 of 39) The Simple Regression Model (3 of 39)
• Definition of the simple regression model • Example: Soybean yield and fertilizer
• “Explains variable y in terms of variable x”
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 2 password-protected website or school-approved learning management system for classroom use. 4
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Simple Regression Model (4 of 39) The Simple Regression Model (6 of 39)
• When is there a causal interpretation?
• Conditional mean independence assumption
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 5 password-protected website or school-approved learning management system for classroom use. 7
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Simple Regression Model (5 of 39) The Simple Regression Model (7 of 39)
• Population regression function (PFR) • Deriving the ordinary least squares estimates
• The conditional mean independence assumption implies that • In order to estimate the regression model one needs data
• A random sample of n observations
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 6 password-protected website or school-approved learning management system for classroom use. 8
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Simple Regression Model (8 of 39) The Simple Regression Model (10 of 39)
• Deriving the ordinary least squares (OLS) estimators • Example of a simple regression
• Defining regression residuals • CEO salary and return on equity
• Fitted regression
• OLS estimators
• Causal interpretation?
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 9 password-protected website or school-approved learning management system for classroom use. 11
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Simple Regression Model (9 of 39) The Simple Regression Model (11 of 39)
• OLS fits as good as possible a regression line through the data points
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 10 password-protected website or school-approved learning management system for classroom use. 12
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Simple Regression Model (12 of 39) The Simple Regression Model (14 of 39)
• Example of a simple regression • Properties of OLS on any sample of data
• Wage and education • Fitted values and residuals
• Causal interpretation?
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 13 password-protected website or school-approved learning management system for classroom use. 15
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Simple Regression Model (13 of 39) The Simple Regression Model (15 of 39)
obsno roe salary salaryhat uhat
• Example of a simple regression 1 14.1 1095 1224.058 -129.058 • This table presents fitted
values and residuals for 15
• Voting outcomes and campaign expenditures (two parties) 2
3
10.9
23.5
1001
1122
1164.854
1397.960
-163.854
-275.969 CEOs.
4 5.9 578 1072.348 -494.348
5 13.8 1368 1218.508 149.493 • For example, the 12th CEO’s
6 20.0 1145 1333.215 -188.215 predicted salary is $526,023
7 16.4 1078 1266.611 188.611 higher than their actual
• Fitted regression
8 16.3 1094 1264.761 -170.761
salary.
9 10.5 1237 1157.454 79.546
10 26.3 833 1449.773 -616.773
• By contrast the 5th CEO’s
11 25.9 567 1442.372 -875.372
12 26.8 933 1459.023 -526.023
predicted salary is $149,493
13 14.8 1339 1237.009 101.991
lower than their actual
14 22.3 937 1375.768 -438.768
salary.
• Causal interpretation? 15 56.3 2011 2004.808 6.192
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 14 password-protected website or school-approved learning management system for classroom use. 16
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Simple Regression Model (16 of 39) The Simple Regression Model (18 of 39)
• Goodness of fit • CEO Salary and return on equity
• How well does an explanatory variable explain the dependent variable?
• Measures of variation:
• Caution: A high R-squared does not necessarily mean that the regression
has a causal interpretation!
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 17 password-protected website or school-approved learning management system for classroom use. 19
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Simple Regression Model (17 of 39) The Simple Regression Model (19 of 39)
• Decomposition of total variation • Incorporating nonlinearities: Semi-logarithmic form
• Regression of log wages on years of education
• Goodness-of-fit measure (R-squared) • This changes the interpretation of the regression coefficient:
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 18 password-protected website or school-approved learning management system for classroom use. 20
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Simple Regression Model (20 of 39) The Simple Regression Model (22 of 39)
• Fitted regression • CEO salary and firm sales: fitted regression
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 21 password-protected website or school-approved learning management system for classroom use. 23
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Simple Regression Model (21 of 39) The Simple Regression Model (23 of 39)
• Incorporating nonlinearities: Log-logarithmic form • Expected values and variances of the OLS estimators
• CEO salary and firm sales • The estimated regression coefficients are random variables because
they are calculated from a random sample
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 22 password-protected website or school-approved learning management system for classroom use. 24
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Simple Regression Model (24 of 39) The Simple Regression Model (26 of 39)
• Standard assumptions for the linear regression model
• Assumption SLR.1 (Linear in parameters)
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 25 password-protected website or school-approved learning management system for classroom use. 27
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Simple Regression Model (25 of 39) The Simple Regression Model (27 of 39)
• Discussion of random sampling: Wage and education • Assumptions for the linear regression model (cont.)
• The population consists, for example, of all workers of country A • Assumption SLR.3 (Sample variation in the explanatory variable)
• In the population, there is a linear relationship between wages (or log wages)
and years of education.
• Draw completely randomly a worker from the population
• The wage and the years of education of the worker drawn are random because
one does not know beforehand which worker is drawn.
• Throw that worker back into the population and repeat the random draw n
times. • Assumption SLR.4 (Zero conditional mean)
• The wages and years of education of the sampled workers are used to estimate
the linear relationship between wages and education.
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 26 password-protected website or school-approved learning management system for classroom use. 28
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Simple Regression Model (28 of 39) The Simple Regression Model (30 of 39)
• Theorem 2.1 (Unbiasedness of OLS) • Graphical illustration of homoskedasticity
• Interpretation of unbiasedness
• The estimated coefficients may be smaller or larger, depending on the sample
that is the result of a random draw.
• However, on average, they will be equal to the values that characterize the true
relationship between y and x in the population.
• “On average” means if sampling was repeated, i.e. if drawing the random
sample and doing the estimation was repeated many times.
• In a given sample, estimates may differ considerably from true values.
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 29 password-protected website or school-approved learning management system for classroom use. 31
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Simple Regression Model (29 of 39) The Simple Regression Model (31 of 39)
• Variances of the OLS estimators • An example for heteroskedasticity: Wage and education
• Depending on the sample, the estimates will be nearer or farther away from
the true population values.
• How far can we expect our estimates to be away from the true population
values on average (= sampling variability)?
• Sampling variability is measured by the estimator‘s variances
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 30 password-protected website or school-approved learning management system for classroom use. 32
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Simple Regression Model (32 of 39) The Simple Regression Model (34 of 39)
• Theorem 2.2 (Variances of the OLS estimators) • Theorem 2.3 (Unbiasedness of the error variance)
• Under assumptions SLR.1 – SLR.5:
• Conclusion:
• The sampling variability of the estimated regression coefficients will be the
higher, the larger the variability of the unobserved factors, and the lower, the The estimated standard deviations of the regression coefficients are called “standard errors.” They
higher the variation in the explanatory variable. measure how precisely the regression coefficients are estimated.
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 33 password-protected website or school-approved learning management system for classroom use. 35
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Simple Regression Model (33 of 39) The Simple Regression Model (35 of 39)
• Estimating the error variance • Regression on a binary explanatory variable
• Suppose that x is either equal to 0 or 1
The Simple Regression Model (36 of 39) The Simple Regression Model (38 of 39)
• Counterfactual outcomes, causality and policy analysis • Random assignment
• In policy analysis, define a treatment effect as: • Subjects are randomly assigned into treatment and control groups such that
there are no systematic differences between the two groups other than the
treatment.
• In practice, randomized control trials (RCTs) are expensive to implement and
may raise ethical issues.
• Note that we will never actually observe this since we either observe • Though RCTs are often not feasible in economics, it is useful to think about the
yi(1) or yi(0) for a given i, but never both. kind of experiment you would run if random assignment was a possibility. This
helps in identifying the potential impediments to random assignment (that we
• Let the average treatment effect be defined as: could conceivable control for in a multivariate regression).
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 37 password-protected website or school-approved learning management system for classroom use. 39
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
The Simple Regression Model (37 of 39) The Simple Regression Model (39 of 39)
• Counterfactual outcomes, causality and policy analysis (contd.) • Example: The effects of a job training program on earnings
• Let xi be a binary policy variable. • Real earnings are regressed on a binary variable indicating
participation in a job training program.
• Therefore, regressing y on x will give us an estimate of the (constant) • Those who participated in the training program have earnings $1,790
treatment effect. higher than those who did not participate.
• As long as we have random assignment, OLS will yield an unbiased • This represents a 39.3% increase over the $4,550 average earnings
estimator for the treatment effect τ. from those who did not participate.
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 38 password-protected website or school-approved learning management system for classroom use. 40
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 1 password-protected website or school-approved learning management system for classroom use. 3
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Estimation (1 of 37) Multiple Regression Analysis: Estimation (3 of 37)
• Definition of the multiple linear regression model • Example: Average test scores and per student spending
• “Explains variable y in terms of variables x1, x2,…, xk”
Multiple Regression Analysis: Estimation (4 of 37) Multiple Regression Analysis: Estimation (6 of 37)
• Example: Family income and family consumption • OLS Estimation of the multiple regression model
• Random sample
• Regression residuals
• Model has two explanatory variables: inome and income squared
• Consumption is explained as a quadratic function of income
• One has to be very careful when interpreting the coefficients: • Minimize sum of squared residuals
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 5 password-protected website or school-approved learning management system for classroom use. 7
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Estimation (5 of 37) Multiple Regression Analysis: Estimation (7 of 37)
• Example: CEO salary, sales and CEO tenure • Interpretation of the multiple regression model
• Model assumes a constant elasticity relationship between CEO salary and the • The multiple linear regression model manages to hold the values of
sales of his or her firm. other explanatory variables fixed even if, in reality, they are correlated
• Model assumes a quadratic relationship between CEO salary and his or her with the explanatory variable under consideration.
tenure with the firm.
• “Ceteris paribus”-interpretation
• Meaning of “linear” regression • It has still to be assumed that unobserved factors do not change if the
• The model has to be linear in the parameters (not in the variables) explanatory variables are changed.
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 6 password-protected website or school-approved learning management system for classroom use. 8
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Estimation (8 of 37) Multiple Regression Analysis: Estimation (10 of 37)
• Example: Determinants of college GPA • “Partialling out” interpretation of multiple regression
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 9 password-protected website or school-approved learning management system for classroom use. 11
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Estimation (9 of 37) Multiple Regression Analysis: Estimation (11 of 37)
• Properties of OLS on any sample of data • Goodness-of-Fit
• Fitted values and residuals • Decomposition of total variation
• R squared
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 10 password-protected website or school-approved learning management system for classroom use. 12
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Estimation (12 of 37) Multiple Regression Analysis: Estimation (14 of 37)
• Example: Explaining arrest records • Standard assumptions for the multiple regression model
• Assumption MLR.1 (Linear in parameters)
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Estimation (13 of 37) Multiple Regression Analysis: Estimation (15 of 37)
• Example: Explaining arrest records (cont.) • Standard assumptions for the multiple regression model (cont.)
• An additional explanatory variable is added.
• Assumption MLR.3 (No perfect collinearity)
• In the sample (and therefore in the population), none of the independent
variables is constant and there are no exact linear relationships among the
independent variables.
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 14 password-protected website or school-approved learning management system for classroom use. 16
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Estimation (16 of 37) Multiple Regression Analysis: Estimation (18 of 37)
• Example for perfect collinearity: small sample • Including irrelevant variables in a regression model
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 17 password-protected website or school-approved learning management system for classroom use. 19
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Estimation (17 of 37) Multiple Regression Analysis: Estimation (19 of 37)
• Standard assumptions for the multiple regression model (cont.) • Omitted variable bias
• Assumption MLR.4 (Zero conditional mean)
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 18 password-protected website or school-approved learning management system for classroom use. 20
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Estimation (20 of 37) Multiple Regression Analysis: Estimation (22 of 37)
• Example: Omitting ability in a wage equation • Standard assumptions for the multiple regression model (cont.)
• Assumption MLR.5 (Homoskedasticity)
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 21 password-protected website or school-approved learning management system for classroom use. 23
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Estimation (21 of 37) Multiple Regression Analysis: Estimation (23 of 37)
• Omitted variable bias: more general cases • Theorem 3.2 (Sampling variances of the OLS slope estimators)
• Under assumptions MLR.1 – MLR.5:
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 22 password-protected website or school-approved learning management system for classroom use. 24
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Estimation (24 of 37) Multiple Regression Analysis: Estimation (26 of 37)
• Components of OLS Variances: • An example for multicollinearity
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 25 password-protected website or school-approved learning management system for classroom use. 27
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Estimation (25 of 37) Multiple Regression Analysis: Estimation (27 of 37)
• Components of OLS Variances (contd.) • Discussion of the multicollinearity problem
• 3) Linear relationships among the independent variables • In the above example, it would probably be better to lump all expen-
• Regress xj on all other independent variables (including constant) diture categories together because effects cannot be disentangled.
• The R-squared of this regression will be the higher when xj can be better • In other cases, dropping some independent variables may reduce
explained by the other independent variables. multicollinearity (but this may lead to omitted variable bias).
• The sampling variance of the slope estimator for xj will be higher when xj can be
better explained by the other independent variables.
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 26 password-protected website or school-approved learning management system for classroom use. 28
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Estimation (28 of 37) Multiple Regression Analysis: Estimation (30 of 37)
• Only the sampling variance of the variables involved in • Variances in misspecified models (cont.)
multicollinearity will be inflated; the estimates of other effects may
be very precise.
• Note that multicollinearity is not a violation of MLR.3 in the strict
sense.
• Multicollinearity may be detected through “variance inflation factors.” • Case 1:
• Case 2:
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 29 password-protected website or school-approved learning management system for classroom use. 31
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Estimation (29 of 37) Multiple Regression Analysis: Estimation (31 of 37)
• Variances in misspecified models • Estimating the error variance
• The choice of whether to include a particular variable in a regression can be
made by analyzing the tradeoff between bias and variance.
• An unbiased estimate of the error variance can be obtained by substracting the number of
estimated regression coefficients from the number of observations. The number of observations
minus the number of estimated parameters is also called the degrees of freedom. The n
estimated squared residuals in the sum are not completely independent but related through the
k+1 equations that define the first order conditions of the minimization problem.
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 30 password-protected website or school-approved learning management system for classroom use. 32
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Estimation (32 of 37) Multiple Regression Analysis: Estimation (34 of 37)
• Estimation of the sampling variances of the OLS estimators • Theorem 3.4 (Gauss-Markov Theorem)
• Under assumptions MLR.1 - MLR.5, the OLS estimators are the best
linear unbiased estimators (BLUEs) of the regression coefficients, i.e.
• Note that these formulas are only valid under assumptions MLR.1- • OLS is only the best estimator if MLR.1 – MLR.5 hold; if there is
MLR.5 (in particular, there has to be homoskedasticity) heteroskedasticity for example, there are better estimators.
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 33 password-protected website or school-approved learning management system for classroom use. 35
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Estimation (33 of 37) Multiple Regression Analysis: Estimation (35 of 37)
• Efficiency of OLS: The Gauss-Markov Theorem • Several Scenarios for Applying Multiple Regression
• Under assumptions MLR.1 - MLR.5, OLS is unbiased • Prediction
• However, under these assumptions there may be many other estimators that • The best prediction of y will be its conditional expectation
are unbiased.
• Which one is the unbiased estimator with the smallest variance?
• In order to answer this question one usually limits oneself to linear estimators,
i.e. estimators linear in the dependent variable. • Efficient markets
• Efficient markets theory states that a single variable acts as a sufficient statistic
for predicting y . Once we know this sufficient statistic, then additional
information is not useful in predicting y.
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 34 password-protected website or school-approved learning management system for classroom use. 36
Introductory Econometrics: A Modern Approach (7e)
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 37
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 38
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Chapter 4
Multiple Regression Analysis:
Inference
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 1 password-protected website or school-approved learning management system for classroom use. 3
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Inference (1 of 37) Multiple Regression Analysis: Inference (3 of 37)
• Statistical inference in the regression model • Discussion of the normality assumption
• Hypothesis tests about population parameters • The error term is the sum of “many” different unobserved factors.
• Construction of confidence intervals • Sums of independent factors are normally distributed (CLT).
• Problems:
• Sampling distributions of the OLS estimators • How many different factors? Number large enough?
• The OLS estimators are random variables • Possibly very heterogenuous distributions of individual factors
• We already know their expected values and their variances • How independent are the different factors?
• However, for hypothesis tests we need to know their distribution
• In order to derive their distribution we need additional assumptions • The normality of the error term is an empirical question.
• Assumption about distribution of errors: normal distribution • At least, the error distribution should be “close” to normal.
• In many cases, normality is questionable or impossible by definition.
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 2 password-protected website or school-approved learning management system for classroom use. 4
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Inference (4 of 37) Multiple Regression Analysis: Inference (6 of 37)
• Discussion of the normality assumption (cont.) • Testing hypotheses about a single population parameter
• Examples where normality cannot hold: • Theorem 4.2 (t-distribution for the standardized estimators)
• Wages (nonnegative; also: minimum wage) • Under assumptions MLR.1 – MLR.6
• Number of arrests (takes on a small number of integer values)
• Unemployment (indicator variable, takes on only 1 or 0)
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 5 password-protected website or school-approved learning management system for classroom use. 7
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Inference (5 of 37) Multiple Regression Analysis: Inference (7 of 37)
• Terminology • t-statistic (or t-ratio)
Multiple Regression Analysis: Inference (8 of 37) Multiple Regression Analysis: Inference (10 of 37)
• Testing against one-sided alternatives (greater than zero) • Example: Wage equation (cont.)
• Reject the null hypothesis in favour of the
alternative hypothesis if the estimated
coefficient is “too large” (i.e. larger than a
critical value).
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 9 password-protected website or school-approved learning management system for classroom use. 11
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Inference (9 of 37) Multiple Regression Analysis: Inference (11 of 37)
• Example: Wage equation • Testing against one-sided alternatives (less than zero)
• Test whether, after controlling for education and tenure, higher work
• Reject the null hypothesis in favour of the
experience leads to higher hourly wages. alternative hypothesis if the estimated
coefficient is “too small” (i.e. smaller than a
critical value).
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 10 password-protected website or school-approved learning management system for classroom use. 12
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Inference (12 of 37) Multiple Regression Analysis: Inference (14 of 37)
• Example: Student performance and school size • Example: Student performance and school size (cont.)
• Test whether smaller school size leads to better student performance • Alternative specification of functional form:
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 13 password-protected website or school-approved learning management system for classroom use. 15
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Inference (13 of 37) Multiple Regression Analysis: Inference (15 of 37)
• Example: Student performance and school size (cont.) • Example: Student performance and school size (cont.)
• One cannot reject the hypothesis that there is no effect of school size
on student performance (not even for a lax significance level of 15%).
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 14 password-protected website or school-approved learning management system for classroom use. 16
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Inference (16 of 37) Multiple Regression Analysis: Inference (18 of 37)
• Testing against two-sided alternatives • “Statistically significant” variables in a regression
• If a regression coefficient is different from zero in a two-sided test, the
• Reject the null hypothesis in favour of the
alternative hypothesis if the absolute value of corresponding variable is said to be “statistically significant”.
the estimated coefficient is too large. • If the number of degrees of freedom is large enough so that the normal
approximation applies, the following rules of thumb apply:
• Construct the critical value so that, if the null
hypothesis is true, it is rejected in, for example,
5% of the cases.
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 17 password-protected website or school-approved learning management system for classroom use. 19
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Inference (17 of 37) Multiple Regression Analysis: Inference (19 of 37)
• Example: Determinants of college GPA • Testing more general hypotheses about a regression coefficient
• Null hypothesis
• t-statistic
• The test works exactly as before, except that the hypothesized value is
substracted from the estimate when forming the statistic.
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 18 password-protected website or school-approved learning management system for classroom use. 20
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Inference (20 of 37) Multiple Regression Analysis: Inference (22 of 37)
• Example: Campus crime and enrollment • How the p-value is computed (here: two-sided test)?
• An interesting hypothesis is whether crime increases by one percent if • The p-value is the significance level at which one is
enrollment is increased by one percent. indifferent between rejecting and not rejecting the null
hypothesis.
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 21 password-protected website or school-approved learning management system for classroom use. 23
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Inference (21 of 37) Multiple Regression Analysis: Inference (23 of 37)
• Computing p-values for t-tests • Guidelines for discussing economic and statistical significance
• If the significance level is made smaller and smaller, there will be a point where • If a variable is statistically significant, discuss the magnitude of the coefficient to
the null hypothesis cannot be rejected anymore. get an idea of its economic or practical importance.
• The reason is that, by lowering the significance level, one wants to avoid more • The fact that a coefficient is statistically significant does not necessarily mean it
and more to make the error of rejecting a correct H0. is economically or practically significant!
• The smallest significance level at which the null hypothesis is still rejected, is • If a variable is statistically and economically important but has the “wrong”
called the p-value of the hypothesis test. sign, the regression model might be misspecified.
• A small p-value is evidence against the null hypothesis because one would • If a variable is statistically insignificant at the usual levels (10%, 5%, or 1%), one
reject the null hypothesis even at small significance levels. may think of dropping it from the regression.
• A large p-value is evidence in favor of the null hypothesis. • If the sample size is small, effects might be imprecisely estimated so that the
• P-values are more informative than tests at fixed significance levels. case for dropping insignificant variables is less strong.
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 22 password-protected website or school-approved learning management system for classroom use. 24
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Inference (24 of 37) Multiple Regression Analysis: Inference (26 of 37)
• Confidence intervals • Example: Model of firms‘ R&D expenditures
• Simple manipulation of the result in Theorem 4.2 implies that
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 25 password-protected website or school-approved learning management system for classroom use. 27
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Inference (25 of 37) Multiple Regression Analysis: Inference (27 of 37)
• Confidence intervals for typical confidence levels • Testing hypotheses about a linear combination of the parameters
• Example: Return to education at two-year vs. at four-year colleges
• Relationship between confidence intervals and hypotheses tests • A possible test statistic would be:
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 26 password-protected website or school-approved learning management system for classroom use. 28
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Inference (28 of 37) Multiple Regression Analysis: Inference (30 of 37)
• The standard error of the difference in parameters is impossible to • Testing multiple linear restrictions: The F-test
with standard regression output • Testing exclusion restrictions
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 29 password-protected website or school-approved learning management system for classroom use. 31
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Inference (29 of 37) Multiple Regression Analysis: Inference (31 of 37)
• Estimation results • Estimation of the unrestricted model
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 30 password-protected website or school-approved learning management system for classroom use. 32
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Inference (32 of 37) Multiple Regression Analysis: Inference (34 of 37)
• Estimation of the restricted model • Test decision in example
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 33 password-protected website or school-approved learning management system for classroom use. 35
Introductory Econometrics: A Modern Approach (7e) Introductory Econometrics: A Modern Approach (7e)
Multiple Regression Analysis: Inference (33 of 37) Multiple Regression Analysis: Inference (35 of 37)
• Rejection rule • Test of overall significance of a regression
• A F-distributed variable only takes
on positive values. This corresponds
to the fact that the sum of squared
residuals can only increase if one
moves from H1 to H0.
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a © 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 34 password-protected website or school-approved learning management system for classroom use. 36
Introductory Econometrics: A Modern Approach (7e)
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 37
• Restricted regression
• Test statistic
© 2020 Cengage. May not be scanned, copied or duplicated, or posted to a publicly accessible website, in whole or in part, except for use as permitted in a license distributed with a certain product or service or otherwise on a
password-protected website or school-approved learning management system for classroom use. 38