Professional Documents
Culture Documents
GY460 Techniques of Spatial Analysis: Steve Gibbons
GY460 Techniques of Spatial Analysis: Steve Gibbons
Steve Gibbons
Introduction
• Theoretical framework
– Random utility model
• Empirical methods:
– Micro: Probit, logit, multinomial logit
U ij Vij ij
ij is random
– reflects idiosyncratic tastes of i and unobserved attributes of choice j
Random Utility Model
• Define
yi 1 if i chooses bus
yi 0 if i chooses car
• So choose bus if U i1 U i 0
Vi1 i1 Vi 0 i10
i1 i10 Vi1 Vi 0
• So the probability that we observe an individual choosing bus
travel is
Pr ob i1 i 0 Vi1 Vi 0
Pr ob i1 i 0 ti1 ti 0 pi1 pi 0
The linear probability model
Pr ob bus
1
0
Vi
Non-linear probability model
Pr ob bus
1
0
Vi
Probits and logits
• Common assumptions:
– Cumulative normal distribution function – “Probit”
exp Vi
Pr ob i chooses bus
1 exp Vi
Pr ob yi 1 F xi β
Prob yi 0 1 F xi β
i n
ln L yi lnF xi β 1 yi 1 F xi β
i 1
Example
• McFadden, D. (1974) The Measurement of Urban Travel Demand, Journal of Public Economics, 3
Characteristics t
U ij Vij ij
• Choose choice k if utility higher than for all other choices
j J i n
ln L yij lnProb i chooses j
j 1 i 1
Multinomial logit (2)
• Vij tij
Parameters estimated: pij j zij
• For an individual characteristic that is common across choices
(e.g. income, gender): one parameter per choice
– For at least one choice this is zero (base case).
Source: Scott, Coomes et al (note: they also report models which include individual Xs)
The independence of irrelevant alternatives problem (IIA)
and the nested logit model
Multinomial logit and “IIA”
– Train 60%
• IIA assumes that if red bus company shuts down, the market shares become
– Blue bus 20% + 5% = 25%
• Because the ratio of blue bus trips to train trips must stay at 1:3
Independence of Irrelevant Alternatives
• Model assumes that ‘unobserved’ attributes of all alternatives are perceived as equally similar
• But will people unable to travel by red bus really switch to travelling by train?
– Train: 60%
exp Vik
exp Vil
• Value placed on choices available in second stage (3,4) enter into calculation
of choice probabilities in first stage (2)…
I 2 ln exp V3 exp V4
• Estimate logit model for Car (1) versus Public (2) using:
exp V2 I 2
Pr ob Public
exp V2 I 2 exp V1
Example: Transport mode choice
Train Bus
Example: Transport mode choice
Variable Parameter
Cost -0.002
• Residential choice
Aggregate choice models
Micro and aggregated choice models
exp Vk
Pr ob i chooses k
exp V
j
j
ln Pr ob i chooses k Vk ln exp V j
j
ln nk / N xk i
• i.e. if you only have choice characteristics, you could use a choice-level
regression of the proportion of individuals making each choice on the
choice characteristics
exp k k nk
Pr ob number choosing k = nk
nk !
ln k xk
• See Guimaraes et al Restats (2003)
– though this equivalence was known before this ‘discovery’
• Here’s an example…
Data (295 i’s 3 j’s)
id choice d x
1 American 0 18.97627
1 Japan 0 7.542373
1 Europe 1 3.461017
2 American 1 18.97627
2 Japan 0 7.542373
2 Europe 0 3.461017
3 American 1 18.97627
3 Japan 0 7.542373
3 Europe 0 3.461017
4 American 0 18.97627
4 Japan 1 7.542373
4 Europe 0 3.461017
5 American 1 18.97627
5 Japan 0 7.542373
5 Europe 0 3.461017
Conditional logit
------------------------------------------------------------------------------
choice | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
x | .0999331 .0091997 10.86 0.000 .081902 .1179642
------------------------------------------------------------------------------
Simpler data
choice n x p
------------------------------------------------------------------------------
n | Coef. Std. Err. z P>|z| [95% Conf. Interval]
-------------+----------------------------------------------------------------
x | .0999331 .0091997 10.86 0.000 .081902 .1179642
_cons | 3.364614 .1450806 23.19 0.000 3.080262 3.648967
------------------------------------------------------------------------------
OLS
. reg lnp x
------------------------------------------------------------------------------
lnp | Coef. Std. Err. t P>|t| [95% Conf. Interval]
-------------+----------------------------------------------------------------
x | .101293 .0052644 19.24 0.033 .034403 .168183
_cons | -2.339238 .06295 -37.16 0.017 -3.139094 -1.539383
------------------------------------------------------------------------------
Aggregate v micro choice models
• Hence, there’s little point in using conditional logit if you only have
choice-characteristics
ln( n jk ) xjk j k jk
• Typically characteristics of destination and source include some
measure of “attraction” e.g. population mass (or “market potential” in
trade models) wages (endogenous)
• And measure of the cost in moving between place j and d (e.g. log
distance)
ln(n jk ) ln d jk xjk j k jk
• Hence gravity – after Newton
• Even for internet site visits!: see Blum and Goldfarb (2006) Journal of
International Economics
• Trade literature has many examples