Professional Documents
Culture Documents
θ ∂V (i) /∂S S θ
θ ∂V (i) /∂S S θ
as the probit. In such case, Eq. (2.51) or (2.52) can be used to c of the k th variable.
notalculate approximate arc elasticities for the logit model, it is possible compute
point elasticities as follows:
ec i ( S ik ) =
Sik p ( i)
p ( i ) S ik
Sik e
V ( j)
V ( j)
Sik
[ ]
e V (i )
e V ( j)
j
S ik ik [ Ip ( i ) ]
Where
ik =V (i)/Sik
(5.26)
the parameter of the k th variable. Note that the elasticity has the same sign as
S ik ik
to the choice function V(i). also note that the choice elasticity according to
to p(i). this means that the more likely a particular choice is, the less sensitive it is
to change in the value of V (i). using the same principle
ec i ( V (i))=
p(i)/ p(i)
=V (i) [ 1p(i)]
V (i)/V (i)
(5.27)
Gives the elasticity of choice with respect to the whole V(i) function.
To illustrate the choice elasticity, consider the numerical example of the
previous section in which mode choice was given by a logit model with a choice
function V(i) = -0.2ti 0.1 ci. the conditional probabilities of mode choice P(m|j) were
computed as
p ( m| j )=
ec mj ( t m )
as follows:
ec mj ( t m )= t t m [ 1p(mj)]
Where
(5.28)
t m=
20 30 40
25 20 50
These elasticities clearly indicate the larger sensitivity of choice of the modes for
which the choice probabilities are small such as p (2|3) or p (1|2).
U ( i) =V ( i ) +e (i ) ,i=1,2,,n
p (i ) ,i=1,2,,n.
^
V= p ( i ) v ( i) ,for alli
i
Strictly, however, this is not the correct measure of the expected utilities to
be used in predicting choice at the next hierarchy of choice. It is only correct
in the case of deterministic choice, and only an approximation in the case of
stochastic choice. In this latter case, it is expected value of perceived
attributes that is relevant. This can be calculated by finding the expected
value
V^
^ [ maxU i ]
V=E
i
1 2
[ max U i ] f ( U 1 ) , ( U 2 ) ,, ( U n ) dU 1 ,U 2 ,,U n
(5.30)
Replacing U(i) by [V(i) + e(i)] we obtain
e(i)
f ()
V (i ) .
e(i)
V^
=p(i)
V (i)
(5.32)
V^
V^
e
= V (j)
V (i) e
V(i)
(5.33)
eV ( i)
V=
dV (i)
V ( j)
V (i) e
j
ln e
V ( j)
(5.34)
V ()
function, say
Xk
can be obtained as
1
X k = ln e V ( j)
k j
Where
is
V (i)/ X k
(5.35)
Xk
and
in the choice
V^
is that the
V max
not correct for it will reflect the maximum of the expected values, which is
not the same as expected value of the maximum. Using
V^
or
V max
for
t mj
[ ]
t mj=m 20 30
25 40
e0.2t
p ( m| j )= 0.2t
e
mj
rj
{ [
0.73 0.88
p ( m| j )= m=1
m=2 0.27 0.12
we can calculate the expected actual values
Vj
]
or
tj
t^ j
t^ j
as
t^ j=t 1 j p ( 1| j )+ t2 j p(2 j)
( 21.35,31.2 )
These are the expected values of the actual times to each destination after the
choices of mode are made. The expected perceived times
(5.34):
j21
j21
j22
t j
( 18.4,29.4 )
The values of
model than
t^ j
t j
choices are presumably made. It should not be surprising that the values of
t mj
t j
are
stochastic perception process. The difference between the two results will impact
the results of the destination choice is in versely proportional to
0.
p j =
t j
will give
tj
0.
p j =
2 (as in the
tj
72, 0.28).
V^
s for sequencing.
estimate the specific traffic volume involved. In the deterministic case, this is done
simply multiplying N with the proportions of choice. For example, let P (m, j, r) be
the proportion of all users of a particular mode, destination, and route combination
(m, j, r) , then the traffic volume Xmjr can be predicted from
X mjr=Np ( m, j,r )
(5.36)
This traffic will occur during a period over which the N trips are supposed to be
made. In a sequenced choice approach in urban travel demand, when a trip
generation predicts the total number of trips N that are made during, say, an
average weekday, Xmjr would then be daily flows of traffic. Equation (5.36) will then
be a demand function for daily trips with attributes mjr.
When the choice model used is stochastic, then the number of trips will be a
random variable with a multinomial distribution and with an expected value
X
E(mjr)=Np ( m , j,r )
(5.37)
And a variance
(5.38)
Taking the example of the previous section for an illustration, we recall the
three-mode situation where the logit model predicted
P ( m) =[ 0.12,0.875,0.005 ]
. if
there are 1000 trips distributed among the three modes, then the expected values
of traffic by each mode
Xm
will be
[105.6,109.4,4.975 ]
Note that the values of the variances relative to the means are lower for the
higher traffic volumes.
The relation between demand and choice can also be illustrated by reference to
Eq. (5.32):
V^
=p(i)
V (i)
choice with respect to one alternative equals the probability of choice of that
alternative. This is analogous calibration process to definition of demand in
terms of marginal benefits as discussed in Chap. 2. With a population of N
trip to be divided among alternatives l, the expected demand for each
alternative is given bt the expected marginal utility
V
(
)
E [ X i ]=N
=Np(i)
V (i)
(5.39)
V ( )
t a=38
t b=47
and
for the bus and the auto, respectively. There are ties that are measured
and assumed to be true for all 10 respondents. Suppose the mode choice is
modeled by a binary logit of the form
at A
e
p ( A )= at at
e +e
A
at B
e
p ( B ) = at at
e +e
A
p ( A ) e 38 9
= =e
p ( B ) e 47
0.6 9
=e
0.4
=
Had the
V ( )
ln 6/4
=0.045
9
regression might be used to estimate the parameters. Note that in this simple case,
only one parameter could be estimated since there is effectively one piece of
information, namely
p ( A )=0.6
and
p ( B ) =1p(A)
tA
and
tB
In general, the estimation of choice model parameters is done using the maximum
likelihood method. We discuss this method in the following paragraphs for the
general case of multinomial choice, at the disaggregate and aggregate level. We
return after that to the binary case as a special instance and illustrate some of its
features.
constructed for each individual and alternative. This function includes demand and
supply variables represented by a vector X nk.
pnk
of each
alternative for each of the individuals. Assuming without loss of generality that the
choction is linear, we can write V(n,k) = AXnk. Where A is a vector of parameters to
be estimated. In other words
(5.40)
Where
x ink
aik
(5.41)
we note that choice of each individual. At the same time, we observe the values of
X nk .
(5.42)
(5.43)
N k = Y nk
n=1
Nk
sample. We now partition the sample N into K subsets Sk , each of which contains
the.
Nk
Y nk
represent the outcomes of independent Bernoulli trials, i.e., assuming that the
choices of different individuals are independent, then the probability of
Nk
=p ( N 1 , N 2 ,,N k|A )
N!
(p )
N 1 ! N 2 ! N k ! k n S nk
(5.44)
N 1 !N 2 !N k !
) to give
ln p kn
k nS k
(5.45)
=0for allof i
aik
aik
(5.46)
a^ i
can
a ik a jk
^ =
(5.47)
^A
ratio test, as we shall demonstrate with an illustration. The facility in applying Eq.
(5.45) to (5.47) to estimate and evaluate parameter values of a choice depends on
the form of the model itself. The logit model, for example, lends itself to relatively
straightforward estimation. The probit model on the other hand requires
computational procedures that are rather complex and for which computer codes
have only recently become available [see, for example, Daganzo (1979)].
It should be noted that the specification of the model in Eq. (5.40) implies through
the
aik
variable
aik
aik =a i
meaning that the attributes have the same effect on choice regardless of which
alternative they described. The alternative-specific model obviously contains a
larger number of parameters and requires more data for its calibration. It has the
potential advantage of capturing effect of factors that are specific to alternatives. Its
disadvantage is in that it is not as amenable to policy analysis as the attributespecific option because it cannot predict the choice probabilities for novel
alternatives since no parameter values will exist for them. The alternative is to
make the parameters attribute-specific, or equivalently choice abstract. This
permits the prediction of the shares of any alternatives once the values og their
attributes are specified. However, the assumption of abstractiveness does caouse a
loss of explanatory problem. We shall see some of these models in later chapters.
When the choice model used is a multinomial logit, then it is possible to write down
the explicit solutions to the estimation problem uing Eqs. (5.43) to (5.45).
For the MNL model, Eq. (5.44) can be written as
V (n,k)
N!
e
=
N 1 ! N 2 ! N k ! k n S e V (n, k)
(5.48)
(V (n ,k ))
V (n,k)ln e
k nS k
By replacing
V (n,k)
with
5.49)
a ik xink
and maximizing
, we obtain the
equations
V ( n,k)
= x ink xink
=0
V
aik k nN
k n
e
(n ,k)
(5.50)
aik
The binomial (or binary) disaggregate choice is a special case with two alternatives.
In this case, the likelihood function Eq. (5.43) can be simplified:
N!
(p ) ( p )
N 1 ! N 2 ! nS n1 n S n2
1
(5.51)
Which for the logit model can be simplified upon taking logarithms to
nS2
(5.52)
To which Eq. (5.49) can be applied to obtain the estimates of the parameters A.
Note that in the case of disaggregate calibration it is not possible to take the
advantages of the simplifying property of the logit model that is shown in Eq. (5.22).
the relativesodds of alternative 1 and 2 in the disaggregate case are given by
p(1) e V(n,1)
=
p(2) e V(n,2)
(5.53)
The right-hand side is individual specific, and hence it is not possible to obtain
different observation for
p(1)/p(2)
linearization of Eq. (5.52) and the use of regression for the estimation of the
parameters can only be done in the aggregate case.
It is rather cumbersome to illustrate the disaggregate model estimation with a
numerical example, for the number of probabilities one needs to calculate is rather
large and equal to the product of the number of the individuals in the sample by the
number of alternatives. We can, however, illustrate the construction of the likelihood
function, for example, with a small sample size and a very simple choice model.
Suppose that the data given in Table 5.1 give the travel times measured for five
individuals, for each of two modes. Suppose that of these people the first twi were
observed to select the first mode and the other three the seconf mode. Suppose
that a simple choice model is used to explain this choice process as follows:
Table 5.1
S1
S2
xn1
xn2
1
2
5
4
7
6
3
4
5
6
6
4
4
4
5