Professional Documents
Culture Documents
JHE Vol. 42
JHE Vol. 42
JHE Vol. 42
a r t i c l e i n f o a b s t r a c t
Article history: This paper analyzes the interaction of direct and indirect risk selection in health insurance markets. It is
Received 15 April 2014 shown that direct risk selection – using measures unrelated to the benefit package like selective adver-
Received in revised form 6 September 2014 tising or ‘losing’ applications of high risk individuals – nevertheless has an influence on the distortions of
Accepted 9 December 2014
the benefit package caused by indirect risk selection. Direct risk selection (DRS) may either increase or
Available online 17 December 2014
decrease these distortions, depending on the type of equilibrium (pooling or separating), the type of DRS
(positive or negative) and the type of cost for DRS (individual-specific or not). Regulators who succeed in
JEL classification:
reducing DRS by, e.g., banning excessive advertising or implementing fines for ‘losing’ applications, may
I13
I18
therefore (unintendedly) mitigate or exacerbate the distortions of the benefit package caused by indirect
L13 risk selection. It is shown that the interaction of direct and indirect risk selection also alters the formula
for optimal risk adjustment.
Keywords: Risk © 2015 Elsevier B.V. All rights reserved.
selection Risk
adjustment
Health insurance
Discrete choice
Imperfect competition
3
van de Ven and van Vliet (1992) provide an extensive list of measures insur-
∗ Tel.: +49 651 2012624. ers may use for risk selection; for differential treatment of low and high risks’
E-mail address: lorenzn@uni-trier.de applications see Bauhoff (2012).
1 4 See Shen and Ellis (2002).
See van de Ven and Ellis (2000).
2 5
See Breyer et al. (2011). See Frank et al. (2000), Cao and McGuire (2003) and Ellis and McGuire (2007).
http://dx.doi.org/10.1016/j.jhealeco.2014.12.003
0167-6296/© 2015 Elsevier B.V. All rights reserved.
82 N. Lorenz
N. Lorenz
/ Journal
/ Journal
of Health
of Health
Economics
Economics
42 (2015)
42 (2015)
81–89
81–89 2
risk adjustment has been concerned with improving this underly- Table 1
Effect of DRS with individual-specific cost on the distortion of the benefit package.
ing regression by, e.g., including additional variables or altering the
grouping algorithm for diagnoses in morbidity based risk adjust- Type of equilibrium Positive DRS Negative DRS
ment, so that a larger part of the variance of actual expenditures is Pooling equilibrium Distortion decreases Distortion increases
explained. The larger the explained part of the variance, the closer Separating equilibrium Distortion decreases Distortion decreases
transfers are to actual cost, and the lower the incentives for risk
selection should be.
Initiated by the very influential study of Glazer and McGuire high risks is reduced. Negative DRS on the other hand creates a ‘sub-
(2000), there has developed a small literature that departs from this stitution effect’: If insurers are (somewhat) successful in repelling
statistical approach and instead explicitly models insurers’ incen- the high risks by DRS, they can reduce the degree of IRS. For an
tives for risk selection. One study in this literature has shown that overview of these results see, Table 1.
conventional, i.e. regression-based, risk adjustment may decrease A regulator who succeeds in reducing negative DRS (by, e.g.,
welfare if there is imperfect competition, another, that it may even charging a fine for ‘losing’ applications of high risk individuals) will
increase the extent of risk selection.6 These undesirable effects of therefore simultaneously reduce IRS in the pooling equilibrium, but
conventional risk adjustment exemplify the need for what Glazer unintendedly increase the distortion of the benefit package in the
and McGuire (2000) have termed optimal risk adjustment. They separating equilibrium. The distortions caused by IRS will also be
have shown that a regulator can increase the effectiveness of a risk increased if he succeeds in reducing positive DRS (by, e.g., banning
adjustment scheme by distorting the payments as calculated from excessive advertising), in this case for both the pooling and the
a regression: there has to be overpayment for signals which are
separating equilibrium.
correlated with high risks and underpayment for signals which are
In three of the four cases, optimal risk adjustment then becomes
correlated with low risks. If the over- and underpayments are cho-
even more important. We therefore derive the impact of DRS
sen optimally, incentives for IRS can be eliminated completely.7
on the formula for optimal risk adjustment developed by Glazer
Optimal risk adjustment has also been derived for a setting where
and McGuire (2000). We show that the overpayment for a signal
individuals differ in their elasticity to switch insurers or where
insurers are allowed to vary their premium in some dimension, that indicates a high risk has to be increased exactly by insurers’
expenditures on positive DRS; likewise, the underpayment has to
as is the case in the insurance exchanges in the US.8
be reduced by the expenditures on negative DRS. With this modi-
A concern, already raised by Glazer and McGuire (2000) them-
fication, their formula can eliminate the incentives for IRS even in
selves, is that such over- and underpayments create incentives
the presence of DRS.
for DRS regarding the signal, but so far it has not been analyzed
In the literature on optimal risk adjustment, some of the results
whether this has an influence on optimal risk adjustment. In fact,
regarding the distortions caused by IRS have been derived under
we are not aware of any theoretical study that explicitly models
perfect competition, but DRS seems incompatible with such a
the interaction of direct and indirect risk selection, even in the
setting where individuals are perfectly informed about all ben-
absence of risk adjustment.9 In this study we therefore develop
efit packages and premiums and always choose the insurer that
such a model and show that in general (the degree of) DRS has
offers the best benefit package-premium combination. We there-
an influence on the distortions of the benefit package caused by
fore derive our results within a discrete choice model, which can
IRS and that this alters the formula for optimal risk adjustment.
easily capture different levels of competition. To keep the model
DRS may either increase or decrease the distortions caused by
simple, we assume that the benefit package is one-dimensional, but
IRS, depending on whether insurers try to attract the low risks
the model can be extended to a multi-dimensional benefit package.
(positive DRS) or to repel the high risks (negative DRS), whether a
Also, to simplify the notation when deriving the results, we first
pooling or a separating equilibrium emerges, and whether the cost
consider the case of two risk types, but then show that the results
for DRS is individual-specific or not.
also hold for an arbitrary number of risk types.
If insurers’ expenditures for DRS are at least to some degree
The remainder of this paper is organized as follows: In Section
individual-specific (and not just a fixed cost), they affect risk-type-
2, we introduce the basic discrete choice model and show how DRS
specific cost: Positive DRS increases the cost per low risk, negative
can be incorporated in such a model. We analyze the pooling equi-
DRS the cost per high risk. In the first case, the cost difference
librium in Section 3 and the separating equilibrium in Section 4.
between the risk types is reduced, in the second, it is increased.
Section 5 concludes.
In the pooling equilibrium where both risk types pay the same pre-
mium, positive DRS therefore reduces the incentives for IRS, while
negative DRS increases it. 2. The model
In the separating equilibrium, insurers’ expenditures for posi-
tive DRS translate into a higher premium for the contract offered 2.1. Basic model without DRS
for the low risks; this makes this contract less attractive for the high
risks, so the distortion of the benefit package necessary to repel the Individual preferences regarding the benefit-premium bundle
are given by
u = pr v(m) − R, (1)
6
See Lorenz (2013) and Brown et al. (2011), respectively. In the empirical part of
their study, Brown et al. (2011) find such an increase in the extent of risk selection where R denotes the premium and m the level of medical services
for the Medicare Advantage program in the U.S.; however, there has been some (measured in monetary terms). pr is the probability of becoming
disagreement on this finding, see Newhouse et al. (2012). ill, and there are two risk types r = H, L, with pH > pL ; the share of
7 See also Glazer and McGuire (2002) and Jack (2006).
8
the low risks is . The utility of receiving medical treatment, v(m),
For the first setting, see Bijlsma et al. (2011), and for the second, McGuire et al.
(2013) and Shi (2013).
is increasing at a decreasing rate, i.e. v∗ (m) > 0 and v∗∗ (m) < 0. The
9 Eggleston (2000) derives the optimal mix of supply and demand side cost shar- efficient level of medical services is implicitly defined by v∗ (mFB ) =
ing for a setting with a single (semi-altruistic) HMO that can influence the level of 1.
medical services (according to the outcome of a patient–provider bargaining pro- There are n insurers j, each offering a benefit-premium bundle
cess) and can dump a share of the high risks at some cost; however, there is no {mj , Rj }. The individual’s decision of which insurer to choose may,
competition as there is only one provider.
however, not only depend on these benefit-premium bundles, but
82 N. Lorenz
N. Lorenz
/ Journal
/ Journal
of Health
of Health
Economics
Economics
42 (2015)
42 (2015)
81–89
81–89 3
risk adjustment has been concerned with improving this underly- Table 1
also on some other factors, like perceived friendliness of personnel, 2.2. The model with positive and negative DRS
location, or which insurer was recommended by family or friends.
In a discrete choice model, these other factors are captured by aug- We consider positive DRS to be an activity each insurer is
menting the individual’s utility as given in (1) by an individual- and engaged in which generates some cost and increases the proba-
j
insurer-specific utility component εi; the utility of an individual i bility of being chosen by the individual (or group of individuals)
(being of risk type r) when choosing an insurer j therefore is the activity is targeted at. We model this increase in the probability
of being chosen to stem from an increase in the utility the individ-
u i (mj , Rj ) = pr v(mj ) − Rj + εji . (2) ual receives, which may either be real (as, e.g., with a discount for
a fitness club membership) or just perceived (as with advertising).
If εj is assumed to be i.i.d. extreme value, the logit model with its We denote the cost by aj and the increase in utility by g(aj ), where
i g(aj ) is increasing and concave.13 With positive DRS, the (perceived)
analytically tractable choice probabilities arises. Denoting risk type utility of individual i choosing an insurer j therefore is
r’s utility of the benefit-premium bundle offered by insurer j by
u i (mj , Rj ) = pr v(mj ) − Rj + g(aj ) + εji, (4)
Vrj = pr v(mj ) − Rj ,
so that insurer k’s market share among risk type r is given by14
and specifying the variance of ε as Var(ε ) =
j j 2 2 /6, the probabil-
i i
ity of individual i (being of risk type r) choosing a particular insurer
k k
k is10 e(V
r +g(a ))/
Prk = j
. (5)
e(Vr +g(aj ))/
j
Prob(i chooses k) = Prob(Vr k + εk > V l + εl ∀ l =
/ k)
i r i
Two cases regarding the cost aj can be distinguished:
k
eVr / non-individual-specific and individual-specific cost. With non-
= j
. (3) individual-specific cost, total cost for DRS of an insurer j is
n
e Vr /
j=1 independent of the number of individuals choosing this insurer.
The prime example for this case is selective advertising, where
Denote this probability by P kr ; it is also insurer k’s market share cost does not increase if an additional individual chooses insurer
among the individuals of risk type r. P k is increasing in V k : a higher
r r j. With individual-specific cost, total cost for DRS of an insurer
10
share of individuals of risk type r will choose insurer k, if this insurer See Train (2009, p. 40).
11
offers a higher level of medical services or charges a lower premium. See Zweifel et al. (2009), chapter 7, for the Rothschild–Stiglitz equilibrium in
this setting.
The variance of the additional utility component, Var(εji ) = 12 We explain why this occurs in Section 4.1.
2 2 /6, is a measure of the level of competition in this health insur-
ance market. If is small, all the εij are very similar and therefore
only play a minor role in which insurer is chosen: Offering an only
somewhat higher utility level than all the other insurers will, in this
case, already attract a large share of all individuals; this implies a
high level of competition. If, on the other hand, is large, the other
factors besides the benefit level and the premium – captured by
j
large positive and large negative εi – are rather important, so that
insurers, when increasing their premium (or reducing their benefit
level), only lose a small share of their insured; a large level of
therefore corresponds to a low level of competition.
As shown by Lorenz (2013) who has analyzed this basic model
without DRS, the level of competition determines which type of
equilibrium emerges: If the level of competition is low (i.e., is
large), there will be a pooling equilibrium: all insurers offer the
same benefit-premium bundle, so each individual i chooses the
j
insurer j for which his εi is maximal. If the level of competition
is high (i.e., is small), a separating equilibrium similar (but not
identical) to the Rothschild–Stiglitz equilibrium under perfect com-
petition arises.11 Some of the n insurers offer a benefit-premium
bundle designated for the low risks, the remaining insurers a con-
tract designated for the high risks (with a larger benefit package
and a higher premium). All the low risks choose the insurer with
j
the highest εi among the first set of insurers, but only most of the
j
high risks choose the insurer with the highest εi among the second
set. Because a small share of the high risks chooses one of the insur-
ers offering the contract designated for the low risks, the separation
of risk types is not perfect.12
82 N. Lorenz
N. Lorenz
/ Journal
/ Journal
of Health
of Health
Economics
Economics
42 (2015)
42 (2015)
81–89
81–89 4
risk adjustment has been concerned with improving this underly- Table 1
13
Since we are interested in a setting where insurers are engaged in DRS, we
assume lim g ∗ (a) → ∞ to guarantee an interior solution.
a→0+
14
To simplify the notation, we do not introduce different symbols for P kj for the
case of no, positive or negative DRS; we will, however, always make clear to which
case we refer.
15 In Section 5 we argue that the main effects should be similar if negative DRS
does not occur during the application process, but is targeted at individuals who
already hold a contract with the insurer.
84 N. Lorenz
N. Lorenz
/ Journal
/ Journal
of Health
of Health
Economics
Economics
42 (2015)
42 (2015)
81–89
81–89 84
ing the equilibrium without DRS in Section 3.1. We consider the tive insurers’ expenditures on risk selection are, the larger, c.p.,
equilibrium with positive DRS in Section 3.2 and analyze the impact the equilibrium level of ak will be. These expenditures affect the
of positive DRS on risk adjustment in Section 3.3. We then show distortion of the benefit level m, which is now given by
that the results also hold for an arbitrary number of risk types in
Section 3.4. The effects of negative DRS, which are just the opposite 2
Normalizing the mass of individuals to one and assuming profit Because the last term in the brackets [·] is positive, compared with
maximization, the objective of insurer k is
(8), the condition if there is no DRS, v∗ (mk ) has to decrease: mk
= P kL increases, so the distortion is reduced. More generally, the larger
L+ (1 − )P kH
k k k
max H, (6)
mk ,Rk the equilibrium level of ak , the larger mk .
where k = Rk − pr mk denotes insurer k’s profit per individual of
r Result 1. In the pooling equilibrium, the distortion of the benefit
risk type r. The solution to this objective yields the following two level decreases in the level of positive DRS if cost for DRS is individual-
conditions (see Appendix A.1): First, specific.
n
k
+ (1 − ) k
= . (7)
L H n−1 The incentive to distort the benefit level (with or without DRS)
arises because profit per high risk is lower than profit per low
Average profit per insured decreases in n and increases in : a risk, where the degree of the distortion depends on the difference
higher level of competition (a larger number of insurers n or a between these two profits, which, in the case without DRS, is given
smaller level of ) decreases profit. Secondly, the condition deter-
by20 − = (p − p )m .
k k H L k
mining the distortion of the benefit level is given by L H
With positive DRS, profit per low risk decreases because insurers
(1 − )(pH − pL )
2 waste part of this profit on their expenditures for DRS; this reduces
k k
1− n m v∗ (m ) = 1, (8) the difference between net profits (including ak ), and thereby the
p
n−1
incentive to distort the benefit level.
where p = pL + (1 − )pH . Because the fraction is positive, it is This is different for the case of non-individual-specific cost.21
immediately apparent that v∗ (mk ) > 1, so that mk is distorted below Although in this case, expenditures for risk selection decrease total
the efficient level mFB . As is to be expected, the distortion increases profit, they do not specifically decrease profit per low risk, so the
in the difference pH − pL and in the level of competition (captured difference between the risk-type-specific profits remains the same:
by n /(n − 1)). Therefore, positive DRS has no influence on the distortion of the
benefit level if cost is non-individual-specific. In the following, we
3.2. The pooling equilibrium with positive DRS will therefore only consider individual-specific cost.
The solution to this objective yields a positive equilibrium level of insurers’ expenditures on DRS, implicitly defined by
85 N. Lorenz
N. Lorenz
/ Journal
/ Journal
of Health
of Health
Economics
Economics
42 (2015)
42 (2015)
81–89
81–89 85
With positive DRS of the young and individual-specific cost, so that the distortion of the benefit level is eliminated for
insurer k’s objective reads as22 (1 − ) H
RAO = (p − pL )m. (15)
ı
k
= k k
rs P rs r − k k
rY P rYa , (11)
r s r
If there is perfect correlation, the share of the low risks equals the
share of the young, so = ; in addition, the mass of individuals
k contains g(ak ) and g(aj ). The distortion of the benefit in the lower left and the upper right corner in Table 2 has to be
where only PrY
level is now determined by23 zero, which requires ı = (1 − ) = (1 − ). Replacing in (15) shows
that with perfect correlation the efficient benefit level is therefore
(1 − )(pH − pL ) 2 (pH − pL ) implemented for RAO = (pH − pL )m, which is just the cost difference
1− n mk + n ıak v∗ (mk ) = 1. (12)
n−1 p n−1 p
between the two risk types. With less than perfect correlation,
ı < (1 − ), so there is overpayment, and the lower the level of
Because the last term in the brackets is positive, compared with no correlation (i.e., the lower ı), the larger this overpayment has to
DRS, v∗ (mk ) has to decrease, so the distortion is reduced. However, be.
for a given level of ak , the reduction of the distortion is of course not
as large as with DRS against the risk type itself, since ı < (1 − ). 3.3.2. Optimal risk adjustment with positive DRS
If there is less than perfect correlation, expenditures on DRS of the With optimal risk adjustment there is overpayment for the old,
young not only increase the cost of the low risks, but also of the so this is the group positive DRS will be targeted at.25 Insurer k’s
high risks and therefore translate into a smaller reduction of the objective in this case is given by (13) with RAO replaced by RAO − ak
rO
cost difference between the risk types. As is to be expected, for a and with P k containing g(ak ) and g(aj ) as given in (5). The posi-
given level of ak the reduction of the distortion increases in the level
tive equilibrium level of ak then enters condition (14), where again
of correlation (i.e. in ı). RAO has to be replaced by (RAO − ak ).26 The efficient benefit level is
Result 2. In the pooling equilibrium, the distortion of the benefit therefore implemented with
level decreases in the level of positive DRS of a signal that is correlated (1 − ) H L k
with low risk if cost for DRS is individual-specific. A higher level of RAO = (p − p )m + a . (16)
ı
correlation (a higher ı) increases the effect of a given level of DRS on
the distortion of the benefit level. Because part of the overpayment for the old is spent on positive
DRS, the optimal overpayment for the old has to be raised by exactly
these expenditures, so that the net difference in payments (includ-
3.3. Implications of DRS on optimal risk adjustment in the ing ak ) equals the amount necessary to eliminate IRS as given in
pooling equilibrium (15).
We now discuss the implications of the interaction of direct and Result 3. With individual-specific cost, if there is positive DRS
indirect risk selection for optimal risk adjustment. As shown by regarding a signal that is used for risk adjustment and that is correlated
Glazer and McGuire (2000), if a regulator does not observe indi- with high risk, the optimal overpayment of that signal to eliminate IRS
viduals’ risk type, but only a signal that is correlated with risk type has to be increased by the expenditures for DRS.
(like age), it is still feasible to eliminate the distortion of the benefit
This result shows that the formula for optimal risk adjustment
package by overpaying for a signal that indicates a high risk, and
derived by Glazer and McGuire (2000) is not invalidated by DRS:
underpaying for a signal that indicates a low risk. In Section 3.3.1,
there is still overpayment for a signal that is correlated with high
we first show how to implement optimal risk adjustment in our
risk, and underpayment for a signal correlated with low risk. Also,
setting with two risk types and two age groups if there is no DRS;
DRS does not invalidate their claim that optimal risk adjustment
we then determine how the optimal payments have to be modified
can implement the efficient benefit level. However, the formula to
if there is DRS in Section 3.3.2.
implement the efficient benefit level has to be modified and include
insurers’ expenditures on DRS if cost is individual specific. Whether
3.3.1. Optimal risk adjustment without DRS these expenditures are negligible or significant is an empirical mat-
With risk adjustment, each insurer receives a payment of RAO ter, but, e.g., the findings of Starc (2014), who reports that insurers
for each insured that is old; these payments are financed by a risk spend a large part of potential profits on marketing and insurance
adjustment fee RAF which each insurer has to pay for each insured brokers, indicate that these expenditures may be substantial.
(including the old). The balanced budget constraint requires this fee
to be RAF = (1 − )RAO . The insurer’s objective with risk adjustment 3.4. More than two risk types
is then given by
To keep the notation simple when deriving the results, so far
k
= rY P
k
( k
− RAF ) + rO P
k
( k
− RAF + RAO ). (13)
rY r rO r we have considered the case of only two risk types. The results,
r r however, also hold for an arbitrary number of risk types. Let the
i i i
4.1. The separating equilibrium without DRS high risks, this difference will be negative, for others somewhat,
or even considerably, larger than zero. There is therefore no single
In Lorenz (2013), it is shown that the separating equilibrium incentive compatibility constraint (ICC), but an individual-specific
arises for a high level of competition (i.e. a low level of ) and that ICC for each of the high risks depending on this difference.
it is similar, but not identical, to the Rothschild–Stiglitz equilibrium One way to capture these individual-specific ICCs is by a dis-
under perfect competition.28 Both equilibria can be found in Fig. 1. tribution function denoting the share of the high risks for which
Under perfect competition, the separating equilibrium consists the ICC is violated; formulated in this way, this distribution func-
of contract B, chosen by the high risks, and contract A1 , chosen by tion coincides with insurer A’s market share among the high risks
straint is satisfied); in addition, both iso-profit lines pass through (with the darkness of the shaded area as a measure of this density,
the origin and are in this case zero-profit lines.29 The number of which is bell-shaped). Above the shaded area, the ICC is satisfied for
insurers offering the contract for the low risks (insurers of type A) all the high risks, so P A A
30
However, Olivella and Vera-Hernandez (2010) have shown that the smallest high risks.
number of insurers supporting this equilibrium is n = 3, with nB = 1 and nA = 2. Insurer A, when offering A2 , would therefore not be chosen by
any of the high risks. However, he can still increase his profit by
moving along the indifference curve of the low risks to the right,
which would have two effects31 : on the one hand, it increases profit
per low risk because the indifference curve has a larger slope than
pL ; on the other hand, it also increases the number of the high risks
31
We consider a movement along the indifference curve of the low risks because
with more than one insurer of type A, this would leave this insurer’s market share
among the low risks unaffected.
88 N. Lorenz
N. Lorenz
/ Journal
/ Journal
of Health
of Health
Economics
Economics
42 (2015)
42 (2015)
81–89
81–89 88
equilibrium will be a contract like A4 . Since positive DRS reduces by the high risks by f(b A), VHA can be increased by exactly that
the attractiveness of the contract offered by insurers of type A for amount without altering the number of the high risks choosing
the high risks in the premium-dimension, they can increase the insurer A. This allows insurer A to increase his benefit level, see
attractiveness of their contract in the benefit level-dimension. Fig. 2(b). There, AL5 denotes a contract offered by insurer A as
AH
perceived by the low risks, while 5 denotes the same contract as
perceived by the high risks. Compared to AL5, AH5 is shifted upwards
by f(bA ), the utility decrease of negative DRS (measured in mone-
32
See Appendix C.1; for a more detailed derivation of this equilibrium, see Lorenz tary terms). The larger bA , the larger f(bA ), and therefore the larger
(2013).
VA
mA can be without increasing the share of high risks choosing
33 Because contract B is far above the indifference curve I associated with con-
L
of the low risks to choose this contract. Because insurers of type B cannot increase
tract A3 , for the low risks there is a huge difference in utility between A3 and B. Any their market share among the low risks (which remains at zero), they will refrain
moderate increase in the (perceived) utility of contract B due to positive DRS of insur- from positive DRS.
ers of type B will reduce this difference only to a small degree and not induce any
89 N. Lorenz
N. Lorenz
/ Journal
/ Journal
of Health
of Health
Economics
Economics
42 (2015)
42 (2015)
81–89
81–89 89
DRS influences the distortion caused by IRS even if cost is non- Appendix A.
individual-specific.
A.1. The pooling equilibrium without DRS
Result 6. In the separating equilibrium, negative DRS against the
high risks reduces the distortion of the benefit level of the low risks Using the property that the derivative of P k k
The main mechanism in this last case is therefore different from all
the other cases we have considered. Here, IRS is substituted by DRS, ∂ k Pk k k k
L (1 − PL ) PH (1 − PH )
k
+ k k k
while in all the other cases, IRS affects risk type specific profits and = − PL + (1 − ) − + PH
∂ Rk L H
thereby the gains from IRS.
= 0 (17)
∂ k
L L (1 − PL ) ∗ k k
= p v − PL
With optimal risk adjustment, profit per high risk equals ∂mk L
In this paper, the interaction of direct and indirect risk selec- The FOC of (9) with respect to ak is
tion (DRS and IRS) has been analyzed. It has been shown that DRS,
Pk k
on the difference between these profits, positive DRS reduces the (17) for Rk and substituting in (18) then yields (10).
distortion of the benefit level, while negative DRS increases it. In
the separating equilibrium, DRS can act as a substitute for IRS and A.3. Positive DRS of a signal that is correlated with risk type
thereby reduce the distortion of the benefit level. In addition, it
has been shown that the effects of DRS on type- and signal-specific The FOC of (11) with respect to ak defines a positive equilibrium
profits also have an influence on the formula for optimal risk adjust- level of ak . Solving the FOC with respect to Rk for Rk = (n /(n −
ment: The over- and underpayments have to be inflated by insurers’ 1)) + pmk + ak and inserting in the FOC with respect to mk yields
expenditures on positive DRS and reduced by their expenditures on (12).
negative DRS.
We have derived these results for a setting where negative DRS
occurs during the application process, but the main mechanisms A.4. Optimal risk adjustment without DRS
should also hold if it is targeted at the high risks who already hold
a contract with the insurer. If an insurer attracts a larger share of Solving the FOC of (13) with respect to Rk for Rk = (n /(n − 1)) +
91 N. Lorenz
N. Lorenz
/ Journal
/ Journal
of Health
of Health
Economics
Economics
42 (2015)
42 (2015)
81–89
81–89 91
the high risks, this will increase the cost of negative DRS, even if the pmk + RAF − (1 − )RAO and substituting in the FOC with respect to
activity of risk selection and the cost associated with it occur only mk (using the definitions of rs as given in Table 2) yields (14).
later (when the insurer tries to induce the high risks to switch to
another insurer). With negative DRS, high risks are more expensive A.5. Optimal risk adjustment with DRS
than without DRS; anticipating these additional cost, insurers will
therefore not make their contract as attractive for this risk group The derivation of condition (16) is identical to the derivation of
as they would without DRS. Therefore, negative DRS will increase condition (14) described in Appendix A.4; the only difference is that
the distortion in such a setting as well. RAO has to be replaced by (RAO − ak ).
References Lorenz, N., 2014. The interaction of direct and indirect risk selection. Trier University
Working Paper 12/14. http://ideas.repec.org/p/trr/wpaper/201412.html
Bauhoff, S., 2012. Do health plans risk-select? An audit study on Germany’s McGuire, T.G., Glazer, J., Newhouse, J.P., Normand, S.-L., Shi, J., 2013. Integrating risk
Social Health Insurance. Journal of Public Economics 96 (9–10), adjustment and enrollee premiums in health plan payment. Journal of Health
750–759. Economics 32 (6), 1263–1277.
Bijlsma, M., Boone, J., Zwart, G., 2011. Competition leverage: how the demand side Newhouse, J.P., Price, M., Huang, J., McWilliams, J.M., Hsu, J., 2012. Steps to reduce
affects optimal risk adjustment. TILEC Discussion Paper, 2011-039. favorable risk selection in Medicare Advantage largely succeeded, boding well
Breyer, F., Bundorf, M.K., Pauly, M.V., 2011. Health care spending risk, for Health Insurance Exchanges. Health Affairs 31 (12), 2618–2628.
health insurance, and payment to health plans. In: Pauly, M.V., McGuire, Olivella, P., Vera-Hernandez, M., 2010. How complex are the contracts offered by
T.G., Barros, P.P. (Eds.), Handbook of Health Economics, vol. 2. Elsevier, health plans? SERIEs 1 (3), 305–323.
Amsterdam, pp. 691–762. Rothschild, M., Stiglitz, J., 1976. Equilibrium in competitive insurance markets: an
Brown, J., Duggan, M., Kuziemko, I., Woolston, W., 2011. How does risk selection essay on the economics of imperfect information. Quarterly Journal of Eco-
respond to risk adjustment? Evidence from the Medicare Advantage Program. nomics 90 (4), 629–649.
NBER Working Paper 16977. Shen, Y., Ellis, R.P., 2002. How profitable is risk selection? A comparison of four risk
Cao, Z., McGuire, T.G., 2003. Service-level selection by HMOs in Medicare. Journal of adjustment models. Health Economics 11 (2), 165–174.
Health Economics 22 (6), 915–931. Shi, J., 2013. Efficiency in plan choice with risk adjustment and premium discrimi-
Eggleston, K., 2000. Risk selection and optimal health insurance-provider payment nation in Health Insurance Exchanges. Unpublished.
systems. Journal of Risk and Insurance 67 (2), 173–196. Spiegel, 2011. Fragwürdige Beratung: Krankenversicherer wimmelt Senioren ab.
Ellis, R.P., McGuire, T.G., 2007. Predictability and predictiveness of health care spend- Spiegel, May 9, 2011. http://www.spiegel.de/wirtschaft/soziales/fragwuerdige-
ing. Journal of Health Economics 26 (1), 25–48. beratung-krankenversicherer-wimmelt-senioren-ab-a-761384.html
Frank, R.G., Glazer, J., McGuire, T.G., 2000. Measuring adverse selection in managed Starc, A., 2014. Insurer pricing and consumer welfare: evidence from Medigap. RAND
health care. Journal of Health Economics 19 (6), 829–854. Journal of Economics 45 (1), 198–220.
Glazer, J., McGuire, T.G., 2000. Optimal risk adjustment in markets with adverse Train, K.E., 2009. Discrete Choice Methods with Simulation, 2nd ed. Cambridge Uni-
selection: an application to managed care. American Economic Review 90 (4), versity Press, New York.
1055–1071. van de Ven, W.P., Ellis, R., 2000. Risk adjustment in competitive health plan markets.
Glazer, J., McGuire, T.G., 2002. Setting health plan premiums to ensure efficient qual- In: Culyer, A.J., Newhouse, J.P. (Eds.), Handbook of Health Economics. Elsevier,
ity in health care: minimum variance optimal risk adjustment. Journal of Public Amsterdam, pp. 755–845.
Economics 84 (2), 153–173. van de Ven, W.P., van Vliet, R.C., 1992. How can we prevent cream skimming in
Jack, W., 2006. Optimal risk adjustment with adverse selection and spatial compe- a competitive health insurance market? The great challenge for the 90’s. In:
tition. Journal of Health Economics 25 (5), 908–926. Zweifel, P., Frech, H. (Eds.), Healtheconomics Worldwide. Kluwer Academic Pub-
Lorenz, N., 2013. Adverse selection and risk adjustment under imperfect competi- lishers, Dordrecht, Boston, London, pp. 23–46.
tion. Trier University Working Paper 5/13. http://ideas.repec.org/p/trr/wpaper/ Zweifel, P., Breyer, F., Kifmann, M., 2009. Health Economics, 2nd ed. Springer, Berlin,
201305.html Heidelberg.
Journal of Health Economics 42 (2015) 90–103
a r t i c l e i n f o a b s t r a c t
Article history: This study explores the impact of environmental regulations in China on infant mortality. In 1998, the
Received 27 January 2014 Chinese government imposed stringent air pollution regulations, in one of the first large-scale regulatory
Received in revised form 24 February 2015 attempts in a developing country. We find that the infant mortality rate fell by 20 percent in the treatment
Accepted 26 February 2015
cities designated as “Two Control Zones.” The greatest reduction in mortality occurred during the neonatal
Available online 6 March 2015
period, highlighting an important pathophysiologic mechanism, and was largest among infants born
to mothers with low levels of education. The finding is robust to various alternative hypotheses and
JEL classification:
specifications. Further, a falsification test using deaths from causes unrelated to air pollution supports
Q56
I18
these findings.
Q53 © 2015 Elsevier B.V. All rights reserved.
I12
O13
Keywords:
Infant mortality Air
pollution Environmental
regulation China
http://dx.doi.org/10.1016/j.jhealeco.2015.02.004
0167-6296/© 2015 Elsevier B.V. All rights reserved.
S. Tanaka / Journal of Health Economics 42 (2015) 90–103 91
environment, wherein the intensity of exposure to the regulations (2011) examines regulations on air pollution and water pollution
can be defined by the TCZ regulatory status, and we are able to com- in India since 1987. They find these regulations efficacious in reduc-
pare changes in infant mortality rate (IMR) before and after the pol- ing air pollution, but such reductions led to modest and statistically
icy reform, between the cities assigned and not assigned as the TCZ. insignificant reductions in infant mortality. Our study provides
To implement the analysis, we draw IMR data from the Chi- an interesting contrast, finding that infant mortality significantly
nese Disease Surveillance Points (DSP) system that collected birth responded to the environmental regulation. Further, the regulation
and death registrations for 145 nationally representative sites from we focus on targeted coal for energy generation, which is the major
1991 through 2000. IMR, defined as the number of infant deaths contributor to air pollution in many other developing countries as
under age one per 1000 live births in a given year, is available for well, whereas Greenstone and Hanna (2011) focus on vehicular
each DSP site by year level, linked with detailed information on pollution.3 , 4 The findings in this study accordingly present relevant
estimates for the effect of environmental regulations in develop-
birth characteristics and parental attributes. We match this dataset
ing countries implementing similar policies on coal in the power
to the TCZ regulatory status assigned to individual cities, based on
industry.
the governmental report, and thereby estimate the treatment effect
Second, the present study contributes to our understanding
of the regulations.
of the relationship between air pollution and infant mortality at
We find that the air pollution regulations led to significant greater concentration levels. Previous evidence is predominantly
reductions in infant mortality. The difference-in-differences esti- derived from the United States or other developed countries, where
mates suggest that the regulations have led to 3.29 fewer infant pollution is relatively low.5 Since we know little about the shape of
deaths per 1000 live births than would have occurred in the absence the dose–response relationship, it is consequently difficult to pre-
of the regulations. This corresponds to a 20 percent reduction dict the marginal impact of pollution reduction in the presence of
in IMR. 63 percent of the reduction in infant mortality occurred non-linear relationship.6 Air pollution in China is one of the highest
during the neonatal period, highlighting an important pathophysi- in the world. Its total suspended particulates (TSP) level in 1995 was
ologic mechanism, and the greatest reduction of mortality occurred four times higher than the WHO standards, and four times higher
among children born to mothers with low educational levels. than the level in the United States in 1970, when the Clean Air
A major methodological challenge, however, is that the TCZ des- Act was amended, as examined in Chay and Greenstone (2003b).
ignation rule may not be orthogonal to unobserved characteristics Thus, estimates in China provide compelling evidence applicable to
that contribute to reductions in air pollution and infant mortal- the distinctive context of developing countries where air pollution
ity. The present study conducts a number of robustness checks levels are relatively high.
and a falsification test to address this issue. First, we confirm that Third, there is extensive literature showing differential patterns
the TCZ status has little association with changes in observable according to socioeconomic status, yet it is still an open question as
covariates, assuring that there is no systematic difference in con- to whether air pollution also exhibits differential impact on infant
current trends in observable characteristics between the TCZ and mortality (Currie and Hyson, 1999; Case et al., 2002; Jayachandran,
non-TCZ cities. Although this is not a direct test of the exclusion 2009). While infants in poor countries are considered to be the
restriction, since it requires that TCZ status not be correlated with most susceptible to the effect of pollution, not only because of
trends in unobservable factors, this result leads us to believe that high pollution levels but also because families lack the resources
the treatment effect is less likely to be confounded by differential or knowledge necessary to avoid exposure, the impacts may be
trends in unobservable factors as well (Altonji et al., 2005). Second, small if air pollution does not have first-order effect on them.7 Thus,
the regression is also directly adjusted for differential pre-existing the present study helps identify vulnerable population in designing
trends in mortality, yet the estimates are essentially unaffected. policies.
Lastly, the policy had no impact on infant deaths due to acciden- The current research design has several advantages over previ-
tal causes. The absence of a causal mechanism linking air pollution ous studies. First, this study focuses on infants, not only because
to these causes of death serves as falsification evidence, suggest-
ing that differences in access to or quality of medical services and
technologies cannot be the sources of bias. Overall, there is no evi-
system. Second, even if pollution is successfully reduced to some extent, infant mor-
dence that the estimates are driven by inappropriate identification tality may not fall if a concave relationship between mortality and pollution level
assumptions, leading us to believe that the treatment effect based leads to low marginal pollution effect at high concentration levels. Third, magni-
on the TCZ status is indeed not spurious but causal. tudes of impacts in reducing air pollution should be greater if people have limited
access to medical services, initially have lower health status, and/or have limited
This study makes three major contributions to the existing liter-
knowledge in avoiding pollution.
ature. First, by exploiting regulation-induced changes in air quality, 3 Since the most heavily affected industry is the power industry, which was a
it addresses a policy-relevant question: to what extent do envi- driving force behind China’s rapid economic growth, the findings in this study
ronmental regulations in developing countries lead to reductions accordingly highlight the important tradeoffs among economic growth, environ-
mental quality, and human health. See Tanaka et al. (2014) for the impact of the
in infant mortality? Several prior studies have focused on vari-
environmental regulation on industrial performance.
ation in air quality induced by recession (Chay and Greenstone, 4 A relatively smaller-scale air quality regulatory regime, targeting a different
2003a), weekly fluctuations (Currie and Neidell, 2005), wildfires industry, in another developing country, can be found in the Indian transportation
(Jayachandran, 2009), or wind directions (Luechinger, 2014). Chay sector, which was mandated to use compressed national gas vehicles in Delhi during
working hours. Kumar and Foster (2007) show its effect on respiratory health.
and Greenstone (2003b) provide compelling evidence for the link- 5 Examining the pollution effect at low levels, especially levels lower than what
age between the Clean Air Act of 1970 and infant mortality in the is often considered to be the standard, is also an interesting question in itself.
U.S. It remains to be determined, however, whether, and how effec- Currie and Neidell (2005) find that CO has significant impact on infant mortality
tively, environmental regulations can improve human health in in California over the 1990s at relatively low levels.
6 For example, Arceo et al. (2012) show a non-linear relationship between CO
developing countries.2 A recent study by Greenstone and Hanna
and health in Mexico. Evidence in developed countries may understate the impact
of pollution reduction in developing countries if the marginal impact of pollution is
higher at greater concentration levels.
7 For example, people with poor health tend to stay indoor with little exposure
2
Extrapolating evidence in developed countries to developing countries is dif- to pollution. Children from rich households, who tend to have better health, may be
ficult for a number of reasons. First and foremost importantly, we do not know exposed more to pollution if they are more likely to have outside activities.
whether environmental regulations in developing countries had any impact in
reducing pollution due to weaker implemental mechanisms and an enforcement
92 S. Tanaka / Journal of Health Economics 42 (2015) 90–103
they are particularly vulnerable to air pollution due to their weak environmental regulatory policies, the first version of which
respiratory system, but because focusing on infant mortality mit- was enacted in 1987, known as the Air Pollution Prevention and
igates complicating factors associated with adult mortality. For Control Law (APPCL). This original law, however, failed to reduce
example, adult deaths correlate more closely to chronic disease air pollution, mainly because it excluded the power sector, the
conditions than to acute changes in air quality. In addition, adults major contributor of SO2 emissions (Qian and Zhang, 1998). Even
may migrate into less polluted areas. Addressing infant mortality worse, SO2 emissions continued to surge, and areas affected by
circumvents these issues, if not completely, because it is relatively acid rain expanded.
less difficult to identify causes of death during the first year of APPCL was consequently amended in 1995. The major part of
birth, and because migration rates are low for pregnant women and the amendment was to include a section to regulate pollutant emis-
infants. Lastly, China is not only one of the most polluted countries sions and coal combustion, particularly regarding the usage of high
but also one of the first developing countries to regulate air pollu- sulfur-content coal, at power plants (Hao et al., 2007). Although the
tion on such a large-scale. It is evident that China serves as a rare 1995 APPCL still had a weak enforcement mechanism and limited
research environment in which to assess the impact of environ- efficacy, a prominent feature of the amendment was to propose a
mental regulations at greater concentration levels. future regional strategy, which would identify priority regions to
The rest of the paper is organized as follows. Section 2 provides improve air quality and prevent the spread of acid rain.9 , 10
the historical background on air pollution and national air pollution This was officially approved and implemented as the Two Con-
regulations in China. Section 3 describes the data and the descrip- trol Zone (TCZ) policy in January 1998 (State Council, 1998). This
tive statistics. Section 4 presents the econometric framework and legislation designated prefectures exceeding nationally mandated
its validity, and Section 5 presents empirical results. Section 6 con- thresholds as either acid rain control zone or SO2 pollution control
cludes. zone.11 Based on the records in preceding years12 , prefectures were
designated as SO2 pollution control zone if;
2. Background on air pollution and regulations in China
• Average annual ambient SO2 concentrations exceeded the Class
2.1. Brief history II standard,13
• Daily average concentrations exceeded the Class III standard, or
China is infamous for its air pollution, due to emissions from a • High SO2 emissions were recorded.14
power sector that relies heavily on coal to generate electric power.
As the world’s largest coal producer, China possesses abundant Alternatively, prefectures were designated as acid rain control
and relatively cheap coal, which constitutes the country’s primary zone if
energy resource endowment, accounting for 75.5 percent of total
energy production in 1995 (National Bureau of Statistics of China, • Average annual pH values for precipitation were less than or
2006). However, coal generally emits more pollutants than other equal to 4.5,
fossil fuels. As China underwent rapid economic growth, total SO2 • Sulfate deposition was greater than the critical load, or
emissions increased from 18.4 million tons in 1990 to 23.7 million • High SO2 emissions were recorded.
tons in 1995, and the ambient air pollution rose to levels detri-
mental to human health (State Environmental Protection Agency In total, 175 prefectures out of 333 prefectures across 27
[SEPA], 1996). provinces were designated as TCZs. They accounted for 40.6 per-
cent of its population, 62.4 percent of GDP, and 58.9 percent of
Fig. 1 illustrates the world distribution of TSP (Panel A) and SO2
the total SO2 emissions in 1995 (Hao et al., 2001). The SO2 pollu-
(Panel B) concentration levels in 1995. The TSP level in Beijing, the
tion control zone was concentrated in the north due to high SO2
capital city of China, was 377 g/m3 , almost four times higher than
emissions for heating,15 whereas the acid rain control zones were
the WHO guideline of 90 g/m3 , and its SO2 concentration level was
primarily in the south, where heat, humidity, and solar radiation
90 g/m3 , almost double the WHO guideline of 50 g/m3 (WHO,
combine to create high atmospheric acidity. Hence, acid rain in the
2002). SO2 is also an important precursor of acid rain. From the
1980s to the mid-1990s, the area of China experiencing acid rain
the most hazardous because, when inhaled, these particulate matters or gas can gies in a particular region or for a set of time period, attempting to demonstrate their
penetrate deep into the lungs and interfere with internal gas exchange. Further, effectiveness before expanding their implementation to the entire nation.
11 In this sense, the legislation can be considered to be parallel to the attainment
Laden et al. (2000) find that fine particles emitted from combustion sources (i.e.,
motor vehicles or coal combustion) have a stronger association with mortality than and nonattainment county designation by the Clean Air Acts in the United States.
12
those from non-combustion sources. Alternatively, SO2 becomes sulfuric acid when The original document does not specify exactly which years of records they refer
it interacts with water, which is the main component of acid rain that may have a to.
direct or indirect impact on health. Yet, epidemiological evidence of the impact of 13 According to the Chinese National Ambient Air Quality Standards (CNAAQS)
SO2 on mortality in developed countries is somewhat mixed. While Mendelsohn and for SO2 , Class I standard designates an annual average concentration level not
Orcutt (1979) show close associations between the two, SO2 is also considered a less exceeding 20 g/m 3 , Class II ranges 20 g/m3 < SO2 < 60 g/m 3 , and Class III ranges
important determinant of mortality (Schwartz and Marcus, 1990; Nielsen and Ho, 60 g/m3 < SO2 < 100 g/m 3 . Cities should meet Class II, which is considered to be
2007). Hedley et al. (2002) is one of the few intervention studies that investigates less harmful.
14
changes in SO2 caused by an overnight restriction on all power plants and road The original document does not specify the levels of SO2 emissions that are
considered to be “high.”
vehicles in Hong Kong using fuel oil with a sulfur content of more than 0.5 percent. 15
See Almond et al. (2009) for the impact of heating policy, which created a
They found that the intervention resulted in an immediate reduction in ambient
discrepancy in air quality north and south of the Huai River.
SO2 concentrations and a reduction in death rates particularly due to respiratory