Professional Documents
Culture Documents
Coalitions 1721
Coalitions 1721
Demange, G., and R. Guesnerie. 2001. On coalitional stability the Owen set of linear production processes. G a m e s
of anonymous interim mechanisms. E c o n o m i c a n d E c o n o m i c B e h a v i o u r 32: 139-156.
T h e o r y 18: 367-389. Gillies, D.B. 1959. Solutions to general non-zero-sum games.
Diamantoudi, E., and L. Xue. 2003. Farsighted stability in In C o n t r i b u t i o n s t o t h e t h e o r y o f g a m e s ,
hedonic games. S o c i a l C h o i c e a n d W e l f a r e 21: vol. 4, ed. A.W. Tucker and R.D. Luce. Princeton:
39-61. Princeton University Press.
Dreze, J., M. Le Breton, and S. Weber. 2006. Rawlsian pricing Granot, D. 1986. A generalized linear production model: A
of access to public facilities: A unidimensional illustration. unifying model. M a t h e m a t i c a l P r o g r a m m i n g
J o u r n a l o f E c o n o m i c T h e o r y 136: 759-766. 34: 212-222.
Dubey, P., and A. Neyman. 1984. Payoffs in nonatomic Gravel, N., and S. Thoron. 2007. Does endogenous formation
economies: An axiomatic approach. E c o n o m e t r i c a of jurisdictions lead to wealth-stratification? J o u r n a l
52: 1129-1150. o f E c o n o m i c T h e o r y 132: 569-583.
Durlauf, S. 2002. The memberships theoiy of poverty: The Greenberg, J. 1995. Coalition structures. In H a n d b o o k o f
role of group affiliations in determining socioeconomic game theoiy with economic applications,
outcomes. In U n d e r s t a n d i n g Poverty in ed. R.- J. Aumann and S. Hart. Amsterdam: North-
A m e r i c a , ed. S. Danziger and R. Haveman. Holland.
Cambridge, MA: Harvard University Press. Greenberg, J., and S. Weber. 1986. Strong Tiebout equilib-
Dutta, B., A. van den Nouweland, and S. Tijs. 1998. Link rium under restricted preferences domain. J o u r n a l o f
formation in cooperative situations. I n t e r n a t i o n a l E c o n o m i c T h e o r y 38: 101-117.
J o u r n a l o f G a m e T h e o r y 27: 245-256. Greenberg, J., and S. Weber. 1993. Stable coalition structures
Dutta, B., and R. Vohra. 2005. Incomplete information, with a unidimensional set of alternatives. J o u r n a l o f
credibility and the core. M a t h e m a t i c a l S o c i a l E c o n o m i c T h e o r y 60: 62-82.
S c i e n c e s 50: 148-165. Hahn, G., and N. Yannelis. 1997. Efficiency and incentive
Echenique, F., and J. Oviedo. 2004. Core many-to-one compatibility in differential information economies.
matchings by fixed-point methods. J o u r n a l o f E c o - E c o n o m i c T h e o r y 10: 383-411.
n o m i c T h e o r y 115: 358-376. Haimanko, O., M. Le Breton, and S. Weber. 2004. Voluntary
Economides, N.S. 1986. Non-cooperative formation of communities for the provision of public
e q u i l i b r i u m c o a l i t i o n s t r u c t u r e s , University projects. J o u r n a l o f E c o n o m i c T h e o r y 115: 1-
Discussion Paper No. 273, Columbia University. 34.
Einy, E., D. Moreno, and D. Monderer. 1999. On the least core Hammond, P., M. Kaneko, and M. Wooders. 1989. Con-
and the Mas-Colell bargaining set G a m e s a n d tinuum economies with finite coalitions: Core, equilibria,
E c o n o m i c B e h a v i o r 28: 181-188. and widespread externalities. Journal of
Einy, E., D. Moreno, and B. Shitovitz. 2000. On the core of an E c o n o m i c T h e o r y 49: 113-134.
economy with differential information. Journal o f Harsanyi, J. 1974. An equilibrium point interpretation of
E c o n o m i c T h e o r y 94: 262-270. stable sets and a proposed alternative definition. M a n -
Ellickson, B., B. Grodal, S. Scotchmer, and W. Zame. 2001. a g e m e n t S c i e n c e 20: 1472-1495.
Clubs and the market: Large finite economies. J o u r n a l Hart, S., and M. Kurz. 1983. Endogenous formation of
o f E c o n o m i c T h e o r y 101: 40-77. coalitions. Econometrica 51: 1047-1064.
Engelbrecht-Wiggans, R., andD. Granot. 1985. On market Hart, S., and A. Mas-Colell. 1996. Bargaining and value.
prices in linear production games. Mathematical Pro- Econometrica 64: 357-380.
gramming 32: 366-370. Herings, J-J., A. Mauleon, and V. Vannetelbosch. 2006.
Epstein, L., and M. Marinacci. 2001. The core of large Farsightedly stable networks. RM/06/041, University of
differentiable TU games. J o u r n a l o f E c o n o m i c Maastricht.
T h e o i y 100: 235-273. Herings, J.-J., G. van der Laan, and D. Talman. 2007. The
Farrell, J., and S. Scotchmer. 1988. Partnerships. Quarterly socially stable core in structured transferable utility
Journal of Economics 103: 279-297. games. Games and Economic Behavior 59: 85-104.
Forges, F., J.-F. Mertens, and R. Vohra. 2002a. The ex ante Holmstrom, B., and R. Myerson. 1983. Efficient and durable
incentive compatible core in the absence of wealth effects decision rules with incomplete information. Econometrica
.Econometrica 70: 1865-1892. 51: 1799-1819.
Forges, F., E. Minelli, and R. Vohra. 2002b. Incentives and the Ichiishi, T., and A. Idzik. 1996. Bayesian cooperative choice
core of an exchange economy: A survey. Journal of strategies. International Journal of Game Theory 2E. 455-
Mathematical Economics 38: 1-41. 473.
Gale, D., and L.S. Shapley. 1962. College admissions and the Iehle, V. 2006. The core partition of a hedonic game, Working
stability of marriage. A m e r i c a n Mathematical paper, Maison des Sciences Economiques, Universite
Monthly 69: 9-15. Pantheon-Sorbonne.
Garratt, R., and C.-Z. Qin. 1997. On a market for coalitions Izquierdo, J.M., and C. Rafels. 2001. Average monotonic
with indivisible agents and lotteries. Journal of Economic cooperative games. Games and Economic B e h a v i o r 36:
Theory 77: 81-101. 174-192.
Gellekom, J.R.G., J.A.M. Potters, J.H. Reijnierse, M.C. Engel, Jackson, M., and A. van den Nouweland. 2005. Strongly stable
and S.H. Tijs. 2000. Characterization of networks. Games and E c o n o m i c B e h a v i o r 51: 420-
1722 Coalitions
Jackson, A., and A. Watts. 2002. The evolution of social and Kovalenkov, A., and VI. Wooders. 2001b. An exact bound on
economic networks. Journal of Economic Theory 106:265 epsilon for non-emptiness of the epsilon-cores of games.
295. Mathematics of Operations Research 26: 654 678.
Jackson, A., and A. Wolinsky. 1996. A strategic model of Kovalenkov, A., and M. Wooders. 2003. Approximate cores of
social and economic networks. Journal of Economic games and economies with clubs. Journal of Economic
Theory 7 44 74. theory 110: 87-120.
Jordan, J.S. 2006. Pillage and property. Journal of Economic Kovalenkov, A., and M. Wooders. 2005a. Many sided
Theory 131: 26—14. matchings, clubs and market games. In Group formation
Kajii, A. 1992. A generalization of Scarf’s theorem: An '/-core in economics: networks, clubs and
existence theorem without transitivity or completeness. coalitions, ed. G. Demange and M. Wooders. Cambridge:
Journal of Economic Theory 56: 194 205. Cambridge University Press.
Kalai, E., A. Postlewaite, and .1. Roberts. 1979. A group Kovalenkov, A., and VI. Wooders. 2005b. Laws of scarcity for
incentive compatible mechanism yielding core allocations. a finite game - Exact bounds on estimations. Economic
Journal of Economic Theory 20: 13 22. Theory 26: 383-396.
Kalai, E., and D. Schmeidler. 1977. An admissible set Kovalenkov, A., and M. Wooders. 2006. Comparative statics
occurring in various bargaining situations. Journal of and laws of scarcity for games. In Rationality and
Economic Theory 14: 402-411. equilibrium; a symposium in honor of Marcel K. Richter,
Kalai, E., and E. Zemel. 1982. Generalized network problems ed. C.D. Aliprantis et al. Berlin: Springer- Verlag.
yielding totally balanced games. Operations Research 30: Kurz, .VI. 1989. Game theory and public economics. In
998 1008. Handbook of game theory, vol. 2. ed. R.J. Aumann and S.
Kaneko, .VI., and VI. Wooders. 1982. Cores of partitioning Hart. Amsterdam: North-Holland.
games. Mathematical Social Sciences 3: 313-327. Le Breton, .VI., I. Ortuno-Ortin, and S. Weber. 2006. Gamson
Kaneko, VI., and VI. Wooders. 1986. The core of a game with s Law and hedonic games, Working Paper No. 420, IDEI,
a continuum of players and finite coalitions: The model Toulouse.
and some results. Mathematical Social Sciences 12: 105 Le Breton, .VI., G. Owen, and S. Weber. 1992. Strongly
137. balanced cooperative games. International Journal of
Kaneko, M., and VI. Wooders. 1989. The core of a continuum Game Theory 20: 419 427.
economy with widespread externalities and finite Le Breton, .VI., and S. Weber 2001. The art of making
coalitions: from finite to continuum economics. Journal of everybody happy: How to prevent a secession. Working
Economic Theory 49: 135 168. Paper No. 01 176. International Monetary Frmd.
Kaneko, M., and M. Wooders. 1994. Widespread externalities Lee, D., and O. Volij. 2000. The core of economies with
and perfectly competitive markets. In Imperfections and asymmetric information: An axiomatic approach. Journal
behavior in economic organizations, ed. R. Gilles and P. of Mathematical Economics 38: 43-63.
Ruys. Boston: Kluwer Academic Publishers. Legros, P., and A.F. Newman. 2002. Monotone matching in
Kaneko, M., and M. Wooders. 1996. The nonemptiness of perfect and imperfect worlds. Review of Economic
the /core of a game without side payments. International Studies 69: 925-942.
Journal of Game Theory 25: 245 258. Legros, P., and A.F. Newman. 1996. Wealth effects, distri-
Keiding, H. 1976. Cores and equilibria in an infinite economy. bution, and the theory of organization. Journal of Eco-
In (’(imputing equilibrium: How and nomic Theory 70: 312-341.
why, ed. J. I.os and M.W. I.os. Amsterdam: North- Logistics Research Project. 1957. Reports of three informal
Holland. conferences on the theory of games. Department of
Koczy, A., and I.. Lauwers. 2004. The coalition structure core Mathematics, Princeton University. Accession No.
is accessible. Games and Economic Behavior 48: 86-93. (OCoLC)ocm39726894. Conferences held 20-1 March,
Konishi, H., M. I.e Breton, and S. Weber. 1997. Equilibrium 1953; 31 January-1 February 1955, and
in a model with partial rivalry. Journal of Economic 11- 12 March, 1957, Princeton.
Theory 72: 225 237. Lucas, W., andR. Thrall. 1963. n-person games in partition
Konishi, H., M. Le Breton, and S. Weber. 1998. Equilibrium function form. Naval Research Logistics Quarterly 10:
in a finite local public goods economy. Journal of 281-298.
Economic Theory 79: 224 244. Maschler, M., and G. Owen. 1992. The consistent Shapley
Konishi, H., and D. Ray. 2003. Coalition formation as a value for games without side payments. In Rational
dynamic process. Journal of Economic Theory 110: interaction, ed. R. Selten. Berlin: Springer-Verlag.
1- 41. Maschler, M., and B. Peleg. 1966. A characterization, exis-
Kovalenkov, A., and M. Wooders. 2001a. Epsilon cores of tence proof and dimensions bounds for the kernel of a
games with limited side payments; nonemptiness and game. Pacific Journal of Mathematics 18: 289-328.
equal treatment. Games and Economic Behavior 36: 193- Maschler, M., and B. Peleg. 1967. The structure of the- kernel
218. of a cooperative game. SLAM Journal of Applied
Mathematics 15: 569-604.
Coalitions 1723
Maschler, M., B. Peleg, andL.S. Shapley. 1971. The kernel and Papai, S. 2004. Unique stability in simple coalition formation
bargaining set for convex games. International Journal of games. G a m e s a n d E c o n o m i c B e h a v i o r 48:
Game Theory 1: 73-93. 337-354.
Mas-Colell, A. 1989. An equivalence theorem for a bargaining Pauly, M. 1970. Cores and clubs. P u b l i c C h o i c e 9: 53-65.
set. Journal of Mathematical Economics 18: 129-139. Peleg, B. 1985. The axiomatization of the core of cooperative
Mauleon, A., and V.J. Vannetelbosch. 2004. Farsightedness games without side payments. J o u r n a l o f M a t h e -
and cautiousness in coalition fomration games with m a t i c a l E c o n o m i c s 14: 203-214.
positive spillovers. Theory and Decision 56: 291-324. Perry, M., and P.J. Reny. 1994. A noncooperative view of
McLean, R., and A. Postlewaite. 2005. Core convergence with coalition formation and the core. E c o n o m e t r i c a 6 2 :
asymmetric information. Games and Economic Behavior 795-817.
50: 58-78. Predtetchinski, A., and J.-J. Herings. 2004. A necessary and
Milchtaich, I. 1996. Congestion games with player-specific sufficient condition for non-emptiness of the core of a non-
payoff functions. Games and Economic Behavior 13: transferable utility game. J o u r n a l o f E c o n o m i c
111- 124. T h e o r y 116: 84-92.
Milchtaich, I., and E. Winter. 2002. Stability and segregation Qin, C.-Z. 1994. The inner core of an n-person game. G a m e s
in group formation. G a m e s a n d E c o n o m i c a n d E c o n o m i c B e h a v i o r 6: 431-444.
B e l l a v - i o r 38: 318-346. Ray, D., and R. Vohra. 1997. Equilibrium binding agree-
Milleron, J.C. 1972. Theory of value with public goods: A ments. J o u r n a l o f E c o n o m i c T h e o r y 73: 30-78.
survey article. Journal of Economic Theory 5: 419-477. Ray, D., and R. Vohra. 1999. A theory of endogenous coalition
Monderer, D., and L.S. Shapley. 1996. Potential games. structures. G a m e s a n d E c o n o m i c B e h a v i o r
Games and Economic Behavior 14: 124-143. 26: 286-336.
Muench, T. 1972. The core and the Lindahl equlibrium of an Reny, P.J., andM. Wooders. 1996a. The partnered core of a
economy with public goods; an example. Journal of game without side payments. J o u r n a l of
Economic Theory 4: 241-255. E c o n o m i c Theory 70: 298-311.
Myerson, R.B. 1977. Graphs and cooperation in games. Reny, P.J., and M. Wooders. 1996b. Credible threats of
Mathematics of Operations Research 2: 225-229. secession, partnership, and commonwealths. In U n d e r -
Myerson, R.B. 1995. Sustainable matching plans with adverse standing strategic interaction: Essays in
selection. G a m e s a n d E c o n o m i c B e h a v i o r 9: h o n o r o f R e i n h a r d S e l t e n , ed. W. Albers et al.
35-65. Berlin: Springer- Verlag.
Myerson, R.B. 2007. Virtual utility and the core for games Roth, A.E. 1984. Stable coalition formation: Aspects of a
with incomplete information. Journal of dynamic theory. In C o a l i t i o n s a n d c o l l e c t i v e
E c o n o m i c Theory 136: 260-285. a c t i o n , ed. M. Holler. Wiirzberg: Physica-Verlag.
Nash, .1. 1953. Two-person cooperative games. Samet, D., and E. Zemel. 1984. On the core and dual set of
E c o n o m e t r i c a 21: 128-140. linear programming games. M a t h e m a t i c s of
Owen, G. 1975. On the core of linear production games. O p e r a t i o n s R e s e a r c h 9: 309-316.
M a t h e m a t i c a l P r o g r a m m i n g 9: 358-370. Scotchmer, S., and M. Wooders. 1988. M o n o t o n i c i t y i n
Owen, G. 1977. Values of games with a priori unions. In games that exhaust gains to scale,
Essays i n m a t h e m a t i c a l e c o n o m i c s a n d Technical Report No. 525, IMSSS, Stanford University.
g a m e theory, ed. R. Henn and O. Moeschlin. Seidmann, D.J., and E. Winter. 1998. Gradual coalition
Berlin/Heidel- berg/New York: Springer. formation. R e v i e w o f E c o n o m i c S t u d i e s 65:
Page Jr., F.H. 1997. Market games with differential infor- 793-815.
mation and infinite dimensional commodity spaces: The Serrano, R., and R. Vohra. 2006. Information transmission in
core. E c o n o m i c T h e o r y 9: 151-159. coalitional voting games. J o u r n a l o f E c o n o m i c
Page Jr., F.H., and M. Wooders. 1996. The partnered core and T h e o r y (forthcoming).
the partnered competitive equilibrium. E c o n o m i c s Selten, R. 1981. A non-cooperative model of characteristic
Letters 52: 143-152. function bargaining. In E s s a y s i n g a m e t h e o r y
Page, F.H. Jr., and M. Wooders. 2006. S t r a t e g i c b a s i n s and mathematical economics in honor of
of attraction, the path dominance core, O . M o r g e n s t e r n , ed. V. Bohm and H. Nachtkamp.
a n d n e t w o r k f o r m a t i o n g a m e s , Working Mannheim: Bibliographisches Institut.
Paper No. 06-W14 (revised), Department of Economics, Shapley, L.S. 1953. A value for n-person games. In
Vanderbilt University. Contributions to the theory of games II,
Page, F.H. Jr., and M. Wooders. 2007. Networks and clubs. ed. A.W. Tucker and R.D. Luce. Princeton: Princeton
Journal of Economic Behavior & O r g a n i z a t i o n University Press.
(forthcoming). Shapley, L.S. 1967. On balanced sets and cores. N a v e l
Page Jr., F.H., M. Wooders, and S. Karnat. 2005. Networks R e s e a r c h L o g i s t i c s Q u a r t e r l y 9: 45-48.
and farsighted stability. Journal of Economic Theory 120: Shapley, L.S. 1971. Cores of convex games. I n t e r n a t i o n a l
257-269. J o u r n a l o f G a m e T h e o r y 1: 11-26.
Shapley, L.S., and M. Shubik. 1966. Quasi-cores in a
monetary economy with nonconvex preferences.
E c o n o m e t r i c a 34: 805-827.
1724 Coase Theorem
Shapley, L.S., and M. Shubik. 1969a. On the core of an Yohra. R. 1999. Incomplete information, incentive com-
economic system with externalities. A m e r i c a n E c o - patibility and the core. J o u r n a l o f E c o n o m i c
n o m i c R e v i e w 59: 678-684. T h e o r y 86: 123-147.
Shapley, L.S., and M. Shubik. 1969b. On market games. Yolij. O. 2000. Communication, credible improvements and
J o u r n a l o f E c o n o m i c T h e o r y 1: 9 25. the core of an economy with asymmetric information.
Shapley, L.S., and .VI. Shubik. 1972. The assignment game 1: International Journal of Game Theory
The core. I n t e r n a t i o n a l J o u r n a l o f G a m e 29: 63-79.
T h e o r y 1: 11-30. Watts. A. 2001. A dynamic model of network formation.
Shapley, L.S., and VI. Shubik. 1975. Competitive outcomes in G a m e s a n d E c o n o m i c B e h a v i o r 34: 331-341.
the cores of market games. I n t e r n a t i o n a l J o u r n a l Weber. S. 1979. On e-cores of balanced games. I n t e r n a -
o f G a m e T h e o r y 4: 229-237. t i o n a l J o u r n a l o f G a m e T h e o r y 8: 241-250.
Sharkey, WAV'. 1990. Cores of games with fixed costs and Weber. S. 1981. Some results on the weak core of a non-
shared facilities. I n t e r n a t i o n a l Economic sidepayment game with infinitely many players.
R e v i e w 31: 245-262. J o u r n a l o f M a t h e m a t i c a l E c o n o m i c s 8:
Shenoy, P.P. 1979. On coalition formation: A game- theoretic 101-111.
approach. I n t e r n a t i o n a l J o u r n a l o f G a m e Weber. S.. and H. Wiesmeth. 1991. The equivalence of core
T h e o r y 8: 133 164. and cost share equilibria in an economy with a pubhc
Shenoy, P.P. 1980. A dynamic solution concept for abstract good. J o u r n a l o f E c o n o m i c T h e o r y 54: 180-
games. Journal of Optimization Theory and Applications 197.
32: 151 169. Wilson. R. 1978. Information, efficiency and the core of an
Shubik, M. 1959. Edgeworth market games. In C o n t r i b u - economy. E c o n o m e t r i c a 46: 807-816.
tions to the theory of games IV, annals Winter. E.. and M. Wooders. 1994. An axiomatization of the
o f m a t h e m a t i c a l s t u d i e s 4 0 , oil. F.R. Luce and core for finite and continuum games. S o c i a l C h o i c e
A. Tucker. Princeton: Princeton University Press. a n d W e l f a r e 11: 165-175.
Shubik, M. 1671. The bridge game economy: An example of Wooders, VI. 1978. Equilibria, the core, and jurisdiction
indivisibilities. J o u r n a l o f P o l i t i c a l E c o n o m y structures in economies with a local public good. J o u r -
79: 909-912. n a l o f E c o n o m i c T h e o r y 18: 328-348.
Shubik, M., and M. Wooders. 1982. Near markets and market Wooders, VI. 1980. The Ticbout hypothesis: Near optimality
games. Cowles Foundation Discussion Paper No. 657, also in local pubhc good economies. E c o n o m e t r i c a 48:
published as ‘Clubs, near markets and market games’. In 1467-1486.
Topics in mathematical economies and Wooders, VI. 1983. The epsilon core of a large replica game.
game theory: Essays in honor of Robert J o u r n a l o f M a t h e m a t i c a l E c o n o m i c s 11:
J . A u m a n n , ed. M. Wooders. Fields Institute 277-300.
Communication 23. Providence: American Mathematical Wooders, M. 1994a. Equivalence of games and markets.
Society, 2000. E c o n o m e t r i c a 62: 1141-1160.
Shubik, M., and M. Wooders. 1983a. Approximate cores of Wooders, M. 1994b. Large games and economies with
replica games and economies: Part I. Replica games, effective small groups. In G a m e - t h e o r e t i c
externalities, and approximate cores. M a t h e m a t i c a l methods in general equilibrium analysis,
S o c i a l S c i e n c e s 6: 27-48. ed. J.-F. Mertens and S. Sorin. Dordrecht: Kluwer
Shubik, M., and M. Wooders. 1983b. Approximate cores of Academic Publishers.
replica games and economies: Part II. Set-up costs and Wooders, M., and W.R. Zame. 1987. Large games; fair and
firm formation in coalition production economies. stable outcomes. J o u r n a l o f E c o n o m i c T h e o r y
M a t h e m a t i c a l S o c i a l S c i e n c e s 6 : 285 306. 42: 59 93.
Tanman, Y., A. Urbano, and J. Watanabe. 1997. A model of Xue, L. 1998. Coalitional stability under perfect foresight.
multiproduct price competition. J o u r n a l of
E c o n o m i c T h e o r y 77: 377 401.
Ticbout, C. 1956. A pure theory of local expenditures.
J o u r n a l o f P o l i t i c a l E c o n o m y 64: 416 424.
Coase Theorem
Tijs, S. 1681. Bounds for the core and the tau-value. In G a m e
t h e o r y a n d m a t h e m a t i c a l e c o n o m i e s , ed. O.
Moeschlin and D. Pallaschke. Amsterdam: North- Francesco Parisi
Holland.
van den Nouweland, A. 2005. Models of network formation in
cooperative games. In G r o u p f o r m a t i o n i n
economies: Networks, clubs and
coalitions, Abstract
ed. G. Demange and M. Wooders. Cambridge: Cam- The Coase Theorem holds that, regardless of the
bridge University Press.
initial allocation of property rights and choice of
von Neumann, .1., and O. Morgenstem. 1953. T h e o r y o f
g a m e s a n d e c o n o m i c b e h a v i o r . Princeton: remedial protection, the market will
Coase Theorem 1725
determine ultimate allocations of legal entitle- is no further potential for reciprocal profit, which
ments, based on their relative value to different will not be exhausted until each right is in the hands
parties. Coase’s assertion has occasioned of the highest-valuing individual. The Coase
intense debate. This article provides an intel- theorem predicts that, in a competitive market
lectual history of Coase’s fundamental theorem environment without legal or factual impediments
and surveys the legal and economic literature to exchange, the final allocation of rights will be
that has developed around it. It appraises the efficient.
most notable attacks to the Coase Theorem, and This article discusses the pervasive methodo-
examines its methodological implications and logical implications of Ronald Coase’s idea to the
normative and practical significance in legal field of law.
and policy settings.
work of his undergraduate teacher. In his Nobel some of the Chicago economists. George Stigler
memorial lecture, Coase acknowledges the impor- was among Coase’s early critics:
tance of his encounter with Plant as a ‘great stroke Ronald Coase criticized Pigou’s theory rather casu-
ally, in the course of a masterly analysis of the
of luck’ that cultivated his interest in property rights
regulatory philosophy underlying the Federal Com-
theory (Coase , p. 715). For Coase, Plant’s teaching munication Commission’s [FCC] work. Chicago
that ‘[t]he normal economic system works itself economists could not understand how so fine an
(Salter , pp. 16-17) and that prices in a competitive economist as Coase would make so obvious a mis-
take. Since he persisted, we invited Coase (he was
market lead resources to their highest valuing uses
then at the University of Virginia) to come and give a
was a revelation into the dynamic of the economic talk on it. Some twenty economists from Chicago and
system: ‘I was then 21 years of age, and the sun Ronald Coase assembled one evening at the home of
never ceased to shine. I could never have imagined Aaron Director. ... In the course of two hours of
that these ideas would become some 60 years later a argument the vote went from twenty against and one
for Coase to twenty-one for Coase. What an
major justification for the award of a Nobel Prize. exhilarating event! (Stigler , pp. 75-6)
And it is a strange experience to be praised in my
eighties for work / did in my twenties’ (Coase , p. According to Coase, the objections to his FCC
716). paper are at the origin of his later 1960 article on
The experience of the following years at the the problem of social costs. Coase recalls that he
London School of Economics laid the methodo- was urged to omit that section of his FCC article,
logical foundations of what would later become something he refused to do. In retrospect, Coase
Coase’s theorem on the problem of social costs. All believes that had it not been for the Chicago
the ingredients of his revolutionary analysis on the economists’ attacks his full-fledged idea would
debated theme of social cost had been profiled have never been formulated ( , p. 250).
during his LSE years (see Williamson and Winter ,
pp. 34-5). But it is not until the late 1950s that
Coase verbalized such a simple and yet ingenious
The Positive Coase Theorem
idea. He had first expounded the core of his later
The arguments that were refined in the course of
theorem in an article published in 1959. In those such debate were later put together in the form of an
pages, one grasps what would later become the
article for the Journal of Law and Economics in
central theme of Coase’s celebrated argument: 1960, titled ‘The Problem of Social Cost’. This
Whether a newly discovered cave belongs to the man article - later known as the Coase theorem - soon
who discovered it, the man on whose land the became a milestone in legal and economic litera-
entrance to the cave is located, or the man who owns ture. In the course of his austere discussion, Coase
the surface under which the cave is situated is no
doubt dependent on the law of property. But the law
does not reveal any sign of anticipated realization of
merely detemrines the person with whom it is nec- the revolutionary power of his insight. Indeed,
essary to make a contract to obtain the use of the Coase insists that he never intended to convey his
cave. Whether the cave is used for storing bank thoughts in the precise and analytical form of a
records, as a natural gas reservoir, or for growing
mushrooms depends, not on the law of property, but
theorem ( , p. 157).
on whether the bank, the natural gas corporation, or A few years after the publication of ‘The Prob-
the mushroom concern will pay the most in order to lem of Social Cost’, a sizeable number of com-
be able to use the cave. ( , p. 25) mentaries and theoretical elaborations were
The discussion of the rationale of property developed on Coase’s newly presented theme. The
rights under Coase’s highest bidder framework unpretentious style of Coase’s article had thus been
obviously contained an attack on the Pigouvian crowned by a notoriety rarely attained by legal
approach (Pigou ) to the problem. The point was writings of any sort (Shapiro , p. 1540). Part of the
rather self-evident to Coase, but not so for uproar is explained by the fact that the article
challenged an established
Coase Theorem 1727
principle of public finance (see Manne pp. 123-6).novelty of his approach inspired an entire gener-
Before ‘The Problem of Social Cost’, very little ation of scholars - pioneers in this new branch of
attention had been given to the possibility that the
applied economics. Only a few months prior to
problem of externalities could be resolved throughreceiving the Nobel Prize for economics, in occa-
free market exchanges. sion of the First Annual Meeting of the American
Coase boldly attacked the conclusions reachedLaw and Economics Association, Ronald H. Coase
by the Pigouvian tradition by suggesting its influ-
was recognized, together with Guido Calabresi,
ence was in part due to the lack of clarity in itsHenry G. Manne and Richard A. Posner, as a
exposition ( , p. 39). Coase departs from the
founding father of Law and Economics. This
Pigouvian approach by demonstrating that, in the recognition follows many years of challenging
absence of transaction costs, generators and victims
debate. Many of the writings that developed around
of externalities will negotiate an efficient allocation
‘The Problem of Social Cost’ tested the premises of
of resources, independent of the initial assignment
Coase’s model, seeking to undermine the conditions
of rights among them. In confuting the conclusionsof his model and stressing the lack of practical
of the Pigouvian tradition, Coase gave life to a reach of his analysis.
model with the potential for the evaluation of an Further criticisms pertained to three fundamen-
unlimited number of legal and social issues. tal points. One group of critics observed that the
George Stigler was the first scholar to restate
Coase Theorem disregarded the inter-industrial
Coase’s model in the form of a theorem: ‘[U]nder long-term effects of the system (Calabresi ;
perfect competition private and social costs will be
Wellisz ). These critics argued that Coase ignored
equal’ ( , p. 113). Demsetz ( , p. 349)
the possible disequilibria which may occur after the
defined the theorem in the following terms: ‘Therenegotiation and the likely dynamic changes in the
are two striking implications of this process that are
initial equilibrium. In the context of Coase’s well-
true in a world of zero transaction costs. The output
known example, if the right has been assigned to the
mix that results when the exchange of property ranchers, the farmer will have to pay local ranchers
rights is allowed is efficient and the mix is until they all relinquish their right of pasture. The
independent of who is assigned ownership (except entire cost will, thus, burden the farming industry.
that different wealth distributions may result in Farmers will either have to bear the burden of the
different demands)’. Soon thereafter, Guido injury caused by the livestock or agree to pay the
Calabresi stated the same principle more price demanded by the ranchers, whichever is less,
descriptively: ‘Thus, if one assumes rationality, no
on the assumption that negotiation is costless.
transaction costs, and no legal impediments to Under this liability rale, the cost of ranching will
bargaining, all misallocations of resources would be
not reflect the cost imposed on the farmers. The
frilly ciued in the market by bargains’ (Calabresi ,
transfer of rights and liability from one group to
p. 68). another will, therefore, result in a shift in the
The implicit premise of Coase’s analysis draws
relative wealth and costs associated with the two
upon a fundamental postulate of microeconomic industries. The criticism claims that, in the long ran,
theory: the free exchange of goods in the market every shift of wealth will lead to an inter-industrial
moves goods towards their optimal allocation. The disequilibrium.
voluntary transfer of individual rights in the In 1968, Calabresi, one of the initial proponents
marketplace, thus, will ciue a non-optimal alloca-of this criticism, reconsidered it, noting that in the
tion of legal entitlements. presence of determined conditions the conclusions
of Coase remain as true in the long ran as in the
The Coasean Methodological Revolution short term ( , p. 67). Calabresi’s
Coase’s article constitutes, according to many later analysis re-established the authority of the
commentators, the first example of an economic Coase Theorem, at least on this point. It became
analysis of law in North American literature. The clear that Coase did not ignore the long-term
1728 Coase Theorem
effects of his model. Perhaps not explicitly, he had different final allocations, notwithstanding an equal
considered them to their logical extreme. Calabresi level of efficiency. In order for the final allocations
observes: ‘The reason is simply that (on the given to be identical, it is necessary that the utility
assumptions) the same type of transactions which functions of the individuals involved are almost
cured the short mn misallocation would also occur linear. The absence of the income effects implies, in
to cure the long mn ones. ... This process would this sense, that the demand functions for the good
continue until no bargain could improve the are independent of the income level.
allocation of resources’ (1967, pp. 67-8). It should be further observed that the credibility
In 1972, Harold Demsetz joined this debate, of the threat made in the course of strategic
demonstrating with a more systematic analysis that bargaining finds its limits in the market structure in
the conclusions reached by Coase are not corroded which the Coasean negotiation takes place. In
by the long-term effects of a change in the general, the competitive structure of the market
assignment of property rights. Demsetz’s reasoning eliminates much of the advantage that can be
finds its basis in the principle that the process of obtained through strategic behavior in the negoti-
allocation of scarce resources among alternative ation process. Inasmuch as the market of resources
uses is analogous to the process of constrained is competitive, strategic bargaining is not capable of
optimization of the single owner of two conflicting bringing about any abnormal return.
activities. The criticism, however, appears to be on the
An additional critique, formulated by Calabresi ( mark when it argues that, in some marginal situa-
) and Wellisz ( ), suggests tions, the curing role of the free exchange may still
that strategic behaviour in the bargaining process be impeded. For example, consider reversing the
risks compromising Coase’s results. These authors assignment of property rights between the rancher
observe that the change in the rule of law creates the and the farmer. In such a situation, the farmer is
conditions for possible extortion on the part of the likely not to have an equally large number of
right holders against the other individuals who are alternatives. The transfer of a farm from one place
bound by the rule. The argument is that individuals to another is costly, and farming unavoidably
are likely to threaten the use of their own rights in a requires the undertaking of location-specific
measure which exceeds the optimal level, in order investments. Since some capital investment is
to maximize the gain from the release of their own irreversibly locked in that specific location, the
legal entitlements. By introducing the possibility of farmer has less opportunity to relocate than the
strategic behaviour in the negotiation, the result rancher. The rancher, consequently, finds himself in
may differ from the optimal equilibrium. Demsetz ( a position of local monopoly in the sale of his
, p. 21) supplied a convincing answer to property right. Demsetz considers the monopoly
this criticism. According to Demsetz, the possibility that affects this feature of the Coasean exchange
of strategic behaviour in the negotiations does not identical to the standard monopoly of microeco-
alter the efficiency in the final allocation of nomic analysis ( , p. 24). According to
resources between the two activities. Strategies will Demsetz, the concerns for possible monopolistic
be capable of altering the internal distribution of the structures in the market of rights considered by
contractual surplus between the parties, but not the Coase must not, however, be used to raise again the
final outcome of the negotiation. already resolved problem of the initial allocation of
It should be noted, however, that the entire rights, since reversing the rale of liability would
analysis presupposes that the so-called income simply result in the farmer now having monopoly
effect can be ignored. In general, a different allo- power ( , pp. 24-5).
cation of property rights implies a different distri- A second group of critics concentrated on the
bution of wealth between the individuals involved. distributive effects of the model (Regan ; Nutter ).
Different initial endowments generate They argued that a final efficient allocation of
resources requires transfers of wealth
Coase Theorem 1729
induced by the changed legal rule. Further, these assignment of rights and choice of remedies will
critics observed that, even if one disregards the affect final allocations.
distributive effects of the rule, a different assign-
ment of the right could in some cases create the The Relevance of Transaction Costs and the
conditions for strategic behaviour in negotiation Simple Normative Coase Theorem
capable of disturbing the efficiency of the final The notion of transaction costs has acquired par-
allocation. ticular importance in law and economics as the
A third group of authors focused on the scarce absence of transaction costs represents a funda-
realism of the no-transaction-cost assumption (see mental condition for the applicability of the positive
Cooter , p. 457). According to this criticism, the Coase theorem. Although at first impression
true Achilles’ heel of Coase’s analysis was in the transaction costs play a role analogous to trans-
unrealistic assumption of absence of costs in the portation costs in international trade or, more gen-
process of negotiation and transfer of the right. erally, to the contracting costs in the economics of
These authors observed that the idea of a transac- exchange (Demsetz , p. 20), in Coase’s world the
tion without cost is a logical fiction cloaking a mere role of transaction costs has much greater normative
tautology. implications.
For purposes of the theorem, the notion of
transactions costs should include not only
The Normative Coase Theorem bargaining costs associated with the negotiation and
conclusion of the contract but also all costs
The utility of models predicting behaviour in a zero associated with the strategic behaviour of the parties
transaction-cost world is that they guide the law - and the execution and enforcement of the
whose object is to develop rules which approximate transaction. The notion of transaction costs should
the zero transaction-cost world as closely as thus include ex ante costs due to asymmetric
possible - in responding to legal problems arising in information, adverse selection, free riding, and
a positive transaction-cost environment (Epstein ). hold-up strategies, as well as ex post costs
The vast literature that developed around Coase’s associated with monitoring and enforcing the
theorem formulated important normative corollaries contracts.
of it, based on the evaluation of the relative costs of Strategic behaviour may be an important source
alternative assignments of rights. of transaction costs in a Coasean setting. In Coase’s
According to the positive Coase theorem, absent various examples, the property rights which are
transaction costs, the final allocation of scarce exchanged are private goods, characterized by their
resources would coincide with the use that an excludability. Difficulties arise when the object of
individual who is the single owner of different the Coasean bargaining is an entitlement which has
activities would make of his endowments, the nature of a public good (see Cheung , pp. 49-
regardless of the initial assignment of rights and 70). Due to the well-known problems associated
choice of remedial protection. When transaction with the supply of public goods, the Coasean
costs are present, however, an exchange will be bargaining solution may fail to cure a non-optimal
pursued only to the point at which its marginal allocation of rights that falls within this category.
benefit equals the marginal cost of the transaction. Consider a scenario in which the object of the
If transaction costs exceed the benefits of a contract, Coasean negotiation consists of a non-excludable
no exchange will take place in the market. For a right (for example, the right to enjoy pollution-free
right to be exchanged it is necessary that transaction air in a residential environment). As well known,
costs be less than the difference between the individuals will not reveal their own preferences for
demand and supply prices. If this condition is not public goods through the price system, placing
met, the Coasean bargaining will not be carried out, public goods among those cases that are most
and both initial resistant to the Coasean antidote.
1730 Coase Theorem
A first simple normative reformulation of the receive the entitlement. Once the entitlement deci-
Coase theorem focuses on transaction costs and the sion is made, the law must decide how the entitle-
role that legal systems may play in reducing these ment is to be protected and whether it may be
impediments to voluntary bargaining. Legal rules transferred. Articulating a concept of entitlements
can lower obstacles to private bargaining, such as protected by property, liability or inalienability
by reducing transaction costs and minimizing other rales, Calabresi and Melamed ( ) develop a
costs associated with transfer (strategic, legal, and framework that integrates the approaches of prop-
so on). For this reason, transactional cost erty and tort. Entitlements can be protected by
considerations should be fundamental to any property rules (transfer of the entitlement involves a
analysis of legal regimes and the design of voluntary sale by its holder), liability rales (the
contracting processes, governance mechanisms and entitlement may be destroyed by another party if he
institutions. is willing to pay an objectively determined value for
it), or rales of inalienability (transfer of the
The Complex Normative Coase Theorem entitlement is not permitted, even between a willing
The first original formulation of Coase’s proposi- seller and a willing buyer). Calabresi and Melamed
tion can be restated as a normative theorem: in the allow for a wide range of concerns to be balanced
presence of positive transaction costs, the efficiency through the assignment of a particular entitlement.
of the final allocation is not independent of the Calabresi and Melamed outline how, given the
choice of the legal rale, and that the preferable reality of transaction costs, an economic efficiency
initial assignment of rights is that which minimizes approach selects one allocation of entitlements over
the effects of such transaction costs. The various another. Entitlements cannot be enforced solely
normative restatements of the Coase Theorem aim through property rules because, even if the transfer
at identifying legal rules and remedies that replicate would benefit all parties, high transaction costs
the outcomes of a hypothetical Coasean bargaining (especially the hold-up problem) may prevent an
or to mimic the solution that would be chosen by efficient reallocation. Calabresi and Melamed
the single owner of interfering resources. demonstrate how liability rales often achieve a
Important normative reformulations of the Coase combination of efficiency and distributive results
Theorem focus on two important elements: that would be difficult to achieve under a property
relevance of initial assignment of rights and rele- rale. Calabresi points out that Coase’s analysis
vance of remedial protection. Demsetz ( ) and offers invaluable instruments for the identification
Calabresi and Melamed ( ) were among the of the areas in which public intervention becomes
first to discuss systematically the problems resulting desirable (Calabresi , pp. 72-3). In its normative
from lifting the assumption of zero transaction version, the theorem indicates that legal rules that
costs. Articulating the normative core of the Coase minimize the effects of such costs are to be
theorem, Demsetz observes that the introduction of preferred for being relatively more efficient
significant transaction costs into the choice of (Polinksy , p. 14). In its more complex formulation,
liability rule analysis does affect resource the Coase theorem provides, indeed, a guide for
allocation. One liability rule may be superior to such a choice.
another because the difficulty of avoiding costly The following is a classic illustration (Polinsky ,
interactions is usually different for the interacting pp. 11-14). The smoke of a factory soils laundry
parties. Accordingly, the normative predicament which is line drying on five neighbouring
indicates that the rale of liability should be based on properties. The losses amount to $150 for each
which party can avoid the costly interaction at the neighbour, for a total of $750. The damage could be
lowest cost. eliminated through the installation of a purifying
When two or more parties have conflicting filter on the industrial smokestack or through the
interests in the same resource, the law must decide acquisition of electric dryers on the part of each one
which party shall prevail, that is, which party shall of the neighbouring owners. The cost
Coase Theorem 1731
of the filter would amount to $300, while the dryers problems. Notwithstanding the obvious measure-
would impose a cost of $100 per household, for a ment and information problems, Epstein ( )
total of $500. The first solution is obviously more stresses the importance of the ‘single owner test’:
efficient, since the acquisition of five dryers would where resources are under the command of two or
require a greater expenditure than the single filter. more persons, the legal arrangement should attempt
The Coase theorem predicts that in the absence of to induce all the parties to behave in the same way
transaction costs the efficient solution will be that a single owner would. Epstein concludes that,
chosen independently of the initial assignment of where the single owner test yields a unique result,
property rights. Even if we assume an initial that result should be adopted as the legal rule.
allocation of polluting right to the industry (that is, Where the single owner test does not yield clear
fully legalizing industrial emissions), the results, however, no corollary principle will provide
landowners would jointly offer to buy the industrial a decisive answer to the particular problem.
filter at their expense. Sharing the cost of the filter Further exploring the choice between property
in equal parts, each owner would face a cost of only and liability rules suggested by Calabresi and
$60, with a relative saving of $40 compared with Melamed, Kaplow and Shavell ( ) address
the otherwise necessary acquisition of a personal several factors casting doubt on the equivalence of
dryer. these alternatives in low transaction-cost envi-
If we relax the initial assumption of no trans- ronments. Their analysis considers several objec-
action costs, the initial allocation of property rights tions to Coasean costless bargaining, including the
no longer is immaterial. Imagine that each owner inability of a party to ascertain what the other is
has to face a cost of $120 in order to negotiate the willing to pay or accept, victims’ ability to mitigate
contract with his neighbours and with the owner of harm, the problem posed by one party being
the industrial plant. If the right is assigned to the judgment proof, and administrative costs. Kaplow
industry, each landowner will have to choose and Shavell find a presumption in favour of liability
whether to bear the loss of his soiled laundry for rales over property rules in the context of harmful
$150, to acquire the electric dryer for $100, or, externalities, but that this may be overcome as a
finally, to undertake the negotiation process for a result of one of more of the factors they describe.
total pro-rata cost of $180. Considering these After considering some of the proffered
alternatives, each rational land- owner would justifications for the use of property rales to protect
choose to acquire his own dryer, generating a possessory interests, the authors find a strong
socially non-optimal outcome. However, the theoretical case for the protection of these interests
assignment of property rights to the neighbouring using property rales. The normative Coase theorem
residents rather than to the polluting industry would thus underlies the choice of the optimal system to
minimize the effect of positive transaction costs, ensure the protection of various types of property
since the industry would have incentives to install rights.
the filter, without any need for Coasean bargaining Also bridging the gap between Coase, where
with the neighbours. liability rales and property rales are equally effi-
Two impediments to bargaining (that is, sources cient, and Calabresi and Melamed, where high
of transaction costs) take the form of externalities transaction costs lead to a preference for liability
and hold-up, which Epstein ( ) rales, is the work by Ayres and Talley ( ) on
shows stand in inverse relationship to each other. private information as a transaction cost. The
He defines the optimal legal rule as that which inefficiency occurs when parties misrepresent their
minimizes the sum of these externality and holdout own valuations to gain strategic advantage in the
costs in any particular institutional setting. Epstein bargaining process. Focusing on the effect of
demonstrates, through examples in property, splitting an entitlement between two rivalrous users
restitution and tort, how Coase’s transaction costs rather than among buyers or among sellers, these
model plays the central organizing role in authors find that, when two parties have
developing legal responses to many private law
1732 Coase Theorem
private information about how much they value an the implications of his theorem in their areas of
entitlement, endowing each party with a partial research. Coase’s invitation was taken up by a
claim to the entitlement can reduce the incentive to number of economists and lawyers who
behave strategically during bargaining by inducing experimented with the unparalleled analytical
greater disclosure. A bargainer has two Coasean potential of Coase’s theorem in their research.
alternatives: buy the other party’s claim or sell According to Coase, economists in the Pigouvian
one’s own claim. The normative formulation of tradition fail to consider the possible reciprocity of
Ayres and Talley is that a liability rule regime is the effects of individual choices. By labelling one
preferable because it allows a party’s decision to agent as injurer and the other as victim, the
pursue one of these alternative transactions to Pigouvian tradition presumes an initial allocation of
function as a credible signal of a low or high rights (Comes and Sandler , p. 59). In such a
valuation, thereby encouraging more efficient trade. manner this approach falls into a serious
Building upon the literature on property frag- methodological error, notwithstanding empirical
mentation (Heller ; Buchanan and Yoon ), Parisi ( ) psychological studies suggesting otherwise (see
and Schulz et al. ( ) suggest that property is Kahneman et al.
subject to a fundamental law of entropy. In the , pp. 1325^18). By taxing the generator of the
property context, entropy induces a one-directional externality in a measure corresponding to the
bias. This bias is driven by asymmetric transaction difference between the private cost and the social
costs - it is often harder to reunite separated cost of his own activity, the followers of Pigou fail
property bundles than to break them apart. Parisi to consider the effects of potential victims’
hypothesizes that courts and legislators account for behaviour. If the social cost of the industrial
the presence of asymmetric transaction costs and emissions is calculated by aggregating the economic
correct for problem through the selective use of disadvantages of the residents who are negatively
remedies and by selecting default rules designed to affected by the smoke, the figure will vary with the
minimize the total deadweight losses of property number of individuals who fix their residence in
fragmentation. Parisi ( ) that area. If the Pigouvian tax is imposed on the
offers a industrial activity only, there will be less incentive
reformulation of the normative Coase theorem in for each resident to consider moving into a different
situations characterized by asymmetric transaction neighbourhood. New individuals may actually
and strategic costs, such as when complementary locate their residence in that area, without
fragments of property are attributed to different considering the potential increase in the costs
owners. The asymmetry arises from the fact that it imposed on the industrial activity.
is often harder to reunite separated property bundles Through these arguments, Coase’s analysis
than to break them apart. This variant of the Coase demonstrates the incapacity of the Pigouvian
theorem turns on (a) an initial allocation of approach to consider the interdependence of the
entitlements that minimizes the effects of the harmful effects generated by individual choices.
positive transaction costs, and (b) the selection of Coase’s analysis occasioned a paradigmatic shift in
legal rules that reduce social welfare losses by legal and economic analysis, and, as Henry Manne
facilitating optimal levels of reunification. once observed, ‘it is hard to imagine law ever again
being free of the influence of the techniques and
findings of objective economic analysis’ ( , p.
4). His theorem, short of
providing a simplistic formula for the social cost
problem, suggests an alternative approach based on
The Coase Theorem and Its Legacy in the evaluation of the relative costs of alternative
Law and Economics assignments of rights and legal protection.
In 1960 Coase entrusted legal and economic
scholars with the challenging task of deriving
Coase, Ronald Harry (Born 1910) 1733
role of transaction costs in economic activity 1951. His time at the LSE was interrupted by the
and the influence of alternative institutional Second World War, during which he served as a
structures on economic performance are hall- statistician at the Forestry Commission (1940-41)
marks of Coase’s scholarship, and both the and in the Central Statistical Office, Offices of the
economic analysis of law and the new institu- War Cabinet (1941—46). Coase left the LSI' for the
tional economics are outgrowths of Ins work. US and the University of Buffalo in 1951,
Coase occupies a significant although somewhat remaining there until 1958. Following a year spent
controversial place in the history of the Chicago at the Center for Advanced Study in the Behavioral
School of economics. Sciences at Stanford, he accepted an appointment at
the University of Virginia in 1959.
Keywords Although Coase is most closely associated with
Accounting; Average cost pricing; Bargaining; the Chicago School, his two most influential works
Chicago School; Coase conjecture; Coase the- - ‘The Nature of the Firm’ (1937) and ‘The Problem
orem; Coase, R. H.; Cobweb theorem; Con- of Social Cost’ (I960) - were written before he
sumer theory; Externalities; Firm, organization arrived at Chicago, in 1964, to teach at the Law
of; Government failure; Imperfect competition; School and to join Aaron Director in editing the
Knight, F.; Law and economics; Lemer, A.; Journal of Law and Economics. Coase retired from
Marginal cost pricing; Market failure; the University of Chicago in 1981 and was awarded
Mathematics and economics; Monopoly; Multi- the Nobel Prize in Economics in 1991.
part pricing; New institutional economies;
Opportunity cost; Posner, R.; Public goods;
Scholarly Work
Public utilities; Public utility pricing; Rational
expectations; Social cost theory; Special While most economists identify Coase with his two
interests; Transaction costs; Viner, J. classic articles on the film and social costs, his
published output is very extensive and ranges across
topics such as accounting, advertising, public
JEL Classifications goods, consumer surplus, public utility pricing,
B31 monopoly theory, blackmail, the economic role of
government, and the history of economic thought.
Ronald Harry Coase was born on 29 December
Several themes appear throughout Coase’s work:
1910 in the London suburb of Willesden. He
the important role played by institutions - in
received the BSc in Commerce from the London
particular the film, the market and the law - in
School of Economics in 1932 and while there was
determining economic structure and performance,
greatly influenced by Arnold Plant, who, as Coase
the role of transaction costs in economic activity,
has said, taught him many of the lessons that later
the need for a comparative institutional approach to
came to be associated with the Chicago School.
economic policy, and a distaste for abstract
Interestingly, Coase did not take a single economics
theorizing. These themes come through
course while he was at the LSE, which he suggests
unmistakably in The Finn, the Market and the Law
gave him 'a freedom in thinking about economic
(1988) and Essays on Economics and Economists
problems which [he] might not otherwise have had'
(1994), which, together, collect many of Coase’s
(1990, p. 3).
most significant writings.
Upon completing his sUidies at the LSE, Coase
The lion’s share of Coase's work during the first
took up a position at the Dundee School of Eco-
part of his career dealt, in one way or another, with
nomics and Commerce, where he taught with his
film behaviour and organization. His earliest
friend and public choice pioneer Duncan Black
contributions analysed the formation of
from 1932 to 34. Coase moved on to the University'
of Liverpool in 1934-35 before returning to the
LSE, where he remained from 1935 until
Coase, Ronald Harry (Born 1910) 1735
producers’ expectations (for example, Coase and focus broadly, rather than narrowly, on benefits,
Fowler 1935), using the pig cycle as the case study. costs, and incentives.
The conventional cobweb theorem explanation for Coase’s work on public utilities also has an
these cycles was that producers expected current historical strand. Articles on the British Post Office
prices and costs to continue into the future. The discuss the rise of the penny postage in Great
adjustments in supply that resulted then gave rise to Britain under Rowland Hill and the attempts by the
disequilibrium cycles. Coase and Fowler found that Post Office to enforce its monopoly against
this explanation was incorrect - that producers did in inclusions by private entrepreneurs, including the
fact adjust their expectations of prices and costs messenger companies (for example, 1955). His
very quickly, and that the prediction errors arose study of British broadcasting analyses the devel-
from the difficulty of predicting variations in opment of wireless and wire radio broadcasting, as
demand and in foreign supply. This work was later well as of television broadcasting and the rise of the
cited by J.F. Muth ( , p. 21) in one of his BBC as the monopoly supplier of all of the above
classic papers on rational (1950, 1954). His interest in the government’s role
expectations. Coase also collaborated with Fowler in broadcasting carried over to the United States and
and Ronald Edwards on a series of pieces dealing an analysis of the role of the Federal
with the interrelations between accounting and Communications Commission (1959, 1966) in the
economics (for example, Coase 1938; Coase et al. allocation of broadcast frequencies. In fact, it was
1938). These writings, which were very much in the from this study that ‘The Problem of Social Cost’
LSE cost tradition, demonstrated that traditional came to be written.
accounting practices do not adequately capture the While the foregoing gives a sense for the
true (opportunity) nature of costs and also pointed to breadth of Coase’s contributions, it is unquestion-
the problematic nature of designing workable able that his most influential work is contained in
accounting methods to do so. two papers - ‘The Nature of the Firm’ (1937) and
Coase also wrote a number of articles dealing ‘The Problem of Social Cost’ (1960), the two works
with monopoly and imperfect competition, a few of cited by the Royal Swedish Academy in awarding
which bear mention of here. Two of his theoretical Coase the Nobel Prize. In the former, Coase set out
pieces are of particular import. ‘Durability and to explain why firms exist and what determines the
Monopoly’ (1972) demonstrated that a monopoly extent of a firm’s activities. He found the answer in
firm which produces a good that is infinitely durable a concept to which most economists had until
will be forced to sell the good at the competitive recently paid scant attention - transaction costs.
price, unless it can decrease the durability of the Coase suggested that we tend to see firms emerge
good or make contractual arrangements through when the cost of internal organization is lower than
which it promises to limit its production - a result the cost of transacting in the market, and that the
which has come to be known as ‘the Coase con- limit of a firm’s activities (or, the extent of internal
jecture’. ‘The Marginal Cost Controversy’ (1946) is organization) comes at the point where the cost of
Coase’s most significant work on monopoly and organizing another transaction internally exceeds
deals with public utility pricing and regulation. the cost of transacting through the market. Although
Abba Lemer and others had claimed that marginal published in 1937, ‘The Nature of the Firm’
cost pricing accompanied by a government subsidy attracted little attention until the early 1970s, when
is the efficient pricing policy for public utilities. Oliver Williamson, Armen Alchian, Harold
Against this, Coase argued that marginal cost Demsetz and others began to build on or take off
pricing is inferior to a system of multi-part pricing from Coase’s contribution to bring transaction
and may in fact be inferior to average cost pricing. costs, the contracting process, and firm organization
This paper, and three related papers that followed it, to the fore in economic analysis.
are illustrative of one of the central themes in ‘The Problem of Social Cost’ took the
Coase’s work - that, in assessing the efficiency of transaction-cost paradigm in a different direction -
economic outcomes, one must
1736 Coase, Ronald Harry (Born 1910)
the legal-economic arena and situations of conflicts Britain (1974) are excellent examples of Coase’s
over rights. Although ‘The Problem of Social Cost’ position here. When Coase looks at government, he
is one of the most cited articles in all of the sees agencies captured by special interests, making
economics and legal literatures, it has also been policies that usually make matters worse rather than
widely misunderstood. From this paper comes the better, and operating in virtual ignorance of the
now-famous Coase theorem - actually codified as virtues of the market. Yet a careful reading of Coase
such by George Stigler ( ) - which suggests that he is not ‘anti- government’ but,
says that when transaction costs are zero and rights rather, an advocate for economic theorizing and
are folly specified, parties to a dispute will bargain policymaking which recognizes that policy choices
to an efficient outcome, regardless of the initial are always among imperfect alternatives.
assignment of rights. But Coase recognized that the These criticisms are part of Coase’s more gen-
transaction costs are pervasive and will generally eral concern about the way that economists practice
preclude the working of this bargaining mechanism. their trade (1994). He is suspicious of consumer
Coase thus concludes that legal decision-makers theory as a whole and of the way in which
should assign rights so as to maximize the value of mathematical and quantitative techniques have been
output in society - a concept that lies at the heart of used in modem economics. His own writings
the modem law and economics movement (Medema evidence some graphs and some technical intuitive
; Medema and Zerbe ). analysis, but, reflecting Coase’s lifelong distaste for
The crux of ‘The Problem of Social Cost,’ using mathematics in his work, there is not an
however, is Coase’s attempt to demolish the equation to be found. Coase believes that
Pigovian tradition of social cost theory (Pigou ). economists are obsessed with what he calls ‘
The analysis that came to be known as the Coase blackboard economics’, an economics where curves
theorem was used to demonstrate that, under are shifted and equations are manipulated on the
standard neoclassical assumptions, Pigovian blackboard, with little attention to the
remedies for externalities are unnecessary: correspondence (or lack thereof) between these
costlessly functioning markets, like the costlessly models and the real-world economic system. This,
functioning governments of Pigovian welfare he says, has manifested itself in economists’
theory, will generate efficient outcomes. The ignorance of the role played by transaction costs
problem, as Coase pointed out, is that neither and economic institutions generally, and in an
markets nor governments function costlessly, and approach to public policy that fails to examine in
thus neither will generate optimal solutions. This any kind of depth the consequences of alternative
leaves policymakers with a choice among imperfect policy actions.
alternatives, and Coase advocates a close
examination of the benefits and costs associated
with the alternative policy options, in order to Coase and Chicago
facilitate the adoption of policies (including doing
nothing at all) which maximize the value of output. Coase’s critical attitude toward the practice of
That government failure is at least as pervasive economics does not stop at the doors of the Uni-
as market failure, and that economists are too quick versity of Chicago. Indeed, his close association
to advocate tax, subsidy, and regulatory solutions with the Chicago School belies a degree of tension
without a careful examination of the situation, are in the relationship and highlights the risks involved
recurring themes in Coase’s work. His analyses of in thinking in terms of a homogeneous Chicago
social cost issues, public utility pricing, and his school. In spite of his position as a founding father
classic article on role of the lighthouse in public of law and economics and, by extension, the
goods theory as against the actual history of private expansion of the boundaries of economics so
lighthouse provision in Great closely associated with Chicago, Coase has been
critical of economic imperialism
Coase, Ronald Harry (Born 1910) 1737
generally and of the economic analysis of law in 1946. The marginal cost controversy. Economica,
particular (Coase 1977, 1993). Coase’s interest is New Series 13: 169-182.
not the economic analysis of law, but rather the 1 95 0. British broadcasting: A study in monopoly.
study of how the legal system impacts the economic L on do n: L on gm an s , G r e e n an d C o.
system - old-style Chicago law and economics of 1954. The development of the British television
the sort being published in the Journal of Law and service. Land Economics 30: 207-222.
Economics in the 1960s and 1970s. As such, his 1955. The postal monopoly in Great Britain: An
interest and intellectual commonalities lie much historical survey. In Economic essays in com-
more with the older Chicago school of Frank memoration of the Dundee School of Economics
Knight and Jacob Viner than with the Becker- 1931-1955, ed. J.K. Eastham. London: William
Stigler-Posner generation, and he has a much Culcross and Sons.
greater interest in the new institutional economics1959. The Federal Communications Commission.
(of which he is also regarded as a founding father) Journal of Law and Economics 2: 1-40.
than in the modem economic analysis of law 1960. The problem of social cost. Journal of Law
movement a la Richard Posner. Coase has been and Economics 3: 1-44.
chastised by Posner ( ) on 1966. The economics of broadcasting and govern-
this and other counts, but he remains unapologetic. ment policy. American Economic Review 56:
That Coase has a place within the Chicago tradition 440-447.
goes without saying, but he has also remained his 1972. Durability and monopoly. Journal of Law and
own man - dissenting from the received doctrine Economics 15: 143-149.
when it did not fit with his views. 1974. The lighthouse in economics. Journal of Law
and Economics 17: 357—376.
1977. Economics and contiguous disciplines. In
The organization and retrieval of economic
knowledge, ed. M. Perlman. Boulder: Westview
See Also Press.
1988. The firm, the market, and the law. Chicago:
► Ciicagc
University of Chicago Press.
► Chicago ■
1990. Accounting and the theory of the firm.
► Coas° Th eorem
Journal of Accounting and Economics 12: 3-13.
1992. The institutional structure of production.
American Economic Review 8 2: 7 13 - 71 9.
1993. Law and economics at Chicago. Journal of
Selected Works Law and Economics 36: 239-254.
1994. Essays on economics and economists.
1935. (With R.F. Fowler.) Bacon production and Chicago: University of Chicago Press.
the pig-cycle in Great Britain. Economica, New
Series 2: 142-167. Bibliography
1937. The nature of the firm. Economica, New
Series 4: 386^105. Medema, S.G. 1994. R o n a l d H . C o a s e . London/New
1938. Business organisation and the accountant. In York: Macmillan/St. Martin’s Press.
Medema, S.G. 1999. Legal fiction: The place of the Coase
Studies in costing, ed. D. Solomons. London: theorem in law and economics. E c o n o m i c s a n d
Sweet and Maxwell, 1952. P h i l o s o p h y 15: 209-233.
1938. (With R.S. Edwards and R.F. Fowler.) Medema, S.G., and R.O. Zerbe Jr. 2000. The Coase theorem.
Published balance sheets as an aid to economic In The encyclopedia of law and
e c o n o m i c s , ed. B. Bouckaert and G. De Geest
investigation'. Some difficulties. London: Aldershot: Edward Elgar Publishing.
Accounting Research Association Publication
No. 3.
1738 Cobb-Douglas Functions
Muth, J.F. 1961. Rational expectations and the theory of price manipulated and to the fact that it possesses the
movements. E c o n o m e t r i c a 29: 315-335. minimal properties that economists consider
Pigou, A.C. 1932. T h e e c o n o m i c s o f w e l f a r e . 4th
ed. London: Macmillan.
desirable. It appeared early (at least by 1916; see
Posner, R.A. 1993. Ronald Coase and methodology. J o u r - Wicksell , p. 133), notably in the theory of
n a l o f E c o n o m i c P e r s p e c t i v e s 7 (4): 195-210. distribution where it was used to prove the adding-
Stigler, G.J. 1966. T h e t h e o r y o f p r i c e . 3rd ed. New up theorem of factor shares when the production
York: Macmillan.
Williamson, O.E. 1975. Markets and hierarchies: Analysis and
elasticities sum to unity. It is the first form that
antitrust implications. New York: Free Press. many embryonic mathematical economists squeeze
and buffet to obtain nice expressions for marginal
products and utilities. It has been applied
econometrically countless times, still surprising
Cobb-Douglas Functions people that it can explain the data so well
(Mairesse ). It forces itself into relatively new areas
Murray Brown such as frontier production functions (see Forsund
et al. ). And it has been used both as a utility and
production function in analyses of growth,
development, macroeconomics, public finance,
labour and just about any other applied area in
Abstract
economics. Yet it possesses restrictive properties
Perhaps the most common form of production
and perhaps for that reason it has become for some
function in economics, the Cobb-Douglas
an object of disdain, often regarded as a child’s toy
function has a range of attractive properties. The
in the world of real economics. But for others, the
input demand and supply of output functions
Cobb-Douglas is at least a venerable form and,
have the property of continuous differentiability
effectively, it and its putative inventor are regarded
everywhere on their respective domains; and the
fondly.
form has a function coefficient that is identical
In its unrestricted form, the Cobb-Douglas can
to its degree of homogeneity, calculated by
be written as f(x) = i=lxf, where A is an efficiency
summing the factor production elasticities. Its
parameter, a, is the elasticity of/(x) with respect to
restrictions have made it an object of disdain for
x,, and x is confined to /?"++. Defining the x, as goods
some. But the Cobb-Douglas form is remarkably
consumed, it has been used as a utility function;
robust in a vast variety of applications and is
defining them as inputs in the production process, it
therefore very likely to endure.
is a production function; as normalized prices, it is
an indirect utility function; and so on. We focus
here on its use as a production function for a single
Keywords output.
Aggregation (production); CES production A large part of the appeal of the form stems
function; Cobb, C.; Cobb-Douglas functions; basically from the fact that if 0 < a, < l,./(x) is
Douglas, P. H.; Elasticity of substitution; Factor strongly pseudo-concave on its domain. That entails
substitution; Frontier production functions; that if the firm is a profit maximizer and factor
Production functions; Technical change; Walras, supply and product demand functions are
L.; Wicksell, J. G. K.; Wicksteed, P. H. continuously differentiable on their domains, then
the input demand and supply of output functions
have the immensely useful property of continuous
JEL Classifications differentiability everywhere on their respective
E23 domains. Also, if Y.,a, < 1 and if factor supply and
product demand functions are well-behaved, the
The Cobb-Douglas function is perhaps the most
input demand functions are downward sloping with
ubiquitous form in economics, owing its popularity
respect to own price and the output supply
to the exceptional ease with which it can be
Cobb-Douglas Functions 1739
function does not slope downward with regard to the extremely restrictive conditions that the
product price. What could be better and, moreover, expansion paths for each firm are parallel (i.e. if cit.
it is all so simple to demonstrate. = cit = Cj for each i and for all r and /), and that the
Another attractive property of the form is that it first order conditions are satisfied, the R functions
has a function coefficient that is identical to its consistently aggregate to
degree of homogeneity, calculated by summing the
factor production elasticities. Thus, 52,-a,- 1
for all i easily and succinctly characterizes
decreasing, constant and increasing returns to scale, a nicely behaved aggregate production function.
respectively. This characteristic also has important There is another way to look at the aggregation-
implications for the cost, profit and revenue duals of across-firms problem that involves the Cobb-
the production function. For example, the cost Douglas function. Suppose that factors in each firm
function of a price-taking firm which has a Cobb- are used in fixed proportions with the Leontief
Douglas technology decomposes into two parts, one coefficients being distributed across all firms
a linear homogeneous function of factor prices and according to a Pareto distribution. Then a surprising
the other a function of output cj, that is C(q, w) = B\ result by Houthakker (see Sato ) i s that the
{ i=iwfqc0, where if is a positive constant, w is a aggregate production function of the industry is a
(positive) price vector of the inputs, ct = cq/yZ^i and Cobb-Douglas form.
CO = 1 fEjClj. Of course, there is a price for these desirable
The list of attractive properties extends to the implications and most of it is owing to the fact that
aggregation problem since the Cobb-Douglas is the Cobb-Douglas technology entails that the elas-
homogeneous and weakly separable. First consider ticity of substitution takes on the knife-edge value
the question of aggregation across inputs. Suppose of unity. If there is no technological change, a unit
one can write a generalized Cobb-Douglas function substitution elasticity implies that the income shares
as follows: of all factors of production remain constant in the
face of changes in things that are deemed germane
such as saving, the rate of growth of the economy
«= n( nv ? ) r ' and relative factor supplies. Only the state of the
technology matters in this instance, a highly
where by = ciy/ZZ/i.y, Ys = 'iZjasj , Js is the number disputable outcome. When technological change is
of factors in the sth group, S is the number of allowed to proceed in a Cobb-Douglas world, it is a
groups, 5=1, 2,..., S, and j = 1, 2,..., Js. Notice that fact that Hicks-, Solow- and Harrod- neulral
Yjjbsj = 1. Since each expression in the parentheses technological change are equivalent, thus blurring
is homogeneous of degree one for each s, the profit these distinctions. Another implication of the unit
maximization procedure can be decomposed into substitution elasticity of the (linear homogeneous)
two stages and there exist quantity and price Cobb-Douglas form is that, used in growth models,
indexes (call them xs and Ws, respectively) such that it guarantees the existence and stability of
the expenditure on the sth group is WsXs for s = equilibrium growth, again obscuring an important
1,2,..., S. problem in economics.
With respect to aggregation across firms, sup- Furthermore, it is a fact that the Cobb-Douglas
pose the rth firm’s production function were form requires that each factor of production be
essential in the sense that no factor may be
q, = Arx\l;xcz;, completely substituted for another. Hence the
domain of the function must be confined to the set
where = 1 and i = 1, 2, ..., R. It is evident that the of strictly positive real numbers. This is not
expansion paths for all firms are straight lines particularly disturbing for situations in which the
through their respective origins. Then under factors can be taken to be large aggregates but it
does limit the analysis in other contexts.
1740 Cobb-Douglas Functions
Technological change is represented in the specifies each variable as z(A) = (zA - 1 )/A for A / 0
Cobb-Douglas by changes in the efficiency and z(A) In z for A 0. Then, applying this
parameter A which are Hicks neutral, by changes in transformation to the production function, we would
the scale of the factor inputs which are factor have z0 = q and z;- = x, for all i. Thus, if the z/£ (k =
augmenting and also Hicks neutral, and by changes 0, 1, ..., n) are related linearly, the transformation
in the elasticities of production which may be Hicks turns out to be a useful procedure in econometrics to
non-neutral. However, the unit elasticity of treat the general problem of functional form, an
substitution is restrictive in still another way: it important special case of which is the Cobb-
cannot represent a technological advance that Douglas.
results in a change in the ease of substitution among In sum, though it is restrictive and sometimes
factors of production. regarded as an economic toy, the Cobb-Douglas
What is the form’s provenance? It is generally form is remarkably robust in a vast variety of
attributed to Paul Douglas and although he grace- applications and that it will endure is hardly in
fully acknowledged (Douglas ) that question.
Wicksteed and Walras were cognizant of it, he
neglected to add Wicksell’s name to the list. Be that
as it may, Douglas relates in his gentle comments
See Also
that in 1927 he asked a professor of mathematics,
Charles Cobb, to devise a formula that could be
used to measure the comparative effect of each of
two factors of production upon the total product to
satisfy a linear log-log relationship in his input and
output data. His work encountered a host of
theoretical concerns (see Brown for a discussion)
aside from the capital, output and labour measures Bibliography
for which he was faulted. But the production form
remained in spite (or perhaps because) of its Brown, M. 1966. On the theory and measurement of tech-
nological change. Cambridge: Cambridge University Press.
restrictive properties.
Brown, M., and J. De Cani. 1963. Technological change and
Subsequent work has demonstrated that the the distribution of income. International
Cobb-Douglas is a special case of a variety of forms E c o n o m i c R e v i e w 4 : 289-309.
and approaches. The constant elasticity of Christensen, L.R., D.W. Jorgenson, and L.J. Lau. 1973.
substitution (CES) production function is perhaps Transcendental logarithmic production frontiers.
R e v i e w o f E c o n o m i c s a n d S t a t i s t i c s 55: 28-
the most well known of the forms that yield the 45.
Cobb-Douglas as a special case, either by using Douglas, P.H. 1948. Are there laws of production?
L’Hopital’s rule when the elasticity of substitution A m e r i c a n E c o n o m i c R e v i e w 38: 1-41.
goes to unity or it can be derived from certain Douglas, P.H. 1967. Comments on the Cobb-Douglas
production function. In T h e theoiy and
expressions used in deriving the CES function (see e m p i r i c a l a n a l y s i s o f p r o d u c t i o n , edited by.
Brown and De Cani ). Parenthetically, the CES, M. Brown, National Bureau of Economic Research,
itself, is known to mathematicians as a Studies in Income and Wealth No. 31. New York:
mean of order t [i.e. (XI \ I 'I) 'for TF=- 0] so that, if
A X Columbia University Press.
Forsund, F.R., C.A.K. Lovell, and P. Schmidt. 1980. A survey
one takes the limit as > 0, of course, the Cobb- of frontier production functions and of their relationship
Douglas emerges. Also, it can be derived from the to efficiency measurement. Journal of
translog production form (Christensen et al. ) and E c o n o m e t r i c s 13: 5-25.
Mairesse, J. 1974. C o m p a r i s o n o f p r o d u c t i o n
many others, besides, by judiciously restricting
f u n c t i o n e s t i m a t e s . Paris: Institut National de la
certain parameters. A different approach to the Statistique et des Etudes Economiques.
derivation of the Cobb-Douglas form has been Sato, K. 1975. P r o d u c t i o n functions and
taken by P. Zarembka ( ), who a g g r e g a t i o n . Amsterdam: North-Holland.
Wicksell, K. 1958. S e l e c t e d p a p e r s o n e c o n o m i c
t h e o r y , ed. Erik Lindahl. Cambridge, MA: Harvard.
Cobbett, William (1763-1835) 1741
Zarembka, P. 1987. Transformation of variables in econo- itself as a means of appropriating the product of
metrics. In T h e n e w P a l g r a v e : A d i c t i o n a r y ’ labour but most importantly, with the return to cash
o f e c o n o m i c s , ed. J. Eatwell, M. Milgate, and P.
Newman, vol. 4. London: Macmillan.
payments in 1819, it created a situation where the
nation was compelled to pay in an appreciated
medium of exchange debts contracted in a depre-
ciated paper currency. The industrious were there-
fore forced to pay in gold what had been contracted
Cobbett, William (1763-1835) in paper. Thus both the inflation of the Revolution-
ary and Napoleonic wars and the deflation that
N. W. Thompson followed were seen by Cobbett as redistributing
wealth in favour of the idle generally and the
fundholders and bankers in particular. Justice
demanded, therefore, that there should be an equi-
William Cobbett was bom, appropriately, at the table adjustment in the level of taxation to take
‘Jolly Farmer’ in Famham, Surrey, in 1763. He was account of the appreciation in the value of money.
by turns soldier, clerk, teacher, joumahst and For this, parliamentary reform and an end to polit-
pohtical agitator but whether in his early literary ical corruption were necessary prerequisites.
career as an anti-Jacobin pamphleteer or later in his Under the undifferentiated heading of ‘The
role as combative radical his voice was that of the Thing’, Cobbett sought to attack all those who
small farmer threatened by the forces of economic jeopardized his vision of a stable, hierarchically
and social change which characterized the early structured, rural economy and society, dominated by
phases of Britain’s industrial revolution. the independent yeoman farmer and free from ‘the
Cobbett’s acerbic castigation of its major the- all-devouring Jew and tax-eater’. Specifically, he
orists shows he had little time for political economy condemned ‘the Funding and Manufacturing and
and he would certainly have greeted with a wry Commercial and Taxing System’ for their tendency
guffaw his inclusion in a dictionary of economic to draw ‘wealth into great masses ... [and] man also
thought. Nevertheless it was the case that much of into great masses’, like ‘the Great Wen’ [London],
his writing in the Twopenny Register (1816-20) and which both corrupted its inhabitants and
works such as Paper against Gold (1815) was given impoverished the country as a whole.
over to a discussion of economic questions and in Cobbett’s was an anti-industrial, anti-
particular the economic difficulties that confronted commercial, anti-urban and anti-City pohtical
Britain in the post-Napoleonic war period, and even economy with its ideological roots in the pohtical
in his Rural Rides (1830) few opportunities were radicalism of the eighteenth century and with
lost of discoursing on matters economic. Paine’s Decline and Fall of the English System of
For Cobbett material impoverishment was a Finance (1797) as its basic text. Yet if, with the
consequence of the pohtical decisions affecting growth of industrial capitalism, this ‘old corruption’
taxation, the Debt, the convertibility of the currency critique became increasingly irrelevant and
etc., which emanated from a Parliament corrupted inadequate; if by the 1830s it was being supplanted
by the influence of tax-gatherers, ‘Change Alley in the working-class press by anticapitalist and
men, sinecurists, placemen, Jews and Borough- socialist analysis of contemporary ills which
tyrants. Crucial here was the passage of the Bank deployed to critical effect the tools and concepts of
Restriction Act of 1797, which in suspending cash pohtical economy, this should not obscure the
payments by the Bank of England had created a remarkable longevity of Cobbettian ideas. Thus
‘Paper money system’ that robbed the ‘industrious’ even a superficial survey of Chartist literature will
of their ‘earnings’ producing ‘that monster in civil show just how indelible was the mark left by the
society, starvation in the midst of abundance’. This Twopenny Register upon a generation of pohtical
paper money system was seen in radicals. In a sense too Cobbett’s vision of a pastoral
England peopled
1742 Cobclen, Richard (1804-1865)
by free-born, hearty independent cultivators will Manchester School; McCulloch, J. R.; New
remain indestructible, drawing sustenance as it does Poor Law; Pacifism; Tooke, T.
from an ineradicable human craving for the
permanence, the certainty, the order and the sta-
bility which rural self-sufficiency seems to offer. JEL Classifications
B31
Selected Works Cobden led the campaign that repealed the Com
Laws in 1846, after which there was free trade in
1802-36. Cobbett spolitical register. The paper was grain. The son of a Middlesex farmer, he sought his
published throughout this period but the cheap, fortune in Manchester, became an owner of a mill
popular post-Napoleonic war edition of the that employed 2,000 workers and was noted for
paper referred to as Cobbett’s Twopenny excellence of its calicoes. At 35, he was a rich man.
Register ran from 12 October 1816 until 20 July His calling, however, was politics. After taking
1820. part in the successful effort to incorporate Man-
18 15 . Paper against Gold and Glory against chester, he entered the movement against the Com
prosperity. L on do n. Laws in 1838. Until then it had been conducted by
18 30 . Rural rides in the counties of Surrey, Kent, middle class radicals and various business interests,
Sussex___Lo nd on . among them the Manchester Chamber of
Commerce. Cobden, John Bright, and others like
References them wanted to enlarge the movement, make it bold
and uncompromising. They were exasperated by the
Beer, M. 1953. .1 h i s t o t y o f B r i t i s h s o c i a l i s m , vol. businessmen who so wanted to look respectable that
2. London: Allen & Unwin.
they could not see where their interest lay. Thomas
Cole, G.D.H. 1947. T h e l i f e o f W i l l i a m C o h h e t t ,
3rd ed. London: Home & van Thai. Tooke had said the same about the London
Cole, G.D.H. 1977. .1 history of socialist thought. 5 vols. Vol. 1, merchants, when on their behalf he drafted the
Socialist thought: the Forerunners, 1789-1850. London: celebrated petition of 1820 for free trade and they
Macmillan.
were reluctant to sign it.
Green, D. 1983. Great ('obbett: The Noblest agitator.
Oxford: Oxford University Press. The militants of Manchester formed the
Sambrook, J. 1973. W i l l i a m C o b b e t t . London: National Anti-Corn Law League and agitated for
Routledge & Regan Paul. free trade up and down the country. They become
Spater, G. 1982. William Cobbett: The poor man's friend. known as the Manchester School of Economics and
vol. 2. Cambridge: Cambridge University Press.
Thompson, N.W. 1984. T h e p e o p l e ' s s c i e n c e : T h e
were celebrated as arch advocates of laissez- faire.
popular political economy of exploitation AcUially they were a coalition of diverse interests
a n d c r i s i s . Cambridge: Cambridge University Press. that agreed on only one issue - repeal of the Com
Laws - and each did so for its particular reasons.
Cobden's reason was peace. He believed free
trade would break down national barriers and give
everyone a material interest in avoiding war. This
Cobden, Richard (1804-1865) was not an argument gotten up for the occasion but
the expression of a view he had long held. When
William D. Grampp young he wrote two long tracts on foreign policy
which denounced alliances among nations and
political engagements of all kinds, decried the idea
of a balance of power, was especially disapproving
Keywords of colonics, then went on to extol
Anti-Corn Law League (UK); Hank of England;
Bright, J.; Cobden, R.; Colonialism; Corns
Laws; Factory Acts; Free trade;
Cobweb Theorem 1743
free trade as the way to peace and its guarantor. Selected Works
Years later, after he and Bright had brought down
the Com Laws, he told him, ‘I have always had an 1835. England, Ireland and America, by a Man-
instinctive monomania against the system of foreign chester Manufacturer. L on do n.
interference, protocolling, diplomatising, etc.’ 1836. Russia, by a Manchester Manufacturer.
That scarcely expressed the horror he had of E d i nb u rg h.
violent action, even the suggestion of it. When the 1 84 9. Speeches on Peace, Financial Reform and
southern states of America seceded, he thought Other Subjects. L on do n.
Lincoln was wrong in bringing the issue to battle 1867. Political Writings. London: Ridgeway.
although he had no sympathy with them (except 1868. Speeches on Questions of Public Policy, 2
their fondness for free trade). He was shocked by vols, ed. J. Bright and J.E. Thorold Rogers.
the massacres in India and was opposed to wars of Oxford: Oxford University Press.
independence and to revolution. He thought duel-
ling was barbarous, was against capital punishment, Bibliography
objected to boxing, couldn’t stand brass bands, and
asked the Pope to prohibit bull fighting in Spain. He Ashworth, H. 1877. Recollections of Richard Cobden, M.P., and
favoured free trade so long as its effect was the Anti-Corn Law League. London: Cassell. Morley, J. 1881.
The Lije of Richard Cobden. London: Fisher Unwin.
peaceful, as he believed it usually was, but when he Ritchie, J.E. 1865. T h e L i j ' e o f R i c h a r d C o b d e n .
believed it was not he quickly put it aside. He London: Ward and Lock.
opposed the sale of foreign bonds in the London
market if the proceeds were to be used to buy arms.
‘No free trade in cutting throats’, he said.
Pacifism, not laissez-faire, was Cobden’s
guiding principle; and he applied laissez-faire less
to domestic than to foreign markets. He did not care
for the Factory Acts but only spoke, never voted,
Cobweb Theorem
against them. He approved of increasing the
monopoly powers of the Bank of England and of B. Peter Pashigian
regulating aspects of railway construction. He had
no use for the New Poor Law, of which most
economists of the day approved, and spoke
derisively of McCulloch’s ‘usual dogmatism’. But
Abstract
he carefully read the latter’s edition of the Wealth of
The cobweb theorem purports to explain
Nations and wrote in the margins of the chapters
persistent fluctuations of prices in selected
that moved him. His notes are especially lively
agricultural markets. It was first developed in
where Smith condemns the colonial policy of Great
the 1930s under static price expectations where
Britain. However, where he describes the operation
the predicted price equalled actual price in the
of the invisible hand, the margins are quite
last period. Muth’s rational expectations
untouched.
hypothesis posited that forecast errors will not
be serially correlated and the pattern of past
forecast errors cannot be used to improve the
See Also
accuracy of the forecasts. The fundamental
► Nv. T : ;>ni question of whether observed price cycles are
► J.GI v better explained by systematic errors in price
forecasts or by the cumulative impact of
unpredictable shocks has not as yet been
definitively addressed.
1744 Cobweb Theorem
The length of the cobweb price cycle is deter- dubious assumption of a continued supply of
mined by the length of the production process. If it entrepreneurs standing ready to dissipate their
takes one year to bring a fattened hug to the market, capital. The critics were also disturbed by the
then the complete price cycle should take two years. ambiguity of whether the supply curve is of the
At first, little attention and superficial explanations short-or long-run variety, and the failure to clarify
were given to explain why the predicted length is how the adjustment from the short-run to the long-
often shorter than the actual length of the price run supply curve is made. These early criticisms
cycle. It was left to the critics to point out these and ambiguities aside, references to the cobweb
discrepancies. theory continued to appear in textbooks.
The critics are responsible for the other strand of Nerlove’s paper ( ) briefly rekindled the
the literature. They appeared early but were not controversy. His purpose was to resurrect the theory
very influential at first although their criticisms and show that it could explain price behaviour if
were ultimately given more weight. The critics adaptive expectations were employed. Mills ( )
questioned the rationality of using an arbitrary criticized the use of adaptive and other
expectations mechanism by otherwise profit- autoregressive expectations mechanisms in the
maximizing agents, and pointed out that the theory deterministic model because they implied a simple
implies that producers would expect to lose wealth pattern of forecast errors that producers could
if they entered and remained in an industry with a detect, incorporate into their forecasts and thereby
cobweb price cycle. In a perceptive article on the improve the accuracy of their price forecasts. While
pig cycle in England, Coase and Fowler ( ) Nerlove’s suggestion did rectify one limitation of
questioned the realism of static expecta the cobweb theory, it did not address the critical
tions. They showed that the price of a bacon issue of why producers relied on any particular
(mature) pig less the cost of feeding for the next forecasting mechanism. Muth ( )
five months and less the cost of a feeder (young) developed the implications of rational expectations
pig, which would be stable in a competitive market for cobweb theory in his now famous paper. Muth
if farmers had static expectations, fluctuated over postulated that expectations were the predictions of
time. Hence the empirical evidence contradicted the the economic structure of the market and
assumption of static expectations. They presented incorporated all available information. Under
evidence that pig breeders reacted quickly to a certain conditions the predicted price equals the
change in expected profits, and this implied that the conditional expectation of price, given currently
pig price cycle should be only two years instead of available information. Adaptive expectations can be
the observed four-year period. The fluctuation in rational only under special conditions, and the
the profits per pig was attributed to the difficulty of coefficient of adaptation is determined by the values
predicting both demand and foreign imports. The of the slopes of the demand and supply curves.
Coase-Fowler paper advanced, if only in faint The rational expectations formulation has
outline, the essence of the rational expectations powerful implications for cobweb theory. If the
hypothesis which was to blossom some 35 years price forecasts incorporate all available information
later. They hinted that anticipated prices would not and are on average correct, then forecast errors will
be formed in a mechanistic way because profits not be serially correlated and the pattern of past
would be higher the more accurate are the forecasts. forecast errors cannot be used to improve the
Prediction errors were due to the difficulty of accuracy of the forecasts. Moreover, what is then
predicting shifts in demand and in foreign supply. left of the supposed ability of the cobweb theory to
Buchanan’s paper ( ) criticized the cobweb explain the cyclical behaviour of prices? Price
model because it implied that producers suffer fluctuations would have to be explained either by
aggregate losses over the price cycle when output is the cyclical pattern of exogenous variables or by the
determined by the long-run supply curve. He summation of random shocks (Slutsky ). Muth’s
pointed out that the theory was based on the paper represents a
1746 Coddington, Alan (1941-1982)
frontal attack on the traditional cobweb model. He Ezekiel, M. 1938. The cobweb theorem. Q u a r t e r l y
notes that the traditional model tends to predict a J o u r n a l o f E c o n o m i c s 52: 225-280.
Freeman, R.H. 1971. T h e m a r k e t f o r C o l l e g e -
shorter price cycle than is observed and indicates t r a i n e d m a n p o w e r . Cambridge, MA: Harvard
that the rational expectations version predicts a University Press.
longer price cycle. Hoffman, D.L., and P. Schmidt 1981. Testing the restrictions
implied by the rational expectations hypothesis.
Interest in the cobweb model has ebbed in
J o u r n a l o f E c o n o m e t r i c s 15: 265-287.
recent years and few articles on it have appeared in Kaldor, N. 1934. A classificatory note on the determinateness
the major journals. Economists have found it more of equilibrium. R e v i e w o f E c o n o m i c S t u d i e s 1:
rewarding to apply the rational expectations 122-136.
Mills, E.S. 1961. The use of adaptive expectations in stability
hypothesis to areas like monetary or business- cycle analysis: Comment Q u a r t e r l y J o u r n a l o f E c o -
theory than to the study of particular markets, even n o m i c s 75: 330-335.
though the analysis of markets with inventories Muth, J.F. 1961. Rational expectations and the theory of price
raises issues that are just as difficult and subtle. The movements. E c o n o m e t r i c s 29: 315-335.
Nerlove, M. 1958. Adaptive expectations and cobweb
question of whether the cobweb does or does not phenomena. Quarterly Journal of
explain price cycles has not really been resolved. E c o n o m i c s 72: 227-240.
Freeman ( ) has suggested that Pashigian, B.P. 1970. Rational expectations and the cobweb
theory. J o u r n a l o f P o l i t i c a l E c o n o m y 78:338-
the traditional cobweb model explains cycles in the
352.
markets for lawyers, physicists and engineers. Tests Slutzky, E.E. 1937. The summation of random causes as the
of the rational expectations hypothesis have been source of cyclical processes. E c o n o m e t r i c a 5: 105-
suggested by Pashigian ( ) when expec 146.
Waugh, F.V. 1964. Cobweb models. J o u r n a l o f F a r m
tations data are available and by Hoffman and
Schmidt ( ) when expectations data are
unavailable. So the methodology exists for
distinguishing between the competing hypotheses.
Few econometric tests have been made of the Coddington, Alan (1941-1982)
rational expectations hypothesis in markets where
the assumptions of the cobweb model apply. The Victoria Chick
fundamental question of whether observed price
cycles are better explained by systematic errors in
price forecasts or by the cumulative impact of
unpredictable shocks has not as yet been Bom in Doncaster, Yorkshire, Coddington began
definitively addressed. his academic career with a degree in physics at
Leeds University. After a year teaching mathemat-
ics in a school in York, he returned to university in
that city for his D.Phil. in Economics. On taking his
See Also degree in 1966 he was appointed Assistant Lecturer
in Economics at Queen Mary College, London,
► Rational Expectations where he rose steadily to become Professor in 1980.
His theoretical work has three main strands: an
early interest in the theory of bargaining, resulting
Bibliography in several articles and a much-respected book
Buchanan, N. 1939. A reconsideration of the cobweb theorem. (1968); various aspects of environmental eco-
J o u r n a l o f P o l i t i c a l E c o n o m y 47: 67-81. nomics; and, continuing throughout his career,
Coase, R.H., and R.F. Fowler. 1935. Bacon production and the methodology and the history of 20th-century eco-
pig-cycle in Great Britain. E c o n o m i c a 2: 142-167.
nomic thought.
Codetermination and Profit-Sharing 1747
choice, in the pursuit of policy targets such as management, when workers receive detailed
distributive fairness, employment, price stability, and credible information and participate in
efficiency and growth. decision making, identifying themselves
The same combination of private interest and partly with the enterprise and above all
government policy determines the degree of lengthening their time horizon in view of
workers’ participation in decision-making processes continued participation in decision-making
(codetermination) and in the performance (profit- (Aoki ; Cable ; Fitzroy and Mueller ). Of
sharing) of enterprises (for a bibliographical survey, course conflicts within the firm are made
see Bartlett and Uvalic ). more tractable by the introduction of
codetermination but afterwards are bound to
reappear over time (Furobotn ); also there
Codetermination remains a basic conflict between employed
and unemployed workers which may even be
Employee participation in enterprise decision- exacerbated by the employment protection
making in cooperatives amounts to full entrepre- policies conceivably encouraged by those
neurship through participation in assemblies, the already employed in their exercise of
election of representative organs and involvement in codetermination.
the appointment of managers. In other enterprises it (iii) The greater correspondence between workers’
takes the form of access to information and right to powers and responsibilities, code- termination
consultation, participation in decisions on being the counterpart of workers’ exposure to
conditions and organization of work and on internal enterprise risks. The very fact that workers,
social questions, through a workers’ council or unlike capitalists, cannot diversify between
similar organ; right up to the minority (or even different enterprises when selling their
parity) participation and vote in the board of services exposes them to an employment and
directors of a joint-stock company (as in German income risk which induces them to make a
Mitbestimmung; see Nutzinger ) with a possibility claim to control; a claim which up to a point
of influencing decisions about employment, the the employer may prefer to accept instead of
level and structure of investment and other crucial granting higher wages or longer tenure.
factors were the other board members to be
sufficiently divided.
The effects of codetermination are three-fold: Profit-Sharing
(i) The reduction in labour disutility obtainable In pre-capitalist systems workers’ participation in
when workers have a say in the division of the results of their enterprise took the forms - now
labour and work organization, since enter- little used - of sharecropping in agriculture and of
prises may neglect workers’ preferences about sliding scales (indexing wage rates to the price of
the specific uses to which their labour is put or the product), for instance, in English coalmines. In
at any rate respond to the needs of a modem capitalism such participation - for which
hypothetical average worker: if the number of ‘profit-sharing’ is a shorthand label - takes the form
enterprises is not large enough, workers’ of cooperatives’ net revenue sharing, production
control is necessary to reduce disutility and prizes based on group or overall performance,
alienation. The effect of workers’ control on participation in gross/net revenue/profit, share
productivity has an indeterminate sign options, participation in investment funds and pay
(Pagano ). increases graded according to productivity growth.
(ii) The reduction of the number and intensity of The effects of an element of profit-sharing in
conflicts in the workplace in general and, in labour earnings are three-fold:
particular, the more likely acceptance by
workers of unpopular decisions by
Codetermination and Profit-Sharing 1749
(i) An expected increase in labour productivity. firms. Vanek finds that higher employment
This is not due to workers gaining from the will be associated with higher aggregate
product of individual extra-effort (as in the income, lower prices (because of higher out-
case of piece-rates) since each of n workers put), higher export volume and domestic
employed will only get at most 1 hi of the import substitution (with undetermined effects
product of his own extra-effort (Samuelson on the balance of payments depending on
) and on the contrary may reduce effort price and income elasticities), lower after-tax
if he can, being exposed to at most only 1/;? of and after-labour-share profits and higher
the output loss from his own lower effort. The labour-share in national income (Vanek ).
productivity gain can be expected from
workers, costlessly to themselves, gaining Rediscovering Vanek’s macroeconomic benefits
from intelligent and effective use of any given from profit-sharing (though not its impact on net
individual level of effort, from cooperating profits and relative income shares), Weitzman
with other workers and management and from claims that these benefits are neglected by indi-
monitoring and supervising each other’s vidual firms, as in other instances of ‘public goods’,
effort, efficiency and cooperation (Reich and ‘externalities’ and ‘market failures’, therefore
Devine ; Fitzroy and Kraft ). necessitating public policy measures (Weitzman ,
(ii) Cyclical flexibility of labour earnings and ). However, there is no
therefore greater stability of profit levels and reason why a firm should object to granting a given
rates. Employment will not be stabilized increase in earnings under the guise of a profit-
during the cycle by labour earnings flexibility share instead of an equivalent fixed amount unless
obtained through profit-sharing because the that represents forced insurance against profit
marginal cost of labour to firms - i.e. the fixed variability; and why workers - at least at the level of
component of pay - does not vary nation-wide collective bargaining - should not take
automatically. Workers, who are normally into account the potential employment and price
risk-averse, will prefer a fixed sum of money stability benefits of this formula and offset them
to a profit-sharing formula of equivalent against the greater variability of their earnings in
amount while employers, who are normally between negotiations, due to both cyclical factors
risk-lovers, may or may not prefer greater and random factors affecting their firm’s
stability of profit rates (according to their performance.
actual attitude to risk and the alternative cost Contrary to Weitzman’s belief, in fact, profit-
of reducing risk through diversification) to the sharing is not absolutely superior to wage contracts.
point of granting higher average earnings on a For workers, profit-sharing transforms the
profit-sharing formula than a fixed wage to probability distribution of uncertain employment at
mutual advantage. Therefore profit- sharing is a fixed and certain income into a probability
favoured primarily in risky ventures; distribution of employment with a higher mean
otherwise on this ground alone profit- sharing (because of lower marginal cost of labour) but no
would be favoured by firms only in a less variable over the cycle, at a more variable
recession (when workers would only accept it income (both over the cycle and for other factors
as an alternative to a permanent wage cut) and affecting dispersion of enterprise performance) and
by workers only during a boom (when firms at a higher (real) mean. For firms it transforms a
would only accept it as an alternative to a more into a less variable probability distribution of
permanent wage increase). money profit rates around the same mean (or a
(iii) Higher level of labour employment, for a lower mean if workers are protected from actual
given level of labour earnings with respect to losses; the effect on real profit rates depending on
a fixed wage regime, due to the lower accounting conventions and choice of numeraire).
marginal cost of labour to profit-sharing In the pursuit of greater employment and price
1750 Codetermination and Profit-Sharing
stability of course a government may grant tax relief and labour; minor forms of codetermination (or
to shared profits, just as effectively and with just as conversely of profit-sharing) tend to go hand in
much reason as it may subsidize the marginal cost hand with minor forms of profit-sharing (or of
of labour to firms under a wage regime. Otherwise codetermination); a high degree of one without the
there is no reason why profit- sharing should be other is virtually unknown.
forced upon unwilling workers and firms by well- The combination of 100 per cent codetermina-
meaning reformers, beyond the extent they are tion (= self-determination) and 100 per cent profit-
prepared to consider in their market transactions. sharing ( net revenue sharing) obtained in
These propositions are developed further below (see cooperative firms, according to conventional
also Nuti , ). literature, is subject to economic stimuli of a
somewhat ‘perverse’ kind. These are primarily:
restrictive employment ( membership) policies;
Interdependence Between destabilizing and Pareto-inefficient reactions (or at
Codetermination and Profit-Sharing best inelasticities) to price changes and technical
progress; a low propensity towards self-financed
The respective effects of codetermination and of investment (Ward ; Vanek ). In empirical studies of
profit-sharing are not independent. The productivity cooperative firms there is no incontrovertible
increase expected from profit-sharing can be raised evidence of these phenomena, which are probably
by workers having collective discretion over the partly offset by other economic (job security,
organization of labour; or the productivity fall growth-mindedness, etc.) andnon-economic stimuli;
which might derive from workers’ control over but there is a presumption that - albeit in a weak
labour organization might be tempered by profit- form - the same tendencies and, in particular,
sharing. Greater variability of earnings - during the employment restrictive policies might be associated
cycle and across firms - strengthens under profit- with codetermination. We can also presume that
sharing the case for codetermination already present workers’ eagerness to press and ability to assert
in workers’ exposure to employment risk in the demands for codetermination, as in the case of other
wage regime. The income premium required by demands, increase as unemployment diminishes.
risk-averse workers to replace some of their fixed Hence the employment-generating benefits of
wage with a variable profit-share can be reduced by profit-sharing can be at least partly offset by the
their involvement in the decisions which expose restrictive employment policies possibly associated
them to income variability in the first place. The with codetermination brought about by profit-
reduction in conflict frequency and intensity sharing and by greater proximity to full
expected from codetermination is enhanced by employment. Recent empirical studies suggest
profit- sharing because for each worker it partly modest but sizeable improvements in economic
internalizes the conflict between ‘us’ and ‘them’ performance from codetermination and profit-
otherwise manifested and enacted externally; in any sharing (Cable and Fitzroy ; Estrin et al. ) when and
case it is a requirement of any effective incentive where they occur but there may have been costs that
system that power and responsibility should not be remained unobserved and, in any case, the
separated. improvements cannot be generalized.
The quantification of degrees of ‘codetermina-
tion’ and to a lesser extent of ‘profit-sharing’ raises
conceptual and practical difficulties (though see Markets and Policy
Cable 1985). By and large we can observe a certain
correlation between the two: both codetermination Degrees of codetermination and profit-sharing may
and profit-sharing are zero in the pure capitalist well be regarded as desirable on ‘political’ (as
enterprises and unity in cooperatives and other opposed to ‘purely technical’) grounds such as
forms of partnerships of capital
Codetermination and Profit-Sharing 1751
equity and social peace. They may also be the best be justified in forcing profit-sharing on to a yet
policy instruments in the pursuit of public objectives unconverted and disbelieving public, thus achieving
such as stabihty, employment and growth, in the full employment, price stability and growth at a
sense of having the least cost in terms of public stroke. Unfortunately miracles exist only for the
funds or offering the most attractive trade-offs uninformed and the faithful, but do not bear the
between alternative targets. Otherwise, as Jensen weight of sober scrutiny. First, excess demand for
and Meckling argue for codetermination and one labour at full employment cannot be sustained and
can also argue for profit-sharing, if it is truly can only be a temporary disequilibrium. Second,
beneficial to both stockholders and labour no laws permanent excess demand for labour is inconsistent
would be needed to force firms to undertake with lack of codetermination, and when this is
reorganization ( , p. 474). Yet renewed and introduced restrictive employment policies will alter
insistent calls for public intervention in favour of the picture. Third, and most important, there is no
profit-sharing without codetermination have been guarantee that full employment can necessarily be
put forward by M.L. Weitzman in recent writings ( achieved. Without these benefits the alleged ‘public
, , ,, ). The proposal has good’ merits of the sharing contract disappear.
been enthusiastically received in certain academic
and political circles and hailed as a breakthrough in
the specialist press. Excess Demand for Labour at Full
Weitzman’s novelty, the foundation for this Employment
renewed fascination with profit-sharing, is the rash
assertion of two propositions. First, that long-run Suppose that the share economy reaches a state of
full employment equilibrium under profitsharing is full employment. Weitzman maintains the presence
associated with permanent but non-inflationary and persistence of excess demand for labour in
excess demand for labour, which cushions off the long-run equilibrium on the basis of the following
economy from contractionary shocks and gives new argument:
dignity and status to labour. In adman’s language we
are told, for instance: labour total pay = marginal revenue value of
A share system has the hard-boiled property of
excess demand for labour, which turns into a tena-
labour productivity at full
cious natural enemy of stagnation and inflation. The employment
share economy possesses a built-in, three-pronged
assault on unemployment, stagnant output, and the (1)
tendency of prices to rise. This is a hard combination
to beat. (Weitzman , p. 144. because long-run equilibrium must be full-
Second, that even in the short run the share employment equilibrium and because of the under-
economy can achieve and maintain frill employ- lying homomorphism of profit-sharing and wage
ment. For instance: contracts in long-run equilibrium (Weitzman ). By
The share system, . . . , has a strong built-in mecha- definition of profit-sharing
nism that automatically stabilizes the economy at full
employment, even before the long-run tendencies
have had the chance to assert their dominance. labour total pay = fixed pay
... a share economy has the direct ‘strong force’ of + share of net profits
positive excess demand for labor ... pulling it towards (2)
full employment. ... the strong force of the share
system will maintain full employment. (Weitzman , p.
97)
where fixed pay is greater than or equal to zero, and
the share of net profits is greater than zero. It
Were these claims well founded an enlightened
follows from ( ) and ( ) that:
government possessing these troths would
1752 Codetermination and Profit-Sharing
marginal revenue value of share of net profit is zero: with the sharing com-
labour productivity at full ponent of earnings the ‘share economy’ also van-
ishes and reverts to the fixed wage economy
employment > fixed pay = marginal cost of
without any excess demand for labour.
labour to firms Third, workers perceiving excess demand for
(3 ) labour are likely to reduce their supply of effort
and/or increase turnover - as they do in the only
i.e. firms will wish to employ more workers than
known instances of permanent excess demand for
are available. A permanent state of excess demand
labour, i.e. Soviet-type economies (see Lane ) - if
for labour will exist, which will protect full
not right down to the point where their marginal
employment from contractionary shocks, as long as
product equals fixed pay at least as close to that
shocks do not reduce the marginal revenue value of
level as they are allowed to get by monitoring and
labour productivity at full employment below the
supervising arrangements. This is another
fixed element of pay, in which case the maintenance
mechanism which can reduce and eliminate excess
of over-full employment requires a reduction of the
demand for labour if it occurred.
fixed element without cutting earnings as much as
necessary in the wage regime.
There are three grounds for refuting this syllo-
Codetermination and Employment
gism. First, firms should be well aware that, what-
ever their pay formula, they can only attract The lack of codetermination is an explicit precon-
workers by offering the going rate for labour total dition of Weitzman’s claims (though not of
pay and should regard this, and not the fixed Vanek’s, who does not claim full and over-full
element of pay, as marginal cost of labour. If firms employment of labour and does not need this
behave as they should, excess demand for labour restriction). (In the earlier version of his analysis
disappears. Weitzman takes a sanguine view of the possibility
Second, if firms regard the fixed element of pay of keeping codetermination in check: ‘ . . . the
as the marginal cost of labour they should find its bargaining power of labor unions is not a natural
being lower than the marginal revenue value of right . . . ’ (1984a, p. 109); ‘ . . . the decisions on
labour productivity disquieting enough to experi- output, employment and pricing are essentially
ment with alternative combinations of pay param- made by capitalists’ in his model (p. 132); ‘I can see
eters without raising total pay above labour no compelling reason why a capitalist firm should
productivity. Risk-averse workers preferring fixed be more prone to allow increased worker
pay to potentially variable earnings of identical participation in company decision making under
mean, risk-neutral or risk-loving employers will one contract form than under another’ (p. 133,
reduce their labour cost by raising the fixed element emphasis added). His latest version is more open-
of pay at the expense of workers’ profit share; even minded: workers’ participation in decisionmaking
without taking into account attitude to risk it is becomes not only possible but desirable as ‘a
plausible to expect managers to experiment with question of justice and practical politics’ as long as
alternative pay parameters and not to rest until they it excludes employment decisions ( ).
have equalized their marginal cost and marginal It is extremely hard to imagine any major decision,
value of labour, i.e. in which workers might have a voice, that would
not directly or indirectly also affect employment.
marginal revenue value of labour productivity at Either this limitation or workers’ participation
full employment = fixed pay would have to give way.) We know that it is
(3') possible to exclude workers from code- termination
in the presence of persistent unemployment; such
which can only be reconciled with the definition ( ) exclusion might be difficult at full employment, and
of a profit-sharing contract if the workers’ it would certainly be very
Codetermination and Profit-Sharing 1753
difficult with excess demand for labour, but the (iii) The fixed element of pay in share contracts
persistent state of excess demand for labour pos- must be flexible enough to fall down to the
tulated by Weitzman should make the exclusion of level of the marginal revenue product of
codetermination, whether or not employment labour at full employment, positive as it may
questions are directly involved, impossible without be.
an authoritarian or military regime. This is not a
moral, or legal, or legalistic proposition; it is a The first condition rales out the possibility of
question of ‘practical politics’. classical unemployment, i.e. due to lack of equip-
Once workers have a say on output, employment ment, land or other resources in the quantities
and pricing and related questions (investment, necessary to employ all workers efficiently. Yet,
innovation, etc.) they will try and resist the very after over a decade of deep and protracted reces-
possibility of dilution of their own shares just as sion, deindustrialization and decapitalization, even
shareholders usually resist the dilution of share advanced industrialized countries such as Britain of
capital; for better or worse they are likely to adopt, France today cannot be expected to be able to
or are tempted to adopt, other things being equal, satisfy this condition as a matter of course, not to
restrictive employment policies in the possibly speak of Italy or, say, Spain, or of less developed
misguided and self-defeating purpose of raising or countries. In his formal model Weitzman ( )
maintaining individual earnings. This is not a case postulates constant physical
against profit-sharing, but an argument for not productivity of labour; this is a plausible assump-
expecting that overfull employment, if achievable, tion up to near-full capacity but Weitzman gives no
can be sustained necessarily, i.e. an argument reason why the capacity should be constrained by
against the plausibility of Weitzman’s model (see labour instead of other resources.
Nuti ). The second condition rales out the possibility of
Keynesian unemployment, i.e. aggregate demand
constraints making the marginal product of labour
Profit-Sharing and Full Employment valueless before full employment is reached. Even
if the first condition was satisfied, imperfect
The foundation of Weitzman’s claims on behalf of competition - which in all of Weitzman’s work
profit-sharing is the assertion that, even in the short provides the environment in which the share
run, the share economy ‘delivers’ frill employment contract is to operate - provides an excellent reason
of labour. (‘Resources are always frilly utilised in a why firms might not give to additional physical
share system’ (Weitzman , p. 949); real world products a positive value. Weitzman can assert that
frictions, inertias and imperfections are mentioned ‘... a “pure” sharing system not having any base
only to be exorcised, and to reassert the full wage would possess an infinite demand for labor’ (
employment claim at least as a ‘natural tendency’ , p. 944),
(p. 949, p. 952) of the share economy which, we are which implies positive marginal revenue for any
told, ‘delivers full employment’ ( ); see also level of output, because of the very special
Weitzman , assumption that the elasticity of demand is greater
p. 97.) than unity (p. 938), which makes demand curves
For a share economy to ‘deliver’ full employ- absurdly and indefinitely elastic even for imper-
ment three necessary conditions must be satisfied fectly competitive firms. The proposition cannot
simultaneously: have any claim to general validity.
Even if demand for labour were to be infinite in
(i) The physical marginal productivity of labour
the pure share economy, i.e. with a zero fixed
at full employment must be positive;
element of pay, it would not necessarily be infinite,
(ii) The marginal revenue obtained by firms from
or even large enough to reach full employment, for
that physical marginal product of labour must
a positive fixed element of pay. Weitzman neglects
also be positive;
the determination of the
1754 Codetermination and Profit-Sharing
relative weight of the fixed and variable compo- effects on aggregate demand; otherwise there is
nents of the share contract but recognizes the little to choose between the two, except for the
impossibility of total dependence of pay on profit; lower degree of public resistance that can be
yet he takes for granted, for no good reason, that the expected for share contracts with respect to wage
fixed element of pay can be compressed down to cuts.
whatever is the full employment marginal revenue In fact if the share contract could really deliver
product of labour, which we do not even know for and maintain full employment, while a wage
sure is positive. economy could not, the greater variability of
It is a non-controversial feature of the sharing workers’ earnings associated with profit-sharing
contract, known from Vanek ( ), that the over the cycle would disappear and, between firms,
replacement of part of the wage by a profit-share of could be eliminated by labour freely redeploying
identical average cost to firms will lead to greater itself at will across labour-hungry firms; the
employment, higher output and lower prices - in the variability of employment would also disappear;
absence of large enough adverse feedback on workers would have de facto free access to a job in
investment (which Weitzman recognizes as a any firm of their choice, as in forgotten utopias
possible short run effect of the introduction of (Hertzka ; Chilosi ). Thus it could be said that ‘... a
sharing) and in the absence of large enough move towards profit sharing represents an
feedbacks of accompanying codetermination on unambiguous improvement for the working class’
firms’ employment policy. But there is a world of (Weitzman , p. 954). But we have seen that profit-
difference between higher employment and full sharing cannot guarantee full (let alone over-full)
employment and another world of difference employment. Without full employment, the higher
between full employment and persistent over-full variability of earnings associated with profit-sharing
employment; no serious work can afford to switch remains and it may or may not be compensated by
indifferently and cavalierly from one to the other. the higher mean value of employment probability
and perhaps real earnings. Outside over-full
employment, in fact, the share economy is just as
Share Contracts and Public Good vulnerable to contractionary shocks as the wage
economy because, in spite of flexibility of labour
If the share economy could really guarantee, as earnings in the share regime, the marginal cost of
general and necessary consequences of its estab- labour to firms (which is the fixed component of
lishment, the achievement and stability of full workers’ pay) remains constant just as does the
employment without adverse drawbacks there wage. Thus the higher stability of employment to be
would be a case for public policy treating the share found in Japan simply cannot be the result of profit-
contract as ‘public good’ to be pressed on an sharing, as Weitzman firmly believes, seeing that
unenlightened public still largely unaware of Japan has never known a state of over-full
potential benefits, as in the case of safe vaccination employment; higher employment stability would
against infectious disease. The case for the share require workers’ shares in GNP instead of their
economy would not be much greater than that for enterprise’s profits.
enforced wage flexibility, which would also The fact that the adoption of a share contract,
guarantee full employment and stability under the without the guarantee of stable frill employment,
same circumstances. A downward flexible wage has a cost for workers, eliminates the necessity, but
would not deliver excess demand for labour but this not the possibility, of the share contract having
is a questionable achievement and would not be ‘public good’ features. A vaccine may be somewhat
necessary to absorb contractionary shocks if wages unsafe, its degree of unsafety acceptable to all if
were flexible; downward flexible wages would also vaccination is universal and all benefit from
require a greater fall of money earnings to achieve reduced exposure to infection, yet individuals
full employment in the short run and may be more benefit from free-riding strategies and the
likely to bring about adverse
Coghlan, Timothy (1855-1926) 1755
enforcement of universal vaccination as ‘public Jensen, M.C., and W.H. Meckling. 1979. Rights and pro-
good’ can still be beneficial to all. If labour con- duction functions: An application to labor-managed firms
and codetermination. J o u r n a l o f B u s i n e s s 52:
tracts were negotiated exclusively at the level of 469-506.
individuals or firms the external beneficial effects Lane, D. (ed.). 1985. E m p l o y m e n t a n d l a b o u r i n
of the share contract might be lost from sight; but t h e U S S R . London: Harvester Press.
these external benefits - unlike the case of genuine Nuti, D.M. 1985. The share economy:
Plausibility and viability of Weitzman’s
‘public goods’ - are completely internalized in m o d e l . European University Institute Working Paper,
nationwide negotiations between associations of No. 85/194, Florence; Italian translation in P o l i t i c a
employers and employees. Admittedly the benefits, e d E c o n o m i c 1, January 1986.
such as they are, of profit-sharing may be still Nuti, D.M. 1986. A rejoinder to Weitzman. (In Italian.)
P o l i t i c a e d E c o n o m i a 4, April.
unknown to the public at large and deserve wider
Nutzinger, H.G. 1983. Empirical research into German
publicity. But it is counterproductive to foist a good codetermination: Problems and perspectives. E c o -
medicine on a sceptical public by claiming that it nomic Analysis and Workers'
can guarantee longevity or immortality. At the first M a n a g e m e n t 17(4): 361-382.
Pagano, U. 1984. W e l f a r e , p r o d u c t i v i t y a n d s e l f -
signs that such excessive claims are unfounded it
management. European University Institute
may be thrown away despite its real lesser benefits. Working Paper, No. 84/128, Florence.
Reich, M., and J. Devine. 1981. The microeconomics of
conflict and hierarchy in capitalist production. R e v i e w
Bibliography o f R a d i c a l P o l i t i c a l E c o n o m i c s 12(4): 27-45.
Samuelson, P.A. 1977'. Thoughts on profit-sharing.
Aoki, M. 1984. The co-operative game theory of the firm. Zeitschriftfur die Gesamte
Oxford: Oxford University Press. S t a a t s w i s s e n s c h a f t (special issue on profit-
Bartlett, W., and M. Uvalic. 1985. B i b l i o g r a p h y o n sharing).
labour- managed firms and employee Vanek, J. 1965. Workers’ profit participation, unemployment
p a r t i c i p a t i o n . European University Institute Working and the Keynesian equilibrium.
Paper, No. 85/198, Florence. Cable, J.R 1984. E m p l o y e e W e l t w i r t s c h a f t l i c h e s A r c h i v 94(2): 206-214.
participation and firm performance: A Vanek, J. 1970. The general theory of labor-managed market
p r i s o n e r s ' d i l e m m a f r a m e w o r k . European Uni- economies. Ithaca: Cornell University Press.
versity Institute Working Paper, No. 84/126, Florence. Cable, Ward, B.M. 1958. The firm in Illyria: Market syndicalism.
J.R., and F.R. Fitzroy. 1980. Productive efficiency, incentives A m e r i c a n E c o n o m i c R e v i e w 48(4): 566-589.
and employee participation: Some preliminary results for Weitzman, M.L. 1983. Some macroeconomic implications of
west Germany. K y k l o s 33(2): 100-121. Chilosi, A. 1986. alternative compensation systems. E c o n o m i c J o u r -
The right to employment principle and self- n a l 93(4): 763-783.
managed market socialism: A historical Weitzman, M.L. 1984. T h e share economy.
account and an analytical appraisal of Cambridge, MA: Harvard University Press.
s o m e o l d i d e a s . European University Institute Working Weitzman, M.L. 1985a. Profit sharing as macroeconomic
Paper, No. 86/214, Florence. policy. A m e r i c a n E c o n o m i c R e v i e w : P a p e r s
Estrin, S., D.C. Jones, and J. Svejnar. 1984. The varying a n d P r o c e e d i n g s 75(2): 41-45.
nature, importance and productivity effects of worker Weitzman, M.L. 1985b. The simple macroeconomics of profit
participation: Evidence for contemporary producer cooperatives sharing. A m e r i c a n E c o n o m i c R e v i e w 75(5):
in industrialised Western societies. CIRIEC Working Paper, No. 937-953.
84/04, University of Liege. Fitzroy, F.R. and K. Kraft. 1985. Weitzman, M.L. 1986. Reply to Nuti. (In Italian.) P o l i t i c a
Profitability and profilsharing. Discussion Papers of the
International Institute of Management, WZB, Berlin, IIM/IP 85
41, December.
Fitzroy, F.R., and D.C. Mueller. 1984. Cooperation and Coghlan, Timothy (1855-1926)
conflict in contractual organisations. Q u a r t e r l y
R e v i e w o f E c o n o m i c s a n d B u s i n e s s 24(4): Colin G. Clark
24-49.
Furobotn, E.G. 1985. Codetermination, productivity gains
and the economics of the firm. O x f o r d E c o n o m i c
P a p e r s 37: 22-39.
Hertzka, T. 1980. F r e i l a n d . Ein soziales
Z u k u n f t s b i l d . Dresden: Pierson. English translation, Coghlan was the son of a poor family in Sydney,
London: Chatto & Windus, 1891. and obtained his education through scholarships.
Like several other distinguished economists, his
1756 Coghlan, Timothy (1855-1926)
original career was in engineering, in this case civil thought at the time New Zealand might enter the
engineering, which led to his taking an interest in Australian federation).
statistics. New South Wales had been established as The free trade case was lost in 1908, when
a penal colony under military rule, and therefore Victoria, with its allies in some other states, per-
right from the colony's beginning people and suaded the new federal government to impose
governments were accustomed to statistical highly protective tariffs.
enumeration much more thorough than in the rest of Coghlan was posted to London, where he
the world. Coghlan was appointed statistician by the performed most of the duties (but without the title,
colonial government, and produced over a long which went to the political head) of Agent General
period of years a large output of excellent statistical for New South Wales. The duties were varied and
papers. responsible, including the raising of large sums in
Before Federation in 1901, there were six loans.
separate colonial governments in Australia, often In partial retirement, Coghlan embarked upon a
pursuing different objectives. This was particularly magnum opus. Labour and Industry in Australia,
the case in the highly controversial matter of free which was published in England in 1918, giving a
trade or protection. The two most populous colonies most frill and detailed account of its subject, but
were Victoria and New South Wales. Victoria had illuminated by stories and anecdotes which make it
attracted a large population through the abundance readable.
of alluvial gold in its riverbeds. This was soon
exhausted; and deep mining fell far short of taking
its place. The Victorian Government considered that Selected Works
the only way to provide employment to make up for
the decline in mining was to establish 1 89 3. Sheep and wool in New South Wales, with
manufacturing industries under high tariff history and growth of the pastoral industry of
protection. New South Wales (which admittedly had the colony as regards both these items of pro-
not had such a destabilizing gold rush) was duction. S yd n e y : P o t t e r .
determined to adhere to free trade, with the object 1 89 8a . Notes on the financial aspect of Australian
of promoting agricultural and pastoral production. Federation, n . t. p . S y dn e y : W . A . G u l l i c k .
Some of Coghlan’s writings were polemic, strongly 1 89 8b . Notes on the financial aspect of Australian
stating the free trade case. In the Wealth and Pro- Federation. The Position of Tasmania and
gress of New South Wales, in succeeding editions, Western Australia. Sy dn e y : W . A . G u l l i c k .
the available statistics of the colony were 1 89 8c . Notes on the financial aspect of Australian
assembled, pointing to a superior rate of growth to Federation. The incidence of the federal tariff.
that of Victoria. S yd n e y : W . A. G u l l i c k .
Coghlan was fifty years ahead of his time in 1 89 8d . Tables to accompany notes on financial
making national product estimations for New South aspects of federation. Sy dn e y : W . A .
Wales. Without any experience in any other country Gullick.
to guide him, he devised methods which would pass 1 89 8e . A statistical account of the seven colonies
muster today. His work however was not followed of Australasia 1861-1897. S yd n e y : W . A .
up. There was no further estimate of Australian Gullick.
national product until the study by Benham in the 1902. The progress of Australasia in the nineteenth
1920s, using less precise methods, and then another century. London, Philadelphia: The Linscott
gap until Sir John Crawford and I prepared a study Publishing Company.
in 1937. 1903a. (With T.T. Ewing.) The progress of Aus-
Coghlan prepared a series of publications on a tralia in the century. London Edinburgh: W. &
more extensive basis. The Seven Colonies of Aus- R. Chambers Ltd.; Philadelphia,Detroit: The
tralasia (i.c. including New Zealand; it was Bradley-Garretson Co., Ltd.
1 90 3b . The decline in the birth-rate of New South
Wales and other phenomena of childbirth: An
Cognitive Ability 1757
19 18 . Labour and industry> in Australia, from the Originally called IQ tests (for Intelligence Quotient
first settlement in 1788 to the establishment of because the measiues were constructed as the ratio
the commonwealth in 1901. Lo nd on / N e w of mental age to chronological age multiplied by
Yo r k : Ox f o r d Un i v e r s i t y P r e s s . 100), that name has fallen out of favoiu. Instead,
such tests are now often referred to as tests of
cognitive ability. Although the term IQ is still
sometimes used to refer to what such tests measiue,
none constructs a ratio.
Cognitive Ability
the notion that people had a number of independent and Murray ). Heckman ( ), in a review
primary mental abilities rather than a single general of that book, argues that even though g has sig-
mental ability. Both Spearman and Thurstone made nificant explanatory power, many other factors,
contributions to the development of factor analysis both cognitive and non-cognitive, matter as well.
as a way to identify the presence of unobserved Finally, test scores are correlated with a number
variables (abilities) that affect a number of of social behaviours including unwed motherhood,
observable variables (sub-test or item scores). criminal activity, and welfare receipt (Jensen , ch.
Today, the Spearman-Thurstone debate has been 9). While these correlations are substantial, and
resolved with a compromise. The most common cognitive test scores are typically better predictors
view among psychometricians who study cognitive of most of these outcomes than any other single
ability is that there are a number of different personal attribute, they still explain less than half
abilities. Some people are better at solving problems the variance.
verbally while others are good at solving problems Individuals’ scores on tests of cognitive ability
that involve visuahzation. Some people who are also tend to be strongly correlated over time - much
good at both of these things may be only average at more so for adults than for children. A study of
tasks that rely heavily on memory. However, there older adults found their full-scale IQ scores to be
is a tendency for people who perform well in any of correlated .92 when tested at two points in time
these broad areas to perform well in all others as three years apart (Plomin et al. ). In contrast, a
well (Carroll ). Most modem tests of cognitive study of children tested at two points in time
ability provide both a full-scale score that is most roughly two years apart found correlations of only .
reflective of general intelligence, and a number of 46 for those who were less than one year old at first
special- ability specific sub-scores as well. testing and .76 for those who were one year old at
first testing (Johnson and Bradley-Johnson ).
It is common to draw a distinction between tests
Validity of achievement and tests of ability. Achievement
tests measure how much knowledge the test taker
Binet’s is considered the first successful test of has accumulated in a particular area while ability
cognitive ability in that it was able to accurately tests endeavour to measure how quickly a person
predict teachers’ assessments of their students on can solve unfamiliar problems. Typically, scores on
the basis of a relatively short verbally administered the two types of tests are highly correlated. In fact,
test. Scores on tests of cognitive ability correlate all tests of ability are, to some degree, tests of
well with common perceptions of how bright or achievement as it is impossible to measiue ability
smart someone is. They are also strongly correlated without also measuring the test taker’s reading or
with measures of academic achievement such as verbal comprehension at least. Further, to the extent
achievement test scores, grades and ultimate that the task being tested relies on knowledge of
educational attainment (typically .5 or better). They geometry, arithmetic, general knowledge, and so
are less highly correlated (.5 or less ) with many on, the rolls of the achievement test and ability test
important life outcomes including reported annual are confounded.
income and job status. Performance on a wide range Cultural bias has been a concern with
of jobs and work tasks is positively related to knowledge-based tests. Some knowledge is more
cognitive test scores with performance on more accessible to some people than others. For example,
demanding jobs having higher correlations. Some we would expect that a child growing up with upper
have claimed that general cognitive ability is middle-class parents in New York or Paris to find it
responsible for most of this explanatory power (Ree easier to learn the distance between the two cities (a
and Earles ; Ree et al. ). This was a major theme of general knowledge question that was once on one
the controversial best-seller The Bell Curve of the popular IQ tests) than someone from the
(Hermstein slums of St. Louis or a tribesman from
Cognitive Ability 1759
the bush in Africa. For this reason a number of tests abstract or symbolic problem-solving. Differences
have been constructed that require a minimal tend to be smaller on seemingly culturally rich
amount of prior knowledge, such as CatteH’s cul- items such as general knowledge. Hermstein and
ture fair test (Cattell ) or Raven’s progressive Murray ( , Appendix 5) provide a review
matrices (Raven ). of the evidence on bias.
The best evidence that tests can be biased in at
least some circumstances comes from studies of a
Group Differences phenomenon called stereotype threat. It has been
shown that reminding peopleof their group identity
No matter what test is administered, men and
can cause them to perform in ways more consistent
women of the same background tend to have very
with stereotypes of the group’s abilities. For exam-
similar average scores on tests of cognitive ability,
ple, blacks have been found to perform worse on
though they differ slightly in their performance on
some particularly difficult vocabulary items when
some sub-tests (Jensen pp. 531—6). However, there
given a questionnaire that asked them to state their
are large differences across ethnic groups and
race before taking the test or when the test was
geographic areas. The difference that has generated
represented as a test of intelligence as opposed to a
the most controversy is the difference in average
test of vocabulary. Women who were told that the
scores of US blacks and whites, which is typically
difficult math test they were taking generally
reported to be about one white standard deviation,
showed gender differences performed worse than
though this gap has declined some in recent years
those taking the same test who were told the test
(Dickens and Flynn ). Do these represent real
showed no differences. Men showed the opposite
differences in cognitive ability or do they reflect
effect and performed better when told the test
cultural bias in the tests?
showed a gender difference (Steele ). However, it
Defenders of the tests offer several pieces of
has not been demonstrated that stereotype threat
evidence suggesting that they are unbiased. Fore-
produces substantial bias on standard tests in
most is the evidence of ‘external validity’ - that the
standard test-taking circumstances.
same regression equation that predicts outcomes
While most evidence is consistent with the view
such as job performance, grades, or educational
that tests provide a fair measure of the underlying
attainment for one group will typically do a
concept of cognitive ability across ethnic groups, it
similarly good job for any other group. Also,
is not conclusive. For example, since tests rarely
different groups find the same questions more or
explain as much as half the variance in the
less difficult. Members of different groups with
outcomes in studies of external validity, there is
similar scores will have similar patterns of right and
always the possibility that the tests underestimate
wrong answers. If some questions are more
black cognitive ability but that other disadvantages
culturally biased than others, the disadvantaged
pull down black performance. If true, the validity of
group should find those items more difficult than
the tests as predictors of practical outcomes is an
the mainstream group does. But researchers looking
artifact of offsetting biases. This could explain why
for such cultural bias have found no evidence of it
it is that when regressions of white performance on
(an exception occurs when one of the groups being
white test scores fail to predict black performance
compared is made up of non-native speakers of the
they tend to predict better performance than is
language in which the test was administered, in
observed. Further, common- sense notions that
which case scores on questions requiring a better
people from different cultural backgrounds
knowledge of the language will be lower).
probably have less opportunity to acquire certain
Surprisingly, to the extent that there are black-white
types of information or practise certain skills should
differences across test items, blacks do worse on
be given some weight. If studies find that blacks do
what seem to be some of the least culturally
no worse than similarly scoring whites on highly
dependent items - those involving
culturally loaded items, that could indicate that the
poorscoring whites
1760 Cognitive Ability
Finally, the most profound changes in measured reinforcement from teachers and be more likely to
cognitive ability have taken place over time. James be tracked into more demanding classes and to
Flynn has documented huge gains in cognitive develop friendships with other similarly disposed
ability - as much as a standard deviation or more a children. Such a child will have much more
generation - in more than 14 countries. Numerous opportunity to practise intellectual work and receive
other authors have found gains on other tests and in more ‘coaching’ in intellectual pursuits. A small
other countries (Flynn initial physiological difference could mushroom
, ). This phenomenon of large and per into a large difference in ability through a process
vasive gains has been dubbed ‘the Flynn Effect’. whereby the advantage leads to a better
How is it that large gains are possible in the face environment which improves ability and gives
of high heritability estimates? The chief flaw in the access to even better environments.
argument that high heritability implies a limited If such reciprocal causation is at work in the
role for environment is that it misunderstands what development of cognitive ability, then small persis-
heritability is measuring. It ignores the possibility tent exogenous differences in environment could
that genetic and environmental influences might be produce large differences in cognitive ability. Dick-
correlated. In particular, it ignores the possibility ens and Flynn ( ) lay out a formal model of
that genetic influences on ability are largely the such a process. If in a cross section of people in the
work of environmental advantages that come about same ethnic group most exogenous environmental
due to modest physiological advantages. differences are transient, then they will not accu-
Consider a sports analogy. Identical twins raised mulate through reciprocal causation and will not
apart have a shared genetic endowment that tends to explain much variance across individuals. However,
make them notably taller than their peers. As such small persistent differences between groups or
they are both better basketball players. Even though generations could cause laige differences if they
they are raised apart, both are likely to spend more drive the engine of reciprocal causation. Similarly,
time playing basketball than other children their preschool programmes which enrich children’s
age. They are good at it and thus enjoy it more than cognitive environment can have large effects, but
other activities in which they do not naturally excel. once the children are removed from the programme
Consequently they both get more practice at the process can work in reverse and unravel the
basketball than their peers, and that makes them gains. The exogenous decline in the quality of the
better at the game. Being better players than their environment from the removal of the programme’s
peers, they are more likely to be picked by coaches stimulation sets off a downward spiral of poorer
for high-school teams and more likely to receive yet performance leading the child into poorer environ-
more practice and more intensive coaching. If this ments, yet poorer performance and so on.
leads to them playing in college they will both be
enormously better players than the average person.
A small physiological difference, which would Conclusion
make only a very modest difference in their
Modem psychology views cognitive ability as
performance on the court if they were untrained and
having a number of dimensions, all of which seem
inexperienced, has mushroomed into a huge
to be correlated with one another. Many interpret
difference in performance because it has been
this correlation as reflecting an underlying general
reinforced by the environmental influences of
cognitive ability, or g, that is measured by the full-
practice and coaching.
scale scores on the major tests of cognitive ability or
It is not hard to imagine the same thing hap-
IQ. General cognitive ability is an important
pening with cognitive ability. Someone who is
predictor of a wide range of economic and life
slightly quicker or has an emotional disposition
outcomes, with similar predictive validity across
amenable to thought and contemplation will be
groups with different average levels of ability. Still,
more likely to spend more time in intellectual
cognitive test scores typically explain
pursuits. Such a person will likely receive positive
1762 Cognitive Ability
far less than half the variance in life outcomes, so r e l a t e d m e a s u r e s , ed. U. Neisser. Washington, DC:
cognitive ability is only one important factor among American Psychological Association.
Flynn, J. 2006. Efeito Flynn: repensando a inteligencia e seus
many that explain success. efeitos [The Flynn effect: rethinking intelligence and what
Adult differences in cognitive ability within affects it]. In I n t r o d u q d o a P s i c o l o g i a d a s
representative samples of ethnic groups raised in Diferenqas Individuals [Introduction to
similar circumstances are subject to substantial the psychology of individual
d i f f e r e n c e s ] , ed. C. Flores-Mendoza and R. Colorn.
genetic influence, but this does not mean that group Porto Alegre: ArtMed.
differences are genetic in origin. Despite the large Heckman, J. 1995. Lessons from the bell curve. J o u r n a l o f
role played by genetic differences in explaining P o l i t i c a l E c o n o m y 103: 1091-1120.
adult variance in cognitive ability, there is Hermstein, R., andC. Murray. 1994. T h e B e l l C u r v e .
New York: Free Press.
considerable evidence that intelligence is highly Jensen, A. 1973. Educability and group
malleable and the life outcomes influenced by d i f f e r e n c e s . New York: Harper and Row.
intelligence even more so. Jensen, A. 1998. The g factor: The science of Mental Ability.
Westport: Praeger.
Johnson, C., and S. Bradley-Johnson. 2002. Construct
stability of the cognitive abilities scale-second edition for
See Also infants and toddlers. Journal of
Psychoeducational
A s s e s s m e n t 20: 144-151.
► Behavio Lazar, I., and R. Darlington. 1982. Lasting effects of early
education: A report from the consortium for longitudinal
studies. In M o n o g r a p h s o f t h e S o c i e t y f o r
Bibliography R e s e a r c h i n C h i l d D e v e l o p m e n t , vol. 47.
Chicago: Chicago University Press.
Asbury, K., T. Wachs, andR. Plomin. 2005. Environmental Neisser, U., G. Boodoo, T. Bouchard Jr., A. Boykin, N. Brody,
moderators of genetic influence on verbal and nonverbal S. Ceci, D. Halpem, J. Loehlin, R. Perloff, R. Sternberg,
abilities in early childhood. I n t e l l i g e n c e 33: 643-661. and S. Urbina 1996. Intelligence: Knowns and unknowns.
Binet, A. 1905. New methods for the diagnosis of the A m e r i c a n P s y c h o l o g i s t 51: 77-101.
intellectual level of subnomrals. L'Annee Plomin, R., J. Defries, G. McCleam, and P. McGuffin. 2001.
P s y c h o l o g i q u e 12, 191-244. English edition: 1916. B e h a v i o r a l g e n e t i c s . 4th edn. New York: Worth
The Development of Intelligence in Publishers.
C h i l d r e n (trans: Elizabeth S. Kite). Vineland: Plomin, R., N. Pedersen, P. Lichtenstein, and G. McCleam.
Publications of the Training School at Vineland. 1994. Variability and stability in cognitive abilities are
Carroll, J. 1993. Human cognitive abilities: A sun’ey of factor- largely genetic later in life. B e h a v i o r G e n e t i c s 24:
analytic studies. Cambridge, MA: Cambridge University 207-215.
Press. Raven, J. 1941. Standardization of progressive matrices.
Cattell, R. 1960. M e a s u r i n g i n t e l l i g e n c e w i t h British Journal of Medical Psychology
c u l t u r e f a i r t e s t s . Champaign: Institute for 19: 137-150.
Personality and Ability Testing. Ree, M., and J. Earles. 1992. Intelligence is the best predictor
Ceci, S. 1991. How much does schooling influence general of job performance. C u r r e n t D i r e c t i o n s i n
intelligence and its cognitive components? A reassessment P s y c h o l o g i c a l S c i e n c e 1: 86-89.
of the evidence. D e v e l o p m e n t a l P s y c h o l o g y ’ 27: Ree, M., J. Earles, and M. Teachout. 1994. Predicting job
703-722. performance: Not much more than g. J o u r n a l o f
Dickens, W. 2005. Genetic differences and school readiness. A p p l i e d P s y c h o l o g y ’ 79: 518-524.
T h e F u t u r e o f C h i l d r e n 15: 55-69. Rushton, J., and A. Jensen. 2005. Thirty years of research on
Dickens, W., and J. Flynn. 2001. Heritability estimates versus race differences incognitive ability. P s y c h o l o g y ,
large environmental effects: The IQ paradox resolved. P u b l i c P o l i c y , a n d L a w 11: 235-294.
P s y c h o l o g i c a l R e v i e w 108: 346-369. Speamran, C. 1904. General intelligence, objectively
Dickens, W., and J. Flynn. 2006. Black Americans reduce the detemrined and measured. A m e r i c a n J o u r n a l o f
racial IQ gap: Evidence from standardization samples. P s y c h o l o g y 15: 201-293.
P s y c h o l o g i c a l S c i e n c e 17: 913-920. Steele, C. 1997. A threat in the air: how stereotypes shape
Flynn, J. 1987. Massive gains in 14 nations: What IQ tests intellectual identity and performance. A m e r i c a n
really measure. P s y c h o l o g i c a l B u l l e t i n 101: 171- P s y c h o l o g i s t 52: 613-629.
191. Thurstone, L. 1935. T h e v e c t o r s o f t h e m i n d .
Flynn, J. 1998. IQ gains over time: toward finding the causes. Chicago: University of Chicago Press.
In T h e r i s i n g c u r v e l o n g - t e r m g a i n s i n I Q Turkheimer, E., A. Haley, M. Waldron, B. D’Onofrio, and I.
and Gottesman. 2003. Socioeconomic status modifies
Cohen, Ruth Louisa (Born 1906) 1763
See Also
Cohen Stuart, Arnold Jacob (1855-1921)
►
Arnold Heertje
Selected Works
JEL classifications
Selected Works C32
Abstract
Mathematical Structure of I
This article summarizes the mathematical
(1) Cointegrated Models
structure of cointegrated time series models and
discusses econometric procedures commonly Let X, denote a scalar 1(1) stochastic process, with
used to analyse cointegrated time series. This moving average representation A, c(L)st> where E t
discussion is carried out in the context of
is a scalar white noise process, and •'W = £“ QCiL‘
stochastic trends that follow driftless 1(1) or
is a polynomial in the lag operator L, and where the
‘unit root’ processes. The article concludes with
moving average coefficients, Cj decay sufficiently
a brief discussion of cointegration in the context
rapidly so that < 00■ The Beveridge-Nelson
of more general stochastic trends.
decomposition (see trend/cycle decomposition)
implies that X, can be represented as Xt xt + at,
Keywords
where 7/ is a random walk, so that xt xt. i + et, where
Beveridge-Nelson decomposition; Cointegration;
Dickey-Fuller unit root tests; Error-correction et is white noise and at has a moving average
terms; Granger, C.W.J.; Heteroskedasticity and representation at = d{L)et, where
autocorrelation corrections; Maximum
1766 Cointegration
£>i < oo. Thus, X, can be expressed as the sum of a stochastic trends. When X, is not cointegrated, the
stochastic trend, xt, and an 1(0) process, at. VAR for X, can be written in terms of AX, and has
When X, is an n x 1 vector of 1(1) processes, a the form d>(L)X, = e„ where O(L) is a stable lag
similar result implies that A) = Axt + at, where A is a polynomial (so its roots are outside the unit circle)
matrix of constants, xt is a vector of random- walk and s, is white noise. When X, is cointegrated, the
stochastic trends, and a, is a vector of I (0) VAR has the form O (L) X, = fjcXX,_i + £,, where a
processes. Because X, contains n elements, the is an n x r matrix with columns that are the linearly
vector xt will generally contain n stochastic trends. independent cointegrating vectors. Thus, the
However, when xt contains only k < n stochastic cointegrated VAR expresses the elements of AX, as
trends, A is n x k, so that a!A = 0, for any vector a in functions of its own lags, but also includes the r
the null space of the column space of A. This means regressors XX, | in each of the VAR’s n equations.
that XX, = a'a t, so that the linear combination XX, The variables aX,_, are called ‘error-correction
does not depend on the stochastic trends. In this terms’ and the cointegrated VAR is called a ‘vector
case, the time series making up Xt are said to be error correction model’ (VECM).
cointegrated. Any non-zero vector a that satisfies a! Watson ( ) provides a summary of the
A 0 will annihilate the stochas algebra linking these various representations of the
tic trend in XX,, and vectors with this property are cointegrated model.
called cointegrating vectors. When A has full col-
umn rank, the number of linearly independent
cointegrating vectors is r = n — k, which is called
the cointegrating rank of the process. Testing for Cointegration
For example, suppose that Xt contains n = 3
series representing interest rates on one-month, The time series making up X, are cointegrated if the
three-month and six-month US treasury bills. linear combinations XX, are 1(0) random variables.
Suppose tliat X„ = xt + a„, for / 1,2, 3, where If X, is not cointegrated, then XX, will be I (1) for
xt is a common stochastic trend shared by the three any non-zero vector a. Tests of cointegration ask
interest rates. Then Xt Ax, + a,, where k 1 (there is a whether XX, is 1(1) or 1(0).
single stochastic trend), A = (11 1)' (the trend has an Consider the simple case in which there is only
equal effect on each of the interest rates) and a.\ = one potential cointegrating vector, so that XX, is a
(1 0 1)' and a2 = (1 0 —1)' are scalar. Cointegration can then be tested using a unit
two linearly independent cointegrating vectors, so root test applied to XX,. The straightforward
that r = 2 and XxX, and X2X, denote the interest rate application of a unit root test requires that a is
term spreads. known, so that the scalar variable XX, can be
Vector moving average models (VMAs) and calculated directly from the data. This is possible in
vector autoregressions (VARs) are often used to many empirical applications (such as the interest
represent the linear properties of vector stochastic
rate example described above) where the value of a
processes. The Granger representation theorem (see
can be pre-specified.
Engle and Granger ) shows that VMAs and VARs
Thus, suppose that a is known, and consider the
for cointegrated processes have special structures.
competing hypotheses HI(1): XX, is 1(1) and HI(0l:
In general, the VMA for an 1(1) vector process is
XX, is 1(0). The hypothesis Hi(i) means that the
"X, = /.XL);:,, where is white noise with full rank
elements ofX, are not cointegrated and the hypoth-
covariance matrix. When Xt is not cointegrated, the
esis HI(0) means that the elements are cointegrated.
n x n matrix D( I), which contains the sum of the
Under HI(1j the autoregressive model for XX,
moving average coefficients, has rank n. But, when
contains a unit root, while under HI(0), the auto-
X, is cointegrated, D( I) has rank k < n, where k
regressive model for XX, is stable.
denotes the number of
The null Hi(1) can be tested against the alterna-
tive HI(0) using an augmented Dickey-Fuller
Cointegration 1767
(ADF) unit root test or the modified ADF test cointegrating vector, and so forth. As discussed in
developed in Elliott, Rothenberg and Stock ( ). Johansen ( ), these tests are easily formulated
The null HI(0) can be tested against HI(1) and carried out using the VECM model. Recall that
using the best local test proposed by Nyblom ( ), the VECM model has the form 0(L) Xt = [>otXt | +
modified for serial correlation as st. Consider the null and alternative hypotheses
described in Kwiatkowski et al. ( ), or a H0 :r=r0 v.v. Ha:r= ra where ra>rm and write the
point-optimal test as discussed in Jansson ( ). VECM as <J>(L)A, = fj0 u.'()Xt i + /i a'X, | + s:h
(There are important practical consider where aQ contains the rQ cointegrating vectors under
ations associated with the choice of the long-run- the null and a contains the additional cointegrating
variance estimator (see heteroskedasticity and vectors under the alternative. Under the null
autocorrelation corrections) used in tests for the hypothesis, the variables a'X, \ do not enter the
H^o) null hypothesis because of the high degree of VECM, while under the alternative these variables
serial correlation under the alternative. See Muller ( enter the VECM. Thus, the null and alternative can
) for discussion. ) be written as//„ : /) = 0versus IIa : [1 / 0. As in the
When a is not known, the unit root tests case of r = 1, the tests depend on whether the
described in the last paragraph use a'Xt in place of cointegrating vectors are known or unknown. When
a!Xt, where a is an estimator of a. For example, the cointegrating vectors are known, the regressors
Engle and Granger ( ) suggest estimating a by c/^X, , and a.'X, t can be constructed from the data,
regressing the first element of Xt onto the other and the Wald test for [1 0 can be constructed
elements of Xt using OLS, and carrying out an ADF using the usual
test using the residuals from this regression.
regression formula. When the cointegrating vectors
Estimation of a changes the distribution of the ADF
are unknown, the testing problem is more difficult,
test statistic from what it is when a is known, so that
but Johansen ( ) provides an simple
critical values for the Engle-Granger test are
formula for the likelihood ratio test statistic. In
different than the standard ADF critical values. As
either case, the critical values for the test are ‘non-
described in Phillips and Ouliaris ( ) and Hansen (
standard’, that is, they are not based on the y2
) the correct
or /'’distributions. Critical values for the tests
critical values depend on the number of elements in
depend on the values of ra — r0, the number of
X and on the properties of the deterministic trends
cointegrating vectors that are known and unknown,
in the model. Stock and Watson ( )
and the presence or absence of constants and time
tabulate choices of critical values from the Phillips
trends in the model. The various critical values are
and Ouliaris ( ) and Hansen ( ) papers
that are appropriate for data that follow 1(1) pro- tabulated in Horvath and Watson ( ).
cesses that may or may not contain drift, and thus
serve as conservative critical values. Modifications
for tests of the Hi(0) null versus the Hi(i) alternative
are discussed in Shin ( ) and Estimating Unknown Cointegrating
Jansson ( ). Coefficients
The tests outlined above are useful for testing
whether a single series a.Xt is I (0) or 1(1), but in Unknown coefficients in cointegrating vectors are
many applications there may be more than one typically estimated using least squares and Gaussian
potential cointegrating relation (r> 1) so that it is maximum likelihood estimators (MLEs). The
useful to have tests for hypothesis that postulate properties of these estimators can be understood by
different values of r. That is, it is useful to entertain considering a simple bivariate model
hypotheses of the form Hj : r = j, for j = 0, 1, ... n.
The hypothesis r 0 means that there is no Ai, = 6X21 + i] 1,
cointegration, r= 1 means that there is a single X21 = A2,-1 + 1)2/
1768 Cointegration
where i], = [i]ui]2l]'~ iid N (0, E). In this example, The density of {X2t}Tt=l does not depend on 6 , and
there is one common trend that coincides with X2t, the density of X\t\{X2t)lt , is characterized by the
the cointegrating vector is a = (1 — 9)' where 9 is an Gaussian linear regression Xlt = 0X2l + [jX2, + vt,
unknown parameter, the error correction term is a! where [1 is the regression coefficient from the
X, = i]u which is potentially correlated with the
regression of rju onto fj2t (=AX2t), vt is the error in
innovation in the common trend, i]2„ and the
this regression, and vt\{X2t)t , ~ iid N(0, a2). Simple
assumption of normality is used to motivate the
calculations (Phillips ) can then be used to show
Gaussian MLE of 9.
The OLS estimator of 9 has several interesting that 9 —9 ~ Op(T ')
~MLE T
properties (Stock ). Even though X2t and r]U arc and that 0 \{X2t)t , ~ Njd, L), where
correlated, the OLS estimator is consistent; indeed
~ OLS V depends on (X2t)Tt=l. Thus, 0 is consistent, is
it is ‘super-consistent’ in the sense that 0 —6 conditionally normally distributed and unbiased,
/ _______i \ xOLS 1 'xOLS _
Alternative Models for the Common constructed using large-sample normal approxi-
Trends mation applied to Gaussian MLEs.
Several papers have sought to address the Elliott
The concept of cointegration involves variables that critique by developing methods with good
share common persistent ‘trend’ components. The performance for a range of autoregressive roots
statistical analysis outlined above utilized a close to, but not exactly equal to 1.0. For example,
particular model of the trend component, namely, Wright ( ) argues that if 90 is the true value of
the driftless unit root process xt = tt \ + et. Analysis a cointegrating coefficient, then X\, — 60 X21 will be
of this model highlights many of the key features of 1(0), but if 90 is not the true value then Xlt — 60 X21
cointegrated processes, but more general models are will be highly persistent. He suggests testing that 0
often needed for empirical analysis. For example, = 90 by testing the HI(0) null for the series Xlt —
constant terms are often added to the model to 9oX2t. Alternative testing procedures in this context
capture non-zero means of error correction terms or are proposed in Stock and Watson ( ) and
drifts in the trend process. These constant terms Jansson and Moreira ( ).
change the distribution of test statistics for
cointegration in ways familiar from the effect of
constants and time trends in Dickey-Fuller unit root See Also
tests (see Flamilton ). Flansen ( ) and
Johansen ( ) con
tain useful discussion of the key issues. Higher-
order integrated processes (for example, 1(2) pro-
cesses) are discussed in Johansen ( ), Granger ►
and Lee ( ), and Stock and Watson ( ).
Hylleberg et al. ( ) discuss cointegration at
seasonal frequencies. Robinson and Hualde ( ) and Bibliography
the references cited therein discuss Elliott, G. 1998. The robustness of cointegration methods
cointegration in fractionally integrated models. when regressors almost have unit roots.
Elliott ( ) discusses cointegrated models in E c o n o m e t r i c a 66: 149-158.
which the trend follows a ‘nearunit-root’ process - Elliott, G., T.J. Rothenberg, and J.H. Stock. 1996. Efficient
tests for an autoregressive unit root. E c o n o m e t r i c a
an AR process with largest autoregressive root very 64: 813-836.
close to 1.0. (Formally, the asymptotics use a local- Engle, R.F., and C.W.J. Granger. 1987. Co-integration and
to-unity nesting with largest root AR root equal to 1 error correction: Representation, estimation, and testing.
E c o n o m e t r i c a 55: 251-276.
c/1] where c is a constant.) Elliott shows that, while
Granger, C.W.J. 1981. Some properties of time series data and
the basic cointegrated model remains unchanged in their use in econometric specification. J o u r n a l o f
this case, the properties of Gaussian maximum E c o n o m e t r i c s 16: 121-130.
likelihood estimators of unknown cointegrating Granger, C.W.J. 1986. Developments in the study of co-
integrated economic variables. O x f o r d B u l l e t i n o f
coefficients change in important ways. In particular,
E c o n o m i c s a n d S t a t i s t i c s 48: 213-228.
the Gaussian MLEs are no longer conditionally Granger, C.W.J., and T.H. Lee. 1990. Multicointegration.
unbiased, and confidence intervals constructed A d v a n c e s i n E c o n o m e t r i c s 8: 71-84.
using Gaussian approximations (for example, 9 ± Granger, C.W.J., and A.A. Weiss. 1983. Time series analysis of
error-correction models. In S t u d i e s i n e c o n o m e t -
1.96SE ( ll j j can be very misleading. Elliott’s rics, time series, and multivariate
s t a t i s t i c s , i n h o n o r o f T . W . A n d e r s o n , ed.
critique is important because small deviations from
S. Karlin, T. Amemiya, and L.A. Goodman. San Diego:
exact unit roots cannot be detected with high Academic.
probability, and yet small deviations may Hamilton, J.D. 1994. T i m e s e r i e s a n a l y s i s . Princeton:
undermine the validity of statistical inferences Princeton University Press.
Hansen, B. 1992. Efficient estimation and testing of
cointegrating vectors in the presence of deterministic
trends. J o u r n a l o f E c o n o m e t r i c s 53: 86-121.
1770 Colbert, Jean-Baptiste (1619-1683)
Horvath, M.T.K., and M.W. Watson. 1995. Testing for Manuscript, Department of Economics, Princeton
cointegration when some of the cointegrating vectors are University.
prespecified. E c o n o m e t r i c T h e o r y 11: 952 984. Stock, J.H., and M.W. Watson. 2007. I n t r o d u c t i o n t o
Ilylleberg, S., R.F. Engle, C.W.J. Granger, and B.S. Yoo. e c o n o m e t r i c s . 2nd ed. Boston: Pearson-Addison
1990. Seasonal integration and cointegration. J o u r n a l Wesley.
o f E c o n o m e t r i c s 44: 215 238. Watson, M.W. 1994. Vector autoregression and cointegration.
Jansson, M. 2004. Stationary testing with covariates. b i H a n d b o o k o f e c o n o m i c s , e d . R.F. Engle and
E c o n o m e t r i c T h e o r y 20: 56 94. D.L. McFadden, vol. 4. Amsterdam: North- Holland.
Jansson, M. 2005. Point optimal tests ofthe null of hypothesis Wright, J.H. 2000. Confidence sols for cointegrating coef-
of cointegration. J o u r n a l o f E c o n o m e t r i c s 124: ficients based on stationarity tests. J o u r n a l o f
187-201. B u s i n e s s a n d E c o n o m i c S t a t i s t i c s 18:211
Jansson, M., and M. Moreira. 2006. Optimal inference in 222.
regression models with integrated regressors.
E c o n o m e t r i c a 74: 681 714.
Johansen, S. 1988. Statistical analysis of cointegrating vectors.
Journal of Economic Dynamics and
C o n t r o l 12:231 254.
Johansen, S. 1994. The role ofthe constant and linear terms in Colbert, Jean-Baptiste (1619-1683)
cointegration analysis of non-stationary variables.
E c o n o m e t r i c R e v i e w s 13: 205 229. D. C. Coleman
Johansen, S. 1995. A statistical analysis of cointegration for
1(2) variables. E c o n o m e t r i c T h e o r y 11: 25-59.
Kwiatkowski, D., P.C.B. Phillips, P. Schmidt, and Y. Shin.
1992. Testing the null hypothesis of stationarity against
the alternative of a unit root. J o u r n a l o f
E c o n o m e t r i c s 54: 159-178. Keywords
Muller, U.K. 2005. Size and power of tests for stationarity in Colbert, J.-B.; Colbertism; Mercantile system;
highly autocorrelated time series. J o u r n a l o f Mercantilism
E c o n o m e t r i c s 128: 195-213.
Nyblom, J. 1989. Testing for the constancy of parameters over
time. J o u r n a l o f t h e A m e r i c a n S t a t i s t i c a l
A s s o c i a t i o n 84: 223 230. JEL Classifications
Park, J.Y. 1992. Canonical cointergrating regressions. B31
E c o n o m e t r i c a ( A Y . 119 143.
Phillips, P.C.B. 1991. Optimal inference in cointegrated Colbert was born at Reims on 29 August 1619 and
systems. E c o n o m e t r i c a 59: 283-306.
died on 6 September 1683. In no way at all could he
Phillips, P.C.B., and B.li. Hansen. 1990. Statistical inference
on instrumental variables regression with 1(1) processes. be called an economist. He was, however, one of
R e v i e w o f E c o n o m i c S t u d i e s 57: 99-124. the most powerful administrators, known to history,
Phillips, P.C.B., and S. Ouliaris. 1990. Asymptotic properties of measures affecting the economic life of a nation,
of residual based test for cointegration.
to such an extent and with such lasting influence
E c o n o m e t r i c a 58: 165-193.
Robinson, P.M., and J. Hualde. 2003. Cointegration in that his name is preserved in the notion of
fractional systems of unknown orders. E c o n o m e t r i c a Colbertism.
71: 1727-1766. He came of a mercantile family which had
Saikkonen, P. 1991. Asymptotically efficient estimation of acquired some public offices. He learned his job as
cointegrating regressions. E c o n o m e t r i c T h e o r y 7:
1-21." economic administrator by entering the service, in
Shin, Y. 1994. A residual-based losl of the null of 1651, of a man he was effectively to succeed,
cointegration against the alternative of no cointegration. Cardinal Mazarin. Once successfully installed in
E c o n o m e t r i c T h e o r y 10: 91 115. the service of Louis XIV, after Mazarin’s death in
Stock, J.H. 1987. Asymptotic properties of least squares
estimates of cointegrating vectors. E c o n o m e t r i c a 55:
1661 his climb to power was rapid. He soon came
1035-1056. to hold numerous offices of state: finance, com-
Stock, J.H., and M.W. Watson. 1993. A simple estimator of merce, buildings, the navy, and more besides. His
cointegrated vectors in higher-order integrated systems. achievements rested in part upon his exercising
E c o n o m e t r i c a 61: 783-820.
Stock, J.H., and M.W. Watson. 1996. Confidence sets in
virtually undisputed power for 22 years as the
regression with highly serially correlated regressors. dominant minister of the grandest of absolute
Colbert, Jean-Baptiste (1619-1683) 1771
monarchs, and in part upon his own qualities of were therefore to be especially favoured.
character which he brought to bear upon the eco- Manufacturing industry deserved encouragement
nomic problems of France as he perceived them. because it lessened French dependence on imports,
Those qualities included energy, tenacity, shrewd- because it was the basis of an export trade which
ness, honesty, a notable ability to deploy the tech- brought in wealth, and because it employed the idle
niques of the courtier, and a wholly remarkable (the Catholic Colbert had the zeal for work and the
capacity for hard work. His hand was felt in every disapproval of idleness normally thought of as
aspect of French economic life; and everywhere he peculiar to Puritanism). In the interest of the
exercised that passion for order which is so often economic unification of France, internal trade and
the hallmark of the bureaucrat. Adam Smith sniffed transport needed improvement by the removal of
at him as a ‘laborious and plodding man of business tolls and the repair of roads and bridges. Royal
... accustomed to regulate the different departments support was needed, and was secured, for the
of public offices’ (Smith , p. 627). But he was a lot construction of canals - of which the most
more than that. Cold, humourless, and devoted, he spectacular achievement was the opening in 1681 of
was the superservant of a super-king. the Canal des Deux Mers, providing a waterway
Those qualities did not, on the other hand, between the Atlantic and the Mediterranean.
include any original economic ideas whatever. He Colbert shared the pervasive belief in a fixed
had absorbed, with characteristic thoroughness, all cake of trade, so that, as he patiently explained to
the assumptions, maxims, dogmas, and assorted Louis in March 1669, the whole trade of Europe
notions about economic matters which circulated in was carried in a fixed number of vessels and
16th- and 17th-century Europe, and to which the therefore ‘le commerce cause un combat perpetuel
label of mercantilism has become attached. en paix et en guerre entre les nations de l’Europe, a
Consequently, by dint of his position and activities, qui on emportera la meilleure partie’. The Dutch,
and because a very large volume of his papers have the English and the French were the ‘acteurs de ce
survived for the historian, he has come down to combat’ (Lettres VI, p. 266). France’s gain was to
posterity as the embodiment of conventional be secured by Holland’s and/or England’s loss. It
mercantilism in practice. Nonexistent as a followed that shipbuilding should be encouraged
theoretical entity, mercantilism has acquired the and the French navy and mercantile marine greatly
appearance of a coherent economic policy probably enlarged. France should move in on trades hitherto
more from Colbert’s activities than from any other dominated by her rivals. Hence his setting up in the
single historical source. And because it appeared, 1660s of privileged trading companies: a French
and was continued after his death, in the grandeur East India Company, a French West India Company
which was France, it was copied or adapted in other to improve and exploit French colonies, and the
aspiring monarchies. French mercantilism or Company of the North to tap the Baltic trade. Such
Colbertism thus became a recognizable reality in a views also provided an economic justification for
way that the English ‘mercantile system’ did not. the war which Louis launched against Holland in
The nature of his economic ideas can often be 1672. Colbert had to find the revenue for these and
gathered from the explanatory memoranda which he others of his master’s military activities.
addressed to Louis XIV (who was not always as Consequently, he devoted much time to trying to
interested in such matters as Colbert thought he reform the royal finances. Many of his measures -
should be). They have a familiar ring. He wanted for example, to improve the collection of taxes or to
money circulating in the kingdom, not because he unify the customs system - were thus again part of a
identified money with wealth, but because it policy designed to improve the performance of the
facilitated the payment of taxes and helped to economy so that it could in turn yield more wealth
stimulate economic activity; those branches of to the greater glory of le roi soleil.
overseas trade which brought in precious metals
1772 Colbertism
How much success attended Colbert’s policies confusingly, as a synonym for mercantilist policies
has been a matter of debate. Laissez-faire econo- in general.
mists and economic historians of similar views have In the course of his account, and denunciation,
inevitably disparaged them and stressed the of the mercantile system, Adam Smith presented it
rigidities which were built into the French economy as something foisted upon governments by con-
in the 18th century. His efforts to unify the chaotic spiring businessmen. Extending this view from
diversity of French fiscal and customs England to France, he said of Colbert that he had
administration were only very partially successful; been ‘imposed upon by the sophistry of merchants
his overseas trading companies were inadequately and manufacturers’ (Smith , p. 434). Whatever
financed and generally unprofitable; his compara- degree of truth there may be in his account so far as
tive neglect of agriculture left the basis of the it related to England - and there is some - it wholly
economy in a poor state. But his work did greatly misrepresents the mind of Colbert and the nature of
improve the size and efficiency of the French navy Colbertism. Distrusting the self-interest of
and mercantile marine; stimulate - albeit at a high businessmen as a power for the greater good of
cost - certain areas of French manufacturing indus- society, Colbert believed profoundly that, although
try; and encourage French merchant enterprise in their pursuit of profits should be encouraged, the
branches of trade hitherto the preserve of others. way to ensure that such activities redounded to the
Not all of this was evident in his own lifetime. But greater wealth, and hence power and glory, of
one thing was: Colbert died a very rich man, enno- France was by regulation and order. So Colbertism
bled as Marquis de Seignelay, his brothers and was essentially a systematic treatment of economic
sisters and cousins amply provided with lucrative activities imposed from above by the King through
sinecures, his sons as ministers or army officers, and his servant. It could be described as a version of the
his three daughters married off to dukes. Such were mercantile system appropriate to an absolutist state.
the 17th-century rewards of administering an It owed little or nothing to mercantile or
economy. manufacturing pressures brought to bear on
governments. Although there were some similarities
between Colbertism and English mercantilism, both
in the ideas which lay behind it and in its outward
Bibliography forms as it affected overseas trade, the creation of
Colbertian policies did not in the least resemble the
Clement, P. (ed.). 1861-2. Lettres, Instructions et Memoires de
Colbert, 8 vols. Paris. process of bargaining and compromise between
Cole, C.W. 1939. C o l b e r t a n d a c e n t u i y o f Crown and Parliament by which English
F r e n c h m e r c a n t i l i s m , 2 vols. New York: mercantilism was muddled into existence. For this
Columbia University Press. reason alone the term ‘Parliamentary Colbertism’,
Smith, A. 1776. In A n i n q u i r y i n t o t h e n a t u r e a n d
c a u s e s o f t h e w e a l t h o f n a t i o n s , ed. E.
coined by Cunningham and used by him to describe
Carman. New York: Modem Library. 1937. English economic policy, 1689-1776
(Cunningham , II, pp. 403-68), was singularly
inappropriate. It was also inapt for the different
reason that Colbertism was distinguished by a
concern for the direct control of production which
Colbertism was wholly absent from the English version of
mercantilist policies.
D. C. Coleman
The quintessence of Colbertism is strikingly
illustrated in Colbert’s approach to manufactures.
Observing that France had great industrial potential,
with many and scattered crafts and substantial
Colbertism is a term used to describe the economic manpower, he set about the country’s industrial
policies associated with the French statesman, Jean-
Baptiste Colbert; and sometimes,
Colbertism 1773
rehabilitation. He used a variety of weapons: sub- because of inadequate mercantile enthusiasm for
sidies, special tax reductions or exemptions, pro- them; and in the degree of personal control which
tection against foreign imports, the encouragement Colbert himself exercised, especially over the
of early marriage and large families, grants of French East India Company. So far from being a
special privileges, and the establishment of product of mercantile pressures Colbertism ran foul
manufactures royales. Disapproving, for example, of merchants on more than one occasion. Colbert
of the way in which his countrymen imported and made himself very unpopular with those of
wore the woollen cloth or serges of Holland and Marseilles, for example, when, obsessed by the
England, he set up manufactures royales to need to keep money circulating so that taxes could
stimulate their production in France; and in 1667 be paid, he tried to prevent them from exporting
very sharply increased import duties against the coin in order to conduct their trade with the Levant.
offending English and Dutch imports. Similar And the highly protective anti-Dutch tariff of 1667
techniques were used to promote the making of attracted internal opposition because it so obviously
lace, silk stockings, tapestries, carpets, glassware, invited retaliation.
tinplate, soap, naval supplies, and cannon. Luxury The vast regulative apparatus built up by Col-
items and textiles received particular attention. It bert and his successors showed more contempt than
has been said that ‘the greatest industry in France understanding of the role of businessmen. French
was supplying the wants of the King and his court’ commercial and industrial advance during the 18th
(Cole , II, p. 303). In quantitative terms this was century, though owing something to Colbert’s
probably untrue but its significance was very real; initiating stimuli, continued despite, rather than
and such a statement could not possibly be made because of, the perpetuation of Colbertism. Indeed,
about English industry. Stimulation demanded one of the reasons for the final reaction against it
regulation. So Colbert established a Code of was the extent to which the bureaucratic machine
Commerce, promulgated for textiles elaborate had become both corrupt in its operation and
controls covering precise lengths, widths and other irrelevant to the needs of the French economy. It
details of all types of textiles; established an helped the proliferation in 18th-century France of a
apparatus of industrial inspection; and insisted upon congerie of fiscal officeholders and a concomitant
all labour being organized within the guild trade in offices and privileges functioning in and
structure. around an overblown court. Such practices certainly
Three points need to be stressed about these existed before Colbert’s day; but just as Colbert
measures. First, Colbertism was here a continuation brought a new administrative zeal to old economic
and codification, a new ordering of old practices; it ideas, so Colbertism came to provide a still more
was part of an etatisme with medieval roots. fertile soil for the growth of ancient corruptions.
Second, at the time that Colbert was imposing these Meanwhile, however, it appealed to other states -
measures on the French economy, their English Pmssia and the German principalities, Russia,
counterparts were withering away; the last legisla- Austria, Spain - intent on building up or repairing
tive attempt at general regulation of the English economic bases for the support of absolutist courts,
cloth industry failed in 1678. Third, Colbert’s reg- territorial ambitions, or the urge for military glory.
ulative achievements were continued after his The sorts of mercantilism which they adopted all
death: Colbertism brought many more detailed varied a good deal, despite the common name and
regulations in the seventy years after 1683. some common economic ideas. But those of central,
Colbert’s founding of privileged monopolistic eastern and southern Europe were often much
trading companies shows a certain resemblance to nearer in spirit to Colbertism than to the mercantile
the prior establishment of their counterparts in system which Smith discerned in England or to the
Holland and England. Again, however, the special particular variety which the Dutch had erected in
nature of Colbertian mercantilism is evident both in Holland. Colbertism was in this sense sui generis.
the preponderance of royal and government finance
in the early years of these companies
1774 Cole, George Douglas Howard (1889-1959)
as comparable, since Ricardo’s Principles'. Above of collective action are a public good: benefi-
all, it provided the theoretical credentials for his ciaries will gain whether or not they participate
own dissenting economics. in promoting them, while their individual efforts
However, if Keynes had to be absorbed by the cannot secure them. Small groups can use
Left, and mobilized for a recovery programme, selective incentives to ensure that their members
Cole also took the view that it was necessary to contribute to promoting their common interests.
look beyond the conditions of short-term stabili- This typically results in the paradoxical
zation and towards the development of a ‘new’ ‘exploitation of the great by the small’. The
economics of socialism. He therefore emerged as a logic of collective action helps explain many
leading advocate of socialist economic planning in notable examples of economic growth and
the 1930s, but for the rest of his life (and after the stagnation since the Middle Ages.
war from the vantage point of the Chichele Chair of
Social Political Theory at Oxford) he continued to Keywords
search for a form of socialist economy consistent Anarchy; Bargaining; Cartels; Class conflict;
with his prior commitment to a form of non- Collective action; Collective bargaining; Col-
bureaucratic socialist democracy. lusion; Common interests; Countervailing
power; Encompassing organizations; Excess
burden; Exploitation; Galbraith, K.; Group the-
Selected Works ory; Industrial revolution; Invisible hand; Latent
groups; Lobbying; Mercantilism; Non-
19 29 . The next ten years in British social and
excludability; Olson, M.; Patronage dividends;
economic policy. L on do n: Ma c m i l l a n .
Public choice; Public goods; Revelation of
19 32 . Economic tracts for the times. Lo nd on :
preferences; Samuelson, P.; Selective incentives;
Ma c m i l l a n .
Smith, A.; Strategic behaviour; Technical
19 35 . Principles of economic planning. Lo nd on :
progress; Trade unions; Wicksell, J.
Ma c m i l l a n .
1950. Socialist economics. London: Gollancz.
JEL Classifications
References D71
Cole, M. 1971. T h e l i f e o f G . D . H . C o l e . London:
For a long while, economists, like specialists in
Macmillan.
other fields, often took it for granted that groups of
Wright, A. W. 1979. G . D . H . C o l e a n d s o c i a l i s t
d e m o c r a c y . Oxford: Clarendon. individuals with common interests tended to act to
further those common interests, much as individuals
might be expected to further their own interests. If a
group of rational and self-interested individuals
realized that they would gain from political action
of a particular kind, they could be expected to
Collective Action engage in such action; if a group of workers would
gain from collective bargaining, they could be
Mancur Olson
expected to organize a trade union; if a group of
firms in an industry would profit by colluding to
achieve a monopoly price, they would tend to do so;
if the middle class or any other class in a country
Abstract had the power to dominate, that class would strive
The logic of collective action undermines the to control the government and run the country in its
assumption that common interests are always own interest. The idea that there was some tendency
promoted by their beneficiaries. Where the for groups to
number of beneficiaries is large, the benefits
1776 Collective Action
act in their common interests was often merely common interest are a public or collective good to
taken for granted, but in some cases it played a that group; they are like the public goods of law and
central conceptual role, as in some early American order, defence, and pollution abatement in that
theories of labour unions, in the ‘group theory’ of voluntary and spontaneous market mechanisms will
the ‘pluralists’ in political science, in J.K. not provide them. The fundamental reality that
Galbraith’s concept of ‘countervailing power’, and unifies the theory of public goods with the more
in the Marxian theory of class conflict. general logic of collective action is that ordinary
More recently, the explicit analysis of the logic market or voluntary action fails to obtain the
of individual optimization in groups with common objective in question. It fails because the benefits of
interests has led to a dramatically different view of collective or public goods, whether provided by
collective action. If the individuals in some group governments or non-governmental associations, are
really do share a common interest, the furtherance not subject to exclusion; if they are received by one
of that common interest will automatically benefit individual in some group, they automatically also
each individual in the group, whether or not he has go to the others in that group (Olson ).
borne any of the costs of collective action to further Since many groups with common interests
the common interest. Thus the existence of a obviously do not have the power to tax or any
common interest need not provide any incentive for comparable resource, the foregoing logic leads to
individual action in the group interest. If the farmers the prediction that many groups that would gain
who grow a given crop have a common interest in a from collective action will not in fact be organized
tariff that limits the imports and raises the price of to act in their common interests. This prediction is
that commodity, it does not follow that it is rational widely supported. Consumers have a common
for an individual farmer to pay dues to a farm interest in opposing the legislation that gives var-
organization working for such a tariff, for the ious producer groups supra-competitive prices, and
farmer would get the benefit of such a tariff whether they would sometimes also have a common interest
he had paid dues to the farm organization or not, in buyers’ coalitions that would countervail
and his dues alone would be most unlikely to producer monopolies, but there is no major country
determine whether or not the tariff passed. The where most consumers are members of any
higher price or wage that results from collective organization that works predominantly in the
action to restrict the supply in a market is similarly interest of consumers. The unemployed similarly
available to any firm or worker that remains in that share a common interest, but they are nowhere
market, whether or not that firm or worker partic- organized for collective action. Neither do most
ipated in the output restriction or other sacrifices taxpayers, nor most of the poor, belong to organi-
that obtained the higher price or wage. Similarly, zations that act in their common interest (Austen-
any gains to the capitalist class or to the working Smith ; Brock and Magee ; Chubb ; Hardin ; Moe ;
class from a government that runs a country in the Olson ).
interests of that class, will accrue to an individual in Though some groups can never act collectively
the class in question whether or not that individual in their common interest, certain other groups can,
has borne the costs of any collective action. This, in if they have ingenious leadership, overcome the
combination with the extreme improbability that a difficulties of collective action, though this usually
given individual’s actions will determine whether takes quite some time. There are two conditions
his group or class wins or loses, entails that a either of which is ultimately sufficient to make
typical individual, if rational and self-interested, collective action possible. One condition is that the
would not engage in collective action in the interest number of individuals or firms that would need to
of any large group or class. act collectively to further the common interest is
Analytically speaking, the benefits of collective sufficiently small; the other is that the groups
action in the interest of a group with a should have access to ‘selective incentives’.
Collective Action 1777
The way that small numbers can make collective When the parties that would profit from collec-
action possible at times is most easily evident on tive action have very different demand curves, the
the assumption that the individuals in a group with party with the highest absolute demand for collec-
a common interest are identical. Suppose there are tive action will have an incentive to engage in some
only two large firms in an industry and that each of amount of collective action when no other member
these firms will gain equally from any government of the group has such an interest. This leads to a
subsidy or tax loophole for the industry, or from paradoxical ‘exploitation of the great by the small’.
any supra-competitive price for its output. Clearly This is true to a greater degree and is evident much
each firm will tend to get the benefit of any more simply if income effects are ignored, as in the
lobbying it does on behalf of the industry, and this demand curves for a collective good depicted in the
can provide an incentive for some unilateral action figure below. When the party with the highest
on behalf of the industry. Since each firm’s action demand curve for the collective good, /J h, has
will have an obvious impact on the profits of the obtained the amount of the collective good, Qi, that
other, the firms will have an incentive to interact is in its interest unilaterally to provide, any and all
strategically with and bargain with one another. parties with a lower demand curve, such as /J s, will
There would be an incentive to continue this automatically receive this same amount, and thus
strategic interaction or bargaining until a joint have no incentive to provide any amount at all!
maximization or ‘group optimal’ outcome had been (Olson ). When income effects and certain ‘private
achieved. This same logic obviously also applies to good’ aspects of some collective goods are taken
collective action in the form of collusion to obtain a into account the results are less extreme, but a
supra- competitive price, and thus we obtain the distribution of burdens disproportionality
well- known incentive for oligopolistic collusion in unfavourable to the parties with the absolutely
concentrated industries whenever there are signif- larger demands tends to remain. This disproportion
icant obstacles to or costs of entry. As the number has been evident, for example, in various military
in a group increases, however, the incentive to act affiances and international organizations, in cartels,
collectively diminishes; if there are ten identical and in metropolitan areas in which metropolis-wide
members of a group with a common interest, each collective goods are provided by independent
gets a tenth of the benefit of unilateral action in the municipalities of greatly different size (Olson and
common interest of the group, and if there are a Zeckhauser ; Sandler ) Fig. .
million, each gets one millionth. In this last case, The other condition, besides small numbers, that
even if there were some incentive to act in the can make collective action possible, is ‘selective
common interest, that incentive would cease long incentives’. Those large groups that have been
before a group-optimal amount of collective action organized for collective action for any substantial
had taken place. Strategic interaction or voluntary period of time are regularly found to have worked
bargaining will not occur since no two individuals out special devices, or selective incentives, that are
have an incentive to interact strategically or to functionally equivalent to the taxes that enable
bargain with one another. This is because the failure governments to provide public goods (Olson ;
of one individual to support collective action will Hardin ). These selective incentives either punish or
not them have any perceptible effect on the reward individuals depending on whether or not
incentive any other individual faces so there is no they have borne a share of the costs of collective
incentive for strategic interaction or rational action, and thus give the individual an incentive to
bargaining. Thus we obtain the result that, in time, contribute to collective action that no good that is or
sufficiently small groups can act collectively, but would be available to all could provide. The most
that this incentive for collective action decreases obvious devices of this kind are the ‘closed shop’
monotonically as the group gets larger and and picket line arrangements of labour unions,
disappears entirely in sufficiently large or ‘latent’ which often make union membership a condition of
groups.
1778 Collective Action
employment and control the supply of labour during workplaces where picket lines may be established.
strikes (see, for example, McDonald ; Gamson ). Those who would profit from entering a cartelized
Upon investigation it becomes clear that labour industry or profession are similarly almost always
unions are not in this respect fundamentally without selective incentives. Experience in a variety
different from other large organizations for of countries also confirms that those with higher
collective action, which regularly have selective levels of education and skill have better access to
incentives that, though usually less conspicuous selective incentives than lower income workers;
than the closed shop or the picket line, serve the highly trained professionals such as physicians and
same function. attorneys usually come to be well organized before
Farm organizations in several countries, and labour unions emerge, and the unions of skilled
quite notably in the United States, obtain most of workers normally emerge before unions
their membership by deducting the dues in farm representing less skilled workers. The correlation
organizations from the ‘patronage dividends’ or between income and established status and access
rebates of farm cooperatives and insurance com- to selective incentives works in the same direction
panies that are associated with the farm organiza- as the lesser difficulty of collective action of small
tions. The professional associations representing groups of large firms in relatively concentrated
such groups as physicians and lawyers character- industries explained above. Together these two
istically have either relatively discreet forms of factors generate a tendency for collective action to
compulsion (such as the ‘closed bar’) or subtle have, in the aggregate though not in all cases, a
individual rewards to association members, such as strong anti-egalitarian and pro-establishment impact
access to professional publications, certification, (Olson ).
referrals, and insurance. In small groups, and The study of collective action goes back to the
sometimes in large ‘federal’ groups that are beginnings of economics, but then came to be
composed of many small groups, social pressure strangely neglected during most of the rest of the
and social rewards are also important sources of history of the subject. Though this is not generally
selective incentives. realized, the study of collective action, admittedly
The selective incentives that are needed if large only in an inductive and intuitive way, was a crucial
groups are to organize for collective action are less part of Adam Smith’s analysis of the inefficiencies
often available to potential entrants or those at the and inequities in the economies he observed
lower levels of the social order than to established (Smith ). Smith even noted that the main
and well-placed groups. The unemployed, for beneficiaries of collective action in his time were
example, obviously do not have the option of by no means the poor or those of average means.
making membership of an organization working in He also emphasized the tendency for urban interests
their interest a condition of employment, nor do to profit from collective action at the expense of
they naturally congregate as the employed do at rural people, because the
Collective Action 1779
geographical dispersion of agricultural interests handicap that nual interests face in organizing for
areas made it more difficult for them to combine to collective action is far less in developed countries
exert political influence or to fix prices; this today than it was in Smith’s time, even this part of
emphasis presumably owed something to the poor his argument still generally holds true in the
transportation and communication systems in his developing countries, where transportation and
day, which presumably obstructed the organization communication in the nual areas are poor, peasants
of rural interests more in his time than it does in are generally unrepresented, and agricultural
developed countries now. commodities normally underpriced (Anderson and
The label that Adam Smith gave to the set of Hayami ; Schultz ; Olson ).
public policies, monopolistic combinations, and Adam Smith’s insights into collective action and
ideas that he attacked was, after all, ‘mercantilism’, its consequences were ignored until recent times.
because the single most important source of the Presumably one reason is that most economists in
evils was the collective action of merchants, or the 19th and early 20th centimes were mainly
merchants and ‘masters’, especially those organized interested in the logic of the case for competitive
into guilds or ‘corporations’. In his discussions of markets. The logic of collective action, by contrast,
the ‘Inequalities Occasioned by the Policy of is really a general statement of the logic of market
Europe’ and of ‘The Rent of Land’ (Bk. I, ch. 10, failure; it embodies the central insight of the
pt. iiandch. IT), Smith emphasized that ‘whenever theories of public goods and externalities, that
the legislature attempts to regulate the differences markets and voluntary market-type arrangements do
between masters and their workmen, its counsellors not generally work in those cases where the
are always the Masters’. Similarly, beneficiaries of any collective good or benefit
cannot be excluded because they have not paid any
it is everywhere much easier for a rich merchant to
obtain the privilege of trading in a town corporate,
purchase price or dues (Baumol ). It was not until
than for a poor artificer to obtain that of working in Knut Wicksell’s New Principle of Just Taxation’
i t . . . Though the interest of the labourer is strictly was published in German in 1896 (Musgrave and
connected with that of the society ... his voice is little Peacock ) that any economist revealed a clear
heard and less regarded.
understanding of the nature of public goods, and
The rural interests are similarly at a disadvan- only with the publication of Samuelson’s articles in
tage, according to Smith, especially as compared the 1950s (Samuelson ) that this idea came to be
with those in ‘trade and manufacturers’: generally understood in the English-speaking world.
The inhabitants of a town, being collected into one A second obstacle to the development of the
place, can easily combine together. The most insig- logic of collective action was that collective action
nificant trades carried on in towns have accordingly,
in some place or another, been incorporated ...
by governments was normally taken for granted.
voluntary associations and agreements prevent that Notwithstanding the difficulties of collective action,
free competition which they cannot prohibit... .The anarchy is relatively rare because a government that
trades which employ but a small number of hands provides some sort of law and order quickly takes
run most easily into such combinations. ... People of
over. This in turn is due to conquerors and the gains
the same trade seldom meet together, even for
merriment and diversion, but the conversation ends they obtain in increased tax revenues from
in a conspiracy against the public, or in some con- establishing some system of law and order and
trivance to raise prices. property rights. In the absence of the provision of
By contrast, ‘the inhabitants of the country, these most elemental collective goods, there is not
dispersed in distant places, cannot easily combine much for a conqueror to take, so the historic first
together’. movement of the invisible hand is evident in the
These passages, though not in the order they incentive conquerors have to establish law and
appear in Smith, nonetheless correctly convey his order. Those who lead the governments that succeed
alertness to collective action. Though the conquerors obviously must
1780 Collective Action
maintain a system of law and order if they are to incentives will usually have great difficulty in
continue collecting significant tax revenues. Since working out these (often subtle) devices, and will
governments providing basic collective goods have normally succeed in overcoming the great
been ubiquitous, the classic writers on public goods difficulties of collective action only when they have
like Wicksell and Samuelson did not even ask how relatively ingenious leadership and favourable
collective goods emerged in the first place. They circumstances.
focused instead on how to determine what was an If follows that it is only in long-stable societies
appropriate sharing of the tax burdens and on the that many extra-governmental organizations for
difficulty of determining what level of provision of collective action will exist. In societies where
public goods was Pareto- optimal. This in turn totalitarian repression, revolutionary upheavals, or
naturally led to Wicksell’s recommendation that unconditional defeat have lately destroyed
only those public expenditures that could, with an organizations for collective action, few groups will
approximate allocation of the tax burdens, have been able in the time available to have
command approximate unanimity, should normally overcome the formidable difficulties of collective
be permitted, and to Samuelson’s and Musgrave’s ( action. It has been shown elsewhere (Mueller ;
) concern for Olson ), that (unless they are very ‘encompassing’)
the non-revelation of preferences for public goods. organizations for collective action have
The difficulties of collective action and public good extraordinarily anti-social incentives; they engage
provision on a voluntary basis therefore naturally in distributional struggles, even when the excess
did not gain any theoretical attention. burden of such struggles is very great, rather than in
When, as in the new political economy or public production. They also will tend to make decisions
choice, the focus is also on the efforts of extra- slowly and thereby retard technological advance
governmental groups to obtain the gains from and adaptations to macroeconomic and monetary
lobbying, cartelization, and collusion, and on shocks. It follows that societies that have been
private action to obtain collective benefits of other through catastrophes that have destroyed
kinds, a more general conception becomes natural organizations for collective action, such as
(Barry and Hardin ; Olson ; Taylor ). It then Germany, Japan, and Italy, can be expected to enjoy
becomes clear that the likelihood of voluntary ‘economic miracles’. An understanding of
collective action depends dramatically on the size of collective action also makes it possible to
the group that would gain from collective action. understand how Great Britain, the country that with
When a group is sufficiently small and there is time industrial revolution discovered modem economic
for the needed bargaining, the desired collective growth and had for nearly a century the world’s
goods will normally be obtained through voluntary fastest rate of economic growth, could by now have
cooperation (Frohlich et al. ). If there are substantial fallen victim to the ‘British disease’. The logic of
differences in the demands for the collective good at collective action, in combination with other
issue, there will be the aforementioned paradoxical theories, also makes it possible to understand many
‘exploitation of the great by the small’. When the of the other most notable examples of economic
number of beneficiaries of collective action is very growth and stagnation since the Middle Ages, and
large, voluntary and straightforward collective also certain features of macroeconomic experience
action is out of the question, and taxes or other that contradict Keynesian, monetarist, and new
selective incentives are indispensable. Selective classical macroeconomic theories (Balassa and
incentives are available only to a subset of those Giersch ).
extra-governmental groups that would gain from
collective action. Even those extra-governmental
groups that do have the potential of organizing See Also
through selective
►
Collective Action (New Perspectives) 1781
►:: v Smith, A. 1776. An inquiiy into the nature and causes of the
wealth of nations. London: J.M. Dent. 1910.
Taylor, M. 1976. A n a r c h v a n d c o o p e r a t i o n .
► : J c C : oice London: John Wiley.
Wicksell, K. 1896. A new principle of just taxation. Trans,
from the German by J.A. Buchanan, in Musgrave and
Bibliography Peacock (1967).
Anderson, K., and Y. Hayami. 1986. T h e p o l i t i c a l
e c o n o m y o f p r o t e c t i o n . Sydney: George Allen &
Unwin.
Austen-Snrith, D. 1981. Voluntary pressure groups.
E c o n o m i c a 48: 143-153.
Balassa, B., and H. Giersch (eds.). 1986. E c o n o m i c
Collective Action (New Perspectives)
i n c e n t i v e s . Proceedings of the international economic
association, London: Macmillan. David P. Myatt
Barry, B., and R. Hardin (eds.). 1982. R a t i o n a l m a n
a n d i r r a t i o n a l s o c i e t y . Beverly Hills: Sage.
Baumol, W.J. 1952. W e l f a r e e c o n o m i c s a n d t h e
t h e o r y o f t h e s t a t e . Cambridge, MA: Harvard
University Press. Abstract
Brock, W., and S. Magee. 1978. The economics of special
Olson’s logic of collective action predicts that
interest groups: The case of the tariff. A m e r i c a n E c o -
n o m i c R e v i e w 68: 246-250. public-good provision is most likely to fail when
Chubb, J. 1983. Interest groups and the size of the consumer group is large; his
b u r e a u c r a c y . Stanford: Stanford University Press. public goods are partially rival, and so the
Frohlich, N., J. Oppenheimer, and O. Young. 1971. P o l i t -
private cost of provision is relatively high. With
ical leadership and collective boards.
Princeton: Princeton University Press. a pure public good, this logic no longer applies,
Gamson, W.A. 1975. T h e s t r a t e g y o f s o c i a l and so attention turns to producer groups. When
p r o t e s t . Home- wood: Dorsey. provision involves teamwork (so that the
Hardin, R. 1982. C o l l e c t i v e a c t i o n . Baltimore: Johns
collective action succeeds when everyone works
Hopkins University Press for Resources for the Future.
McDonald, D.J. 1969. U n i o n m a n . New York: Dutton. together) then coordination problems arise.
Moe, T.M. 1980. T h e o r g a n i z a t i o n o f i n t e r e s t s . Modem techniques suggest that ‘good’ equilibria
Chicago: University of Chicago Press. in which provision is successful are robust only
Mueller, D.C., (ed.). 1983. T h e p o l i t i c a l e c o n o m y o f
when the costs of provision fall below private
g r o w t h . New Haven: Yale University Press.
Musgrave, R.A. 1959. T h e t h e o r y o f p u b l i c rather than social benefits.
f i n a n c e . New York: McGraw-Hill.
Musgrave, R.A., and A.T. Peacock (eds.). 1967. C l a s s i c s
i n t h e t h e o r y o f p u b l i c f i n a n c e , 2nd ed. New
York: McGraw-Hill. Keywords
Olson, M.L. 1965. T h e l o g i c o f c o l l e c t i v e a c t i o n . Chicken games; Collective action; Cournot
Cambridge, MA: Harvard University Press.
contributions games; Critical mass theory;
Olson, M.L. 1982. T h e r i s e a n d d e c l i n e o f
n a t i o n s . New Haven: Yale University Press. Equilibrium-selection problem; Externalities;
Olson, M.L. 1984. Ideology and growth. In T h e l e g a c y o f Global games; Interdependent consumption;
r e a g a n o m i c s , ed. C.R. Hulten and I.V. Sawhill. Interdependent production; Market failure;
Washington, DC: Urban Institute Press.
Multiple equilibria; Olson, M.; Provision games;
Olson, M.L. 1985. Space, organization, and agriculture.
American Journal of Agricultural Public goods; Risk dominance; Schel- ling, T.;
E c o n o m i c s 67: 928-937. Selective incentives; Strategic voting; Strategy
Olson, M.L., and R. Zeckhauser. 1966. An economic theory of revision; Teamwork dilemma; Volunteer’s
alliances. T h e R e v i e w o f E c o n o m i c s a n d
dilemma
S t a t i s t i c s 48: 266-279.
Samuelson, P.A. 1955. Diagrammatic exposition of a theory of
public expenditure. T h e R e v i e w o f E c o n o m i c s
a n d S t a t i s t i c s 37: 350-356.
Sandler, T. (ed.). 1980. The theory and structure of inter- JEL classifications
national political economy. Boulder: Westview. D71
Schultz, T.W. 1978. D i s t o r t i o n o f a g r i c u l t u r a l
1782 Collective Action (New Perspectives)
In a review conducted on behalf of the UK Gov- to further those common interests, much as indi-
ernment, Stem ( ) concluded that ‘climate viduals might be expected to further their own
change is a serious global threat, and demands an interests’. He persuasively argued that ‘the exis-
urgent global response ... the benefits of strong and tence of a common interest need not provide any
early action far outweigh the economic costs of not incentive for an individual action in the group
acting’. The cuts in emissions that he suggested interest’. Hence consumers may fail to campaign for
could generate global benefits. However, the costs their collective protection, unions may fail to
would be borne individually by those making protect all their members, oligopolists may fail to
significant cuts (developed nations) or by those maintain collusive prices, and nations of the world
sacrificing future opportunities (rapidly developing may fail to prevent further climate change.
nations). Olson’s point was simple and is now familiar:
A shared desire to cut greenhouse-gas emissions when contemplating choice, individuals consider
generates a classic problem of collective action: a only the private impact of their actions. For the
group with common interests must rely on volun- classic case of a public good, an individual faces the
tary individual optimization for the pursuit of those full marginal cost of provision but fails to account
interests. Stem’s ‘urgent global response’ to a ‘seri- for the benefit spilling over to others; the presence
ous global threat’ requires nations to act. Such of positive externalities leads to under-provision. If
sovereign states need respond only to their own an individual could internalize these externalities,
incentives; any participation is voluntary. Within perhaps by excluding the consumption of others and
each state, the pursuit of national objectives is not charging them for it, then efficiency could be
automatic; environmental effects stem from the restored. Alas, pure public goods are non-
decisions of individual agents. Even if it were in a excludable, and hence this route to efficiency is
state’s collective interest to support a collective blocked.
action against climate change, it cannot be assumed Nevertheless, as long as individuals enjoy some
that constituents of that state would individually private benefit from voluntary action then we can
offer their backing. expect some, albeit too little, provision of public
To economists, the collective-action problem goods. The extent of any inefficiency depends upon
boils down to the private provision of a public good the nature of the collective-action problem, the
or the private exploitation of a common resource. availability ofmechanisms to restore efficiency, and
Law and order, military defence and pollution die size and nature of the relevant group. Olson (
control are classic textbook examples of public ) concluded that ‘unless the number of indi
goods: the benefits of provision are non-excludable, viduals in a group is quite small, or unless there is
and so private providers fail to capture the full coercion or some other special device to make
impact of their contributions. This market failure individuals act in their common interest, rational,
leads to inefficient under-provision. On the other self-interested individuals will not act to achieve
hand, the commons exploitation of traffic their common or group interests’. In the context of
congestion and commercial fishing yield negative small groups, when partial provision is deemed
externalities: market failure yields to inefficient possible, he identified ‘a surprising tendency for the
overindulgence in these activities. In both cases, ‘exploitation’ of the great by the small’. These
individuals fail to pursue efficiently their collective claims led to his theory of groups: (a) collective
objectives. actions fail when the groups are large; (b) laiger
The idea that group members will not always factions bear a disproportionate share of any pro-
pursue their common interests was once not vision; and (c) selective incentives are necessary if
accepted widely. In his article in the first edition of groups are to succeed. These three claims are con-
the New Palgrave, Mancur Olson ( ) sidered in turn, before attention turns to a rather
observed that ‘economists, like specialists in other different perspective on collective action.
fields, often took it for granted that groups of i The first claim is Olson’s ‘group size’ hypoth-
ndividuals with common interests tended to act esis: private provision should fall as a group
Collective Action (New Perspectives) 1783
grows larger. Olson ( ) painted a picture of a good, and so face a relatively large private benefit.
meeting at which too few people make careful Once again, this builds upon the assumption that the
contributions: ‘When the number of participants is collective output is rival; for a pure public good, the
large, the typical participant will know that his own same logic would predict that those who care most
efforts will probably not make much difference to contribute most, and such contributors need not be
the outcome, and that he will be affected by the large in a conventional sense.
meeting’s decision in much the same way no matter Olson’s third claim concerned the possible
how much or how little effort he puts into studying response to the problem of collective action. Such a
the issues.’ More directly, the claim is that the response requires, according to this claim, selective
private benefit of any voluntary contribution falls incentives that are ‘functionally equivalent to the
with the group’s size; equivalently, the private cost taxes that enable governments to provide public
for any particular level of public provision rises goods ... [they] either punish or reward individuals
with the group size. This claim leans on two depending on whether or not they have borne a
implicit assumptions. First, an increase in the share of the costs of collective action, and thus give
number consuming the good leads to an increase in the individual an incentive to contribute ...’
the provision cost, and hence the good is (at least (Olson ). The classic example of selective
partially) rival; it is an impure public good. Second, incentives is the ‘closed shop’ of labour unions; to
the group size corresponds to the number of enjoy the benefits of collective union bargaining
consumers, and not to the size of the contributor power each worker must be a member, and hence
pool. pay the costs of any strike action. Interestingly,
These two implicit assumptions that underpin when the selective incentive is based on preventing
the group-size hypothesis are often valid. For a group member from enjoying the collective output
instance, the global climate change that worried then the implicit assumption is that the public good
Stem ( ) corresponds to a Targe group’ global is at least partially excludable.
collective-action problem (Sandler ). Nevertheless, In summary, Olson ( , ) forcefully
the assumptions often exclude interesting clarified the inescapable logic of collective action:
collective-action problems. The first assumption any theory of group behaviour must rely upon the
rules out pure public goods. Consider, for instance, incentives faced by individuals, and not simply
the contemporary voluntary provision of open- assume that groups pursue their common interests.
source software (Raymond ; Johnson ; Lemer and His theory of groups remains relevant for many
Tirole ). The typical license under which such contemporary problems. However, it steps outside
software is distributed ‘requires that the source code the world of pure public goods by assuming the
... be made available to everyone, and that the interdependent consumption of an impure public
modifications made by its users also be turned back good, and does not allow for interdependence of
to the community’ (Lemer and Tirole ). This a production. Put more succinctly, Olson’s groups
modem instance of the ‘collective invention’ consist of public good consumers rather than public
documented by Allen ( ). Open-source software good providers.
is automatically Attention now refocuses on collective-action
non-excludable. Of course, software is a classic problems in which economic players non-
instance of a non-rival good: consumption by one cooperatively choose whether to participate in the
individual does not hamper the consumption private production of a pure public good. Crucially,
opportunities of others. Hence, an increase in the there can be interdependence of production: the
size of the group consuming the good, while fixing incentive to participate in a collective action
the size of the group able to provide it, has no direct depends on the expected participation of others.
impact on incentives. Decisions become genuinely strategic, and this
Olson’s second claim was that provision costs changes the nature of the collective-action problem.
fall on larger members of a group. The idea is that
such members consume large shares of the public
1784 Collective Action (New Perspectives)
A little notation proves helpful. Amongst n the successful provision of a public good) turns
players, write x, for the action of player i, and upon either the participation of a critical mass of
collect the actions of everyone together into a vector players, or contributions that exceed a particular
x. Payoffs satisfying threshold. Returning once more to the economics of
climate change, a plausible scenario is one in which
Ui{x) = G( x ) - c^xi) the ice caps melt unless carbon emissions are
pushed down below a critical level. Whereas in a
( 1) Cournot contributions game the incentive to
contribute decreases with the participation of others,
comprise the value G(x) of public good and the
here it may increase: a nation may find it
private cost e/x,) that player i incurs when con-
worthwhile to chase environmental targets if and
tributing to it; the externality imposed on others is only if it expects others to play their part in inter-
captured by (n 1)G (x). The nature of the strategic national agreements.
interaction amongst players depends upon the form A central feature of threshold-based scenarios is
taken by G(x). A simple specification is when x, is a that an individual’s decision depends on aggregate
positive real number and G(x) = x,-. ^ Player’s participation. This is easiest to explore in a binary-
decision is strategi action game where x, e {0,1} for each player i;
cally independent of others’ actions: he simply hence x, = 1 can be interpreted as individual
equates the private marginal benefit of the public participation in a collective action. In many such
good to its private marginal cost via the first-order situations, the incentive to participate depends on
condition = cJ(x[), yielding the usual underprovision the number of others who do so. Hence, writing A«,
problem (Comes and Sandler ). (x) for this incentive,
A second natural specification to consider is
A M,(x) = P{m) where m = x^. (2)
where G(x) = F(El,x‘) for some nicely
behaved concave production function F( ■ ). This When P(m) < 0 for all m, no players participate;
falls within the class of Cournot contributions this is an n-player Prisoner’s Dilemma. If P(m)
games (Chamberlin ; McGuire ; Young ; Comes and decreases with m, then the unique equilibrium
Sandler ; Bergstrom et al. ; Bemheim ). Here, entails the participation of m players, where P(m* -
strategic interaction is non-trivial since the marginal 1) > 0 > P(m*); for the Cournot games considered
benefit of increased public good provision depends above the participation m might be socially
on the total contributions of all players. suboptimal. If P(m) increases with m, so that there
Nevertheless, a unique Nash equilibrium involves is a threshold m satisfying P(m — 1) < 0 < P(m ),
under-provision. The associated literature concerned then there are two pure- strategy Nash equilibria,
itself with the comparative-static properties of such one in which everyone participates, and one in
models, including the response of public good which the collective action fails. This means that the
output and the burden of provision to the problem of collective action becomes one of
redistribution of income (Warr ; Kemp ). coordination.
These first two examples of eq. ( ) simply flesh Games satisfying eq. ( ) drew the insightful
out the implicit model of Olson ( ). The nature attention of Schelling ( , ). He opened
of the collective action problem changes signifi- his analysis by describing the use of protective
cantly when G(x) takes on more interesting and yet helmets in ice hockey: players were willing to wear
plausible shapes. For instance, G( • ) might take a helmets only if others did so too. Other sociological
weakest link (G(x) = min{x,}) or best shot (G(x)
examples are easy to find: members of a crowd will
min{x,}) form (Hirshleifer , );
join a protest only if others do so (Berk ;
these are special cases of symmetric but non-
Granovetter ) and successful consumer boycotts
additive specifications (Comes ).
require a critical mass (Innes )•
Here, however, attention turns to situations in
which the success of a collective action (that is,
Collective Action (New Perspectives) 1785
Political situations can also fit eq. ( ). Consider a player A participates so long as player B does not. If
plurality rule election in which a group wishes to v > cB, then there are two pure-strategy Nash
prevent the success of a disliked incumbent can- equilibria in which a single player provides the
didate. They can do so if and only if a critical public good. But who provides?
number m abandon their first-preference candidate One possibility is to use risk dominance
and vote for their second choice. Setting P(m — 1) (Harsanyi and Selten ) as a selection criterion. The
> 0 and P(m) < 0 otherwise yields a strategic-voting risk-dominant equilibrium is that which maximizes
model (Palfrey ; Myerson and Weber ; Cox , ; the product of players’ incentives to remain at the
Myatt ). equilibrium. So, in the volunteer’s dilemma, the
In sociology, collective-action games with equilibrium in which A provides is risk-dominant if
threshold properties fall under the umbrella of the (v —cA)cB > (v — cB)cA, which holds if and only if
theory of critical mass (Oliver et al. ; Oliver and cA < cB: the most efficient provider volunteers.
Marwell ; Marwell et al. ; Following Olson ( ), the
Marwell and Oliver ). Alas, these sociolo strong (low-cost providers) bear the cost of provi-
gists had no theoretical machinery for selecting sion to the benefit of the weak.
between multiple equilibria. In economics, multiple A coordination problem also arises in the
equilibria arise in threshold-driven step-level public ‘teamwork dilemma’ (Fig. ) where both players are
goods games (Palfrey and Rosenthal ). Once again, needed for the collective action to succeed. This is
the problem of coordination boils down to a need to an assurance or ‘stag hunt’ game: as long as v > cA
choose amongst multiple equilibria. Fortunately, and v > cB there is a pure-strategy equilibrium in
recent contributions to economics allow some which both players participate, and a second with
progress to be made on the equilibrium-selection no participation in which the collective action fails.
problem. The former equilibrium is risk dominant if and only
To explore further, it is instructive to consider a if (v ~ca)(v —cB) > cAcB, which boils down to v >
simple world: two individuals (A and B) either cA+cB, this requires a single private (not social)
participate (Y) or not (N) in a collective action. benefit from the public good to exceed the total
Participationinvolves a private cost (either cA or private cost of provision. If 2v > cA+cB > v, then the
c/i), but may provide a public good to be enjoyed by collective action fails even though it would be
both players. A natural representation is via a socially optimal for it to succeed. Once again, this is
simple 2x2 strategic form game (Fig. ). a return to Olson ( ): success of the collec
In the ‘provision game’ a participant produces a tive action relies on private incentives.
public good worth v to everyone. A player’s All well and good, but can the criterion of risk
marginal product is strategically independent: the dominance be justified? In the recent literature two
incentive for player A to participate is always v — c contrasting approaches lead to the same answer.
i, and hence he does so if and only if v > cA. The theory of global games (Carlsson and Van
However, this generates a spillover of v for player Damme ; Morris and Shin ) supposes that players
B, and hence the social gain is 2v — c ,. The do not share common knowledge of the payoffs of
parameter configuration 2v > cA > v yields the games. Instead, players must rely upon privately
classic under-provision of a public good. observed signals of the game being played. For
But what if there is strategic interdependence? instance, players may be unsure of the true value v
Suppose that only one player need provide, so that a of the public good, and see an estimate of it.
second participant generates a cost but no additional Crucially, this estimate allows them to infer not
benefit. This ‘volunteer’s dilemma’ (Diekmann ) is only this value but also the probable signals
a textbook game of ‘chicken’ (Lipnowski and received by others, and hence their opponents’
Maital ). If 2v > c , > v and 2v > cB > v then neither likely behaviour. When signals become very precise
player is willing to participate even though it is then the play of a simple 2x2 game almost
socially optimal for someone to do so. However, if
v > c , then
Y V Y V
~ C B v- V Y v- cB
2 v-cB 2v -
o
v- cA CA v- C A
cA v- CA
i
C?
v- cB 0 V- cB 0 0
1
N N N
V 0 V 0 0
o
Provision game Volunteer's dilemma Teamwork dilemma
always coincides with the risk-dominant Nash and Palfrey ): the log odds of choosing one action
equilibrium (Carlsson and Van Damme ). rather than another is determined by the difference
Others have selected equilibria by studying the in their payoffs, and so he is more likely to choose
evolving play. Players (or populations from which better performing strategies. An inspection of eq. ( )
players are drawn) may adjust their play over time reveals that the difference in a player’s payoffs is
in the direction of myopic best-reply, but occa- equal to the difference in potential; the potential
sionally ‘mutate’ to a different strategy (Kandori et function captures the essential strategic interaction
al. ; Young , ). As the probability of the game.
of mutations vanishes, play in the long ran focuses Allowing play to evolve, the strategy-revision
on a single stochastically stable equilibrium (Foster process is drawn towards the states-of-play with the
and Young ). In a symmetric teamwork dilemma, it highest potential. In the long run, when quantal
picks out the risk dominant equilibrium. responses approximate best replies, the process
Can modem literature say anything about the spends almost all time in the state that maximizes
general case of eq. ( )? Players act as though they p(x): evolution leads players to maximize the dif-
are attempting to maximize jointly the single real- ference between a single private benefit and total
valued function private costs rather than social welfare which would
incorporate the frill social benefit of nG(x).
This approach can be applied to the teamwork
p(x) = G(x) - , C(v)- (3)
dilemma: the potential of the state-of-play in which
neither player participates is zero, and the potential
This is a potential function, and yields a poten-
of the equilibrium in which the collective action
tial game (Monderer and Shapley ). This function
succeeds is v — (cA + cB). The latter equilibrium has
has a natural interpretation: the private benefit that a
positive potential if and only ifv > cA + cB. only if a
single individual derives from a public good, minus
private individual would be willing to step forward
the total private costs involved in its provision.
and pay the frill cost of provision himself will the
Clean results emerge when play of a potential
collective action succeed. So, whereas it may at first
game evolves via a payoff-responsive stochastic
appear that the success of a collective action (the
strategy-revision process (Blume , ,
coordinated play of fY,Yg in the teamwork dilemma)
; Brock and Durlauf ; Blume and Durlauf ,
can follow from the interdependence of team
). Over time, players occasionally revise
members, evolving play results in failure (the play
their strategies. When a player does so, his decision
of fiV,/Vg in the teamwork dilemma) unless a
is not a myopic best reply to the current play of
private individual would be willing to fund the
others, but rather a quantal response (McKelvey
collective action himself.
On reflection, this should be unsurprising. Each
step of evolving play (or each step of
Collective Action (New Perspectives) 1787
reasoning in a global-game argument) is driven by Comes, R., and T. Sandler. 1985. The simple analytics of pure
reference to private incentives. So what lesson public good provision. E c o n o m i c a 52: 103-116.
Comes, R., and T. Sandler. 1996. T h e t h e o r y o f
should be taken away? Even when a group’s externalities, public goods and club
problem is one of coordination, its members cannot g o o d s . 2nd ed. London: Cambridge University Press.
escape Olson’s ( , ) fundamental Cox, G.W. 1994. Strategic voting equilibria under the single
nontransferable vote. American Political
logic of collective action.
S c i e n c e R e v i e w 88: 608-621.
Cox, G.W. 1997. M a k i n g v o t e s c o u n t . Cambridge,
UK: Cambridge University Press.
Diekmann, A. 1985. Volunteer’s dilemma. J o u r n a l o f
See Also C o n f l i c t R e s o l u t i o n 29: 605-610.
Foster, D., and H.P. Young. 1990. Stochastic evolutionary
► Collective /..v't.c:: game dynamics. Theoretical Population
► .xtcm; B i o l o g y 38: 219-232.
Granovetter, M. 1978. Threshold models of collective
► ’lib 11 Gocvc behavior. A m e r i c a n J o u r n a l o f S o c i o l o g y 83:
1420-1443.
Bibliography Harsanyi, J.C., and R. Selten. 1988. A g e n e r a l t h e o r y
of equilibrium selection in games.
Cambridge, MA: MIT Press Classics.
Allen, R.C. 1983. Collective invention. Journal of Economic
Hirshleifer, J. 1983. From weakest-link to best-shot: The
Behavior and Organization 4: 1-24.
voluntary provision of public goods. P u b l i c C h o i c e
Bergstrom, T.C., L. Blume, and H.R. Varian. 1986. On the
41: 371-386.
private provision of public goods. J o u r n a l o f
Hirshleifer, J. 1985. From weakest-link to best-shot:
P u b l i c E c o n o m i c s 2 9 : 25-49.
Correction. P u b l i c C h o i c e 46: 21-23.
Berk, R. 1974. A gaming approach to crowd behavior.
Innes, R. 2006. A theory of consumer boycotts under
A m e r i c a n S o c i o l o g i c a l R e v i e w 39: 355-373.
symmetric information and imperfect competition.
Bemheim, B.D. 1986. On the voluntary and involuntary
E c o n o m i c J o u r n a l 116: 355-381.
provision of public goods. A m e r i c a n E c o n o m i c
Johnson, J.P. 2002. Open source software: Private provision
R e v i e w 76: 789-793.
of a public good. J o u r n a l o f E c o n o m i c s a n d
Blume, L.E. 1993. The statistical mechanics of strategic
M a n a g e m e n t S t r a t e g y ’ 11: 637-662.
interaction. G a m e s a n d E c o n o m i c B e h a v i o r
Kandori, M., G.J. Mailath, and R. Rob. 1993. Learning,
5:387-424.
mutation and long-run equilibria in games.
Blume, L.E. 1995. The statistical mechanics of best- response
E c o n o m e t r i c a 61: 29-56.
strategy revision. G a m e s and Economic
Kemp, M. 1984. A note on the theory of international
B e h a v i o r 11: 111-145.
transfers. E c o n o m i c s L e t t e r s 14: 259-262.
Blume, L.E. 1997. Population games. In T h e e c o n o m y
Lemer, J., and J. Tirole. 2001. The open source movement:
a s a n e v o l v i n g c o m p l e x s y s t e m 1 1 , ed. W.B.
Key research questions. E u r o p e a n E c o n o m i c
Arthur, S.N. Durlauf, and D.A. Lane. Boulder: Westview.
R e v i e w 45: 819-826.
Blume, L.E., and S.N. Durlauf. 2001. The interactions- based
Lemer, J., and J. Tirole. 2002. Some simple economics of open
approach to socioeconomic behaviour. In S o c i a l
source. J o u r n a l o f I n d u s t r i a l E c o n o m i c s 50:
d y n a m i c s , ed. S.N. Durlauf and H.P. Young.
197-234.
Cambridge, MA: MIT Press.
Lipnowski, I., and S. Maital. 1983. Voluntary provision of a
Blume, L.E., and S.N. Durlauf. 2003. Equilibrium concepts
pure public good as the game of chicken. J o u r n a l o f
for social interaction models. I n t e r n a t i o n a l G a m e
P u b l i c E c o n o m i c s 20: 381-386.
T h e o r y R e v i e w 5: 193-209.
Marwell, G., and PE. Oliver. 1993. T h e c r i t i c a l m a s s
Brock, W.A., and S.N. Durlauf. 2001. Discrete choice with
in collective action: A micro-social
social interactions. R e v i e w of Economic
t h e o r y . Cambridge and New York: Cambridge
S t u d i e s 68: 235-260.
University Press.
Carlsson, H., and E. Van Damme. 1993. Global games and
Marwell, G., PE. Oliver, and R. Prahl. 1988. Social networks
equilibrium selection. E c o n o m e t r i c a 61: 989-1018.
and collective action: A theory of the critical mass. III.
Chamberlin, J. 1974. Provision of collective goods as a
A m e r i c a n J o u r n a l o f S o c i o l o g y 94: 502-534.
function of group size. A m e r i c a n P o l i t i c a l
McGuire, M. 1974. Group size, group homogeneity and
S c i e n c e R e v i e w 68: 707-716.
aggregate provision of a pure public good under coumot
Comes, R. 1993. Dyke maintenance and other stories: Some
behavior. P u b l i c C h o i c e 18: 107-126.
neglected types of public goods. Q u a r t e r l y J o u r n a l
o f E c o n o m i c s 108: 259-271.
1788 Collective Agriculture
agriculture, including the various programmers in processing, building and maintaining irrigation
the Soviet Union associated with Lysenko facilities. However, many modem farm inputs are
(discussed in Volin ) and the ill-fated introduction divisible. Provided they are appropriately priced,
of the double-wheeled, double-share plough, in credit is available and they have access to lumpy
China (Kuo , ch. 12). complementary inputs, all farm strata modernizing
Third, party-led collectives were viewed as a areas tend to acquire them (Hayami and Kikuchi ).
means to attain high farm marketing rates and an Moreover, in large agricultural units labour
outflow of farm sector savings to finance non-farm supervision is a major problem (Bradley and Clark
investment: 1971). If a collective’s members trust each other
By transferring the disposal of agricultural out- and are motivated to work hard for the group
put from individual peasants to government- irrespective of relative income then labour
supervised collective farm managements, collec- supervision is not an issue. However, this is rarely
tivization destroys the basis for the peasants’ the case (Morawetz ) and collective farm managers
resistance to the ‘siphoning-off of the economic have had to devise payment systems to motivate
surplus (Baran , p. 268). farm workers. In certain farm tasks (notably
However, without, for example, adequate sup- harvesting) it is easy to pay labour according to its
plies of appropriately priced industrial commodi- product, but for most farm tasks it is more difficult
ties, forcibly raising the rate of farm sector than in industry to devise payment systems that
marketings can reduce the growth rate of farm strongly motivate from the work often requires a
output and the future volume of farm marketings. flexible response from the worker which is difficult
Moreover, it has proved difficult to achieve a net to anticipate in the payment system; the final
farm savings outflow due, for example, to agricul- produce takes a long time to produce, with different
ture’s need for industrial incentive goods and farm workers’ contributions difficult to isolate; work is
inputs (increased, insofar as inputs are inefficiently physically dispersed and production conditions vary
used and collectivization adversely affects livestock greatly from one part of the production unit to
holdings, motive power and fertilizer supplies), and another; the main task specializations are seasonal,
the state’s inability to control private market prices and permanent minute sub-division of work into
(Ellman ; Ishikawa ). easily measurable segments is not generally
Fourth, it was considered that collectives would possible. These problems have meant that under
prevent ‘capitalist’ polarization alongside farm private agriculture, if labour is relatively abundant
modernization, with the majority of peasants and capital relatively expensive, the normal
becoming wage labourers (Stalin ; Mao ). Evidence outcome is for land to be rented out beyond a
from other developing countries contradicts Stalin certain farm size, so that a relatively high output per
and Mao’s erode vision of rural class polarization acre can be attained through self-operating, self-
(see, especially, Hayami and Kikuchi ). It indicates motivated, rent-paying farmers, rather than culti-
too that appropriate state policies (e.g. land reform, vated with large numbers of hired workers. In
provision of education and credit, infrastructure collective farms, the attempt to supervise large
construction, progressive taxation) can mitigate numbers of farm workers has resulted in powerful
rural class inequalities. Class polarization is not the managerial diseconomies of scale and reduced farm
inevitable accompaniment of rural modernization, efficiency.
nor is collectivization the only way to resolve Collective agriculture has not performed well.
problems of rural class inequality (e.g. Hayami and Collective farms in the USSR in 1929-31 and in
Kikuchi ). China in 1959-61 experienced massive institu-
Fifth, Lenin, Stalin and Mao all believed that tionally caused declines in farm output, accompa-
agriculture was characterized by lumpiness and nied by demographic disasters (on the Soviet Union,
economies of scale (Lenin ; Stalin ; Mao ). In many see Volin , ch. 10; on China, see Ashton et al. ). It is
farm tasks, large scale is indeed an advantage, for indeed, a terrible indictment of collective farming,
example in research, that the worst famines
1790 Collective Bargaining
of the 20th century have occurred under that sys- General Office of the Central Committee of the Chinese
tem. The USSRs long-term growth of farm output Communist Party. 1956. S o c i a l i s t h i g h t i d e i n
China 's villages (Zhongguo nongcun de
has required colossal capital outlays so that by the s h e h u t h u y i g a o c h a o ) , vol 3 vols. Peking:
1970s, the agricultural sector was absorbing over People’s Publishing House.
one quarter of Soviet new fixed investment Hayami, Y., and M. Kikuchi. 1981. A s i a n v i l l a g e
(Carey ). From the mid-1950s to the later 1970s e c o n o m y a t t h e c r o s s r o a d s . Tokyo: University
of Tokyo Press.
Chinese farm output per caput was stagnant: ‘de- Ishikawa, S. 1967. Resource flow between agriculture and
collectivization’ of agriculture in the early 1980s industry. T h e D e v e l o p i n g E c o n o m i e s 5(1): 3—
was accompanied by a huge in farm output (Nolan 49.
and Paine ). Kuo, L.T.C. 1972. The technical transformation of agriculture
in communist China. London: Praeger.
The ‘socialist’ countries’ poor agricultural per- Lenin, V.I. 1899. T h e d e v e l o p m e n t o f c a p i t a l i s m
formances is in part attributable to shortcomings in i n R u s s i a , 1964. Moscow: Progress Publishers.
the supply of industrial goods (Smith ). Part is also Mao, Tsetung. 1955. On the co-operative transformation of
due to extensive state intervention in collective agriculture. M a o 1977.
.- - -1977. Selected works of Mao Tsetung,
farms. However, there are fundamental problems in vol V. Peking: Foreign Languages Press.
principle even with relatively independent Morawetz, D. 1983. The kibbutz as a model for developing
collective farms. Large units (whether state, countries. S t e w a r t 1983.
collective or private) are necessary to undertake Nolan, P., and S. Paine. 1986. Towards an appraisal of the
impact of rural reform in China, 1978-85. C a m b r i d g e
activities exhibiting lumpiness or economies of
J o u r n a l o f E c o n o m i c s 10(1): 83-99.
scale. However, for many farm tasks powerful Selden, M. 1982. Co-operation and conflict: co-operative and
managerial diseconomies of scale exist, and even collective formation in China’s countryside. S e l d e n
given favourable policies in other respects, in most a n d L i p p i t 1982.
Selden, M., and V. Lippit (ed). 1982. T h e t r a n s i t i o n t o
circumstances this would prove a barrier to good
s o c i a l i s m i n C h i n a . New York: M.E. Sharpe.
performance of collective farms. Smith, G.A.E. 1981. The industrial problems of Soviet
agriculture. C r i t i q u e 14: 41-65.
Stalin, J. 1929. Concerning questions of agrarian policy. In
See Also P r o b l e m s o f L e n i n i s m , ed. J. Stalin. Peking:
Foreign Languages Press n.d.
► Agricultural Growth and Population Change Stewart, F. (ed). 1983. W o r k , income and
►:: i n e q u a l i t y . London: Macmillan.
Unger, J. 1984. C h e n v i l l a g e . Berkeley: University of
California Press.
Bibliography US Congress, Joint Economic Committee (USCJEC). 1976.
Soviet economy in a new perspective.
Ashton, B., et al. 1984. Famine in China, 1958-61. P o p u - Washington, DC: US Government Printing Office.
l a t i o n a n d D e v e l o p m e n t R e v i e w 10(4): 613-645.
Baran, P. 1957. T h e p o l i t i c a l e c o n o m y o f
g r o w t h . New York: Monthly Review Press.
Bradley, M.E., and M.G. Clark. 1972. Supervision and
efficiency in socialized agriculture. S o v i e t S t u d i e s Collective Bargaining
23(3): 465-473.
Carey, D.W. 1976. Soviet agriculture: Recent performance
William Brown
and future plans. I n U S C J E C 1976.
Cohen, S.F. 1974. B u k h a r i n a n d t h e B o l s h e v i k
r e v o l u t i o n . London: Wildwood House.
Davies, R.W. 1980. The socialist offensive: the collective action
of Soviet agriculture, 1929-1930. London: Macmillan.
Ellman, M. 1975. Did the agricultural surplus provide the Keywords
resources for the increase in investment in the USSR Arbitration; Bargaining; Collective bargaining;
during the First Five Year Plan? E c o n o m i c Collusion; Consultation; Employment contracts;
J o u r n a l 85(4): 844-863.
Industrial democracy; Industrial relations;
Negotiation; Trade unions; Wage determination
Collective Bargaining 1791
and Kochan ( ) in the United States, but prob evidence that Flanders had submitted. Finally, by
ably the most influential discussion has been Flan- conceptualizing wages as part of a broader package
ders’ attempt of 1968 to create a comprehensive of regulations and as embodying strongly normative
theoretical analysis. He argued that the economic principles, the theory opened the way to a more
associations of the term ‘collective bargaining’ are fruitful understanding of wage determination than is
misleading. The collective agreement commits no offered simply by the market models of orthodox
one to either buy or sell labour, but rather ensures theory.
that, when labour is bought or sold, the terms of the Two crucial features of the employment rela-
transaction will accord with the provisions of the tionship ensure that the process of collective
agreement. Above all else, collective bargaining is a bargaining is fundamentally unlike that of non-
rule-making process covering many aspects of the labour commercial bargains. They are its open-
employment relationship besides pay and conditions endedness and its continuity. The labour contract is
of work. The second characteristic feature of open-ended because the recruitment of an employee
collective bargaining that Flanders stressed is that does not ensure the performance of work; the
of the power relationship between the protagonists employee has to be motivated, by whatever means,
whose negotiations (‘the diplomatic use of power’ ) to perform to the required standard. In all but highly
create the rules. Thus, while there are also technical oppressive societies such motivational techniques
rales and legal rales regulating work, what tend to be varied and complex, differing not least in
distinguishes the legitimacy of those that result the extent to which they place emphasis upon levels
from collective bargaining is their authorship. They of pay and upon employee participation. Since
are jointly determined by the accepted social comparisons (and especially very local ones)
representatives of both employers and employees play an important part in the motivation and
who consequently share responsibility for both the demotivation of labour, the bureaucratic
rales’ contents and their observance. standardization of terms of employment, which is
Flanders’ analysis has proved fertile in several generally a characteristic of collective agreements,
respects. It has drawn attention to the extent to often fits in well with management’s preferred
which collective bargaining is a positive manage- personnel techniques. In this way, properly
ment technique rather than just an impediment to conducted collective bargaining can provide a
effective management imposed by trade unions. As socially stable working environment which
a result of this shift in emphasis, a major part of facilitates the employer’s prime aim of eliciting
academic research into collective bargaining in the labour productivity. In short, the conduct of the
1980s has explored managerial, as opposed to trade bargain affects the quality of the labour bargained
union, strategies, and has exposed the extent to over.
which union behaviour is shaped by these manage- The second distinctive feature of the employ-
ment strategies. In addition, what could be seen as ment relationship is its continuity. Employer and
the Weberian undercurrent in Flanders’ analysis has employees are bound together, for better or worse,
focused policymakers’ attention upon the for an indeterminate duration. Additions to and
importance of procedural clarity in conflict resolu- departures from the workforce generally occur in a
tion, and thereby upon the dangers of ambiguity in piecemeal way. A host of potentially contentious
the legitimation of agreements. The most obvious issues feature in the relationship, only a small
example is provided by the influential central rec- minority in contention at any one time, and many
ommendation of the British Royal Commission on affecting only a minority of the workforce. Thus a
Trade Unions and Employers’ Associations of bargain over a particular issue, such as a pay
1968. The emphasis it placed upon employer initi- grievance, cannot be evaluated in isolation, but as
ated procedural reform, rather than legislative con- one fibre in a thick rope of regulations, with many
straints on trade unions, owed much to the largely implicit trade-offs with respect to other
issues, past, present and future.
Collective Bargaining 1793
are other countries where employer preference, or which associations of employers conclude
legal opportunity, makes it unusual for the industrywide agreements, remains the most
bargaining opponents to use legal sanctions against important form in most of Continental Europe. In
each other. In these circumstances the great bulk of practice there is often some employer collusion in
substantive regulation may be unwritten and in the industries where single-employer bargaining
form of verbal agreements, custom, and tacit dominates, and there is usually room for individual
understandings. Because of this a greater emphasis employer discretion in industries with strong
is placed upon the rectitude of the procedural employers’ associations, but the distinction remains
agreements (which may still be very informal) one of fundamental economic, political, and
whereby this amorphous body of substantive rales is managerial significance.
interpreted and altered, not through comprehensive Two other defining characteristics of bargaining
periodic negotiations, but by a constant incremental structure are its ‘form’ and ‘scope’. The first refers
process of piecemeal adjustment. Although the to the extent to which proceedings and agreements
United States might be described as exemplifying are formalized and codified. As already mentioned,
the legalistic extreme, and Great Britain the this depends in part upon the labour legislation of
‘voluntaristic’, most bargaining arrangements have the country. The second matter, scope, refers to the
elements of each, with the degree of legalism and range of issues covered by collective bargaining. At
formality varying by issue and industry, as well as its narrowest it may include no more than pay and
by country. hours, while elsewhere it may take in issues as
diverse as training policy, investment decisions and
child-care facilities.
Bargaining Structure The most comprehensive theory seeking to
explain industrial and national differences in
The structure of bargaining in a country, industry, bargaining structure is to be found in Clegg’s Trade
or enterprise, refers to several different characteris- Unionism under Collective Bargaining ( ).
tics of collective bargaining. The two most impor- This sees the strategy adopted by employers
tant are the ‘bargaining units’ and ‘bargaining as the main determinant of bargaining structure,
levels’ employed. A bargaining unit is a group of although changes in strategy may be slow to take
employees covered by a particular agreement. effect. The legislative framework of a country is
Within this basic territory of industrial government also of crucial importance. It defines the limits of
there is a coherence of terms of employment, pro- rights to strike, the status of the employment con-
cedures, and trade union representation that is not tract, any guarantees of security for trade unions,
necessarily to be found between different and the legally responsible agents on each side.
bargaining units. The level of bargaining refers to Most countries acquired their principal labour
the role played by the principal negotiators within legislation at some historic period of crisis - war,
their organizations; whether, for example, the defeat, depression, or extreme industrial unrest - and
employer representative responsible is a factory the institutional arrangements that developed from
manager, a company director, or an employers’ that have become consolidated in subsequent, more
association representative. peaceful times. This helps to account for the very
These two characteristics are involved in the great variations in collective bargaining practice to
single most important decision in the shaping of any be found in different countries; they often owe their
bargaining structure which is whether the employers origin to a distant panic measure based upon a
confront the unions singly or in alliance. Single- fashionable idea (such as, for example, compulsory
employer bargaining, resulting in agreements at arbitration in Australia or compulsory conciliation
company-level or lower, is the majority practice in in Canada) to which employers and unions have
the United States and Japan and now in Britain. adjusted so firmly that radical reformation is all but
Multi-employer bargaining, in impossible. A recurring experience around the
world is of
Collective Choice Experiments 1795
legislatures finding extreme difficulty in reforming regulation of aspects of employment which would
collective bargaining, other than in times of extreme otherwise generate greater disharmony and division.
crisis, because of the essential privacy of the
bargaining relationship between employers and
union. See Also
Most industrialized countries publicly assert a
commitment to collective bargaining as a necessary
part of a democratic society, and for most it is the
normal means of conducting industrial relations in
the public sector. Convention 84 (1947) of the
International Labour Organization asserts that ‘all Bibliography
practical measures shall be taken to assure to trade Chamberlain, N.W. 1951. C o l l e c t i v e b a r g a i n i n g .
unions which are representative of the workers New York: McGraw-Hill.
concerned die right to conclude collective agree- Clegg, H.A. 1960. A new approach to industrial democracy.
ments with employers and employers’ associations’. Oxford: Blackwell.
Clegg, H.A. 1976. Trade unionism under collective bargaining.
In practice the freedom of collective bargaining in Oxford: Blackwell.
both public and private sectors varies substantially Dunlop, J.T. 1967. The social utility of collective bargaining.
between countries and over time. In C h a l l e n g e s t o c o l l e c t i v e b a r g a i n i n g ,
No discussion of collective baigaining would be ed. L. Ulman. New York: Prentice-Hall.
Flanders, A. 1968. Collective bargaining: A theoretical
complete without a mention of the debate analysis. B r i t i s h J o u r n a l o f I n d u s t r i a l
concerning its relationship with industrial R e l a t i o n s 6 (1): 1-26. Reprinted in Flanders, A. 1975.
democracy. M a n a g e m e n t a n d u n i o n s . London: Faber &
One view is that, because collective bargaining Faber.
Harbison, F.H. 1951. Goals and strategies in collective
is essentially concerned with compromise, trade
bargaining. New York: Harper.
unions are sucked into collaborating with capitalism Kochan, T.A. 1980. Collective bargaining and industrial
and thereby denied the opportunity of uniting the relations. Homewood: Irwin.
working class in overthrowing existing employers Leiserson, W.M. 1922. Constitutional government in
American industries. A m e r i c a n Economic
and then instituting true industrial democracy
R e v i e w 12 (Suppl): 56-79.
through workers’ control. Opposing this is a view Marsh, A. 1979. Concise encyclopedia of industrial relations.
that deplores the fact that collective bargaining Famborough: Gower.
institutionalizes the opposition of capital and Walton, R.E., and R.B. McKersie. 1965. A b e h a v i o r a l
t h e o r y o f l a b o r n e g o t i a t i o n s . New York:
labour: them and us. It considers that the best form
McGraw-Hill.
of industrial democracy is to be found where Webb, S., and B. Webb. 1897. I n d u s t r i a l d e m o c r a c y .
workers are brought to perceive an u Iti mate
identity of interest with employers. Between these
positions is that most clearly expressed by Clegg in
A New Approach to Industrial Democracy ( ).
This argues that there can never be com Collective Choice Experiments
plete identity of interest between employer and
Rick K. Wilson
employee, and also that if employee representatives
are given managerial responsibilities they will be
forced to behave very similarly to the employers
they have replaced. Consequently the role of the
trade union is best seen as one of constant Abstract
opposition, acting to modify management actions in Collective choice experiments examine voting
the light of members’ interests insofar as their mechanisms that aggregate individual prefer-
organized power permits. Far from undermining the ences. Two general topics have received the
common interests of capital and labour, collective most attention. The first pertains to agents
bargaining permits the joint deciding on a single collective outcome or
1796 Collective Choice Experiments
policy. The second topic covers election mech- fixed preferences over the dimensions, and policies
anisms that govern candidates and voters. represented as points in the space. Using rules that
mimic many parliamentary systems, these theoreti-
Keywords cal papers demonstrate that a Condorcet winner (a
Agenda control; Arrows; Theorem; Collective policy that can defeat all others under pairwise
choice experiments; Electoral mechanisms; voting) exists only under rare distributions of
Median voter; Social choice; Spatial committee voters’ preferences. Plott ( ) establishes the con
experiments ditions under which a Condorcet winner will exist
and he makes the connection between this and a
Nash equihbrium of a spatial committee game. Like
JEL Classifications others, he concludes that an equihbrium is rare in
C9 multidimensional spatial committee games.
Early spatial committee experiments by Berl et
Duncan Black ( ) and Kenneth Arrow ( ) al. ( ) and Fiorina and Plott ( ) provide
raised the key question of collective choice: if evidence that when a Condorcet winner exists, sub-
people have different preferences for policy out- jects choose it or outcomes that are close to it. In
comes are there general mechanisms that can games where there is no such equihbrium (which is
(always) aggregate those preferences in consistent the most common case), subjects select outcomes
and coherent ways? The answer is ‘no’. Starting that scatter in the policy space. These initial empir-
from simple premises involving individual transi- ical findings, coupled with experiments by Laing
tivity, aggregate Pareto optimality and non- and Olmsted ( ) and McKelvey et al. ( ),
dictatorship there is no collective choice mechanism defined the standard for conducting spatial commit-
that yields a socially transitive outcome. Such a tee experiments. Subsequent experiments have
finding is startling given the confidence placed in adopted almost identical procedures.
democratic institutions that rely on voting The standard experimental design introduces a
mechanisms to choose a single outcome from many two-dimensional policy space. The orthogonal
possible outcomes. dimensions are arbitrary (X and Y in most settings)
Experimentalists have thoroughly explored dif- and typically range from zero to 200 or more units.
ferent institutions that can be used to aggregate Every point in the space characterizes a policy.
preferences. Political economists who straddle both Preferences over outcomes are induced by assigning
economics and political science have carried out each subject a payoff function mapping earnings in
much of this work. Their concern is with situations dollars to each point in the space. While many
where actors who have opposed interests have to payoff functions have been tested, most
settle on a single outcome and with the properties of experimenters have settled on a quadratic loss
the institution used to produce an outcome. This function, with monetary payoffs decreasing as a
article first turns to the institutional mechanisms by function of distance from a subject’s ideal point.
which individuals settle on a collective outcome. Usually five subjects are assigned different ideal
The second topic turns to electoral mechanisms points in the space, and it is the arrangement of
used in representative democracies. these ideal points that allows the experimenter to
manipulate, whether a Condorcet winner exists or
not. Subjects are given an initial status quo and then
Spatial Committee Experiments
allowed to introduce amendments. Voting takes
In the late 1960s theoretical papers by Davis and place following an amendment, with the winner
Hinich ( ) and Plott ( ) described a social becoming (or remaining) the new status quo.
choice environment for spatial committees. Those Amending takes place in between votes. A motion
committees consist of a well-defined multi- to adjourn, passed under a voting rule, constitutes
dimensional policy space, with actors holding the stopping rule for the committee decision. This
serves as the standard institution
Collective Choice Experiments 1797
for subsequent spatial committee experiments. Experiments on spatial committees have added
Changing these basic institutional rules became the to a clearer understanding of institutional mecha-
way to test theories of collective choice. nisms. Experimental results demonstrate that
Experimental results in the absence of equilib- changing who has the power to set the agenda, how
rium are both frustrating and profitable. Frustration the agenda is built, how many votes are needed and
arises over the fact that committee choices tend to whether players enjoy veto powers, matters.
be clustered in similar regions of the policy space.
While there appears to be some pattern to the out-
comes, the process by which these outcomes arise Electoral Mechanisms
has not been frilly characterized (but see the attempt
by Bianco et al. ). Profitably, these empirical results A second area of interest for collective choice
led theorists and experimenters to add agenda experimentalists is with electoral mechanisms.
control to the structure of the game. This led to a Three broad directions have been taken that treat
distinction between preference-induced and different aspects of representative democracies. The
structure-induced equilibrium. For example, Plott first is concerned with candidate behaviour. At the
and Levine ( ) showed the effectiveness heart of this research is the question of whether
of agenda control both in the laboratory and in a candidate positions will converge to equilibrium
natural setting. Awarding agenda power created a when it exists. The second direction is concerned
structure-induced equilibrium and laboratory sub- with voter behaviour, particularly how voters
jects converged to it Recent experimental work by behave when they have little information about
Frechette et al. ( ) illustrates that the equilib candidate positions. The final direction deals with
rium favours agenda setters. the way in which electoral rules determine the
Theoretical work by Buchanan and Tullock ( likelihood that ‘types’ of candidates are elected,
) led experimentalists to examine whether where types usually refer to racial and ethnic
changing the proportion of actors needed to pass a minority candidates.
policy had any effect. Experiments by Laing and The initial experimental work on candidate
Slotznick ( ) showed that moving from simple behaviour focused on candidates who cared only
majority rule (50 per cent plus 1) to supermajority about winning and varied the information condi-
majority rule (67 per cent) resulted in many equi- tions that the candidates have about the preferences
libria and that subjects chose them. Schofield ( ), of voters. Most experiments use a unidimensional
among others, provided the theoretical policy space that guarantees an equilibrium. This
basis for when an equilibrium exists as a function of equilibrium is defined by the policy preference of
the dimensionality of the policy space, the voting the median voter. In the experiments elections are
rule and the distribution of voters’ preferences. sequential, with two candidates announcing posi-
These theoretical findings spurred experimentalists tions in the policy space and voters choosing
to examine other changes to the standard committee between the candidates. Voters are assigned ideal
experiment. For example Wilson and Herzberg ( points in the policy space, the winning candidate is
) theoretically pre required to implement the announced policy and
dicted and experimentally demonstrated that when a voters are paid an amount that decreases with the
single player holds veto power, that player’s ideal distance of the winning position from their ideal
point is the equilibrium. Haney et al. ( ) point. Candidates are paid only if they win. Once
empirically show committee choices the election is over another election is held with
converging to equilibrium when a weighted voting candidates free to change their previously
rule is used. Such a rule requires that a single player announced policy. Not surprisingly, all candidates
always be included in a coalition. These results are quickly adopt the position of the median voter when
representative of the kind of work that has they are fully informed about voter preferences.
dominated the experimental spatial committee Under incomplete information about voters.
agenda.
1798 Collective Choice Experiments
candidates also converge to the median voter’s altering such games, thereby producing an equi-
position, by responding to feedback about the vote librium that subjects choose. Often those institu-
share accruing to different policy positions, as in tional changes benefit one actor (for example, by
McKelvey and Ordeshook ( ). If candi assigning agenda control to a particular player). A
dates have policy preferences whereby their earn- third finding is that incomplete information does
ings depend not only on winning but also on not prevent convergence to equilibrium for either
implementing a policy close to their own preferred candidate platform choice or voter behaviour. The
position, then the median voter result no longer fourth finding returns to Arrow’s original insight:
holds (see the experimental results by Morton )• voting mechanisms can be
When voters are uninformed about candidate manipulated to achieve predictable, but very dif-
positions, are they able to cast accurate ballots? ferent, outcomes. It all depends on the mechanism
With minimal information, such as biased that is implemented.
endorsements or polls, subjects do very well at
inferring candidate positions. Lupia and McCubbins
( ) and Morton and Williams See Also
( ) consider various aspects of voter informa
tion and show that voters are able to quickly
determine the positions of candidates and cast their
vote accordingly.
Finally, several experiments have focused on ►
differing electoral mechanisms and what they mean ►
for the type of candidates that gain election. For ►
example Gerber et al. ( ) compare two
voting mechanisms in an experiment to test whether
one or the other disadvantages a racial or ethnic Bibliography
minority candidate. A form of cumulative voting (in
A r r o w , K . J . 1 9 6 3 . Social choice and individual values.
which voters can cast more than a single vote) leads New Haven: Yale University Press.
to more minority candidates being elected. This Berl, J.E., R.D. McKelvey, PC. Ordeshook, and M. D. Winer.
should be no surprise to collective choice theorists 1976. An experimental test of the core in a simple n-person
who have long noted that different electoral cooperative nonsidepayment game. Journal of Conflict
Resolution 20: 453-476. Bianco, W.T., M.S. Lynch, G.J.
mechanisms lead to predictable variation in Miller, and I. Sened. 2006. ‘A theory waiting to be discovered
outcomes. Cox ( ) offers an and used’: A reanalysis of canonical experiments on majority
extended discussion of such mechanisms. rule decision making. Journal of Politics 68: 837-850. Black,
D. 1948. On the rationale of group decision making.
Journal of Political Economy 5 6 : 2 2 - 3 4 .
What We Know Buchanan, J., and G. Tullock. 1962. Calculus of consent.
Ann Arbor: University of Michigan Press.
C o x , G . W . 1 9 9 7 . Making votes count: Strategic coordina-
Collective choice experiments provide several tion in the world s electoral systems. N e w Y o r k : C a m -
insights. First, when a Nash equilibrium of the bridge University Press.
underlying game exists it is a strong predictor of the Davis, O.A., and M.J. Hinich. 1966. A mathematical model of
policy formation in a democratic society. In Mathema tical
outcome of the experiment. The second finding is
applications in political science, ed. J.L. Bemd. Dallas, TX:
that when there is no Nash equilibrium for the Southern Methodist University. Fiorina, M.P., and C.R. Plott.
underlying game, subjects choose outcomes that 1978. Committee decisions under majority rule: an
cluster in predictable areas of the policy space, but experimental study. American Political Science Review 72:
575-598.
the process by which that occurs is not settled. At
Frechette, G., J.H. Kagel, and J.H. Lehrer. 2003. Bargaining
the same time, experimentalists have implemented in legislatures: An experimental investigation of open
institutional mechanisms versus closed amendment rules. American Political
Science Review 97: 221-232.
Collective Models of the Household 1799
Gerber, E.R., R.B. Morton, andT.A. Rietz. 1998. Minority and (b) the decision process results in Pareto-
representation in multimember districts. A m e r i c a n
efficient outcomes. The main results of the
P o l i t i c a l S c i e n c e R e v i e w 92: 127-144.
Haney, R, R. Herzberg, and R.K. Wilson. 1992. Advice and theory of collective models then refer to the
consent: Unitary actors, advisory models and empirical issue of deriving testable restrictions
experimental tests. J o u r n a l of Conflict on household behaviour and recovering from
R e s o l u t i o n 36: 603-633.
this some information on the structural model
Laing, J.D., and S. Olmsted. 1978. An experimental and game
theoretic study of committees. In G a m e T h e o r y a n d that can be used to carry out welfare compari-
P o l i t i c a l S c i e n c e , ed. P.C. Ordeshook. New York: sons at the individual level.
New York University Press.
Laing, J.D., andB. Slotznick. 1983. Winners, blockers, and the
status quo: Simple collective decision games and the core. Keywords
P u b l i c C h o i c e 40: 263-279. Collective models of the household; Exclusive
Lupia, A., and M.D. McCubbins. 1998. T h e d e m o c r a t i c goods; Household behaviour; Indirect utility
dilemma: Can citizens learn what they function; Pareto efficiency
n e e d t o h t o w ? Cambridge: Cambridge University
Press.
McKelvey, R.D., and PC. Ordeshook. 1985. Sequential
elections with limited information. A m e r i c a n JEL Classifications
J o u r n a l o f P o l i t i c a l S c i e n c e 29: 480-512. Dll
McKelvey, R.D., PC. Ordeshook, and M.D. Winer. 1978. The
competitive solution for n-person games withoutUntil recently ‘unitary’ models, which assume that
transferable utility with an application to competitive
household members act as if they maximize a
games. A m e r i c a n P o l i t i c a l S c i e n c e R e v i e w
72: 599-615.
unique utility function under a budget constraint,
were largely predominant in the literature on
Morton, R.B. 1993. Incomplete information and ideological
household behaviour. There is increasing agree-
explanations of platfomr divergence. A m e r i c a n
P o l i t i c a l S c i e n c e R e v i e w 87: 382-392.
ment, however, that economists cannot ignore the
Morton, R.B., and K.C. Williams. 2001. Learning by voting:
fact that most households are composed of several
Sequential choices in presidential primaries and other
individuals who take part in the decision process.
elections. Ann Arbor, MI: University of Michigan Press.
Consequently, the ‘collective’ models, which pos-
Plott, C.R. 1967. A notion of equilibrium and its possibility
under majority rule. A m e r i c a n Economic
tulate that (a ) each household member has specific,
R e v i e w 57: 787-806.
generally different preferences and (b) the decision
Plott, C.R., and M.E. Levine. 1978. A model of agenda
process results in Pareto-efficient outcomes, have
influence on committee decisions. A m e r i c a n
E c o n o m i c R e v i e w 68: 146-160. attracted considerable attention from the profession
Schofield, N. 1985. S o c i a l C h o i c e a n d D e m o c r a c y .
during recent years.
Heidelberg: Springer.
To examine the properties of collective models,
Wilson, R.K., andR.Q. Herzberg. 1987. Negative decision
let us consider a household consisting of two
powers and institutional equilibrium: Experiments on
persons, A and B, who make decisions about
blocking coalitions. T h e W e s t e r n P o l i t i c a l
Q u a r t e r l y 40: 593-609. consumption. These persons are characterized by
well-behaved utility functions of the form: M,(X^,
\B, X), where x, denotes a vector of private goods
consumed by member i and X a vector of public
Collective Models of the Household goods (i = A, B). This specification of preferences is
very general; it allows for altruism but also for
Olivier Donni externalities or any other preference interaction. We
denote the vector of prices for private goods by p,
the vector of prices for public goods by P and the
household total expenditure by y. Finally, we
Abstract suppose that there exists a vector of distribution
Collective models of the household are based on factors, that is, a set of exogenous variables which
two fundamental assumptions: (a) each agent is influence the intra-household
characterized by specific preferences
1800 Collective Models of the Household
allocation of resources without affecting prefer- There exist at least three different sets of
ences or the budget constraint. Examples are given testable restrictions that characterize household
by the respective contribution of each member to behaviour.
the exogenous household income, the state of the
marriage market or divorce legislation. These SRI Condition
variables, which are often assigned a crucial role in Browning and Chiappori ( ) and Chiappori
the derivation of the results, are denoted by s. and Ekeland ( ) show that household
To simplify notation, let n' = (p', P') be the demands compatible with Eq. ( ) have to satisfy the
vector of prices. Then, efficiency essentially means following condition:
that household behaviour can be described by the
maximization of a utilitarian social welfare s = E +Ri,
function, that is,
where E is a symmetric, semi-definite matrix
max f i ( n , y , s ) u A ( x A , x B , X ) and R! is a rank one matrix. The interpretation is the
xA,XB,X
following. For any given pair of utility functions,
+ (1 - f j . ( n , y , s ) ) u B ( x A , x B , X ) (1)
(a) the budget constraint determines the Pareto
frontier as a function of n and y and (b) the value of
subject to p' (X | + xB) + P'X= y. In this pro-
/i determines the location of the household
gramme, the function ji determines the location of
equilibrium on this frontier. Consequently, a change
the household equilibrium along the Pareto frontier.
in n implies a shift of the Pareto frontier. The latter
If q = 1, then the household behaves as though
entails the modification of household demands
member A always gets her way whereas, if /< = 0, it
described by X, However, the value of /i varies as
is as if member B is the effective dictator. We
well, hence the location of the equilibrium moves
denote the solutions to Eq. ( ) by x, (n, y s), xB (n, y s)
along the Pareto frontier. Since the frontier is of
and X(TT, y s).
dimension one, this effect is very restricted and
defined by Ri.
Proportionality Condition
Characterization
The particular structure of Eq. ( ) leads to further
The first objective of the theory of collective restrictions on behaviour. To make things simple,
models is to investigate the properties of the house- let us suppose that the vector of distribution factors
hold demands derived from Eq. ( ). These properties is twodimensional: s = (si, s2). Then, Bourguignon
can either be tested statistically or be imposed a et al. ( ) demonstrate the following result:
priori for simplifying the estimation task. From this
perspective, one crucial point is that individual =e
ds\ dsx ’
demands for private goods, xA and xB, are generally
unobservable by the outside econometrician; where 6 is a scalar. Thus, the response to differ-
demands for these goods are observed only at the ent distribution factors is co-linear. The interpreta-
household level, x xA + xB To be usefiil, the tion is that distribution factors can only change the
restrictions derived from the collective setting have location of the outcome on the frontier (through
thus to characterize household demands, x or X, function ji), and the latter is of dimension one.
instead of individual demands, xA and xB.
Let = (x', X') be the vector of household Specific Conditions
demands. We define the Pseudo-Slutsky matrix as The econometrician is often inclined to put more
follows: structure on preferences. For example, let us sup-
pose that agents have utility functions of the form:
«,(x„ X). In that case, we say that agents are
Collective Models of the Household 1801
‘egoistic’ in the sense that the utility does not This expression describes the level of welfare
depend on the partner’s consumption. This that member i attains in the household when he or
assumption implies, in particular, that the decision she faces the price-income bundled, v) and a set of
process can be decentralized. In a first step, house- distribution factors s. This representation of utility
hold members agree on the level of public goods as differs from the ‘unitary’ indirect utility function in
well as on a particular distribution of the residual that it implicitly includes the sharing function, and
expenditure between them. In a second step, they hence an outcome of the collective decision process.
maximize their utility function, taking into account However, the knowledge of Eq. ( ) is usually
the level of public goods and their own budget sufficient to evaluate the impact of economic
constraint. It means, formally, that there exists a policies on individual welfare.
pair of functions (pjp. X, y , s), pB(p, X, /, s)), In general, if agents are egoistic, the collective
satisfying pA + pB = y where y = y - P'X, such that indirect utility functions can be retrieved. None-
the demand for private goods by member i is the theless, the econometrician must observe the
solution to demand for some specific goods, referred to as
‘exclusive’, which benefit only one person in the
maxM;(x/, X) subject to p'x, = pr household. More precisely, we say that good X (x)
is exclusively consumed by member i if du/d X =
Hence, household demands for private goods, 0(iduj/dxj = 0) for j / i. The intuition is that the
conditionally on the demands for public goods, can household demand for ‘exclusive’ goods can be
be written as: used as an indicator of the distribution of bargaining
power within the household. Donni ( ) considers the
x = xA(p,X,p(p,X,y*,s)) case of purely private con
+ xB(p,X,y* - p(p, X , y* , s) ) , sumption (X = 0) and shows that, if there is a single
exclusive good, the collective indirect utility
where p = pA and y — p = pB. This structure functions can be identified up to composition by an
generates strong testable restrictions because the increasing transformation. Similarly, Chiappori and
same liinction p(p, X, y , s) enters each demand for Ekeland ( ) consider the oppo
private goods. Bourguignon, Browning and site case of purely public consumption (x = 0) and
Chiappori ( ) exphcitly derive these restrictions show that, if there are two exclusive goods (one for
under the form of partial differential equations, each member), the identification is still possible.
whereas Donni ( ) shows that the demands for However, the general case with both private and
public goods have a particular but different struc- public consumption has not been completely treated
ture, which implies testable restrictions as well. until now; see Blundell, Chiappori and Meghir (
) for a first investigation.
Welfare Analyses - Identification
Bibliographical Note
One of the main sources of interest in collective
models is to provide the theoretical background for The main idea of collective models can be traced
performing welfare comparisons at the individual back to Leuthold ( ), who estimates a model
level. The key concept in that case is what of household labour supply based on non-
Chiappori ( ) calls the ‘collective’ indirect cooperative game theory, where the individual is the
utility function. Let us suppose again that agents are basic decision-maker. However, this model differs
egoistic. If so, the collective indirect utility function from collective models in that the underlying
is defined as follows: decision process does not result in efficient
outcomes. It actually belongs to the family of
Vi(n,y, s) = Ui(xi(n,y,s),X(n,y,s)). ‘strategic’ models (which are sometimes
(2 )
1802 Collective Models of the Household
referred to as ‘collective’ models in a broad sense). Blundell, R., P.-A. Chiappori, and C. Meghir. 2005. Collective
labor supply with children. J o u r n a l o f P o l i t i c a l
Nevertheless, a significant advance towards the
E c o n o m y 113: 1277-1306.
development of collective models is made by Bourguignon, R, Browning, M., and P.-A. Chiappori.
Manser and Brown ( ) and McElroy 1995. The collective approach to household behaviour.
and Homey ( ) at the beginning of the 1980s. Working Paper. Paris: DELTA.
These authors study the properties of models based Bourguignon, R, M. Browning, P.-A. Chiappori, and V.
Lechene. 1993. Intrahousehold allocation of consumption:
on bargaining theory, which implies Pareto- A model and some evidence from French data. A n n u l e s
efficiency. In that case, the location along the d ' e c o n o m i c e t d e s t a t i s t i q u e 29: 137-156.
Pareto frontier is determined by the Nash (or Kalai- Browning, M., and P.-A. Chiappori. 1998. Efficient
Smorodinsky) solution. However, the first formal intrahousehold allocations: A general characterization
and empirical tests. E c o n o m e t r i c a 66: 1241-1278.
investigation of a model based on the sole Chiappori, P.-A. 1988. Rational household labor supply.
efficiency assumption is due to Chiappori ( , E c o n o m e t r i c a 56: 63-89.
) in the context of labour supply deci Chiappori, P.-A. 1992. Collective labor supply and welfare.
sions. This model is not explicitly examined in this J o u r n a l o f P o l i t i c a l E c o n o m v 100: 437-467.
Chiappori, P.-A. 1997. Introducing household production in
article because it can be seen as a particular case of collective models of labor supply. J o u r n a l o f P o l i t -
the model of consumption. Note, however, that i c a l E c o n o m y 105: 191-209.
Apps and Rees ( ), Chiappori ( ), Chiappori, P.-A., and I. Ekeland. 2003. The micro economics
Donni ( ), and Fong and Zhang ( ) pre of group behavior: Identification. Working paper.
Chicago: University of Chicago.
sent theoretical extensions of Chiappori’s initial Chiappori, P.-A., and I. Ekeland 2006. Characterizing group
model, whereas Chiappori, Fortin and Lacroix ( behavior. J o u r n a l of Economic Tlieoiy
) exhibit empirical results. Finally, we must (forthcoming).
mention that several authors have generalized col- Chiappori, P.-A., B. Fortin, and G. Lacroix. 2002. Marriage
market, divorce legislation, and household labor supply.
lective models to inter-temporal decisions and
J o u r n a l o f P o l i t i c a l E c o n o m y 110: 37-72.
uncertain environment. One of the most represen- Donni, O. 2003. Collective household labor supply: Non-
tative examples of these studies is given by participation and income taxation. J o u r n a l o f
Mazzocco ( ). P u b l i c E c o n o m i c s 87: 1179-1198.
Donni, O. 2004. The intrahousehold allocation of private and
public consumption:Theory and some evidence from U.S.
data. Working paper. Cergy-Pontoise: University of
See Also Cergy-Pontoise.
Donni, O. 2006. Collective consumption and welfare.
C a n a d i a n J o u r n a l o f E c o n o m i c s 39: 124-144.
► 'amity Decision Making
Fong, Y., and J. Zhang. 2001. The identification of
► Gender Roles an: E /ision c I >our unobservable independent and spousal leisure. J o u r n a l
► louseho o f P o l i t i c a l E c o n o m y 109: 191-202.
► lousehold Survf : Leuthold, J. 1968. An empirical study of formula transfers
and the work decision of the poor. J o u r n a l o f
► ndividualism
H u m a n R e s o u r c e s 1: 312-323.
► Integrability of Demand Manser, M., and M. Brown. 1980. Marriage and household
► Intrahousehold Welfare decision making: A bargaining analysis.
► Labe pply I n t e r n a t i o n a l E c o n o m i c R e v i e w 21:31-44.
Mazzocco, M. 2005. Household intertemporal behavior: a
► Rotten Kid Theorem
collective characterization and empirical tests. Working
paper. Madison: University of Wisconsin.
McElroy, M., and M. Homey. 1981. Nash bargained
Bibliography household decisions. I n t e r n a t i o n a l E c o n o m i c
R e v i e w 22: 333-349.
Apps, P., and R. Rees. 1997. Collective labor supply and
household production. J o u r n a l o f P o l i t i c a l
E c o n o m y 105: 178-190.
Collective Rationality 1803
Abstract
This article reviews the concepts of individual Collective Rationality and Social Choice
rationality and collective rationality as they
Of particular interest to the idea of collective
appear in the economics literature. In particular,
rationality is the study of social choice.
the existing literature on social choice and
In a seminal work, Arrow ( ) connects
aggregate demand points to a fundamental dis-
collective rationality to social choice through the
connect between these two notions of rational-
idea of the existence of a social welfare function.
ity. A possible reconciliation of this disconnect
Formally, consider a large set of conceivable alter-
is suggested.
natives, X, that a society faces. A preference
ordering R (weakly preferred) on X is a binary
Keywords relation on X that is both complete and transitive. Its
Aggregate demand; Arrow’s impossibility the- asymmetric and symmetric parts are denoted by P
orem; Collective choice; Collective rationality; (strictly preferred) and /(indifferent) respectively.
Debreu-Mantel-Sonnenschein theorem; There are n number of individuals in the society.
Gibbard-Satterthwaite impossibility theorem; Each individual i has a preference ordering Rj on the
Individual rationality; Prisoner’s dilemma; set X. A social welfare function (SWF), F, maps a
Rational choice; Sen, A.; Social choice; Social profile of individual preference orderings (R lv. .,/?„)
welfare function; Strategic behaviour to a preference ordering on X. The preference
ordering F{RX.. ,,R„) is then interpreted as the
society’s preference on X for the society consisting
JEL Classifications of individuals with preference orderings (Ru.. .,R„).
D02; D71; D72; D81; D82 If such an SWF exists, then the social choice to be
made from any set of feasible alternatives can be
Since ancient times, men have argued that choice
determined by comparing any pair of feasible
should be governed by ‘desire and reasoning
alternatives according to the society’s preference.
directed to some end’ (Sen ). Much modem
The social choice thus made is guided by a
economic theory is based on this rational choice
preference ordering to reach the best among feasible
principle paradigm. In an individual choice prob-
alternatives - collective rationality is achieved. In
lem, the individual is assumed to have a preference
other words, such an SWF, if it exists, is a
ordering on the set of alternatives. The individual
preference aggregation procedure aggregating
choice is rational if, for any given decision
individual preference orderings into a society’s
situation, the choice made is always the best among
preference ordering according to which a rational
all feasible alternatives according to the preference
choice can be made.
ordering. In a collective choice problem, be it that
In isolation, collective rationality is trivial to
of a society or a committee, the definition of this
reach because an SWF always exists. For example,
rational choice principle becomes problematic. As
take any preference ordering R on X.; the constant
there is presumably a huge disparity among the
function that maps every possible profile of
desires and ends of the individuals within the
collective, by whose desire
1804 Collective Rationality
individual preference orderings to R is an SWF. The second strand of literature directly exam-
Obviously, this SWF is not meaningful since no ines the formulation of collective rationality in the
information about individual preferences is definition of Arrow’s social welfare function.
reflected by society’s preferences. For an SWF to Arrow’s SWF requires society’s preferences to be
reasonably aggregate individual preferences, some orderings, that is, binary relations that are complete
minimal set of conditions should be imposed. and transitive. Suppose that we weaken collective
Arrow ( ) considers four conditions: U - rationality to requiring only that society’s
(universal domain: an SWF’s domain contains all preferences be, say, acyclic as opposed to fully
possible individual preference orderings), P (Pareto transitive. Can impossibility then be avoided? More
principle: if all individuals strictly prefer one alter- generally, is the strong collective rationality
native to another, then the society strictly prefers formulated by requiring society’s preferences to be
the first alternative to the second), I (independence orderings to blame for the impossibility? This line
of irrelevant alternatives: the way the society ranks of research concludes that, even with a weakened
a pair of alternatives should depend only on the way notion of collective rationality, the impossibility
individuals rank the same pair, not on how they remains (Sen ). Therefore, social choice cannot be
rank any other alternatives), and D - (non- expected to be collectively rational, even weakly.
dictatorship: no single individual always gets to
determine the society’s preference). He shows the Collective Rationality and Strategic
famous Arrows Impossibility Theorem'. It is Behaviour
impossible to have a social welfare function
satisfying U, P, I and D simultaneously. In other The aforementioned work implicitly assumes that
words, collective rationality is impossible to truthful individual preferences are aggregated. If,
achieve universally if society is to take into account instead, strategic behaviour is allowed, then even if
all individuals in a minimally reasonable way. we require a social choice function to be only non-
Arrow’s Impossibility Theorem jump-started the dictatorial (a social choice function maps a profile
modem day study of social choice. In the huge of individual preferences into an alternative - a
literature of social choice theory, two strands choice of the society), every such social choice
directly relate to collective rationality formulated in function can be manipulated. This is the Gibbard-
tire context of Arrow’s Impossibility Theorem. One Satterthwaite impossibility theorem (Gibbard ;
strand focuses on identifying domain restrictions so Satterthwaite ). That is, even if the collective makes
that social welfare functions satisfying Arrow’s up its mind about what is good for the society in a
three other conditions exist. For example, the SWF given circumstance, as long as individuals are free
which derives society’s preference from majority to report their preferences and the collective does
voting on each pair of alternatives (majority rale) not always choose the top alternative of a given
with universal domain will lead to many cycles in agent’s reported preference, then the collective’s
society’s preference, violating the transitivity goal cannot be achieved.
requirement of a preference ordering. However, if
individual preferences are restricted to those that are
Collective Rationality and Aggregate
single-peaked when alternatives can be represented
Demand
in one dimension, then majority rule will not
generate cycles and satisfies all other requirements The disconnection between collective rationality
of Arrow’s Theorem. In general, this strand of and individual rationality exists in other areas of
literature proves that collective rationality can be economics. In consumer demand theory, the
meaninglully restored for some restricted domains Debreu-Mantel-Sonnenschein theorem (Debreu
(Gaertner ). However, domain restrictions are
severe, and outside of them the problem of society’s
preference cycles is global (McKelvey )•
Collective Rationality 1805
; Mantel ; Sonnenschein ) states that generally The philosophical bases of individualism have
aggregate demand functions do not exhibit any many followers in economics. Binmore ( ,
regularity (such as being downward sloping p. 142) wrote: ‘Game theorists of the strict school
regarding price) even when all individual demand believe that their prescriptions for rational play in
functions are derived from rational decisions in the games can be deduced, in principle, from one-
sense of preference maximization under budget person rationality considerations without the need to
constraints. More specifically, for any given shape invent collective rationality criteria provided that
of the aggregate demand function (not necessarily sufficient information is assumed to be common
downward sloping), there exists a preference knowledge.’ Under the standard assumptions of
profile, one preference for each consumer, such that game theory accounting for individual interests,
the aggregate demand function is generated by the these game theorists will prescribe that players
individual demand functions derived from that defect in the Prisoner’s Dilemma game. Such play
preference profile. On the other hand, empirical leads to a Pareto-inferior outcome and thus is in
evidence suggests that aggregate demand functions conflict with the collective interest. This is not
often exhibit some regularity even when individual problematic if game theory is a normative theory
demand functions do not exhibit regular properties which prescribes what people should do rationally.
from preference maximization under budget However, as a predictive theory it fails to match
constraints (Kirman ). what people actually play in the Prisoner’s Dilemma
game. Experimental evidence shows rampant
cooperation among players of the Prisoner’s
Possible Reconciliation of Individual Dilemma game (Rapoport and Chammah ;
Rationality and Collective Rationality Ledyard ).
If we make the organic philosophical
The findings in social choice theory and demand assumption that a collective is an independent
theory suggest a fundamental separation between entity, then do we arbitrarily assume a criterion of
collective and individual rationality. On the one collective rationality? A more reasonable way of
hand, if individuals in a collective are rational, the thinking about a collective being as organic is,
collective choice is responsive to individuals, and perhaps, to consider that, in a collective, individuals
the collective power does not he in some proper become social creatures, not mere individuals, and
subset of the collective (democratic), then the col- as such their choices have social consequences that
lective choice cannot be ‘collectively rationalized’. they take into account. This can be modelled as
On the other hand, in some situations, collective individuals’ preferences over a given set of
choices can be ‘rationalized’ even when individuals alternatives changing depending on whether they
in the collective do not act as rational individuals. are individuals or members of a collective. How
This separation between collective and individual preferences are specifically influenced may reflect
rationality is not unlike Buchanan’s critique of culture, social convention or custom, so that they
Arrow’s formulation of collective rationality: are context- dependent. But whatever the cause, this
may create sufficient restrictions on the preference
We may adopt the philosophical bases of individu-
alism in which the individual is the only entity domain that collective rationality results as a
possessing ends or values. In this case no question of consequence of some aggregation procedure that is
social or collective rationality may be raised. democratic.
A social value scale as such simply does not exist.
Alternatively, we may adopt some variant of the
organic philosophical assumptions in which the
collectivity is an independent entity possessing its See Also
own value ordering. It is legitimate to test the ratio-
nality or irrationality of this entity only against this ► Arrow’s Theorem
value ordering. (Buchanan , p. 116) ► Ratic
► Ratic
► iocial Choice
1806 Collet, Clara Elizabeth (1860-1948)
1894c. Females and children, earnings of. In likely to make possible benefits to everybody as
Palgrave’s Dictionary of Political Economy, compared with a situation where firms and con-
London: Macmillan & Co., Vol. II. sumers are separated by an anonymous market. This
1898. The collection and utilization of official cannot be the case for the economy as a whole if it
statistics bearing on the extent and effects of the is Pareto efficient (e.g. the standard perfectly
industrial employment of women. Journal of the competitive economy without externalities) but it
Royal Statistical Society 61: 219-260. will still be the case that cooperative action by a
1899. Money wages of in-door domestic servants. group of agents within the economy will allow that
Miss Collet. Report. Parliamentary Papers 1899 group to gain at the expense of the rest of the
[C.9346] xcii. economy. The study of the problems faced by
1931. Domestic service. Chapter VIII in New colluding firms should hope to throw light on these
Survey of London Life and Labour, vol. II, ed. more general issues.
H. Llewellyn Smith. London: R S. King.
19 33 . Ap p en d i x t o The Private Letter Books of 1. Consider a well defined group of firms produc-
Joseph Collet, sometime Governor of Fort St. ing identical or similar products, the implication
George Madras, ed . H . H. D od w e l l . being that demand for the products is interre-
Lo nd on : Lo ng ma ns . lated. As a first step, assume that cost and
demand functions faced by each firm are com-
mon knowledge (every other firm in the group
knows these functions, every firm knows this,
and so on). If the firms meet together to collude
Collusion then some joint action will emerge. Cooperative
game theory concerns itself with this solution
Kevin Roberts but, for the present exercise, it is sufficient to
note two of the main determinants of the even-
tual solution. First, there are the outcomes made
possible by cooperative action. Assume profits
can be redistributed within the group-side-
Although collusive practices are not restricted to the payments can be made. Then if cooperative
economic relationships of a well-defined sub-group action fails to maximize joint profits all firms
in society, it is common to use the term collusion in can be made better off. Given the common
the context of cooperative activity between different knowledge assumption so that there is no
firms. With regard to the study of collusion, argument for inefficiency based upon the mis-
research has centred on the conditions most perceptions of firms, joint profit maximization,
conducive to collusion and, in both theoretical and with the group of firms acting like a multi-
empirical work on the operation of collusive product monopolist, should emerge.
arrangements (see Scherer , Chaps. 6 and 7). Side-payments may not always be possible.
If economic agents are self-interested maxi- For instance, collusive behaviour is outlawed in
mizers then, given that cooperation between a group many countries and, though it may be difficult to
of firms will almost certai nly make possible higher detect whether actions by firms are part of some
profits for each member of the group than is collusive arrangement, the transfer of money
possible without cooperation, there is a presumption between firms is much more likely to be capable
that collusive arrangements will be widespread. of detection. But without side- payments,
Thus it is important to understand why collusion actions which influence the size of joint profits
should fail to be universal. In fact, a more general also influence the distribution of those profits;
issue is also raised by this. Taking the argument one the consequence of this being that distributional
stage further, direct cooperation between a group of considerations will influence the actions chosen
firms and its consumers is by firms. For instance, the
1808 Collusion
cooperative Nash bargaining solution (Nash ) firms are better-off than the best they could
leads to actions which maximize the product of achieve under maximum retaliation from other
individual gains above some status quo position - firms. Here it should be noted that as, by
compared with joint-profit maximization, there is definition, a firm will be happy to choose its
movement towards the equalization of gains Nash strategy if all other firms do the same, the
above the status quo. This may be viewed as a Nash equilibrium is a feasible punishment
compromise between ignoring distributional solution. Thus there will always be an effective
considerations and the other extreme where only deterrent if all firms are better-off under collu-
distributional considerations count. sion than in the Nash equilibrium (Friedman )•
The second major determinant of the solution 2. Thus far, the story has assumed that there are no
reached will be what each firm can expect to information problems for the (potentially
achieve if it refuses to accept a particular collusive) group of firms. Exogenous uncer-
proposed collusive action for the group. The tainty faced by all firms is not a particular
status quo of the Nash bargaining solution may problem but when there are informational
be interpreted in this way. This is the most asymmetries between firms, the study of collu-
obvious component of ‘bargaining power’ for an sion is much richer. In fact, almost all the
agent. The requirement that firms must prefer the theoretical literature on collusion takes some
collusive action to what can be achieved by informational asymmetry as a starting point and
reneging, places restrictions on the collusive it is useful to survey some of this literature from
solution. Under the assumption that firms are the viewpoint of the informational asymmetry
interrelated only through the demand structure which is being postulated.
and that this interrelationship implies that the It is convenient to distinguish between two
goods produced by the collusive firms are forms of asymmetry - adverse selection where,
substitutes, then the worst that can happen to a for instance, some firms do not know the cost
firm if it refuses to accept a collusive arrange- and demand conditions of other firms, and moral
ment is that all other firms maximize their hazard where it is the behaviour of other firms
production and the reneging firm chooses pro- which cannot be observed. In the former case, it
duction to maximize profits in this hostile envi- is preferences that cannot be observed, in the
ronment. Given this scenario, there will usually latter case it is actions. Both forms of
be a large range of collusive actions which offer asymmetry can have important effects upon the
more to firms than can be achieved by reneging. structure of collusive arrangements that could be
However, it is not enough for firms to say that expected to emerge.
they will ‘punish’ a reneging firm in this way, 3. Taking the adverse selection case first, it is
there must be grounds on the part of the reneging fairly reasonable to assume that a firm will have
firm for believing that punishments will be a better knowledge of the demand and cost
carried out - it must be a credible threat in the conditions that it faces than other firms.
sense that if the firm reneges, other firms have an Although firms may be attracted by the sim-
incentive to punish the firm. The credibility plicity of adopting a collusive arrangement
restriction in this environment is captured by the based upon solely common information, this
so-called ‘perfect folk theorem’ of repeated solution ignores the efficiency gains that may be
games (see Rubinstein , for a published version achieved from making the collusive outcome
of the theorem) - firms called upon to punish will sensitive to the privately held information of
have the strongest incentive to punish if all other firms. Clearly, the main problem that arises is
firms punish that firm for not punishing in the that each individual firm must have the incentive
first place. This argument leads to the conclusion to reveal this private information. Consider a
that any collusive outcome can be maintained as simple model with just two firms, 1 and 2.
long as all Assume that the revenues they receive
Collusion 1809
when they produce outputs q\ and q2 are given by exact outcome chosen will be the result of
R\(q\, q2) and R2(q\, q2)- Each firm has a constant bargaining between the firms concerned. This
marginal cost of production and p2 bargaining process may involve the transfer of
and assume that this is private information, firm 1 information prior to collusive agreements being
knowing the true value of [1\, firm 2 knowing the reached (Roberts ).
hue value of [12. Assume, for simplicity, that [1 Recognition of the adverse selection problem
can take on only two values, [1 and [1, P < P- gives theoretical insight into the form of
Collusion will result in the adoption of output collusive practices that are discussed in the more
levels for each firm and the levels may be applied literature. In general discussions a
sensitive to the private information p2 and /?2 - distinction is drawn between implicit and
qdPi, Pi) and q2(pu Pi)- However, this can only be explicit collusion. However, in models without
implemented if a firm does not pretend to be a information problems, each firm is aware of the
high cost firm (p) when it is low cost (yij and vice agreement that would be decided upon if it met
versa; for firm 1, profit maximization implies with other firms and is aware how reneging
firms would be dealt with. In this case, there is
no need for firms to collude explicitly and the
distinction between implicit and explicit collu-
R\ \q\ (jf Pi), q 2 (p- Pi) ~ Pfh sion is not useful. But when adverse selection
exists, it is clear that a rule which makes other
( p - P i ) =R\[q\ (P,P2)- firms’ behaviour depend upon the private
<h(TPp2)] - Pq\ (P’Pi) information of some firm will require informa-
and
tion transmission between firms. The idea of
implicit collusion can be rationalized as a situ-
Ri[qi(PJi),q 2 (P,Pi)\ ~Pq\{P,Pi) ation where either no information is transmitted
^R i <7l (P’ Pi) ^2 (/?> - Pi)
Pqi (; P - Pi) or, to take a less extreme case, where
information is transmitted through aggregate
market-wide indices, e.g. the equilibrium price
This places restrictions on the class of solu-
in the industry.
tions that can be implemented. Combining the
The existence of adverse selection can also
inequalities gives provide a theoretical rationale for mark-up
pricing as a collusive outcome (Roberts ). When
(/-£) [<7I(/^2) — (P’ Pi) — 0 firms are selling in the same market, cost
conditions facing firms are more likely to be the
source of private information than demand
which implies that the lower the marginal cost,
conditions. While it may be impossible for other
the more a firm is allowed to produce. Given that
firms to observe the cost function of a firm, it
the collusive solution will usually entail a
may be possible to observe the level of costs at
restriction in output as compared with what a firm
the output being produced. As this observation
would like to choose (the other firm’s output can be used as a proxy for the private
remaining constant), the firm must be provided information of the individual firm, a collusive
with a disincentive from pretending to be a low- agreement will involve the output and price
cost firm when it is high-cost and this will come levels of firms being dependent upon cost levels
from variations in q2 - to provide the right - this provides a rationale for why the firms’
incentives to firm 1, the output of firm 2 should behaviour may be sensitive to average, rather
be negatively correlated with the marginal cost of than the conventional marginal, costs.
firm 1. 4. Over the last 20 years, most of the theoretical
The foregoing demonstrates that the literature on collusion has considered
implementability requirement gives some
structure to the collusive solution. But the
1810 Collusion
situations where the actions of firms fail to be Porter take the simple case of a homogeneous
perfectly observed by other firms - a situation of product being produced by similar firms. There
moral hazard. If the preferences of firms are is some uncertainty in demand so that a ‘low’
common knowledge then the problem is not one market price may be a result of this uncertainty
of deciding upon the collusive solution but, or of ‘cheating’ on the collusive agreement by
instead, of policing that solution. some firms. Green and Porter consider ‘trigger-
The simplest example of imperfect monitor- price’ punishment strategies which take the form
ing arises when there is a delay before the action of the group moving to the Cournot equilibrium
of a firm is observed by others. If time units are for a fixed time T if the market price drops
set equal to the delay time then a firm will gain below some level p. A feature of this
from reneging on a collusive agreement if informational set-up is that all firms suffer from
this punishment. The trigger-price strategies are
R p c
(i - p)n + pn > n set so that cheating does not occur though,
because of the uncertainty, a proportion of time
where ] |' is the profit per unit time under the
is spent in the punishment regime. There is a
collusive agreement, H7' is the maximum profit
direct trade-off between the gains from a col-
that the firm can achieve by reneging given that
lusive agreement that severely restricts output
other firms keep to the collusive agreement f| 7 is
and the costs of punishment that will be suffered
the profit the firm can achieve when it is being
in the maintenance of this agreement. Notice
punished by the other firms and [i is the discount
77
that as there is no adverse selection, there are no
factor for the firm. Obviously, If ^ II' so that
requirements for the firms to meet together to
the longer the delay before other firms perceive
decide upon collusive behaviour - it may be far-
reneging (the smaller the /?), the more incentive
fetched but there is nothing to rule out a system
a firm has to renege. As the degree of
of implicit collusion with trigger-price
punishment that can be inflicted is restricted by
strategies. For this reason, these moral hazard
the requirement that other firms have to punish,
models are often described as non-cooperative
and this incentive itself is diminished when there
models of collusion.
is a delay in observing behaviour, the smallest
5. The theory and practice of collusion are much
possible ] |r will rise as the delay time increases
discussed in texts on industrial organization. The
so reinforcing the incentive to renege. For a
foregoing has tried to make clear that the
detailed analysis of this problem, see Abreu ( ).
structure of information asymmetry in the mar-
The other main form of imperfect monitoring
ket is crucial for understanding the operation of
that has been considered deals with the case
collusive agreements. With a particular infor-
where individual firm behaviour is never
mational set-up, the structure of the set of col-
observable but market-wide aggregates can be
lusive agreements which will not entail reneging
observed by all firms. This was the situation
is now quite well understood. Despite much
which was studied by Stigler ( ) in his
work, there is rather less understanding of the
seminal paper on collusion and by many authors
exact agreement that will be settled upon.
since (a recent analysis is to be found in the
work of Porter , and Green and Porter ). In the
case studied by Stigler, firms observe the See Also
demand conditions for their own output and this
gives an indication of the prices that other firms ► A:."-...:
are charging. Green and ► Cooperative Equilibrium
► ...>r." e v
Colonialism 1811
The century between the 1820s and the outbreak formative process, a unique process in the history of
of the World War I saw the establishment of the mankind (Frankel ). Some other writers justified
modem colonial order. For during that period colonial mle on the ground that ‘Colonialism was a
European countries had achieved complete dom- necessary instrument of “modernization” which
inance over world trade, finance and shipping. On would help other peoples to do what they could not
the other hand, the political and military authority have done, or have done as well, by themselves’
of the European conquerors was backed by supe- (Fieldhouse , p. 43). At the other end of the
riority in technology, applied science, organization spectrum, radical theorists, notable among them
and information systems (Bagchi ). Walter Rodney, claim that under colonialism ‘the
Between the late 1870s and World War I (1914- only things that developed were dependency and
18), the colonial powers added to their possessions underdevelopment’ (Rodney , p. 256).
an average of about 240,000 square miles (620,000 One of the most articulate arguments put
sq.km.) a year, while during the first 75 years of the forward in defence of the colonial mle in under-
19th century the rate of increase in new territories developed areas is that of Lord Bauer, who contends
acquired by colonial powers averaged about 83,000 that:
square miles (215,000 sq.km.) a year. By the year
The colonial governments established law and order,
1914, the colonies extended over approximately 85 safeguarded private property and contractual
per cent of the surface of the globe. relations, organized basic transport and health ser-
Against this historical background, John Hicks vices, and introduced some modem financial and
legal institutions. This environment also promoted
establishes a useful distinction between two types of
the establishment or extension of external contracts,
colony: colonies of settlement and trading-post which in turn encouraged the inflow of external
colonies (Hicks , p. 51). A third type of colony was resources, notably administrative, commercial and
identified as ‘the plantation colonies'. In such case, technical skills as well as capital.
the colony which started as a colony of settlement These contacts also acquainted the population
with new wants, crops, commodities and methods of
was gradually transformed into a trading colony cultivation and opened new sources of supply of a
(ibid., p. 53). wide range of commodities. These changes
engendered a new outlook on material advance and
on the means of securing it: for good or evil these
The Colonization Debate contacts promoted the erosion of the traditional
values, objectives, attitudes and customs obstructing
material advance. (Bauer , p. 149)
While there is a strong connection between mer-
cantile expansion and colonization, it would be a This argument only confirms the deep-seated
mistake to emphasize the crude economic inter- Western biased view, claiming that material pro-
pretation of colonialism by narrowing down colo- gress and advance can only be achieved by eroding
nialism to the process of control of supplies of raw the traditional values, customs and production
materials, mineral resources and markets in under- structures of pre-capitalist and primitive societies.
developed and precapitalist regions. In fact, such a Rosa Luxemburg ( ) would see in Lord
narrow economistic approach eliminates a vital Bauer’s view an eloquent proof of her radical
aspect of colonialism relating to political activity contentions about colonialism and territorial
and the drive for dominance over the daily fives of expansion, by the emerging capitalist nations.
the people of the colonized regions (e.g. French But what is at issue is not the possibility or not
colonialism). of achieving material progress or advancement, but
Nonetheless, colonialism must be viewed, dia- the terms on which these transformations in the
lectically, as a complex phenomenon of capitalist material and socioeconomic structures were
expansion, operating in terms of time and space. To operated. From our viewpoint, what needs to be
illustrate this point, S.H. Frankel described such a stressed is the loss of sovereignty which the process
process as a disintegrating but also a of European colonization entailed for
Colonialism 1813
practically all colonized peoples. In Africa, for Hence, against all claims of the ‘Free-Trade’
instance, European colonizers often crushed, school, the British cotton industry did not rely on its
suppressed or amalgamated states at will. In most competitive superiority, but relied heavily on the
instances, the direct colonial rule was designed to monopolistic practices embodied in colonial trade-
direct and reorder the day-to-day lives of the regulations, and enforced by the British commercial
African peoples (Ajayi ). and naval supremacy (Hobsbawm , p. 58). On the
Seen in a radically different light, thinkers such other hand, the terms of trade between the colonized
as Albert Memmi, Jean-Paul Sartre and Franz areas and the metropolitan countries had a tendency
Fanon placed greater emphasis on the ideological to deteriorate steadily over time, so that the primary
implications and the sociopsychological conse- producers in colonized areas tended to obtain
quences of the process of colonization. According proportionally less with their labour than they could
to Fanon, colonialism tended not only to deprive a have done had they concentrated on producing food
society of its freedom and its wealth, but of its very or other subsistence crops for their own use or for
character, leaving its people intellectually and the home market (Fieldhouse , p. 78). This may be
morally disoriented (Fanon, English edition, )• characterized, in modem terminology, as ‘unequal
exchange’, which emphasizes once again the
exploitative nature of colonial trade.
Patterns of Colonial Trade
Historians tend to agree that the conquest of col- The Internal Control of the Colonial
onies was designed to the economic advantage of Economy
the European conquerors. Some historians (e.g. E.J.
Hobsbawm) would go as far as to claim that the The key to understanding colonialism as a historical
Industrial Revolution in England would not have phenomenon lies in analysing the mechanism of the
been accomplished without the conquest and internal control of the colonial economy. In this
penetration of ‘underdeveloped’ markets overseas connection, one has to answer two fundamental
(Hobsbawm , p. 54). questions. Why did the Western countries spend so
In fact, the primary aim of all European states much energy, blood and money in seeking to
was to use commercial regulations to maximize procure colonial possessions? What are the direct
their share of colonial trade in both directions and and indirect economic benefits of colonialism?
the profits they made from it. The English Navi- In the colonial economy, top priority was given
gation Acts, dating from the 1650s, may be taken as to infrastructure investment: railways, harbours,
typical in this respect. According to these Acts, all telegraphs, rivers and roads, since it was believed
colonial trade must be carried in British-owned and that these constitute the prerequisites of a modem
registered ships. All goods imported to the colonies economy, making it possible to link internal areas
must either be the product of Britain or be of production to the world commodity markets. In
transhipped and pay duty there. Any colonial the agricultural sector, it is still an open question
exports so ‘enumerated’ must be carried direct to a whether plantations, owned and run by foreigners,
British port in the first instance (Deane p. 204 ). The made any significant contribution to the develop-
aim of such rales and regulations was to give British ment of the colonial economy. On the positive side,
shipowners, merchants and manufacturers an they served as the main vehicle of introducing new
assured benefit from colonial commerce and to crops, attracting foreign capital, expanding the base
enable the government to tax colonial trade. This of the cash economy and the wage-labour force, and
clearly indicates the close association between the increasing agricultural productivity. On the negative
process of colonization and the rise of various side, the crops of such plantations were subject to
foreign-trade monopolies held by the charted severe
colonial companies.
1814 Colonialism
subordinate place within the imperialist system. Frankel, S.H. 1953. The economic impact of colonialism on
This is maintained and sustained by means of under-developed societies. Oxford: Basil Blackwell.
Furtado, C. 1970. E c o n o m i c d e v e l o p m e n t o f
chronic and structural balance of payments diffi- L a t i n A m e r i c a . London: Cambridge University
culties, arising from the trade, aid and investment Press.
relationships with their former or new metropolitan Hicks, J. 1969. A t h e o r y o f e c o n o m i c h i s t o r y .
countries (Warren , p. 35). Oxford: Clarendon.
Hobsbawm, E. 1968. I n d u s t r y a n d e m p i r e . London:
In sum colonialism may be seen in a historical Weidenfeld & Nicolson.
perspective as one decisive and dramatic stage in Luxemburg, R. 1913. T h e a c c u m u l a t i o n o f
the evolution of international economic relation- c a p i t a l . London: Routledge & Kegan Paul. 1951.
ships. The establishment of colonial rule constituted Magdoff, H. 1970. Is imperialism really necessary?
M o n t h l y R e v i e w , Part I, 22(5): 1-14; Part II, 22(6):
an arbitrary break in the normal course of history, 1-11.
splitting up regions and creating new artificial Rodney, W. 1972. How Europe underdeveloped Africa.
entities, transplanting new alien values and London: Bogle.
institutions into colonized societies. One may Warren, B. 1973. Imperialism and capitalist industrialization.
N e w L e f t R e v i e w (81):3—44.
finally wonder whether it would not have been
Warren, B. 1980. I m p e r i a l i s m : P i o n e e r o f
better for the people of the colonized regions to
remain autonomous until certain indigenous forces
could gain momentum and generate new conditions
for socioeconomic development and material Colonies
progress.
Donald Winch
See Also
The economic advantages and disadvantages of
colonies, the best means of establishing them and
ensuring their development, and the principles that
should govern trade and other relations with the
Bibliography mother country, have persistently served as fertile
Ajayi, J.F. 1969. Colonialism: An episode in African history.
topics for policy and theoretical debate in the
In Colonialism in Africa 1870-1960, vol. 1, ed. L.H. Gann history of political economy. The treatment given
and P. Duignan. London: Cambridge University Press. here will be confined to the British debate on
Alavi, H. 1975. India and the colonial mode of production. In colonies from the late eighteenth to the first decades
The socialist register 1975, ed. R. Miliband and J. Saville.
London: Merlin Press.
of the twentieth century.
Bagchi, A.K. 1982. The political economy of underdevel- The British empire was composed of colonies
opment. London: Cambridge University Press. and ex-colonies which had differing histories of
Baran, P. 1957. The political economy of growth. New York: acquisition and varying political and economic
Monthly Review Press.
relationships with the mother country. It follows
Bauer, P.T. 1976. Dissent on development. London:
Weidenfeld & Nicolson. that the problems which they posed were equally
Booth, D. 1985. Marxism and development sociology: diverse, as illustrated by the differences between the
Interpreting the impasse. World Development 13(7): 761- economies of the British West Indies before and
787. "
after slavery was abolished, the question of public
Deane, P. 1965. The first industrial revolution. London:
Cambridge University Press. land disposal and emigration to Canada, Australia
Encyclopaedia Britannica. 1768. 5th ed., vol. 4, 1977. and New Zealand, and the tasks of administering an
Fanon, F. 1966. The wretched of the earth. New York: Groove Indian sub-continent with a largely peasant
Press.
population living close to minimum subsistence
Fieldhouse, D.K. 1981. Colonialism, 1870-1945: An
introduction. London: Weidenfeld & Nicolson. levels. To this list can be added the problems of
integrating Scotland and Ireland
1816 Colonies
into the English economy and polity after the as a civil servant was within the India Office, and
respective Acts of Union in 1707 and 1808, where whose first major economic work was a treatise on
‘colonial’ issues - in a technical rather than emotive Indian Currency and Finance (1913) - a work which
sense - were often at stake, even if the teim was not attempted to do for its day what Sir James Steuart
used to describe them. Indeed, Ireland and India as had done when he wrote The Principles of Money
subsistence farming economies posed similar Applied to the Present State ofthe Coin of Bengal in
problems to British administrators, despite major 1772.
differences in their cultural backgrounds and A treatment based on chronology and recurring
political staUis within the empire. For that matter, themes seems the best way of dealing with the
even the United States after independence could for diversity of Colonies as a topic of economic interest,
some purposes be treated as having a 'colonial' though it should be remembered that colonies and
relationship with Britain, largely because it empire was never treated solely as economic
remained a major outlet for British capital and problems, even after the inauguration, largely under
labour. Marshall’s auspices, of a measure of professional
The sheer magnftude of the problems of empire distance in these matters.
and their changing nature over more than two The initial and longest period in the history of
cenUtries would guarantee that they bulked large in colonial policy began in England during Elizabethan
the minds of British economists. A few strategic times and effectively ended with the dismantling of
examples will show that there has always been a what had become known as the ‘old colonial
fairly intimate relationship between economics, system’ in the 1820s. This system was loosely based
economists, and empire. Adam Smith may well on the amorphous doctrines which Adam Smith
have advised the imposition of the Townshend subjected to attack in the Wealth of Nations as a
duties on North America in 1763, and he was central part of his condemnation of the ‘mercantile
certainly involved in advising the British govern- system’ (later known as Mercantilism). During this
ment on the consequences of American breakaway mercantile period colonies generated a large body of
when these earlier attempts to exert fiscal control literature which reflected the overwhelming concern
led to successful revolt. Malthus held the first Chair with national power and economic self-sufficiency
of political economy in Britain at an educational as the prime objectives of state intervention. Thus
institution at Haileybury established to train the colonies not only served direct strategic purposes as
servants of the East Indian Company; and James naval or military bases, they were also treated as
and John Stuart Mill together devoted nearly 40 sources of precious metals, and of strategic and
years to the service ofthe Company. The younger other raw materials necessary to Britain’s early
Mill was also a consistent supporter of schemes manufacturing industries - of particular value when
involving the ‘systematic colonization’ of Australia carried in British ships and bought through
and New Zealand, as well as taking a major interest, monopolistic arrangements at prices lower than
along with most of his classical predecessors and could be obtained on the world market. Colonies
successors, in the problems of the Irish economy. were variously regarded as protected export
The controversy over imperial preference at the turn markets, outlets for surplus population, and sources
of the twentieth century underlined the gulf that of tribute, in addition to serving occasionally as
existed between historical or institutionalist prison settlements. Although the development of
economists and their more orthodox opponents, led free trade doctrines, together with associated
Alfred Marshall, who believed that the increasing monetary ideas on specie-flow mechanisms,
challenge to Britain’s industrial hegemony did not eventually succeeded in undermining many of the
justify abandonment of deductive methods of arguments in favour of colonial possessions and
economic reasoning or those cosmopolitan free regulations, economic nationalism and neo-
trade principles which had spurred British mercantilistic ideas and policies have always
prosperity earlier. Finally, of course, there is exerted a powerful attraction, especially in countries
Keynes, whose first employment that were
Colonies 1817
industrial later-comers or anxious to overcome the post-colonial underdevelopment in the 1960s and
problems of underdevelopment by means of import- 1970s.
substitution and/or export promotion. Smith’s extensive treatment of colonies in the
During the eighteenth century, the established Wealth of Nations (especially Book IV, chapter 7)
wisdom on the subject of colonies came under became the locus classicus of the anti-mercantile
question largely because the supposed benefits of position, where much of his discussion was inter-
trade controls to the mother country were connected woven with an account of the founding of the
with the growing cost to Britain of defending and European colonies which brought matters up to the
governing her North American colonies. Among the present, namely to the issues underlying Britain’s
earliest critics of the colonial system from this point dispute with its American colonies. Whereas
of view was Josiah Tucker, who argued that Edmund Burke had advocated the relaxation of
existing benefits in the form of export markets and trade controls and taxation as a means of preserving
imported goods would accrue to Britain under free the political status quo, Smith maintained that
trade, and without the attendant military and Britain’s pretensions to empire would remain those
economic burdens of empire. Provoked by David of a shortsighted shopkeeper unless some system
Hume’s essays on commerce and money, Tucker could be found whereby the debts and current
also engaged in an important dispute with Hume on burdens of empire could be shared by the colonies
the question of whether trade relations between rich themselves; and he emphasized the point by closing
and poor countries could be considered as the Wealth of Nations with a warning about the
equalizing or not, and if so, by what process - potential long-term effects on British growth
perhaps through rising labour costs and prices in the prospects of existing arrangements. Hence the
richer trading partner, or through some other elaborate scheme he advanced for an imperial
mechanisms of stimulus and emulation in the poorer (Anglo-American) free trade zone, with provision
country. The dispute was of relevance to free-trade for complete fiscal harmonization and legislative
relations between England and Scotland after union. However, since he regarded this proposal as
Union, and of potential relevance to Britain’s utopian, its purpose was chiefly to underline the
relations with Ireland and other ‘colonies’, whether precise conditions under which the burdens of
acknowledged as such or not. Indeed the Hume- empire could be made acceptable. Much the same
Tucker debate was an early example of recognition result could be achieved through free trade and a
of the essential similarities between international treaty of friendship, without provision for imperial
and interregional trade in a world in which government.
currencies were linked through the gold standard. It Smith’s close dissection of the various gains and
also concerned relative rates of growth and the losses involved in maintaining the monopoly of
respective merits of agriculture and manufacturing colonial trade employed a quasi-mercantilist idea of
as the basis for a nation’s wealth and prospects for ‘vent for surplus’, as well as other arguments about
economic development. Would those who had the the effects on profits of colonial markets which
advantages of an early start acquire world dominion could not be squared with later Ricardian orthodoxy
and monopoly; and hence would poorer and later on the doctrine of comparative costs, capital
starters in the race be forced to employ protective accumulation, Say’s Law, and the permanent causes
measures to establish and maintain their infant of declining profits. Ricardo also pointed out
industries? If noticed, the debate would have circumstances in which it was possible for the
foreshadowed later issues raised by Friedrich List mother country to so regulate the trade of a colony
and Henry Carey with Germany and the United as to make it less beneficial to the colony, and more
States in mind, as well as other questions such as advantageous to the mother country than free trade -
‘free trade imperialism’, the ‘permanence’ of the an early version of the terms-of-trade argument for
dollar problem after World War II, and the debate tariffs (not meant for use) which in the hands of
on ‘dependency’ and the development of Robert Torrens was to
1818 Colonies
blossom into a case for an imperial Zollverein a few restricting access to land. This could be achieved by
years later. setting a price on land sufficient to delay dispersal
With the support of most political economists, of the wage-labour force, and by using the proceeds
however, the system of colonial preferences was of land sale for the purpose of bringing in new
gradually and unilaterally dismantled, beginning immigrants. This policy meant that public land
with the efforts of Huskisson and Robinson in the disposal and immigration had to remain an imperial
1820s. The views of special interest groups, espe- rather than piuely local concern, thereby creating
cially those connected with shipping and the West scope for conflict when colonies achieved self-
Indies, which Smith had expected to prevail, were government. The Wakefield policy came to be seen
outflanked by an uncertain combination of intel- as a symbol of imperial oppression, an attempt to
lectual argument, political opportunism, and a place colonial development within a straitjacket
general realization that Britain’s industrial domi- designed with European conditions in mind. It also
nance meant that mutual restrictions merely entailed loss of freedom in disposing of one of the
restricted the dominant partner without conferring main sources of revenue available to self-governing
equivalent benefit. A similar combination involving colonies. The colonial reformers’ hopes of
humanitarian arguments prevailed on the related establishing an empire in which free trade ruled
matter of West Indian slavery and the slave trade. were another casualty of self-government when it
The keystone of Britain’s rather isolated status as a led to tariffs being raised by Canada and Australia
free-trading nation was installed with the abolition against British and other goods.
of the Com Laws in 1846, a policy that had What now seems remarkable is the rapidity' and
considerable long-term significance for British extent of influence exerted over British colonial
agriculture and Britain’s relationship to colonial policy by Wakefield's untried theories, though
suppliers of food and raw materials, including the modern development economics may yield com-
United States as well as colonies of recent parable examples. He was also highly successful in
settlement. convincing a number of leading political econ-
During the 1830s and 1840s public debate on omists, not least John Stuart Mill, that his ideas
colonies and colonization was dominated by the deserved to form the basis for firture policy. Mill
activities of Edward Gibbon Wakefield and the gave prominence to Wakefield's ideas in his Prin-
colonial reformers, a group of radicals dedicated to ciples of Political Economy and other writings by
the revival of‘the lost art of colonization’. Their consistently championing 'systematic colonization’
programme entailed the creation of self-governing as a solution to Britain's population difficulties; and
colonies as outlets for Britain's surplus capital and by treating its application to new countries as a
labour, avoiding the evils of simply ‘shovelling out valid exception to the general principle of laissez
paupers’, abolishing penal settlements, and creating faire, namely as a case where the self-interest
‘civilized' communities enjoying the benefits of free principle acting under competitive conditions would
trade and high rates of growth. Wakefield’s lead to a sub-optimal result as far as the community
diagnosis of the simultaneous existence in Britain of was concerned. As part of his general modification
surplus capital and labour, and the consequent need of Ricardo’s assumptions concerning capital
for new fields of employment abroad, was scarcity and the distant prospect of the stationaiy
developed in opposition to Ricardian orthodoxy on state. Mill also endorsed the conclusions of
the wage fund and Say’s Law. His ideas on the Wakefield’s heterodox diagnosis of Britain's
optimal economic development of colonies also economic condition, while denying that it was in
conflicted with Smith's view that countries of conflict with Say’s Law and other received
European settlement enjoying an abundance of land Ricardian doctrines. By acknowledging the
were likely to make rapid economic progress. The importance to Britain of the export of capital and
key to high rates of growth lay in achieving the labour to colonies, Mill not only removed an
correct balance between capital, labour, and their obstacle to support for colonization, and hence to
'field of employment’ by
Colonies 1819
the extension of empire, he opened up a major much of it financed by exports of textiles that
exception to the comparative cost doctrine as an competed with domestic industry: Indian commerce
interpretation of Britain’s trading pattern: the trade came in conflict with British manufacturing
with colonies now became akin to interregional interests, which were placated by the imposition of
trade. It should also be noted that Mill, confirming duties on Indian imports. But with the
the tradition that the only new economic arguments reorganization of the Company, especially after
for protection have been advanced by those who 1813, came new priorities and opportunities for
favour free trade as a general rale, gave a cautious those with ambitions to bring the light of post-
endorsement to the infant industry case for tariffs. Smithian, and more especially, Ricardian political
India was never a colony in the same sense that economy to bear on the problems of Indian
North America, Australia and New Zealand were administration. Such a task proved highly congenial
British colonies. Attempts were often made to to James Mill, a critic of the Company’s monopoly
prevent or discourage European colonization before powers who was appointed by the Company in 1819
1830, and until 1858 India was governed by the and rose to the rank of Chief Examiner in charge of
East India Company acting on a renewable Charter political, judicial, and fiscal correspondence with
granted by Parliament. The Company had been India. It was largely through Mill’s efforts, later
subjected to closer government control in the late endorsed by his son, John Stuart Mill, that the
eighteenth century, deprived of its commercial Ricardian rent doctrine came to play such a large
monopoly in 1813, and finally ceased trading part in the conduct of Indian affairs. It provided the
altogether in 1833 when it lost its exclusive priv- basis for the ryotwari system of land tenure,
ileges over the China trade. These developments whereby the state became the sole landlord and met
represented another victory for the forces of free its revenue needs by levying ryots or peasant-
trade, and they blunted the force of Smith’s criti- farmers according to Ricardian principles of pure
cisms both of monopolies and government by rent. By confining the state’s exactions to rent it
trading companies. As we have seen in the case of was thought that the peasant farmer would enjoy
Tucker, free trade ideas could be associated with a normal profits and wages, and the state would elim-
case for complete ‘emancipation’ (Bentham’s term) inate an intermediary or landowning class of
of all colonies. (It was an association of free trade zemindars or rent-receivers. The system embodied
with anti-imperialism which was an invitation for action according to a clear analytical proposition,
revisionist historians to counter with a neat, perhaps antagonism to rent as a form of private income, and
over-neat, inversion by drawing attention to ‘free- a view of the prospects for Indian economic
trade imperialism’.) But there were fewer development that treated the peasant proprietor as a
spokesman for such ideas in relation to India, where capitalistic entrepreneur, freed from the arbitrary
other notions of European superiority and exactions of landowners and responding to market
responsibility held sway, along with more mundane incentives - which is not to say that the application
considerations connected with the retention of of these Western economic ideas to Indian
investment, employment, and trading opportunities. conditions was any more successfiil than the
India had provided Smith with a prime example zemindari alternative.
of a stationary or declining state, something that In Ireland, where similar economic conditions of
could either be attributed to the deficiencies of its a growing population dependent on a backward
system of government and taxation, or to those of a agriculture obtained, it proved more difficult to
backward people whose culture constituted a barrier bypass the Irish landowner in order to grant the kind
to economic progress, though usually to a of security of tenure to the peasant proprietor that
combination of both. In addition, criticism from either existed or was the aim of administrators in
divergent quarters was made of the flow of tribute India. Indeed, the initial view of economists during
leaving India for Britain, the early classical period was unsympathetic to the
preservation of peasant
1820 Colonies
proprietorship in Ireland. The favoured solution was materials in return for similar preferences in the
consolidation of tenant-holdings as a preliminary to markets of her ex-colonies. There had already been
the creation of a capitalistic form of farming a revival of interest in imperial federation, which
employing agricultural wage-labour along English could be portrayed, as it was by John Shield
lines, together with emigration or absorption of the Nicholson, as a return to Adam Smith’s project of
displaced population into alternative employment. empire. While much of this belongs to the larger
Before, but especially after the Great Famine of subject of imperialism, a term which has always
1846, emigration, largely unplanned, was the only carried more nationalistic and ideological oxygen,
part of this programme that operated. Under the the episode is chiefly of interest here because it was
leadership of John Stuart Mill, J.E. Caimes, W.T. the occasion for a major challenge to economic
Thornton and Henry Fawcett, the earlier diagnoses orthodoxy. Chamberlain’s use of arguments
and remedies were entirely recast; a more positive supplied by such economists as W.A.S. Hewins
evaluation of the possibilities of transforming brought Marshall into the professional and political
cottier tenants into peasant proprietors enjoying fray with his Memorandum on Fiscal Policy of
security of tenure was registered and advocated as International Trade (1908), a work which is still
the basis for a solution to Irish problems. It was a perhaps the best brief restatement of the free trade
position that ran directly counter to English position based on a combination of neoclassical
property ideas and involved recognition of the role trade theory and an empirical analysis of
of custom as opposed to contract in designing contemporary conditions in the colonies as well as
policies and institutions for societies that did not in Britain.
conform to the English model. Marshall’s pupil, John Maynard Keynes, was
Mill’s deployment of a more relativist approach not as impressed by his master's memorandum when
in policy matters was later to be seen as a welcome, he looked back on it from the vantage point of 1930.
though incomplete concession to a succeeding At this time Keynes had decided that a revenue
generation of more full-blooded historical and tariff was an acceptable policy for Britain to follow,
institutionalist critics of deductive economic theory, though not for reasons connected with imperial
with its built-in bias in favour of rational economic solidarity. Thus when Neville Chamberlain
man - a creature originally invoked by Mill to achieved his father’s goal with the passage of the
underline the contrast with societies where custom Import Duties Act in 1932, followed by the Ottawa
prevailed. Such critics were more numerous and Agreements which began the period of imperial
vocal after 1870, and the resulting split within the preference, Keynes withdrew his support for tariffs.
economists’ ranks coincided with a campaign for Nevertheless, any account of the revolution
‘fair trade’ in the 1880s which blossomed into associated with Keynes’s name is likely to be
Joseph Chamberlain’s scheme of tariff reform along incomplete without some reference to the ending of
imperial preference lines - an unwitting return to free trade in Britain, coupled as it was with the inau-
Torrens’s imperial Zollverein. The historical guration of an era in which external monetary
economists, led by William Cunningham and W.J. constraints on British domestic policy were weak-
Ashley, had already followed Schmoller and other ened. Of more long-term significance to imperial
German exemplars in according a more positive policy in the interwar period, however, was the
valuation to Mercantilism, and this presaged their revival of interest in state-assisted settlement in the
endorsement of tariff reform as an imperial remedy white dominions and the new emphasis on colonial
for Britain’s declining competitiveness. At the price development, with Africa as well as India now
of forsaking free trade, Britain could offer assuming a larger role in official, if not
preferential treatment to imperial food and raw professional, economic thinking. At this point
colonies and colonial policy become
Colquhoun, Patrick (1745-1820) 1821
something else, the beginnings of modem devel- Colquhoun was bom in Dundee. A successful early
opment economics. career in business led to the position of Lord
Provost of Glasgow in 1782 and 1783. In 1789
See Also Colquhoun moved to London and became active as
a magistrate. He worked on the provision of poor
relief and put forward plans for the reform of
London’s police. He died in London in 1820.
Colquhoun’s interest in poor relief led to his
New and Appropriate System of Education for the
Labouring People (1806), a pamphlet based on his
own experience of running a school in Westminster.
Bibliography Like Thomas Chalmers later, he argued for the
necessity of education to raise the standards and
Ambirajan, S. 1978. C l a s s i c a l p o l i t i c a l e c o n o m y aspirations of the poor, though primarily in order to
a n d B r i t i s h p o l i c y i n I n d i a . Cambridge: curb vice rather than population. This, he believed,
Cambridge University Press.
Barber, W.J. 1975. British economic thought and India, 1600-
was the most cost-effective way of tackling poverty.
1858. Oxford: Clarendon. His Wealth, Power and Resources of the British
Black, R.D.C. 1960. Economic thought and the Irish question, Empire (1814), his last important work, is the one
1817-1870. Cambridge: Cambridge University for which he is best known. This contained detailed
Press.
Black, R.D.C. 1968. Economic policy in Ireland and India in
figures on incomes and occupations and the relative
the time of J.S. Mill. E c o n o m i c H i s t o i y R e v i e w importance of agriculture and manufacturing in
21: 321-336. Great Britain and Ireland. He also included a history
Drummond, I.M. 1974. I m p e r i a l e c o n o m i c p o l i c y , of the public revenue, and descriptive material on
1 9 1 7 - 1 9 3 8 . London: Allen & Unwin.
the colonies.
Rnorr, K. 1944. British colonial theories, 1570-1850.
Toronto: Toronto University Press. The work was not very securely based;
Semmel, B. 1970. The rise of free trade imperialism. McCulloch, who had first-hand experience of trying
Cambridge: Cambridge University Press. to construct large-scale statistical data for his
Stokes, E. 1959. The English Utilitarians and India, Oxford:
Commercial Dictionary, was severely critical of it
Clarendon.
Winch, D. 1965. Classical political economy and colonies. in the Edinburgh Review and in Brande s Dictio-
London: Bell & Sons. nary. But it was followed by later writers and
Wood, J.C. 1983. B r i t i s h e c o n o m i s t s a n d t h e Colquhoun’s estimate that unproductive labour, one
e m p i r e . London: Groom Helm.
fifth of the total, received one third of output was
widely quoted.
Selected Works
Combined labour action in the 18th century was industries was free from industrial conflict (Rule
widespread although by no means was it all directed , ch. 6).
by or channelled through formal organizations. The The London journeymen tailors were the most
Webbs ( , eh. T) were in error in effective combination in Britain during the 18th
insisting that only those associations that were century. Their organization seems to have been
formally constituted and in continuous existence established around 1700 as the result of the coming
should properly be counted among the early trade together of five ‘box’ clubs: box clubs being a
unions. In the 18th century, especially, it was the version of the friendly society. In order to function
collective presence of workers that was the crucial effectively the clubs were associations in contin-
determinant of industrial response. Formal orga- uous existence, and their rules and regulations later
nization was not a necessary condition of industrial provided the basis for the discipline of a trade
militancy, and ‘collective bargaining by riot’ is a union. The meeting places of the box clubs were
well-documented phenomenon (Hobsbawm , ch. 2). public houses recognized by the trade and known as
In 1718 and 1724 West of England clothiers the ‘house of call’. The box club and the house of
complained to Parliament that weavers had call were the type of organization common to all the
‘threatened to pull down their houses and bum their trades that succeeded in establishing more or less
work unless they would agree to their terms’. A continuous associations. The tailors’ combination
study of Lancashire textile workers (Turner ) first came into prominence as the result of a petition
emphasized that the essence of 18th-century presented to the House of Commons in 1721, which
unionism was the persistence of collective pressures asserted that 15,000 London journeymen had
which in given circumstances encouraged collective entered into a combination and engaged upon a
action. Associations among the Lancashire textile strike. The report of a House of Commons
workers developed informally out of the committee was followed by an Act (7 Geo 1, c. 13)
occupational life-style within the community. A which fixed wages by the day for summer and
settled group would encourage habits of association winter, and which also prohibited combinations.
and common action, and skill was always a While the Act was going through Parliament the
consolidating factor, but there was also much journeymen briefed counsel, at a reported cost of
activity among the unskilled and migrant groups. £700.
An incomplete listing of recorded industrial Legislation against combinations in specified
disputes gives a total of 383 for the whole of Britain trades was common during the next hundred years;
between 1717 and 1800 (Dobson , ch. 1). Most of examples affect the wool trade, (12 Geo, I, c. 34),
these were in England, and of the English figure of hatters (22 Geo II, c. 27), silk weavers (17 Geo III,
333 disputes just over a third (120) occurred in c. 55), and the paper trade (36 Geo III, c. 111). At
London, the main centre of the artisan trades in the the end of the century, in 1799, there was passed a
18th century. The occupational breakdown for the general Act (to be repealed and replaced the next
whole country shows 64 incidents from among the year by 40 Geo HI, c. 60) which made illegal all
textile workers, mostly in the wool industry; only combinations whose purpose was obtaining an
seamen and ships-carpenters (each with 37) and advance in wages or the lessening in the number of
tailors (with 22) exceed a total of twenty. Those hours worked; and for the next quarter of a century
between ten and twenty include coalminers, the Combination Laws were in force, although in
workers in the shipbuilding industries, textile respect of certain trades they were not always
workers other than those in wool or silk, rigorously applied (Webb and Webb , ch. 2). In
shoemakers, and most trades in building. The range 1824 all previous statutes in respect of
of additional trades for which some disputes were combinations were repealed but this Act was in turn
listed is considerable, and hardly any occupation repealed and replaced the following year by 6 Geo
except the service IV, c. 129, which permitted combinations
Combination 1825
on strictly defined terms and listed punishments for his views in an article on ‘Combination’ in the 8th
the use or the threatened use of intimidation, edition of the Encyclopaedia Britannica of 1854.
molestation or violence in the pursuit of the The character of trade unionism changed in
declared objects of the industrial action. certain respects during the second quarter of the
The ending of the complete legal prohibition of 19th century as a result of the influence of Owenism
combinations was largely due to a growing appre- and the ideas of the anti-capitalist theorists such as
ciation of the requirements of a labour market in the Hodgskin, Gray and Thompson (wrongly
period of early industrialization. It was also in part designated as the ‘Ricardian socialists’). It is
the result of adroit political pressure by Francis important not to exaggerate the nature of the change
Place and, inside Westminster, Joseph Hume. The since unions remained what they had always been:
most weighty supporter of those who accepted the defensive-offensive bodies
demand that workingmen’s combinations should be concerned with the betterment of their members.
free of legal restraint was J.R. McCulloch, who But in the decade after the legalization of combi-
spoke in the name of orthodox political economy. nations for peaceful agitation there was a notable
McCulloch argued that the Combination Laws were growth of cooperative organizations and some
unjust in that employers and their workmen were support for Owen’s communitarian ideas. Radical
not put on the same level; they were dangerous political economy worked in the same direction.
because they engendered contempt for the law and The distortion of exchange values, so it was argued,
encouraged class hatred; and they were futile meant that labour exchanged below and
because no action could permanently drive up the commodities above their natural values, and under-
level of wages above the natiual rate. McCulloch consumption and the usual accompaniments of
believed in peaceful combination: ‘There is no good economic crisis were the result. Hence a concern
reason why workmen should not, like the possessors with equitable exchange relations with an emphasis
of every other valuable and desirable article, be upon the Labour Exchanges of the early 1830s and
allowed to set whatever price they please upon the a longer term preoccupation with money and the
labour they have to dispose of’ - but he was clear banking system (N.W. Thompson ). Owenism, with
above all that no artificial levels of wages could its ideas of cooperation was a central strand in the
possibly maintain themselves in a competitive organization of the Grand National Consolidated
market. He saw the usefulness of combinations in a Trades’ Union in 1834-5 from which, it should be
strictly narrow context. He warned against strikes, noted, a number of skilled societies held aloof; and
as normally benefitting those employers outside the Owenism, which was critical of policies such as
strike action, and underlined the danger of industry- mutual support funds for unemployment, sickness
wide strikes in reducing the competitiveness of and death, and remained only a partial influence
home industry compared with foreign production upon union attitudes. After the failure of the Grand
(O’Brien , pp. 366-70). He was notably adamant National in the mid-1830s the combinations which
against any use of force or intimidation in the day- survived - almost entirely made up of skilled
to-day activities of the combinations and was workers and craftsmen - continued with their
particularly opposed to attempts to compel traditional sectional approaches to industrial
workmen to join combinations or participate in problems. There remained a residue of Owenite
strikes or other action; and he commended the ideas - in cooperative production for example which
clauses in the 1825 Act which imposed prison the Christian Socialists of 1848-1854 were able to
sentences for those convicted of such intimidation. draw upon (Raven ; Saville ) - but these were
McCulloch elaborated his ideas in the Essay on attitudes which became progressively weaker in the
Wages of 1826, which he published in a revised decades after the half century.
version in the Treatise on Wages of the early 1850s,
and he summarized
1826 Combinatorics
‘marginal revolution’ there has been clear emphasis The development of mathematical program-
on the use of differential methods and an implicit ming, in particular of linear programming, has
acceptance of the perfect divisibility of goods. relied particularly on combinatorics. The algorithms
Indeed if we consider the most elaborate extension developed to solve such problems are essentially
of the basic Arrow-Debreu model it is to one in combinatoric. Many economic models have been
which there is a continuum of agents and a built on the basis of the ‘activity analysis’ or fixed
continuum of goods. This is just the sort of context coefficient production approach.
in which the combinatoric approach is of little use. Game theory has made extensive use of com-
Yet the economist may well feel that discrete binatorics and provides an interesting example of
problems are of importance and that indeed the how the combinatoric approach can be confronted
world is best represented as one where goods are with one using continuous functions or compact
not infinitely divisible and where the number of sets.
agents is finite. Now given the standard assump- Combinatoric arguments are used to show that
tions of convexity, at least in production, the per- the core of a balanced game is non-empty. A simple
fectly competitive model may indeed be regarded as argument shows that a market game is balanced and
a satisfactory ideal or limiting case and the further it can be shown that the only allocations
finiteness of a real economy is just an inconve- remaining if we replicate a given economy are
nience. In this case it would seem that combina- competitive. This leads us to the conclusion that a
torics has little role to play. competitive or Walrasian allocation exists. Now the
However although we may consider taking latter statement is known to be equivalent to the
divisibility as a reasonable idealization in a large existence of a fixed point for a continuous mapping.
economy (even this may be questioned - see ►) as Thus we arrive by combinatorical methods at the
soon as same result as that obtained by an apparently very
convexity is dropped as an assumption for pro- different tool.
duction the problem of finiteness cannot be avoided. This argument is reinforced by the fact that the
If there are increasing returns to scale there will be a algorithms developed for finding approximate
minimum profitable plant size and there will be a competitive equilibria are essentially combinatoric
fundamental indivisibility. At this point in nature. They consist in systematically examining
combinatoric analysis comes back into its own, and points in the price space, of evaluating the total
no argument can be made for using infinitely excess demand of an economy at these points and
divisible goods as a reasonable approximation. Thus finding a path which leads to a reduction in the
the existence of non-convexities will lead us back excess demand until it is close to zero. Again, the
into a situation which may, for example, be game-1 approximation of fixed points of countinuous
ike and in which we will be looking for a solution functions is obtained by a combinatoric approach.
with a finite number of large plants. There are many other examples of ways in
This is to suggest that realism may lead us away which combinatoric analysis has proved useful.
from the smooth differential world to which Arrow’s theorem on the impossibility of a social
economists are accustomed towards a discrete one welfare function is typically proved in the case of a
in which combinatorics plays an important role. finite number of individuals faced with a finite
Nevertheless, till now, finiteness and the com- number of social alternatives. The problem is to
binatoric approach have not occupied a significant find a way of aggregating individual preference
place in economics. However, some examples will orders on these alternatives into a social order in a
show that certain branches, although not central, way which satisfies certain simple axioms. The
have made extensive use of such an approach. usual line of proof consists of finding certain
configurations of individual preferences which
1828 Command Economy
Keywords
Active vs passive money; Aggregation; Balance;
See Also Bounded rationality; Bureaucracy; Central
planning; Centralization; Command economy;
► A: v:
Command mechanism; Command principle;
► Fixed Point Theorems
Communism; Complex social economy;
► 2'e::.:..e Tiec::/
Corruption; Decentralization; Discretion;
► ' KVI
Gorbachev, M. S.; Gross output; Inca production
system; Incentive provision; Industrialization;
► :: _;:msey, Frank Plumpton (1333-1930) Inequality of income; Information; khozraschet;
Labour discipline; Market mechanism; Market
References versus plan; Microbalance; Moneyness; Mormon
economic system; Neurath, O.; perestroika;
There elegant and elementary introductions to combinatorics
are Berge (1971), Polya, Tarjan and Woods (1983) and
Price control; Principal and agent; Rationing;
Ryser (1963). A more advanced text is Aigner (1979). Resource allocation; Second economy; Sellers’
Aigner, M. 1979. C o m b i n a t o r i a l t h e o r y . Berlin: market; Shadow economy; Soft-budget
Springer. constraint; Soviet economic reform; Soviet
Berge, C. 1971. P r i n c i p l e s o f c o m b i n a t o r i c s . New
York: Academic.
Union; Stalin, J.V.; Suboptimization; Unit of
Polya, G., R.E. Taijan, and D.R. Woods. 1983. N o t e s o n measure; Vested interests; War and economics;
introductory combinatorics. Boston: War communism
Birkhauser.
Ryser, H.J. 1963. C o m b i n a t o r i a l m a t h e m a t i c s ,
Mathematical Association of America. New York: Wiley.
Command Economy 1829
Nature of the Command Economy occasioned by inflationary pressure arising from (a)
and/or (b).
The seminal analysis of the nature, characteristics, The command economy therefore requires a
and problems of a command economy is Grossman formal, centralized, administrative hierarchy staffed
( ). by a bureaucracy, and it also needs to be embedded
Any complex social economy must, for its very in (at least) an authoritarian, highly centralized
survival, maintain at least a ‘tolerable’ micro- polity if it is not to dissolve or degenerate into
balance, ‘that minimal degree of coordination of the something else. And that bureaucracy, if it is to
activities of the separate units (firms) which assures effectively implement the command principle, must
a tolerably good correspondence between the supply exercise full control and discretion, if not
of individual producer and consumer goods and the necessarily formal ownership, with respect to the
effective demand for them’ (Grossman , p. creation, use and disposal of all productive property
101). In such an economy, and assets. At the same time, each office or firm and
appropriate balance can be achieved through every economic actor within the command structure
decentralized, market-based (monetized, price- holds interests which, if only in part, do not
mediated) interaction of autonomous units, or by coincide with those of superiors or of the overall
virtue of explicit specific coordinating directives leadership. This generates important problems of
(commands, targets) from some higher authorities. vested interests, principal-agent interaction,
While the former is characteristic of a market incentive provision, and general enforcement of the
economic system, the latter is defining of a ‘com- leadership’s will, and calls for a variety of
mand economy’. In the latter operational-level units monitoring organizations (party, police, banks, and
(for example, firms) must merely ‘implement’ so on). The term ‘command’ must not be taken to
commands; they become ‘executants’ of plans and preclude self-serving behaviour, bureaucratic
directives from above, plans which must insure politics, bargaining between superiors and
balance through the coherence and consistency of subordinates, corruption, peculation and
the instructions they give. Thus the command (dis)simulation. On the contrary, such behaviour
mechanism requires relative centralization and tends to be widespread in a command economy; yet
severe restriction on the autonomy of subordinate the concept of a ‘command economy’ remains valid
operational units. It derives from the overwhelming so long as, in the main, authority relations and not a
priority of social goals, and requires the severe market mechanism govern the allocation of
limitation, if not total destruction, of autonomous resources.
social and economic powers and the enforcement of When not externally imposed, command econ-
strict obedience to directives. omies typically arise from a millennialist elite, with
A command economy is hence a creature of unique access to ‘the truth’, achieving the political
state authority, whose marks it bears and by whose power to impose its will, while facing a crisis of
hand it evolves, exists and survives. Command apparently overwhelming proportions. The
economies are imposed, whether through external perception of a life-threatening crisis, driving the
duress or imitation, or indigenously in order to need for massive mobilization of all social
achieve specific purposes such as (a) maximum resources and rendering potentially disastrous any
resource mobilization towards urgent and overrid- hesitation or dissent, any questioning of ways and
ing national objectives, such as rapid industriali- means, naturally leads, pushed by the ‘logic of
zation or the prosecution of war; (b) radical events’, to the usurpation of all power of discretion,
transformation of the socio-economic system in a all legitimate authority, by the ‘knowing’ elite,
collectivist direction based on ideological tenets and which then becomes responsible for all that is done
power-political imperatives; and (c), not least, or not done in the society and the economy. The
curing the disorganization of a market economy crisis may be artificial or real (‘hostile
brought about by price control, possibly encirclement’), externally or internally imposed (the
need to industrialize, to ‘catch up’),
Command Economy 1831
but it requires moving resources rapidly and mas- Soviet Union does little by way of consciously
sively, forcing new activities and interactions in the steering the economy’s development or finding effi-
cient patterns of resource allocation. Its overwhelm-
face of severe scarcities, of shortage of competent ing concern is simply to equate both sides of each
personnel, of massive uncertainties, and of strongly ‘material balance’ by whatever procedure seems to
held, stark priorities. Indeed, a sense of be most expeditious (Grossman , p. 108).
overwhelming urgency and need for haste drove the
elite of the Soviet Union in the 1930s to test and A major problem is that detailed planning and
establish, through trial and error over several the corresponding directives are often late, are
decades, the institutional structure of a ‘command insufficiently detailed, may lack the requisite
economy’, albeit less than absolute from both information, hence often cannot be effectively
necessity and choice (for example, due to the coordinated, and owing to their rigidity are pecu-
‘lessons’ of ‘War Communism’) (Grossman ; liarly vulnerable to uncertainty (Ericson ).
Zaleski ). Information in the command sector, by the logic of
the system, tends to flow vertically up and down the
administrative hierarchy rather than horizontally
Consequences of Command between buyer and seller, adding to difficulties of
demand-supply coordination by informationally
Rational application of the command principle calls isolating operational units from their suppliers and
for planning, which is basically of two types. users. In addition, problems of motivation,
Longer-term, developmental planning expresses the accountability (down as well as up), inappropriate
leadership’s politico-economic strategy (for decision-making parameters, and divergent interests
example, five-year and ‘perspective’ plans); complicate the procedure. Even at best, this manner
shorter-term, coordinative planning (annual, quar- of resource allocation can hope to attain only
terly, monthly, ten-day) ideally translates the strat- internal consistency (in the sense of effectively
egy into resource allocation while aiming to match matching partially disaggregated requirements and
resource requirements and availabilities for indi- availabilities) but not a higher order of economic
vidual inputs, goods, and so on, in a sufficiently efficiency. Economic calculation in pursuit of
disaggregated way for given time periods and loca- efficiency enters, if at all, at the project-planning
tions. The task of elemental coordination, of micro - stage, and not short-term resource allocation and
balance, so effortlessly accomplished by any func- use.
tioning (however poorly) market system, is over- These problems are aggravated by the logic of
whelmingly large, and grows rapidly with haste that drove imposition of the command
industrialization and economic development, both economy - ‘the pressing contrast between urgent
of which lead to exponential growth of the com- political goals and available resources’ (Grossman ,
plexity of die economy, and hence of the planning p. 108). The necessary attention to the growing
problem. With centralization and the abandonment problem of balance further militates against any
of markets comes the need for massive, detailed effort to consider developmental objectives or effi-
coordinative planning, for ‘making ends meet’ in ciency in making allocative decisions, so that a
the expanding web of interconnections that must be further bias against allocative efficiency is built into
maintained for economic life to continue. Coordi- the command economy. Coupled with limited
native planning serves, therefore, as the basis for ability to gather, filter, process, and communicate
specific operational directives to producers and information, and to compute solutions to planning
users, thereby implementing the command principle problems, this creates a fundamental and growing
to achieve the prime imperative of a social economy inability to acceptably solve the underlying coor-
- ‘balance’. dination problem, and hence further undermines any
It is t h i s task that in fact consumes the largest part of consideration of efficiency.
the so-called planning in the command economy...
The logic of ‘command’ has a number of other
Coordinative planning as it is conducted in the
consequences reflected in the institutions of such
1832 Command Economy
or on the nature and direction of economic/indus- state sectors and as the interface between them and
trial development; all real activity is preordained in state economic institutions/sectors.
the plan and its subsequent implementing com-
mands. The role of money is then to facilitate
monitoring of commanded performance through the Inherent Challenges to the Command
financial flows it generates. Thus monetary prices Economy
do not, and indeed should not, reflect to a
substantial degree social goals and priorities; they As Grossman notes in his seminal article ( ,
merely reveal and measure the flow of commanded p. 107), ‘The chief persistent systemic problem of a
activity. Producer prices (and most retail prices), command economy is the finding of the optimal
wages, prices of foreign currencies, and so on are degree of centralization (or decentralization) under
generally centrally set and controlled, often given conditions and with reference to given social
remaining fixed for long periods of time. Micro- goals’. The fundamental dilemma is that full
disequilibria naturally abound, while the widely centralization poses an insoluble problem, while
perceived dubious meaningfulness of such prices decentralization abandons the ability to direct, to
and the administrative allocation of most producer control development, and to ensure the pursuit of
goods in physical terms combine to sustain the social goals and priorities. With regard to the pine
system of detailed production plans and directives in planning problem, a large body of theoretical
terms of physical indicators - yet another bar to literature arose in the late 1960s, and continued into
more efficient planning and management. the 1980s, on the problem of decentralizing the
Finally, an absolutely essential, indeed defining, planning process to make its informational and
institution of the command economy is the physical computational burden manageable (Eckstein ;
rationing of resources and producers’ goods. This is Bomstein ). But the problem is far greater, and less
where the market is most fully and directly replaced, studied, with respect to implementation; rational
and where the central authorities have the ability to planning is swamped by the struggle to maintain
most directly influence and control the behaviour of elemental coordination.
subordinate operational units. It implements the
centralized mobilization of resources to priorities,
the most direct response to crises and challenges. Decentralization Versus Priority
And it most directly denies to subordinates the
Looked at through the prism of relative advantages,
capability to produce, to develop, in ways outside
operational decentralization shortens ‘lines of com-
those authorized in the plan. This makes the
munication’, increasing flexibility, adaptation and
coexistence between the command principle and the
responsiveness to a changing environment through
market mechanism a source of continual conflict, as
local initiative and innovation, and vastly simpli-
the market opens unauthorized opportunities to sub-
fying the decision problem of economic agents. But
ordinates. In the Soviet Union the command prin-
it does so at the cost of weakening or losing the
ciple, aided by the club of materials rationing,
‘advantages of centralization’, including
repeatedly pushed back and eliminated the market
enforcement of regime values, capability for large-
mechanism when (timidly) introduced in reforms,
scale resource mobilization, concentration of scarce
until the system collapsed in chaos, and the intro-
talent in central decision-making organs, and the
duction of a full-fledged market economy was
maintenance of macro-balance. In particular,
begun in 1992 (Schroeder ; Aslund ). Thus the
decentralization compromises the ability of the
nature of the command system makes it
centre to directly manage the development and
fundamentally incompatible with real markets,
structure of the economy and to force the achieve-
although some market institutions can, and indeed
ment of critical priorities regardless of cost Fur-
must, be allowed to function both within the non
thermore, decentralization requires the introduction
1834 Command Economy
of the alternative coordination mechanism to insure for decision-making is necessarily limited and
tolerable micro-balance - the market - as decen- deliberately subordinated to the planners’ will and
tralization undercuts the ability to directly coordi- the administrators’ power. Banks and the treasury
nate, to balance from above. Thus, to prevent accommodate the money needs of production,
catastrophic imbalance, a more active money with ensuring a soft budget constraint for the individual
economically flexible market prices must be allo- firm. At the same time, the ‘moneyness’ of money
wed to function in a decentralized system. at the firm level is low, hemmed in as it is by
The impossibility of planning and commanding administrative constraints and impediments,
the performance of all economic agents in full including the rationing of nearly all producer goods,
operational detail, however, forces some decen- and by the widespread ‘seller’s market’ (shortages
tralization. This creates a chronic threat to balance of goods and absolute lack of buyers’ alternatives).
which is thus a continuous argument for (re) This monetary ease, together with the sellers’
centralization of planning and materials allocation. market, plays an important role in ensuring
Furthermore, a partial decentralization of planning individual workers’ job security at the firm level
and management in a command economy may do and full employment in the large, while keeping the
more harm than good; it may impair balance firm largely insensitive to money costs and/or
without yielding sufficient benefit. Yet a complete benefits.
decentralization, in the sense of a virtually full Within the command sector, money and prices
devolution of the major production decisions to the have a necessary role in determining terms of
firm level, would be disastrous from the standpoint alternate resource uses only within planned/
of balance, unless the price structure were properly commanded categories, and money has the role of
altered to provide proper signals to firms and limiting total claims to resources in areas, or at a
suitable behavioural rules were prescribed, that is, level of detail, beyond the reach of plan directives.
unless a market mechanism were introduced. Thus This requires ‘businesslike management’ within the
the logic of command predicts a ‘treadmill firm - khozraschet, which is a ‘set of behavioral
ofreforms’ (Schroeder ), an array of countervailing rules that is supposed to govern the actions of
strengthenings of the oversight and control organs Soviet managers beyond their primary responsi-
(in particular, the Party), and enhancements of their bility, the fulfillment of output targets’. It pushes
role in the economy, accompanying moves towards the firm toward ‘technical efficiency’ and limitation
decentralization in the state sector. It also explains of ‘claims on society’s resources for productive use.
the Soviet institutional arrangement of inter-firm ... khozraschet is a system that is well devised to
contracts as a decentralized implementation device. control the behavior of managers in a command
These are required to specify details of interaction economy where a certain amount of devolution of
within planned categories, and establish observable, power to them is inevitable, and where, further,
and hence legally enforceable, commitments to managers’ goals and values do not necessarily
planned implementation, constraining the autonomy coincide with the official ones’ (Grossman , p. 117).
necessarily granted through the minimal Thus money also has the role of facilitating the
decentralization. And it explains the logic of the monitoring of performance in the command sector.
continuing restraints on the use of money and the While administrative orders are the mle in a
continuing efforts at effective price control to keep command economy, backed up by greater or lesser
the autonomy of agents restricted to the minimum degree of state coercion (depending on country and
necessary for the continued functioning of the less- period), any decentralization of implementation
than-absolute command economy. naturally relies on monetary (‘material’) incentives
Even limited decentralization requires that to elicit desired individual compliance and perfor-
money be used in the command sector (as well as in mance. Compounding the incentive problems aris-
the household sector), but its role as a bearer of ing from differences in information and interests
options and as the means of pecuniary calculation between central authorities and implementing
Command Economy 1835
agents is the fact that the physical and other indi- fail to replicate the decisions that their superiors
cators to which the material incentives are linked would have made had they been in a position to
may often be poor measures of social benefit (as make them. This problem is aggravated by the
seen by the leadership). Furthermore, resort to such inevitable ambiguity, incompleteness and inconsis-
rewards widens the distribution of official earnings tency of those rales, incentives and the information
and raises questions of permissible limits of income available on the spot. Only binding physical con-
inequality. Yet there may be little choice in that the straints and observable outcomes can be systemat-
state must in effect compete with the much higher ically enforced, making ‘centralized materials
incomes from the second economy. Indeed, the allocation the most powerful weapon at the disposal
Soviet Union during War Communism, Cuba in the of the central authorities’ (Grossman , p. 118).
1960s, and the People’s Republic of China during Thus, where material inputs are less determinate of
some periods before Mao’s death in 1976 tended to a unit’s activities, this information and incentive
downgrade material incentives in favour of problem is greater, and the defiance of central will
normative controls, but never did quite abolish relatively more widespread and successful. This
them. observation explains the non-viability of any reform
The behaviour of the Soviet-type firm has been that fails to fundamentally alter the materials
much studied (Granick ; Berliner ; Nove allocation system.
; Freris ). Because its directives and the
corresponding managerial incentives stress physical
output, produced or shipped, and thanks to its low Under-Planned, Ill-Commanded Sectors
sensitivity to cost and the ambient sellers’ market,
the firm often sacrifices product cost, quality, A major challenge to the command economy also
variety, innovation and ancillary services to its arises from the existence of sectors outside, or only
customers to sheer product quantity. By the logic of partially affected by, the command principle. In the
command and the requirements of plan Soviet Union these included most of agriculture,
manageability, firms operate in an environment with much of housing, the household sector and some
sole suppliers and assigned users, reducing consumer goods and services. ‘Markets’ were
complexity by eliminating ‘wasteful’ redundancy in allowed to function for the distribution of final
production and distribution. Thus firms in a consumer goods and services, including agricultural
command economy are largely insulated from any produce, for much of the activity of the ‘collective’
product competition, both from the outside world sector in agriculture and for household labour
and from other domestic firms, thanks to the climate supply. For transactions with ‘personal property’
of administrative controls and the prevalent excess within the household and collective sectors, money
demand for their output. Difficulties with supply, was active and agents responded to market prices,
frequent revision of its plans, interference by Party while in the quasi-markets interfacing with the state
and other authorities, and other systemic problems sector - for example, labour and consumer goods -
also stand in the way of its more efficient and money was relatively active but prices remained
effective operation. Indeed, to function at all, the largely controlled and non- market. These are
firm’s management is frequently forced to break sectors where information on needs/preferences and
rales and even resort to criminally punishable acts. capabilities proved too difficult to acquire reliably
This compounds a further critical challenge in real time for acceptable allocation and balance to
posed by necessary decentralization - the conflict be commanded, and so at least one side of a market
between the will, purposes, incentives and priorities was allowed to function with an active money.
of the higher authorities and those at lower levels, Here, the command mechanism proves too crude
particularly of the firms and their managements. and clumsy, and hence politically
Even the best-motivated managers, following all counterproductive, to be used outside of pressing
official rales and incentives, will sometimes emergencies. Indeed, this might be considered a
lesson of War
1836 Command Economy
Communism, the first experience with a command than-absolute command economy. As the complete
economy in Soviet Russia, 1918-21. centralization of decisions in the production sector
(let alone in the household sector) is an
In view of the theoretical incompatibility of
command and market, how could these ‘market’ impossibility, something must be left to local ini-
tiative and dispersed decision making. Thus
sectors be successfully grafted on to the command
mechanism? An explanation (Grossman ) rests on khozraschet is a logical necessity, ‘... an unfriendly
the trade-offs between the authorities’ limitedbridgehead that threatens to seize ground whenever
the planner fails or defaults’ (Grossman , p. 228).
capabilities, the complexity of those sectors, and
With the inevitable devolution of some decision
their centrality to regime priorities. A sector which
making to firms and households, money acquires a
provides significant inputs to physical planning and
necessary and critical role in the command
plan fulfilment, where the unpredictability in the
economy, going well beyond that consistent with
flow of goods is unacceptable, cannot be left to the
market without seriously undermining command. the logic of command. That role arises from the
need to economize in making decentralized
However, if a ‘market’ sector can be treated as a
residual for purposes of materials planning anddecisions, and as a medium of exchange and store
of value in the decentralized interactions that relate
allocation, a buffer for planning, then its coexis-
to all decisions. In acquiring this role, this
tence is acceptable. Further, if its operation is char-
‘moneyness’, it allows accumulations of power
acterized by rapid change and complexities rather
outside the control of the regime. Money is a
outside the core interests of the regime, if without
‘bearer of options’ whose power and influence must
disrupting the industrial core greater incentives and
be restrained if the command mechanism is to
risk can be placed on those peripheral agents, and if
operate properly - to determine priorities and to
non-market constraints can force the desired market
response from it, then the centre will want to insure maximal commitment to their achievement.
separate that sector from the command sphere, As Grossman ( , p. 214)
noted, ‘Money is a form of social power that may
lowering its coordination burden by shifting it to the
market. lead resources astray and is subject to only imper-
fect control by political authority.’
These considerations were indeed active in the
case of those sectors ‘left to the market’ in the Thus the power of money has to be curbed in a
Soviet Union: consumer goods retailing, the command economy by limiting balances available
acquisition of labour services, the support of to households and firms, by compartmentalizing
households in the countryside through a privatemoney into cash and ‘firm’ circuits, and by erecting
barriers and limits to the use of ‘monies’ in each
agricultural sector, and a few peripheral and inter-
category, although that undercuts the effectiveness
stitial activities. Indeed, any attempt to truly
of
‘marketize’ any other sectors or activities in the attempted decentralizations. Liquidity,
‘moneyness’, is constrained by the institutional
command economy is doomed to fail unless the loss
structures and by all the characteristics and
of fervour, of the sense of mission and urgency,
leads to abandonment of the command mechanism. conditions of the ‘sellers’ market’, rendering
Yet even the existence of these limited market ‘money’ the only non-scarce commodity, in
unusable excess to the extent the command
sectors, providing an outlet for incentive earnings
and diverted resources, exerts a continuing mechanism is effective. Monetary policy in the
corrosive pressure on the command economy and properly functioning command economy reduces to
its control mechanisms. limiting the volume of cash in the economy
(‘macro-monetary’ control) through wage fund
restrictions and cash control absorption plans of the
The Cancer of 'Money' retail sector, and the allocation of firm balances in
restricted categories (‘micro-financial’ control) in
A truly monumental challenge to the command just sufficient quantity to support the
economy lies in the role of money in any less- implementation of the plan, with confiscation of
Command Economy 1837
excess funds to prevent unauthorized activity by the more of a command over goods and services, than
firm (Garvy ). desired. This arises from a multitude of factors:
Similarly, the price system, expressed in terms errors in both physical and financial plans, inherent
of that money, must also be mobilized to the incompleteness of plans and commands due to
purpose of control. The inflexible, administratively limited information and time and to the necessity of
segmented, average cost-based prices in command aggregation, changing circumstances and shocks to
economies are a logical necessity of command- and the economy, mistakes in implementation and in
haste-based shortage. For all the problems they responding to shocks, the irregularity and
cause, all the unintended consequences and disruptions in the materials allocation system, the
distortions in the behaviour of subordinates, such behavioural response of even the most enthusiastic
prices help to keep money largely passive, at least and best-intentioned agents to these problems, and
in the core state sectors, and allow both money and so on. All of these can lead to an unexpected lack of
prices to remain instmments, rather than disrupters, funds for doing what was commanded (if only
of command. More than being ideologically implicitly), and hence disruption of commanded
justified, such prices are a response to the pragmatic performance, unless additional liquidity is provided.
and pressing requirements of running a shortage Thus monetary policy in a command economy,
economy with a rapidly developing system of once money is allowed any room for activity, must
centralized direction of enterprises and of materials be accommodating; a lack of funds can never be
allocation. allowed to disrupt planned performance, just as an
Money, however, is not so easily contained. excess of funds cannot be allowed to facilitate
Once in unobserved hands, it exercises its ‘com- unplanned or unauthorized activity. Thus the role,
mand over goods and services’ without reference to the influence, of money has a natural, inexorable
plans, commands or regime priorities. Hence, given tendency to grow: insufficient funds become an
any discretion, in any sphere of activity not directly immediate problem generating new money through
monitored agents will naturally use money in ways credits or additional allocations, while unused funds
they find desirable, placing new demands on a tend to stay hidden until ferreted out by inspection
physical system otherwise tautly planned and or accidental discovery. And as it grows, so does
characterized by general scarcity. This is facilitated the challenge to the command principle. An
by the existence of agents and spheres of activity increasing number of agents, in both the state and
outside the command system, providing ‘legitimate’ non-state sectors, has a growing ability to access
sources and uses for monies, however acquired or resources, to divert them in the name, if not always
disposed. And the possibility of acquiring money the interest, of implementing decentralized plans,
provides incentives for unauthorized activities, and thus to challenge the priorities of the political
incentives to undertake unplanned interactions and authorities. This growing challenge becomes a
reallocations. An active money vastly expands the cancer in the system, a growth that undermines its
sphere of discretion of ‘subordinate’ agents beyond health and feeds tendencies destructive of the
any authorized by a decentralizing reform, and calls priorities of the regime and its rulers.
for severe administrative restrictions, a reduction to
passivity, if it is not to disrupt the planned activities
and discretion of the central authorities. The 'Second £000001/
Yet attempts to administratively constrain the
As the command economy matures, as the messi-
influence, the ‘corruptive’ power, of money become
anic fervour with which it was imposed wanes and
increasingly futile once the ‘genie’ has been ‘let out
the use of extraordinary force diminishes in ensur-
of the bottle’. Even limited decentralizing reform,
ing compliance with commands, these challenges to
allowing money to influence some (subcategory)
command metastasize into a competing yet
production and allocation decisions, inevitably lets
loose more liquidity,
1838 Command Economy
symbiotically attached and dependent economic enthusiasm of its agents, the growth of such a
system: the second economy. This name highlights second economy appears natural.
the distinction of this sphere of economic activity Thus growing ‘monetization’, the existence of
from the officially sanctioned, ‘first’, command ready and waiting market sectors, and the decline in
economy. It is thus defined as ‘all production and the use of violent instmments of enforcement lead
exchange activity that meets at least one of two to a growing sphere and importance of activities
criteria: (a) being directly for private gain; (b) being outside the purview of ‘planning’ and ‘command’.
in some significant respect in knowing contraven- These market-mediated activities are at times
tion of existing law’ (Grossman , p. 25). supportive, helping to achieve tolerable micro-
In the Soviet Union, attempts to strengthen balance in the increasingly complex economy, but
‘material incentives’ and activate ‘the profit motive’ often are in violation of planned implementation
in order to increase the effectiveness and technical and regime values. Private interests, necessarily
efficiency of the implementation of central plans allowed some leeway, grow in significance,
and directives and to stimulate technological increasingly seizing ground from command. In the
progress and innovation, and the growing Soviet Union, the private agricultural sector,
monetization of the agricultural sector, opened the initially permitted only to secure survival of the
door to massive expansion of money supply and peasantry under the extractive pressure of rapid
eroded the barriers between the currency and the industrialization, and the consumers’ personal
enterprise bank account monetary circuits. Col- services sectors provided the basis for a ubiquitous,
lective farms and their subsidiary enterprises, if still systemically marginal, second economy.
owners of‘small means of transport’, vodka man- But then even the core industrial sectors under
ufacture and distribution, and the Caucasus repub- the command mechanism find their managers and
lics (Georgia in particular) proved particularly rich activities increasingly influenced by this illegiti-
sources of illicit (from the system’s perspective) mate, shadow market, system, as managers are often
monetization and private ‘entrepreneurial’ activity. forced to break rules and undertake illegal acts in
This both raised the spectre of inflation and opened order to do their job. Such acts, together with
the door to vastly increased opportunities for ubiquitous and protean illegal activity on private
manipulation by self-interested subordinates in the account, add up to a large underground economy
command sector. Thus the use of ‘economic levers’ characteristic of every command economy, which
greatly increased the opportunity for and incidence together with legal private activity (allowed in
of bribery, corruption, speculation, and even varying degree in different countries) both supports
‘honest’ private labour. and supplements the ‘first economy’ and is inimical
While the fundamental cause of the appearance to it While the second economy significantly adds
and growth of the ‘second economy’ undoubtedly to the supply of goods and services, especially for
lies in the congenital institutional weaknesses of the consumption, it also redistributes private income
command economy discussed above, there are a and wealth, contributes to the widespread official
number of proximate sources that make it unsur- corruption, and generally criminalizes the
prising. These include extensive price control, with population. Virtually every area of economic life is
consequent scarcity and misallocation, high taxes touched upon, and often entangled with, ‘second
on non-state activities/incomes, prohibitions of economy’ activities, while legal private activity
private activity, unmet individual consumption naturally opens a loophole for illegal trading and
needs, poorly protected impersonal (state) property, entrepreneurship, generally below the purview of
the personal power of bureaucrats and ‘gate- the authorities. And it goes hand in hand with the
keepers’, and other historical factors, including the extension of corruption, ensuring that it remains
end of terror. These provide both motives and outside of official notice.
opportunities for officially illicit activity and for the Those ‘violations’ of legality within the com-
authorities to overlook that activity. With the ageing mand sector, a ‘shadow economy’, build informal
of command and the decay in
Command Economy 1839
inter-enterprise relations which are generally ben- own weight and rigidity (Powell ). Thus the
eficial to the operation of state enterprises. They second/shadow economy provides a spontaneous
work to substantially correct the allocative failures surrogate economic reform that imparts a necessary
of the command mechanism, improving firm modicum of flexibility, adaptability and respon-
performance and hence benefiting its management, siveness to a formal set-up that is too often para-
and also provide lucrative opportunities for lyzing in its rigidity, slowness, and inefficiency. In
managers to directly benefit through the activization doing so, the second economy also provides a
of barter, personal connections, and bribery. valuable stabilizing influence on society and the
However, they also spawn further distortions in polity, making life livable and the system humanly
economic behaviour, as managers seek to generate manipulable and responsive to private inducement.
access to cash, the life blood of the ‘second It makes everyone complicit in the way things
economy’, to extract rents, and to hide their activity work, equally ‘guilty’ before state and society,
from supervisory and statistical organs. while providing an almost legitimate, and not polit-
Thus the second economy plays a dual and ically dangerous or directly destructive, outlet for
contradictory role in the command economic sys- individual initiative and entrepreneurship. Finally, it
tem. First, it addresses a number of the problems of relieves inflationary pressures (a ‘monetary over-
coordination and balance endemic to the command hang’) resulting from the command economy’s
mechanism, reallocating both producers’ and necessary combination of monetary looseness and
consumers’ goods, facilitating plan fulfilment and pricing rigidity.
the use of financial incentives, and generating new Despite this positive functional role, the second
incomes and ‘politically safe’ outlets for private economy also has a less positive systemic impact. It
initiative. Hence it becomes important for mocks the pretense of social direction and control,
enhancing consumer welfare, for production sta- subverts its egalitarian impulse by accentuating
bility, and even for social stability. The ‘second differences in access and income, and gives the lie
economy’, and in particular its ‘shadow’ side, plays to the pretense of a ‘new’ ideologically correct
an essential role in the first economy as a ‘pressure (‘Soviet’) man. Its very existence and usefulness
valve’, a release ‘fixing command’ by maintaining thus subvert the ideology of the regime, and it
micro-balance and covering ‘holes’ in economic life works against and undercuts regime priorities by
left by the mistakes or oversight of the planners and exposing the incompetence and incapacity of the
central managers. And this role becomes authorities. Its provision of alternatives weakens the
increasingly important as the economy grows more ‘plan, production, and labour discipline’ so essential
complex and diversified, and hence becomes less to the proper operation of the command mechanism.
susceptible to conscious oversight and direction. Indeed, it attacks the core of the command
As the central authorities struggle with their loss mechanism as it ‘... elevates the power of money in
of control, searching for a solution through reform, society to rival that of the dictatorship itself,
decentralization and recentralization, monetization rendering the regimes implements of rule less
and administrative restriction, agents in the effective and less certain’ (Grossman , p. 36). In
economy take advantage of gaps in control, of the particular, it corrupts officialdom and distorts
autonomy and discretion offered by growing prices, adding a (positive or negative) ‘second
liquidity of the quasimoney in the system, to deal economy margin’, both ‘in kind’ and in money,
with problems of coordination and balance, breaking prices as an effective instrument of
inconsistency of plans and commands, and control. This weakens monetized incentives for
ubiquitous shortages and scarcities. Of course they state activities by providing competing, and often
operate in light of their own partial information, and better, alternatives to them. Hence the second
in their own (private as well as official) interests, economy, and in particular the ‘shadow economy’
but in so doing save the system from collapsing in the state sector, completes the cancerous
under its development of agent autonomy, of the ability to
work outside the plan and its
1840 Command Economy
mechanism and retaining the hierarchical form of growth and the development of the economy. With
organization and the command principle. In the economic growth came increasing complexity and
hope of stimulating efficiency to revive growth growing intractability of the central planning and
rates, the decentralizing measures were accompa- economic management problem. Some
nied by a number of other ‘reforms’ relating to decentralization became essential, and increasingly
organizational structure: prices (still controlled), so as time passed, opening the door once again to
incentives, indicators, materials rationing, and so the rise of money as a significant influence on the
on. The Soviet reforms of 1965, and those in the operation and development of the economy. And
1970s and 1980s prior to perestroika, were of that that influence was only enhanced by the ageing and
kind; many similar ones took place in other Com- mellowing of the system. With the passing of
munist countries after the mid-1950s and prior to ‘terror’ as an effective incentive mechanism, the
the overthrow of Communism in 1989. On the stabilization of personnel and the regularization of
whole, such reforms had little success in addressing procedures, it became ever harder to control agent
the problems of the command economy. behaviour, to contain the distractions of money and
Bureaucratic and political obstacles apart, the the self-interests it mobilized, and to uncover the
attempt to decentralize economic decisions without rents that well- placed agents were able to extract,
bringing in a market mechanism almost inevitably thus aggravating the inherent agency problems of
leads to economic difficulties. The beneficiaries of the command economy.
devolution of decision-making lack the necessary The remaining years of the Soviet system thus
information to produce just what the economy witnessed an epic stmggle, barely perceptible at
requires or to invest to meet prospective needs, and first, but increasingly evident as reforms, decen-
the coordination of plan-subsequent command is tralizations, reorganizations and recentralizations
lost. Moreover, they may apply the additional cycled around each other in the search for a solution
power at their disposal to advance par- ticularist to the increasingly evident and destmctive
causes or to divert resources into illegal channels. malperformance and waste, and aggravating
Microeconomic disequilibria mount, and soon behavioural distortions in response thereto, gen-
superior authorities step in to recentralize on a case- erated by the struggle between the ‘command
by-case basis and the reform withers away principle’ and the weak, but inexorably emerging,
(Grossman ; Wiles , ch. 7; Kontorovich ). ‘market’. Initially reflected in the dysfunctions of
This failure of reform reflects the inherent con- the marginal and quasi-markets of the command
tradictions of the command economy framed in the economy, and in the struggle to harness a ‘passive’
irreconcilable conflict between ‘command’ and money to the purposes of command, the role of
‘money’ discussed above (Ericson ). The Soviet money grew along the ‘treadmill of reforms’ into
command economy, driven by the urgent need for the rival, if still largely subordinate and com-
and haste in industrialization and military plementary, ‘second economy’, and in particular its
development, initially relegated the influence of ‘shadow’ component, on which the ‘command
money and the market to the margins of the system, principle’ increasingly came to depend for its
where they handled areas and activities in which effectiveness. As long as the Soviet system
command had been revealed as counterproductive remained a ‘command economy’, commands had to
during War Communism. That system, the ‘less have last word, and money remained largely
than absolute command economy’, substantially relegated to the sidelines, exercising its influence
industrialized, triumphed in the Great Fatherland within the quasi-monetized instruments (‘economic
War, and recovered to an almost perfect replica of levers’) of the command mechanism and the
its pre-war self by 1950. But by then the strains of distorted markets of the second economy.
its inherent inflexibility and the bounded rationality This inherent conflict, played out over Soviet
of the system’s planners and managers began telling history, revolves around a number of fundamental
on continuing
1842 Command Economy
dualities, elemental oppositions which characterize the ‘oppositions’ in the system, until the central
these primary forces. The ‘command principle’ leadership, largely unintentionally and out of
derives most basically from the urge, the will to ignorance, destroyed the ‘command economy’ in
control, to ‘rationally’ determine and direct the the radical systemic and economic ‘restructurings’
future, exercised by a ‘gnostic’ elite, immanent in beginning with perestroika in 1987.
the Party. It knows what needs to be done, by whom
and how, and can tolerate no dissent or deviation.
Juxtaposed to this ‘Will of Society’ stand the
millions of independent ‘wills’, desires and
See Also
objectives, anarchically coordinated through ‘the
market’, whenever that set of institutions broke ► one; of •' .
through the barriers and limits placed by ► • ..
‘command’. This provides the foundation for the ► Informal Economy
eternal struggle between ‘central priorities and ► Second Economy (Unofficial Economy)
control’ and ‘agent incentives and capabilities’. ► ,? t Bud Constraint
This opposition is severely aggravated by ► Soviet Economic Reform
urgency, by ‘virtuous haste’, in the pursuit of ► ,? idet G. o C i d
overriding social goals and central objectives. For
the mobilization for, and focus of resources on,
Bibliography
these priorities trample on the information,
capabilities and goals of individual and organi- Aslund, A. 1995. How Russia became a market economy.
zational agents which must perforce implement that Washington, DC: Brookings Institution.
mobilization, implement those priorities. Berliner, J. 1957. Factory and manager in the USSR.
Cambridge, MA: Harvard University Press.
‘Effectiveness’ in the pursuit of social objectives
Bomstein, M., ed. 1973. P l a n a n d m a r k e t . New Haven:
becomes opposed to ‘efficiency’ in the attainment Yale University Press.
of any objectives, denies trade-offs based on local Eckstein, A., ed. 1971. Comparison of economic systems:
information and incentives, and hence blocks Theoretical and methodological approaches. Berkeley:
flexibility in response to changing circumstances. University of California Press.
Ericson, R. 1983. A difficulty with the ‘command’ allocation
Indeed, the single-minded pursuit of overriding mechanism. J o u r n a l o f E c o n o m i c T h e o r y 31:
objectives, of absolute priorities, naturally disrupts 1-26.
the fine coordination, the requirements of ‘balance’, Ericson, R. 1991. The classical Soviet-type economy: Nature
necessary to consistently and efficiently pursue any of the system and implications for reform. J o u r nal o f
E c o n o m i c P e r s p e c t i v e s 5 (4): 11-27.
objectives. Ericson, R. 2005. Command vs. shadow: The conflicted soul of
Throughout the history of the Soviet Union, the the Soviet economy. C o m p a r a t i v e E c o n o m i c
needs of centralization, given Soviet social goals, S y s t e m s 47 (4): 1-27.
stood in fateful opposition to the necessity to Freris, A. 1984. The Soviet industrial enterprise.
New York: St Martin’s Press.
decentralize in order to keep the system tolerably Garvy, G. 1977. M o n e y , f i n a n c i a l f l o w s a n d
functioning. The latter necessity spawned repeated c r e d i t i n t h e Soviet U n i o n . New York: Ballinger
(partial) remonetizations and a ‘second economy’ Publishing
that both shored up the operational foundations of Company.
Granick, D. 1954. M a n a g e m e n t o f t h e i n d u s t r i a l
the ‘first economy’ and undermined its long-term f i r m i n t h e USSR. New York: Columbia University
viability, corroding its ideological and systemic Press. Grossman, G. 1962. The structure and organization of
foundations. Money so unleashed intensified the the Soviet economy. Slavic Review 21: 203-222. Grossman, G.
dysfunctions and contradictions of the ‘command 1963. Notes for a theory of the command economy. Soviet
Studies 15 (2): 101-123.
economy’, spurring further repeated ‘reforms’ and
Grossman, G. 1966. Gold and the sword: Money in the Soviet
‘experiments’ that merely further aggravated the command economy. In Industrialization in two systems, ed. H.
inconsistencies, Rosovsky. New York: Wiley. Grossman, G. 1977. The ‘second
economy’ of the USSR. Problems of Communism 26 (5): 25-
40.
Commodity Fetishism 1843
their resources, their preferences, and the produc- without tools, farmers would not be able to grow
tion technologies available to them. But what is food for the workers. It is therefore evident in
really going on is a struggle over the distribution of retrospect that workers and farmers are engaged in a
the economic surplus at the point ofproduction. That collective endeavour. But it is not similarly evident
reality is opaque. Economic agents are therefore prospectively. From that vantage point, it seems
governed by the appearance of rational economic only that farmers and workers - and also the
agents maximizing payoffs to themselves. In his capitalists who provide them with means of
account of commodity fetishism, Marx shows how production - are making individual choices aimed at
this inevitable misperception helps to reproduce and bringing about the best outcomes for themselves,
sustain the underlying reality'. given the constraints they face. Even if they believe
When Marx expressly addresses this phenom- that these essentially egoistic activities are somehow
enon at the conclusion of the opening chapter of the socially beneficial, they can justify this belief by
first volume of Capital (1867), the economic agents appealing to the workings of an 'invisible hand'.
he describes are property-holding individuals. Thus Because there is no visible hand that directs the
it is not exactly capitalism that he aims to model, process, the teims of interaction appear as // they are
but 'simple commodity' production', an ahistorical forces of naUire to which individuals must
idealization. However, the cogency of his account is accommodate. Thus market relations appear as
unaffected as his analysis becomes more historical infrangible constraints that human beings are
and concrete - to the point that the direct producers obliged to operate within, not as social constructions
are, as in fiill-fledged capitalism, a property less that human beings can change. In teims that Kant
proletariat with nothing to exchange except, of introduced and that Marx, following Hegel,
course, their' own labour power. Commodity effectively assumed, freedom (autonomy) is then
fetishism is therefore a general and pervasive fact forfeit. Wills are hetero- nomously determined,
wherever capitalist social relations hold sway. Thus governed by laws of an (apparently) impersonal
the term denotes a systemic opacity at the level of other (the market system itself). To be free, we must
appearance that helps to hold economic agents in therefore take control of the aggregation mechanism
thrall by masking the exploitation of labour. wc have concocted. We do so by putting reason in
Because this misperception sustains the exploitation command - not just at the individual level of the
that engenders it, revolutionaries intent on rational economic agent, but at the societal level as
overthrowing capitalism must tear away the veil of well.
illusion by revealing the exploitation of workers What Marx says about commodity fetishism is
that exchange relations conceal. concise and intriguing. For these reasons, and
Marx does not directly address how commodity because it summarizes the veiy abstract analysis of
fetishism comes into being or how it is sustained. the commodity form with which Capital begins, his
But he does provide fragments of an explanation account of the phenomenon has always been well
when he focuses on the atomizing effects of market known. ‘Commodity fetishism’ is one of those
relations. All resource allocation mechanisms are terms that everyone associates with Marx. But, even
social in the sense that they bring together a host of in what remains of Marxist circles, the basic tenets
disparate and heterogeneous economic activities. of Marx’s account have faded from ongoing
However, where the commodity form prevails, the discussions. A number of factors have contributed
social character of market transactions is apparent to this turn of events: among them, the legacy of the
only after goods and services are produced. The so-called ‘value controversy’ of the 1970s; the
workers know that the corn they consume is efforts of mathematical economists in the 1970s and
produced by farmers, and the farmers know that the 1980s to put the categories of Marxist economic
tools they use in growing com are made by workers. analysis on a sound, analytical footing; and attempts
Everyone also knows that, without food, workers by analytical philosophers, working on Marxist
would not be able to make the tools farmers use; themes, to reconstruct and, when possible, defend
and that, core Marxist positions. The
Commodity Fetishism 1845
conclusion that has emerged is that, pace Marx, Marx treated (Smithian) political economy the same
there is nothing special about the commodification way. He assumed that it correctly describes
of labour power and therefore that the theory of ‘economic facts’. The task, then, was to interpret
surplus value cannot be sustained in the way that those facts - in order to reveal the alienation they
Marx believed. Nowadays, it is only the most express and, in so doing, to advance the emanci-
doctrinaire Marxists who uphold the labour theory patory project of Left Hegelianism. How successful
of value, the basis for Marx’s account of commod- Marx was in implementing this programme is
ity fetishism. This fact along with the decline of subject to debate. What is clear is that, as the focus
political movements that identify with the Marxist of his theoretical work turned away from Hegelian
tradition and, its inevitable consequence, waning philosophy towards political economy, history and
interest in Marx’s work itself, has, for the time politics, he became disabused of the idea that Adam
being, made commodity fetishism a matter of Smith or any other classical economist had gotten
concern mainly to historians of economic thought. political economy descriptively right. His life’s
Not long ago, the situation was quite the oppo- project, thereafter, was to rework the conceptual
site. From roughly the 1950s through the 1970s, apparatus of classical economics - more usually in
commodity fetishism played a central role in the its Ricardian, not Smithian, form - with a view to
two most important and innovative tendencies in revealing the real ‘laws of motion’ of the capitalist
Marxist theory: Marxist humanism and structuralist mode of production. In this endeavour,
Marxism. These were opposing tendencies, Feuerbachian philosophical anthropology seemed to
politically and substantively. But they converged on play no role. But, following the lead of Georg
according commodity fetishism centre stage. Lukacs ( ) several
Marxist humanists sought to de-Stalinize decades earlier, the Marxist humanists pointed out
Marxism by recovering its Left Hegelian roots. This that there was, in Capital, an explicit point of
meant reading Marx’s work through the prism of his connection - in the text on commodity fetishism. It
early writings, before he broke with his ‘erstwhile was there that Marx brought his analysis of the
philosophical conscience’, as he proclaimed in 1845 commodity form to completion. But it was also
in The German Ideology. For the Left Hegelians, there that, in modelling the commodity form, Marx
Ludwig Feuerbach’s philosophical anthropology, identified the objectification of essential human
elaborated in The Essence of Christianity (1841), traits in the process of capital accumulation. In
was fundamental. There Feuerbach ‘inverted’ the consequence, capital, becomes a ‘fetish’, a god in
theological dogma that ‘God makes Man’ by Feuerbach’s sense - one who controls economic
showing how the God idea is an ‘objectification’ of behaviour by force of (illusory) power.
essential human traits. Lacking materiality, God is Structuralist Marxists, like Louis Althusser,
purely an objectification, an ‘alienated’ expression were intent on reading Left Hegelianism out of the
of the human essence. In taking consciousness of Marxist canon. They therefore treated Marx’s
this fact, one recovers essential humanity and references to fetishes and gods as ironic figures of
becomes emancipated from the thrall of its systemic speech, even as they attempted to enlist the text on
misrepresentation. In the Paris Manuscripts (1844), commodity fetishism in the service of opposition to
Marx applied the Feuerbachian programme to Marxist humanism. Borrowing a concept from the
objects of labour; ‘objectifications’ too of essential French philosopher Gaston Bachelard (1884-1962),
humanity, but also material things and therefore not Althusser ( ) dispar
objectifications only. Feuerbach arrived at his aged Marx’s early work by asserting the existence
conclusions by ‘interpreting’ the theology of Right of an ‘epistemological break’ within the Marxist
Hegelian theologians. His working hypothesis was corpus. What he had in mind was roughly a
that they had gotten the concept of God right, but ‘paradigm shift’ - not, however, within an ongoing
that they radically misconstrued what the concept scientific practice but between pre-scientific modes
means. In the Paris Manuscripts of thought and the inception of a new science. In
Althusser’s account, two
1846 Commodity Fetishism
previously monumental epistemological breaks had we should not conclude that there is nothing viable
occurred - one that established mathematics in in the concept or in the theoretical traditions that,
ancient Greece, and one that established the until recently, magnified its importance.
sciences of nature in 17th-century Europe. Marx’s Hegelianism certainly, and structuralism possibly,
achievement was supposedly on a par with these; he still have much to teach us. The last word may not
opened up a science of history. He did so by yet have been said on the theory of surplus value,
anticipating the structuralist turn the ‘human either. If and when interest in Marx resumes, it will
sciences’ (in France mainly) would later take - first certainly be useful to revisit these issues. The notion
in linguistics and psychology, later in anthropology of commodity fetishism played a key role in mid-
and psychoanalysis. Specifically, in Capital and and late 20th-century Marxism. The core idea it
other writings of his maturity, Marx explained a articulates - that necessary misperceptions sustain
range of diverse ‘surface’ phenomena by construing the capitalist order - can again provide useful
them as effects of the workings of a relatively small insights. The concept may not be forever doomed to
number of underlying, generally invariant ‘deep’ be of historical interest only.
structures. The text on commodity fetishism lent
itself to this construal of Marx’s explanatory
practice in as much as it depicted the perceptions of
economic agents as effects of the unseen but
See Also
causally efficacious process of surplus value
extraction. Thus Marx’s account can be seen as a ►
theory of necessary (systemically induced)
misperception - consonant with notions of expla-
► Marx’s Analysis of Capitalist Production
nation that contemporaneous structuralists
endorsed. Perhaps the most innovative use
Althusser made of commodity fetishism was in his Bibliography
theory of ideology, according to which modes of Althusser, L. 1965. F o r M a r x , 1969. New York: Pantheon
production constitute experiential subjectivity by Press.
‘interpellating’ the human subjects who support or Althusser, L. 2002. Ideology and ideological state apparatus.
In L e n i n and philosophy and other
‘bear’ them. e s s a y s . New York: Monthly Review Press.
We now inhabit a different intellectual universe. Althusser, L., and E. Baliber. 1968. R e a d i n g c a p i t a l .
In the past several decades, it has come to be widely London: New Left Books, 1970.
believed, by erstwhile Marxists as much as by Cohen, G. 1988. The labour theory of value and the concept of
exploitation. In H i s t o r y , l a b o u r a n d f r e e d o m :
‘bourgeois economists’, that Marx’s focus on T h e m e s f r o m M a r x . Oxford: Clarendon Press.
production rather than exchange inhibited the Feuerbach, L. 1841. T h e e s s e n c e o f C h r i s t i a n i t y .
development of analytical economic tools. In so far Amherst: Prometheus, 1989.
as this belief is sound, the emphasis Marxists Fromm, E. 1975. M a r x ’ s c o n c e p t o f m a n . New York:
Continuum.
placed on commodity fetishism is partly to blame. Lukacs, G. 1923. History and class
The explanatory strategies of Marxist humanists c o n s c i o u s n e s s . London: Merlin Press, 1967.
and of structuralists have fallen into disrepute, too - Marx, K. 1844. Paris manuscripts (also called economic and
largely because, in both cases, though for different philosophical manuscripts). In C o l l e c t e d w o r k s , ed.
Karl Marx and Frederick Engels, vol. 3. New York:
reasons, the alleged connections between International Publishers, 1975.
appearance and reality were never satisfactorily Marx, K 1845. The German ideology. In C o l l e c t e d
explained. No sustainable account was given either w o r k s , ed. Karl Marx and Frederick Engels, vol. 5.
of how interpretation should proceed in the Marxist New York: International Publishers, 1976.
Marx, K 1867. C a p i t a l , vol. 1. Moscow: Progress Pub-
humanist case or, in the structuralist case, of how
lishers, 1965.
deep structures can be discerned in surface Roemer, J. 1981. A n a l y t i c a l f o u n d a t i o n s o f
phenomena. Thus, commodity fetishism has fallen marxist economic theory. New York:
on hard times. However, Cambridge University Press.
Steedman, I. 1981. M a r x a f t e r s r a f f a , rev. edn. New
York: Shocken.
Commodity Money 1847
BC. Coinage has developed in parallel and broadly and later in England. For a more complete discus-
similar ways in these areas. sion of medieval European coinage, see Spufford (
The metals most commonly used have been ).
gold, silver and copper (in decreasing order of Coins appear to have been used in India in the
scarcity), in varying degrees of fineness (silver early 4th century bc and were probably used before
mixed with substantial amounts of copper, called then. The earliest coins were so-called punch-
billon). Lead, tin and various copper alloys (bronze, marked coins and were adaptations of Greek
brass, potin) have also been used, although less prototypes. Coins were first used in China and the
frequently than the more common metals. The metal Far East about the same time as in India. The
is either mined or acquired through trade. The most distinctive bronze coinage with the square hole in
common method of coinage is striking with a die, the middle first appeared in the 3rd century BC.
although cast coins are also found. In many legal Early coins in eastern Islamic lands were copies of
traditions the right of coinage is a prerogative of the Byzantine gold and bronze coins; those in the East
public or central authority, although it may be were copies of Sassanian silver coins. For more on
delegated or leased to regional authorities or private coinage in India and the Far East, see Williams (
parties. This prerogative may also extend to mining. ).
In other words, the rales governing the supply of Until the 19th century, coins typically bore no
commodity money vary from government indication of face value, and their market value
monopoly to minimal regulation. could fluctuate even relative to one another. From
In Europe and the Mediterranean, coinage - an the late Middle Ages, governments increasingly
invention mythically linked to Croesus, King of sought to regulate the value of coins in some
Lydia - began near the Aegean Sea in the 6th manner, in particular assigning face value or legal
century bc. The use of money developed consid- tender value by decree. It became desirable to turn
erably in Greek and Roman times, leading to a the collection of objects used as a medium of
three-tiered system of gold, silver, and copper exchange into a stable system with fixed exchange
denominations. In the Roman empire, the provision rates between the objects. This was achieved to a
of coinage was a government monopoly. The large degree with bimetallism, a system in which
collapse of the empire in the West led, after a long gold and silver coins remained concurrently in
transition, to a purely silver-based monetary system, circulation at a constant relative price. Its heyday
with a largely decentralized provision of minting. was the mid-19th century, but beginning in 1873 the
Uniformity of coinage was restored under system was quickly abandoned, and by the First
Charlemagne but quickly disappeared along with World War countries were using either gold only or
political fragmentation. Gold returned in common (in Africa and eastern Asia) silver only.
use from the mid- 13 th century. By the 14th (Bimetallism is discussed in more detail in Redish ,
century, most mints in western Europe operated and Velde and Weber .) The development of
along similar lines, with more or less unrestricted banking in the 19th century also led to increased use
coinage on demand provided by profit-making of (convertible) notes and other monetary
mints. A great multiplicity of monetary systems instruments.
persisted, giving rise to both foreign exchange The First World War brought about the suspen-
markets (the earliest financial markets) and money sion of convertibility of the notes in many countries.
changers (the first financial intermediaries). Most countries returned to convertibility between
The first instances of token coinage (coins that 1926 and 1931, but the onset of the Great
are intrinsically useless but are claims to fixed Depression reversed the movement. After the
amounts of the commodity) appeared in the 15th Second World War the only major country whose
century in Catalonia. Notes convertible on demand currency was in any way directly tied to a
appeared in the 17th century, in Sweden commodity was the United States under the Bretton
Woods system: dollars were convertible
Commodity Money 1849
by non-residents of the United States into gold on value of money rises high enough, it becomes
demand, while other currencies of the system were worthwhile for agents to turn metal into coins at the
convertible into dollars. The link between gold and mint at the set price, thus increasing the quantity of
the dollar was severed in 1971. Fiat money money. Such a self-regulating commodity money
standards are now universal. system provides an anchor to the price level. This
has been touted as one of the advantages of a
commodity money system, particularly in the case
The Nature of Commodity Money of the gold standard.
The question of price-level determination
The definitions of commodity and fiat monies given becomes more complicated when multiple com-
above make it seem as if there is a clear distinction modity monies are made out of different commod-
between the two. It is more helpful, however, to ities. An example is the circulation of full-bodied
think of media of exchange along a continuum. An gold and silver coins. Should the unit of account be
object serving a purpose as a medium of exchange the gold coin or the silver coin? This matters
has value above its intrinsic content, reflecting the because under a commodity money system a
value of the service as a medium of exchange. monetary authority does not have the ability to set
Because the value of a commodity qua com- the exchange rate between monies of different
modity and the value as a medium of exchange can commodities forever. Thus, to the extent that the
differ, the value of all commodity monies has a fiat unit of account is used in contracts to determine the
component. A pure fiat money i s one for which this amount of future payments, the choice of the unit of
fiat component makes up its entire value. A nice account can affect the allocation of goods and
theoretical discussion of commodity and fiat monies services. This was one of the issues surrounding the
is given by Sargent and Wallace ( ). possible adoption of a bimetallic standard
mentioned above.
The inability of the monetary authority to set the
Price-Level Determination exchange rate between different monies goes away
under a pure fiat money system. Because fiat money
It is natural that the medium of exchange in an is (virtually) costless to produce, the monetary
economy is what becomes the unit of account, the authority can costlessly exchange one money for
unit in which debt contracts and the prices of goods another to maintain whatever exchange rate is
and services are expressed. It is natural because the desired between different monies that it issues.
money appears on one side of virtually every
transaction.
Because commodity money has an intrinsic Monetary Policy
value apart from that which it obtains by being a
medium of exchange, its relative price will not be The fact that a commodity is used as money alters
zero. Thus, in a commodity money economy, the its value. This is because part of the total quantity of
value of money (the inverse of the price level) is the commodity - namely, the metal locked up in the
bounded away from zero. Moreover, in a canonical form of coins, or the reserves held by the monetary
commodity money system (see below) with authority - is not available for nonmonetary uses.
unlimited minting at a set price, the value of money The allocation between monetary and non-monetary
and its quantity tend to remain within a band. If the uses is determined in equilibrium. Restrictions on
value of money falls far enough, it becomes the ability to change this allocation, such as
preferable to exercise the option and convert some restrictions on melting or exporting coins, or
of it into other, non-monetary uses, thus reducing limitations on the minting of metal, will have an
the quantity and preventing the value from falling effect on the equilibrium value of the money even if
further. Conversely, if the it has no immediate effect on
1850 Commodity Money
the allocation itself. (Since money is an asset, its the relative prices of goods and money. And, just as
valuation is forward looking.) Thus, there is scope occurs with fiat money, inflation has the effect of
for monetary policy under a commodity money transferring wealth from nominal creditors to
standard, although what constitutes monetary policy nominal debtors. Since governments generally
is different from and more limited in scope than tended to be debtors, debasements were used to
what holds under a pure fiat money standard. reduce the amount of their debts. Historically,
Monetary policy consists in actions that tend to debasements also had the secondary effect of
alter the value of money. In a commodity money increasing seigniorage revenue, since the quantity
system, the value of money is the value of the of coins minted tended to increase significantly
option we have described. (The strike price of the after debasements that involved the introduction of
option is zero, since the commodity is the money.) new coins (see Rolnick et al. ; Sargent and Smith ).
Most aspects of monetary policy with commodity Debasements were also used by governments to
money consist in modifying this option, typically by remedy malfunctions of a multiple-denomination
modifying the institutions governing the exercise of commodity money system (see Sargent and Velde ).
the option rather than by modifying the quantity of A second type of monetary policy adds or
money, which the authority usually cannot control modifies restrictions on the conversion of com-
directly. When the monetary authority is directly modity into money or money into commodity. For
involved in the provision of the money, it may example, minting might be restricted by quantity, in
directly profit from its actions. Potential profit is which case the authority decides how much to mint.
often an important consideration of monetary Minting might be unlimited but subject to a fee,
policy. called seigniorage. Governments typically charged
The canonical form of a commodity money such a fee, both to cover the actual costs of minting
standard comprises the following. One or more (called brassage) and as a tax (England was the first,
commodities are chosen to be the standard to which in 1666, to provide minting at no cost). The rate of
the monetary system will be anchored. The this tax or, equivalently, the price paid by the mint
monetary authority defines the specifications of the for bullion might be changed. These restrictions
monetary objects (weight, fineness) and defines the tended to alter the allocation of the commodity
unit of account in terms of these monetary objects. between monetary and nonmonetary uses, and
The conversion of commodity into commodity hence the value of the commodity and the money.
money and vice versa is as costless as possible. In A third type of monetary policy sets limits to the
particular, the monetary authority provides for legal tender quality of certain coins, or changes
unlimited (and even costless) conversion of the their legal tender value. Since coins did not have
commodity into monetary objects (coins or notes). face values until the 19th century, it was up to
Conversely, it places no hindrances on the monetary authorities to set, and from time to time
conversion of monetary objects into commodities alter, the legal tender values of coins. Frequently,
(coins can be melted, notes are convertible on foreign coins were authorized as legal tender at
demand), nor does it place limitations on the con- rates set for domestic coins. Countries attempting to
sumption of the commodity or its service flow (free maintain bimetallism in the face of fluctuations in
possession, unrestricted import and export of the the relative price of gold and silver often had to
commodity). The monetary objects are unlimited adjust the face value of either their gold or silver
legal tender. coins. Changes in the legal tender values could also
One type of monetary policy modifies the spec- be motivated by fiscal considerations or by attempts
ifications of monetary objects and units of account. to target a particular price level or exchange rate.
An example is debasement, which is reducing the The physical nature of the medium of exchange
commodity content of a monetary object (and, led to a particular set of concerns.
frequently, of the corresponding unit of account).
The result of debasement is inflation, since nominal
prices will be adjusted to maintain
Commodity Money 1851
Coins, like anything else, depreciate with use, It has also led to a greater scope for monetary
through wear and tear. Since coins of different policy because the supply of money can be changed
values have different usage rates, the depreciation almost costlessly. However, along with this greater
rate varied by denomination. Also, being roughly scope has come the greater potential for
constant over time, depreciation depended on the governments to use inflation to collect seigniorage
age of the coin. Finally, imperfect minting tech- revenue or to reduce the real value of their debts.
nology as well as actions by the public (clipping, How to use the freedom that commodity money
sweating) aggravated the disparities between coins. restricted is still a matter of debate.
This factor introduced heterogeneity among coins
and hindered the achievement of a stable and
uniform monetary system. Improvements in coin
production partially remedied the problem, as did
See Also
periodic recoinages.
When the monetary objects consist not only of ► Bimetallis
coins but also of paper currency or tokens that are ► Bretton W
demand promises to the commodity, a fourth type ► Fiat Mone
of monetary policy is available: suspension of
convertibility. The monetary authority can refuse to
honour the promise of convertibility for some
period of time. An example is the suspension of Bibliography
convertibility by the Bank of England between 1797
and 1819 during the wars with France. During the Bordo, M., and F. Kydland. 1996. The gold standard as a
commitment mechanism. In M o d e r n p e r s p e c t i v e s
19th century suspensions were not uncommon o n t h e g o l d s t a n d a r d , ed. T. Bayoumi, B.
during financial or fiscal emergencies, with the Eichengreen, and M. Taylor. Cambridge: Cambridge
understanding that the suspension would end after University Press.
the emergency and convertibility would be restored Kiyotaki, N., and R. Wright. 1989. On money as a medium of
exchange. J o u r n a l o f P o l i t i c a l E c o n o m y 97:
at the preexisting parity. This understanding has
927-954.
been described as a state-contingent gold standard Luschin von Ebengreuth, A. 1926. AllgemeineMiinzkunde und
(see Bordo and Kydland ). Geldgeschichte des Mittelalters und der neueren Zeit.
When there is a central bank, an additional Munich: R. Oldenbourg.
Redish, A. 2000. B i m e t a l l i s m : A n e c o n o m i c a n d
monetary tool is to change the discount rate, the
h i s t o r i c a l a n a l y s i s . Cambridge: Cambridge
interest rate at which the central bank lends reserves University Press.
to the banking system. During the gold standard Rolnick, A., F. Velde, and W. Weber. 1996. The debasement
period, this was the primary means by which central puzzle: An essay on medieval monetary history.
J o u r n a l o f E c o n o m i c H i s t o r y 56: 789-808.
banks affected the exchange rate of their money
Sargent, T., and B. Smith. 1997. Coinage, debasements, and
against the monies of other countries. Gresham’s laws. E c o n o m i c T h e o r y 10: 197-226.
Sargent, T., and F. Velde. 2002. T h e b i g p r o b l e m o f
s m a l l c h a n g e . Princeton: Princeton University
Conclusion Press.
Sargent, T., and N. Wallace. 1983. A model of commodity
Commodity money is a thing of the past; countries money. J o u r n a l o f M o n e t a r y E c o n o m i c s 12:
163-187.
worldwide now use fiat money standards. This Sputford, P. 1988. M o n e y a n d i t s u s e i n m e d i e v a l
practice has led to an efficiency gain in the sense E u r o p e . Cambridge: Cambridge University Press.
that resources that were once tied up in coins are Sussman, N., and J. Zeira. 2003. Commodity money inflation:
now available for consumption and production Theory and evidence from France in 1350-1436.
J o u r n a l o f M o n e t a r y E c o n o m i c s 50: 1769-
(perhaps prompting John Maynard Keynes to refer 1793.
to gold as the ‘barbarous relic’). Velde, F., and W. Weber. 2000. A model of bimetallism.
J o u r n a l o f P o l i t i c a l E c o n o m y 108: 1210-1234.
Williams, J., ed. 1997. M o n e y : A h i s t o r y . New York: St
Martin’s Press.
1852 Commodity Reserve Currency
humanitarian importance in case of shortages has seemed impossible. When there has been a
of foodstuffs such as rice. great piling up of stocks (as at this writing in early
1986 ), it has seemed as if mobilization of stocks
These criteria would admit most of the world’s held by the European Economic Community, the
important grains, fibres, fats and oils, beverage United States, various other governments such as
crops, primary metals - and probably a number of Brazil, cartels such as that for tin, and private
basic chemicals and forest products. Amounts of the concerns such as copper companies, might permit a
various commodities in the basket would reflect very rapid start.
their weight in world production and/or trade. Starting a CRC dining a period of widespread
The CRC basket must be large (worth several shortages could be highly inflationary; starting it
million US dollars), because for efficient trading dining a general economic downswing could mit-
each element must be a multiple of a wholesale lot. igate a recession. There is debate as to whether
This large basket-size would be appropriate, since accumulation of a reserve must stick to previously
CRC is designed as a vehicle for holding national agreed proportions of the different commodities, or
monetary reserves rather than for retail or even whether (as proposed by Keynes and more recently
wholesale trade. by Kaldor) the IMA should buy individual
IMA holdings of the various commodities must commodities at such dates and prices as seem wise.
be parcelled out to points of delivery and storage On this route, the composition of the CRC basket
along the normal trade routes of the commodities. and the selling-price offer would grow out of the
Correspondingly, IMA purchases and sales must be process of accumulation. This variant, plainly,
handled by agents at the various trading points. To would increase the similarity of IMA’s operations
tell whether at any moment baskets must be bought to those of commodity ‘stabilization’ groups and
or sold, IMA must sum up bids or offers reported by remove the impersonality held to be the central
these agents. IMA must take the initiative - virtue of the Goudriaan/Graham scheme.
instructing all agents to sell or buy - whenever the
sum of bid prices for the elements of the basket
adds up to the posted selling price for the basket, or
the sum of asked prices to the buying price.
Objections to CRC have hinged primarily on See Also
costs and/or on the difficulties of getting a CRC
system under way. There has been continuing
debate about the size of the reserve needed to
validate IMA selling offers, about the cost of
holding stocks of various suggested commodities,
etc. Such costs must be compared with benefits
from reducing cyclical fluctuations in primary References
producers’ incomes, etc. - and of doing so without
Bennett, M.K., et al. 1949. I n t e r n a t i o n a / c o m m o d i t y
engaging in commodity-by-commodity operations. stockpiling as an economic stabilizer.
Benjamin Graham used to stress that the success of Stanford: Stanford University Press.
such operations hinges on restriction of commodity Goudriaan, J. 1932. H o w t o s t o p d e f l a t i o n . London:
production, whereas success of CRC would The Search Publishing Company.
Grubel, H.G. 1965. The case against international commodity
stimulate production. For this and other reasons, the reserve currency. O x f o r d E c o n o m i c P a p e r s
cost/benefit problem is complex. 17(1): 130-135.
During episodes of worldwide commodity Hailwood, P. 1986. External economy arguments for com-
stringency, accumulation of a commodity reserve modity stockpiling: A review. B u l l e t i n of
E c o n o m i c R e s e a r c h 38(1): 25-41.
Harmon, E. 1959. C o m m o d i t y r e s e r v e c u r r e n c y .
New York: Columbia University Press.
Hart, A.G., N. Kaldor, and J. Tinbergen. 1964. The casefor an
international Commodity Reserve Currency.
1854 Common Factors
Geneva: United Nations Conference on Trade and Well-known examples of comovement in multi-
Development. 17 February 1964. E/CONF.46/p/7. variate data sets include business cycles in mac-
Conveniently accessible in N. Kaldor, E s s a y s o n e c o -
n o m i c p o l i c y - I I . London: Gerald Duckworth &
roeconomic indicators and shifts in the entire term
Co.. 1964. structure of interest rates, and researchers some-
Hayek. F.A. 1984. The future monetary unit of value. In times attribute this comovement to a small set of
M o n e y i n c r i s i s , ed. N. Barry. Siegel: Pacific underlying forces or latent ‘factors’ that influence
Institute for Policy Research.
Newbery, D.M.G.. and J.E. Stiglitz. 1981. T h e t h e o r y o f
each variable in the system. It is then convenient to
commodity price s t a b i l i z a t i o n . Oxford: think of the variation in each variable in the system
Clarendon. as the sum of two types of (unobserved)
components, one of which captures variation that is
due to 'common factors’, while the other captures
Common Factors all other variation. Models that attribute
comovement to common factors are called common
Heather M. Anderson factor models, and common factor analysis involves
the identification and study of the common factors.
Common factor models are particularly popular
in empirical settings because they offer parsimony,
and simplify estimation by reducing the number of
Abstract
parameters that need to be estimated. Economists
This article outlines and illustrates several types
will typically be interested in interpreting common
of common factor models that are found in the
factors so that they can explain why comovement
applied economics literature. These factor
occurs. Economic theory sometimes predicts
models include those based on principal
common factors. Perhaps the best-known example
components, classical factor analysis, dynamic
of this is the capital asset pricing model, in which
factor analysis and common features, and the
the (excess) return for the market portfolio is the
discussion addresses the identification and esti-
common factor in the (excess) return for each
mation of factors, as wel I as the use of common
individual stock. Another well-known example
factor models.
arises when the term structure of interest rates is
modelled, because the no arbitrage condition
Keywords
implies that the entire term structure is determined
ARM A processes; Autoregressive moving
by a single factor, which is the instantaneous
average (ARMA) processes; Canonical corre-
interest rate.
lations; Capital asset pricing model; Classical
A simple model that captures the concept of
factor models; Coincident indices; Common
common factors in a set of N time-series in
factors; Common feature models; Common
the (demeaned) vector Y, = (Yu, Y2,.............YNt)' is
trend model; Diffusion indexes; Dynamic factor
given by
(or index) models; Factor analysis; Kalman
filter; Principal component analysis; Real busi- Y, AF, + .
£r
ness cycle models; Reduced rank regressions;
Static factor models; Stone, J; Term structure of (1 )
interest rates; Time series analysis
where Ft is an /' x 1 vector that contains r common
factors, A is an N x r factor loading matrix (with
rank(A) = r < TV), and £t contains N idiosyncratic
JEL Classifications
components. With the use of II, and I, to denote the
C32
variance covariance matrices of Yt, F, and £t, it is
Economic analysis frequently involves the study of usual to assume that Z£: is diagonal, and it is also
variables that exhibit similar behaviour, and it is common to normalize the set of r factors in Ft by
often of interest to model this comovement. assuming that X/. I,..
Common Factors 1855
Model ( ) is similar to conventional factor factor loadings are also the least squares estimators
models that are often used in cross-sectional set- of the reduced rank regression given by
tings, although the variables are specified here as
time series, to facilitate discussion on dynamic Yt= A B P, + £„ (2)
(N\r) (r \ N )
factor models. If there is no serial correlation in Y,
or F„ or if estimation is undertaken as if this is the
where BYt contains the r factors F, = (/j, .. .,fr).
case, then ( ) is called a static factor model. It is
Anderson ( , ch. 11) provides a standard
usual to assume that F, and e, are jointly stationary,
reference.
that E(e,) = 0,E(F,e't) = 0, and that e, contains no
In practice, one needs to determine r before
serial dependence, but these latter assumptions can
estimating common factor models, and this is
be relaxed, depending on the type of factor model
2jLi+...+2^
under consideration. often based on the ratio given by -TJ-
/l, -+Xj,
There are many ways to identify the factors in This ratio measures the loss of information in the
( ), and standard techniques include the use of reduced rank system relative to an unrestricted
principal component analysis, factor analysis and system, and typically investigators will choose r so
canonical correlations to estimate the parameters of that this ratio is kept small. Bai and Ng ( )
various associated reduced rank regressions. More have developed model selection criteria
recently, researchers have focused on the time series that are consistent as {N, T} —> oo.
properties of multivariate data-sets, and modem Principal components are usually used for
factor models include dynamic factor (or index) dimension reduction, and economic interpretation of
models, and models that incorporate common the resulting factors is rarely straightforward.
features. These latter models incorporate various However, Stone ( ) has summarized a set of
ways of allowing the factors to follow specific series from the US national accounts, associating the
dynamic processes, or to contain specific time- first three principal components with income,
series properties. income growth and time and Chamberlain and
Rothschild ( ) have promoted the use of prin
cipal components for estimating approximate factor
models of asset prices. Stock and Watson ( )
Principal Component Models have suggested the use of diffusion
indexes (principal component factors associated with
Principal component analysis involves the intuition
large macroeconomic data sets) for forecasting key
that most of the variance in Y, will be attributable to
macroeconomic variables, and the interest here
variance in the r components in Ft. The factors Ft
centres on using information in the factors rather
are modelled as linear combinations of Yt, and their than interpreting the factors themselves.
identification is based on finding the r (orthogonal)
linear combinations of Y, that have the most vari-
ance. In practice this involves finding the eigen
values 1] > 12 > ■ ■ ■ > XN and associated eigen- Classical Factor Models
vectors /i, ..., fN of the form /, = f^Y, that are
associated with the roots of the equation |£y XI\ Classical factor models are closely related to prin-
= 0, where £v is an estimate of ~Ly. The [1, are cipal components models, but the underlying intu-
picked so that ^£y — Xj'j /?, = 0and/i'/i, = 1, and the ition and assumptions are different. In this case the
factors Ft are then defined by F, = (/j, .. ,,fr). This key assumption is that E£ is diagonal so that the e,
decomposition ensures that£ (fjej) = 0, but it implies describe idiosyncratic effects that are unique to each
that the &, (which are each linear combinations of variable in Yt, while the factors describe joint effects
(frl i,.. ,,/v)) will be correlated with each other so that in Yt. The assumptions that £(e,) = 0 and E{Fte!^j =
will not be diagonal. Principal components 0 still hold, and the s, are assumed to contain no
estimators of common factors and serial dependence. Under these
1856 Common Factors
assumptions X>- = AEpA' + E£, and estimates for A, Dynamic Factor Models
X/. and can be found by maximizing the function
Classical factor models are not well suited to
t=T multivariate analysis of time series because they
T 1
LT{AXI) = -- l n | E y | --YT&f, assume no serial correlation in eh and, if there are
z z
t=1 any dynamics in F„ then they are implicit and not
explicitly modelled. Dynamic factor models address
subject to the condition that ranMA) = r and a set of these concerns by treating the zt and Ft as
normalization restrictions that will uniquely identify autoregressive moving average (ARMA) processes.
the (r I 1)(/V I ir) parameters. Researchers often use The innovations that underlie the A processes for e,
the joint restrictions that Xp = I,. and that A'X, 'A is are assumed to be mutually uncorrelated, and
diagonal for normalization, but other normalizations uncorrelated with the innovations that underlie the
are common (see Anderson , eh. 14, for details). If F, at all leads and lags, but the factors themselves
Yt and st are normally distributed then LT (A, Es) is can be mutually correlated. Different variables in Y,
the log likelihood for Yt (if we ignore the constant can then move together because they are functions
term), but, even when Yt and st are not normally of the same factor (s), or because they are functions
distributed, the maximization of LT (A, Es) dehvers of different factors that are themselves correlated.
quasi-maximum likelihood estimates. There are The identification and estimation of small- scale
several ways of using the estimates of A and Es to dynamic factor models is sometimes based on
obtain estimates of the factors in F„ and perhaps the spectral techniques (see Geweke ; or Sargent and
best-known of these is Bartlett’s ( , ) Sims ), and use of the Kalman filter in the time
method based on generalized least squares given by domain (as in Engle and Watson , or Harvey and
Koopman ) provides an alternative approach.
F, = ( A Aj A Y,. Dynamic factor models have been particularly
popular for estimating factor models of business
cycles (as in Geweke and Singleton ), but they have
As above, it is necessary to determine r prior to also been used for studying the term structure
estimating the factors, and, on the assumption of (Singleton ) and fluctuations in employment across
normality, the likelihood ratio test statistic for different industrial sectors (Quah and Sargent ).
testing //„: r = s versus HA\ r > s is given by Recent work has shifted towards the identifica-
tion and estimation of common factors in large-
scale models, relying on the use of large A to obtain
—r[ln|fr| -ln|AA' + f e |] = — T Z;=f+1ln(l - I:),
consistent estimates of the factors. One strand of
this literature adopts a static framework and
c- 1-
standard principal components to estimate the
where the /., are the characteristic roots of A X.. A
factors, and then builds dynamic models of the
(in decreasing order) and A is estimated under the
factors. The resulting models are sometimes called
null. The test statistic is asymptotically distributed
approximate dynamic factor models. Applications
as a xl with q = [(A — sf — N — s]!2° of freedom
of this approach include Stock and Watson’s
under the null.
diflusion index ( ),
There are numerous applications of classical
and Bemanke and Boivin’s ( )
factor analysis to economic problems, and an early
estimation of a monetary policy reaction function.
example includes Stone’s ( ) factor anal Another strand of this literature allows different
ysis of the demand for N commodities. Another variables to depend on different lags of common
example includes a factor model of returns by factors. These ‘generalized dynamic factor models’
Deistler and Hamann ( ). are estimated using ‘dynamic principal compo-
nents’, which are the principal components of
Common Factors 1857
spectral density matrices at different frequencies. past history, the second factor is the next best
Applications of this latter approach include a study predictor, and so on, and the factors provide a set of
of business cycle dynamics in the United States leading indicators for Yt. When X, consists of
(Fomi and Reichlin ) and the development of a explanatory variables for Y„ then the factors are
coincident index for Europe (Fomi et al. ). often called coincident indices. One can base a test
of H0 : r = s versus HA : r > s on the test
statistic — rX'r^+1ln^l — 2, j , which has a y2
Canonical correlation-based models distribution with (m — s) (n — s) degrees of
freedom under the null.
Principal component and factor models assume that
the factors are linear combinations of the A
variables in Yt, but sometimes it is useful to assume
that the factors are linear combinations of M Common Feature Models
variables contained in another multivariate time
series denoted by X\. The variables mXt will often Common feature models are a special class of factor
include lags of the variables in Yt, but A, can also models in which the common factors have a
include variables that would be classified as statistical characteristic of interest, while the
explanatory variables in a regression context. The idiosyncratic components fail to have this
factors in ( ) can now be written in the form Ft = characteristic. Common features were first intro-
B'Xt (where rank(B) = r < min {iV, AT} ). In what duced by Engle and Kozicki ( ) when they
follows, we assume that the st in ( ) are white noise discussed serial conelation features - a situation in
and uncorrelated with Xr. which each of A variables is serially conelated, but
The main idea behind a canonical correlations there are linear combinations that are white noise.
approach is to find linear combinations of X, that Here, the presence of A r white noise linear
are strongly correlated with linear combinations of combinations implies a factor model in which there
Yu and, as for principal component models, the are r serially correlated factors (which are
estimators of common factors and factor loadings sometimes called common cycles). An earlier
are the least squares estimates of a reduced rank example of a common feature model is Stock and
regression. In this case the regression is Watson’s ( ) common trend
model which is valid when variables are
cointegrated (as in Engle and Granger ). In this case
the common factors are integrated of order one but
the remaining components (often called error
correction terms) are stationary. Other examples of
and the factors and factor loadings are related to the
common features include Vahid and Engle’s ( )
r largest roots of R = Xy'XyyXy 'XyyX,/ , which is
common trend-common cycle
the multivariate generalization of the (squared)
representation, and common nonlinearity (Anderson
correlation coefficient between two variables. If we
and Vahid ).
order these roots (also called squared canonical
The identification of common features involves
correlations) so that %>£> ... > /2, and let V,, V2, . ■
finding linear combinations of the data that do not
■ Vr be the r associated eigenvectors, then the factor
have the feature, and this can be done using a
loadings and factors are given by A,- = YyX, and
canonical correlations approach in which the
B,X, = Y.y1Y.yxY.x ' V,Xt. Anderson ( , eh. 12)
variables in Xt model the characteristic of interest.
provides a detailed To illustrate, lags of Yt are put into X, when testing
discussion of canonical correlations, while Izenman for serial correlation features in Yt, and lagged
( ) discusses the associated reduced levels are included in Xt when testing for common
rank regressions. When the variables in Xt are trends. Factors associated with the lowest eigen
simply lags of the variables in Y,, then the first values define linear combinations
factor is the best predictor of Y, based on
1858 Common Factors
that do not contain the feature, while factors Engle, R., and M. Watson. 1981. A one-factor multivariate
associated with the highest eigen values are used to time series model of metropolitan wage rates. J o u r n a l
of the American Statistical Association
model the common features. Johansen’s ( ) 76: 774-781.
procedure provides a well-known example of this, Fomi, M., and L. Reichlin. 1998. Let’s get real: A factor-
although inference in this case is based on non- analytic approach to disaggregated business cycle
standard (rather than y2) distributions because the dynamics. R e v i e w o f E c o n o m i c S t u d i e s 65:
453-473.
factors are non-stationary. Fomi, M., M. Hallin, M. Lippi, and L. Reichlin. 2000. The
A well-known example of a common feature generalized dynamic factor model: Identification and
model is the real business cycle model of King et al. estimation. R e v i e w of Economics and
( ), in which a common factor S t a t i s t i c s 82: 540-554.
Geweke, J. 1977. The dynamic factor analysis of economic
(productivity) generates the trend in output con- times-series models. In L a t e n t v a r i a b l e s i n
sumption and investment, and another factor (the s o c i o e c o n o m i c m o d e l s , ed. D. Aigner and A.
deviation of capital stock from steady state ) gen- Goldberger. Amsterdam: North-Holland.
erates the common cycle. Geweke, J., and K. Singleton. 1981. Maximum likelihood
‘confirmatory’ factor analysis of economic time series.
I n t e r n a t i o n a l E c o n o m i c R e v i e w 22: 133-137.
Harvey, A., and S. Koopman. 1997. Multivariate structural
See Also time series models. In S y s t e m a t i c d y n a m i c s i n
e c o n o m e t r i c a n d f i n a n c i a l m o d e l s , ed. C.
► Reduced Rank Regression Heij, H. Schumacher, and C. Praagman. Chichester:
► Time Series Analysis Wiley.
Izenman, A. 1980. Assessing dimensionality in multivariate
regression. In H a n d b o o k o f s t a t i s t i c s , vol. 1, ed.
Bibliography P. Krishnaiah. Amsterdam: North-Holland.
Johansen, S. 1988. Statistical analysis of cointegration vectors.
Anderson, T. 1984. An introduction to multivariate statistical Journal of Economic Dynamics and
analysis, 2nd ed. New York: Wiley. C o n t r o l 12:231-254.
Anderson, H., and F. Vahid. 1998. Testing multiple equation King, R., C. Plosser, and S. Rebelo. 1988. Production, growth
systems for common nonlinear components. J o u r n a l and business cycles II. New directions. J o u r n a l o f
o f E c o n o m e t r i c s 84: 1-36. M o n e t a r y E c o n o m i c s 21: 309-341.
Bai, J., and S. Ng. 2002. Determining the number of factors in Quah, D., and T. Sargent. 1993. A dynamic index model for
approximate factor models. E c o n o m e t r i c a 70: 191- large cross sections. In B u s i n e s s cycles,
221. i n d i c a t o r s a n d f o r e c a s t i n g , ed. J. Stock and
Bartlett, M. 1937. The statistical conception of mental factors. M. Watson. Chicago: University of Chicago Press.
B r i t i s h J o u r n a l o f P s y c h o l o g y 28: 97-104. Sargent, T., and C. Sims. 1977. Business cycle modeling
Bartlett, M. 1938. Methods of estimating mental factors. without pretending to have too much a-priori economic
N a t u r e 141: 609-610. theory. In N e w m e t h o d s i n b u s i n e s s c y c l e
Bemanke, B., and J. Boivin. 2003. Monetary policy in a data r e s e a r c h , ed. C. Sims et al. Minneapolis: Federal
rich environment. J o u r n a l of Monetary Reserve Bank of Minneapolis.
E c o n o m i c s 50: 525-546. Singleton, K. 1980. A latent time series model of the cyclical
Chamberlain, G., and M. Rothschild. 1983. Arbitrage, factor behavior of interest rates. I n t e r n a t i o n a l E c o -
structure and meanvariance analysis in large asset n o m i c R e v i e w 21: 559-575.
markets. E c o n o m e t r i c a 51: 1305-1324. Stock, J., and M. Watson. 1988. Testing for common trends.
Deistler, M., and E. Hamann. 2005. Identification of factor Journal of the American Statistical
models for forecasting returns. J o u r n a l of A s s o c i a t i o n 83: 1097-1107. ’
F i n a n c i a l E c o n o m e t r i c s 3: 256-281. Stock, J., and M. Watson. 2002. Macroeconomic forecasting
Engle, R., and C. Granger. 1987. Cointegration and error using diffusion indexes. J o u r n a l o f B u s i n e s s
correction representation, estimation and testing. a n d E c o n o m i c S t a t i s t i c s 20: 147-162.
E c o n o m e t r i c a 55: 251-276. Stone, J. 1945. The analysis of market demand. J o u r n a l
Engle, R., and S. Kozicki. 1993. Testing for common features of the Royal Statistical Society, Series
(with discussions). J o u r n a l o f B u s i n e s s a n d A 108: 286-382.’
E c o n o m i c S t a t i s t i c s 11: 369-395. Stone, J. 1947. On the interdependence of blocks of trans-
actions. J o u r n a l o f t h e R o y a l S t a t i s t i c a l
S o c i e t y , S e r i e s B , Supplement, 1-45.
Vahid, F., and R. Engle. 1993. Common trends and common
Common Land 1859
was able to support (with the aid of hay etc.) the parish boundary, meeting with those of
through the winter. neighbouring villages.
The extent to which the commons survived This was not true of all areas in the Midlands,
during the medieval period varied considerably however. Conversion of grazing to arable was more
from region to region, as the result of the interplay retarded in the Forest areas. Such areas were not
of a number of factors. Essentially, commons sur- necessarily densely wooded; the term forest was a
vived best where population densities were low and legal rather than a descriptive or environmental
where much land was unsuitable for arable term, referring to areas to which forest law applied.
agriculture. Thus large areas survived in the uplands This was a body of rules and restrictions originally
of the north and west. But there may have been intended to preserve deer for the royal chase and
more complex social and economic factors which which inter alia attempted to limit the destruction of
were also important in the preservation of common suitable habitats through the expansion of arable
grazing, especially in lowland areas, for the cultivation. In reality', forest law functioned more as
distribution of commons in medieval England does a source of revenue, for encroachments on the
not appear to be a direct and simple reflection of wastes were tolerated if a fine was paid.
demographic pressure or environmental factors. In Nevertheless, in areas like Rockingham Forest in
certain parts of the south and east of England - areas Northamptonshire, these instim- tional factors
of dispersed settlement and irregular field systems, combined with the relatively marginal naUtre of
poorly developed communal controls and local soils to preserve extensive areas of open waste.
individualistic agriculture - some communities seem As arable expanded at the expense of grazing, in
to have lacked the management structures necessary the areas of irregular field systems and dispersed
to act corporately to plough up areas of common settlement surviving commons often became foci for
grazing, even in areas of high population and settlement. The poorly developed nature of
moderately fertile soils, such as Norfolk. In addi- communal controls in such areas was probably the
tion, poor controls on the alienation of land and the stimulus for this development; farmers and
practice of partible inheritance led to an early smallholders moved to the edge of areas of common
proliferation of smallholders for whom the grazing not only for the convenience provided by
resources provided by the commons were of vital such a location, but also, as it were, to stake a visible
importance. In such areas the conversion of waste to claim to their use. The freedom to alienate land and
arable agriculture ground to a halt rather earlier than the irregular nature of field systems in these areas
in the classic open-field areas of the Midlands. made such settlement migration possible, for they
In the latter areas, communities were more allowed the acquisition of blocks of land adjacent to
cohesive and communal controls on agriculture and the common upon which farmsteads could be
land-use better developed. There were often established. This development seldom occurred in
stronger manorial controls on the alienation of land, areas of regular open-field systems, nucleated
holdings did not fragment to the same extent and settlement, and strong community controls.
there was less economic polarisation within the Thus it was mainly in the south-east and the
farming community'. It may be this that explains north-west of England that the medieval period saw
why in many of these areas so much of the wastes the development of straggling settlements around
were ploughed up in the 12th and 13th cenUtries, as the perimeter of commons. This process went
arable prices, and the need for food for furthest in parts of East Anglia, where complex
consumption, increased. Whatever the explanation, manorial organisation and the presence of
it appears that in many parts of the central substantial numbers of free tenants practising par-
Midlands, areas of common grazing were almost tible inheritance led to a proliferation of small-
entirely destroyed by the end of the 13 th century. holdings and wholesale migration away from earlier
The arable strips of many villages ran right up to village sites. Farmsteads clustered around
Common Land 1861
areas of common which thus became large village farming as specialized agricultural regions emerged
greens, leaving the parish church - marking the in the 15th century. Survival of commons also
original Saxon site of settlement - isolated in the allowed small cottagers to maintain a measure of
fields some distance away (Wade Martins ). economic independence, and the more extensive
Lowland commons have a distinctive form. commons attracted large numbers of squatters, often
They have straggling, concave outlines, formed by a part-time craftsmen. As a result, areas in which
series of rough arcs; roads funnel into the common large commons survived tended to have a reputation
where these arcs join. This characteristic shape for lawlessness. The opportunities which such areas
probably derives from the fact that commons are the offered to the poor also ensured that they often
remnants of more extensive areas of waste which experienced particularly rapid population growth in
had been continuously encroached upon for the early modem period. In Northamptonshire, for
centimes before their outline became fossilised, example, forest villages in the 17th century were on
usually in the early medieval period. The perimeters average around half as populous again as non-forest
of commons are often defined by particularly villages (Hoskins and Stamp , p. 52).
massive and ancient banks and ditches. Much enclosure of common land occurred dur-
Today, many commons are wooded, but this is ing the 16th and 17th centuries. Nevertheless, in
normally a relatively recent feature, resulting from a 1688 Gregory King estimated that there were still
relatively recent decline in the intensity of grazing. 10 million acres of heaths, moors, mountains and
By the end of their medieval period, commons had barren land in England and Wales, and a further 3
million acres of forests, parks, and commons, the
usually lost whatever woodland they had formerly
majority of which was common land. Today, the
carried; it had been destroyed by felling and over-
total area of common land in England and Wales is
grazing, and only names like ‘Wood Green’
around 1.5 million acres. Even allowing for a high
sporadically reflect their former nature. Medieval
degree of inaccuracy in King’s estimates, there has
woodland, in contrast, was not usually common
clearly been a dramatic reduction in the area of
land, but land which had been enclosed from the
common land as a result of Parliamentary
waste and over which the use and access of others
enclosures, mainly in the late 18th and early
had been limited (Rackham )•
decades of the 19th centuries. Enclosure was
By the end of the Middle Ages, there were
principally inspired by a desire on the part of the
considerable regional variations in the extent of
larger landowners to profit from the conversion of
common land. Commons survived better in the
the remaining commons to arable, or their
upland areas of the north and east than in the more
improvement as pasture, both of which were dif-
fertile lowland zone. Within the lowland areas, they
ficult or impossible where the land was subject to
tended to survive better in the south and east of
the use and access of a large number of local
England, and in the west country, than in the classic
inhabitants. The high prices of arable during the
open-field areas of the Midlands, with the exception
Napoleonic Wars were a particular stimulus to
of the forest areas, where they usually also survived
enclosure of open heaths and wastes, especially on
well.
the light soils of eastern England (Turner pp. 63-
These variations were a factor in the local and
93).
regional development of rural society in the post-
The enclosure of commons was, like the enclo-
medieval period. In particular, areas in which
sure of open fields, closely connected with the
extensive commons survived tended also to be the
decline of the small owner-occupier which had
areas in which the decline of the small freehold
continued almost uninterrupted throughout the early
farmer, which continued at varying rates throughout
modem period. As land fell into the hands of
the post-medieval period, was retarded. In areas like
relatively few people, so it became easier to obtain
the Fens, small farmers used their rights to
the agreement necessary to enclose, especially as
extensive commons to adopt forms of livestock
with the advent of Parliamentary
1862 Common Law
Enclosure a majority in favour of enclosure was Snell, K. 1985. Annals of the Labouring poor: Social change
judged on the basis of the area of land which the and agrarian England 1660-1900. Cambridge: Cambridge
University Press.
agreeing landowners held, rather than on their Turner, M.E. 1980. English parliamentary enclosure.
number. Folkestone: William Dawson.
Yet as well as being in part a consequence of the Wade Martins, P. 1975. The origins of rural settlement in East
decline of the small proprietor, enclosure of Anglia. In Recent work in rural
a r c h a e o l o g y , ed. P.J. Fowler. Bradford on Avon:
commons also served to accelerate this process. The Moonraker Press.
allotments received by those small farmers or
cottagers who were able to prove the legality of
their claim to common rights were seldom sufficient
compensation for the advantages lost through Common Law
enclosure, especially when legal and fencing costs
were taken into account. For many small farmers, P. S. Atiyah
enclosure was the final misfortune which led to their
departure from farming; for the small cottager,
enclosure often led to increased, if seasonal, reliance
on poor relief (Snell ). Common law is a system of law and legal processes
Today, the distribution of surviving common which originated in England shortly after the
land in England and Wales continues to be very Norman Conquest and after several centuries of
uneven, with more in the highland zone than in the continuous development was exported to the
lowlands. Within the lowlands, there are still fewer English colonies, and so came to be the basis of the
commons in the Midland counties than in the south law of the greater part of the United States, as well
and east, or in the west country. In lowland areas, as of Australia, New Zealand, most of Canada and
most commons are now principally valued for their (to a lesser degree) also of India, Pakistan, Bangla
amenity value, or for their role as nature reserves or Desh and many parts of Africa. The chief
Sites of Special Scientific Interest. Common land characteristic of the common law has always been
now survives as such only where it has been that its development has lain largely in the hands of
registered under the terms of the Commons the judges, and that it has therefore grown and
Registration Act of 1965. All existing common land changed incrementally, case by case, in the course
is listed in the final register, which was closed on 1 of actual litigation.
August 1972. In modem times the term ‘common law’ is used
in a variety of senses. In the broadest sense, it
continues to be used to refer to the entire system of
law originating in England which now forms the
basis of the law in the greater part of the former
See Also British Empire, often nowadays called the ‘common
law world’. In this sense the common law is often
contrasted with the ‘civil law’ which derives from
► Common Property Rights the law of ancient Rome, and today operates in most
of Western Europe, as well as in a number of other
countries (such as Japan and Egypt) which have
borrowed their law from European countries. One
Bibliography of the chief characteristics of the modem civil law is
Campbell, I., and P. Clayden. 1980. T h e l a w o f that it derives its authority from one or more basic
c o m m o n s a n d v i l l a g e g r e e n s . Henley-on- Codes of law; and it remains a principal distinction
Thames: Open Spaces Society. between common law and civil law countries that
Hoskins, W.G., andL. Dudley Stamp. 1967. T h e c o m m o n
l a n d s o f E n g l a n d a n d W a l e s . London: Collins.
the former have not generally codified their law.
Rackham, O. 1976. Trees and woodland in the British And even in
landscape. London: Dent.
Common Law 1863
common law jurisdictions (such as California, for depend upon their own preferences, their culture
example) where there does today exist a kind of and traditions.
common law Code, it differs fundamentally in But there is a second way in which the common
nature from the civil law Codes; in particular the law has traditionally been associated with the
system of precedent, and the authority of the judges freedom of the market, and this association rests
to interpret and develop such common law Codes upon the historical facts of the last three centuries.
are quite different from those recognised in civil law The concept of the Rule of Law which came to be
countries. recognized and defended in England after the
The term ‘common law’ is also often used in revolution of 1688 has been seen by many as having
various narrower senses. In the most important of favoured the development of a free market economy
these narrower senses, the common law is often in England prior to and during the early years of the
contrasted with legislation, so that the lawyer in a industrial revolution. Because of the historical fact
common-law country still thinks of legislation as a it was for a long time almost an article of faith
type of law different from the ‘common law’, which among English writers that the common law and the
is basically judge-made law. The term ‘common freedom of the market were closely associated. This
law’ is sometimes used in yet a third relevant sense view is today less strongly held in England, as a
in which it is distinguished from a body of law, result no doubt of the fact that, while Englishmen
known technically as ‘Equity’ which was originally still like to believe in the Rule of Law (despite
supplementary to the common law, and was grave doubts in some quarters as to whether this
developed in the separate Court of Chancery. Today concept has much meaning), they are by no means
common law (in this narrow sense) and ‘Equity’ are so wedded to the free market as they were. In
almost everywhere merged and administered by a America, where the Constitution of 1788
single set of courts. substantially embodied the English traditions as to
The common law (in the first two senses iden- the Rule of Law, as well as the then accepted
tified above) has traditionally been associated with ideology of the free market, the association between
the economics of the free market in at least two the two has survived rather more strongly.
different ways. First, there is a strain of thought, The reasons for the traditional belief in the close
represented in particular by Hayek ( ), which association between the common law and the
seems to suggest that a system of freedom of the market must therefore be sought in
law, like the common law, which is largely judge- history, and in particular in English history during
made, is inherently more likely to favour and protect the period from approximately 1770 to 1870, when
individual freedoms, and among them (or the free market economy was largely in process of
especially) economic freedoms. But this is an being estabhshed. And of all parts of the common
implausible and indeed eccentric claim, which law, none was more important for this purpose than
seems to involve confusion of the first two senses of the law of contract, because this was the part of the
the term of ‘common law’ referred to above. law most intimately related to the economic system.
Because most redistribution is accomplished in Indeed, the story of English law between 1770 and
modem democracies by legislative measures, it is 1870 was to a large degree the story of how the law
easy to assume that a legal system which owes little of contract was converted into the law of the free
to legislation will be more likely to recognize and market, and of how the ideology of freedom of
protect the freedom of the market, but the amount of contract became one of the great intellectual
redistribution which occurs in a legal system does movements of history (Atiyah ).
not necessarily depend upon whether that society is The first three-quarters of the 18th century was
part of the common law world. There is no a priori a period of transition in England, during which
reason to suppose that judges left to themselves by a many older ideas about contract and the market
legislature will necessarily favour the economics of were being displaced by the newer ideas which
the market. In the last analysis, the policies favoured
by judges will
1864 Common Law
gradually became dominant towards the end of the yet, little sign of a general law of contract,
century. Among the older ideas at least three can be governing all types of transaction. Then also, it
identified as particularly hostile to the laws needed remains unclear how far the contract law of this
to serve the emerging free market economy. First, period actually recognized and enforced wholly
there was a regulatory element in the law and the executory contracts, in the sense of awarding
economy dating back to Tudor times, represented damages for breach of a contract prior to any acts of
for instance by statutory controls of wages and performance or detrimental reliance by any of the
prices of many commodities, and by the parties. And finally, it is clear that, from the
apprenticeship laws which controlled entry to many standpoint of today, the law of contract in the 18th
trades with outdated and largely unnecessary century had not yet freed itself from dependence on
restrictions. Secondly, there was a paternalistic the law of property. Of course, in one sense contract
element in much contract law at this time, with the law can never be free from a dependence on
courts still being willing to relieve various classes property entitlements, because contract law is the
of persons from the consequences of bad bargains mechanism by which entitlements are exchanged;
which they had made. This paternalism was but there are clear signs in the 18th century that
particularly pronounced under various doctrines of contract law was still closely tied to property law in
Equity, such as rules for the relief of mortgagors, another sense, in the sense (for instance ) that the
rules against the enforcement of contractual proprietary aspects of many transactions were still
penalties and forfeitures, rules for the protection of regarded as more important than the promissory or
seamen and ‘expectant heirs’, and so forth. Thus, in contractual aspects. So, for instance, the right of a
the third sense of the term the ‘common law’ mortgagor to redeem the mortgaged property was
identified above, it can be said that the common law protected by the courts, even when by the terms of
was always more market- oriented than Equity. the mortgage documents he had forfeited that right
Thirdly, there was a traditional moralistic element by delay in repaying the loan. It was assumed that if
in the contract law of the 18th century, and this also the mortgagee received back his money, with
took different forms, such as the general hostility to interest and costs, he was adequately protected by
usury (as to which see Simpson , pp. 510—18), and the law, even though the contract itself would have
the attempts to regulate the way in which essential given him more extensive rights.
foods and drinks were sold by use of the traditional Dining the century beginning around 1770 these
marketing offences. The ‘moral’ roots of older law older ideas and traditions gradually gave way before
were also related to ideas about ‘just prices’ which, the ideology of freedom of contract; but it would be
though rarely openly recognized in the common wrong to think that this ideology did not have long
law, seem to have been influential at least in some roots and antecedents in still earlier periods. There
of the cases in Equity, where there are signs that the are, even in the 16th and 17th centuries, many signs
Chancellors did have some vague sense of unease if of incipient economic liberalism among the lawyers
they were asked to enforce contracts at prices which such as Coke, who bequeathed to the common law a
seemed to them very unfair, or on terms which were hatred of monopolies as well as a passion for
(in the language of the law) ‘unconscionable’. individual liberties (Wagner ). And Thomas
In addition to these specific instances of inter- Hobbes, in a well-known passage in Leviathan, had
ference with the binding force of private contracts, swept away all the medieval learning about ‘just
there were important respects in which the whole prices’ and declared that ‘[t]he value of all things
concept of a general contract law remained rela- contracted for, is measured by the Appetite of the
tively undeveloped at this time. Thus, while the law Contractors; and therefore the just value, is that
recognized and enforced specific types of contracts, which they be contended to give’ (Hobbes [ ]
such as contracts for the sale of land, contracts of 1968, p. 208 ). So the ideology of freedom
insurance and so forth, there was, as of contract certainly had origins going back well
Common Law 1865
beyond the 18th century. Nevertheless, it does seem Although the common law had always insisted that
(though the matter remains controversial) that major a promise be supported by some ‘consideration’,
changes in the law began during the course of that some reason, before it would be enforced (and to
century which gathered pace as the century that extent at least contained a paternalist element),
progressed. the growing acceptance of the subjective theory of
Certainly, a great deal occurred to change the value meant that the doctrine of consideration
character of contract law from the last quarter of the became much less important during the 19th
18th century until well into the 19th century, and century. So far instance, in Haigh v. Brooks (1840,
there is much evidence that many of these changes 113 English Reports 124) the judges enforced a
in the law were profoundly influenced by classical promise to pay £9000 in return for the giving up of a
economic theory, and perhaps still more by popular guarantee previously given by the promisor, even
versions of classical economic theory. First, the though it now appeared that the guarantee might be
relics of the Tudor regulatory economy gradually unenforceable and legally worthless. The promisor
disappeared. Wage regulation had become had valued it at £9000, said the judges; it was not
increasingly obsolete in practice during the 18th for them to say that the document was worthless.
century, and a major challenge to the older laws in For similar reasons, the prejudice against usury had
the name of freedom of contract had taken place in gradually been overcome, and the usury laws were
the celebrated case of the Gloucestershire Weavers totally repealed in England in 1854.
(1756-7), (Atiyah , pp. 73-4). By the early 19th In these ways, then, the principle that contracts
century most of the legislation authorizing the are binding and must be strictly enforced had been
fixing of wages had been repealed. So too was the greatly strengthened, and exceptional cases had
Statute of Apprentices, after many years during been whittled down by the middle of the 19th
which its operation had been gradually whittled century. In addition, other changes had occurred in
down by the judges. Secondly, the signs of the general nature of contract law, which were
paternalism which are still found in 18th-century closely related to the growing trend to see contract
Equity seem to have disappeared gradually as the law as the law of the free market. First, it was
judges hardened their hearts and toughened their during this period that a general law of contract
minds. For example, signs of an attempt to came into existence for the first time in the common
introduce implied warranties on the sale of goods law world. And the process of generalization was
for the protection of buyers, which can be detected important to the ideology of the law in a number of
in the 18th century, were largely scotched, and the respects. In particular, the generalizing of
principle of caveat emptor reasserted with full contractual ideas meant that the law had to become
vigour. The equitable doctrines allowing the courts more abstract, more broadly principled. Principles
to relieve various unfortunates from the effects of had to be developed which could be applied equally
hard bargains were gradually whittled down, to (say) commercial contracts for the sale of wheat,
although they never disappeared altogether. Third, to contracts of employment, and (for instance) to
the moralistic elements in the law were also personal contracts such as the contract to marry.
gradually whittled down. The law of contract came This abstraction may have helped the law become
increasingly to be seen to be neutrally enforcing more neutral, less inclined to pursue any
agreements which must be presumed to be redistributive tendencies, such as may exist where
beneficial to both parties. The only moral (say) there is a separate body of legal doctrine
component left in the law of contract during the dealing with contracts of employment, or with
19th century seemed to derive from the binding residential leases, or with loan transactions.
nature of promises. Next, it seems clear that another major devel-
The subjective theory of value also seems to opment during this period was the gradual shift in
have been largely accepted by the judges even emphasis in contract law away from treating
before it had been wholly accepted by economists.
1866 Common Law
contracts as present, or partly performed exchanges, another fifty years or so. Freedom of contract had,
and towards treating them as private planning apparently, reached its highest point. But although
devices, made in advance to allocate risks. The this was true of the ideology of freedom of contract
wholly executory contract became clearly among lawyers and judges, it was not really true of
recognised by the law, so that it now became the views of economists or of the politicians, or of
possible for a person to sue for damages for breach the public. By the late 19th century, neoclassical
of a pure promise, even where no performance or economists were already beginning to write
detrimental reliance had taken place. The sceptically about the sweeping effects of freedom of
justification for requiring damages to be paid in contract which had been attributed to the classical
such circumstances was never clearly enunciated, economists, and were pointing out the many
and indeed, specific justification was rarely seen to possible causes of market failure such as
be necessary. It was widely assumed that the broad information difficulties, externalities and monopoly.
principle of freedom of contract required, not only And although most of the older regulatory
that parties be left free to make their own legislation had been repealed in the first half of the
exchanges, but that the law should be available in 19th century, Parliament had at the same time been
aid of a party to enforce his claim to damages where gradually building up a completely new body of
the other failed to perform. John Stuart Mill was the regulatory enactments dealing with new industrial
first economist to point out that a policy of laissez- problems - factories, coal mines, safety at sea for
faire could not be used to justify the enforcement of seamen and emigrant passengers, public health, the
executory contracts (Mill , vol. 2, p. 386), but even adulteration of food and drink, regulation of the
modem economists do not generally pursue this line weights and measures used for sales, and so on.
of thought, though some libertarians have done so. Much of this new legislation had been a pragmatic
And finally, 19th-century contract law increas- response to perceived evils, and though some of it
ingly freed itself from its dependence on property could have been justified economically by
law. Although obviously entitlements still remain arguments concerning misinformation or
the subject matter of all contracts, contract law has externalities, much of it would have been difficult to
become much less concerned with specific items of justify except on the assumption of paternalistic or
property, and is more concerned with wealth as a redistributive motives. Some of it may have been
kind of fungible property. The reason for this was inspired by sheer impatience, an unwillingness to
basically that 19th-century contract law was give the market time to work, or a belief that the
dominated by the needs of merchants and traders, to shortterm costs of market failures were so severe
whom all property is in principle replaceable with that legislative correction was necessary without
money. A merchant can be assumed to be regard to the long term distortions this might
indifferent between a piece of property, and the produce.
value of that property. Similarly, as contracts came What is quite clear is that by the time the
to be increasingly seen as fundamentally risk- English common law and common lawyers had
allocation devices, the particular entitlements or accepted the teachings (as they were thought to be)
property to which the risks attached became less of the classical economists on freedom of contract,
important. these teachings were already somewhat out of date.
By the last quarter of the 19th century, the The result was that the mature common law of
process of developing a mature body of general contract was seriously deficient in a number of
contract law had largely been completed in England, respects. It was first of all deficient in its almost
and although a similar process took place in total neglect of the problem of externalities.
America (Horwitz ), there is ground Contracting parties were entitled to pursue their own
for believing that that was not completed for interests, regardless of the effect of their
Common Law 1867
contract on third parties, or the public. Only in the contracts, contracts of employment, residential
most extreme cases of actual illegality would the leases, and insurance contracts. Unconscionable, or
courts generally refuse to uphold a contract. Sec- unfair contracts are increasingly subjected to
ondly (although this certainly could not be laid at judicial control. Many areas of law which were
the door of the classical economists), there had formerly controlled largely by contract, such as
been, during the 19th century, a serious neglect by family law, are now subject to extensive judicial
common lawyers of the problem of monopoly. This discretionary control. Even business and
may well have been largely due to the fact that for commercial contracts are subject to vast bodies of
the greater part of this period the British economy legislative and regulatory laws, some, such as the
was itself highly competitive, and in little danger modem monopoly anti-trust laws, being designed to
from monopolies. But the complacent assumption preserve the operation of a competitive market, but
that cartels were unstable and were always much of it still being designed to restrict
vulnerable to internal or external competition was in competition or the operation of the free market.
England (though not in America) carried over by America has not gone so far down this road as
lawyers and courts into new conditions towards the Britain and other common-law countries, and
end of the nineteenth century, and well into the indeed, for a long time, in the late 19th and early
present century, when it was utterly out of date. A 20th centuries, constitutional decisions of the United
second result of this failure of the common law to States Supreme Court in the name of freedom of
keep pace with economic theory and political contract, actually prevented similar developments.
reality, was the growing gulf between the common Much legislative intervention with freedom of
law and legislation. Once again, extensive contract was, during this period, declared
legislative intervention with freedom of contract unconstitutional, frequently over the dissent of
began to become commonplace, and much of it was Justice Holmes. By the late 1930s, however, the
increasingly redistributive in character. majority of the court had largely accepted Holmes's
During the course of the present century this view, and since then, legislative intervention with
process continued at an increasing pace unti 11980 freedom of contract has not been regarded as per se
or thereabouts, since when there are signs that unconstitutional. This shift in the court opened the
history has virtually reversed itself. Disillusion with door to the same kind of regulation and intervention
the free market, particularly in England, increased which had already been taking place in Britain, and
during the great depression in the 1930s until, by although America has not, like Britain, brought
the end of World War II, a Labour Government was large-scale industries within the public sector and
elected to power with a massive majority and with a therefore partially outside the control of contract
mandate to lay the foundations for a socialist state law, most of the other legislative developments of
and a socialist economic system. Since then the British type certainly have their parallel in
England has increasingly learned to live with a America. No doubt some contracts are more
‘mixed economy’, to a large part of which the regulated in Britain, but conversely there are plenty
traditional law of contract seems irrelevant because of examples of legislative interference with freedom
the public sector is often controlled by public laws of contract in America which are not to be found in
rather than by contract law. But even in areas where Britain.
private law continues to operate, the common law These vast changes in the operation of the
of contract has become increasingly affected by common law have accompanied or brought with
legislative intervention. Virtually all types of them a change in ideology once again. Paternalism
consumer transactions are today controlled or and redistribution were, at least until around 1980,
affected to some degree by legislation, including increasingly favoured by many writers and
consumer credit
1868 Common Property Resources
teachers of contract law, as well as large sectors of many years to come British or American courts will
the electorate. Even the judges became much more be enforcing contracts according to the full rigour of
sympathetic to arguments based on concepts like the common law.
unconscionability and inequality of bargaining
power. In America, unconscionability was given
express legitimacy as a device for overturning unfair
contracts by the Uniform Commercial Code, and See Also
was increasingly used by the judges as a matter of
common law as well. Many relationships of a
contractual character (for instance, that of physician
and patient) and others of a virtually contractual
character (for instance, that between manufactures Bibliography
of products and ultimate purchasers and consumers)
Atiyah, P.S. 1979. T h e r i s e a n d f a l l o f f r e e d o m o f
are, both in American and Britain, increasingly c o n t r a c t . Oxford: Oxford University Press.
regulated by tort law rather than contract law, at Hayek, F.A. 1973. L a w , l e g i s l a t i o n a n d l i b e r t y ,
least where things go badly wrong and legal actions Rules and Orders, vol. 1. London: Routledge & Kegan
Paul.
for damages are brought based on negligent
Hobbes, T. 1651. In L e v i a t h a n , ed. C.B. Macpherson.
conduct, or on defects in the goods. In such Harmondsworth: Penguin Books. 1968.
malpractice or products liability actions the Horwitz, M.J. 1977. T h e transformation of
appropriate standards of care or quality are set by American Law 1780-1860. Cambridge,
MA/London: Harvard University Press.
judges and juries and not by the contracting parties,
Mill, J.S. 1848. P r i n c i p l e s o f p o l i t i c a l e c o n o m y .
and contractual exculpatory clauses are often denied From the fifth London edition. New York: D. Appleton &
legal validity. Co., 1908.
Since about 1980 there have been increasing Simpson, A.W.B. 1975. A h i s t o r y o f t h e c o m m o n
l a w o f c o n t r a c t . Oxford: Clarendon.
signs that the tide has turned yet again, both in
Wagner, D.O. 1935. Coke and the rise of economic liberalism.
Britain and America. Obviously, and visibly, British E c o n o m i c H i s t o r y R e v i e w 6(1): 30.
and American governments have since then been
trying to reassert the virtues of the free market and
roll back the frontiers of regulation, and in this they
are being vigorously supported by some lawyers and
Common Property Resources
law teachers in America, though not to any real
extent in Britain. It is not yet clear what the impact Jean-Philippe Platteau
of this is going to be on the future of the common
law of contract. One possible scenario is that, as in
the late 19th century, the courts will be behind the
times, but that on this occasion they will be hostile
Keywords
to the reasserted belief in the free market and will
continue to defend paternalist and redistributive Common property resources; Efficiency; Market
intervention in free contracts, particularly where one integration; Open access; Private property
of the parties to the contract is a consumer or ‘small rights; Tragedy of the commons
man’ thought to be weak in bargaining power. But
another possible scenario is that the new enthusiasm
for the free market will prove but a short-lived JEL Classifications
01
hiccup in the long-term trend towards paternalist
and redistributive policies. In either event it seems The concept of common property has become
unlikely that for famous in economics since Garett Hardin ( )
wrote his celebrated article on ‘The Tragedy ofthe
Common Property Resources 1869
Commons’. In this article, common property is critiques, is one under which a community pos-
taken to mean the absence of property rights in a sesses a collective ownership right over local
resource, or what is equivalently known as a regime natural resources. Under common property,
of ‘open access’. Under such a regime, where a therefore, a right of exclusion is assigned to a well-
right of inclusion is granted to anyone who wants to defined user group, and Hardin has created a lot of
use the resource, Hardin argued, inefficiency confusion by using the word ‘commons’ to refer to
inevitably arises in the form of over- exploitation of the alternative situation where no such right is
the resource accompanied by an over-application of granted to any agency. What is not always clear,
the variable inputs. Open access leads to efficiency however, is whether the ownership right involves
losses because ‘the average product of the variable only the ability to specify the rightful claimants to
input, not its marginal product, is equated to the the resource, or whether it also involves the ability
input’s rental rate when access is free and the to define and enforce rales of use regarding that
number of exploiters is large’ (Comes and Sandler , resource (for example, regulations about the
p. 787). The root of the problem lies in the fact that harvesting season and production tools, allowed
the average product rule does not enable the users to quotas of harvestable products of the resource, or
internalize the external cost which their decisions taxes). Baland and Platteau ( ) have coined the
impose on the users already operating in the term ‘unregulated common property’ to refer to the
resource domain. Of course, the efficiency losses former situation, while the term ‘regulated common
are conceivable only in a world of resource scarcity, property’ is used for the latter.
implying that the variable input is subject to Two polar situations can be considered on the
decreasing returns. Such losses are considerable basis of this analytically important distinction
since they amount to the dissipation of the whole between two types of common property regimes. At
resource rent. Here is the crucial intuition behind one extreme, if common property is perfectly
the open access regime: when no property right is regulated, in the sense that the rules of use designed
attached to a resource, the value of this resource is and enforced by the owner community allow a
zero in spite of its scarcity. perfect internalization of the externalities, common
Efficiency losses are to be measured not only in property becomes equivalent to private property
static but also in dynamic terms. Indeed, in an open with a sole owner from an efficiency standpoint.
access regime resource users are induced to This illustrates the general result that, absent trans-
compare average instantaneous returns with the action costs, institutions do not matter. At the other
input’s rental price even though they may well be extreme, a strictly unregulated common property in
aware that they thereby contribute to reducing the the above sense implies that, as the number of users
future stock of the resource. The problem is simply becomes quite large, over-exploitation of the
that they are forced to follow a myopic rule because resource becomes as important as under the open
there is no way in which they can reap the future access regime: the rent attached to the resource is
benefits of restraint in the present. Thus, for totally dissipated (see Platteau , ch. 3).
example, by refraining today from catching juvenile Between these two extremes we find the situa-
fish or from cutting down saplings in the forest, a tions most typically observed on the ground and
villager can receive no assurance that he or she will described in the numerous field studies devoted to
be able in the next period to catch mature fish or to this topic (see Ostrom ; Baland and Platteau , for a
fell fully grown trees. review of such studies). In such instances, rales
The main criticism levelled by numerous social ofuse exist alongside membership rales, yet they
scientists against the concept of open access is that tend to be imperfectly designed and imperfectly
the corresponding regime is rarely encountered on enforced by the village community. One key reason
the ground. The typical regime, according to these for these imperfections is the governance costs that
unavoidably plague any collective decision-making
process. Governance
1870 Common Property Resources
costs include all those costs incurred to reach a are the costs of delimiting and defending the
collective agreement and to organize a community resource ‘territory’ (Dasgupta , pp. 288-9). For
of users. They are likely to be higher when the many natural resources, the costs of dividing the
group is larger and when its membership is more resource domain appear prohibitive under the
heterogeneous (whether measured in terms of present state of technology. For example, the open
diversity of objectives or of wealth inequality). sea - or, more exactly, the fish stock contained in it -
Moreover, governance costs are enhanced by the presents insuperable difficulties for private
opportunistic tendencies of rights-holders not only appropriation. The enforcement of exclusive
to violate or circumvent collective rules but also to property rights to individual patches of the ocean
eschew efforts to create collective mechanisms of would, indeed, be infinitely costly. This is
decision-making and enforcement. Costs arising especially evident when fish species are mobile and
from these proclivities are also dependent on the move within wide water spaces, since exclusive
size of the user group: they are lower if the number rights are too costly to establish and enforce
of resource users is smaller and, at the limit, they whether over the resource or over the territory in
are nil when there is a single user. which the resource moves.
As a consequence of the aforementioned limi- The opportunity costs of privatization, for their
tations, resources are less efficiently managed under part, correspond to the benefits that are lost when
a common property regime than they could be under the common property regime is abandoned. Here,
a private ownership system. This is especially true we can think of scale economies that may be present
if, owing to their scarcity, the resources carry high not only in the resource itself but also in
values which should be reflected in high rents. complementary factors. The obvious advantage of
Population growth and market integration are thus coordinating the herding of animals so as to
two forces that tend to increase the monetary value economize on shepherd labour in extensive grazing
of the efficiency losses arising from common activities is probably the best illustration of the way
property, that is, the forgone rents. This, at least, is scale economies in a complementary factor may
the conclusion drawn by the so-called property prevent the division of a resource domain. Another
rights school of Chicago economists (see, for important category of opportunity costs is the
example, Demsetz ; Barzel ). The advantages of insurance benefits associated with common
private property appear all the more decisive as property. When returns to a resource are highly
such a regime enables users to internalize variable across time and space, the need to insure
externalities without incurring any governance against such variability is yet another consideration
costs. This is because it establishes a one-to-one that may militate against resource division. When a
relationship between individual actions and all their resource has a low predictability (that is, when the
effects: ‘A primary function of property rights is variance in its value per unit of time per unit area is
that of guiding incentives to achieve a greater high), users are generally reluctant to divide it into
internalization of externalities ...’ (Demsetz , p. smaller portions because they would thereby lose
348). the insurance benefits provided by keeping the
Nevertheless, this ignores the costs of resource whole.
privatizing natural resources, which involve both For instance, herders (fishermen) may need to
directs costs and opportunity costs. Direct costs have access to a wide portfolio of pasture lands
comprise transaction costs, such as the costs of (fishing spots) in so far as, at any given time, wide
negotiating, defining and enforcing private property spatial variations in yields result from climatic or
rights. The usual argument is that such costs other environmental factors. On the assumption that
increase with the physical base of the resource. the probability distributions are not correlated too
Thus, the wider the resource base (or the less much across spatial groupings of land or water
concentrated the resource) the higher
Common Property Rights 1871
and that they are not overly correlated over time, a Demsetz, H. 1967. Toward a theory of property rights.
system offering access to a large area within which A m e r i c a n E c o n o m i c R e v i e w 57: 347-359.
Hardin, G. 1968. The tragedy of the commons. S c i e n c e
right-holding users can freely move appears highly 162: 1243-1248.
desirable from a risk-reducing perspective. Ostrorn, E. 1990. Governing the commons: The evolution of
The conclusion of the above discussion is, institutions for collective action. Cambridge: Cambridge
therefore, that the balance of the advantages and University Press.
Peters, P. 1994. D i v i d i n g t h e c o m m o n s : P o l i t i c s ,
disadvantages of various property regimes is a priori policy, and culture in Botswana.
undetermined. Economic theory, however, does Charlottesville/London: University Press of Virginia.
provide useful guidance about which circumstances Platteau, J.-P. 2000. I n s t i t u t i o n s , s o c i a l n o r m s
are more favourable to the persistence of common a n d e c o n o m i c d e v e l o p m e n t . London: Harwood
Academic Publishers.
property or, conversely, to its demise and
replacement by private property. Furthermore,
instead of being fixed once for all, the balance sheet
is susceptible to evolution depending on the
transformation of the parameters on which the Common Property Rights
benefits and costs of privatization depend. Thus, the
direct costs of resource division may fall with Steven N. S. Cheung
technological progress. For example, the
introduction of modem borehole drilling facilitates
the privatization of common grazing areas (Peters ).
ft is therefore not only the factors which enhance In a society where individuals compete for the use
resource value but also those which reduce the of scarce resources, some mles or criteria of com-
direct costs of partitioning that may favour the petition must exist to resolve the conflict. These
private appropriation of natural resources. mles, known as property rights, may be established
in law, in regulation, in custom or in hierarchy
ranking. The structures of rights may take a variety
of forms, ranging from private property rights at
one extreme to common property rights at the other.
See Also
Most fall somewhere in between: either set of rights
► A "/-',ss to T eve opment would be rare in its purest form.
► Agriculture and Economic Development In a private property, the delimitation of the
► Land lArkets right to its use is expressed in dimensions or
► Population and Agricultural Growth characteristics inherent in the property itself. These
► rtoperty Lav/, 3"::.c::.:.:'.'c • \ rights are exclusive to some private party, are freely
► Tragedy of the Commons transferable, and the income derived from them is
not attenuated, restrained or infringed by laws or
regulations. Hence price control, taxation, and
Bibliography
social restriction of transferability may be regarded
Baland, I, and J.-R Platteau. 1996. Halting degradation of as violations of private property rights. In a
natural resources: Is there a role for rural communities? common property, there is no delimitation or
Oxford: Clarendon Press. delineation of its use rights to any private party. No
Barzel, Y. 1989. E c o n o m i c a n a l y s i s o f p r o p e r t y
one has the right to exclude others from using it,
r i g h t s . Cambridge: Cambridge University Press.
Comes, R., and T. Sandler. 1983. On commons and tragedies. and all are free to compete for its use. Hence there
A m e r i c a n E c o n o m i c R e v i e w 83: 787-792. are no exclusive use rights, no rights to be
Dasgupta, P. 1993. An inquiry into well-being and destitution. transferred, and in the limiting case, no
Oxford: Clarendon Press.
1872 Common Property Rights
net income can be derived from using the common the analysis. Attempts to reduce rent dissipation go
property. far to explain why common property in its ‘pure’
This last condition rests on an economic prop- form is seldom observed. In marine fisheries, for
osition known as the dissipation of rent. It argues example, numerous regulations govern the fishing
that because of the lack of exclusive use rights, season, the size offish caught, the boat size and the
individuals competing for the use of a common mesh size, and various licensing arrangements
property will reduce its rental value or net worth to restrict the number of boats and fishermen. The
zero. The reason is that if no one has an exclusive market value of a fishing licence, sometimes
claim to the value (i.e. rent) ofthat property, its use enormous, is one measiue of the ocean rent cap-
will invite competition to the point that each and tured. Even for public beaches, regulations of some
every competing user can earn no more than the type will often be found to govern the use of those
alternative earning of his own resources required in most in demand.
the exploitation of that common property. In other Whereas regulations and restrictions on entry in
words, under competition and with no one having a the use of a common property often serve to reduce
special advantage, a ‘prize’ that has no exclusive dissipation, the rent that can be captured is usually
claimant will be dissipated or absorbed by the costs less than if the property were privately owned. To
of other resources which must be dedicated to its reconcile this observation with constrained
winning. Hence the net value of the prize won is maximization, wc must infer that, enforcement
zero. costs aside, other transaction costs associated with
The usual examples of common property rights the changing of institutional arrangements must
cite a public beach and marine fisheries, and the restrain the formation of private property rights.
dissipation of rent typically implies excessive use or No economy can survive if the majority of its
over-exploitation. However, the dissipation may scarce resources arc commonly owned. Regulations
take the form of under- exploitation. For example, a may, indeed, reduce rent dissipation; but in the
piece of fertile land under common ownership may process they not only distort the use of the resources
be used for herd grazing, or left idle, instead of but also invite corruption and the emergence of
being planted as an orchard. special interests. An unrestrained common property,
In the real world, the complete dissipation of strictly speaking, is propertyless in ownership; if its
rent is rare indeed. This is because the supply curve structure is extended to all resources, starvation for
of labour or of other inputs may be rising (some all must result. If one rules out private property
intramarginal rent may be captured), the competing rights, then to avoid the imposition of an infinite
users may have different opportunity costs (the non- array of regulations the remaining alternative is the
marginal users may be enjoying rent), or entry may communal system or the communist state.
be restricted by regulations, by customs or by In a communist state there is no private owner
information costs. Still, with common property of productive resources: each constituent is prop-
rights some dissipation of rent is inevitable, and no ertyless, in the literal sense of the word. Since the
society can afford to surrender a large portion of its dissipation of rents associated with common prop-
valuable resources to this structure of property erties will guarantee starvation, in a communist
rights. state the rights to use resources, and to derive
A property may be held in common because its income therefrom, arc defined in terms of rank.
capUirable rent is lower than the cost of enforcing That is, stripped of all ownership rights over valu-
exclusivity. In this case, the dissipation of rent is no able and productive resources, the citizens of a
waste. However, to the extent that rent dissipation is communist state hold differing rights to use
viewed as a waste, its occurrence must be resources and to obtain income according to
attributable to the omission of some constraints in
Common Property Rights 1873
their status. In the people’s communes in China to stand aside and simply permit rent to dissipate is
under the Great Leap Forward, for example, no one inconsistent with the postulate of constrained
owns the productive resources (i.e. everyone is maximization.
property less), but comrades of different ranks What is useful and important from the stand-
enjoy different rights and privileges. ‘Rank’ as such point of economic explanation is to view whatever
has value and is subject to competition, therefore a rent dissipation does occur as necessarily a
system of ‘property’ rights is implicit. However, the constrained minimum because, under the max-
valuable rights are now defined in terms other than imization postulate, each and every individual has
the inherent properties of the productive resources. an incentive to reduce that dissipation. Behaviour
This is, in fact, the key distinction between a associated with the dissipation of rent must
private property system and a communist state: the therefore be regarded as attempts to reduce that loss,
former delineates rights in terms of certain and this altered view explains many observations.
dimensions of the productive resources themselves; That some dissipation remains must then be
the latter delineates rights in terms of a attributable to the constraints of transaction costs.
characteristic (rank) of people deprived of produc- The challenge to the economist is to specify and
tive human capital. In the communist state, the identify what these costs are and how they will vary
competition for and protection of rank will draw on under differing circumstances.
the use of valuable and productive resources
(another form of rent dissipation). Moreover, the
lack of market prices increases the cost of infor- See Also
mation, and the lack of contractual choices
increases the cost of enforcing performance. What
is saved in return are the costs of delineating and
enforcing rights in properties.
►
It is among these varied costs - broadly defined
as transaction costs - that we find the key
divergence in economic performance between the References
communist and the private property systems. If one
ignores transaction costs, the delineation of rights in Alchian, A.A. 1965. Some economics of property rights. I I
P o l i t i c o 30(4): 816-829.
terms of rank will produce the same use of Bottomley, A. 1963. The effect of the common ownership of
resources as would the delineation of rights in land upon resource allocation in Tripolitania. L a n d
properties. However, it can be convincingly argued E c o n o m i c s 39: 91-95.
that the broadly defined transaction costs are Cheung, S.N.S. 1970. The structure of a contract and the
theory of a non-exclusive resource. J o u r n a l o f L a w
generally higher with communal than with private and
rights. Communism fails, not because it does not E c o n o m i c s 13: 49-70.
work in theory, but precisely because in practice its Cheung, S.N.S. 1974. A theory of price control. J o u r n a l o f
costs of transaction are higher than those in a L a w a n d E c o n o m i c s 17: 53-71.
Cheung, S.N.S. 1982. W i l l C h i n a g o ‘ c a p i t a l i s t ’ ?
system of private property rights. Still, the
Hobart paper 94. London: IEA.
delineation of rights in ranks is a way to reduce rent Coase, R.H. 1960. The problem of social cost. J o u r n a l o f
dissipation in a propertyless state. L a w a n d E c o n o m i c s 3: 1-44.
Strictly speaking, the dissipation of rent asso- Demsetz, H. 1964. The exchange and enforcement of property
rights. J o u r n a l o f L a w a n d E c o n o m i c s 7: 11-
ciated with common property is no ‘theory’ at all,
26.
because dissipating rent merely to produce an Gordon, H.S. 1954. The economic theory of a common
equilibrium does not explain behaviour. Worse, property resource: The fishery. J o u r n a l of
P o l i t i c a l E c o n o m y 62: 124-142.
Knight, F.H. 1924. Some fallacies in the interpretation of
social cost. Q u a r t e r l y J o u r n a l o f E c o n o m i c s
38: 582-606.
1874 Common Rights in Land
Nevertheless, four major types of common land owner’s tenant. Access to the common rights was
may be distinguished. First, the archetypical form of considerably more complex and took different
common land and the one with the widest forms in different places; but it is possible to dis-
geographical distribution is variously referred to as tinguish four main forms of access. First, in
common waste, common pasture, waste, or England and some parts of the Continent, the own-
common. This land was permanently common and ership or tenancy of particular buildings or land-
most often grassland used for grazing animals. holdings was a prerequisite. Second, in many parts
Usually such land was not suitable for arable of the Continent citizenship of (as distinct from
cultivation typically because its natural fertility was residence in) a commune or a municipality which
low but sometimes for other reasons such a itself owned the common resource was necessary,
propensity to seasonal flooding. On some common sometimes in combination with a property qualifi-
wastes other resources were available, such as peat, cation. Third, in other parts of the Continent mem-
turf, gorse or wood. bership of a cooperative association which owned
Second, in many parts of Europe much of the the common resources was necessary. Membership
arable land (the land on which crops were grown) of these institutions was sometimes inherited, but
was also subject to common rights. Such land, sometimes it was attached to buildings or land (as in
known as open- fields, common fields, or common the first case). Fourth, there were cases were all
arable, was privately owned and cultivated but residents in an area had common rights. But outside
subject to common grazing. In its classic form each laigely uninhabited areas, such as northern Sweden,
farmer held a number of long thin strips of land this situation was unusual.
scattered over an extensive area and intermixed In consequence by no means all individuals or
with the strips of other farmers. Each farmer households enjoyed common rights. The proportion
cultivated his own crops on the arable. But when of the population that enjoyed common rights
the harvest was over, or in years when the land was varied considerably from one region to another and
being fallowed, all those with common rights could changed over time. Where individuals or
turn their livestock into the fields to graze. Thus the households did have common rights, the kinds and
open fields alternated between private and common levels of the rights they enjoyed were determined by
land over the course of the agricultural cycle. local regulations.
Third, common woodland for the production of
fuel and timber was widespread on the European
continent but unusual in England. This was similar Regulation
to common waste in that it was permanently in
common use. Common land was almost invariably regulated by
Fourth, common meadows, which were per- local institutions, often at the level of the individual
manent grasslands for the production of hay, were village or manor. The institutions varied but were
divided into separate blocks in private use but afterusually manor or village courts or village
the hay had been harvested were open to common assemblies or committees of some kind, with the
grazing. Thus, like the open fields, common decisions made by a group of jurors. These insti-
meadows alternated between private land and tutions normally issued sets of mles, ordinances or
common land over the agricultural cycle. by-laws which governed the usage of the commons
and set fines for the infringement of rales. Officials
or monitors were appointed to police the by-laws.
Common Rights The degree to which these institutions and their by-
laws were subject to the influence of feudal
As private property, the right to cultivate the com- overlords and the state varied considerably across
mon arable or to harvest the hay in common Europe.
meadows lay with the owner of the land or the The by-laws provided the basic regulatory
framework for managing the commons (for
1876 Common Rights in Land
on problem solving. From these experiences, indeed businessmen, and indeed accepted capitalism,
already manifest in the underlying strategy, though not necessarily on the terms given or pre-
Commons developed a theory of government as ferred by the established power structure.
alternately a mediator of conflicting interests and an Commons was one of the few American econ-
arena in which conflicting interests bargained over omists to found a ‘school’, a tradition that was
their differences; a theory of the complex carried forward by a corps of students, especially
organization - in terms of freedom, power and Selig Perlman, Edwin E. Witte, Martin Glaeser and
coercion - and evolution of the legal foundations of Kenneth Parsons. Much mid-20th-century American
capitalism, which centred in part on the composing social reform, the New Deal for example, drew on
of major structural conflicts through the mutual or reflected the work of Commons and his fellow
accommodation of interests; and a theory of workers and students.
institutions with an affirmative view of their roles in
organizing individual activity and resolving
conflict.
Selected Works
The institutions Commons studied most closely
were trade unions and government, particularly the 1893. The distribution of wealth. New York:
judiciary. He developed his theory of the economic Macmillan.
role of government in part on the basis of his study 19 05 . Trade unionism and labor problems.
of the efforts of workers to improve their market Bo s t o n: G i nn .
position and in part on the use of government by 1910-11. (With others.) Documentary history of
both enemies and friends of labour. Commons’s American industrial society. 10 vols. Cleveland:
was an interpretation of trade unions as a non- A.H. Clark.
revolutionary development, as collective action 1916. (With J.B. Andrews.) Principles of labor
seeking to do for workers what the organizations of legislation. New York: Harper.
business attempted to do for their owners and 1919. Industrial goodwill. New York: McGraw-
managers. His study of the reception given unions Hill.
and reform legislation led him to recognize the 1919-1935. (With others.) Histoiy of labor in the
critical role of the United States Supreme Court United States. 4 vols. New York: Macmillan.
(and the courts generally), and its conception of 1921. Industrial government. New York: Macmillan
what was reasonable in the development and 19 24 . The legal foundations of capitalism. N e w
application of the working rules which governed the Yo r k : Ma c m i l l a n .
acquisition and use of power in the market. 1934. Institutional economics. New York:
Accordingly, Commons developed a theory of Macmillan
property which stressed its evolution and role in 1934. Myself New York: Macmillan.
governing the structure of participation and relative 1950. The economics of collective action. Madison:
withholding capacity in the market. University of Wisconsin Press.
Commons also developed a theory of institutions
which focused on their respective different mixtures
of bargaining, rationing and managerial
transactions, all taking place within a legal frame-
work which was itself subject to change. Communications
Although Commons’s institutionalism had dif-
Roger G. Noll
ferent emphases from that of Thorstein Veblen, for
example, in that Commons stressed reform of the
capitalist framework, they shared a view of eco-
nomics as political economy and of the economy as
comprising more than the market. Unlike Veblen, The economics of communications is a loose,
Commons was not antagonistic toward somewhat vaguely defined amalgam of topics in
applied microeconomics. Although having close
1880 Communications
ties to the microeconomic theory of the economics every nation. Subsidization, nationalization and
of information, it is probably best characterized as a extremely detailed regulation of prices and attri-
subfield of industrial organization, regulation and butes of the product are common. In market-
public enterprise that deals with the commu- oriented societies, telecommunications and mail are
nications sector: telecommunications, broadcasting, nationalized or subject to economic regulation for
the print media, the performing arts and the postal much the same reasons that underpin the same
system. Of course, the activities that constitute this policies in other infra structural industries: that
list are somewhat arbitrary, but they reflect what is these industries are natural monopolies and that
both taught and studied by people in the subfield as their performance and pattern of development
well as some important economic realities that profoundly affect the development of much of the
make specialized studies of the communications rest of the economy. But even here, unique
sector a valid category among distinct intellectual externality arguments are brought forth as addi-
pursuits. First among these realities is that the tional factors to be taken into account by policy-
industries in the communications sector are closely makers. First, subscription to the telecommunica-
linked. Broadcasting competes with the performing tions network or access to mail delivery creates the
arts for both audience and inputs, and capability to receive communication from others. A
telecommunications competes with the postal ser- person who decides to mail a letter or place a
vice. Moreover, telecommunications networks are telephone call presumably considers only his or her
capable of delivering broadcast services, and vice net benefits from the communication; the
versa. Among the products over which the postal willingness to pay of the recipient (positive or
system, telecommunications and cable television negative) is not taken into account. Thus, for
compete is the delivery of the output of the print example, the extent of phone service and the pattern
media. of calling can be expected to be inefficient if each
Another unifying theme across communications personbears the full cost of, first, subscribing to the
industries is the connection of the study of the network, and then placing telephone calls. In
sector to the economics of public goods and particular, if some potential customers have too low
externalities. Communications is the production and a willingness to pay to become subscribers, but are
dissemination of information. Some aspects of the also desired objects of communications by others,
production of information are public goods, and the the number of subscribers will be too low if
dissemination and use of information can have subscribers must bear the frill cost of connecting
important external effects. Moreover, most of these them to the network. This argument constimtes the
external effects are non-economic phenomena such foundation for the ‘universal service' objective; that
as political participation, the cultural values held by is, a policy of maximizing the number of
members of a society or the level of violence. subscribers to the telephone network, and the policy
Because of the unique character of these practised nearly everywhere of adopting a price
externalities, the motives for public policy in com- structure for telephone services that subsidizes
munications are closely linked with a society’s installation charges, pay-telephone prices and/or
fundamental political and social values. Thus, monthly access charges to the local network,
freedom of speech and the extent of the right to especially for customers in high-cost areas such as
privacy, as well as the use of control of commu- rural communities.
nications to manipulate the political process, are at A second externality of the telecommunications
the heart of debates over communications policy. system is said to be its contribution to national
security. A joint product of a private
telecommunications network is a ready resource
Rationales for Government Intervention that can be commandeered and used by government
in times of national emergency, such as foreign
Not surprisingly, the role of the public sector is very
attack, natural disasters or accidents. The use of
large in the communications sector in nearly
communications to coordinate a response to
Communications 1881
such an event, then, should play a role in affecting television, a core principle of French broadcasting
the capacity and design of the telecommunications policy was to preserve French culture and values by
system, and often is the basis of an argument for limiting and censoring programmes from other
building into the system more redundancy and nations. In Germany, decentralized, regional quasi-
interconnectability than might otherwise be optimal public broadcast monopolies were created after
and than independent private concerns would World War II to protect simultaneously against
undertake on their own. These contingent needs by capture by the national government or by the
government have been said to constitute a separate national print media barons, either of which, it was
natural-monopoly argument, an example of feared, might use broadcasting to arouse nationally
‘economies of scope’ between private and public destructive political passions. And in the United
uses that can only be captured if the private system States, broadcast licensing has, until recently,
is a single, integrated whole. In the United States, enforced a long set of standards for evaluating
for example, the Department of Defense was a competitors for a given license, including personal
consistent critic of proposals to relax regulation and characteristics and other business holdings of the
increase competition in the telecommunications licensee and both performance and promise about
industry during the 1960s and 1970s. the extent of ‘public service’ programming offered
The externalities associated with the mass media by commercial as well as non-commercial
have to do with the social, political and (educational) outlets.
psychological consequences of the content of Of course, other mass media are not free of
information, and for the most part are dealt with by similar policy constraints, although the print media
scholars from disciplines other than economics. and the arts are usually accorded greater freedom in
(The main exception is research on the effects of the content of their messages than are broadcasters.
advertising, where an inconclusive debate has raged The areas of policy controversy are the definition of
for decades as to whether the informational value of the liability for slanderous attacks and the
advertising exceeds the sum of its direct costs and concomitant definition of privacy rights, the
possible resource misallocation owing to boundaries between pornography and legitimate
manipulation and misperceptions of consumers.) expression, and the principles separating sedition
The analytical foundation for the belief in the from reasonable political discourse.
importance of informational externalities is the The core economic issue in this debate is
proposition that people’s behaviour as citizens, whether the ‘marketplace for ideas’ works well
parents, consumers, workers, friends, and so on, can without intervention, or at least better than is the
be significantly affected, at least in the short run, by case with active political intervention by the very
the informational content of the mass media. Once government authorities whose security and power
one accepts this proposition, the next logical step is can be affected by the content of communications.
to entertain the idea that censorship by the state, at The argument for non-intervention is twofold.
least in principle, can prevent some of the external Positively, it is that in the end people's tastes in
diseconomies of destructive content, while ideas should be accorded the same status as then-
proactive state interventions to channel the content tastes in other goods as long as the consumption of
of the media towards greater educational and communications produces no external disecon-
otherwise uplifting content can provide additional omies. If communications cause bad behaviours,
social benefits. then if people are informed about this fact - and
The most obvious manifestations of these ideas about the punishments exacted if those behaviours
are in broadcasting. The British Broadcasting are manifest - they will efficiently anticipate this in
Corporation was founded on openly paternalistic making decisions about which communications to
principles about the potential of radio broadcasts for receive and how to treat those that are received.
educational and other uplifting purposes. Until the And, as with goods, ideas about how the world
recent move towards decentralization through cable works that prove correct will be perceived, at least
television and towards private, commercial eventually, as superior to less correct ideas.
1882 Communications
Negatively, the argument for an unregulated mar- Whether the product is a written news report, a
ketplace for ideas is a pessimistic forecast of how novel, a theatrical production, a television broad-
political intervention is likely to work: a combi- cast, or 'Dial-A-Joke' on the telephone, the problem
nation of orientation towards propaganda to scivc is fundamentally the same: producers will not
the interests of preserving the status quo and an supply a product unless they can recover the
extreme sensitivity to either vocal, organized single- opportunity cost for creating it, yet the marginal
issue groups seeking to impose their values on cost of providing the product to one more consumer
others or a tyranny of the majority that persecutes does not include any of the production costs of the
those who stray too far from current norms. information. Hence, efficient provision of
The other side of the dispute, usually advocated information requires one of the following: subsidies
more by non-economists, is rooted in observed of the production of information, or price
relationships between communications and discrimination with protection against arbitrage so
behaviour, perhaps best documented in the study of that consumers with relatively low willingness to
the effects of television on violence (especially by pay for information will not be inefficiently
children) and on the manipulation of the news for excluded. In practice, both are common.
short-term political objectives. This position regards Governments subsidize broadcasting and
the efficiency of the marketplace for ideas as performing arts by direct payments, and certain
demonstrably poor, at least in the short run; users of the postal and telecommunications system
implicitly, it accords less credence to the either directly or by engaging in price dis-
proposition that individuals are as rational - indeed, crimination (e.g. the differences in basic monthly
are even proactive - in selecting among competing service rates of telephones between residences and
communications as economic theory assumes. businesses, and the lower postal rates for circulating
Proponents of intervention especially emphasize the the print media). Of course, neither direct subsidies
unformed and manipulable attitudes of children. nor discriminatory prices are explicitly designed in
The final pervasive feature in the communica- a quantitative sense to offset the inefficiencies of
tions sector that deserves further elaboration is the private provision of public goods, so that the issue
partially non-riva Irons nature of the consumption of optimal pricing of communications services is an
of information. All information is a public good in active and still- developing field of research. The
the sense that once a new information product has focus here is on the two fields in which most of the
been created for a first user, it does not have to be work has been done: telecommunications and
created again for subsequent users: in principle, at broadcasting. Moreover, the discussion includes
least, the first use of information does not preclude research on market structure issues because of their
its use by others, in practice, this characteristic may close connection to the implications of alternative
be unimportant. Information must be disseminated pricing policies.
in some way to subsequent users, and the cost of
dissemination may exceed the cost of secondary
creation - as for example can be the case for a Pricing and Market
simple computer program. Or, information may be Structure in
very cheaply privatized so that the public goods Telecommunications
characteristic introduces no significant inefficiency
The telecommunications industry in the United
to a private market system of distribution.
States offers an array of services that until very
Nevertheless, the publicness of information is a
recently were provided as joint products by a
serious issue in an assessment of the performance of
legally protected monopoly. When the monopoly
allocational institutions in the communications
was secure and unquestioned, the pricing problem
sector, and in the design of private market processes
was to devise a price structure that recovered joint
for allocating resources the problems of publicness
and fixed costs with minimal loss of efficiency. As
must be addressed.
the natural monopoly presumption came to be
Communications 1883
called into question, the pricing problem began to by toll. These revenues could be used to construct
incorporate another dimension, to provide appro- systems in high-cost rural areas without causing
priate signals to potential competitors so that the increased prices for basic service elsewhere, again
market structure would evolve efficiently. to encourage universal service.
To understand the rudiments of the telephone Since toll calls pass through local switches they
pricing problem requires some basic knowledge impose a cost on the local network, because local
about the technical characteristics of the telecom- switches must be designed to be large enough and
munications network. The traditional telephone complex enough to accommodate them.
system is best conceptualized as having four Consequently, toll prices would bear some local
components: customer terminal equipment system costs in an efficient pricing structure. In
(a telephone, a computer terminal, a switchboard); a addition, however, toll calls also contributed to
pair of copper wires connecting each terminal ‘non-traffic sensitive’ (NTS) costs - the terminals
device to a central switch; the central switch that and copper wire - even though, by definition, the
serves the local community; and a hierarchy of magnitude of investment required for this equip-
transmission conduits and additional switches that ment was unrelated to the amount of calling.
serve to connect the local switches. Typically the Obviously, this pricing structure not only
telephone price structure has three elements: an encouraged universal service but encouraged local
installation charge for activating a customer’s calling (with a zero price at the margin) while
copper wire pairs; a basic monthly service charge discouraging long-distance calling compared to an
for renting terminal equipment and the copper wire efficiency standard. Encouraging subscriptions to
pairs; and a message toll for placing telephone calls. the system may be warranted on efficiency grounds,
The common practice is for the installation charge although the magnitude of the subscription
to cover only a fraction of installation costs as a externality has not been quantified, and so it is not
means of encouraging universal service, and for the possible to tell whether the amount of the subsidy is
basic monthly charge to entitle the subscriber to justified. Likewise, a subsidy of local calls may be
unlimited local calling - sometimes not confined to desirable, but the method of subsidization is of
other telephones connected to the same local switch, doubtful validity. The external benefit (or cost) of a
but also including calls through adjacent local call falls on the person being called, not on society
switches. Usually the basic monthly charge is much generally.
higher (by a factor of 2 or 3) for business than for Hence, the optimal pricing structure involves a
residences, but within each of these categories it sharing of the costs of calling between the parties to
tends to be approximately the same over wide a conversation, where the costs involve the
geographic areas regardless of differences in cost of operating costs of the system and the effect of calls
service. on the required capacity of the switching system.
Until about 1960, the revenues from installation Only if metering costs were large in comparison
charges and the basic monthly rate approximately with the costs of calling would it make sense not to
covered the cost of local service (including local charge for calls, but with modem electronic
switches). But as long-distance toll calls became switching metering costs are not significant, so that
more important, telephone companies increasingly one cannot justify a subsidy for local calls.
used toll revenues to cover part of the cost of the Moreover, even if one could, there is no justification
local system. This required no increase in toll for taxing long-distance calls to pay the subsidy
prices; indeed, long-distance prices generally were unless one believes that the externality of a local
falling because technological change was call is substantially more important than the
extraordinarily rapid in this segment of the system. externality of a long-distance call.
By simply letting prices fall a little more slowly The general structure of an optimal price struc-
than costs, a large and growing fraction of local ture for the telephone network, given important
network costs could be paid for externalities and natural monopoly, can be derived
as follows. Begin with a basic monthly charge that
1884 Communications
would pay the marginal cost of terminals and effect on subscriptions to the system, and hence
copper wire connections to the local switch, and toll very little effect on efficiency. Finally, differences
charges on all calls that pay the marginal cost of the between residential and business basic monthly
switching and transmission facilities that arc traffic charges would exist only if their externality value
sensitive. These prices need further adjustment, for differed, they imposed different costs on the system
they may collect too much or too little total revenue. or they had different price elasticities.
But prior to this adjustment they must also be Obviously, the pricing structure of telephone
uniformly adjusted downwards to account for the service has never reflected these principles. Until
externality of subscribing and calling (assuming that the 1970s, government officials perceived the extent
people like to receive phone calls). The adjustment of inefficiency of the pricing structure as something
for the toll rate can simply be passed on to the of an academic issue and largely ignored it. But
recipient of the call; however, the basic monthly technological change and the false signals to
rate must come down for everyone. At this point the entrants from the price structure have led to strong
likely case is that the basic monthly rate does not pressures for competitive entry into the formerly
cover the NTS costs, so that further adjustments secure telephone monopoly. Computer technology
must be made. One possibility is a subsidy paid has vastly diversified the demand for
from an economy-wide tax, but more likely the telecommunications, as well as vastly increased its
additional revenues will come from the rate magnitude, and computers and other advances in
structure of the telephone company. The first-best microelectronics have altered the technology of
solution is to raise infra-marginal prices, such as the supply. Examples of the broad range of new
cost of the first few calls made per month, computer-based services include on-line
producing what amounts to quantity discounts for connections to mainframes and data bases for
all types of calls. Alternatively, one could adopt technical and business use, automatic teller
Ramsey pricing, raising the price the most for machines, remote sensing for protection against
services with relatively inelastic demand. fires and burglars, and reservation services. Each of
The resulting price structure would have a these uses has somewhat different technical
number of veiy interesting features. Call recipients requirements, so that the optimal market structure
would pay a share of the price of a call. To for the industry may well be to have a product-
implement this so as only to charge for calls with a differentiated oligopoly, even if each has
positive benefit, the shared cost would start a decent unexploited economics of scale. Moreover, the
interval after the call is answered, and subscribers greater demand created by these technical advances
who desired it could be permitted to designate in allows considerable exploitation of scale economies
advance that they would bear the frill cost of their even in a segmented system.
calls. All prices would be built on marginal costs, On the supply side, advances in electronics have
which means peak-load pricing of calling and prices changed the basic character of the local network. No
for both basic access and calling that were higher in longer arc high-density networks built of dedicated
high-cost areas. To the extent that Ramsey prices copper wire pairs for each terminal. Instead, micro-
were invoked, they would most certainly rely electronic technology allows multiple signals on the
primarily on basic monthly charges, for this has by same wires, and small-scale switches distributed
far the lowest demand elasticity: estimates range throughout the local network that serve to
between —0.02 and -0.10. Thus, even if there is a concentrate lines from many terminals into a
significant externality associated with subscribing smaller number of active circuits, taking advantage
to the network, the Ramsey pricing method for of the fact that not all terminals arc in use
paying for it involves raising the price of basic simultaneously. This reduces the unit cost of
access. Or, putting the matter another way, ignoring capacity and hence the significance of scale
this externality in setting prices will have very little economies in the local network. Moreover, it
undermines a cornerstone of the optimal-pricing
structure that was developed
Communications 1885
above by eliminating most of the NTS costs. If, as the face of diminished or perhaps even non-existent
is becoming the case, customers own their terminal natural monopoly they must erect barriers to
equipment, and if line concentration begins when a competitive entry aimed at the overcharged
small number of terminals are aggregated into a customers.
single pathway to the first switch, then nearly all of
the system that is owned by the local telephone
company consists of traffic- sensitive investment. Prices and Market
Hence, the trend should be away from reliance on Structure in Broadcasting
the basic monthly charge and towards greater
The most common way to pay for broadcasting is to
reliance on message tolls for calling.
provide signals to the audience at no charge, and to
The failure to adjust the pricing system to the
rely on either advertising or government subsidies
realities of costs and technology adds to the pres-
as to the source of revenues. In one sense such an
sure for competitive entry in the parts of the system
arrangement seems to fit the fact that broadcasts are
where prices are higher than the costs of service.
a classic public good; the marginal cost to the
Specifically, the attempt to tax longdistance service
broadcaster of one more person receiving a
in order to subsidize local calling makes relatively
broadcast is zero, and consumption among members
intense users of long-distance service ripe
of the audience is completely non-rivalrous. Hence,
candidates for a competitive longdistance supplier.
any attempt to charge a viewer for a programme can
Large companies with many telephone lines who
introduce inefficiency to the extent that anyone is
can provide their own concentrated connections
thereby excluded from participation who also has a
between their facilities have a strong incentive to
positive willingness to pay to join the audience.
bypass subscriptions to the local network. And
The difficulty with free broadcasting, however,
other electronic pathways for communications, such
is that it does not necessarily result in programmes
as cable television and point-to-point uses of the
that maximize the net willingness to pay of the
radio spectrum, can be exploited to bypass the
audience. Ignoring for a moment the frictions in the
telecommunications network.
political process and the incentives of political
Thus far, government officials responsible for
actors to manipulate programme content to their
telecommunications policy whether as operators of
private benefit, both subsidized and advertiser-
public enterprises or regulators of private utilities,
supported television lead broadcasters to measure
have focused more on the structural aspects of the
their success primarily on the basis of the size of
problem than on pricing issues. Even in the United
audience. In a subsidized system, the objective
States, which has perhaps the greatest commitment
would be to make certain that political support is as
to competition, government policy regarding
high as possible, and in an advertising system, in
entrants has been the binding constraint on the
which the broadcaster is selling the attention of
growth of competition, and the price structure is
viewers, revenues are more or less proportional to
still replete with inefficiencies. The likely
audience size. The issue in both cases is not whether
explanation for this phenomenon is the belief by
audience satisfaction is maximized but whether it is
political actors that the cost of local service to
kept high enough for a large enough number of
residences is the price that is most visible politically
people to maximize revenues from a payment
and that rationalized pricing will cause residential
system that is not based on the intensity of
service to become more expensive, either from
preferences but on the number of satisfied cus-
raising basic monthly charges or from message toll
tomers. In particular, small groups with intense
for local calls. To avoid raising residential price,
willingness to pay for an unusual type of program-
political actors therefore believe that they have to
ming material will generally not have their pref-
keep some other prices above the cost of service,
erences satisfied even if their aggregate
and to maintain these prices in
1886 Communications
willingness to pay exceeds that of a large audience inefficiency inherent in this system is the cost of
for the traditional mass-audience programme. privatizing broadcasts so that on either a per pro-
Three means are available for coping with this gramme or per channel basis they can be sold. Prior
state of affairs. One is to expand the number of to the extensive development of cable television in
broadcast options until all groups are satisfied. the United States, the common speculation was that
Suppose that there is a large mass audience and a privatization of broadcasts would cause a diversion
series of small, specialized ones. As the number of from traditional mass audience programming, with
stations expands, the audience for mass pro- more activity in cultural programmes, educational
grammes each can expect will be the total mass broadcasting and public affairs. The expectation
audience divided by the number of stations, Even- was based upon the belief that higher-income
tually, there will be enough stations so that the groups were more interested in diversity and would
largest specialized taste will constitute a larger have more influence in determining the content of
audience than the share of the mass audience the for-pay systems. In practice, this expectation has
next station could expect to capture, so that a not been realized. The new cable-oriented networks
strategy to maximize audience will lead to spe- for the most part offer programming that is like that
cialization. In the United States, this is more or less provided by off-air broadcasters, such as movies,
the policy with respect to radio broadcasting. In the sports events and regular series. The principal
early years of radio, the Federal Communications exception is in public affairs, where national cable
Commission tried to assure diversity in commercial news and public events networks have succeeded.
broadcasting by specifying the format (e.g. type of Educational and cultural programming, however,
programmes) that a station could broadcast. has been largely unsuccessful. The inference to be
Recently, station formats have been deregulated, yet drawn is that scarcity in television stations caused
the multiplicity of categories remain, in much the an excess demand for television, but primarily for
same fashion that there is a broad spectrum of programmes that were much like those featured by
magazines and books by type of material. off-air stations, largely oriented towards the mass
In television, the strategy of increasing the audience.
number of stations is more difficult to follow. The third means for increasing diversity is to
Television stations consume far more radio fre- create a single, multi-outlet monopoly broadcasting
quency space than radio stations, and no nation has entity. If such an entity seeks to maximize total
thus far been willing to allocate enough high- audience, it will not have as much incentive to
quality radio spectrum to television to provide much duplicate mass programming, because audience
of a test as to whether specialization might take substitution from one channel to the next will have
place in a more extensive industry. An unplanned no value. A second or third mass-audience station
test, however, is under way in Italy, where in the will increase the size of the total audience, but the
1970s the courts declared that the government had evidence indicates that the effect is small compared
no constitutional right to limit the number of to audience diversion. For example, in the United
television stations, and largely unregulated entry States the first television station captures a little less
has taken place on a massive scale. It is too soon to than half of the potential audience available in
tell what the ultimate outcome of this system will prime evening viewing time. No matter how many
be. additional stations are added, the maximum viewing
The second mechanism to produce more diver- share appears to be about 80%, and this is almost
sity in television is to allow the audience to express totally achieved after three or four stations are
a willingness to pay, commonly by installing cable operating. This suggests that a multichannel
television with much higher capacity than off-air monopolist would either diversify programming on
television and charging customers on the basis of the second or third channel, or simply elect not to
the number and type of channels that they elect to broadcast on more than one or two channels,
receive. The depending on the relationship
Communications 1887
between the value of a net increment to the audience Second, the network schedule becomes less vul-
and the costs of adding another channel. nerable to political attack, for centralized govern-
American public television provides an example ment officials who might seek to control it face a
of a novel method of support, for it is one of the few collective of over 150 station licensees, who, in
attempts to implement a decentralized decision turn, are actively using contributions patterns to
process for acquiring a public good (here make decisions about which programmes to
programmes). The first component of the system is acquire.
the method of public financing, which involves The American system of financing public tele-
multiple year, advance funding to erect some barrier vision does not, of course, have pristine efficiency
to political manipulation of programmes. The public properties; neither the voluntary audience contri-
funds are then divided into three components: a butions nor the mechanism whereby stations select
budget for experimental programming that is spent programmes is an incentive-compatible mechanism.
by an independent, quasipublic entity (the Nevertheless, in the inherently imperfect world of
Corporation for Public Broadcasting); a budget for public goods acquisition, they appear to perform
the technical operation of the national network remarkably well, and experimental investigations in
(Public Broadcasting Service); and a direct subsidy a laboratory setting suggest that the method of
of local stations. This subsidy is based in part on the acquiring programmes can be productively
success of the station in obtaining private employed in a variety of settings for collective
contributions from its audience. Thus the station decisions.
subsidy amounts to an attempt to overcome the free-
rider problem faced by viewers by providing
matching funds for their contributions. Remaining Issues
The second component of the system is the
mechanism by which stations decide which pro- Two aspects of the communications sector make it a
grammes will appear on the network. This is ripe area for continuing study. First is the rapidly
accomplished by a combination voting and price evolving technology of supply and demand, and the
system. The price of a programme for each station second is the pervasive and changing influence of
is determined by the size of the community it political processes on the structure and performance
serves, and stations then vote on each programme of the sector.
proposal. If some stations vote against the proposal, With changing technology has come a signifi-
the prices faced by the supporters are increased by cant change in the pattern of demand for services.
an amount necessary to allow the programme to This suggests that historical patterns of use and
cover its costs, and voting proceeds again. The estimates of service-specific demand are unreliable
process continues until programmes are either predictors of the future. Yet relatively little research
purchased or discarded; usually fewer than a dozen has investigated how changing technology - lower
iterations are required to reach a decision. Stations costs, greater possibilities of use, more technical
voting against a programme are excluded from capabilities - have affected key aspects of demand:
broadcasting it; however, stations may later join the the rate of growth by service and customer category
group paying for it by paying a premium price. and the own-and cross-elasticities of demand.
The programme acquisition process decentral- Changing technical possibilities and demand
izes network programming to the stations, thereby should also feed back into the political forces that
serving two ends. First, because the station budgets guide the development of the sector. Most advanced
depend on contributions, or the voluntary industrialized nations are in the midst of transition
willingness to pay of the audience, there exists a in at least some policies regarding communications,
feedback mechanism from the audience to the such as the privatization and introduction of
network that is similar to a pay-TV system. competition in telecommunications in Japan and
Great Britain, and the
1888 Communism
elimination of the state broadcasting monopolies in Park, R.E. 1972. Prospects for cable in the 100 largest
France and Italy. These changes deserve study on television markets. B e l l J o u r n a l o f E c o n o m i c s
3(1): 130-150.
two counts: how these dramatic policy changes Park, R.E. 1975. New television networks. B e l l J o u r n a l
affect performance, and what political forces they o f E c o n o m i c s 6(2): 607-620.
may be creating that will shape policy and industry Rosse, J. 1967. Daily newspapers, monopolistic competition,
structure in the future. and economies of scale. A m e r i c a n E c o n o m i c
R e v i e w 52(2): 522-533.
A period of rapid change is one in which Rosse, J., J. Dertouzos, M. Robinson, and S. Wildman. 1979.
important new knowledge is likely to be forth- Economic issues in mass communications industries. In
coming. One can anticipate that a summary of the Proceedings of the symposium of media
economics of communications a decade or two c o n c e n t r a t i o n , vol. I, 40-192. Washington, DC:
Federal Trade Commission.
hence will contain significant and surprising new Snow, M. 1986. Marketplace for telecommunications:
insights. Regulation and deregulation in industrialised democracies.
White Plains: Longman.
Spence, A.M., and B. Owen. 1977. Television programming,
monopolistic competition and welfare. Q u a r t e r l y
See Also J o u r n a l o f E c o n o m i c s 91(1): 103-121.
Spitzer, M. 1985. Controlling the content of print and
broadcast. S o u t h e r n C a l i f o r n i a L a w R e v i e w
58(6): 1349-1405.
Steiner, P. 1952. Program patterns and preferences, and the
workability of competition in radio broadcasting.
References Q u a r t e r l y J o u r n a l o f E c o n o m i c s 66(2): 194-
223.
Bloch, H., and M. Wirth. 1984. The demand for pay services Taylor, L. 1980. Telecommunications demand: A survey and
on cable television. I n f o r m a t i o n E c o n o m i c s a n d critique. Cambridge, MA: Ballinger.
P o l i c y 1(4): 311-332. von Weiszacker, C. 1984. Free entry into telecommunications?
Brock, G. 1981. The telecommunications industry: The I n f o r m a t i o n E c o n o m i c s a n d P o l i c y 1(3):
dynamics of market structure. Cambridge, MA: Harvard 197-216.
University Press.
Coase, R. 1959. The Federal Communications Commission.
J o u r n a l o f L a w a n d E c o n o m i c s 2(1): 1^40.
Courville, L., A. de Fontenay, and R. Dobell. 1983. E c o -
nomic analysis of telecommunications.
Amsterdam: North-Holland.
Communism
Evans, D. (ed.). 1971. B r e a k i n g u p b e l l . Amsterdam:
North-Holland. Ernest Mandel
Levin, H. 1971. T h e i n v i s i b l e r e s o u r c e . Baltimore:
Johns Hopkins Press.
Machlup, F. 1980. The production and distribution of
knowledge in the United States. Princeton: Princeton
University Press. The term ‘communism’ was first used in modem
Mitchell, B. 1978. Optimal pricing and local telephone
times to designate a specific economic doctrine (or
services. A m e r i c a n E c o n o m i c R e v i e w 68(4):
517-537. regime), and a political creed intending to introduce
Network Inquiry Special Staff. 1980. N e w t e l e v i s i o n such a regime, by the French lawyer Etienne Cabet
n e t w o r k s : E n t r y , j u r i s d i c t i o n , o w n e r s h i p in the late 1830s; his works, especially the utopia
a n d r e g u l a t i o n . Washington, DC: Federal Ulcarie, were influential among the Paris working
Communications
Commission.
class before the revolution of 1848. In 1840, the first
Noll, R. 1985. ‘Let them make toll calls’: A state regulator’s ‘communist banquet’ was held in Paris - banquets
lament. A m e r i c a n E c o n o m i c R e v i e w 75(2): 52- and banquet speeches were a common form of
56. political protest under the July monarchy. The term
Noll, R., M.J. Peck, and J.J. McGowan. 1973. E c o n o m i c
aspects of television regulation.
spread rapidly, so that Karl Marx could entitle one
Washington, DC: Brookings. of his first political articles of 16 October 1842 ‘Der
Owen, B. 1975. E c o n o m i c s a n d f r e e d o m o f Kommunismus und die Augsburger Allgemeine
e x p r e s s i o n . Cambridge, MA: Ballinger.
Owen, B., J. Beebe, and W. Manning. 1974. T e l e v i s i o n
Communism 1889
Zeitung’. He noted that ‘communism’ was already cooperatives promoted for example by the English
an international movement, manifesting itself in industrialist and philanthropist Robert Owen.
Britain and Germany besides France, and traced its Weitling himself created such a community,
origin to Plato. He could have mentioned ancient significantly called Communia. Although they were
Jewish sects and early Christian monasteries too. generally established by a selected group of
In fact, some of the so-called ‘Utopian social- followers who shared common convictions and
ists’, in the first place the German Weitling, called interests, these agrarian communities did not
themselves communists and spread the influence of survive long in a hostile environment. The nearest
the new doctrine among German itinerant contemporary extension of these early communist
handicraftsmen all over Europe, as well as among settlements are the kibbutzim in Israel.
the more settled industrial workers of the Rather rapidly, and certainly after the appear-
Rhineland. Under the influence of Marx and Engels, ance of the Communist Manifesto, communism
the League of the Just (Bund dez Gerechten) they came to be associated less with small communities
had created, changed its name to Communist set up by morally or intellectually selected elites,
League in 1846. The League requested the two but with the general movement of emancipation of
young German authors to draft a declaration of the modem working class, if not in its totality at
principle for their organization. This declaration least in its majority, encompassing furthermore the
would appear in February 1848 under the title main countries (wealth-wise and population-wise)
Communist Manifesto, which would make the of the world. In the major theoretical treatise of
words ‘communism’ and ‘communists’ famous the their younger years, The German Ideology, Marx
world over. and Engels stated emphatically:
Communism, from then on, would designate
Empirically, communism is only possible as the act of
both a classless society without property, without dominant peoples ‘all at once’ and similtaneously,
ownership - either private or nationalized - of the which presupposes the universal development of
means of production, without commodity produc- productive forces and the world intercourse bound
tion, money or a state apparatus separate and apart up with them. .. .The proletariat can thus only exist
worldhistorically, just as communism, its activity,
from the members of the community, and the can only have a ‘world- historical’ existence.
social-political movement to arrive at that society.
After the victory of the Russian October revolution And, earlier in the same passage,
... This development of productive forces (which at
in 1917, that movement would tend to be identified
the same time implies the actual empirical existence
by and large with Communist parties and a of men in their world-historical, instead of local,
Communist International (or at least an being) is an absolutely necessary practical premise,
‘international communist movement’), though there because without it privation, is merely made general,
and with want the struggle for necessities would
exists a tiny minority of communists, inspired by
begin again, and all the old filthy business would
the Dutch astronomer Pannekoek, who are hostile to necessarily be restored ... ([ -6] 1976,
a party organization of any kind (the so-called P-49).
‘council communists’, Ratekommunisten). That line of argument is to-day repeated by
The first attempts to arrive at a communist most orthodox marxists (communists), who find in
society (leaving aside early, medieval and more it an explanation of what ‘went wrong’ in Soviet
modem Christian communities) were made in the Russia, once it was isolated in a capitalist
United States in the 19th century, through the environment as a result of the defeat of revolution
establishment of small agrarian settlements based in other European countries in the 1918-1923
upon collective property, communally organized period. But many ‘official’ Communist Parties still
labour and the total absence of money inside their stick to Stalin’s particular version of communism,
boundaries. From that point of view, they differed according to which it is possible to
radically from the production
1890 Communism
successfully complete the building of socialism and That vision of transition towards communism as
communism in a single country, or in a small an essentially evolutionary process obviously has
number of countries. preconditions: that the countries engaged on that
The radical and international definition of a road already enjoy a relatively high level of
communist society given by Marx and Engels development (industrialization, modernization,
inevitably leads to the perspective of a transition material wealth, stock of infrastructure, level of
(transition period) between capitalism and com- skill and culture of the people, etc.), created by
munism. Marx and Engels first, notably in their capitalism itself; that the building of the new
writings about the Paris Commune - The Civil War society is supported by the majority of the popu-
in France - and in their Critique of the Gotha lation (i.e. that the wage-earners already represent
Programme [of the German social- democratic the great majority of the producers and that they
party], Lenin later - especially in his book State and have passed the threshhold of a necessary level of
Revolution - tried to give at least a general sketch of socialist political class consciousness); that the
what that transition would be like. It centres around process encompasses the major countries of the
the following ideas: world.
The proletariat, as the only social class radically Marx, Engels, Lenin and their main disciples
opposed to private ownership of the means of and co-thinkers like Rosa Luxemburg, Trotsky,
production, and likewise as the only class which has Gramsci, Otto Bauer, Rudolf Hilferding, Bukharin
potentially the power to paralyse and overthrow et al. - incidentally also Stalin until 1928 - dis-
bourgeois society, as well as the inclination to tinguished successive stages of the communist
collective cooperation and solidarity which are the society: the lower stage, generally called ‘social-
motive forces of the building of communism, ism’, in which there would be neither commodity'
conquers political (state) power. It uses that power production nor classes, but in which the individual’s
(‘the dictatorship of the proletariat’) to make more access to the consumption fund would still be
and more ‘despotic inroads’ into the realm of strictly measured by his quantitative labour input,
private property and private production, substituting evaluated in hours of labour; and a higher stage,
for them collectively and consciously (planned) generally called ‘communism’, in which the
organized output, increasingly turned towards direct principle of satisfaction of needs for cvciyonc
satisfaction of needs. This implies a gradual would apply, independently of any exact measure-
withering away of market economy. ment of work performed. Marx established that
The dictatorship of the proletariat, however, basic difference between the two stages of com-
being the instrument of the majority to hold down a munism in his Critique of the Gotha Programme,
minority, does not need a heavy apparatus of full- together with so much else. It was elaborated at
time functionaries, and certainly no heavy apparatus length in Lenin's State and Revolution.
of repression. It is a state sui generis, a state which In the light of these principles, it is clear that no
starts to wither away from its inception, i.e. it starts socialist or communist society' exists anywhere in
to devolve more and more of the traditional state the world today. It is only possible to speak about
functions to selfadministrating bodies of citizens, to ‘really existing socialism’ at present, if one intro-
society in its totality'. This withering away of the duces a new, ‘reductionist’ definition of a socialist
state goes hand in hand with the indicated withering society, as being only identical with predominantly
away of commodity production and of money, nationalized property of the means of production
accompanying a general withering away of social and central economic planning. This is obviously
classes and social stratification, i.e. of the division different from the definition of socialism in the
of society between administrators and classical marxist scriptures. Whether such a new
administrated, between 'bosses' and 'bossed over’ definition is legitimate or not in the light of
people. historical experience is a matter of political and
philosophical judgement. It is in any case another
matter altogether than ascertaining
Community Indifference Curves 1891
whether the radical emancipatory goals projected which reflects the collective individual preferences
by the founders of contemporary communism have of agents. Scitovksy’s notion of a ‘community
been realized in these really existing societies or indifference curve’ essentially allows the analytical
not. This is obviously not the case. convenience of social indifference curves, in certain
circumstances, without having to assume a specific
Bergson-Samuelson social welfare function or
having to assume the restrictive assumptions on
See Also agents’ preferences needed to guarantee that agents
act collectively as a single individual.
► Central Planning The definition of a community indifference
► Collective Agriculture curve is basically simple. Suppose there are m
► Full Communism commodities and n agents. Let x denote a com-
► Marx, Karl Heinrich (1818-1883) modity vector (as m-vector with non-negative
►:: coordinates) and u[ a utility function representing
► Planned Economy agent z’s preferences. We will assume that ui is
► Socialist Economies monotone increasing and quasi-concave. Given a
vector u = (u1 ,...,unj of utility numbers, the
References community indifference curve at u', CIC(u'), is
defined to be the set of all commodity vectors x
Marx, K., and Engels, F. 1845-6. T h e G e r m a n such that there is a distribution (x 1; ..., x„) of
i d e o l o g y . As in Karl Marx and Frederick Engels, commodity vectors satisfying YtX, = x and M,(X,) =
C o l l e c t e d w o r k s , vol.5. London: Lawrence &
Wishart, 1976.
ui,i= 1,..., n, and there is no x < x , x / x which also
has this property. Thus one can obtain any vector x
e CIC(u') by fixing the quantities of all but one
good and minimize the amount of the remaining
good subject to achieving u'. As pointed out by
Samuelson ( ), the community
Community Indifference Curves
indifference curve can be interpreted as a ‘dual’ to
Wayne Shafer the utility possibility frontier. The utility possibility
frontier, for a given x, is the set of all vectors u' of
utility numbers achievable by a Pareto efficient
distribution of x to the agents. Let U (x) denote the
utility possibility frontier for the commodity vector
Keywords
x. Then it is easy to see that CIC(ii ) {x : u e U(x)}
Bergson-Samuelson social welfare functions; and that U(x) =
Community indifference curves; Comparative \u : x c CIC{u )\.
statics; Scitovsky, T.; Utility possibility We will now describe the most important prop-
frontiers erties of community indifference curves. First, each
CIC(u') looks like the indifference curve of a
monotone quasi- concave utility function. That is,
JEL Classifications the set of vectors x such that x > x 1 for some x1 €
Dll
CIC(u') is a convex set. For example, when m 2,
The idea of a community indifference curve, as the CIC(w') is a curve with a diminishing marginal rate
term is commonly used, is due to Scitovsky ( ). of substitution. Second, unlike the utility possibility
The genesis of the idea is the fact that comparative frontier, the community indifference curve is
statics and welfare analysis in economic models is essentially an ordinal concept, that is it does not
simplified considerably if there is a social prefer- depend on the choice of utility functions
ence ordering over aggregate commodity bundles representing agents’ preferences, in the
1892 Community Indifference Curves
following sense. Suppose, for each i, u and v, are x' G CIC(u ) and an x' C CIC(u') such that x lies
two utility functions representing agent i’s prefer- above CIC(u ), in which case x is also ‘better’ than
ences, and let (x,, ..., x„) be a Pareto efficient x'. Thus it is important to realize that community
allocation to the agents. Define 1 / = [ufxQ,.. u„(x„)\ indifference curves cannot be used to define a social
and v = [v^xD, ...v„(x„)]. Then CIC (ii ) = CIV(v'). ordering of aggregate output vectors. Nevertheless,
Clearly, community indifference curves can be community indifference curves can still be a useful
parameterized by a given Pareto efficient allocation analytical device. For example, consider a market
of goods rather than a given vector of utilities. economy with two produced goods. Consider an
Third, assuming smooth utility functions, the equilibrium in which all consumers face the same
marginal rate of substitution for any two prices, in terms of the aggregate output vector x' and
commodities on a community indifference curve is the vector of utilities ii obtained by the agents.
equal to the common marginal rate of substitution of Graphically this equilibrium can be represented by
each agent. Specifically, pick any x e C/C(z/), and drawing the production possibility frontier and
let (x,,... ,x„) be the Pareto efficient allocation of .r CIC(i/), noting they meet at x'. The slope of the
such that ufx) = u . i 1 , . . T h e n for any two production possibility frontier at x' represents the
commodities h and h', the marginal rate of price ratio faced by firms, and the slope of the
substitution of h and h' evaluated at x € CIC(u') is CIC(i/) at x' the common price ratio faced by
equal to the marginal rate of substitution of h for /?' consumers. If firms and consumers face the same
at x, on agent i’s indifference curve through x;. price ratio, then the CIC(u') must be tangent to the
Fourth, and very important, community indifference production possibility frontier at x. Thus no feasible
curves are not, in general, ‘indifference’ curves in x can be produced which can make all agents better
the sense of being level curves of some function. off, so the situation is Pareto optimal. If, however,
Pick any x, and u , u e U(x ), such that u' / u . Then firms face different prices than the agents because
by definition, x c CIC(ur) fi CIC(u ). Thus CIC(ur) of, for example, taxes or tariffs, then the slope of
must either coincide with CIC(u) or intersect the CIC(i/ ) will be different from the slope of the
properly. The condition for two community indif- production possibility frontier, and thus the two
ference curves never to intersect properly is then curves will intersect properly. In this case there
that CIC(u') = CIC(u ) for all u', u G U(x), for all x. must exist an x' on the production possibility
It turns out that this is true if and only if the agents frontier which lies above CIC(i/), so the original
have identical homothetic preferences, in which situation is Pareto inefficient.
case the family of all community indifference
curves will coincide with the family of indifference
curves for the common preferences of the agents.
From the above definition and properties, the
following observation constitutes the basic use of See Also
community indifference curves: if the economy is
currently at a vector of utility numbers u', then x' is ► Arrow’s Theorem
a commodity vector which lies above CIC(u') if and ► iptimality and Efficiency
only if there is some distribution of x' to the agents ► ocial Welfare Function
which will achieve a vector of utilities u" such that ► Velfare Economics
u > u'. In this sense, x' is ‘better’ than any x c
CIC(i/). However, since from above community Bibliography
indifference curves can intersect properly, it may
Samuelson, P.A. 1956. Social indifference curves. Q u a r -
also be that there is a u such that
t e r l y J o u r n a l o f E c o n o m i c s 70 (1): 1-22.
Scitovsky, T. 1942. A reconsideration of the theory of tariffs.
R e v i e w o f E c o n o m i c S t u d i e s 9 (2): 89-110.
Comparative Advantage 1893
likely to specialize the smaller she is compared with each country. Suppose we take the opposite extreme
England in the sense defined above and the more of completely specific labour in each sector, so that
world demand is skewed towards the consumption each country produces a fixed combination of cloth
of wine relative to the consumption of cloth. and wine, with no possibility of transformation. In
this case, labour in the import- competing sector in
each country must necessarily lose, as a result of
The Gains from Trade trade, while labour in each country’s export sector
must gain. It can be shown, however, that trade will
Viewed as a ‘positive’ theory, the principle of com-
improve potential welfare in each country in the
parative advantage yields predictions about (a) the
Samuelson ( )
direction of trade: that each country exports the
sense that the utility-possibility frontier with trade
good in which it has the lower comparative oppor-
will dominate the corresponding frontier without
tunity cost ratio as defined by the technology in that
trade, so that no one need be worse off, and at least
country, and about (b) the terms of trade: that it is
some one better off, if lump-sum taxes and transfers
bounded above and below by these comparative
are possible (Samuelson ).
cost ratios. From a ‘normative’ standpoint the prin-
ciple implies that the citizens of each country
become ‘better off as a result of trade, with the International Factor Mobility and World
extent of the gains from trade depending upon the Welfare
degree to which the terms of trade exceed the
domestic comparative cost ratio. It is the ‘norma- Another very important normative issue is the
tive’ part of the doctrine that has always been the question of the relationship between the free- trade
more controversial, and it is therefore necessary to equilibrium and world efficiency and welfare. In the
evaluate it with the greatest care. Ricardian model world welfare in general will not
In Ricardo’s example the total labour force in be maximized by free trade alone. In the numerical
each country is presumably supplied by an aggre- example considered here Ricardo stresses the fact
gate of different households, each having the same that England can still gain from trade even though
relative productivity in the two sectors. Thus all she has an absolute advantage in the production of
households in each country must become better off both goods, her productivity being greater in both
as a result of trade if the terms of trade lie strictly in cloth and wine, though comparatively greater in
between the domestic comparative cost ratios. The cloth. Suppose that labour in Portugal could
import-competing sector in each country simply produce at English levels, if it moved to England;
switches over instantaneously and costlessly to that is, the English superiority is based on climatic
producing the export good (moving to the opposite or other ‘environmental’ factors and not on
comer of its linear production- possibilities frontier, differences in aptitude or skill. Then, if labour was
in terms of the familiar geometry), obtaining the free to move, and in the absence of ‘national’
desired level of the other good by imports, raising sentiment, all production would be located in
utility in the process. When one country is England, and Portugal would cease to exist. The
incompletely specialized, then all households in that former Portuguese labour would be better off than
country remain at unchanged utility levels, all of the under free trade, since their real wage in terms of
gain from trade going to the individuals in the wine will now be two units instead of one. The
‘small’ country. Thus we have a situation in which English labour would be worse off, if the terms of
everybody gains, in at least one country, while trade were originally better than 0.5 wine per unit of
nobody loses in either country, as a result of trade. cloth, but it is easy to show that they could be
This very strong result depends upon Ricardo’s sufficiently compensated since the utility-possibility
assumption of perfect occupational mobility in frontier for the world economy as a whole is moved
out by the integration of the labour forces.
Comparative Advantage 1895
The case when each country has an absolute ranking the countries in terms of the ratio of their
advantage in one good is more interesting. As is productivities in the two goods, with country 1
easy to see, from Findlay ( ), this case will having the greatest relative efficiency in cloth and
involve a movement of labour to the country with country n in wine. World demand and the sizes of
the higher real wage under free trade, increasing the the labour forces will determine the ‘marginal’
production of this country’s exportable and reducing country j, with countries 1 to j exporting cloth and
that of the lower-wage country under free trade. The (/' + 1) to n exporting wine.
terms of trade turn against the higher-wage country The simultaneous consideration of comparative
until eventually the real wage is equalized. The advantage with many goods and many countries
terms of trade that achieve this equality of real presents severe analytical difficulties. Graham (
wages will be equal to the ratio of labour ) considered several elaborate
productivities in each country’s export sector; that numerical examples, his work inspiring the Roch-
is, the ‘double factoral’ terms of trade will be unity. ester theorists McKenzie ( ) and Jones ( )
This solution of free trade combined with perfect to apply the powerful tools of activity analysis to
labour mobility will achieve not only efficiency for this particular case of a linear general equilibrium
the world economy as a whole but equity as well. model. It is interesting to note in connection with
‘Unequal exchange’ in the sense of Emmanuel ( mathematical programming and activity analysis
) would not exist, while liberal, that Kantorovich ( ) in his celebrated book on
utilitarian and Rawlsian criteria of distributive planning for the Soviet economy worked out an
justice would be satisfied as well, as pointed out in example of optimal specialization patterns for fac-
Findlay ( ). Despite all this, it still seems tories that corresponds exactly to the Ricardian
utopian to expect a policy of ‘open borders’, in model of trade between countries.
either direction, for the contemporary world of
nation-states.
The Three-Factor Ricardian Model
Extensions of the Basic Ricardian Model While most of the literature on the Ricardian trade
model has concentrated on the model of Chapter 7
The two-country, two-good Ricardian model was of the Principles in which it appears that labour is
extended to many goods and countries by a number the sole scarce factor, his more extended model in
of subsequent writers, whose efforts are described in the Essay on Profits has been curiously neglected,
detail by Haberler ( ) and Viner though the connections between trade, income
( ). In the case of two countries and n goods distribution and growth which that analysis
the concept of a ‘chain of comparative advantage’ explores are quite fascinating. The formal structure
has been put forward, with the goods listed in of the model was laid out very thoroughly in
descending order in terms of the relative efficiency Pasinetti ( ). The economy produces two
of the two countries in producing them. It is readily goods, com and manufactures, each of which has a
shown that with a uniform wage in each country all one-period lag between the input of labour and the
goods from 1 to some number j must be exported, emergence of output. Labour thus has to be
while all goods from (J + 1) to n must be imported. supported by a ‘wage fund’, an initially given stock
The number j itself will depend upon the relative that is accumulated over time by saving out of
sizes of the two countries and the composition of profits. Com also requires land as an input, which is
world demand. Dombusch et al. ( ) generalize in fixed supply and yields diminishing returns to
this result to a continuum of successive increments of labour. The wage-rate is
goods in an extremely elegant and powerfiil model given exogenously in terms of com, and
that has been widely used in subsequent literature. manufactures are a luxury good consumed only by
An analogous chain concept applies to the case of the land-owning class, who obtain rents determined
two goods and n countries, this time by the marginal product of land.
1896 Comparative Advantage
Profits are the difference between the marginal comparative advantage will depend upon the slopes
product of labour and the given real wage, which is of these curves at their autarky equilibria, which are
equal to the marginal product ‘discounted’ by the endogenous variables depending upon the sizes of
rate of interest, in this model equal to the rate of the ‘wage fund’ in relation to the supply of land and
profit, defined as the ratio of profits to the real wage the consumption pattern of landowners, as well as
that has to be advanced a period before. Momentary the technology for the two goods.
equilibrium determines the relative price of com As Burgstaller ( ) points out, however, the
and manufactures, the rent per acre and the rate of steady-state solution of the model restores the linear
profit, as well as the output levels and allocation of structure of the pattern of comparative advantage.
the labour force between sectors. The growth of the The zero profit rate in the steady state requires the
system is at a rate equal to the product of the rate of maiginal product of labour to be equal to the given
profit and the propensity to save of the capitalist real wage, and this implies a fixed land-labour ratio
class. It is shown that the system approaches a and hence output per unit of labour in com. We thus
stationary state, with a monotonically falling rate of once again have two fixed technical coefficients, so
profit and rising rents per acre. that the slope of the linear production-possibilities
The opportunity to import com more cheaply frontier is once again an exogenous indicator of
from abroad will have significant distributional and comparative advantage.
growth consequences. Just as Ricardo argued in his The ‘neo-Ricardian’ approach of Steedman (
case for the repeal of the Com Laws, cheaper , ) considers more general time-phased
foreign com will reduce domestic rents and raise the structures of production. Technology alone deter-
domestic rate of profit, and thus the rate of growth.
mines negatively sloped wage-profit or factor- price
The approach to the stationary state is postponed,
frontiers, any point on which generates a set of
though of course it cannot be ultimately averted,
relative product prices and hence a pattern of
while the growth consequences for the com exporter
comparative advantage relative to another such
are definitely adverse. The main doctrinal
economy.
significance of this wider Ricardian model,
however, is to reveal the extent to which the
subsequent ‘general equilibrium’ or ‘neoclassical’ Factor Proportions and the Heckscher-
approach to international trade is already present Ohlin Model
within the Ricardian framework. For one thing, the
pattern of comparative advantage itself depends While J.S. Mill, Marshall and Edgeworth all made
upon the complex interaction of technology, factor major contributions to trade theory, the concept of
proportions and tastes. In his Chapter 7 case the comparative advantage did not undergo any evo-
pattern of comparative advantage is exogenous, lution in their work beyond the stage at which
simply given by the four fixed technical coefficients Ricardo had left it. They essentially concentrated on
indicating the productivity of labour in cloth and the determination of the terms of trade and on
wine in England and Portugal. The production- various comparative static exercises. The interwar
possibility frontiers for each country are linear, and years, however, brought fundamental advances,
comparative advantage is simply determined by the stemming in particular from the work of the Swedes
relative magnitudes of the slopes. As demonstrated Heckscher ( ) and Ohlin ( ). The
in Findlay ( ), however, the Essay on Profits development of a diagrammatic apparatus to handle
model implies a concave production-possibilities general equilibrium interactions of tastes,
frontier at any moment, since there are diminishing technology and factor endowments by Haberler (
returns to labour in com even though the marginal ), Leontief ( ), Lemer ( ) and others
productivity of labour in manufactures is constant. culminated in the rigorous establishment of trade
With two countries the pattern of theory and comparative advantage as a branch of
neoclassical general equilibrium theory.
Comparative Advantage 1897
capital formation and the concept of a natural question posed by Samuelson ( ) of whether
resource ‘frontier’ into the Heckscher-Ohlin trade equalizes not only the marginal product or
framework. rental of capital but the rate of interest itself.
Jones, R.W., and P. Neary. 1984. Positive trade theory. In b a l a n c e o f p a y m e n t s , ed. R. Baldwin et al.
H a n d b o o k o f i n t e r n a t i o n a l e c o n o m i c s , ed. Chicago: Rand McNally.
R.W. Jones and P.B. Kenen, vol. 1. Amsterdam: North- Samuelson, P.A. 1971. Ohlin was right S w e d i s h J o u r n a l
Holland. o f E c o n o m i c s 73: 365-384.
Kantorovich, L. 1965. T h e b e s t u s e o f e c o n o m i c Steedman, I. 1979a. T r a d e a m o n g s t g r o w i n g
r e s o u r c e s . Cambridge, .VIA: Harvard University e c o n o m i e s . Cambridge: Cambridge University Press.
Press. Steedman, I., ed. 1979b. F u n d a m e n t a l i s s u e s i n
Kenen, P.B. 1965. Nature, capital and trade. J o u r n a l o f t r a d e t h e o r y . London: Macmillan.
P o l i t i c a l E c o n o m y 73: 437 460; Erratum, Slight/, .1. 1970. Factor-price equalization in a dynamic
December, 658. economy. J o u r n a l o f P o l i t i c a l E c o n o m y 78:
Kingman, P.R. 1979. Increasing returns, monopolistic 456—4SS.
competition and international trade. J o u r n a l o f Stolper, W.. and P.A. Samuelson. 1941. Protection and real
I n t e r n a t i o n a l E c o n o m i c s 9: 469 479. wages. R e v i e w o f E c o n o m i c S t u d i e s 9: 58-73.
Lancaster, K.J. 1980. Intra-industry trade under perfect Trefler, D. 1995. The case of missing trade and other
monopolistic competition. Journal of mysteries. A m e r i c a n E c o n o m i c R e v i e w 85:
I n t e r n a t i o n a l E c o n o m i c s 10: 151-175. 1029-1046.
Learner, E.P. 1984. Sources of international comparative Vanek, .1. 1963. The natural resource content of U.- S. Foreign
advantage: Theory and evidence. Cambridge, MA: MIT trade 1870-1955. Cambridge. MA: MIT Press.
Press. Vincr, .1. 1937. Studies in the theory of international trade. New
Lemer, A.P. 1952. Factor prices and international trade. York: Harper.
E c o n o m i c a 19: 1-15.
Leontief, W.W. 1933. The use of indifference curves in the
analysis of foreign trade. Q u a r t e r l y J o u r n a l o f
E c o n o m i c s 47: 493-503.
Leontief, W.W. 1954. Domestic production and foreign trade:
The American capital position re-examined. E c o n o m i c
Comparative Statics
I n t e r n a t i o n a l e 7: 9-38.
Lemer, A.P. 1932. The diagrammatic representation of cost John Nachbar
conditions in international trade. E c o n o m i c a 12: 345-
356.
Lemer, A.P. 1952. Factor prices and international trade.
E c o n o m i c a 19: 1-15.
MacDougall, G.D.A. 1951. British and American exports. Abstract
E c o n o m i c J o u r n a l 6 1: 697 724.
Comparative statics in competitive general
McKenzie, L.W. 1954. Specialization and efficiency in world
production. R e v i e w o f E c o n o m i c S t u d i e s 21 equilibrium (GE) environments provide insight
(3): 165 180. into the operation of GE models and a means to
Ohlin, B. 1933. I n t e r - r e g i o n a l a n d i n t e r n a t i o n a l confront GE models with data. This article
t r a d e . Cambridge, MA: Harvard University Press. focuses on a canonical comparative statics
Oniki, H., and H. Uzawa. 1965. Patterns of trade and
investment in a dynamic model of international trade.
prediction: in an exchange economy, aggregate
R e v i e w o f E c o n o m i c S t u d i e s 32: 1 5 38. endowment changes are negatively related to
Pasinetti, L. 1960. A mathematical formulation of the equilibrium price changes. In particular, an
Ricardian system. R e v i e w o f E c o n o m i c S t u d i e s increase in the aggregate endowment of a com-
27: 78-98.
Ricardo, D. 1951. T h e w o r k s a n d c o r r e s p o n d e n c e
modity lowers its equilibrium price.
o f D a v i d R i c a r d o , ed. P. Sraffa. vols. 1 and 4.
Cambridge: Cambridge University Press. Keywords
Samuelson, PA. 1948. International trade and the equalization
of factor prices. E c o n o m i c J o u r n a l 58: 163 184.
Aggregate excess demand; Asset pricing;
Samuelson, PA. 1949. International factor price equalization Comparative statics; Competitive general equi-
once again. E c o n o m i c J o u r n a l 59: 181 197. librium; Debreu-Mantel-Sonnenschein theorem;
Samuelson, PA. 1950. Evaluation of real national income. Demand shocks; Endowment changes; General
Oxford Economic Papers 2: 1 29.
Samuelson, PA. 1953. Prices of factors and goods in general
equilibrium; Gross substimtes; Hicks, J.; Partial
equilibrium. R e v i e w o f E c o n o m i c S t u d i e s 21: equilibrium; Production economies; Rybcyznski
1-20. theorem; Stolper-Samuelson theorem; Supply
Samuelson, PA. 1962. The gains from international trade once shocks; Weak axiom of revealed preference
again. E c o n o m i c J o u r n a l 72: 820 829.
Samuelson. P.A. 1965. Equalization by trade of the interest
rate along with the real wage. In T r a d e , g r o w t h
Comparative Statics 1901
because they do not require estimating normaliza- Brown, D., and R. Matzkin. 1996. Testable restrictions on the
tion vectors, but they are harder to interpret. equilibrium manifold. E c o n o m e t r i c a 64:1249-1262.
Chiappori, P., I. Ekelund, F. Kiibler, and H. Polemarchakis.
As already noted, CS-type reasoning can be 2004. Testable implications of general equilibrium the-
extended to production economies (see Quah ; ory: A differentiable approach. J o u r n a l of
Nachbar ). CS-type reasoning can also be extended M a t h e m a t i c a l E c o n o m i c s A O : 105-119.
Echenique, F., and A. Manelli. 2005. C o m p a r a t i v e
to asset pricing environments (see Quah ).
statics, English auctions, and the
For shocks to preferences rather than endow- S t o l p e r - S a m u e l s o n t h e o r e m . Mimeo. Tempe:
ments or technologies, the analog of CS is Arizona State University.
Grandmont, J. 1992. Transformations of the commodity
Ap ■ Ax < 0 , space, behavioral heterogeneity and the aggregation
problem. J o u r n a l o f E c o n o m i c T h e o r y 57: 1-
35.
where Ax is the change in equilibrium consumption.
Hicks, J. 1939. V a l u e a n d c a p i t a l . Oxford: Clarendon
Profit maximization implies that this inequality Press.
holds for any price normalization. In this respect, Hildenbrand, W. 1983. On the law of demand.
the analysis of demand shocks is trivial compared E c o n o m e t r i c a 51: 997-1018.
Kehoe, T. 1987. Comparative statics. In T h e n e w
with the analysis of supply shocks.
P a l g r a v e : A d i c t i o n a r y o f e c o n o m i c s , ed.
Interest in comparative statics has helped moti- J. Eatwell, M. Milgate, andP. Newman, vol. 1.
vate research on the uniqueness, regularity, and Basingstoke: Palgrave.
stability of equilibria (see Kehoe ). Note that some Kiibler, F. 2003. Observable restrictions of general equi-
of the comparative statics results cited above (for librium with financial markets. J o u r n a l of
E c o n o m i c T h e o r y 110: 137-153.
example, the Brown-Matzkin results and the /7,\v Nachbar, J. 2002. General equilibrium comparative statics.
CS result) do not assume uniqueness or stability. E c o n o m e t r i c a 1 9 : 2065-2074.
Finally, perhaps the most famous comparative Nachbar, J. 2004. General equilibrium comparative statics:
statics results are the Stolper-Samuelson theorem The discrete case with production. J o u r n a l o f
M a t h e m a t i c a l E c o n o m i c s 40: 153-163.
and its dual, the Rybcyznski theorem (for a recent Quah, J. 1997. The law of demand when income is price
treatment, see Echenique and Manelli ). Stolper- dependent. E c o n o m e t r i c a 65: 1421-1442.
Samuelson links changes in factor prices with factor Quah, J. 2003. Market demand and comparative statics when
intensities and changes in output prices. goods are normal. J o u r n a l o f M a t h e m a t i c a l
E c o n o m i c s 39: 317-333.
Rybcyznski links changes in final goods production Snyder, S. 1999. Testable restrictions of Pareto optimal public
with factor intensities and changes in factor good provision. J o u r n a l o f P u b l i c E c o n o m i c s
supplies. Although it is possible to embed these 71:97-119.
results within a highly restricted GE model, they are Williams, S. 2002. Equations on the derivatives of an initial
Compensated Demand
See Also
Tatsuo Hatta
x
= h(p,p), (1 ) ep{p,p)=h{p,p). (7)
where x and p are the consumption and price This identity usually referred to as the S h e p h a r d -
vectors, while p is the utility level. We call h the S a m u e l s o n L e m m a was obtained by Hicks
compensated (or Hicks) demand function. Formally, (1946, p. 331), Samuelson ( , p. 68,
it may be defined as the solution function of the -54, pp. 15-16), and Shephard ( ).
following minimization problem: To prove the Shephard-Samuelson Lemma, let x
be an expenditure-minimizing vector that yields p
minp'x subject to u(x) = p, (2) where u is at the price p , i.e.,
x*=h(p*,p). (8)
the utility function.
Define the gain function g by
Slutsky Compensation
Mosak’s equality has played an important role in endowment bundle, the utility level is hardly
index number theory. The Laspeyres index, which changed by a slight change of the price vector away
is widely adopted in practice, is based on the from the initial one. Equations ( ), ( )
Slutsky compensation, since it indicates the change and ( ), yield
in income that would be needed in the current year
in order to buy the commodity bundle bought in the s(p,x) = h\p,w{p,x)\.
base year. Under the Slutsky substitution effect, the
individual can be no worse off and is likely to be Differentiating this with respect to p and noting ( ),
better off since he is able to purchase at least the we immediately have ( ).
bundle he had before the price change. Thus he is
‘overcompensated’ for the price change (see
Samuelson , pp. 4-5). The price index that truly Historical Notes
reflects the utility change should be based on the
Slutsky ( ) first established the homogeneity
Hicks compensation. Such an index is difficult to
and symmetry conditions on the substitution matrix.
compute because utility levels are not observable.
Since he did not have the concept of utility-
Mosak’s equality reveals, however, that for small
maintaining compensation or the compensated
price changes the Laspeyres index is a good
demand function, he derived these properties for the
approximation to the ‘ideal’ index.
Slutsky compensated substitution matrix sp \p. m(p,
Let us now formally state Mosak’s equality.
y)] rather than for the Hicksian matrix hp [p. v(p,y)\.
Define the function s by
During the 1930s, Hicks and Allen ( ) and Hicks (
, 1946) gave verbal
s(p,x) = m(p,p'x).
interpretations to the substitution matrix in terms of
( 22 ) the Hicksian compensation. But in their formal
derivation of its properties, they defined the sub-
The function s is the demand function with the fixed stitution matrix to be the Slutsky substitution
endowment bundle x. We call sp \p, m (p, y)\ the matrix, as is clear from the following passage from
Slutsky substitution matrix. It represents the the Mathematical Appendix of Hicks (1946, p.
variation in demand when the price change is 309):
accompanied by an income compensation that .. .it follows from the equation that the substitution
term represents the effect on the demand for xs of a
keeps the original consumption bundle m(p,y) on
change in the price of xr combined with such a
the budget plane. Mosak’s Equality may now be change in income as would enable the consumer, if he
expressed as chose, to buy the same quantities of all goods as
before, in spite of the change in Pr.
sp\p,m(p,y)\ = hp\p,v{p,y)\. ( 23 ) Thus they too did not explicitly state the func-
tion h, much less gave a name to it.
To prove this, first define the function w by To the writer’s knowledge, Samuelson ( ) is
the first author who explicitly stated ( ) and derived
w(p,x) = v(p,p'x). ( 24 ) Hicksian rules directly from it, though he did not
The value of w(p,x) represents the maximized give a name to it (see Samuelson , (43) on p. 103
utility level when the endowment bundle .r is given. and (99) onp. 114). Subsequently, Samuelson ( , p.
Differentiating this and applying Roy’s identity, we 8, nl) gave a heuristic proof of the
get symmetry and the negative-semidefiniteness condi-
tion of tiie Slutskycompensated substitution matrix
wp\p,m(p,y)]=0. ( 25 ) as envelope properties, i.e. in a spirit very much
similar to the one given above. In this path-breaking
Thus, if the utility maximizing bundle under a proof, however, he relied upon the indirect utility
certain price vector happens to be equal to the
1908 Compensated Demand
junction rather than the expenditure junction, thus ( ) and Harbeiger’s ( ) analysis of the
without using the mirror image minimization prob- excess burden of taxation, H i c k s ’ s ( ) compen
lem or the compensated demand junction. sating and equivalent variation that illuminate the
McKenzie ( ) and Karlin ( ) concept of consumers’ surplus, and Alonso’s ( )
explicitly rent bidfunction, which keeps a consumer’s
defined the function h, and derived its properties by utility constant regardless of the location he
taking full advantage of the Shephard- Samuelson chooses. A number of economists of the Chicago
Lemma. However, they had to use a global property School, including Friedman ( ), Bailey ( ),
of the expenditure function in deriving the negative and Becker ( ), used the concept of the Hicks
semidefiniteness condition, which is a local compensation in various welfare analyses. Each of
property. The proof of this condition employed these authors gave a different name to the concept.
above is solely based on a local minimization Friedman called its graph the Marshallian demand
condition, and serves as the mirror image curve contrary to the current usage of this term;
counterpart of Samuelson’s ( ) proof for the Bailey, the constant-real-income demand curve, and
Slutsky substitution matrix. Diamond and McFad- Becker, the pure demand curve.
den (1973) attribute this proof method to Gorman The explicit use of the compensated demand
(see Gorman ). The above proof of Mosak’s junction gave rise to dramatically clearer restate-
equality is due to Hatta and Willke ( ). ments and proofs of many existing theorems. Its
Silberberg ( ) considered the general max usefulness has reached far beyond that, however.
imization problem with possibly many constraint Since the early 1970s, this function has been used
functions where both target and constraint functions for the analyses of the welfare impacts of para-
may be non-linear. Extending Samuelson’s ( ) metic shifts in various general equilibrium models,
proof method, he showed that the compen as stated by Ohyama ( ), Takayama
sated solution function of that problem satisfies ( ), Diamond and MacFadden ( ), Dixit
generalized forms of the symmetry and non- ( ) and Hatta ( , ), and comprehen-
negative definite conditions, as long as the sively studied by Dixit and Norman ( ) and
constraint functions do not contain shift parameters. Woodland ( ).
Silberberg’s proof boils down to Gorman’s in the The history of the compensated demand func-
standard expenditure minimization problem. Hatta ( tion is curious. The properties of its derivatives
) extended the concept of the compen were known and the concept of Hicks compensation
sated demand function to the case where the same used iu many fields of economics before the
shift parameters may appear simultaneously in both function i tself was stated or named. Perhaps this is
target and constraint functions in the general because economists has an unconscious reluctance
problem. He showed that the properly compensated in putting the elusive concept of the utility level as a
solution function in that problem satisfies variable of a function. Once explicitly stated and
generalized forms of symmetry and non-negative well understood, however, the compensated demand
definite conditions. He also established an envelope function has found a powerful use in welfare
theorem that contains both the Shephard-Samuelson economics, precisely because it has the utility level,
Lemma and Roy’s Identity as special cases. His rather than income, as an explicit variable.
proof integrates Samuelson’s ( ) and
Gorman’s into one. The global char
acterization of the compensated demand function
by McKenzie and Karlin was extended into duality
theory, as surveyed by Diewert ( ). See Also
In many branches of economics outside the
demand theory, the concept of the compensated
demand function was implicitly used without being
explicitly stated. Examples are Hotelling’s
Compensating Differentials 1909
Keywords
Compensating differentials; Hedonic wage
function; Labour economics; Rosen, S.; Super-
stars, economics of; Unobserved skill; Urban
economics; Urban environment and quality of
life; Wage heterogeneity, sources of; Wage
premium; Worker heterogeneity
John S. Chipman
Conclusion Abstract
The compensation principle holds that one of
A job’s wage is not a sufficient statistic for its two possible states constitutes an improvement
quality. Coal miners are paid a relatively high wage over the other if the gainers could compensate
but the work is dangerous and unpleasant. A major the losers for their losses and still be at least as
research agenda in labour economics investigates well off as in the original state. The conflict
how much people implicitly pay for non-market job between potentiality and actuality - one situation
attributes. Credible estimates of wage compensating is judged better than another if everybody could
differentials for living in less polluted cities or be made better off in the new situation even
working in risky industries would greatly aid policy though some in fact become worse off - ensures
analysis that seeks to measure the benefits of that the compensation principle does not allow
environmental and safety regulation. for value-free policy decisions.
Compensation Principle 1913
Keywords
though, at the particular moment, such a person
Autarky; Baldwin envelope; Barone, E.; might have a cold and the other one not. The
Bergson-Samuelson social-welfare function; compensation principle is usually used to make
Bickerdike, C. F.; Caimes, J. E.; Compensation comparisons in this sense; one state of the economy
principle; Competitive equilibrium; Consumer is sounder, healthier, more robust, or has greater
surplus; Cost-benefit analysis; Deadweight loss; productive potential, than another. What this
Debreu, G.; Dupuit, A.-J.-E. J.; implies is that states under comparison are usually
Exchange; Excise taxes; Free trade; Funda- not unique, singleton states but composite ones, or
mental theorem of welfare economics; Gains sets of states. Formally, the objects being compared
from trade; Game theory; General equilibrium; are usually sets of commodity bundles that could be
Hicks, J. R.; Hotelling, H.; Ideal money; made available to the aggregate of consumers,
Income-compensation function; Interpersonal described in the literature as ‘situations’ in contrast
utility comparisons; Kaldor, N.; Kuznets, S.; to single ‘points’ in such sets (cf. Baldwin ).
Lemer, A. R; Little, I. M. D.; Lump sum taxes; Examples of comparisons in which the com-
Marshall, A.; Mishan, E. J.; Monopoly; New pensation principle is typically used are those
welfare economics; Optimal tariffs; Pantaleoni, between (a) a perfectly competitive system of
M.; Pareto optimality; Partial equilibrium; industrial organization and an imperfectly com-
Pigou, A. C.; Proportional income tax; Revealed petitive one; (b) free trade and no trade (or
preference; Ricardo, D.; Robbins, L. C.; restricted trade); (c) the state of an economy before
Samuelson, P. A.; Scitovsky indifference and after a war, or depression, or change in
surface; Scitovsky, T.; Taxation of income; productive techniques. Most but not all of these
Transferable utility; Utility-possibility frontier; types of comparisons are relevant to policy deci-
Value judgements; Viner, J. sions; and the policy decisions are usually not of an
ad hoc type (for which the compensation principle
would hardly be appropriate) but of a fundamental
nature concerning the underlying system of
JEL Classifications
B4 industrial organization and trade.
Inasmuch as the principle can be applied with-
The term ‘compensation principle’ refers to the out the need to make interpersonal comparisons,
principle that, in comparing two alternative states in some of its more ardent proponents have maintained
which a given community of persons might find that it is ‘value-free’. However, there can be no
itself, one of the states constitutes an improvement doubt that it does require acceptance of some value
over the other (in the weak sense including equiv- judgements, since the Pareto criterion itself
alence) if it is possible for the gainers to compen- constitutes one - albeit a widely accepted one.
sate the losers for their losses and still be at least as Another value judgement implicit in the principle as
well off as in the original state. it has usually been applied is that each individual is
If the hypothetical compensation is actually car- the best judge of his or her own wellbeing; while
ried out, the principle reduces to the Pareto crite- also quite widely accepted, this one is obviously
rion: all are at least as well off, in one state controversial, and in fact government policy
compared to the other. There is no need to invoke measures are often called for precisely in those
the compensation principle in such a case. On the instances where it is clearly an untenable
other hand, if the principle is used to compare two assumption. But the most important and contro-
unique alternative states in which a community versial way in which value judgements enter into
might find itself, neither of which is Pareto- the compensation principle is in the conflict
superior to the other, the principle seems quite between potentiality and actuality: one situation is
arbitrary unless interpreted in a broader context. judged better than another if everybody could be
There is a sense in which one person might be said made better off in the new situation even though
to be basically healthier than another even some in fact become worse off. This
1914 Compensation Principle
lacuna in the principle has led Little ( ) and , p. 272) and Marshall ( , p. 447; 1920,
Mishan ( p. 467) who used the concept of consumers’ surplus
) to formulations in which compen
to compare the losses of consumers (say from a
sation tests are combined with explicit distributional
value judgements, and Samuelson ( bridge toll or an excise tax) with the gains to the
,
government. The demonstration that the former
) into a full-fledged ethical system in which
exceed the latter, so that consumers cannot be
compensation is carried out to the extent that the
ethical norms dictate. compensated for their losses out of the government
revenues, provided a convincing case for the
In many applications the compensation principle
superiority of income tax to an excise tax (or for the
is difficult to formulate in a precise manner unless
one assumes absence of externalities in con- superiority of government subsidization of bridge
construction to its financing of it by tolls), and at
sumption, so it is usually formulated (but with some
the same time provided scientific prestige and great
notable exceptions - for example, Coase ) under the
intuitive appeal to a method that was able to reach
assumption that each person’s welfare depends only
on his or her own consumption of goods and such a definitive conclusion and furnish a measure
of the ‘deadweight loss’.
services. In most applications, the data available for
making comparisons are, almost inevitably, limited While Dupuit and Marshall used partial-
equilibrium analysis, Pareto (
to aggregative information on the actual state of the , p. 58) was
economy in each situation; much of the work in the first to introduce the concept into general-
equilibrium theory, in the course of an article
applying the principle therefore consists in using
devoted to proving the optimality of competitive
economic theory to make inferences from the actual
observations concerning underlying conditions in equilibrium. In the first part of this article
the economy. By its nature, the compensation (summarized by Sanger ), Pareto used as his
criterion of optimality the sum of individual
principle is limited in its application to comparing
alternative states (or sets of states) of a givenutilities; in the second part, however - acknowl-
edging the criticisms and suggestions of Pantaleoni
community of individuals; thus, it cannot be applied
and Barone (both admirers of Marshall, which
(at least not literally ) to historical comparisons of a
Pareto was not) - he reformulated the problem so as
country’s condition over time (since the population
has changed) or to comparisons of the living to sum not the utilities of different consumers but
conditions of different countries (since the the quantities they consume. His criterion of
optimality (1894, p. 60) was that it should be
populations are different). However, extensions of
impossible for one person to gain without another
the principle to cover such comparisons are possible
losing - ‘Pareto optimality’ - a criterion that had
provided suitable additional empirical assumptions
also been introduced by Marshall ( , pp. 449-50;
and value judgements are accepted; for example, if
all individuals are assumed to have identical 1920, pp. 470-1). A more
preferences, one could ask whether there exists arefined version of Pareto’s argument later appeared
in the Corns (Pareto -7, vol. 1, pp. 256-62; vol. 2, pp.
redistribution of income in each period (or country)
88-94).
such that each individual in the one situation would
be better off than each individual in the other. This The proposition formulated by Pareto ( )
anticipated what has now come to be known as the
would obviously entail additional value judgements
along with the additional empirical assumptions. ‘fundamental theorem of welfare economics’,
namely, that every competitive equilibrium is
Pareto optimal and, conversely, every Pareto opti-
Historical Development: From Dupuit to mum can be sustained by a competitive equilibrium.
Hotelling Pareto considered the problem faced by a socialist
state striving to attain an outcome in which it was
The compensation principle may be traced back to impossible for one person to gain without another
Dupuit ( , pp. 359-60; Arrow and Scitovsky losing. The Ministry of Justice would concern itself
with problems of income
Compensation Principle 1915
distribution, and the Ministry of Production with the rest will still be worse off than originally.
resource allocation and choice of production coef- (Barone used what is now known as the criterion of
ficients. A weakness of Pareto’s argument was that revealed preference to make inferences concerning
he assumed a price system already to be established preferences from data on prices and incomes). Such
- perhaps our socialist state needs the prices of its a state was described by Pareto and Barone as
capitalist neighbours to guide it. Pareto further ‘destruction of wealth’, and its measure by
assumed that each individual’s budget constraint aggregate income loss at the competitive-
was adjusted by the addition of a parameter (a equilibrium prices provided an alternative to the
lump-sum subsidy or tax) controlled by the deadweight loss considered by Dupuit and
government. The government’s objective was to Marshall. Barone ( ) also related
maximize the sum of these parameters, which he his arguments to those of Marshallian consumers’-
showed was equal to aggregate profit - the value of surplus analysis.
commodities consumed less the value of factor Lemer ( ) invoked the compensation prin
services supplied, equal to the value of firms’ ciple in his proposed method for measuring
output less the outlay on their factor inputs. If it monopoly power, describing it as ‘a loss to the
were possible to increase all the parameters, the consumer which is not balanced by any gain reaped
existing situation would not be Pareto optimal; if by the monopolist’. In this paper Lemer also
their sum were a maximum, it would not be formulated, apparently independently, the concept
possible to increase one of them without decreasing of Pareto optimality.
another, and the outcome would be Pareto optimal.
Hotelling ( ) made a noteworthy contribu
Pareto showed that maximization of aggregate
tion by providing an alternative demonstration of
profit at the given prices, subject to the resource-
the inferiority of excise taxes to income taxes, using
allocation and production- function constraints,
the compensation principle directly. He considered
would lead to cost- minimization and zero profits.
a single individual consuming n commodities in
(For mathematical details of Pareto’s arguments see
amounts qj and facing market prices pj. Prior to the
Chipman , pp. 88-92). Pareto summarized this result
imposition of the excise taxes (or tolls), the
by stating (1896-7, vol. 2, p. 94):
Free competition of entrepreneurs yields the same individual consumes a bundle q° at pricesp° and
values for the production coefficients as would be income (or fixed component of income) m°, which
obtained by determining them by the condition that maximizes a utility function U(q) subject to the
commodity outputs should be chosen in such a way
budget constraint p° ■ q = m\ Subsequent to the
that, for some appropriate distribution, maximum
ophelimity would be achieved for each individual in introduction of taxes, market (tax-inclusive) prices
society. and after-tax income are p1 and tn respectively, and
The last clause was Pareto’s unfortunately a bundle q is chosen which maximizes U(q) subject
awkward way of stating the criterion of Pareto to p1 ■ q = m . The government collects r=(/?1 —
optimality. p°)-c^ — (m1 m°)
Barone ( ), who had originally spurred in revenues. Since the government is assumed to
Pareto on to this line of argument, developed it collect [p'j — p^j ■ q'j in taxes on commodity j.pj
further himself. He noted that a competitive equi- must be identified with the production cost after the
librium has the property that aggregate profit is at a tax (as well as with the market price = production
maximum at the equilibrium prices, hence, for any cost before the tax); this is a fairly restrictive
feasible departure from this equilibrium, valuing assumption, since it implies that the tax does not
consumption and factor services at the equilibrium affect production costs. (In this respect Hotelling’s
prices, some individuals may gain and others will treatment is less general than Dupuit’s and Mar-
lose, the losses outweighing the gains so that, even shall’s, involving infinite elasticities of supply). We
if the gainers part with all their gains, may denote the ad valorem excise-tax rate on
commodity j by tj = p)/p<j — L and a proportional
income-tax rate by to = 1 - ni1 /m° (negative
1916 Compensation Principle
taxes are interpreted as subsidies). The govern- charitable interpretation of Hotelling’s contribution
ment’s revenues are see Silberberg ).
Hotelling went on to assert that his proposition
n
r=
Yl lJph 'j + = 0, could be extended to many consumers (though no
jI details or proof were provided), and he proceeded to
examine the consumers’-surplus measure of loss \
assumed zero since the government distributes the
(pl -p°) ■ (q1 - q ) = \rIp]- (</' -</') (whereTisa
total proceeds of these excise taxes back to the
diagonal matrix of excise-tax rates tj). He also made
consumer (or taxes the consumer if these are neg-
some general observations ( , p. 267) that, to
ative). The consumer’s budget constraint after the
this day, constitute what is probably the best state-
imposition of the taxes is
ment to be found of the philosophy underlying the
compensation principle.
]C(i1 ij)phj o 'o)w°.
The Years of the New Welfare
These two equations together imply that q1 satisfies Economics
the budget constraintp° ■ q = m°, hence ql was in
the consumer’s original budget set. Therefore, In the cases to which the compensation principle
setting aside the ‘infinitely improbable ... was applied by Dupuit, Marshall, Lemer and
contingency’ that q and q1 lie on the same indif- Hotelling, compensation was made between the
ference surface, Hotelling concluded ( p. 252) that class of consumers on the one hand and a govern-
‘if a person must pay a certain sum of money in ment or a monopolist on the other. While Pareto
taxes, his satisfaction will be greater if the levy is and Barone had discussed compensation between
made directly on him as a fixed amount than if it is different classes of consumers (as had Hotelling in
made through a system of excise taxes which he can his general remarks) their work was unknown to
to some extent avoid by rearranging his production English-speaking economists until the publication
and consumption’. in 1935 of the English translation of Barone’s work.
Unfortunately Hotelling overlooked the fact that Even this seems not to have struck home, however,
if tj = I for all j then the government’s budget since Kaldor ( ) cited pas-
constraint implies p° ■ q1 = —m°to/t, whence /(, = sages from Harrod ( ) and
—t and q1 = q°. That is, a system of uniform ad Robbins ( ) to
valorem excise taxes is equivalent to a proportional the effect that, since movement towards free trade
income tax. This was pointed out by Frisch ( ) would affect different classes differently, no sci-
and accepted by Hotelling ( ). entific statement could be made concerning the
As Frisch made clear, what Hotelling really proved beneficial effect of free trade without making
was the non-optimality of a system of non- interpersonal comparisons of utility.
proportional excise taxes or subsidies when selling Kaldor ( ) proceeded to sketch an argu
prices are given. If these selling prices are equal to ment to the effect that removal of an import duty
marginal costs, Hotelling’s theorem shows that (using the classical example of repeal of the Com
market prices should be proportional to marginal Laws) would result in a situation in which the
costs. Since incomes are fixed in Hotelling’s losses incurred by the landlords could be compen-
formulation, income taxes may be regarded as sated by the gains (through lower import prices)
lump-sum taxes. If institutional consideration make obtained by the other consumers. Such an argument
excise taxes impossible for one commodity (say cannot be correct, however, since, as Kaldor ( )
leisure), then they must be zero for all commodities pointed out only a year later, it follows
and optimality requires that prices be equal to from Bickerdike’s theory of optimal tariffs that a
marginal costs. (For a less country can gain from the imposition of a suffi-
ciently small duty, and, as Graaff ( ) and
others later demonstrated, the compensation prin-
ciple can be used to show that, with suitable
Compensation Principle 1917
compensation, all persons can gain. Unless the rate consumed, under both (balanced) free trade and
of com duty was above the optimal tariff rate, the autarky the budget equation p ■ x = p ■ y holds.
opposite conclusion would follow to that indicated Letting superscripts 0 and 1 denote equilibrium
by Kaldor ( ). values under autarky and free trade respectively, it
A previous attempt by Pareto ( ) to show follows that
by means of the compensation principle that a tariff
would lead to ‘destruction of wealth’ was defective, pl-xl-wl-ll >pl-x ° - w 1 -1°.
since he assumed trade to be balanced in domestic
Assuming all N individuals to be identical in their
prices and thus he failed to take account of the
preferences and ownership of factors, and dividing
improvement in the terms of trade and the beneficial
this inequality through by the number of indi-
effect of the tariff revenues.
Other attempts prior to 1939 to make the case viduals, it states that each person chooses (x'/A, l1
for free trade suffered from vagueness both in IN) under free trade when (x°/A, PiIV) is available,
specifying the criterion of gain and in specifying the hence (if/?1 / p°) each person prefers (x'/A, l1 IN) to
alternative with which free trade was being (x°/A, PlN). Therefore free trade is Pareto- superior
compared. Ricardo ( , p. 25) stated: ‘There are to autarky.
two ways in which a country may be benefited by Samuelson went on to assert ( , p. 204)
trade - one by increase of the general rate of that, if the assumption of identical individuals is
profits ... the other by the abundance of commod- dropped, then, although it could no longer be said
ities, and by a fall in their exchangeable value, in that each individual was better off under free trade,
which the whole community participate’. According ‘it would always be possible for those who desired
to Caimes ( , p. 418), ‘the true trade to buy off those opposed to trade, with the
criterion of the gain on foreign trade [is] the degree result that all could be made better off. This
in which it cheapens commodities, and renders them argument went unchallenged until Olsen ( )
more abundant’. A hint of a compensation principle pointed out that, if compensation were
is found in Viner ( pp. 533—4): paid from gainers to losers, a new equilibrium price
free trade ... necessarily makes a v a i l a b l e to the
constellation p1 would result, and the argument no
community a s a w h o l e a greater physical real longer follows. For this reason Samuelson’s 1939
income in the fomr of more of a l l commodities, results has come to be known as the gains-from-
and ... the state ... can, by appropriate supplementary
trade theorem for the ‘small- country case’, though
legislation, make certain that removal of duties shall
result in more of e v e r y commodity for e v e i y this interpretation was not suggested by Samuelson
class of the community. at the time. But this description of Samuelson’s
result is inaccurate. Generalizing his argument we
Like Kaldor’s statement, this is formally incorrect;
can say that if (*!.<) are the allocations of (x, ll) to
but it was sufficiently suggestive to stimulate
individual i, where f , xti ri and fy!! , l\ = I', the given
Samuelson ( ) into providing a formal proof
the allocations (xj, l)) of (x1, /') under free trade one
of a gains-from-trade theorem, albeit under very
can find Pareto-optimal allocations (x®, if) of (y\ P)
restrictive assumptions.
under autarky such that
Samuelson ( ) assumed that an open econ
omy had a locus <p(y, l) = 0 of efficient combina- 1
• xj - w1 ■ l) >p' -x" H- • /f for
tions of outputs y and (variable) factor services /,
and asserted that vectors of prices p and factor z = 1, 2, .. . ,A .
rentals w in competitive equilibrium would be such
that aggregate profit p ■ y - w ■ l is a maximum. This proves that for any free-trade equilibrium it is
This is the same as the proposition of Pareto ( ), possible to find a weakly Pareto-inferior Pareto-
and Barone ( ) referred to above. Let optimal autarky equilibrium. It does not prove the
ting x denote the bundle of commodities obverse proposition that for any autarky
1918 Compensation Principle
equilibrium it is possible to find a weakly Pareto- allocation A'1 of v1. This follows from the same
superior free-trade equilibrium. A general gains- argument that establishes the Pareto optimality of
from-trade theorem was therefore yet to be the assumed competitive equilibrium in period 1.
established, but Samuelson had provided an The non-existence of an allocation X of y° such that
important first step. XRX1, where n(X') = y1, constituted for Hicks the
Hicks ( ) ushered in the ‘new welfare eco definition of an ‘increase in real social income’.
nomics’ with a synthesis building on Hotelling ( Kuznets ( ) pointed out by an example
) and Kaldor ( ) and based on the com that, in the case considered by Hicks, it could also
pensation principle, making it possible, according to be true that there is no allocation X of y1 which is
him, to make policy proposals in favour of weakly Pareto superior to the actual allocation X° of
economic efficiency which were free of value y°. Accordingly he suggested that Hicks’s criterion
judgements. Hicks’s most original contribution be supplemented by the condition that there should
(Hicks ) was his attempt to apply the compensation exist an allocation X of y1 that is weakly Pareto
principle to data on a country’s real national income. superior to the actual allocation X° of y°. But while
This was a natural thing to try to do, since Pigou’s ( the latter criterion impliesp° ■ y1 > p° ■ y°, it is not
) main work was implied by it, so a national-income comparison
devoted to evaluating a country’s welfare by using current and base prices would still not yield
national-income comparison, and it was largely Kuznets’s criterion.
Kuznets’s criticism of Hicks was similar to the
Pigou’s resort to interpersonal comparisons in order
objection raised by Scitovsky ( ) to the criterion
to justify this that was the object of Robbins’s (
proposed by Kaldor ( ). According to
) criticism.
Scitovsky’s interpretation of Kaldor, an allocation
Hicks’s ( ) basic tool was the ‘revealed-
X1 of y1 is better than an allocation X° of y°, if there
preference’ comparison which had been employed
exists a reallocation A ofy , which is Pareto superior
by Barone ( ) and Hotelling
to X°. Scitovsky objected that this gave preference
( ). If observations are available at times
to the status quo ante, and besides, he pointed out
0 and 1 of a country’s national income in period-1
that the criterion was internally inconsistent in the
prices, and it is recorded that p1 ■ y1 > p' ■ y° (where
sense that it allowed two such pairs (X, y ) to be
p\ yl are vectors of prices and outputs at time t),
superior to each other. He therefore proposed that
what can be inferred? In the first place, to make any
Kaldor’s test be supplemented by the criterion that
headway one must assume that the observed there exist a reallocation X° ofy° that is Pareto
situations are competitive equilibria. Let us define inferior to X1.
an allocation of a commodity bundle x as an N x n The literature on ‘compensation tests’ suffered
matrix X whose /th row, .r„ is the bundle of n from ambiguity as to the domain of definition of the
commodities allocated to individual i, and whose relations and internal inconsistency of the relations.
row sum a{X) = , v, is equal to .r. It was pointed out by Gorman ( )
As between two bundles x?,xjconsumed by indi- that the relations were intransitive. It was shown in
vidual i, let us define xjRpf to mean that x} is Chipman and Moore ( ) that the
preferred or indifferent to xf by individual i, where Hicks-Kuznets and Scitovsky double criteria, as
Rt is a continuous, convex, monotonic total order, well as the national-income comparisons in terms of
with Pt denoting strict preference and /, indifference. base- and current-year prices, could lead to cycles of
(This relation assumes the absence of externalities in three competitive equilibria each superior to its
consumption). Finally, let XlRX° (resp. X1 PX°) mean successor.
that X1 is weakly (resp. strictly) Pareto-superior to The definitive contribution to the subject of
X° (i.e. xjRjxf for all i, resp. xjRixf for all i and national-income comparisons was that of Samuelson
xjRixf for some i). Then, from the real-income ( ) who introduced what Chipman and
comparisonp1 ■yl>pI ■ y°, Hicks noted that there
does not exist an allocation Xof y° that is weakly
Pareto-superior to the actual
Compensation Principle 1919
proof of a gains-from-trade theorem, as well as of free trade. A similar approach was indicated by
the proposition that a country could gain by a tariff. Vanek ( ).
Kemp ( ) noted that Samuelson’s 1939 Grandmont and McFadden ( ) and
theorem implied that for any point on the free- trade Chipman and Moore ( ) both used the concept
utility-possibility frontier, the autarky utility- of an income-distribution policy to establish the
possibility frontier must pass below it; he reasoned gains-from-trade theorem. In Chipman and Moore
that, as a result, for any point on the autarky utility- this policy was chosen to be one that maximizes a
possibility frontier, the free-trade utility-possibility separable Bergson-Samuelson social-welfare
frontier must pass above it. If this argument can be function. A standard argument is used to show that
accepted, it follows that for every allocation^ C social utility is at least as high under free trade as
A(Y°) where Y° is the autarkic production- under autarky. It remains to show that a function
possibility set, there exists a (weakly) Pareto- W(u) can be chosen so that the corresponding
superior allocation A 1 £ A(Yh) where Y] is the free- distribution policy ensures that an increase in social
trade consumption-possibility set. Then free trade is utility implies an increase in each individual’s
superior to autarky by Samuelson’s 1950 criterion. utility. This is achieved by choice of W(u) = miiij
The trouble with this argument, however, is that (iij — uf)/cj where ct > 0 and it1- is the level of
it requires that one can define a free-trade utility- utility achieved by individual i under autarky.
possibility frontier (or consumption- possibility
frontier) with the strong topological property of
homeomorphism to the (N — 1)- dimensional unit General-Equilibrium Theory
simplex (intuitively, absence of ‘holes’). That this
need not always be possible, was shown by Otani ( The compensation principle is used in the proof of
, p. 149), and indeed the theorem that every competitive equilibrium is
admitted by Kemp and Wan ( , p. 513). It is Pareto-optimal (Arrow , pp. 516, 519; Koopmans ,
always possible if world prices are fixed, beyond p. 49; Debreu , pp. 94-5), in the sense that arbitrary
our country’s control. In that case the free-trade allocations of feasible output bundles among
consumption-possibility set Y1 is the budget set consumers are assumed possible, regardless of
enclosing the production-possibility set resource-ownership constraints. A pair (A0, p°) is a
Y° (cf. Samuelson , p. 821), and the gains- from- competitive equilibrium for the production-
trade theorem follows immediately from the possibility set Y if J^'RX for all X C A(Y) satisfying
property I1 S
RY . In similar fashion the Xp° < A°p° and y°p° < yp° for all y £ Y, where y° =
famous ‘Baldwin envelope’ (Baldwin ) defines a ff(A°) C_ Y. Pareto-optimalify means that one
well-behaved consumption-possibility set cannot find an X £ A(Y) such that XPX°. The proof is
containing the production-possibility set, from by contradiction: XPX° implies Xp° > A°p° (the
which one can prove the superiority of restricted vector inequality being weak in all components and
trade (with an optimal tariff) to autarky (cf. strict
Samuelson ). in at least one) hence taking column sums,
0 - 0 0
For the general case in which a country can yp >yp ■
influence world prices, a method was shown by The converse theorem, that every Pareto opti-
Kenen ( ). If all but 1 of the N individuals are mum can be sustained by a competitive equilibrium,
constrained to have the same level of satisfaction requires stronger assumptions which are awkward to
under trade as achieved under autarky, a net state (cf. Arrow , p. 518; Koopmans , p. 50; Debreu ,
production-possibility set can be constructed which p. 95). The basic idea of the proof (Koopmans , pp.
indicates the amount available for the Mh person. It 50-52; Debreu , p. 96) can be sketched in terms of
remains only to show that the Mh person will gain the concept of a Scitovsky ( )
from a movement from autarky to indifference surface. If A0 is a Pareto-optimal
Compensation Principle 1921
allocation for a closed, convex production- this other than the historical accident that con-
possibility set K then the interior of the Scitovsky sumers’ surplus developed as a partial- equilibrium
set of X° can be written pkx°k + f°r tool. He remarked that consumers’ surplus is not
some k. needed for the first task, since lack of fulfilment of
Defining the allocation X1 by x\pkxQk and x\Rt xf the proportionality between marginal utilities and
for i ^ k we have X1 PX° hence X1 £ A(Y). Therefore marginal costs provides the needed information
the interior of the Scitovsky set does not intersect Y, immediately. For the second task, he was not
and these convex sets can be separated by a content with a ranking of the non- optimal states,
hyperplane defining the equilibrium prices. It is then but with measuring the size of their deviations from
verified that at these prices the properties of a optimality, which of coiuse would provide such a
competitive equilibrium are satisfied. ranking. Thus, the staunch ordinalist in consumer
Debreu ( , p. 590) introduced an alternative theory became an equally ardent cardinalist in
equilibrium concept according to which the condi- consumer theory.
tion that consumer preferences be maximized sub- Hicks’s concepts of compensating and equiva-
ject to their budget constraints was replaced by the lent variation (Hicks ) may most conveniently be
condition that consumer expenditures be minimized defined in terms of the minimum- income or
subject to the constraints that the bundles considered income-compensation functions of McKenzie () and
be at least as desirable as the equilibrium bundles. Hurwicz and Uzawa
(The second of the above theorems follows more ( ). Denoting the ith consumer’s demand
easily under this alternative definition). For a given function by x,- = h^p, in,) (where x, and p are ;?-
set of positive-valued utility functions representing vectors), and defining the indirect preference
consumer preferences, Arrow and Hahn ( , p. relation R* by (p°,mk) R* (/;', m\) if and only if h,
108) called this a ‘com (p°, )Rtht (px, m\), the income-compensa
pensated equilibrium’. As a means of proving exis- tion function is defined by
tence of the latter they studied the utility-possibility
frontier or ‘Pareto frontier’ ( , p. 96), and pi(p;p°,m°) i n f {m , : (p,ml)R*(p0,m^) }.
obtained a new proof of the result of Chipman and
Moore ( ) that the set of Pareto-optimal Following Chipman and Moore ( ), the gen
allocations X of Y (the ‘contract curve’) and the eralized compensating variation in going from (p\
utility-possibility frontier are topologically homeo- w?) to {p, nij) is defined as
morphic to the unit simplex of dimension one less
than the number of individuals. These results were Ci(p, mp,p°, »if) = mt - Pi (p;p°, »i?)
further developed by Moore ( ).
and the generalized equivalent variation by
Ei(p,mi;p0,m<}) = Pi(p0;p,nii) - mk.
Cost-Benefit Analysis
These reduce to Hicks’s concepts when m,- = mf.
Hicks ( , p. 112) made an interesting distinc The compensating variation expresses for each
tion between two tasks of welfare economics: (1) consumer the amount of money income he or she
the study of (Pareto)-optimal organizations of the would be willing to give up (or the negative of the
economy and (2) the study of deviations from such amount by which he or she would have to be
optima. More precisely, the first was concerned with compensated), at the new prices, to make up for the
when there was a deviation and the second with the change in prices and income. One of the reasons for
size of the deviation. He also identified these two the great appeal of the concept is that these are
tasks with general- and partial-equilibrium analysis amounts that can be added up over the set of
respectively, although there appears to be no consumers. In Hicks’s words ( , p. 127):
justification for
1922 Compensation Principle
the general test for a particular reform being an improvement ordering between pairs of price-
i m p r o v e m e n t is that the gainers should gain
income pairs (p‘, A/) as follows. Let A* (p, M) be
sufficiently for them to be able to compensate the
the set of (n + /V)-tuplcs (p, m) such that 22^=1 m‘ =
losers and still remain gainers on balance. This test
M, and let R* be the relation such that (p°, m°)
would be carried out by striking the balance of the
Compensating Variations. R*(p\ m1) if and only if (p°,m9 )R* (p\ m\) for i =
Denoting by m' the vector of N incomes in state t, 1,2,..., N. Then we define the dual KHS relation
and by M' their sum, we can define a dual potential- >R* by
Choosing price-income pairs (p°, m°) and (p1, ml) Boadway ( ) considered the relationship
satisfying this definition, since p,(p‘‘, p, mj) is an between the condition of positive summed com-
indirect utility function representing/?* for t 0 or 1, pensating variations and the fulfilment of com-
we have P;(p';p°,w;f) > Pi(p';p 1,»il-) for all pensation tests and came to the negative conclusion
individuals i, hence that the former was neither necessary nor sufficient
for satisfaction of the latter in general, but was
sufficient in the case of identical and homothetic
M° = Y^m<i preferences. Foster ( ) showed
1=1
that, if there are no price distortions (but not
otherwise), satisfaction of the compensation tests
i=1 2=1 implies satisfaction of the ‘cost-benefit criterion’
(positive summed compensating variations). This
so one obtains a multi-consumer analogue to the conclusion is in accord with the above inequalities.
compensating variation from the formula What about the Hicksian tenet that the size of
the compensating variation is important so that one
can compare two suboptimal states? This would
M ° =^2fj.i(p°;p1,m1i)^
2=1 require one to be able to conclude that, if the
compensating variation from state 0 to state 2 is
positive and greater than the compensating variation
M1 - =0.
2=1 from state 0 to state 1, then state 2 should be
superior to state 1 in terms of the dual KHS
Likewise for the equivalent variation, ordering. But this is not true even in the case of the
single consumer. It was shown in Chipman and
o=y>,(Pv.m”) Moore (1980) that the function C, (p, mt; p\ m°)
2=1 cannot be an indirect utihty function for unrestricted
domain (p, m,) > 0, and can be if m[ is held constant
-M° ^ ^pi(p°;p1,m!1)-M°. i=i
if and only if preferences are homothetic, and if p\
In the latter case the same indirect utihty is held constant if and only if preferences are
‘parallel’ with respect to commodity 1. If
functions are summed on both sides of the
preferences are identical and homothetic, since p, =
inequality sign; it is a case where Benthamites and
pis homogeneous of degree 1 in mi, I p,(p°;p, mi)
compensationists can find common ground.
so exact
aggregative analogues are obtained to both the
Compensation Principle 1923
compensating and equivalent variations. If the that the exchange in commodity 2 would be con-
equivalent variation, which is an indirect utility stant, but in commodity 1 ‘indeterminate’; see the
function, is used, restrictions on consumer prefer- reply by Berry , on behalf of Marshall, and Marshall
ences are not needed, and the problem of finding an 1891, p. 756; 1920, p. 845). The above form for the
adequate indicator of the size of the deviation from utility function has been rediscovered many time,
a given Pareto optimum is satisfactorily resolved. by Wilson ( ), Samuelson ( ),
and others; cf. Chipman and Moore ( ,
p. 115). Barone ( , p. 213n) gave the name
Game Theory ‘ideal money (numeraire)’ to a good with a constant
marginal utility (commodity 1 in the above). For the
One of the striking aspects of von Neumann and
case c, = c for all i, these ‘parallel’ preferences (cf.
Morgenstem’s theory of games ( ) was not
Boulding ) yield a special case of the family of
only its postulate of measurability of utility but also
aggregable Antonelli-Gor- man-Nataf demand
that of its transferability between players. Since this
functions referred to above.
was introduced as a positive rather than a normative
assumption, it has met with even greater resistance
of the part of economists than the hedonist calculus. Concluding Observations
Indeed, it was not until Debreu and Scarf ( )
showed how game the As Scitovsky ( ) pointed out, the compensa
ory could be liberated from this restriction with tion principle has been used in two quite different
their development of the concept of the core of an ways. Prior to Hicks ( ), it was used only to
economy that game theory began to be taken really compare efficient with inefficient states of a given
seriously by economists. The replacement of economy with a given technology or trading system.
transferability of utilities by transferability of Starting with Hicks ( ), its use was
commodities bears a striking resemblance to the extended to comparison of efficient states of an
replacement in welfare economics of the calculus of economy under different technologies. It has turned
utilities by the principle of compensation. out that, in order for national-income comparisons
In some branches of game theory the assump- to provide a correct indicator of potential- welfare
tion of transferable utility is still retained, but it has improvement, very strong conditions are required
been made somewhat more plausible, or at least concerning similarity of individual preferences:
interpretable, by means of the postulate that the locally, the Antonelli- Gorman-Nataf conditions,
utility functions of all individuals are linear in one and, globally, identical homothetic preferences. It is
distinguished commodity used for making side not even enough to assume that aggregate demand
payments (cf. Owen , p. 122). These utility can be generated by an aggregate preference
functions have the form relation - for example, that preferences are
homothetic and relative income- distribution
Ui{xi\, Xi2i —, ) CjXj\ V/ (x/2, —, XLn ).
constant (cf. Chipman and Moore ). Even in such
cases, strong value judgements (such as acceptance
This form of the utility function goes back to
of a particular Bergson-Samuelson social-welfare
Edgeworth ( , p. 237n) and even earlier
function) are required in order to draw welfare
(though in garbled form) to Auspitz and Lieben (
conclusions from national-income comparisons.
, p. 471). In Edgeworth it was used to illus
When attention is restricted to the efficient
trate the phenomenon of exchange when the mar-
operation of an economy with a given technology, it
ginal utility of one commodity serving as money
turns out that, in most cases of interest, the ranking
was held constant, in accordance with one possible
of consumption-possibility criterion sets according
interpretation of Marshall’s theory of consumers’
to the Kaldor-Hicks-Samuelson criterion follows
surplus. (In the case n = 2 he showed
from their ranking by set-inclusion.
1924 Compensation Principle
This does not mean, however, that the set- inclusion Arrow, K.J., and T. Scitovsky, eds. 1969. R e a d i n g s i n
is always obvious or easy to prove. w e l f a r e e c o n o m i c s . Homewood: Irwin.
Auspitz, R., and R. Lieben. 1889. U n t e r s u c h u n g e n
The KHS ordering of consumption-possibility i i b e r d i e T h e o r i e d e s P r e i s e s . Leipzig:
sets could be given simply a factual interpretation as Duncker & Humblot.
indicating the ‘productive potential’ of an economy. Baldwin, R.E. 1948. Equilibrium in international trade: A
But if it is given a normative interpretation then it diagrammatic analysis. Q u a r t e r l y J o u r n a l o f
E c o n o m i c s 62: 748-762.
obviously involves a value judgement, since a more Baldwin, R.E. 1954. A comparison of welfare criteria.
efficient outcome, if it is not Pareto-superior, can R e v i e w o f E c o n o m i c S t u d i e s 21: 154-161.
obviously be judged worse in terms of some social- Barone, E. 1894. Sulla ‘consumers’ rent’. G i o r n a l e d e g l i
welfare function. E c o n o m i s t i . Series 2, 9, September, 211-224.
Barone, E. 1908. II Ministerio della produzione nello stato
Samuelson’s ( ) model of the ‘good soci
colletivista. G i o r n a l e d e g l i E c o n o m i s t i Series
ety’, elegant though it is, is too sweeping for most 2. 37: 267-293; 391-414. English translation: The ministry
economists to accept, and it begs the question of of production in the collectivist state. In C o l l e c t i v i s t
how the social-welfare function will be chosen. e c o n o m i c p l a n n i n g , ed. F.A. Hayek. London:
Routledge & Regan Paul, 1935.
Little’s ( ) and Mishan’s ( ) attempts to
Berry, A. 1891. Alcune brevi parole sulla teoria del baratto di
link plausible distributional value judgements with A. Marshall. G i o r n a l e d e g l i E c o n o m i s t i .
compensation criteria have encountered Series 2, 2, June, 549-553.
unresolvable logical difficulties (cf. Chipman and Boadway, R.W. 1974. The welfare foundations of cost- benefit
analysis. E c o n o m i c J o u r n a l 84: 926-939. A reply,
Moore ). The hope that the compensation principle
E c o n o m i c J o u r n a l 86 (1976), 358-361.
would allow policy decisions to be made free of Boulding, K.E. 1945. The concept of economic surplus.
value judgements has not been fulfilled. A m e r i c a n E c o n o m i c R e v i e w 35: 851-869.
Nevertheless, much has been learned about the Caimes, J.E. 1874. Some leading principles of political
economy newly expounded. New York: Harper & Brothers.
interrelationships among values, facts, and policies,
Chipman, J.S. 1976. The paretian heritage. Revue europeenne
and it can certainly be said that the development of des sciences societies et Cahiers Vilfredo Pareto 14 (37):
the compensation principle has led to clearer 65-171.
thinking about economic policy issues. Chipman, J.S. 1982. Samuelson and welfare economics. In
Samuelson and neoclassical economics,
ed. G.R. Feiwel. Boston: Kluwer-Nijhoff Publishing.
Chipman, J.S., and J.C. Moore. 1971. The compensation
principle in welfare economics. In P a p e r s i n
See Also q u a n t i t a t i v e e c o n o m i c s , ed. A.M. Zarley, vol. 2.
Lawrence: University Press of Kansas.
Chipman, J.S., and J.C. Moore. 1972. Social utility and the
► Social Welfare Function
gains from trade. J o u r n a l o f I n t e r n a t i o n a l
► Welfare Economics
E c o n o m i c s 2: 157-172.
Chipman, J.S., and J.C. Moore. 1973. Aggregate demand, real
Bibliography national income, and the compensation principle.
I n t e r n a t i o n a l E c o n o m i c R e v i e w 14: 152-181.
Chipman, J.S., and J.C. Moore. 1976a. Why an increase in
Antonelli, G.B. 1886. S u l l a t e o r i a m a t e m a t i c a
GNP need not imply an improvement in potential wel-
d e l l a e c o n o m i a p o l i t i c a . Pisa: Folchetto.
fare. K y k l o s 29 (3): 391-418.
English translation: On the mathematical theory of
Chipman, J.S., and J.C. Moore. 1976b. The scope of con-
political economy. In P r e f e r e n c e s , u t i l i t y , a n d
sumer’s surplus arguments. In Evolution,
d e m a n d , ed. J.S. Chipman, L. Hurwicz, M.K. Richter,
w e l f a r e , a n d t i m e i n e c o n o m i c s , ed. A.
and H.F. Sonnenschein. New York: Harcourt Brace
Tang, F.M. Westfield, and J.S. Worley. Lexington:
Jovanovich, 1971.
Heath.
Arrow, K.J. 1952. An extension of the basic theorems of
Chipman, J.S., and J.C. Moore. 1978. The new welfare
classical welfare economics. In P r o c e e d i n g s o f t h e
economics, 1939-1974. I n t e r n a t i o n a l E c o n o m i c
second Berkeley symposium on
R e v i e w 19: 547-584.
mathematical statistics and probability,
Chipman, J.S., and J.C. Moore. 1979. On social welfare
ed. J. Neyrnan. Berkeley/Los Angeles: University of
functions and the aggregation of preferences. J o u r n a l
California Press.
o f E c o n o m i c T h e o r y 21: 111-139.
Arrow, K.J., and F.H. Hahn. 1971. G e n e r a l
c o m p e t i t i v e a n a l y s i s . San Francisco: Holden-
Compensation Principle 1925
Chipman, J.S., and J.C. Moore. 1980a. Real national income Hicks, J.R. 1941. The rehabilitation of consumers’ surplus.
with homothetic preferences and a fixed distribution of R e v i e w o f E c o n o m i c S t u d i e s 8: 108-116.
income. E c o n o m e t r i c a 48: 401-422. Reprinted in Arrow and Scitovsky (1969).
Chipman, J.S., and J.C. Moore. 1980b. Compensating Hicks, J.R. 1942. Consumers’ surplus and index-numbers.
variation, consumer’s surplus, and welfare. A m e r i c a n Review of Economic Studies 9: 126-137.
E c o n o m i c R e v i e w 70: 933-949. Hicks, J.R. 1957. A r e v i s i o n o f d e m a n d t h e o r y .
Coase, R.H. 1960. The problem of social cost. J o u r n a l o f Oxford: Clarendon Press.
L a w a n d E c o n o m i c s 3: 1-44. Hotelling, H. 1938. The general welfare in relation to
de Graaff, J. V. 1949. On optimum tariff structures. R e v i e w problems of taxation and of railway and utility rates.
o f E c o n o m i c S t u d i e s 17: 47-59. Reprinted in E c o n o m e t r i c a 6: 242-269. Reprinted in Arrow and
Arrow and Scitovsky (1969). Scitovsky (1969).
de Graaff, J.V. 1957. T h e o r e t i c a l welfare Hotelling, H. 1939. The relation ofprices to marginal costs in
e c o n o m i c s . Cambridge: Cambridge University Press. an optimum system. E c o n o m e t r i c a 7: 151-155. A
Debreu, G. 1951. The coefficient of resource utilization. final note, E c o n o m e t r i c a 7: 158-159.
E c o n o m e t r i c a 19: 273-292. Hurwicz, L., and H. Uzawa. 1971. On the integrability of
Debreu, G. 1954. Valuation equilibrium and Pareto optimum. demand functions. In P r e f e r e n c e s , u t i l i t y , a n d
Proceedings of the National Academy of d e m a n d , ed. J.S. Chipman, L. Hurwicz, M.K. Richter,
S c i e n c e s 40: 588-592. and H.F. Sonnenschein. New York: Harcourt Brace
Debreu, G. 1959. T h e o r y o f v a l u e . New York: Wiley. Jovanovich.
Debreu, G., and H. Scarf. 1963. A limit theorem on the core of Kaldor, N. 1939. Welfare propositions in economics and
an economy. I n t e r n a t i o n a l E c o n o m i c R e v i e w interpersonal comparisons of utility. E c o n o m i c
4: 235-246. J o u r n a l 49: 549-552. Reprinted in Arrow and
Dupuit, J. 1844. De la mesure de l’utilite des travaux publics. Scitovsky (1969).
Annales des Fonts et Chaussees, Kaldor, N. 1940. A note on tariffs and the terms of trade.
Memoires et documents relatifs a l ’art E c o n o m i c a , N.S. 7: 377-380.
des constructions et au service de Kannai, Y., andR. Mantel. 1978. Non-convexifiable Pareto
I ’ i n g e n i e u r . Series 2, 2, 2e semestre, 332-375, PI. 75. sets. E c o n o m e t r i c a 46: 571-575.
English translation: On the measurement of the utility of Kemp, M.C. 1962. The gains from international trade.
public works, in Arrow and Scitovsky (1969). Economic Journal 72: 803-819.
Edgeworth, F.Y. 1891. Osservazioni sulla teoria matematica Kemp, M.C., and H.Y. Wan Jr. 1972. The gains from free
delTeconomia politica con riguardo speciale ai principi di trade. I n t e r n a t i o n a l E c o n o m i c R e v i e w 13: 509-
economia di Alfredo Marshall. G i o r n a l e d e g l i 522.
E c o n o m i s t I . Series 2, 2: 233-245. Ancora a proposito Kenen, P.B. 1957. On the geometry of welfare economics.
della teoria del baratto. Giornale d e g l i E c o n o m i s t i . Quarterly Journal of Economics 71: 426—447. Koopmans,
Series 2, 2: 316-318. Abridged English translation: On the T.C. 1957. Three essays on the state of economic science. New
determinateness of economic equilibrium, in F.Y. York: McGraw-Hill.
Edgeworth, Papers relating t o p o l i t i c a l e c o n o m y , Kuznets, S. 1948. On the valuation of social income -
vol. 2. London: Macmillan., 1925. Reflections on professor Hicks’ article. E c o n o m i c a ,
Foster, E. 1976. The welfare foundations of cost-benefit N.S. 15: 1-16, 116-131.
analysis - A comment. Economic Journal 86: 353-358. Lemer, A.P. 1934. The concept of monopoly and the mea-
Frisch, R. 1939. The Dupuit taxation theorem. surement of monopoly power. R e v i e w of
E c o n o m e t r i c a ! : 145-150. A further note on the Economic
Dupuit taxation theorem. Econometrica 7: 156-157. Studies 1: 157-175.
Gorman, W.M. 1953. Community preference fields. Little, I.M.D. 1950. A critique of welfare economics.
E c o n o m e t r i c a 2 1: 63-80. 2nd ed. London: Oxford University Press. 1957.
Gorman, W.M. 1955. The intransitivity of certain criteria Marshall, A. 1890. P r i n c i p l e s o f e c o n o m i c s .
used in welfare economics. Oxford Economic Papers, N.S. London: Macmillan. 2nd ed, 1891; 8th ed, 1920.
7: 25-35. McKenzie, L.W. 1957. Demand theory without a utility index.
Grandmont, J.M., and D. McFadden. 1972. A technical note Review of E c o n o m i c S t u d i e s 24: 185-189. Mishan, E.J.
on classical gains from trade. Journal of Internat i o n a l 1969. W e l f a r e e c o n o m i c s : A n a s s e s s m e n t .
E c o n o m i c s 2: 109-125. Amsterdam: North-Holland.
Harrod, R.F. 1938. Scope and method of economics. Eco- Moore, J.C. 1975. The existence of‘compensated equilibrium’
n o m i c J o u r n a l 48: 383^412. and the structure of the Pareto efficiency frontier.
Hicks, J.R. 1939. The foundations of welfare economics. International E c o n o m i c R e v i e w 16: 267-300.
E c o n o m i c Journal 49: 696-712. Nataf, A. 1953. Sur des questions d’agregation en
Hicks, J.R. 1940. The valuation of social income. econometrie. P u b l i c a t i o n s d e I ' I n s t i t u t d e
E c o n o m i c a , N.S. 7: 105-124. S t a t i s t i q u e de l'Universite d e P a r i s 2 (4): 5-61.
Olsen, E. 1958. Udenrigshandelens gevinst [The gains of
international trade]. Nationalokonomisk
1926 Competing Risks Model
i of leaving the state it is in, each risk giving rise to individuals are heterogeneous. It may even be
its own exit destination, which can also be denoted sensible from the individual’s point of view to use
by i. One may then define random variables Tt their privately observed exogenous exit rates into
describing the duration until risk i is materialized. destinations j as inputs for the optimal strategy
Only the smallest of all these durations Y := min, T, affecting the exit rate into destination i (i / j) (see,
and the corresponding actual exit destination, which for example, van den Berg ). Erroneously assuming
can be expressed as Z := argmin, 7], are observed. independence leads to incorrect inference, and in
The other durations are censored in the sense that fact the issue of whether the durations 7} are related
all is known is that then- realizations exceed Y. is often an important question in its own right.
Often those other durations are latent or Unfortunately, the joint distribution of all Tt is
counterfactual, for example if T, denotes the time not identified from the joint distribution of Y, Z, a
until death due to cause i. result that goes back to Cox ( ). In particular,
In economics, the most common application given any specific joint distribution, there is a joint
concerns individual unemployment durations. One distribution with independent durations Tt that
may envisage two durations for each individual: one generates the same distribution of the observable
until a transition into employment occurs, and one variables Y, Z. In other words, without additional
until a transition into non-participation occurs. We structure, each dependent competing risks model is
observe only one transition, namely, the one that observationally equivalent to an independent
occurs first. Other applications include the duration competing risks model. The marginal distributions
of treatments, where the exit destinations are relapse in the latter can be very different from the hue
and recovery, and the duration of marriage, where distributions.
one risk is divorce and the other is death of one of Of course, some properties of the joint distri-
the spouses. More generally, the duration until an bution are identified. To describe these it is useful
event of interest may be right-censored due to the to introduce the concept of the hazard rate of a
occurrence of another event, or due to the data continuous duration variable, say W. Formally, the
sampling design. The duration until the censoring is hazard rate at time t is 9(t): = limj, j 0 Pr (W e [t, t +
then one of the variables T h dt))/dt. Informally, this is the rate at which the
Sometimes one is interested only in the distri- duration W is completed at t given that it has not
bution of Y. For example, an unemployment insur- been completed before t. The hazard rate is the
ance (UI) agency may be concerned only about the basic building block of duration analysis in social
expenses on UI and not in the exit destinations of sciences because it can be directly related to indi-
recipients. In such cases one may employ standard vidual behaviour at t. The data on Y, Z allow for
statistical duration analysis for empirical inference identification of the hazard rates of 7' at t given that
with register data on the duration of UI receipt. T > t. These are called the ‘crude’ hazard rates. If
However, in studies on individual behaviour one is the I) are independent, then these equal the ‘net’
typically interested in one or more of the marginal hazard rates of the marginal distributions of the Tj.
distributions of the 7). If these variables are known We now turn to a number of approaches that
to be independent, then again one may employ overcome the general non-identification result for
standard duration analysis for each of the T t competing risks models. In econometrics, one is
separately, treating the other variables 7} ( j / i ) as typically interested in covariate or regressor effects.
independent right-censoring variables. But often it The main approach has therefore been to specify
is not clear whether the 7) are independent. Indeed, semi-parametric models that include observed
economic theory often predicts that they are regressors X and unobserved heterogeneity terms V.
dependent, in particular if they can be affected by With a single risk, the most popular
the individual’s behaviour and
1928 Competing Risks Model
duration model is the mixed proportional hazard the survivors at I, due to the well-known fact that
(MPH) model, which specifies that 6 (t\ X x, V) = \J/ V i and X are dependent conditional on survival (i.e.
(t) exp (xfjyVfor some function Vis unobserved, and conditional on 7) > t > 0) even though they are
the composition of the survivors changes selectively independent unconditionally. Second, if V\ and V)
as time proceeds, so identification from the are dependent, this affects the distribution of V)
observable distributions of T\X is non-trivial. among the survivors at t, which in turn affects the
However, it holds under the assumptions that XJL| observable crude hazard of T2 at t given that '/'i > t.
Vand var(X) > 0 and some regularity assumptions In sum, the variation in this crude hazard with
(see van den Berg , for an overview of results). exp(x'j8i) for given exp(x'p2) is informative on the
With competing risks, the analogue of the MPH dependence of the durations. An analogous
model is the multivariate MPH (MMPH) model. argument holds for the crude hazard rate
With two risks, corresponding to cause 7=1.
Note that identification is not based on exclusion
0 i( f | x , V)=\//1(t) exp(xJ/]1)Vi a nd restrictions of the sort encountered in instrumental
d2lt\x, V)=ij/2lt) exp{x'pi)V2. variable analysis, which require a regressor that
where 7), T 2 \X, Pare assumed independent, so that affects one endogenous variable but not the other.
a dependence of the durations given X is modelled Here, all explanatory variables are allowed to affect
by way of their unobserved determinants V\ and V2 both duration variables - they are just not allowed to
being dependent. Many empirical studies have affect the duration distributions in the same way.
estimated parametric versions of this model, using Identification with regressors was first established
by Heckman and Honore ( ), who considered a
maximum likelihood estimation.
somewhat larger class of models than the MMPH
The semi-parametric model has been shown to
model and accordingly imposed stronger conditions
be identified, under only slightly stronger condi-
on the support of X.
tions than those for the MPH model (Abbring and
Although the MPH model is identified from
van den Berg ). Specifically, Var(X) > 0 is
single-risk duration data where we observe a single
strengthened to the condition that the vector X
spell per subject, there is substantial evidence that
includes two continuous variables with the
estimates are sensitive to misspecification of
properties that (a) their joint support contains a non-
functional forms of model elements (see van den
empty open set in R’, and (b) the vectors [il, [>2 of
Berg , for an overview). This implies that estimates
the corresponding elements of Pi and p2 form a
of MMPH models using competing- risks data
matrix (jSj, P2) of full rank. Somewhat loosely, X
should also be viewed with caution. It is advisable
has two continuous variables that are not perfectly to include additional data. For example, longitudinal
collinear and that act differently on di and d 2 . Note survey data on unemployment durations subject to
that, with such regressors, one can manipulate right-censoring can be augmented with register data
cxp(x![>]) while keeping exp(x'/?2) constant. The or retrospective data not subject to censoring (see
two terms exp(x'PP are identified from the for example van den Berg et al. ). More in general,
observable erode hazards at i 0 because at / 0 no one may resort to ‘multiple-spell competing risks’
dynamic selection due to the data, meaning data with multiple observations of Y,
unobserved heterogeneity has taken place yet. Now Z for each subject. For a given subject, such
suppose one manipulates x in the way described observations can be viewed as multiple independent
above. If 7\, T 2 \X are independent, then the draws from the subject-specific distribution of Y, Z,
observable erode hazard rate of T2 at t > 0, given on the assumption that the unobserved
that 71 > t, does not vary along. But, if 71; 72|A are heterogeneity terms V,, V2 are identical across the
dependent, then this crude hazard rate does vary spells of the subject. Here, a subject can denote a
along, for the following reason. First, changes in single physical unit, like an individual, for which
explx'/^) affect the distribution of unobserved we observe
heterogeneity V\ among
Competing Risks Model 1929
two spells in exactly the same state, or it can denote We end this article by noting some connections
a set of physical units for which we observe one between competing risks models and other models.
spell each. Multiple-spell data allow for First, they are related to switching regression
identification under less stringent conditions than models, or Roy models. For example, if l',\X, Vin
single-spell data. Abbring and van den Berg ( ) the MMPH model have Weibull distributions, then
showed that such data identify models that we can write log I) = x,j, + i = 1,2) (for example,
allow for full interactions between the elapsed van den Berg et al. ), where we observe 7) iff J) < /}
durations t and x in 6 t{t\ x, V), and, indeed, allow the (/ / /). Second, competing risks models are building
corresponding effects to differ between the first and blocks of multivariate duration models, notably
the second spell. The assumptions on the support of models where one of the durations is always
X are similar to above. Fermanian ( ) developed observed (for example, 7) captures the moment of a
a nonparametric kernel estimator of the Heckman treatment and T2 is the observed duration outcome
and Honore ( ) model. of interest).
Another approach to deal with nonidentification We have considered only continuous-time
of dependent competing risks models is to duration variables 7) that have different realizations
determine bounds on the sets of marginal and joint with probability 1. Recently, semi-parametric and
distributions that are compatible with the observable nonparametric results have been derived for
data. Peterson ( ) discrete-time or interval-censored competing risks
derived sharp bounds in terms of observable quan- models and models where different risks can be
tities. They are often wide. In case of the marginal realized simultaneously (see for example Bedford
distributions of two sub-populations distinguished and Meilijson ; van den Berg, van Lomwel and van
by a variable X, the bounds associated with the Ours ; Honore and Lleras-Muney ). The
different X may overlap, whether or not X biostatistical literature contains many studies in
(monotonically) affects (one of) the marginal which specific assumptions are made on the
distributions. With overlap, the causal effects of X dependence structure of the two durations Th
cannot even be signed. enabling inference on the marginal distributions
Bond and Shaw ( ) combined bounds with from data on Y, Z (see for example Moeschberger
regressors. In the case of a single binary regressor, and Klein , for a survey).
the only substantive assumption made is that there
exist increasing functions g and h such that 7), T2\
X=0 equals g('I'\), h(T2)\X = 1 in distribution. In
words, the dependence structure is invariant to the See Also
values of the regressors, so the latter affect only the
marginal distributions. Specifically, the copula (and
therefore Kendall’s x) of the joint distribution is
invariant to the value of X. The assumption is
satisfied by the aforementioned competing risks
models with regressors. Clearly, by itself the
assumption is insufficient for point identification. Bibliography
The bounds concern the regressor effects on the
Abbring, J., and G. van den Berg. 2003. The identifiability of
marginal distributions. If it is assumed that X affects the mixed proportional hazards competing risks model.
the marginal distributions of Tt in terms of first- Journal of the Royal Statistical Society,
order stochastic dominance, the bounds are S e r i e s B 65: 701-710.
sufficient to sign the effect of X on at least one of Bedford, T., and I. Meilijson. 1997. A characterization of
marginal distributions of (possibly dependent) lifetime
the marginal distributions (so, in case of MMPH variables which right censor each other. A n n a l s o f
models, also on at least one of the individual S t a t i s t i c s 25: 1622-1645.
marginal distributions conditional on V). Bond, S., and J. Shaw. 2006. Bounds on the covariate-time
transformation for competing-risks survival analysis.
L i f e t i m e D a t a A n a l y s i s 12: 285-303.
1930 Competition
Cox, D. 1959. The analysis of exponentially distributed life- the concept in dealing with conditions of per-
times with two types of failure. J o u r n a l o f t h e sistent and imperfectly predicted change will be
R o y a l S t a t i s t i c a l S o c i e t y . S e r i e s B 21:
411-421. removed only when economics possesses a
Fermanian, J. 2003. Nonparametric estimation of competing developed theory of change.
risks models with covariates. J o u r n a l of
M u l t i v a r i a t e A n a l y s i s 85: 156-191.
Keywords
Heckman, J., and B. Honore. 1989. The identifiability of the
competing risks model. B i o m e t r i k a 76: 325-330. Bilateral monopoly; Clark, J.; Cliffe Leslie, T.;
Honore, B., and A. Lleras-Muney. 2006. Bounds in competing Coalitions; Comparative statics; Competition;
risks models and the war on cancer. E c o n o m e t r i c a Competitive equilibrium; Contracting; Cournot,
74: 1675-1698.
A.; Creative destruction; Darwin, C.; Dcmsctz,
Moeschberger, M., and! Klein. 1995. Statistical methods for
dependent competing risks. L i f e t i m e Data H.; Edgeworth, F.; Entrepreneur- ship; Factor
A n a l y s i s 1: 195 204. price equalization theorem; Fisher, L; Industrial
Peterson, A. 1976. Bounds for a joint distribution function organization; Innovation; .lenkin, F.; Jevons, W.;
with fixed sub-distribution functions: application to
Joint action; Kirzner, 1.; Knight, F.; Labour
competing risks. P r o c e e d i n g s o f t h e N a t i o n a l
A c a d e m y o f S c i e n c e s I S : 11-13. markets; Laissez faire; Law of indifference; Law
van den Berg, G. 1990. Search behaviour, transitions to of one price; Law of supply and demand; Long-
nonparticipation and the duration of unemployment. run equilibrium; Malthus, T.; Market price;
E c o n o m i c J o u r n a l 100: 842-865. Markets; Marshall, A.; Mathematics and
van ilcn Berg, G. 2001. Duration models: Specification,
identification, and multiple durations. In H a n d b o o k economics; Mill, .1. S.; Monopoly; Oligopoly;
o f e c o n o m e t r i c s , ed. J. Heckman and E. Learner, Pareto, V.; Perfect competition; Perfect
Vol. 5. Amsterdam: North-Holland. information; Perfect markets; Pigou,
van den Berg, G., M. Lindeboom, and G. Ridder. 1994. A. ; Profit-maximixing behaviour; Rate of
Attrition in longitudinal panel data, and the empirical
analysis of dynamic labour market behaviour. J o u r n a l
return; Recontracting; Resource mobility;
o f A p p l i e d E c o n o m e t r i c s 9: 421-435. Schumpeter, .1.; Second best; Senior, N.; Smith,
van den Berg, G., van Lomwel, A., and van Ours, J. 2003. A.; Stigler, G. J.; Thornton, H.; Walras,
Nonparametric estimation of a dependent competing risks L. ; Workable competition
model for unemployment durations. Discussion Paper No.
8l>x. Bonn: l/.A.
JEL Classifications
B0
Competition
Competition is a rivalry between individuals (or
George J. Stigler groups or nations), and it arises whenever two or
more parties strive for something that all cannot
obtain. Competition is therefore at least as old as
man's history, and Darwin (who borrowed the
concept from economist Malthus) applied it to
Abstract
species as economists had applied it to human
Competition arises whenever two or more
behaviour.
parties strive for something that all cannot
A concept that is applicable to two cobblers or a
obtain. The classical economists felt no need for
thousand shipowners or to tribes and nations is
a very precise definition of competition because
necessarily loosely drawn. When Adam Smith
they viewed monopoly as highly exceptional. In
launched economics as a comprehensive science in
the late 19th century competition became the
1776, he followed this usage. He explained why a
subject of intense analysis; the concept of
reduced supply of a good led to a higher price: the
perfect competition emerged as the standard
'competition [which] will immediately begin’
model of economic theory and as first
among buyers would bid up the price. Similarly if
approximation in the concrete studies of applied
the supply become larger, the price would sink
microeconomics. The limitations of
Competition 1931
more, the greater ‘the competition of the sellers’ employments, and that knowledge is less commonly
(Smith [ ] 1976, pp. 73—4). Here competition possessed than knowledge within one market. It
was very much like a race: a race to obtain part of often requires a good deal of time to disengage
reduced supplies or to dispose of a part of increased resources from one field and install them elsewhere.
supplies. Almost nothing except a number of buyers Both of these conditions were recognized by Smith,
and sellers was necessary for competition to who spoke of the difficulty of keeping secret the
operate. And the greater the number of each, the existence of extraordinary profits, and of the long
greater the vigour of competition: mn sometimes required for the attainment of
If this capital [sufficient to trade in a town] is divided
equality of rates of return.
between two different grocers, their competition will For the next three-quarters of a century the
tend to make both of them sell cheaper, than if it prevailing treatment of competition followed the
were in the hands of one only; and if it were divided practice of Smith. One can find occasional hints of a
among twenty, their competition would be just so
more precise definition of competition, well
much the greater, and the chance of their combining
together, in order to raise the price, just so much the illustrated by Nassau W. Senior:
less, (ibid., pp. 361-2)
But though, under free competition, cost of produc-
With such a loose concept, there was little occasion tion is the regulator of price, its influence is subject
to much occasional interruption. Its operation can be
to speak of one market as being more or less supposed to be perfect only if we suppose that there
competitive than another, although this very pas- are no disturbing causes, that capital and labour can
sage presented the commonsense idea that larger be at once transferred, and without loss, from one
numbers of rivals increased the intensity of employment to another, and that every producer has
frill information of the profit to be derived from
competition. every mode of production. But it is obvious that these
The competition of grocers in a town pertained suppositions have no resemblance to the truth. A
to competition within a market or an industry. Smith large portion of the capital essential to production
made much of the competition of different markets consists of buildings, machinery, and other
implements, the results of much time and labour,
or industries for resources, and he developed what and of little service for any except their existing
has always remained the main theorem on the purposes... few capitalists can estimate, except upon
allocation of resources in an economy composed of an average of some years, the amount of their own
private, competing individuals or enterprises. The profits, and still fewer can estimate those of their
neighbours. ( , p. 102)
argument may be stated: Each owner of a
productive resource will seek to employ it where it Senior is hinting at a concept of perfect competi-
will yield the largest return. As a result, under tion, but the hint is not pursued.
competition each resource will be so distributed that The classical economists felt no need for a
it yields the same rate of return in every use. For if a precise definition because they viewed monopoly as
resource were earning more in one use than another, highly exceptional: Harold Demsetz has counted
it would be possible for its return in the lower- only one page in 90 devoted to monopoly in The
yielding use to be increased by reallocating it to the Wealth of Nations and only one in 500 in Mill’s
higher-yielding use. And this theorem led to what Principles of Political Economy. Indeed the word
John Stuart Mill called the most frequently ‘monopoly’ was usually restricted to grants by the
encountered proposition in economics: ‘There sovereign of exclusive rights to manufacture, import
cannot be two prices in the same market’ (Mill , or sell a commodity; witness the entry in the Penny
Book II, eh. IV, s. 3). Cyclopedia ( ):
The competition of different markets or indus- It seems then that the word monopoly was never
used in English Law, except when there was a royal
tries for the use of the same resources called grant authorizing some one or more persons only to
attention to some problems which are less important deal in or sell a certain commodity or article.
within a single market such as the grocery trade in a If a number of individuals were to unite for the
town. One must possess knowledge of the purpose of producing any particular article or com-
modity, and if they should succeed in selling such
investment opportunities in these different
1932 Competition
article very extensively, and almost solely, such denied the fact that prices were determined by the
individuals in popular language would be said to Taw of supply and demand’, particularly within
have a monopoly. Now, as these individuals have no
advantages given them by the law over other per-
labour markets. He employed bizarre examples,
sons, it is clear they can only sell more of their such as supply and demand curves which coincided
commodity than other persons by producing the over a vertical range, to show that price could be
commodity cheaper and better. (XV, p. 341) indeterminate or unresponsive to changes in supply
The ability of rivals to seek out and compete away or demand. These objections naturally called forth
supernormal profits, unless prevented by legal responses, from both J.S. Mill (Collected Works, V)
obstacles, was bebeved to be the basic reason for and Fleeming Jenkin, a famous engineer.
the pervasiveness of competition. The most persuasive reason for the increasing
In the last third of the 19th century the concept attention to the concepts of economics was the
of competition became the subject of intense study. gradual move of economic studies to the univer-
The most popular reason given for this attention is sities, which proceeded rapidly in the last decades of
that the growth of large-scale enterprises, including the century. The expanding use of mathematics was
railroads, public utilities, and finally great one major symptom of the development of the
manufacturing enterprises, made obvious the fact formal and abstract theory of economics by Walras,
that a simple concept of competition no longer fit Pareto, Irving Fisher and others. That formalization
the economy of an industrial nation such as would scarcely be possible without a more precise
England. specification of the nature of competition, and the
A second source of misgiving with the broad precise specification of the nature of competition,
definition of competition is that it might not lead to and the replies to Thornton’s criticisms were a
the uniformity of returns to a resource predicted by precursor to this literature.
the theory. The Irish economist Cliffe Leslie The groundwork for the development of the
repeatedly made this charge: concept of perfect competition was laid by Augustin
Cournot in 1838 in his Mathematical Principles of
Economists have been accustomed to assume that
wages on the one hand and profits on the other are,
the Theory of Wealth. He made the first systematic
allowing for differences in skill and so forth, equal- use of the differential calculus to study the
ized by competition, and that neither wages nor implications of profit-maximizing behaviour.
profits can anywhere rise above ‘the average rate’, Starting with the definition, Profits = Revenue —
without a consequent influx of labour or of capital
bringing things to a level. Had economists, however,
Costs, Cournot sought to maximize profits under
in place of reasoning from an assumption, examined various market conditions. He faced the question:
the facts connected with the rate of wages, they would How does revenue (say, pq) vary with output (qfi
have found, from authentic statistics, the actual The natural answer is to define competition as that
differences so great, even in the same occupation,
that they are double in one place what they are in
situation in which p does not vary with q - in which
another. Statistics of profits are not, indeed, the demand curve facing the firm is horizontal. This
obtainable like statistics of wages; and the fact that is precisely what Cournot did:
they are not so, that the actual profits are kept a The effects of competition have reached their limit,
profound secret in some of the most prominent when each of the partial productions D k [the output
trades, is itself enough to deprive the theory of equal of producer k ] is i n a p p r e c i a b l e , not only with
profits of its base. ( , pp. 158-9) reference to the total production D = F ( p ) , but
also with reference to the derivative F ' ( p ) , so that
The easiest way to combat such criticisms was not the partial production D k could be subtracted from
to confront them with data - that path was not D without any appreciable variation resulting in the
chosen for many years - but to define competition price of the commodity. (Cournot [ ] 1927,
in such a way as to ensure the desired results such p. 90)
according to his theory of oligopoly, the excess of perfectly free competition, so that any one will
exchange with any one else upon the slightest
price over marginal cost approached zero as the
advantage appearing. There must be no conspiracies
number of like producers became large. The argu- for absorbing and holding supplies to produce
ment is as follows: unnatural ratios of exchange. (Jevons , pp. 86, 87)
Let the revenue of the firm be q,p, and let n
identical firms have the same marginal costs, MC. By perfect knowledge Jevons meant only that each
Then the equation for maximum profits for one firm trader in a market knew the price bids of every other
would be trader. The second condition ruled out any joint
actions by two or more traders, without his noticing
p + qfdp/dq) = MC. that with knowledge so perfect as to know the
behaviour of rivals, there might appear the very
The sum of n such equations would be np + conspiracies he ruled out. The two conditions
dictated that ‘there cannot be two prices for the
q(dp/dq) = nMC, for nqt q. This last equation may same kind of article’ in a perfect market, which he
called the Taw of indifference’.
be written, p = MC —p/nE, The merging of the concepts of competition and
the market was unfortunate, for each deserved a full
where E is the elasticity of market demand (Cournot
and separate treatment. A market is an institution
, p. 84).
for the consummation of transactions. It performs
Cournot believed that this condition of compe-
this function efficiently when every buyer who will
tition was fulfilled ‘for a multitude of products, and,
pay more than the minimum realized price for any
among them, for the most important products’.
class of commodities succeeds in buying the
Cournot’s definition was enormously more
commodity, and every seller who will sell for less
precise and elegant than Smith’s so far as the
than the maximum realized price succeeds in selling
treatment of numbers was concerned. A market
the commodity. A market performs these tasks more
departed from unlimited competition to the extent
efficiently if the commodities are well specified and
that prices exceeded the marginal cost of the firm,
if buyers and sellers are fully informed of their
and the difference approached zero as the number of
properties and prices. Also a complete, perfect
rivals approached infinity. This definition, however,
market allows buyers and seller to act on differing
illuminated only the effect of number of rivals on
expectations of future prices. A market may be
the power of individual firms to influence the
perfect and monopolistic or imperfect and
market price, on Cournot’s special assumption that
competitive. Jevons’s mixture of the two has been
each rival believed that his output decisions did not
widely imitated by successors, of course, so that
affect the output decisions of his rivals. It therefore
even today a market is commonly treated as a
bore only on what we term market competition.
concept subsidiary to competition.
Cournot did not face the question of the role of
Edgeworth was the first economist to attempt a
information possessed by traders, and this question
systematic and rigorous definition of perfect com-
was taken up by William Stanley Jevons in 1871 in
petition. His exposition deserves the closest scrutiny
his Theory of Political Economy. He characterized a
in spite of the fact that few economists of his time
perfect market by two conditions:
(1.) A market, then, is theoretically perfect only when
or ours have attempted to disentangle and uncover
all traders have perfect knowledge of the conditions the theorems and conjectures of the Mathematical
of supply and demand, and the consequent ratio of Psychics ( ), probably the
exchange; ... (2.) ... there must be most elusively written book of importance in the
history of economics. His exposition was the most
influential in the entire literature.
The conditions of perfect competition are stated
as follows:
1934 Competition
T h e f i e l d o f c o m p e t i t i o n with reference to a there be one seller and two buyers, and let the seller
contract, or contracts, under consideration consists of
gain all the benefits of the sale: each buyer is
all individuals who are willing and able to recontract
about the articles under consideration ... charged the maximum price he would pay rather
There is free communication throughout a n o r - than withdraw from the market. If now a second
m a l competitive field. You might suppose the con- seller appears, he will find it advantageous to offer
stituent individuals collected at a point, or connected better terms to the two buyers: ‘It will in general be
by telephones - an ideal supposition [1881], but
sufficiently approximate to existence or tendency for
possible for one of the [sellers] (without the consent
the purposes of abstract science. of the other), to recontract with the two [buyers], so
A p e r f e c t field of competition professes in that for all those three parties the recontract is more
addition certain properties peculiarly favourable to advantageous than the previously existing contract’
mathematical calculation; ... The conditions of a
p e r f e c t field are four; the first pair referable to the
(ibid., p. 35). As the numbers of traders on each side
heading m u l t i p l i c i t y or continuity, the second to increase, the price approaches the competitive
d i v i d e d n e s s or fluidity. equilibrium level where no individual trader can
I. An individual is free to r e c o n t r a c t with any influence it.
out of an indefinite number, .. . A defect in this argument is that it ignores the
II. Any individual is free to c o n t r a c t (at the same fact that if the traders on one or both sides of the
time) with an indefinite number;
market, be they 2, or 2000 or 2,000,000, join
... This condition combined with the first together they can do better individually than by
appears to involve the indefinite divisibility competing. If traders on each side join, however,
of each article of contract (if any X deal with there will be bilateral monopoly, not competition.
an indefinite number of Fs he must give each Edgeworth gives no reason why the combination of
an indefinitely small portion of .r); which traders tails to take place. Only in modem times has
might be erected into a separate condition. the reason for independent behaviour by rivals been
III. Any individual is free to r e c o n t r a c t with
another independently of, w i t h o u t t h e
established: the costs of reaching and enforcing
c o n s e n t being required of, any third party,... agreements on joint action increase with both the
IV. Any individual is free to c o n t r a c t with number of rivals and the complexity of the trans-
another independently of a third party; ... actions. At a certain level - quite possibly with only
The failure of the first [condition] involves the two traders under some conditions - the costs of
failure of the second, but not vice versa; and the joint action exceed the gain to at least some of the
third and fourth are similarly related (Edgeworth , traders, and independent behaviour emeiges.
pp. 17-19). Edgeworth’s ‘conjecture’, as it is now often
The essential elements of this formidable list of called, that a unique, competitive price would
conditions are two: emerge when the number of traders became large,
has given rise to a modem literature vast in scope
1. There are an indefinitely large number of inde- and often highly advanced in its mathematical
pendent traders on each side of a market (the techniques (for references, see Hildenbrand ). One
Cournot condition). result in this literature is that in the case of a large
2. Each trader can costlessly make tentative con- (infinite) number of traders, no coalition of a portion
tracts with everyone (hence the divisibility of of the traders can exclude traders outside the
commodities) and alter these contracts coalition from trading at the price-taking
(recontract) so long as a more favourable con- equilibrium.
tract can be made. The result is perfect knowl- Edgeworth’s introduction of the requirement
edge (the Jevonian condition). that the commodity or service that is traded be
Edgeworth gave an intuitive argument for the highly divisible is a response to the following
problem:
need for an indefinitely large number of traders on
both sides of a market. It proceeds as follows. Let Suppose a market, consisting of an equal number of
masters and servants, offering respectively wages
and service; subject to the condition that no man can
Competition 1935
serve two masters, no master employ more than one free movement of resources is even approached in
man; or suppose equilibrium already established the real economy. Nor is the assumption of perfect
between such parties to be disturbed by any sudden
influx of wealth into the hands of the masters. Then
mobility necessary to eliminate monopoly power in
there is no d e t e r m i n a t e , and very generally a market: in the Victorian age, the price of wheat of
u n i q u e , arrangement towards which the system Iowa was set in Liverpool even though
tends under the operation of, may we say, a law of transportation costs were substantial. The
Nature, and which would be predictable if we knew
beforehand the real requirements of each, or of the
assumption is usually necessary to attain strict
average, dealer; ... (Edgeworth , p. 46). equality in the price of a good at every point (the
law of one price ), although even this is not strictly
Consider the simple example: a thousand masters
true (as in the factor price equalization theorem).
will each employ a man at any wage below 100; a
Clark also demanded that the economy be stationary
thousand labourers will each work for any wage
for perfect competition, a condition we shall return
above 50. There will be a single wage rate: knowl-
to later.
edge and numbers are sufficient to lead a worker to
All the elements of a concept of perfect com-
seek a master paying more than the going rate or a
petition were in place by 1900, and this concept
master to seek out a worker receiving less than the
increasingly became the standard model of eco-
market rate. But any rate between 50 and 100 is a
nomic theory thereafter. The most influential
possible equilibrium. But if a single worker leaves
statement of the conditions for perfect competition
the market, the wage will rise to 100, and if a single
was made by Frank H. Knight in his doctoral
employer withdraws, the wage will fall to 50. This
dissertation, Risk, Uncertainty and Profit ( ).
ability of a single trader to affect the price arises
The conditions were stated in extreme form; for
because of the lumpiness of the article traded (here
example, ‘There must be perfect, continuous,
a worker’s labour for a given period). Once a
costless intercommunication between all individual
worker can work for two masters, the withdrawal of
members of the society’ (Knight , p. 78) -
one worker in a thousand will reduce the available
so Jones in Seattle would know the price of potatoes
hours of work per day to each employer by only
and be able costlessly to ship to Smith in Miami a
8/1000 hours or 4.8 minutes per day, with only
bushel of potatoes at every moment of time.
negligible influence upon the wage rate.
Of course these conditions are not necessary,
Alternatively, a distribution of wage offers and
but only sufficient, to achieve the competitive
demands would also eliminate the indeterminacy
equilibrium. For example, if even a considerable
and market power.
fraction of buyers knows that seller A is charging
Edgeworth’s analysis was limited to competition
more than B for a given commodity, their patronage
within a market, and it was left to John Bates Clark
may be quite enough to force A to reduce his price
to emphasize the need for mobility of resources if
to that of B. Nor are the various conditions
the return on each resource was to be equalized in
independent of one another: for example, if it is
every use.
.. .there is an ideal arrangement of the elements of very cheap for either a commodity or its buyers or
society, to which the force of competition, acting on sellers to move between two places, that will insure
individual men, would make the society conform. that the prices in the two places will be widely
The producing organism actually shapes itself about known.
his model, and at no time does it vary greatly from
it ... We must use assumptions boldly and advisedly,
Along with the development of the concept of
make labour and capital absolutely mobile, and competition as a standard component of the theory
letting competition work in ideal perfection. (Clark , of prices and the allocation of resources, it acquired
pp. 68, 71) a growing role as the criterion by which to judge the
Perfect and free mobility of resources is of efficiency of actual markets. Adam Smith had
course an even more extreme assumption than the already advanced the proposition that output was
other conditions required for perfect competition maximized in a private enterprise economy with
because there is less reason to believe that competition. If each owner of a
1936 Competition
resource maximized the return from his resources, in products and production methods was the very
then (in the absence of ‘external’ effects of one essence of competitive capitalism. He argued that
person’s actions on others) aggregate output would the displacing of one product or method by another,
be maximized. This theorem (labelled ‘on a process which he called creative destruction, made
maximum satisfaction’) was developed and qual- the concept of perfect competition irrelevant to
ified by Leon Walras (1874), Alfred Marshall either positive analysis or welfare judgements. If the
(1980), Pareto (1895-6, 1907), Pigou (1912) and a monopoly that reduced output, compared to
host of modem economists. competition, by 10 per cent in one year, increased
Competition is much too central a concept in output by 100 per cent over the next two decades,
economics to remain unaffected when economists then monopoly might be preferred to stagnant
change their interests or analytical methods. We competition.
may illustrate this fact by the problem of economic It is crucial to this argument that monopoly
change. provides large, though temporary, rewards to suc-
In a regime of change, of growing population cessful innovators but competition does not:
and capital or innovations or new consumer
But perfectly free entry into a n e w field may make
demands, the problem of defining competition is it impossible to enter it at all The introduction of new
much more difficult than it is for the stationary methods of production and new commodities is
economy. Unless the change is predictable with hardly conceivable with perfect - and perfectly
precision, knowledge must necessarily be incom- prompt - competition from the start. And this means
that the bulk of what we call economic progress is
plete and errors and lags in adaptation to new incompatible with it. As a matter of fact, perfect
conditions can be large. For this reason, indeed, J.B. competition is and always has been temporarily
Clark believed that perfect competition was suspended whenever anything new is being
achievable only in the stationary economy. introduced - automatically or by measures devised
for that purpose - even in otherwise perfectly
Even short-run changes in market price raise the competitive conditions. (Schumpeter , pp. 104-5)
question: is the change in price initiated by a
particular seller or buyer, and if so, is this trader not Schumpeter relies on instantaneous rivalry to
facing a negatively sloping demand curve or a eliminate the incentives to innovation under com-
positively sloping supply curve? The infinitely petition, and the conclusion would not hold if
elastic supply and demand curves of perfectly competition is defined in terms of long-run
competitive equilibrium seem inapplicable to equilibrium.
periods of changing market conditions. Some Nevertheless the issue is not disposed of so
economists nevertheless retain the condition that easily. If change is continuous rather than sporadic,
individual traders cannot influence price by intro- long-run equilibria will never be frilly achieved.
ducing a hypothetical auctioneer who announces Several economists have emphasized that
price changes. alterations in the concept of competition are called
A partial adaptation of the competitive concept for in periods of historical change. Kirzner has
to change is made by making it a long-run equi- emphasized the role of entrepreneiuial rivalry in
librium concept. Even if resources are not costlessly competition, whereas such rivalry is nonexistent in
mobile and even if entrepreneurs do not have a perfectly competitive equilibrium. Demsetz has
perfect foresight, one can analyse the rate of proposed a concept of laissez-faire competition, in
approach of returns on resources to equality. If an which freedom of resoiuces to move into any use is
industry experiences a once-for-all large change, it the central element. Such realistic reversions to the
could be in competitive equilibrium before and after competitive concept of the classical economists
the change, and the equilibria could be studied by have not been systematically formalized into
competitive theory (comparative statics). theoretical models.
This adaptation did not satisfy Joseph The concept of perfect competition, or indeed
Schumpeter, who believed that incessant change any theoretically precise concept of competition,
Competition 1937
will not be met by the actual condition of compe- continue to play a major role in the economics of an
tition in any industry. John Maurice Clark made the enterprise economy.
most influential effort to create a concept of
‘workable competition’ which would serve as a
See Also
working rule for public policies which seek to
preserve or increase competition.
Clark emphasized the fact that if one requisite of
perfect competition is absent, it may be desirable
that a second requisite also be unfulfilled. For
example, with instantaneous mobility but imperfect
knowledge, members of an occupation would keep Bibliography
shifting back and forth between two cities, always
overshooting the amount of migration which would Clark, J.B. 1899. T h e d i s t r i b u t i o n o f w e a l t h .
London: Macmillan.
equalize wage rates. This propensity to overshoot
Clark, J.M. 1940. Toward a concept of workable competition.
equilibrium would be corrected with less mobility A m e r i c a n E c o n o m i c R e v i e w , June; reprinted
of labour. This problem was later formalized as the in R e a d i n g s i n t h e s o c i a l c o n t r o l o f
theory of the ‘second best’. i n d u s t r y . Philadelphia: Blakiston, 1942.
Cliffe Leshe, T.E. 1888. E s s a y s in political
The essence of the concept of workable com-
e c o n o m y a n d m o r a l p h i l o s o p h y . London:
petition was the belief that Tong- run curves, both Longmans Green.
of cost and of demand, are much flatter than short- Cournot, A. 1838. Researches into the mathematical principles
run curves, and much flatter than the curves which of the theory of wealth. Reprinted. New York: Macmillan,
1927.
are commonly used in the diagrams of theorists’
Demsetz, H. 1982. Economic, legal and political dimensions of
(J.M. Clark , p. 460). This correct and sensible view competition. Amsterdam: North-Holland.
led to a proliferation of studies, usually in doctoral Edgeworth, F.Y. 1881. M a t h e m a t i c a l p s y c h i c s .
dissertations, of individual industries, in which the London: Kegan Paul, 1932.
workableness of competition in each industry was Hildenbrand, W. 1974. C o r e a n d e q u i l i b r i a o f a
l a r g e e c o n o m y . Princeton: Princeton University
appraised. Unfortunately there were no objective Press.
criteria to guide these judgements, and there was no Jevons, W.S. 1871. T h e t h e o r y o f p o l i t i c a l
evidence that the studies were accepted by the e c o n o m y . London: Macmillan.
governmental agencies which administered Kirzner, I.M. 1973. Competition and
e n t r e p r e n e u r s h i p . Chicago: University of Chicago
competitive policies. Press.
The popularity of the concept of perfect com- Knight, F.H. 1921. R i s k , u n c e r t a i n t y a n d p r o f i t ,
petition in theoretical economics is as great today as Part 2. Boston: Houghton Mifflin Co.
it has ever been. The concept is equally popular as Marshall, A. 1890. P r i n c i p l e s o f e c o n o m i c s .
London: Macmillan.
first approximation in the more concrete studies of Mas-Colell, A., ed. 1982. Noncooperative approaches to the
markets and industries that comprise the field of theory of perfect competition. New York: Academic Press.
‘industrial organization’ (applied microeconomics). McNulty, P.J. 1967. A note on the history of perfect com-
The limitations of the concept in dealing with petition. J o u r n a l o f P o l i t i c a l E c o n o m y 75:
395-399.
conditions of persistent and imperfectly predicted Mill, J.S. 1848. P r i n c i p l e s o f p o l i t i c a l e c o n o m y .
change will not be removed until economics In C o l l e c t e d w o r k s , ed. J.M. Robson. Toronto:
possesses a developed theory of change. Even University of Toronto Press, 1965.
within a stationary economic setting the concept is Nutter, G.W. 1951. The extent of enterprise monopoly in the
United States, 1899-1939. Chicago: University of Chicago
being deepened by mathematical economists (see
Press.
Mas-Colell ). Meanwhile the central elements of Penny Cyclopedia ofthe Societyfor the Diffusion of Useful
competition - the freedom of traders to use their Knowledge. 1839.
resources where they will, and exchange them at Schumpeter, J.A. 1942. C a p i t a l i s m , s o c i a l i s m a n d
d e m o c r a c y . New York: Harper & Bros.
any price they wish - will
Senior, N.W. 1836. P o l i t i c a l e c o n o m y . London: W.
Clowes.
Shepherd, W.G. 1982. Causes of increased competition in the
1938 Competition and Efficiency
Smith, A. 1776. T h e w e a l t h o f n a t i o n s . Glasgow ed. situations, and that the term ‘industrial competition’
Oxford: Oxford University Press. 1976. be applied when mobility across industries is
Stigler, G.J. 1957. Perfect competition, historically con-
templated. J o u r n a l o f P o l i t i c a l E c o n o m y 65:
present. The work that I shall discuss deals with the
1-17. converse possibility: perfectly contestable markets,
Stigler, G.J., and R. Sherwin. 1985. The extent of the market. situations in which competition may not necessarily
J o u r n a l o f L a w a n d E c o n o m i c s 28: 555-585. exist within a particular market, but firms (and
Thornton, W.T. 1869. O n l a b o u r . London: Macmillan.
resources) are assumed to be perfectly mobile
across industries.
The role of entry and exit in assuring the equal-
Competition and Efficiency ization of returns across markets is not logically
limited those cases in which it is technologically
John C. Panzar feasible for the market to be populated by a large
number of firms, each capable of achieving an
efficient scale of operation. It may be expected that
the lure of profits might serve to make relevant
The association between economic efficiency and certain aspects of competitive theory even under
competition goes back at least as far as Adam conditions of ‘natural monopoly’. The most striking
Smith’s ‘invisible hand’ metaphor. Indeed, a goodly practical illustration of this point was the recent
portion of the vast body of subsequent work in deregulation of airlines in the United States. This
value theory has dealt with the normative issues took place, in part, because the free mobility of
arising from the workings of the competitive resources (aircraft) across markets led policy
economy. Thus any short essay on the topic must be makers to believe that satisfactory economic
somewhat idiosyncratic, focusing upon the points performance could be achieved without the
which are of greatest interest to the author. stultifying effects of economic regulation. This,
Therefore, I shall limit my attention to the despite the fact that most city-pair airline markets
properties of the (static, partial equilibrium) are natural monopolies and none can be expected to
economic model of perfect competition and how its support the large numbers of firms required by the
use has recently been extended to add to our perfectly competitive model. Thus the need to
understanding of a larger range of real world extend at least part of the competitive paradigm to
markets. I must leave to others the tasks of sorting incorporate such cases had become apparent.
out the importance of competition in, for example, In a classic article, Demsetz ( ) set forth
the Schumpeterian process of ‘creative destruction’, one way to break the commonly perceived link
the aggregation and transmission of society’s stock between monopoly provision of certain increasing
of information, or the evolutionary progress of returns services and monopoly conduct on the part
technological advance. Fortunately for my of the firm providing the service at any point in
purposes, the historical development of the time. By pointing out that the impossibility of
competitive model has been thoroughly analysed by competition within the market need not preclude
Stigler ( ). The formulation of effective competition for the market, Demsetz
the model, as we know it today, was completed in raised a fundamental challenge to the conventional
the work of Knight ( ). It is interesting to wisdom that the only effective ways to deal with a
note that the last refinement to be added was the technological natural monopoly were through
free mobility of resources across industries: i.e. the economic regulation or public enterprise.
entry and exist of firms. In his insightful concluding Demsetz chose to elaborate this idea in the
section, Stigler points out that competition can context of a franchise bidding scheme, in which the
flourish within a market without this last ingredient. franchise was to be awarded, not to the firm
(Consider an agricultural market with Ricardian offering the greatest lump sum payment to the
rents.) He suggested that the term ‘market municipal coffers, but to the firm offering to
competition’ be used to describe such
Competition and Efficiency 1939
serve the market at the lowest price. Subsequent price. While feasibility requires financial viability,
authors have criticized this as a policy proposal, it does not preclude the positive profits which may
focusing on the problems raised by considerations attract entry. Therefore, the neoclassical notion of
of sunk costs and incomplete contracts, from which long-run competitive equilibrium must encompass
Demsetz explicitly sought to abstract. However, some additional restrictions. More specifically,
there is another sense in which the franchise
bidding example may have been an unfortunate Definition 2 A long-run competitive equilibrium is
expository choice. Because it introduced a new any FIC which also has the property that py — c (y)
institution between the firm and the market - the < 0 for ally.
franchise auctioneer - this illustration may have While this characterization of long-run com-
obscured the link between the analysis of petitive equilibrium may be unfamiliar, it is equiv-
competition for the market and the earlier notion of alent to the standard notion of price taking firms
the role of free entry and exit in ensuring effective earning zero economic profits by equating marginal
industrial competition. Furthermore, Demsetz’s cost to price. (To see this, note that since profits are
simple bidding scheme cannot handle the realistic nonpositive for all output levels, the fact that py1
cases in which the monopolist produces two or C(y‘) > 0 means that output level y1 maximizes the
more technologically related services. rth firm’s profits. This, in turn, implies that MC(yl)
The theory of contestable markets developed by = p if firm i is producing.)
Baumol et al. ( ) is most usefully viewed Characterizing competitive equilibrium via
as an attempt to extend the neoclassical (partial Definitions and has the advantage of focusing
equilibrium ) theory of long-run competitive equi- attention on the role played by potential entry. The
librium to the case of increasing returns to scale. In strictures of Definition can be interpreted to mean
so doing, they developed a model which achieved that the firms in an industry in long-mn competitive
the Demsetz solution to the monopoly problem as equilibrium act as if they were policed by potential
the result of a market equilibrium process. This entrants prepared to enter the market in pursuit of
extension was accomplished by emphasizing the any profit opportunity calculated at current market
role played by potential entry in characterizing the prices. While this lack of attention to the possibility
role defining properties of long- run competitive of retaliatory price responses by rivals reflects the
equilibrium. To see this reinterpretation most noncooperative spirit of the competitive paradigm,
clearly, the following definitions are necessary: it ignores the response of consumers to a change in
the market price. Therefore it is useful to consider
Definition 1 A Feasible Industry Configuration making potential entrants ‘less optimistic’ in the
(FIC) is a collection of firms, i = 1, . . . , m output following sense:
vectors for each, y1, ..., and a market price vector p
such that each firm earns non-negative profits and Definition 3 A Sustainable Industry Configuration
the total quantity supplied equals the quantity (SIC) e ise any FIC e
which also satisfies the condition
demanded; i.e. py‘ — C(y‘) >0, for all i = l, ... ,m that p y - C(y ) < 0 for all pe < p andye < D(pe).
and J2y‘ = d(p) - where C is the (multiproduct) Thus firms in a SIC behave as if the market
were policed by potential entrants that calculate the
minimum cost function and D the market demand
profitability of entry under the assumption that
function.
incumbent firms’ prices remained unchanged, but
Feasibility surely reflects the minimal condi-
that do take account of the reality that consumers
tions one would expect to prevail in long-run
can be induced to purchase a larger quantity only at
industry equilibrium in a private enterprise econ-
a lower price. Put another way, an FIC is also an
omy: All firms must earn non-negative profits and
SIC when no potential
the total quantity supplied by firms equals the
amount demanded by consumers at the market
1940 Competition and Efficiency
entrant can anticipate earning a positive profit by when they exist, sustainable industry configurations
quoting a price at or below that prevailing in the are solutions to the Second Best optimization
market and serving all or a part of the resulting problem: maximize welfare (as measured, for
demand. The following semantic clarification example, by the sum of producers’ and consumers’
completes the characterization of a contestable surpluses) subject to the constraint that firms earn
market: non-negative profits. Clearly, when increasing
returns to scale render marginal cost pricing
Definition 4 A perfectly contestable market is one unprofitable, the best that can be done, in the
in which perfectly free entry and exit ensure that the absence of lump sum transfers and discriminatory
only possible long-ran equilibria are SICs. or non-linear pricing, is to set price equal to average
An immediate implication of Definitions , , , cost.
and is: However, even this level of performance can no
longer be guaranteeed once one moves to the
Proposition 1 Any long-run competitive equilib- realistic realm of multiple products. For example, a
rium is an SIC, but not conversely. Thus all monopolist producing two or more products can, in
perfectly competitive markets are perfectly con- general, find an infinite number of price com-
testable, but not all perfectly contestable markets binations which will yield it exactly zero economic
are perfectly competitive. profits. Some of these prices and resulting market
The proof follows from the fact that the condi- demand quantities may represent SICs. Call this set
tions which an FIC must satisfy in order to be a P. If the underlying cost and demand functions are
long-run competitive equilibrium are stronger than sufficiently well-behaved, there will exist a unique
those required of an SIC. Thus a long-run constrained welfare maximizing price vector p*.
competitive equilibrium is, by construction, an SIC. The most desirable efficiency result would be for
To see that the converse is not true, consider the the set P to consist of the single element p*.
case in which the average costs of production fall Unfortunately, it is easy to construct examples in
throughout the relevant range; i.e. at least as far as which P does not contain p*, as well as cases in
the intersection of the average cost curve and the which P is empty. What efficiency properties does
market demand curve. The point of intersection, the this generalized process of industrial competition
Demsetz outcome, characterizes a sustainable possess when extended beyond the realm of perfect
industry configuration, since profits are non- competition? The results that pertain generally he
positive and no point on or below the demand curve entirely on the cost side.
can yield non-negative profits at a lower price.
However this outcome is clearly not a long-run
competitive equilibrium because price is equal to Proposition 2 In any SIC, the industry’s output is
average cost which, by hypothesis, is strictly greater divided among the firms in a way that minimizes
than marginal cost. The above demonstration points total industry costs.
out the fact that the concept of contestable markets The proof is by contradiction. Consider an ini-
can be applied beyond the large-numbers case of tial SIC composed of m firms producing output
perfect competition. However it also raises vectors, y1, . . . , ym, at market prices p. Suppose,
questions about the efficiency properties of such contrary to hypothesis, that there exists an alterna-
markets. The fact that equilibrium may iiwolve a tive group of Affirms with output vectors, z1,..., zk,
price greater than marginal cost means that the First that could produce the current industry output at a
Best optimality properties of the competitive model lower total cost That is JUz' = D(p) = JA/, but Y,fV)
need no longer apply. What efficiency properties, < £,C(y'). Then the new group, in total, would earn
then, can be associated with equilibria in positive economic profits at the initial price p. This
contestable markets? In the case of single product is true because, by hypothesis, total revenues would
markets it is intuitively clear (and straightforward to be equal, but total costs would be lower for the
prove) that, alternative group, while the initial
Competition and Selection 1941
behaviour; Satisficing; Schumpeter, J.; generate this sort of result, they are the basis for a
Schumpeterian competition; Selection equilib- critique of its generality and an appraisal of the
rium; Simon, H.; Static equilibrium; Survival of strength of the cratch on which standard theory
the fittest; Technological opportunity; Winter, S. leans. They also provide a helpful entry-way to the
G. much broader class of evolutionary models in which
mimicry results fail to hold. This entry-way has the
convenient feature that the return path to standard
JEL Classifications theory is well marked; the sense in which
BO evolutionary theory subsumes portions of standard
theory becomes clear.
Under competitive conditions, a business firm must
The concept of competition need not, of course,
maximize profit if it is to survive - or so it is often
be considered only in the context of perfectly
claimed. This purported analogue of biological
competitive equilibrium. In a broader sense of the
natural selection has had substantial influence in
term, any nontrivial selection model in which the
economic thinking, and the proposition remains
‘fit’ prosper and the ‘unfit’ do not is a model of a
influential today. In general, its role has been to
‘competitive’ process. The process need not have a
serve as an informal auxiliary defence, or cratch, for
static equilibrium, or any equilibrium, and it may
standard theoretical approaches based on
easily lead to results that are clearly non-
optimization and equilibrium. It appeared explicitly
competitive by the standards of industrial
in this role in a provocative passage in Milton
organization economics.
Friedman’s famous essay on methodology
The remainder of this essay first considers in
(Friedman , eh. 1), and it seems that many
more detail the theoretical links between selection
economists are familiar with it in this context only.
processes and competitive equilibrium outcomes. It
There is, however, an alternative role that the
then examines a more interesting and less well-
proposition can and does play. It serves as an
explored area that involves selection and, in a broad
informal statement of the common conclusion of a
class of theorems characterizing explicit models of sense, competition; Schumpeterian competition.
economic selection processes. A model in this class
posits, first, a range of possible behaviours for the Competitive Equilibrium as a
firm. This range must obviously extend beyond the Selection Outcome
realm of profit maximization if the conclusion of
the argument is to be non-trivial, and it must include The intention here is to describe the heuristic basis
behaviour that is appropriately termed ‘profit of existing examples of this type of theorem, or,
maximizing’ if the conclusion is to be logically alternatively, to describe the basic recipe from
attainable at all. The model must also characterize a which an obviously large class of broadly similar
particular dynamic process that in some way results could be produced. There may be other basic
captures the general idea that profitable firms tend recipes, as yet unknown. There certainly are ways to
to survive and grow, while unprofitable ones tend to ignore individual instructions of the recipe and yet
decline and fail. A stationary position of such a preserve the result, though at the cost of delicately
process is a ‘selection equilibrium’. contrived adjustments in other assumptions.
Models of this type occupy an important but (To avoid confusion, it should be noted at the
non-central position in evolutionary economic outset that the word ‘equilibrium’ is used in two
theory (Nelson and Winter ). They establish that the different senses in this discussion, the ‘no incentives
equilibria of standard competitive theory can indeed to change behaviour’ sense employed in economic
be ‘mimicked’ (in several different senses) by the theory and the ‘stationary position of a dynamic
equilibria of selection models. More importantly, by process’ sense that is common outside of
making explicit the strong assumptions that
apparently are required to
Competition and Selection 1943
economics. The point of the discussion is, in positive profit that is potentially achievable at
fact, to relate these two equihbrium ideas in a selection equihbrium scale.
particular way.)
In standard theory, expansion in response to
(1) Constant returns to scale must prevail in the profitability may be seen as an aspect of the firm’s
specific sense that the supply and demand profit-seeking on the assumption that it regards
functions of an individual firm at any particular prices as unaffected by its capacity decisions. In
time are expressible as the scale (or ‘capacity’ ) turn, this ordinarily requires that the firm in
of that firm at that time multiplied by functions question be but one of an indeterminately large
depending on prices, but not directly on scale number of firms that all have access to the same
or time. Increasing returns to scale must be technological and organizational possibilities.
excluded for familiar reasons. Decreasing While the assumption that firms have identical
returns must be excluded because they will in production sets and behavioural rules is common
general give rise to equilibrium ‘entrepreneurial and appears inoffensive in orthodox theorizing, it is
rents’ which could be partially dissipated by very much at odds with evolutionary theory. The
departures from maximization without orthodox view comes down to the assertion that all
threatening the survival of the firm. Thus, for productive knowledge is freely available to one and
example, the U-shaped long ran average cost all - perhaps it is all in the public library. By
curve of textbook competitive theory does not contrast, evolutionary theory emphasizes the role of
provide a context in which selection necessarily firms as highly individualized repositories of
mimics standard theory if competitive productive knowledge, not all of which is
equilibrium would require some firms to be on articulable. From the evolutionary perspective, the
the upward sloping portion of the curve. fact that mimicry theorems rely on assumptions of
unimpaired access to a public knowledge pool is by
(2) Firms must increase scale when profitable and itself sufficient to make it clear that the selection
decrease scale (or go out of business entirely) argument can provide only a weak and shaky crutch
when unprofitable. Alternatively, profitability for standard competitive theory.
of a particular firm must lead to entry by
perfect imitators of that firm’s actions. In the (3) A firm that is breaking even with a positive
absence of such assumptions, it is plain that output at prevailing prices must not alter its
there w i l l in general be equilibria with non- behaviour; a potential entrant that would only
zero profit levels, which under assumption (1) break even at prevailing prices must not enter.
cannot mimic the competitive result. While the This assumption is needed to assure that the
‘decline or fail’ assumption is a plausible competitive equilibrium position is in fact a
reflection of long-run breakeven constraints stationary position of the selection process.
characteristic of actual capitalist institutions, no
such realistic force attaches to the requirement Models of natural selection in biology do not
that profitability lead to expansion. If firms do typically involve this sort of assumption, but neither
not pursue profits in the long-run sense of do they conclude that only the fittest genotypes
expanding in response to positive profitability, survive - the biological analogue of the proposition
stationary positions may involve positive discussed here. Rather, they show how constant
profits. Such stationary positions fail to mimic gene frequencies come to prevail as the selection
competitive equilibria for that reason alone forces that tend to eliminate diversity come into
(given constant returns), but they also introduce balance with mutation forces that constantly renew
once again the possibility that the short-run it. A strictly analogous treatment of economic
behavioural responses of surviving firms may selection would be much more appealing than the
dissipate some of the sort of result discussed here.
1944 Competition and Selection
It would admit that occasional disruptions may equilibrium must clearly be one in which some
arise from random behavioural change, or from firms take actions that are profit maximizing in that
over-optimistic entrants. Thus, potentially at least, competitive equilibrium, and in this sense are profit
it could better serve the purpose of establishing the maximizers. But this observation does not imply
point that the results of standard competitive theory that the survivors in the selection equilibrium
are in some sense robust with respect to its possess maximizing rules, and in general it is not
behavioural assumptions. Unfortunately, standard necessary that survivors be maximizers in this
theory offers no clue as to what this sense might be. stronger sense. (Proof: Consider a competitive
It is plain that the adjustment processes of the equilibrium with constant returns to scale. Restrict
system are centrally involved, and there is no the firms’ supply and demand functions to be
behaviourally plausible theory of adjustment that is constant up to a scale factor at the values taken in
the dynamic counterpart in the disciplinary par- the given equilibrium. Embed this static equilibrium
adigm of static competitive equilibrium theory. in a dynamic adjustment system in which firms’
Within the limits defined by the requirement for scales of output respond to profitability in
a strictly static competitive outcome, the most accordance with assumption (2). Then the given
plausible approach combines the idea of charac- competitive equilibrium becomes a selection
terizing the firms in the selection process by their equilibrium - since the only techniques in use make
‘rules of behaviour’ - an idea advanced in a seminal zero profit - but the firms are not profit maximizers
paper by Armen Alchian ( ) - with Her- in the stronger sense.)
bert Simon’s idea of satisficing ( ). In the The second point extends the first. The notion
simplest version, each firm simply adheres of profit maximizing behavioural rules itself rests
unswervingly to its own deterministic behavioural on the conceptual foundation of a production set or
rule (or ‘routine’, in the language of Nelson and function that is regarded as a given. In evolutionary
Winter ). Such a rule subsumes or implies the theory, however, it is the rules themselves that are
firm’s supply and demand functions, and given the regarded as data and as logically antecedent to the
conditions set forth in (1) and (2) above, a constant values (actions) they yield in particular
environment evokes a constant response. Satisficing environments. Thus, in this context, a problem
may be introduced as a complication of this picture arises in interpreting the basic idea of a selection
by an assumption that a firm that sustains losses equilibrium mimicking a standard competitive one:
over a period of time will search for a better there is no obvious set of ‘possibilities’ to which
behavioural rule; this adds behavioural plausibility one should have reference.
to the adjustment process but does not introduce the The most helpful approach here emphasizes
possibility that random rule change might disrupt internal consistency. Assumptions about the struc-
an otherwise stationary competitive equilibrium ture of what is ‘possible’ can be invoked without
position. the additional assumption that there is a given set of
possibilities - for example, additivity and divis-
(4) The final requirement can be succinctly but ibility may be assumed without implying that the
inadequately stated as ‘some firms must actu- set of techniques to which these axioms apply is a
ally be profit maximizers’. Although this for- given datum of the system. Such an approach
mulation does adequately cover some simple provides a basis for discussing whether a particular
cases, it does not suggest the depth and subtlety selection equilibrium is legitimately interpretable
of the issues involved. as a competitive equilibrium given the other
assumptions in force. Along this path one can
Two points deserve particular emphasis here.
explore a rich variety of selection equilibrium
The first is the distinction between profit maxi-
situations that may be thought of as competitive
mizing rules of behaviour (functions) and profit
equilibria. Precisely because the variety is so rich,
maximizing actions. In general, a selection equi-
to know only that an outcome is interpretable in this
librium that mimics a particular competitive
fashion is to know very little about it.
Competition and Selection 1945
In the light of formal analysis of selection and predictions put forward in Schumpeter’s var-
models of the sort described above, how strong is ious writings. Regardless of the verdicts ultimately
the crutch that selection provides to standard rendered on particular points, everyday observation
theory? For many analytical purposes, it is a crucial repeatedly confirms the appropriateness of his
weakness that the crutch relates only to equilibrium emphasis on the centrality of innovation in
actions and not to behavioural rules; it is from the contemporary capitalism. It confirms, likewise, the
knowledge that the rules are maximizing that the inappropriateness of the continuing tendency of the
results of comparative statics derive. A selection economics discipline to sequester topics related to
system disturbed by a parameter change from a technological change in subsectors of various
‘mimicking’ equilibrium does not necessarily go to specialized fields, remote from the theoretical core.
a new ‘mimicking’ equilibrium, let alone to one that The purpose of the present discussion is to
is consistent, in standard theoretical terms, with the assess the relationships of selection and competition
information revealed in the original equilibrium. from a Schumpeterian viewpoint, that is, to extend
More fundamentally, selection considerations the discussion above by considering what difference
cannot compensate for the inadequacies of standard it makes if firms are engaged in inventing,
theory that arise from the basic assumption that discovering and exploring new ways of doing
production possibilities are given data of the things. Plainly, one difference it makes is that
system. ‘competition’ must now be understood in the broad
sense that admits a number of additional
dimensions to the competitive process, along with
Schumpeterian Competition price-guided output determination. In particular,
costly efforts to innovate, to imitate the innovations
In two great works and in many other writings,
of others, and to appropriate the gains from
Joseph Schumpeter proclaimed the central impor-
innovation are added to the firm’s competitive
tance of innovative activity in the development of
repertoire.
capitalism. His early book, The Theory of Economic
Selection now operates at two related levels.
Development, focused on the role and contribution
The organizational routines governing the use made
of the individual entrepreneur. From today’s
of existing products and processes in every firm
perspective the work remains enormously insightful
interact through the market place, and the market
and provocative but may seem dated; the image of
distributes rewards and punishments to the
the late 19th-century captains of industry lurks
contenders. These same rewards and punishments
implicitly in the abstract account of the
are also entries on the market’s scorecard for the
entrepreneur. The late work, Capitalism, Socialism
higher level routines from which new products and
and Democracy, is likewise insightful, provocative
processes derive - routines involving, for example,
and a bit anachronistic. In this case, the
expenditure levels on innovative and imitative
anachronism derives from the predictions of a
R&D efforts. Over the longer term, selection forces
future in which the innovative process is bureau-
favour the firms that achieve a favourable balance
cratized, the role of the individual entrepreneur is
between the rents captured from successive rounds
frilly usurped by large organizations, and the
of innovation and the costs of the R&D efforts that
sociopolitical foundations of capitalism are thereby
yield these innovations.
undercut. Present reality does not correspond
In formal models constructed along these lines,
closely to Schumpeter’s predictions, and it seems
it is easy to see how various extreme cases turn out.
increasingly clear that he greatly underestimated the
One class of cases formalizes the cautionary tale
seriousness of the incentive problems that arise
told by Schumpeter ( , p. 105), in which
within large organizations, whether capitalist
competition that is ‘perfect - and perfectly prompt’
corporations or socialist states.
makes the innovative role non-viable.
Substantial literatures have accumulated around
a number of specific issues, hypotheses
1946 Competition and Selection
Sufficiently high costs of innovation and low costs model firms apply to innovative effort and their
of imitation (including costs of surmounting any innovative achievements. The long mn behaviour of
institutional barriers such as patents) will lead to the the model as a whole depends critically on the
eventual suppression of all firms that continue to answers provided for a set of key questions relating
attempt innovation, and the system will settle into a to technological opportunity. Does the individual
static equilibrium. (The character of this firm face diminishing returns in innovative
equilibrium may, however, depend on initial achievement as it applies additional resources over
conditions and on random events along the evo- a short period of time? If so, from what ‘ fixed
lutionary path; the production set ultimately arrived factors’ does the diminishing returns effect arise,
at is an endogenous feature of the process.) One can and to what extent are these factors subject to
also construct model examples to illustrate the change over time either by the firm’s own efforts or
cautionary message ‘innovate or die’, the principal by other mechanisms? Are selection forces to be
requirement being simply a reversal of the cost studied in a context in which technological
conditions stated above. opportunity presents more or less the ‘same
With the exception of some extreme or highly problem’ for R&D policy over an extended period,
simplified cases, models of Schumpeterian com- or is the evolutionary sorting out of different
petition describe complex stochastic processes that policies for the firm a process that proceeds
are not easily explored with analytical methods. Of concurrently with historical change in the criteria
course, the activity of writing down a specific that govern the sorting?
formal model is often informative by itself in the Technological opportunity is said to be constant
sense that it illuminates basic conceptual issues and if R&D activity amounts to a search of an
poses key questions about how complex features of unchanging set of possibilities - in effect, there is a
economic reality can usefully be approximated by a meta-production set or meta-production function
model. Some additional insight can then be obtained that describes what is ultimately possible.
using simulation methods to explore specific cases Increasing technological opportunity means that
(Nelson and Winter , Part V; Winter ). One of the possibilities are being expanded over time by causal
most significant benefits from simulation is the factors exogenous to the R&D efforts in question -
occasional discovery of mechanisms at work that implying that, given a level of technological
are retrospectively ‘obvious’ and general features of achievement and a level of R&D effort, the effort
the model. will be more productive of innovative results if
The discussion that follows pulls together a applied later. With constant technological
number of these different sorts of insights, empha- opportunity, returns to R&D effort must eventually
sizing in particular some issues that do not arise in be decreasing, approaching zero near the boundary
the related theoretical literature that explores var- of the fixed set of possibilities.
ious Schumpeterian themes using neoclassical It is all too obvious that it may be very difficult
techniques (For the most part these neoclassical to develop an empirical basis for modelling tech-
studies explore stylized situations involving a single nological opportunity in an applied analysis of a
possible innovation, and thus do not address issues particular firm, industry or national economy. There
relating to the cumulative consequences of dynamic is no easy escape from the conundrum that observed
Schumpeterian competition. See Kamien and innovative performance reflects both opportunity
Schwartz ( ) and Dasgupta and endogenously determined effort, not to mention
( ) for references and perspectives on this the fact that neither performance nor effort is itself
literature.) easily measured or the even more basic question of
A fundamental constituent of any dynamic whether analysis of the past can illuminate the
model of Schumpeterian competition is a model of future. These difficulties in operationalizing the
technological opportunity. Such a model establishes concept of technological opportunity do not,
the linkage between the resources that unfortunately, in any way diminish its critical role
in Schumpeterian competition.
Competition and Selection 1947
endowments of two countries starting to compete to be continuously changing in ways that could not
would lead to specialization up to the point where, be predicted by counting heads or hectares.
at the margin, the relative costs of each pair of In the 1950s and 1960s, when industrial inter-
goods were the same in each country. In fact, with national competition became much more open and
increasing returns to scale, equal relative marginal many countries were newly industrializing, trade
costs are no guarantee of equilibrium. Competitive both ways in different variants of the same products
advantage may accrue to an industry in the country grew prodigiously. It has become clear that
where home demand expands and is protected for a international competition drives intraindustry trade
time, so that relative costs fall as output increases. in imperfectly competitive differentiated markets;
This so-called ‘infant industry’ case is still the most that in very many markets today’s net exporter is
striking illustration of the inadequacy of simple tomorrow’s importer; and that international
static theorems about the effects of international competition involves the international transfer of
competition upon trade flows. capital and knowhow as well as of goods.
The account of trade flows due to Heckscher and Something is needed then to supplement the
Ohlin (namely that it depends on differing input explanation of broad directions of trade advantage
requirements for different goods and differing input in terms of fixed differences in factor endowments
endowments in different countries) remains the and given technology. There has arisen a less
most important observation about international determinate, more detailed account of disequilib-
competitive advantage. rium change in trade flows under conditions of
This model of specialization through trade can imperfect competition. This account makes it clear
be modified to cope quite well with such market how international competition leads to two-way
imperfections as transport and other transaction flows of trade within the same industry. Where
costs, limited information, a degree of imperfect there are economies of scale in each of eight plants,
competition in factor markets and limited trade four in each country, producing four variants of one
barriers. It becomes seriously strained however product, a likely outcome is that four of the eight
when one tries to use it to explain the results of will survive, perhaps two in each country, and in
international competition in the world of many each country the loyal customers of each type will
traders and many commodities. The number of import or buy at home accordingly. Net trade flows
separate ‘factors of production’ has to be increased might be zero but gross trade might be half of total
to equal the number of traded commodities and consumption. Or two countries, one rich and one
defined to suit the observed trade flows. poor, might have developed at home (through
Competitive advantage became something domestic competition) lower relative costs in the
‘revealed’, rather than predictable by taking a ‘superior’ variant in the richer country and in the
census of resources. Cheap US skilled labour might ‘down-market’ variant in the poorer country. As
be redefined as capital; but since physical capital trade becomes easier, with cheaper transport and
was also cheap in the US the paradox which communications or lower tariff barriers, the poor
Leontief observed - that the US tended to import country exports its variant to the poorer consumers
capital-intensive goods and export labour- intensive in the richer country and vice versa. As incomes
- could not be resolved this way. grow in both countries, demand in both will shift
On another view, out of which grew an alter- more and more towards the rich country’s variant,
native and more robust model of international and the poorer country loses overall market share.
competition, high technology was regarded as Where the pressures of competition are leading
complementary to (skilled) labour explaining the to successive shifts in state-of-the-art technology in
bias of US trade. But a theory which relied first on a discontinuous way, competitive advantage may
separating factors of production but then required move from the technological leader nation to the
them to be lumped together is not a good theory. fast followers as they acquire this
Furthermore, competitive advantages were found
1950 Competition in International Trade
technology; and back to the leader again with the an economy, open to fierce international competi-
next major breakthrough, as has been observed in tion, which has lost its place in this way and is
industries as different as radio receivers and rapidly de-industrializing.
textiles. The typical market in international trade now-
These models of changing comparative advan- adays is one in which there is a small number of
tage owe much to Joseph Schumpeter. They focus firms each with a strong home base but competing
upon the temporary monopoly advantages that in each other’s markets and in thud markets. Price
accrue to the innovator, extended sometimes by cuts are typically quickly matched, so oligopolists
patent protection and economies of scale. compete (and try to secure their segment of the
As the technology gap is closed by imitators market) by differentiating their products. Cars,
competing away the monopoly profits, the tech- clothes and computers are therefore both imported
nological leader nation, like Schumpeter’s indi- and exported as international competition drives
vidual innovator, jumps to new dry steppingstones individual producers to specialize.
in today’s ‘sunrise’ industries. The technology gap Much of the most recent work on international
is moved but kept open. competition examines how trade develops under
Whenever this evolution involves new products, conditions of imperfect competition. Intraindustry
it becomes closely linked to a now well- established trade flows are affected by increasing returns to
view of the development of product markets: scale in individual product lines, by product
namely, that products move through a life cycle of differentiation, by competition in non- price terms
youthful growth as comparative luxuries; into (design, reliability', durability, variety, packaging,
explosive growth as they are mass-produced and servicing, distribution, delivery speed, advertising)
become available to poorer consumers; into and by the scope for vertically integrated
maturity, stagnation and sometimes death as they corporations to choose the separate locations of the
are superseded by a more efficient or desirable way different stages of production of the final product.
of satisfying the same underlying needs. As With intra-industry trade specialization, the
products move through their life cycles, threat of re-entry into each other’s specialisms, as
international competitive advantage will tend to well as the possibilities of new entiy, limit
shift from developed technological leader monopoly power. Other things being equal, entry is
economies to newer or poorer industrializing easier where world demand is growing fast and
economies, particularly those with large domestic when the background technology is changing fast.
markets allowing homegrown scale economies. Trade under conditions of imperfect competition
Technology gap and product life-cycle theories tends to lead to increasing ‘narrow market’ spe-
do not explain why some economies lead and others cialization but decreasing ‘wide market’ monopoly
follow. The underlying point is that industrial power, together with lower costs and equal or
leadership creates a kind of human and physical greater breadth of consumer choice.
capital infrastructure, usually reflected in vigorous Two final international competition issues are
R&D. This intellectual property is partly external to important: the occasionally perverse effects of
the individual competitors in the leader economies increased price competitiveness through currency
and acts as a catalyst for new processes and depreciation; and the effect upon trade flows of the
products. There is a virtuous circle at work so long competitive options open to MNCs.
as resources can move freely from yesterday’s Suppose, through successive devaluations, one
staples to tomorrow’s winners. When structural country’s relative unit labour costs fall. (Relative
adaptation is slow, international competition can unit labour costs are a better measure of price
remove a historical leadership position from an competitiveness than relative export prices because
economy whose high relative wages then prevent it the latter catch only actual transactions, not lost
from competing later in the cycle with newly- orders.) The benefit to its trade in manufactures may
industrializing fast-followers. The UK is be short-lived. Orders wi 11 come in at
Competition Policy 1951
the most price-sensitive end of each of its industrial Caves, R.E., and R.W. Jones. 1981. W o r l d t r a d e a n d
markets. Since this is usually the less sophisticated p a y m e n t s : A n i n t r o d u c t i o n , 3rd ed. Boston:
Little Brown.
end, the nation’s industries tend to become less Cornwall, J. 1977. M o d e m c a p i t a l i s m , i t s g r o w t h
competitive in non-price terms: to be impelled to a n d t r a n s f o r m a t i o n . London: Martin Robertson,
specialize in those product versions whose demand ch. 10.
grows least fast as incomes grow. In the longer run, Gmbel, H., and P. Lloyd. 1975. Intra-industry trade: The theory
and measurement of international trade in differentiated
scale economies (and overall market share) may be products. New York: Wiley.
lost. Paradoxically, devaluation may have its best Houthakker, H.S., and S.P. Magee. 1969. Income and price
chance of increasing trade competitiveness when elasticities in world trade. T h e R e v i e w of
domestic competition is limited, so that home E c o n o m i c s a n d S t a t i s t i c s 51(2): 111-125.
Kanamori, H. 1968. Economic growth and exports. In
currency prices can be raised in price-inelastic Economic growth: The Japanese
markets, and the higher margins be used to e x p e r i e n c e s i n c e t h e M e i j i E r a , ed. L. Klein
overcome non-price disadvantages in world and K. Onkawa. Homewood: Richard Irwin.
markets. Leontief, W.W. 1934. Domestic production and foreign trade.
E c o n o m i c a I n t e m a z i o n a l e 7: 3-32. Reprinted in
Large firms can move technology, capital and American Economic Association, R e a d i n g s i n
management skills, as well as goods, and change i n t e r n a t i o n a l e c o n o m i c s , Homewood: Richard
domestic factor supplies by training. International Irwin, 1968.
competition drives firms to look for the lowest cost Linder, S. 1961. A n e s s a y o n t r a d e a n d
t r a n s f o r m a t i o n . Stockholm: Almqvist & Wiksell.
way of creating value-added. It is often cheaper for
Schumpeter, J.A. 1942. C a p i t a l i s m , s o c i a l i s m a n d
a MNC to transfer key inputs than to ship final d e m o c r a c y . New York: Harper.
products. Exporting, licensing, local assembly or Stout, D.K. 1977. International price competitiveness, non-price
packaging, joint ventures with local producers, and factors and export performance. London: National
Economic Development Office.
complete local production and marketing are
Thirlwall, A.P. 1980. Balance of payments theory and the UK
alternative modes of international competition. experience. London: Macmillan.
Each mode has private benefits and costs. The Venables, A.J. 1985. International trade, trade and industrial
choice between them is affected by the expected policy and imperfect competition: A survey. Centre for
Economic Policy Research Discussion Paper No. 74, Oct
reactions of competitors and by the policies of the
Vernon, R. (ed.) 1970. T h e t e c h n o l o g y f a c t o r i n
consumer governments. Protection, and its threat, international trade. National Bureau of
and dramatic improvements in communications in Economic Research Conference Series No. 22. New York.
recent years have led to a large increase in Wells, L. 1972. International trade: The product life cycle
approach. In T h e p r o d u c t l i f e c y c l e a n d
international competition between local subsidiaries
i n t e r n a t i o n a l t r a d e , ed. L. Wells. Cambridge,
of MNCs and an increase in the proportion of trade
in intermediate goods within vertically integrated
firms. The overall volume of trade is nowadays a
poor measure of the importance or strength of
Competition Policy
international competition.
Alan Hughes
See Also
► International Trade
The content and direction of competition policy is
inevitably conditioned by the domestic structure
References and international relations of the economy in which
Caves, R.E. 1971. International corporations: The industrial the policy is applied. Since my discussion, although
economics of foreign investment. E c o n o m i c a 38: theoretical, is essentially concrete, I will direct my
1-27. analysis to the case of competition policy in the
UK. However, the argument may readily be
generalized to other economies.
1952 Competition Policy
link between market behaviour and some notion of distribution of income. It is inevitable that the
economic efficiency we obviously need to define a achievement of the kind of efficiency we have
concept of competition. The best- developed notion described above as the objective of UK economic
of competition in formal economic analysis is that policy will involve changes in the distribution of
inherent in the state of affairs known as perfect income. It not least between different industrial
competition. It is easy to show that, under certain nations, and between sectors of the domestic
very restrictive conditions, perfect competition will economy. For all these reasons we have preferred to
lead to economic efficiency in the traditional Pareto adopt another approach to competition to use in
sense in which resources are so allocated that no relation to our efficiency criterion set out earlier.
one can be made better off without someone else This approach analyses competition, not as a
being made worse off. Unfortunately, both the state of affairs, but as a dynamic process linking
notion of perfect competition and the related Pareto structural change with market behaviour. Compe-
efficiency criterion are too restrictive to serve as tition is taken to mean that range of activities aimed
useful foundations for the analysis of competition at meeting the objectives of one producer at the
policy. The state of affairs embodied in perfect expense of others, and is thus defined in the
competition assumes away nearly all those aspects business sense of the word, as a process involving
of business behaviour with which competition rivalry between producers. Competitive rivalry
policy is concerned (such as rivalry in terms of takes both the form of contests within existing
price setting, innovation in products and processes, markets, and the form of potential entry into new
and advertising) and ignores the static welfare gains areas when prospective returns appear relatively
to be had by improving the internal organization of attractive. It includes rivalry in terms of price, but
enterprises, in favour of an analysis concentrating also in terms of altered or improved techniques of
exclusively on the gains to be had by reallocating production or products, and in terms of the provi-
resources between perfectly internally efficient sion of information to consumers about products.
producers. Moreover, even within its own restricted All these forms of rivalry have consequences for the
ambit, it yields few helpful decision rules for an level and rate of growth of the technical efficiency
imperfect world to adopt since it is unfortunately of production and standards of consumption, for the
the case that if the conditions necessary for perfect allocation of resources between industries, and for
competition are unavoidably absent in any single the evolution of the structures of markets
market then nothing can be said in general about the themselves.
correct behaviour to be followed in the rest of the There are a number of theories from which we
economy. In particular, it does not follow, for may choose, which involve notions of competition
example, that the perfectly competitive partial as a dynamic process. The best known are those of
equilibrium solution of setting prices equal to Marx ( -94), Schumpeter ( ),
marginal cost, so that normal profits are earned in Downie ( ) and Clark ( ). This entry is
all sectors but the affected one, is the next best based on the characterization of the competitive
solution to setting prices equal to marginal cost process by Downie. This is because he provides in a
everywhere. This ‘problem of the second best’ relatively simple manner, an analysis which is both
means that appropriate pricing rules can only be more precise for our purposes than Clark, and
derived after a piecemeal approach to individual avoids the complications imposed by the richer,
cases in which the input-output links with the more cosmic, implications of the Marxian and
distorted sectors are examined in order to gauge in Schumpeterian analyses of capitalism. Moreover,
which direction, and to what extent, price should the Downie model includes the main beneficial
diverge from marginal cost. The Pareto efficiency effects usually claimed to follow from competitive
criterion is, of course, limited because of its rivalry, in terms of cost efficiency, resource
inability to deal with those economic changes which allocation, and technical progress, whilst
involve alterations in the emphasizing the possible structural changes flowing
from competition which may change its
1954 Competition Policy
nature and effects. We can then, following Downie, scale, may limit the power of losers to respond
sketch an outline of a simple competitive process in actively enough to prevent the transfer mechanism
a given market and consider its efficiency from leading to the concentration of ever more
implications. output in fewer hands. Thus, in the absence of
In any market we may expect that firms with the revival the end product of the competitive process
lowest cost structure (inherited from the past), will may be domination of the market by the single
for any given market price have the highest profits producer whose efficiency in the past has
to finance expansion. In their own pricing policy, outstripped all rivals.
firms are expected to set a price which promises to However, before this stage is reached, the pri-
attract customers and provide sufficient retained vate gains from collusion between remaining pro-
profits to finance the capacity necessary to serve ducers may become apparent, since it can offer both
them in a balanced manner. In this way, within lower costs of competition and a higher market
markets, relatively low costs will be associated with price. The costs of adding to the stock of goodwill
relatively fast growth, and a competitive transfer of accumulated by past advertising can be reduced,
market shares from the less to the more efficient and the necessity to indulge in competitive process
will occur. Moreover, this transfer mechanism may and product innovation is modified. Moreover,
have a feedback effect on efficiency in the sense collusion can offer the leading firms higher margins
that the previously less efficient will be threatened by concerted pricing policies, at least in the short
with an ever-diminishing share of the market unless run, and possibly in the long run too, if the scales of
they can improve their cost position, for instance by production and advertising created by the
introducing more recent innovations in production competitive process itself form effective barriers to
technique. entry. Thus the competitive process seems to end
The transfer and innovation mechanisms within inevitably in dominant-firm monopolistic or
markets thus have beneficial effects, both upon oligopolistic structures, and in the suppression of
allocative efficiency, by transferring output and the process itself. This reduces the pressure to
resources to the currently most efficient producers, increase the level of efficiency through time and, in
and upon technical progressiveness, by encouraging the case of collusion, to reduce the dispersion of
the introduction of the least-cost techniques efficiency. Market price is higher, and to some
available. The outcome of this competitive process, extent output is lower than it might otherwise be.
however, in terms of the changing allocation of High-cost plants and firms can remain in production
output between different firms over time, and the at the higher market price and the incentive to
impact of this upon competitive behaviour itself, improve products and production methods falls.
will depend ultimately on the answer to a number of Thus, both levels and rates of growth of efficiency
empirical questions about the relationship between in any market may be worsened by the cessation of
past and future efficiency, and company size and the competitive process in structural conditions
performance. If past success is repeated, and which may themselves limit the chances of
resulting gains in relative size offer efficiency substantial new entry.
advantages in themselves, through economies of In terns of this analysis the objective of com-
scale or enhanced innovative ability, then particular petition policy would appear to be the identification
markets may come to be dominated by ever fewer, and regulation of those structures, restrictions on
ever larger, films as a result of the transfer entry and kinds of behaviour which arrest the
mechanism, whilst the innovative mechanism, competitive process, and produce the efficiency
acting as a spur for past losers to improve losses described above. This clearly poses problems
performance, may not be powerful enough to offset of establishing links between structure and
this tendency. Past failure may raise the desire, but behaviour, of specifying behaviour inimical to the
inhibit the ability', to recover. Innovation in process competitive process and of establishing the exis-
or product may be expensive, and low profits, or tence of markets where the competitive process has
relatively small atrophied, and where it appears that efficiency
Competition, Austrian 1955
gains are possible. In the case of an open economy approach towards large-firm dominance than the
such as the United Kingdom the relevant markets, current legislation already adopts. This is not to
and standards of comparison for efficiency, are deny that competition policy is necessary, but to
international in character, and we must recognize assert that an appropriate competition policy must
that in many industries the degree of concentration be designed in the light of existing economic
of purely domestic producers will be of little use in conditions; of the evidence of the effects of various
assessing the fierceness of the transfer mechanism, forms of market structure and behaviour; and of the
because of the importance of foreign competition. overall objectives of economic policy.
However, more fundamental problems have also
to be recognized. It may be that a competitive
process of the kind outlined above may not be the See Also
only, or the most efficient, means of achieving the
► mti-ti ust Policy
same ends, either before or after the stage is reached ► Mark Structure
at which the possibilities of market dominance or
collusion arise. Thus, we must consider planned or
cooperative solutions to the problem of raising Bibliography
efficiency. Moreover, there is no guarantee that the
Clark, J.M. 1961. Competition as a dynamic process.
end result of the competitive process itself, in terms Washington, DC: Brookings Institution.
of the levels and rates of change of operating Downie, J. 1958. T h e c o m p e t i t i v e p r o c e s s . London:
efficiency of producers, and their distribution Duckworth.
Marx, K. 1867-94. C a p i t a l London: Lawrence & Wishart,
between activities, will, for any individual country,
1970-72.
necessarily meet the overall efficiency objective NEDO. 1978. C o m p e t i t i o n p o l i c y . London: HMSO/
outlined earlier, in terms of output, inflation, National Economic Development Office.
employment and international trade performance Schumpeter, J. A. 1942. C a p i t a l i s m , s o c i a l i s m a n d
d e m o c r a c y . London: George Allen & Unwin, 1968.
targets. There is then, for both the above reasons,
Singh, A. 1977. The UK industry and the world economy: A
the possibility of conflict, in industrial policy case of de-industrialisation? C a m b r i d g e J o u r n a l
methods, between the promotion of arm’s-length o f E c o n o m i c s 1(2): 113-36.
competitive behaviour through competition policy,
and the possible reduction in such behaviour
encouraged by other aspects of government policy
in pursuit, for instance, of industrial strategy. The
possibility of conflict is most obvious in relation to
Competition, Austrian
restrictive trade practices, but is also present in the
creation or encouragement of domestic dominant- Paul J. McNulty
firm positions through government-sponsored
mergers or rationalization schemes (NEDO ).
These conclusions run in some cases contrary to
the majority of recent academic argument in this
Keywords
area, where increasing emphasis has been paid to
the claimed welfare losses due to merger, increased Allocative efficiency; Austrian economics;
monopoly power, and restrictive trade practices. Competition, Austrian; Creative destruction;
However, in the context of current UK industrial Entrepreneurship; Imperfect competition;
performance and within the current overall Innovation; Market power; Mises, L.E. von;
economic policy framework a detailed scrutiny of Monopolistic competition; Monopoly; Perfect
the current evidence for the UK yields no competition; Schumpeter, J.A
foundation for a generally more aggressive
JEL Classifications
BO
1956 Competition, Austrian
The essence of Austrian economics is its emphasis describes an equilibrium situation but says nothing
on the ongoing economic process as opposed to the about the competitive process which led to that
equilibrium analysis of neoclassical theory, equilibrium. Indeed, it robs the firm of all business
Austrian concepts of competition reflect this activities which might reasonably be associated
emphasis. Indeed, one of the central challenges by with the verb ‘to compete’ (von Hayek ). Thus,
Austrians to the neoclassical model, and a common firms in the perfectly competitive model do not
denominator of virtually all Austrian economics, is raise or lower prices, differentiate their products,
the rejection of the concept of perfect competition. advertise, try to change their cost structures relative
In this respect, a number of economists who cannot to their competitors, or do any of the other things
be considered Austrian in all aspects of their work, done by business firms in a dynamic economic
share, nonetheless, the Austrian emphasis on actual system. This was precisely the reason why
market activities and processes - for example, Schumpeter insisted on the irrelevance of the
Joseph Schumpeter ( ), J.M. Clark ( ), concept of perfect competition to an understanding
Fritz Machlup ( ) of the capitalist process.
and others. For Schumpeter, any realistic analysis of com-
When the concept of competition entered eco- petition would require a shift in analytical focus
nomics at the hands of Adam Smith and his pre- from the question of how the economy allocates
decessors, it was not clearly defined, but it generally resources efficiently to that of how it creates and
meant entry by firms into profitable industries (or destroys them. The entrepreneur, a neglected figure
in classical and neoclassical economics, is the
exit from unprofitable ones) and the raising or
central figure in the Schumpeterian analytical
lowering of price by existing firms according to
framework. The entrepreneur plays a
market conditions. There was little recognition, and
disequilibrating role in the market process by
virtually no analysis, of entrepreneurship as it might
interrupting the ‘circular flow’ of economic life,
be reflected in these and other forms of competition,
that is, the ongoing production of existing goods
but there was a recognition that business firms do in
and services under existing technologies and
most situations have some control over market
methods of production and organization. He does
prices, with the degree of control varying inversely
this by innovating - that is, by introducing the new
with the number of firms in the industry. These
product, the new market, the new technology, the
basic ideas, expanded and supplemented, are
new source of raw materials and other factor inputs,
generally compatible with most modem Austrian
the new type of industrial organization, and so on.
analysis. The result is a concept of competition grounded in
What is objectionable to Austrian economists is cost and quality advantages which Schumpeter felt
the neoclassical concept of perfect competition, is much more important than the price competition
developed during the 19th and early 20th centuries. of traditional theory and is the basis of the ‘creative
The development began with Cournot ( ), destruction’ of the capitalist economic process. It
whose concern it was to specify as rigor produces an internal efficiency within the business
ously as possible the effects of competition, after the firm, the importance of which for economic welfare
process of competition had reached its limits. His is far greater, Schumpeter argued, than the
conceptualization of this situation was a market allocative efficiency of traditional economic theory
structure in which the output of any one firm could (Schumpeter ).
be subtracted from total industry output with no His emphasis on the advantages of the firm’s
discernible effect on price. Later contributions by internal efficiency led Schumpeter to a greater
Jevons, Edgeworth, J.B. Clark and Frank Knight led tolerance for large-scale business organizations,
to the model of perfect competition as we know it even for those enjoying some degree of monopoly
today (Stigler ; McNulty )• power, than was typical of many more traditional
The trouble with the concept from the Austrian theorists of his time. This is a not uncommon
point of view, as Hayek has emphasized, is that it
Competition, Austrian 1957
characteristic of Austrian economics. Hayek, for economic change. Kirzner’s entrepreneur plays an
example, makes the distinction between entrenched equilibrating role; the changes he brings about are
monopoly, with its probable higher- than-necessary responses to the mistaken decisions and missed
costs, and a monopoly based on superior efficiency opportunities he detects in the market. Unlike
which does relatively little harm since in all Schumpeter’s entrepreneur, he is not so much the
probability it will disappear, or be forced to adjust creator of his own opportunities as a responder to
to market conditions, as soon as another firm the hitherto unnoticed opportunities that already
becomes more efficient in providing the same or a exist in the market. Thus, in the competitive market
similar good or service (von Hayek ). And that is process, the Schumpeterian and Kirznerian
precisely Schumpeter’s point. The ground under entrepreneurs may complement each other - the one
even large-scale enterprise is constantly shaking as creating change, the other responding to it.
a result of the competitive threat from the new firm, Austrian dissatisfaction with the perfectly com-
the new management, or the new idea. petitive model extends to the theories of imperfect
Schumpeter’s competitive analysis was less a and monopolistic competition. Hayek’s and
defence of monopoly power than of certain business Kirzner’s criticisms are the same as of perfect com-
activities which were judged to be monopolistic petition, namely, that the analysis is limited to an
only from the comparative standpoint of the model equilibrium situation in which the underlying data
of perfect competition. He insisted that the quality are assumed to be adjusted to each other, whereas
of a firm’s entrepreneurship was of far greater the relevant problem is the process through which
significance than its mere size. adjustment occurs. Schumpeter criticized monop-
The leading contemporary Austrian theorist of olistic competition for its continued acceptance of
competition is Israel Kirzner ( ). Kirzner’s an unvarying economic structure and forms of
approach draws on the analysis of market processes industrial organization. Nonetheless, the incorpo-
and the concept of ‘human action’ developed earlier ration into economic theory of quality competition
by Ludwig von Mises. For von Mises, and sales efforts, complementing the traditional and
entrepreneurship is human action in the market limited focus on price competition, as well as the
which successfully directs the flow of resources efforts on the part of some industrial organization
toward the fulfillment of consumer wants (von specialists and institutional economists to analyse
Mises ). Kirzner’s more fully developed theory of and explain actual market processes, are
competition is based on the idea that the means - developments that are generally within the Austrian
end nexus of economic life is not given but is itself tradition.
subject to creative human action. This creative role
Kirzner defines as entrepreneurship, and it is
essentially the ability to detect new but desired See Also
human wants, as well as new resources, techniques,
► Austrian 5
or other ways through which to satisfy them.
► :o
Whether he discovers new wants or new means of
► E cati \ c D.
satisfying old ones, the Kirznerian entrepreneur is
the one who sees and exploits what others fail to
notice - the profit opportunities inherent in any Bibliography
situation in which the prices of factor inputs fall
C l a r k , J . M . 1 9 6 1 . Competition as a dynamic process.
short of the price of the final product. Washington, DC: Brookings Institution.
There is a difference between Kirzner’s theory C o u r n o t , A . A . 1 8 3 8 . Recherches sur les principes
of entrepreneurship and that of Schumpeter. mathematiques de la theorie des richesses. P a r i s :
Schumpeter’s entrepreneur is a disequilibrating Hachette.
Kirzner, I. 1973. Competition and entrepreneurship. Chicago:
force in the economic system; he initiates University of Chicago Press.
1958 Competition, Classical
Machlup, F. 1942. Competition, pliopoly, and profit. Pts. competition (notably the conditions which ensure
I-II. E c o n o m i c a , N.S. 9, Pt. I, 1-23, Pt. II, 153-73. price-taking) are often read back, illegitimately, into
McNulty, P. 1967. A note on the history of perfect compe-
tition. J o u r n a l o f P o l i t i c a l E c o n o m y 75: 395-399.
classical discussions of competition.
Schumpeter, J. 1942. C a p i t a l i s m , s o c i a l i s m , a n d The mechanisms which determine the economic
d e m o c r a c y . New York: Harper & Row. 1962. behaviour of industrial capitalism are not self-
Stigler, G. 1957. Perfect competition, historically contem- evident. As a form of economy in which production
plated. J o u r n a l o f P o l i t i c a l E c o n o m y 65: 1-17.
von Hayek, F.A. 1948. The meaning of competition. In
and distribution proceed by means of a generalized
I n d i v i d u a l i s m a n d e c o n o m i c o r d e r , ed. F.A. process of exchange (in particular by the sale and
Hayek. London: Routledge. purchase of labour), it possesses no obvious direct
von Mises, L. 1949. H u m a n a c t i o n . New Haven: Yale mechanisms of economic and social coordination.
University Press.
Yet, in so far as these operations constitute a
system, they must be endowed with some degree of
regularity, the causal foundations of which may be
revealed by analysis. The first steps in economic
Competition, Classical investigation which accompanied the beginnings of
industrial capitalism consisted of a variety of
John Eatwell attempts to identify such regularities, often by
means of detailed description and enumeration, as
in the works of Sir William Petty, and hence to
establish the dominant causes underlying the
behaviour of markets. But what was required was
Keywords
not simply the description and classification which
Cantillon, R.; Capital accumulation; Classical
precedes analysis, but abstraction, the
theory of value and distribution; Competition,
transcendence of political arithmetic (Smith , p.
classical; Intrinsic value; Market price; Marx, K.
501).
H.; Natural price; Neoclassical economics;
The culmination of the search for a coherent
Perfect competition; Perfect liberty; Petty, W.;
abstract characterization of markets, and hence the
Quesnay, F.; Rate of profit; Ricardo, D.; Smith,
foundation of modem economic analysis, is to be
A.; Turgot, A. R. J.
found in Chapter 7 of Book I of Adam Smith’s
Wealth of Nations - ‘Of the Natural and Market
Price of Commodities’. In this chapter Smith pre-
JEL Classifications
BO sented the first satisfactory formulation of the
regularity inherent in price formation. The idea,
Only through the principle of competition has partially developed earlier by Cantillon, and by
political economy any pretension to the character of Turgot in his discussion of the circulation of money,
a science. So far as rents, profits, wages, prices, are was that
There is in every society ... an ordinary or average
determined by competition, laws may be assigned
rate of both wages and profits ... When the price of
for them. Assume competition to be their exclusive any commodity is neither more nor less than what is
regulator, principles of broad generality and sufficient to pay the rent of land, the wages of labour,
scientific precision may be laid down, according to and the profits of stock employed ... according to
their natural rates, the commodity is then sold for
which they will be regulated. (Mill , p. 242)
what may be called its natural price.
In all versions of economic theory ‘competi-
tion’, variously defined, is a central organizing and that
concept. Yet the relationship between different The natural price ... is, as it were, the central price,
to which the prices of all commodities are continually
definitions of competition and differences in the gravitating. Different accidents may sometimes keep
theory of value have not been fully appreciated. In them suspended a good deal above it, and sometimes
particular, the characteristics of ‘perfect’ force them down somewhat below
Competition, Classical 1959
it. But whatever may be the obstacles which hinder It is obvious that the annual products which can be
them from settling in this center of repose and derived from capitals invested in these different
continuance, they are continually tending towards it. employments are mutually limited by one another,
(Smith 177< , p. 65) and that all are relative to the existing rate of interest
on money. (Turgot , p. 70)
Thus the natural price encapsulates the persistent
element in economic behaviour. And that persis- However, Turgot neither related the determination
tence derives from the ubiquitous force of compe- of the rate of profit to production in general - he
tition: or, as Smith put it, the condition of ‘perfect accepted the Physiocratic idea that the incomes of
liberty’ in which ‘the whole of the advantages and the industrial and commercial classes were ‘paid’
disadvantages of the different employments of by agriculture - nor developed the conceptual
labour and stock must ... be either perfectly equal or framework which linked the formation of prices and
continually tending to equality’ (p. Ill), for the of the rate of profit to the overall organization of the
natural price is ‘the price of free competition’ (p. economy. These were to be Smith’s achievements:
68). If ... the quantity brought to market should at any
The relationship between competition and the time fall short of the effectual demand, some of the
establishment of what Petty called ‘intrinsic value’ component parts of its price must rise above their
natural rate. If it is rent, the interest of all other
had been discussed in the works of Petty,
landlords will naturally prompt them to prepare
Boisguillebert, Cantillon and Harris as the outcome more land for the raising of this commodity; if it is
of rival bargaining in price formation, competition wages or profit, the interest of all other labourers
being the greater when the number of bargainers and dealers will soon prompt them to employ more
labour and stock in preparing and bringing it to
was such that none has a direct influence on price.
market. The quantity brought thither will soon be
Quesnay expressed the formation of competitive sufficient to supply the effectual demand. All the
prices as being ‘independent of mens’ will... far different parts of its price will soon sink to their
from being an arbitrary value or a value which is natural rate, and the whole price to its natural price.
(Smith , p. 65)
established by agreement between the contracting
parties’ (inMeek ,p. 90), but he did not relate the So in a competitive market there will be a tendency
organization of production to the formation of for the actual prices (or ‘market prices’ as Smith
prices in competitive markets. Consideration of that called them) to be relatively high when the quantity
relationship required the development of a general brought to market is less than the effectual demand
conception of the role of capital, and with it the (the quantity that would be bought at the natural
notion of a general rate of profit formed by the price) and relatively low when the quantity brought
competitive disposition of capital between alterna- to market exceeds the effectual demand. This
tive investments (Vaggi ). working of competition was known as the ‘Law of
A significant step in this direction was made by Supply and Demand’. The working of competition
Turgot, who both conceived of the process of which constitutes the ‘Law’ do not identify the
production as part of the circulation of money: phenomena which determine natural prices. The
We see ... how the cultivation of land, manufactures ‘Law’ of supply and demand should not be
of all kinds, and all branches of commerce depend confused with supply and demand ‘theory’, that is,
upon a mass of capitals, or movable accumulated the neoclassical theory of price determination which
wealth, which, having been first advanced by the was to be developed one hundred years later. Nor
entrepreneurs in each of these different classes of
work, must return to them every year with a regular should Smith’s discussion of the tendencies of
profit ... It is this continual advance and return of concrete market prices be confused with supply and
capitals which constitutes w h a t o u g h t t o b e demand function, which are loci of equilibrium
called the circulation of money. prices.
(Turgot , p. 148)
Adam Smith’s conception of ‘perfect liberty’
and saw that the structure of investments would consists of the mobility of labour and stock between
tend to be that which yielded a uniform rate of different uses - the mobility that is
profit:
1960 Competition, Classical
centralization becomes one of the greatest levers of Theory proceeds by the extraction from reality
its development. (Marx , p. 778n) of those forces which are believed to be dominant
Like Smith and Ricardo, Marx, relates the and persistent, and the formation of those elements
development of competition to the establishment of into a formal system, the solution of which is to
the general rate of profit: determine the magnitude or state of the variables
under consideration. It is obvious that the solution
What competition, first in a single sphere, achieves is
a single market value and market price derived from
will not, except by a fluke, correspond to the actual
the individual values of commodities. And it is magnitudes of the variables ruling at any one time,
competition of capitals in various spheres, which first for these will be the outcome not solely of the
brings out the price of production equalising the elements grouped under the heading ‘dominant and
rates of profit in the different spheres. The latter
persistent’, but also of the myriad of other forces
process requires a higher stage of capitalist produc-
tion than the previous one. (Marx , p. 180) excluded from the analysis as transitory, peculiar or
specific (lacking general significance) which may,
It is in his conception of the circuit of capital
at any moment, exert a more or less powerful effect.
that Marx best portrays capitalist competition. The
Nonetheless, the practice of analysis embodies the
image is one of capital as a homogeneous mass of
assumption that the forces comprising the theory
value (money) seeking its maximum return. Profits
are dominant, and that the determined magnitudes
are created by embodying capital in the process of
will, on average, tend to be established. In any
production, the commodity outputs of which must
satisfactory analytical scheme these magnitudes
be realized, that is, returned to the homogeneous
must be centres of gravitation, capturing the
money form to be reinvested. Competition is thus
essential character of the phenomena under
characteristic of the capitalist mode of
consideration.
accumulation; mobility and restructuring are two
The importance of Smith’s use of competition is
aspects of the same phenomenon.
now apparent. Theory cannot exist in a vacuum.
Marx’s general conception of capital as a system
Simply labelling forces dominant is not enough.
corroborates Quesnay’s notion of an economy oper-
These forces must operate through a process which
ating ‘independent of men’s will’. This does not
establishes their dominance and through which the
mean that there may not be circumstances in which
‘law-governed’ nature of the system is manifest.
individual capitals exercise some control in
That process is competition, which both enforces
particular markets - indeed, such limitations may be
and expresses the attempt of individual capitals to
necessary for the accumulation process to proceed
maximize profits. Thus important aspects of the
in certain lines. Capital removes only those barriers
behaviour of a capitalist market economy may be
which limit its accumulation. The market control
captured at a sufficient level of generality to permit
exercised in some lines of modem industry is not
the formulation of general causal statements, that is,
necessarily a limitation but may be a prerequisite of
to permit analysis. Without this step, which
production on an extended scale. Aggregate capital
constitutes the establishment of what was called
flows discipline the actions of individual capitals,
above the method of analysis, it would have been
and hence endow the system with the regularity
impossible to develop any general form of
manifest in the perpetual tendency, successfully
economic theory.
contradicted and recreated, towards a general rate of
The classical theory of value and distribution
profit and associated prices.
may be shown to provide a logically coherent
Competition not only establishes the object of
explanation of the determination of the general rate
analysis, natural prices and the general rate of
of profit and hence of natural prices (prices of
profit, but makes meaningful analysis possible,
production) taking as data (see Sraffa ):
since it allows the operations of the capitalist
economy to be characterized in a manner which (a) The size and composition of social output;
permits theoretical statements of general validity to (b) The technique in use; and
be made about them. (c) The real wage.
1962 Competition: Marxian Conceptions
organizing and equilibrating force in an exchange profits determined through market interactions.
society, bringing about natural prices as centres of Price differentiation even for homogeneous prod-
gravity for market prices through capital flows from ucts is assumed to exist under disequilibrium con-
areas with low rates of returns to areas with high ditions ( , ch. 10). Monopoly firms are
rates. Yet compared with the theory of perfect considered exceptional cases as ‘temporary
competition, classical free competition was defined monopolies’ ( , p. 178) when the
more in terms of economic behaviour than of demand exceeds supply for a considerable period or
market structure (Stigler ; McNulty ; Eatwell ). as ‘natural monopolies’ ( , p. 861)
Marx’s concept of competition, rooted in the when there is ownership of land or natural resources
classical theory of free competition, also refers to ( , pp. 178, 861).
the behavioural activities of the capitalist firm. In production activities, the reorganization of
Marx, however, more than the classics, cast serious the firm and technical change are seen as the main
doubts on the stability properties of the competitive weapons of competition (Marx , pp. 623). The goal
process, and he conceptualized competition as inter- of the firm is to capture a transient surplus profit
firm dynamics carried out through reorganization of and to transform it into long-run growth potentials,
the firm and technical change. In this it somewhat leading to disequilibria and imbalances through
resembles the modem theory of oligopolistic rivalry irreversible technical change and innovation, taking
(Friedman ) and Schumpeter’s notion of place not in time- continuous form but in discrete
competition as a process of ‘creative destruction’ ( steps. Moreover, competition through technical
). change results not in the existence of one optimal
technique but in the coexistence of multiple
techniques, and the weighted average technique is -
The Dynamics of Competition in Marx excluding some exceptional cases such as
decreasing returns to scale and rent - considered the
The marxian concept of competition though already
regulating technique determining the long-run
adumbrated in his early writings (
normal price ( , ch. 10).
/8) is systematically developed in his later
Contrary to those forces generating disequilibria
work ( /3, , 1893, 1894). It is derived
and imbalances through inter-firm dynamics
from his theory of the behaviour of the capitalist
competition is also conceived as a balancing force.
firm (Kuruma ). The driving force for economic
Capital as a homogeneous fund (money capital)
change and growth is the goal of the capitalist firm
seeks its maximum return by flowing between
to grow and to expand (‘the selfexpansion of
sectors (‘competition between industries’, , ch.
capital’). From the inter-firm dynamics results
10). Free mobility of
economic evolution, accumulation and growth, but
labour and capital, no artificial or natural barriers
also the downfall of old firms and the centralization
for its entry or exit and sufficiently widespread
of capital (Marx ch. 25) by which the competition
knowledge of fields of investment are considered
and rivalry become fiercer. Firms are not conceived
preconditions for the free flow of funds. In Marx (
as powerless economic agents adjusting passively
, ch. 10) as in Smith ( , ch. 7), and
toward parametrically given techniques, prices and
Ricardo ( , ch. 4) a dynamical process is con
quantities but as actively seeking the reorganization
ceived in which capital funds flow into industries
of production and market activities in the context of
with high rates of return away from industries with
rivals’ possible reactions. Firms also are not seen as
low rates of return. Thus the relative output
price takers but rather as pricesetting firms with
proportions in industries will change, creating
their market shares adjusting through the reaction of
imbalances of supply and demand. These, in turn,
the rivals or as quantitysetting firms ( , ch. 10) with
cause relative market prices and profit rates to
prices and
change, tending to establish for the economic
system long-run prices of production as centres of
1964 Competition: Marxian Conceptions
gravity for market prices. Yet the stability proper- theorists regarding (i) the causes of the monopo-
ties of such a dynamic process were not demon- lization; (ii) the determination of the mark-ups and
strated rigorously. The arguments were put forward the rates of return; (iii) the different role of inverse
intuitively by analogy with Newton’s theory of the demand function and quantity reactions in their
planetary system that profit rates fluctuate or theory of price setting firms; (iv) the impact of the
oscillate within a bounded interval and actual pricesrise of oligopolies and firms size on technical
gravitate around their long-run production prices. change; and (v) the impact of large oligopoly firms
Differentials of profit rates between industries and on the stability of the economic system (increasing
firms were expected to exist for a shorter or longer stability or instability with stagnation tendencies).
period due to disequilibrium dynamics and due to Yet in spite of these differences, post- marxian
speed and ease of adjustment varying from one theory is influenced by the theory of imperfect
industry to another ( , p. 208). Though Marx competition arising in the 1930s, and competition is
anticipated thus identified more with market structure than with
possible institutional and structural changes, due torival behaviour. Moreover, the theory of mark-up
‘concentration and centralization of capital’, he didpricing was built more on a partial equilibrium view
not, however, assume that inter-firm and inter- and thus not well- founded in an interdependent
industry competition would become less severe with economic system. Though in the writings of some
the evolution of capitalism. of the post- Marxian scholars the existence of large
oligopoly firms does not preclude rivalry and
competition (in particular concerning technical
Post-Marxian Theory change, see Baran and Sweezy , eh. 3), post-
marxian writers seemed to have considered the
In the post-marxian theory since Hilferding ( ) the
theory of imperfect competition a more adequate
elimination of competition and the
framework for their analysis of advanced
delay and disruption of the formation of a
capitalism.
general profit rate through monopolization
became the main theoretical concern. Three
causes are posited as reasons for monopolization:
Recent Discussions
industrial concentration, increasing constraints
for the mobility of capital (in particular due to In recent discussions there is a certain revival of the
high proportion of fixed capital in total capital concepts of competition of the classics and Marx, in
outlay), and collusion (cooperative behaviour particular concerning the role of competition for (1)
and cartels). In this view these three causes industrial and corporate price and profit
result in monopoly prices and the persistence of determination; (2) technical change and innovation;
differential profit rates between industries and and (3) the formation of a general rate of profit.
size classes of firms (Sweezy ). For those theories,
the large firms are conceived as economic units 1. In new contributions attempts have been made to
endowed with discretionary price setting power elaborate a theory of mark-up pricing for large
determining their own environment (Kalecki , corporations in the context of a dynamic theory
; Sweezy of competition and long-run prices of
; Baran and Sweezy ; Eichner ). Here the production. In this context the economic
ideas of mark-up pricing, target rate of return behaviour of large corporations is explained
pricing and entry-preventing pricing have more in terms of change of the production
replaced the classical and marxian theory of pro- processes and the organization of the firm and
duction prices (natural prices, prices of less in terms of a change of market structures
production). (Clifton , ; Semmler ) as was
Given this general trend in post-marxian theory
there are, however, many differences among
Competition: Marxian Conceptions 1965
Roemer, J.E. 1979. Continuing controversy on the falling rate virtues - imperfectly or monopolistically com-
of profit: Fixed capital and other issues. C a m b r i d g e petitive practices.
J o u r n a l o f E c o n o m i c s 3(4): 379-398.
Schumpeter, J. 1943. C a p i t a l i s m , s o c i a l i s m a n d Questions of intervention on grounds of
d e m o c r a c y . London: George Allen & Unwin. allocative inefficiency have continued to hinge
Semmler, W. 1984a. C o m p e t i t i o n , m o n o p o l y a n d on the existence of classical monopoly profits.
d i f f e r e n t i a l p r o f i t r a t e s . New York: Columbia
By the 1970s, the weight of empirical evidence
University Press.
Semmler, W. 1984b. Marx and Schumpeter on competition, and the acknowledged fact of intensified global
transient surplus profit and technical change. competition served to eliminate the credibility of
E c o n o m i e A p p l i q u e e 37(3-4): 419-455. the market concentration doctrine derived from
Shaikh, A. 1978. Political economy and capitalism: Notes on
Dobb’s theory or crisis. C a m b r i d g e J o u r n a l o f
perfect and imperfect competition (see Demsetz ,
E c o n o m i c s 2(2): 233-251. ).
Smith, A. 1776. An inquiry into the nature and causes of the At a more fundamental level, Joan Robinson
wealth of nations, ed. E. Cannan. London: Methuen, 1961. was persuaded to abandon imperfect competition
Steedman, I. 1984. Natural prices, differential profit rates,
and the classical competitive process. The
in favour of trying to more fully develop a
Manchester School of Economic and classical line of analysis only partially worked
S o c i a l S t u d i e s 52(2): 123-140. out by the classical economists and largely
Stigler, G.J. 1957. Perfect competition, historically con-
ignored ever since (see Clifton )•
templated. J o u r n a l o f P o l i t i c a l E c o n o m y
65(1): 1-17. Yet, after a decade of deregulation and the
Sweezy, P.M. 1939. Demand under conditions of oligopoly. strongest sentimentality to let the free market
J o u r n a l o f P o l i t i c a l E c o n o m y 47: 568-573. reign, evidence of static and dynamic ineffi-
Sweezy, P. M. 1942. T h e t h e o r y o f c a p i t a l i s t ciencies in industry is accumulating (Business
d e v e l o p m e n t . New York: Monthly Review Press,
1968. Week ). Has the sheer intensity of competition in
the rate of economic change and the pace of
economic life become so severe as to hinder
economic efficiency even under the strongest
Competitive Market Processes possible tendencies to equalized returns in the
market? Competition, however complex and full
J. A. Clifton of discontinuities, is still evident as a systematic
and general force in the empirically observed
fact that accounting rates of return across firms
and industries tend toward uniformity over time.
1. Do fully competitive price signals from intense This dynamic tendency is stronger among
rivalry in the market justify the moral sentiment larger than smaller firms and is stronger today
of laissez-faire? On grounds of distributive than a century ago (see Singh and Whittington
justice among risk-takers, the answer has chapter 6; Brazen , , ,
generally been ‘yes’ throughout the history of pp. 239-40). But it is not explained by the
economic analysis. neoclassical theory of perfect competition, which
In considerations of optimum economic requires atomism of independent agents imder
efficiency, however, the answer seems to have static premises of maximization. It is not
become more difficult over the course of devel- explained by imperfect or monopolistic com-
opment. The cheapening of commodities petition for stable positions of some degree of
witnessed by the classical economists is the most monopoly power are less and less in evidence all
virtuous example of efficient competitive market the time. Yet this dynamic tendency is not
processes in which the distribution of returns associated with any optimum or unique state of
tends to be equalized. Against the theoretical industrial efficiency, as under perfect competi-
standard of perfect competition, non-price forms tion. Finally, the intensity of competitive
of competition came to be viewed in the 1930s
as second-class
1968 Competitive Market Processes
rivalry that leads to this tendency cannot be affairs, moving capital from areas of lower to
measured by neoclassical standards - the number areas of higher returns.
of firms in a market. It exists primarily under When finance is committed to industry as
market conditions of concentrated oligopoly. fixed capital, it is at once immobilized for its
It seems pointless to try to reconstitute the economic life. It does not have the character of
general theory of competitive value by still more putty which enables it to be moulded for any use
a priori game theorizing which only adds to the promising today’s highest return in the market.
false perception of indeterminacy and lack of The greatest barriers to capital mobility existed
systematic generality in ‘price behaviour’ under for the single factory enterprise which typified
contemporary market conditions. A recent organizational structures in the United States in
alternative has been to apply game theory to the 1840s. Railroad firms created the first degree
perfect competition (see Mas-Colell ). What used of capital mobility directly within the enterprise
to be a static state of affairs distinguished by the by pioneering the coordinated, multi-unit
absence of any and all rivalry is now a non- organization. From the 1890s on, a degree of
cooperative equilibrium, independent of the capital mobility across industries was added by
number of agents, that may entail dynamic integrated manufacturing companies and by
strategies of M periods contingent on past mass retailers. Truly diversified industrial
histories. corporations began appearing in the 1920s and
This re-introduces the long forgotten classi- by the 1930s, mass retailers were national in
cal principle that interdependent, dynamic scope (see Chandler , for a definitive history).
rivalries are what lead to the tendency toward Beyond these structural elements in the
uniformity in returns across the price system. A development of capital mobility in the firm, the
possible virtue of the approach is that not all number of competitive strategies available to it
games need have positive sum outcomes, so the from economies of large-scale organization and
question of competitive rivalry and economic the intensity of the search for competitive
efficiency is left open, not closed as in the pure advantage available from large budgets and
neoclassical doctrine of perfect competition. staffs have also increased. Product innovations
With all the intellectual baggage imposed by from a permanent R&D staff, advertising
perfect and imperfect competition, however, is it campaigns, takeovers and divestitures, together
not preferable to start fresh by examining and with price and credit competition give the firm
explaining in classical price-theoretic terms that added flexibility in responding to changes in
systematic empirical tendency toward uniformity market conditions and in initiating them.
in returns? The first point is that the institutional Free capital mobility is not synonymous with
conditions for free capital mobility in the the ability of atomistic firms or individual agents
industrial context of fixed capital have developed to move freely throughout the economy,
gradually and progressively over the course of whatever utopian analogies with a system of
economic development during the past two perfect liberty and individual freedom that may
hundred years. They are to be found in the first conjure up. What matters is the freedom of
instance not in the atomistic enterprise but in the capital, however organized, so to move. As
evolution of the organizational structure and theoretical constructs, perfect and imperfect
competitive strategies of today’s representative competition left a vision of capitalist develop-
firm, the industrially and geographically diver- ment that is at complete odds with the actual
sified, publicly held corporation. Top manage- historical development of conditions of free
ment in industry has increasingly assumed the capital mobility. In this view, which is also
role once reserved for bankers in day-to-day espoused by many non-neoclassical economists,
barriers to free capital mobility have
Competitive Market Processes 1969
grown with the evolution of large corporations, The very interdependence in decisionmaking
and the system has become less competitive, not between oligopolistic firms is what causes that
more competitive. ebb and flow of business and profits across
Even beyond considerations of corporate firms, industries and markets so as to render the
organization and strategy, free capital mobility ex post rate of return fully competitive and
is nowhere more folly developed in history than beyond the control of the individual firm.
in the institutions of today’s capital markets. Unfortunately, game theory was used for
Ever more integrated on a world scale, ever decades to deny the generality of contemporary
more innovative in the range of ‘products’ and competitive behaviour rather than to explain its
services offered, the large firms which dominate most systematic feature in the convergence of
these markets have such powerful and all- accounting rates of return over the long run.
encompassing information networks as to A virtue of the neoclassical theory of perfect
approximate the economic assumption of perfect competition was to provide a readily quantifiable
information in the short run, if not rational means of measuring the intensity of competition
expectations in the long run. - by the number of firms in a market. In
The acceptance of market processes in ever consideration of non-price forms of competition,
more spheres of human existence beyond basic this precision in economic theory became lost,
needs is a third sense in which free capital appearing in lieu of theory as an industrial
mobility is more highly developed today. Scale organization ‘paradigm’ of market structure and
economies in automobile production are not conduct and performance. Can quantitative
barriers to entry into new fields of endeavour precision be resurrected in a general theory of
like the child care industry. Finally, with the competitive value for the modem age?
growth of labour-intensive services as a Observation tells us that the intensity of
proportion of the economy, more businesses rivalry in contemporary markets can be mea-
take on the characteristics of merchant capital sured by the frequency and voracity of changes
once again in history since even learned human in market conditions - the sum total of strategic
capital is more malleable than fixed stock. moves and countermoves made by firms in that
2. Beyond considerations of free capital mobility market per unit of time. The common denom-
in explaining the uniformity in returns are other inator among all types of competition is to what
key issues that fall outside the scope of perfect measured degree does the action move business
and imperfect competition, whether or not and profits from one sphere to another or one
amalgamated with game theory. firm to another.
If today’s oligopolistic firms are the slaves of There is a clear analogy to perfect competi-
the market as never before in history, in what tion that can be made here. Were oligopolists in
sense are they ‘pure price takers’? Such corpo- a market limited to the type of action an atom-
rations are entirely unable to dictate their ex post istic firm entering that market could take, pure
rate of return in the market, whatever their ex price taking behaviour would emerge as the
ante pricing behaviour. It is with the ex post rate frequency of such strategic moves and coun-
of return that the theory of competitive price is termoves increased without limit. Price for a
concerned, and that will be determined by many homogeneous good would be bid down to its
forms and intensities of competitive behaviour normal competitive minimum not by unlimited
in the market, of which a suggested mark-up ex entry by one small firm after another, but by an
ante is only one. Partial equilibrium mark-up unlimited number of atomistic-like strategic
theories have never comprehended the moves by the competing oligopolists.
difference between ex ante and ex post and err Game theory to date appears to have over-
in believing ex ante pricing discretion implies looked the primacy of numbers of actions in
some degree of ex-post monopoly power.
1970 Competitive Market Processes
Competitive Market
Processes,
Fig. 1 Efficiency and
Inefficiency in Competitive neoclassical range of
Resource Allocation *For - range of --------^ - sub-optimal
neoclassical theory, this competitive —
axis measures the number behaviour competitive
behaviour with
of competitors, where the
limit, represented by the a fully equalized
returns
On
vertical dashed line, is the o
familiar large numbers case o
of atomistic or perfect d
competition. For classical
theory, the axis measures
the frequency and
magnitude of changes in
market conditions. There is
no limit to the intensity of
such competitive behaviour \4>
________________i______________________________V- -
p '
Intensity of Competition*
former implies efficiency where no such impli- of return. The curve y n expresses the propo-
cation is warranted a priori, while the latter sition that the more competition, the lower the
implies an absence of highly energetic behaviour degree of monopoly, and the stronger the ten-
from constantly striving, an implication at dency toward uniformity in returns around a
complete odds with the facts. A more neutral normal rate of profit p. All inefficiency is due to
term like ‘competitive behaviour’ seems the absence of competition in sufficient degree,
preferable. and may be measured as social welfare losses
4. If frilly competitive price signals can exist under like the area y a n.
different degrees of industrial efficiency, then The principle justification for laissez-faire
the moral sentiment of laissez-faire is not so through history has been that ‘competition
readily justified in a competitive free enterprise without limit’ must always enhance the general
system. Welfare economics must focus on welfare by improving static or dynamic effi-
competition as both virtue and vice, rather than ciencies, as expressed in the positive slope of
competition as virtue and monopoly as vice, as cni. Competition in effect can never become so
in the past fifty years. Consider Fig. , which intense, or of a character or complexity, that it
relates the intensity of competition to the degree pushes a market or an economic system beyond
of economic efficiency. In modem economic point n in the long run. In neoclassical theory,
doctrine there are three unambiguous situations: this is expressed as an increase in the number of
pure monopoly firms without limit tending to produce a state of
(point y), perfect competition (point 7i) and the perfect competition.
long mn shutdown point beyond which a firm Yet once we admit that ruinous competition
cannot cover its total costs (point 0). has existed in history, is there no range of sub-
In the context of a single industry, ruinous optimal competitive behaviour between n and
competition is rightly viewed as self-correcting <p? Competition that is sufficiently intense to
by market forces. Therefore, the entire scope of bid away all excess profits, but too intense to
economic investigation is believed to have been maximize efficiency and the general welfare?
between 0 and (5. The curve CT TI expresses the Fully competitive market processes that lead to
sentiment that the more competition the better sub-optimal outcomes - zero sum or even neg-
for efficiency as measured by the rate ative sum games?
1972 Competitive Market Processes
If and only if such business practices are energetic, can hardly be called ‘maximizing
isolated in one or a few markets will they be self- behaviour’.
correcting by the market. If they are, or have The growing inability to protect positions of
become, systemic throughout the economy, there differential rent or supra-normal profits for a
is no reason to believe they will be self- period necessary to sustain some of the most
regulating in the market in a way which leads to productive forms of risk-taking entrepreneurial
movement from a position like 5 to the unique behaviour is caused by the very intensity of
point of optimum efficiency associated with competition in contemporary market processes.
equality of returns, n. The crowding out of these Schumpeterian forms
I submit that today’s general competitive of dynamically efficient market processes is a
equilibrium in resource allocation lies at a point third social welfare loss that exists in today’s
like 5 and that the free enterprise system in an laissez-faire atmosphere.
atmosphere of laissez-faire is experiencing social The capitalization of finance on pure finance
welfare losses of the form n a S, not of the form rather than real asset creation has become
n /> y from monopolistic distortions. almost an epidemic of market processes that are
There is no distortion in price signals asso- of dubious value to the general welfare and that,
ciated with contemporary social welfare losses. moreover, increase the cost of capital for
They exist in a climate of intensely competitive productive uses. For example, the increase in
market processes where the tendency toward takeover divestiture type activities is associated
equality of returns is stronger, not weaker. The with the creation of a distinct market for
real issue is becoming whether all this incessant corporate control which simply changes the
change still represents a Schumpeterian process distribution of ownership and or control of
of creative destruction or an inefficient process existing productive assets.
of ‘destructive creations'. 5. All seem to be agreed that competition has
Free capital mobility has become so highly become more intense in recent decades and
developed in financial markets and top man- especially in recent years. I continue to main-
agement behaviour in corporations that it has led tain, as well, that there has been a secular
to the virtual collapse of the long period in increase in the intensity and complexity of
setting aspiration levels for the rate of return on competition over the course of capitalist devel-
real capital formation in industry. This increase opment and that the free enterprise system
in the intensity of competition is generating an continues to develop fundamentally along the
ongoing bias against efficiency-enhancing forms lines of ever greater capital mobility.
of strategic corporate behaviour in favour of But it is also my contention that over the
stop-gap or crisis management forms of course of capitalist development and especially
competition such as ‘asset juggling’, which does evident in recent years in America, the intensity
not affect the quality of products or the of competition has become so great as to hinder
efficiency with which they are produced, industrial efficiency. Change for the sake of
distributed and sold. change rather than for economic and social
The rate of change in and complexity' of progress. Competition, that engine of prosperity
market conditions to which the film must that has propelled us forward for two centuries,
respond strategically has accelerated, not only in now seems to be of a character that it is holding
product and input markets, but also in economic us back.
policy variables here and abroad. The intensity'
of these competitive pressures is leading to the This suggests a very different role for economic
creation of corporate cultures that are very risk doctrine and public policy than either laissez-faire
averse, and to decision-making of strictly or the regulation of monopolistically competitive
bounded rationality' that, however practices. It implies that intervention in the market
which reduces the intensity or scope of certain
Computation of General Equilibria 1973
economy, and a description of the current state of changes in economic policy or in the environment
productive knowledge possessed by each of the in which the economy finds itself. The effects of a
firms engaged in manufacturing or in the provision major shock to the economy ofthe United States -
of services. The model then yields a complete such as the four-fold increase in the price of
determination of the course of prices and interest imported oil which occurred in late 1973 - can be
rates over time, levels of output and the choice of studied by contrasting equilibrium prices, real
techniques by each firm, and the distribution of wages and the choice of productive techniques both
income and patterns of saving for each consumer. before and after the event in question. Generations
The Walrasian model is essentially a generali- of economists have used the Walrasian model to
zation, to the entire economy and to all markets analyse the terms of trade, the impact of customs
simultaneously, of the ancient and elementary unions, changes in tariffs and a variety of other
notion that prices move to levels which equilibrate issues in the theory of International Trade. And
supply and demand. No intellectual construction of much of the literature in the field of Public Finance
this scope, designed to address basic questions in a is based on the assumption that the competitive
subject as complex and elusive as economics, can model is an adequate description of economic
be described as simply true of false - in the sense in reality.
which these terms are used in mathematics or In these discussions the analysis is frequently
perhaps in the physical sciences. The assertions of conducted in terms of simple geometrical diagrams
economic theory are not susceptible to crisp and whose use places a severe restriction on the number
immediate experimental verification. Moreover, the of consumers, commodities and productive sectors
Walrasian model disregards obvious aspects of that can be considered. This is in contrast to formal
human motivation which are of the greatest mathematical treatments of the Walrasian model,
economic significance and which cannot be which peimit an extraordinary generality' in the
addressed in the language of our subject: economic elaboration of the model at the expense of
theory is mute about our affective lives, about our immediate geometrical visualization. Unfortunately,
opposing needs for community and individual however, it is only under the most severe
assertion, and about the non-pecuniary determinants assumptions that mathematical analysis will be
of entrepreneurial energy. capable of providing unambiguous answers
There are, in addition, aspects of economic concerning the direction and magnitude of the
reality which are capable of being described in the changes in significant economic variables, when the
framework of the Walrasian model but which must system is perturbed in a substantial fashion. In order
be assumed away in order for the model to yield a for a comparative analysis to be carried out in a
determinate outcome. Uncertainty' about the future multi-sector framework it is necessary to employ
is an ever-present fact of economic life, and yet the computational techniques for the explicit numerical
complete set of markets for contingent commodities solution of the highly nonlinear system of equations
required by the Arrow-Debreu treatment of and inequalities which represent the general
uncertainty is not available in practice. Economies Walrasian model.
of scale in production are a central feature in the
rise of the large manufacturing entities which
dominate modem economic activity; their The Use of Fixed-Point Theorems in
incorporation into the Walrasian model requires the Equilibrium Analysis
introduction of non-convex production possibility
sets for which the competitive equilibrium will One ofthe triumphs of mathematical reasoning in
typically fail to exist. economic theory has been the demonstration of the
In spite of its many shortcomings, the Walrasian existence of a solution for the general equilibrium
model - if used with tact and circumspection - is an model of an economy, under relatively mild
important conceptual framework for evaluating the assumptions on the preferences of consumers and
consequences of the nature of production possibility sets (see
Computation of General Equilibria 1975
fp) = Ax,
2. pA < 0.
Computation of General
Equilibria, Fig. 3
Computation of General Equilibria 1977
Let prices be normalized so as to lie on the unit by Enrico Barone in 1908, entitled ‘The Ministry of
simplex S = {p = (pltp2,-- 5>i=l}- Production in a Socialist Economy’. Barone, and
The set of prices p for which pA<() is termed the subsequently Oskar Lange ( ), accepted the
dual cone of the production possibility set generated Walrasian model - with suitable transfers of income
by the activity matrix A. Its intersection with the - as an adequate description of ideal economic
unit simplex is a convex polyhedron C consisting of activity in an economy in which the means of
those normalized prices which yield a profit less production were collectively owned. In the absence
than or equal to zero for all activities. of markets, prices, levels of output and the choice of
We construct a continuous mapping of S into productive techniques were to be obtained by an
itself as follows: for each p in .S' consider the point explicit numerical solution of file Walrasian system.
P +ftp)\ a point which is generally not on the unit A key feature of Barone’s analysis was the concept
simplex itself. We then define g(p) - the image of p of the ‘technical coefficients of production’ - the
under the mapping - to be that point in C which is input/output coefficients associated with those
closest, in the sense of Euclidean distance, to P activities in use at equilibrium. Barone’s contention
+
.f{p)' is ^en an elementary application of the Kuhn- was that the equilibrium could be found - by an
Tucker Theorem to show that a fixed point of this extremely laborious calculation which might indeed
mapping is, indeed, an equilibrium price vector. claim a significant share of the national product -
only if die correct activities were known in advance.
For Barone, rational economic calculation in a
socialist economy was defeated by the many oppor-
tunities for substitution in production: the particular
The Equilibrium Model as a Tool for activities in use at equilibrium would be impossible
Policy Evaluation to determine by a prior computation. It is instructive
to quote Barone on this point.
Brouwer’s original proof of his theorem was not The determination of the coefficients economically
most advantageous can only be done in an e x p e r i -
only difficult mathematically, but it was decidedly m e n t a l way: and not on a s m a l l s c a l e , as
non-constructive; it offered no method for effec- could be done in a laboratory; but with experiments
tively computing a fixed point of the mapping. on a v e r y l a r g e s c a l e , because often the
Brouwer did, in fact, reject his own argument during advantage of the variation has its origin precisely in
a new and greater dimension of the undertaking.
the later ‘intuitionist’ phase of his career, in which Experiments may be successfid in the sense that they
he proclaimed the acceptability of only those may lead to a lower cost combination of factors; or
mathematical conclusions obtained by constructive they may be unsuccessfid, in which case the
procedures. In spite of the many simplifications in particular organization may not be copied and
repeated and others will be preferred, which
the proof of Brouwer’s Theorem offered during the e x p e r i m e n t a l l y have given a better result.
subsequent half-century, it was not until the mid- The Ministry of Production could not do without
1960s that constructive methods for approximating these experiments for the determination of the
fixed points of a continuous mapping finally made e c o n o m i c a l l y most advantageous technical
coefficients if it would realize the condition of the
their appearance on the scene (Scarf ) - aided by the minimum cost of production which is e s s e n t i a l
development of the modem electronic computer and for the attainment of the maximum collective
by the rapid methodological advances in the welfare.
discipline of operations research. It is on this account that the equations of the
equilibrium with the maximum collective welfare are
In the early decades of this century, the question
not soluble a p r i o r i , on paper.
of the explicit numerical solution of the general
equilibrium model was an active topic of discussion
- not by numerical analysts - but rather by
economists concerned with the techniques of
An Elementary Algorithm
economic planning in a socialist economy. The issue Barone’s negative conclusion is certainly valid if
was raised in the remarkable paper published
the full production possibility set, including all of
1978 Computation of General Equilibria
the possibilities for substitution in production, is not of parameters required to describe demand func-
known to the central planner. In this event, tions, initial endowments and the production pos-
numerical calculation is impossible, and Lange’s sibility set is considerable, and in practice the
suggestion, made some 20 years later, may be constraint of reproducing the current equilibrium
appropriate: the problem can be turned on its head must be augumented by a variety of additional
and the market, itself, can be used as a mechanism statistical estimates in order to specify the model.
of discovery as well as a giant analogue computer. The limitations of data in the form required by the
But if the production possibility set can be Walrasian model inevitably make this estimation
explicitly constructed, substitution - in and of itself procedure less than fully satisfactory.
- does not seem to me to be a severe impediment to The second step in the exercise is to calculate
numerical computation. the solution after the proposed policy changes are
At the present moment, some 20 years after the explicitly introduced into the model. In some cases
introduction and continued refinement of fixed- the policy variables being studied can be directly
point computational techniques, I have in my incorporated as parameters in the equations whose
possession a small floppy disk with a computer solution yields the equilibrium values; if the
program which will routinely solve - on a personal changes are small, their effects on the solution may
computer - for equilibrium prices and activity levels be obtained by differentiating these equations and
in a Walrasian model in which the number of solving the resulting linear system for the
variables is on the order of 100. (The authors of the corresponding changes in the equilibrium values
program suggest that examples with 300 variables themselves. This approach was adopted by Leif
can be accommodated on a mainframe computer. ) Johansen ( ) and by Arnold
Substantial possibilities of substitution, if known in Harberger ( ) in his study of the incidence of a
advance, offer no difficulty to the successful tax on corporate profits. The use of this method in
functioning of this algorithm. In my opinion, the policy analysis continues in Norway, and it forms
modem restatement of Barone’s problem is rather the basis of the amibitious programme carried out
that even 300 variables are extremely small in by Peter Dixon and his collaborators in Australia (
number in contrast to the millions of prices and ). If, on the other hand, the policy changes
activity levels implicit in his account. The are large, the equilibrium position may be shifted
computer, while expanding our capabilities substantially, and its determination may require the
immeasurably, has taught us a severe lesson about use of more sophisticated computational methods.
the role of mathematical reasoning in economic Fixed-point algorithms can be divided into two
practice and forced us to shift our point of view major classes: those based on the elements of
dramatically from that held by our predecessors. differential topology, surveyed by Smale ( ),
We realize that our preoccupations are not with and those which are combinatorial in nature. The
universal laws which describe economic most elementary of the combinatorial algorithms for
phenomena with full and complete generality, but approximating a fixed point of a continuous
rather with intellectual formulations which are an mapping of the unitsimplex S = {(x = (xj,x2,.. xn)\xi
imperfect representation of a complex and elusive > = 1} begins by dividing the sim
reality. The application of general equilibrium plex into a large number of small subsimplices as
theory to economic planning, and more generally to illustrated in Fig. . In our notation the simplex is of
the evaluation of the consequences of changes in dimension n 1 and has faces of dimension n 2, .. .,1.
economic policy, must be based on highly It is a requirement of the subdivision that the
aggregated models whose conclusions are at best intersection of any two of the subsimplices is either
tentative guides to action. empty or a full lower dimensional face of both of
An exercise in comparative statics is begun by them.
constmcting a general equilibrium model whose Each vertex of the subdivision will have asso-
solution reflects the economic situation existing ciated with it an integer label selected from the set
prior to the proposed policy change. The number
Computation of General Equilibria 1979
Computation of General 3
Equilibria, Fig. 4
(1, 2,..n). When the method is applied to the each coordinate is not increasing very much under
determination of a fixed point of a particular map- the mapping. Since the sum of the coordinate
ping, the labels associated with a vertex will depend changes is by definition zero, the image of any point
on the mapping evaluated at that point. For the in the completely labelled subsimplex will be close
moment, however, the association will be arbitrary to itself, and such a point will therefore serve as an
aside from the requirement that a vertex on the approximate fixed point of the mapping. A formal
boundary of the simplex will have a label i only if proof of Brouwer’s Theorem requires us to
the /th coordinate of that vertex is positive. construct a sequence of finer and finer subdivisions,
The remarkable combinatorial lemma demon- to find, for each subdivision, a completely labelled
strated by Emanuel Spemer ( ) in his doctoral simplex, and to select a convergent sequence of
thesis is that at least one subsimplex must have all these simplices tending to a fixed point of the
of its vertices differently labelled. Assuming this mapping.
result to be correct, let us consider a mapping of the Spemer’s Lemma may be applied to the equi-
simplex in which the image of the vector ,r = Oi,.. librium problem directly. For simplicity, consider
.,xn) is fix) = [/}(*),..., /,(*)]. The requirement that the model of exchange in which the market excess
the image be on the simplex implies that f(x) > 0 and demand functions are given by g(p), with p on the
that Yfi{x) = 1. It follows that for every vertex of the unit price simplex. As before, we subdivide the
subdivision v, unless v is a fixed point of the simplex and associate an integer label from the set
mapping, there will be a least one index i for which (1,..., n) with each vertex v of the subdivision,
fi(y) vi <0. If we select such an index to be the label according to the following rule: the label i is to be
associated with the vertex v, then the assumptions of selected from the set of those indices of which gi(p)
Spemer’s Lemma are clearly satisfied, and the < 0. It is an elementary consequence of Walras’s
conclusion asserts the existence of a simplex whose Law that a selection can be made which is
vertices are distinctly labelled. consistent with the assumptions of Spemer’s
If the simplicial subdivision is very fine, the Lemma, and there will therefore be a subsimplex all
vertices of this sub-simplex are all close together; at of whose vertices bear distinct labels. By virtue of
each vertex a different coordinate is decreasing the particular labelling rule, any point in such a
under the mapping, and by continuity every point in completely labelled simplex will be an approximate
the small subsimplex wi 11 have the property that equilibrium price vector in the sense that all excess
demands, at this price, will be either negative or, if
positive, very small.
1980 Computation of General Equilibria
Spemer’s original proof of his combinatorial has the label 1, this simplex is completely labelled
lemma was not constructive; it was based on an and our search terminates; otherwise we move to an
inductive argument which required a complete adjacent simplex by removing the vertex whose
enumeration of all completely labelled simpli- ces label agrees with that of v and replacing it with that
for a series of lower dimensional problems. In order unique other vertex yielding a simplex in the
to develop an effective numerical algorithm for the subdivision. As the process continues, we are, at
determination of such a simplex let us begin by each step, at a simplex whose vertices bear the
embedding the unit simplex, and its subsimplices, in labels 2 , . . w i t h a single one of these labels
a larger simplex T, as in Fig. . The larger simplex is appearing on a pair of vertices. Precisely two n 2
subdivided by joining its n new vertices to those dimensional faces have a complete set of labels
vertices of the original subdivision lying on the 2,..,,n. The simplex has been entered through one of
boundary of the unit simplex. The assumptions of these faces; the algorithm proceeds by exiting
Spemer’s Lemma permit the new vertices to be through the other such face.
given distinct labels from the set (1,..., n), in such a The argument first introduced by Lemke ( ) in
way that no additional completely labelled simplices his study of two person non- zero sum
are generated. For concreteness, let the new vertex games was carried over by Scarf ( ) to show
receiving the label i be denoted by v'. that the above algorithm never returns to a simplex
We begin our search for a completely labelled previously visited and never requires a move
simplex by considering the simplex with vertices outside of T. Since the number of simplices is
v2,..., v" and one additional vertex, say v*. If v* finite, the algorithm must terminate, and termina-
tion can only occur when a completely labelled
simplex is reached.
Computation of General Equilibria 1981
Improvements in the Algorithm the boundary. That such a mapping cannot exist is
known as the Non-Retraction Theorem, an assertion
The algorithm can easily be programmed for a which is, in fact, equivalent to Brouwer’s Theorem.
computer, and it provides the most elementary The impossibility of such a mapping reinforces our
numerical procedure for approximating fixed points conclusion that a completely labelled simplex does
of a continuous mapping and equilibrium prices for exist.
the Walrasian model. Since its introduction in 1967, Select a point c interior to one of the boundary
the algorithm, in this particular form, has been faces of T and consider the set of points which map
applied to a great number of examples of moderate into c; that is, the set of x for which h(x) c. As Fig.
size, and it performs sufficiently well in practice to indicates, this set contains a piecewise linear path
conclude that the numerical determination of beginning at the point c, and trans- versing precisely
equilibrium prices is a feasible undertaking. The those simplices encountered in our elementary
algorithm does, however, have some obvious algorithm. There are however, other parts of the set
drawbacks which must be overcome to make it {x|/i(x)=c}: closed loops which do not touch the
available for problems of significant size. For boundary of T and other piecewise linear paths
example, the information which yields the labelling connecting a pair of completely labelled simplices.
of the vertices, and therefore the path taken by the Stated somewhat informally, the general conclusion,
algorithm, is simply the index of a coordinate which of which this is an example, is that the inverse
happens to be decreasing when the mapping is image of a particular point, under a piecewise linear
evaluated at the vertex. More recent algorithms mapping from an n dimensional set to an n— 1
make use of the full set of coordinates of the image dimensional set, consists of a finite union of interior
of the vertex instead of a single summary statistic. loops, and paths which join two boundary points
Second, this primitive algorithm is always ini- (see Milnor , for the differentiable version).
tiated at the boundary of the simplex. If the To see how this observation can be used, con-
approximation is not sufficiently good, the grid size sider the product of the unit simplex S and the
must be refined, and a recalculation, which makes closed unit interval [0, 1]; that is, the set of points
no use of previous information, must be performed. (x, 1) with x in S and 0 < / < 1, as in Fig. .Extend
It is of the greatest importance to be able to initiate the mapping from the unit simplex to this large set
the algorithm at an arbitrary interior point of the by defining F(x,f)=(l—t) f(x)+tx* , with x* a
simplex selected as our best a priori estimate of the preselected point on the simplex, taken to be an
answer. estimate ofthe true fixed point. The set ofpoints for
The following geometrical setting (Eaves and which F(x,t) x 0 is, by our general conclusion, a
Scarf ) for the elementary algorithm suggests the finite union of paths and loops. Precisely one of
form these improvements can take. Let us construct these paths intersects the upper boundary of the
a piecewise linear mapping, h(x), of T into itself as enlarged set. If the path is followed, its other end-
follows: for each vertex v in the subdivision let point must lie in the face / 0 and yield a fixed point
h( v) v , where i is the label associated with v. We of the original mapping.
then complete the mapping by requiring h to be The path leading to the fixed point can be
linear in each simplex of the subdivision. The followed on the computer in several ways. We can,
mapping is clearly continuous on Tand maps every for example, introduce a simplicial decomposition
boundary point of T into itself. Moreover, every of the set S x [0, 1] and approximate />' by a
subsimplex in the subdivision whose vertices are piecewise linear mapping agreeing with F on the
not completely labelled is mapped, by /?, into the vertices of the subdivision. Following the path then
boundary of T. If none of the simplices were involves the same type of calculation we have
completely labelled, this construction would yield a become accustomed to in carrying out linear
most improbable conclusion: a continuous mapping programming pivot steps. There are a great many
of T into itself which is the identity on the boundary variations in the mode of simplicial subdivision
and which maps the entire simplex into
1982 Computation of General Equilibria
Computation of General
Equilibria, Fig. 7
computed. Codenotti et al. ( ) give an over There are H agents with endowments eh e M++1^
view on recent developments along this line. and utility functions uh : 11 /
* ' —>
In this article I will not discuss any of these M. It is assumed throughout this article that utility
practical aspects of the solution of large-scale functions are smooth in the sense of Debreu ( ) -
models. I will instead focus on the following two that is, utility is C2, strictly increasing,
unrelated developments in the computation of strictly quasi-concave, exhibits non-zero Gaussian
general equilibria in economics. curvature and indifference curves do not cut the
axes.
1. The computation of equilibria in models with There are / assets available for trade. In each
time, uncertainty and missing asset markets. state s, asset j pays a bundle of commodities a,{s)
2. The computation of all equilibria and the rela- C_ M£. It is without loss of generality to assume that
tionship between exact and approximate equi- the LS x /matrix
libria in the standard Arrow-Debreu model.
/ai(l) •••
Models with Asset Markets \ai(S) .. . aj(S) J
Due to their essential static nature, standard com- has frill rank /. Allowing assets to pay in different
putable general equilibrium models suffer from an commodities is crucial when one wants to extend
oversimplified treatment of uncertainty. Agents the model to several time periods and long-lived
either solve a static problem or have myopic expec- securities.
tations, and the model can therefore not explicitly In the following, it will be useful to write
incorporate investment and saving decisions. The commodity prices as
general equilibrium model with incomplete asset
markets (GEI model) provides a basic framework P=(P(0),P(1)........pISDeV""1-1
with several agents and several commodities to
incorporate uncertainty and financial markets. See,
for example, Magill and Qunizii ( ) for
an overview of the literature. The computation of
equilibria in these models is challenging because in and the S x /asset payoff matrix (as a function of
some specifications equilibria fail to exist while in spot prices p(\)... p(S)), R(p), as
others they are often numerically unstable.
Kehoe and Prescott ( ) argue that real busi (p{\)-ax{\) ... p(l)-aj(l)\
ness cycle models provide an alternative way to m: •• :
\p{S)-ax{S) . .. p(S) ■ aj(S) J
extend computable general equilibrium to models
with time and uncertainty. There is now a large In part of the discussion we assume an exogenous
literature on the computation of equilibria in short-sale constraint, that is, there is a number 0 < K
dynamic stochastic economies. This is reviewed < oo such that the two-norm of an agent’s portfolio
elsewhere in this dictionary; see approximate must always be less than or equal to K. One can
solutions to dynamic models (linear methods); see then write an agent’s aggregate excess demand
also Judd ( ). function as the solution of his maximization
In the standard GEI model there are two time problem in the GEI economy.
periods (Kubler and Schmedders , show how the
problem of computation of equilibria in multiperiod (zh{p),(ph{p)) = arg max life1' + z) s.t.p
finance models can be essentially reduced to the ■z
two period case) and S possible states of the world
in the second period. There are L perishable
=
0(p(l) • z(l)>... ,p(S) ■
commodities available for trade at each state. z(S)) T = R(P) •‘PIMI < - K
1986 Computation of General Equilibria (New Developments)
iE^(p)ii<e-
hI e x t e n d e d pa yo f f ma t r i x R* (p) by
R*(p) = [Rip), f ip)]
A General Algorithm
and let R* i{p) be R*(p) with the f’th column
Although generally R(p) will have full rank ./, there deleted. For the constrained agents h = 2, ...,
will be so-called ‘bad prices’ at which the rank of //define
R(p) drops. When there are no short sale
constraints, that is, K oo, this leads to a discon- = a rg ma x uh{eh + z) s . t p ■ z = 0
tinuity of excess demand. Scarf’s algorithm fails: no
matter how fine the simplicial subdivision, if the {1y{\)-z{\),...,p{S)-z{S))T =R*_i(p)-p.
algorithm terminates at somc/J, one cannot
necessarily infer a bound on |z(/i)| and hence cannot Now consider a family of homotopies, indexed by i
find an e-equilibrium.
Homotopy continuation methods (see Garcia (H \
zii
(p)+tYyip,R-iip))
and Zangwill ; Eaves ) turn out to be ideally suited Hi(p,t,6) = h=2
for this numerical problem. In order to solve a R*(p)6
\e-e-i
system of equations/(.r) = 0,/: X—> Y, the basic
idea underlying homotopy methods is to find a Jo prove existence of a homotopy path, Brown et al.
smooth map H: X x [0, 1] —> Y with ( ) show that LJ/1H~1 contains a smooth
path connecting the starting point to a solution at t=
H{x, 1) = /(,v) and H(x, 0) = g(x), 1.
While generically in endowments a homotopy
where g : A * Y has a known unique zero. The map path turns out to exist, the algorithm is hardly
H is called a smooth homotopy. In using homotopy applicable in medium-sized problems, since the
methods it is crucial to set up the function, H, to number of homotopies one has to consider can
ensure that there is a smooth path that connects become quite large. An alternative is to focus on
(rs,0) with y(rs) = 0 to some (I, 1) with f{x) = 1. models with K < oo (or alternatively models with
Brown et al. ( ) develop a homotopy algo transaction costs) or to consider algorithms which
rithm which can be shown to be globally conver- might fail in a small class of problems but which
gent in that it finds an e-equilibrium for any e > 0 in are generally more efficient.
a finite number of steps. Following the so-called
Cass-trick, it is usefiil to introduce an unconstrained
agent, that is, to define the first agent maximization
problem as
Computation of General Equilibria (New Developments) 1987
Grobner Basis Brown, D.J., P.M. DeMarzo, and B.C. Eaves. 1996. Com-
For given polynomials f\,..fk in 0[x] the set puting equilibria when asset markets are incomplete.
E c o n o m e t r i c a 64: 1-27.
Codenotti, B., S. Pemmaraju, and K Varadarajan. 2004.
Algorithms column: The computation of market equi-
/= = <fv ■■■,/*) libria. A C M S I G A C T N e w s 35(4): 23-37.
Debreu, G. 1972. Smooth preferences. E c o n o m e t r i c a 40:
603-615.
Eaves, B.C. 1972. Homotopies for the computation of fixed
is called the ideal generated by/i, It turns
points. M a t h e m a t i c a l P r o g r a m m i n g 3: 1-22.
out that under conditions which can often be shown Faugere, J.C. 1999. A new efficient algorithm for computing
to hold in practice, the so-called ‘reduced Grobner Grobner bases (f4). J o u r n a l o f P u r e a n d
basis’ of this ideal, /, in the lexicographic term A p p l i e d A l g e b r a 139: 61-88.
Ferris, M.C., and J.S. Pang. 1997. Engineering and economic
order has the shape
applications of complementarity problems. S I A M
R e v i e w 39: 669-713.
# = { X \ - ?i(-v„),.v2 - q 2{ x „ ) , . . . - Garcia, C., and W. Zangwill. 1981. P a t h w a y s t o
qn_fx„),r(x„)} s o l u t i o n s , fixed p o i n t s , a n d e q u i l i b r i a .
Englewood Cliffs: Prentice Hall.
Herings, P.J.J., and K Schmedders. 2006. Computing
where r is a polynomial of degree d and the q, are equilibria in finance economies with incomplete markets
polynomials of degree d — 1. and transaction costs. E c o n o m i c T h e o r y 27: 493-
This basis can be computed exactly, using 512.
Judd, K. 1998. N u m e r i c a l m e t h o d s i n e c o n o m i c s .
Buchberger’s algorithm (recently, much more
Cambridge: MTT Press.
efficient versions of the basic algorithm have been Kehoe, T.J., and E.C. Prescott. 1995. Introduction to the
developed; see for example Faugere ). The number symposium, the discipline of applied general equilibrium.
of real solutions to the original system then equals E c o n o m i c T h e o r y 6: 1-11.
Kubler, F. 2001. Computable general equilibrium with
the number of real solutions of the univariate financial markets. E c o n o m i c T h e o r y 18: 73-96.
polynomial r(.) which can be determined exactly by Kubler, F., and K. Schmedders. 2000. Computing equilibria
Sturm’s method (see Sturmfels , for details). The in stochastic finance economies. C o m p u t a t i o n a l
E c o n o m i c s 15: 145-172.
roots of/-(.) can be approximated numerically with
Kubler, F., and Schmedders, K. 2006. Uniqueness of equi-
standard methods and the remaining solution to the libria in semi-algebraic economies. Discussion paper,
original system is linear in these roots. Northwestern University.
Kubler and Schmedders ( ) use the method Magill, M.J.P., and M. Qunizii. 1996. T h e o r y o f
i n c o m p l e t e m a r k e t s . Cambridge: MIT Press.
to test for uniqueness of equilibria in semi-
Richter, M.K., and K.-C. Wong. 1999. Non-computability of
algebraic classes of economies. competitive equilibrium. E c o n o m i c T h e o r y 14: 1-
27.
Scarf, H. 1967. On the computation of equilibrium prices. In
Ten economic studies in the tradition of
See Also i r v i n g f i s h e r , ed. W.J. Fellner. New York: Wiley.
Schmedders, K. 1998. Computing equilibria in the general
equilibrium model with incomplete asset markets.
Journal of Economic Dynamics and
C o n t r o l 2 2 : 1375-1403.
Schmedders, K. 2004. Homotopy path-following with
easyhomotopy: Solving nonlinear equations for
economic models. Working paper, Northwestern
University.
Shoven, J.B., and J. Whalley. 1992. A p p l y i n g g e n e r a l
Bibliography e q u i l i b r i u m . Cambridge: Cambridge University
Press.
Sommese, A.J., and C.W. Wampler. 2005. The numerical
Basu, S., R. Pollack, and M.-F. Roy. 2003. A l g o r i t h m s i n
solution of systems ofpolynomials arising in engineering
r e a l a l g e b r a i c g e o m e t r y . New York: Springer.
and science. Singapore: World Scientific Press.
Blum, L., F. Cucker, M. Shub, and S. Smale. 1998. C o m -
Sturmfels, B. 2002. S o l v i n g s y s t e m s o f p o l y n o m i a l
p l e x i t y a n d r e a l c o m p u t a t i o n . New York:
e q u a t i o n s , CBMS Regional Conference Series in
Springer.
Computational Methods in Econometrics 1991
JEL Classification
C15
1992 Computational Methods in Econometrics
x=arg min ^(ys - xJsb)2 of the support is the single unknown parameter that
5=1 needs to be estimated, while c is known. Two
parametric estimation methods with particularly
where x's is the sth row of matrix X and h is a k / 1
attractive statistical properties are the generalized
vector of real numbers defining the regression plane
method of moments (GMM) and the method of
Xb. Under the assumption that X has hill column
maximum likelihood (MLE). Indeed, for relatively
rank k, the solution to this ordinary least squares
large sample sizes these two methods are
minimization problem is the I inear-in-v expression
comparably attractive in terms of statistical
[i (X'X) X!y. which only requires the matrix
properties, while they differ drastically in terms of
operations of multiplication and inversion. Suppose,
computational requirements: the GMM solution is
however, that Gauss had chosen instead as his
6gmm = f ys — c, thus requiring only the simple
measure of distance the stun of absolute value of the
calculation of the sample mean y, while the MLE
deviations, and defined instead:
involves the highly nonlinear operation of finding
the minimum of the data vector y, 6mie =
s
) = vg minV|ys-A>| (2) min(y1,... ,ys).
b In the following section we discuss in turn the
s =1
leading classes of methods that are of particular
The vector fl that solves the second minimi- importance in modem econometrics, while section “
zation is known as the least absolute deviations ” introduces the con
(LAD ) estimator and has no closed-form matrix cept of parallel processing and describes its current
expression. In fact, calculation of p requires highly value and future promise in aiding dramatically
nonlinear operations for which computationally econometric computation.
efficient algorithms were developed only in the
1970s. To give a concrete example, consider the
intercept-only linear regression model where X is
the S x 1 vector of ones. Then the single P Computational Methods
coefficient that solves ( ) is the sample mean of y, Important for Econometrics
while ft that solves ( ) is the sample median of y.
The latter is orders of magnitude more difficult to The advancement of computational methods for
compute than the former since it involves sorting y econometrics relies on understanding the interplay
and finding the value in the middle, while the between the disciplines of econometric theory,
former simply adds all elements of y and divides by computer science, numerical analysis, and applied
the sample size. Clearly, it could be quite mathematics. In the five subsections below we
misleading if p and p where compared solely in discuss the leading classes of computational
terms of statistical properties without any methods that have proven of great value to modem
consideration of their substantially different econometrics.
computational requirements.
Matrix Computation and Specialized
A second example in a similar vein is the
following parametric estimation problem. Suppose a Languages
sample of size S is observed on a single variabley. It To start with the fundamental econometric frame-
is believed that each observation^ is work of linear regression, the sine qua non of
econometric computation is the ability to program
and perform efficiently matrix operations. To this
end, specialized matrix computer languages have
( 1 y y2 ... /-i\
y i y
y2 y ••• r
1 y
W~l y2 y i /
requiring the inversion of an equi-correlated var- problems in computational economics, for example,
iance covariance matrix Q of dimension S x S, the problem of optimal control.) Algorithms for
where S is of the order of the product of the number optimizing functions of many variables are a key
of available observations in the cross- section component in the collection of tools for
dimension times the number available in the time econometric computation. The suitability of a cer-
dimension. For modem panel data-sets, this can tain algorithm to a specific optimization econo-
exceed 300,000, thus making the calculation of Q 1 metric problem depends on the following
infeasible even on today’s supercomputers, let alone classification:
with the slide rales available in 1966. Fuller and
Battese ( ), however,1. Algorithms that require the calculation of first
showed that the equi-correlated nature of the one- and possibly second derivatives Versus algo-
factor random effects model made calculation of the rithms that do not. Clearly, if the function to be
GLS estimator equivalent to an OLS problem, optimized is not twice continuously differen-
where the dependent variable y and the regressors X tiable (as is the case with LAD) or even dis-
are simple linear combinations of the original data continuous (as is the case with the maximum
score estimator for the semiparametric analysis
v„, „rllY,..., xklt and its time ^averages yu,x\it, ... ,xku
defined by of the binary response model - see Manski
{ylt
and y,, =yit — /f,, and analogously for the regressor ), algorithms that require differentiability
variables. This realization allowed the calculation of will not be suitable. The leading example of an
the GLS estimator without the need for inverting the algorithm not relying on derivatives is the non-
usually problematically large Q matrix. linear simplex method of Nelder and Meade (
Another important case where a theoretical ).
development in methodology led to a dramatic 2. Local Versus global algorithms. Optimization
lowering of the computational burden and hence algorithms of the first type (for example, Gauss-
allowed the calculation of models that would oth- Newton, Newton-Raphson, and Bemdt et al. (
erwise have had to wait perhaps for decades for )) search for an optimum in the vicinity
sufficient advancements in computer technology is of the starting values fed into the algorithm.
the simulation-based inference for Limited This strategy may not necessarily lead to a
Dependent Variable models, associated with the global optimum over the full set of parameter
name of Daniel McFadden ( ). See section space. This is of particular importance if the
“ ” below, McFadden, Daniel function to be optimized has multiple local
and simulation-based estimation. optima, where typically the estimator with the
desirable statistical properties corresponds to
Optimization locating the overall optimum of the function. In
Many econometric estimators with attractive sta- such cases, global optimization algorithms (for
tistical properties require the optimization of a example, simulated annealing and genetic
(generally ) nonlinear function of the form: optimization algorithm) should be employed
instead.
q=aig m ax F(6; data) ( 3)
9 Special methods are necessary for constrained
optimization, where a function must be maximized
over a vector of unknown parameters 0 of dimen- or minimized subject to a set of equality or
sion p, typically considerably larger than 1. Exam- inequality constraints. These problems, in general
ples are: the method of maximum likelihood, considerably more demanding than unconstrained
minimum-distance (OLS, LAD, GMM), and other optimization, can be handled through three main
extremum estimators. (The need to optimize alternative approaches: interior, exterior and re-
functions numerically is also important for certain parameterization methods.
Comprehensive reviews of optimization
methods in econometrics can be found in Goldfeld
Computational Methods in Econometrics 1995
• Simulation-based Bayesian and classical infer- in the case of simulation-based estimators. See
ence, where integrals are approximated through Nagumey ( ) for a discussion of parallel com
computer simulation (known as Monte Carlo putation in econometrics.
integration). Particularly important methods in An alternative approach for parallel computation
this context are the following: frequency that does not involve a single computer with many
simulation; importance sampling; and Markov processors has been developed recently and offers
chain Monte Carlo methods (the leading considerable promise for computational
exponents being Gibbs resampling and the econometrics. Through the use of specialized
Metropolis/Hastings algorithm). A related class computer languages, many separate computers are
of methods, known as variance-reduction harnessed together over an organization’s intranet
simulation techniques, includes control variates or even over the internet, and an econometric
and antithetics. See Geweke ( ) and computation task is distributed across them. The
Hajivassiliou et al. ( ) for reviews. See also benefits of this approach depend critically on the
simulation-based estimation. relative burden of the overhead of communicating
across the individual computers when organizing
the splitting of the tasks and then collecting and
Parallel Computation processing the separate partial results. Such
distributed parallel computation has the exciting
Parallel processing, where a computation task is potential of affording formidable super-computing
broken up and distributed across different com- powers to econometric researchers with only
puters, is a technique that can afford huge savings modest computer hardware.
in terms of total time required for solving partic-
ularly difficult econometric problems. For example,
the simulation-based estimators mentioned in the
previous section exhibit the potential of significant
See Also
computational benefits by calculating them on
computers with massively parallel architectures, ► Longitude: Data Analysis
because the necessary calculations can be organized ► addi ?om 1537)
in essentially an independent pattern. An example ► i Ton-' v:: Programming
of such a computer is the Connection Machine CM- ► i f o i m : .
5 at the National Center for Supercomputing
Applications in Illinois with 1024 identical ► o . - rs in Econometrics
processors in a multiple- instruction/multiple-data ► Simulation-Based Estimation
(MIMDI) configuration. The benefits of such a
parallel architecture on the problem of solving an
Bibliography
econometric optimization classical estimator not
involving simulation can also be substantial, since Abramowitz, M., and I. Stegun. 1964. H a n d b o o k o f
such estimators involve the evaluation of m a t h e m a t i c a l f u n c t i o n s . Washington, DC:
contributions to the criterion (for example, National Bureau of Standards.
likelihood) function in the case of independently Balestra, P., and M. Nerlove. 1966. Pooling cross-section and
time-series data in the estimation of a dynamic model.
and identically distributed (i.i.d.) observations. E c o n o m e t r i c a 34: 585-612.
Since typical applications in modem applied Belsley, D. 1974. Estimation of system of simultaneous
econometrics using cross-sectional and longitudinal equations and computational specifications of GREM-
data sets involve several thousands of i.i.d. LIN. A n n a l s o f E c o n o m i c a n d S o c i a l
M e a s u r e m e n t 3: 551-614.
observations, the potential benefits of parallel Bemdt, E.K., B.H. Hall, R.E. Hall, and J.A. Hausman. 1974.
calculations of such estimators should be obvious. Estimation and inference in nonlinear structural models.
The benefits of a massively parallel computer Annals of Economic and Social
architecture become even more pronounced M e a s u r e m e n t 3: 653-666.
Computer Industry 1997
Davis, P.J., and P. Rabinovitz. 1984. M e t h o d s o f Zellner, A., L. Bauwens, and H. VanDijk. 1988. Bayesian
n u m e r i c a l i n t e g r a t i o n . New York: Academic. specification analysis and estimation of simultaneous
Dennis, J.E., and R.B. Schnabel. 1984. U n c o n s t r a i n e d equation models using Monte Carlo methods. J o u r n a l
optimization and nonlinear equations. o f E c o n o m e t r i c s 38: 73-90.
Englewood Cliffs: Prentice-Hall.
Drud, A. 1977. An optimization code for nonlinear econo-
metric models based on sparse matrix techniques and
reduced grades. A n n a l s o f E c o n o m i c a n d
S o c i a l M e a s u r e m e n t 6: 563-580. Computer Industry
Efron, B. 1982. T h e j a c k k n i f e , t h e b o o t s t r a p ,
a n d o t h e r r e s a m p l i n g p l a n s , CBMS-NSF
Shane Greenstein
monographs No. 38. Philadelphia: SIAM.
Fuller, W.A., and G.E. Battese. 1973. Transformations for
estimation of linear models with nested-error structure.
Journal of the American Statistical
A s s o c i a t i o n 68: 626-632.
Geweke, J. 1988. Antithetic acceleration of Monte Carlo Abstract
integration in Bayesian inference. J o u r n a l o f Commercial computing has grown to include an
E c o n o m e t r i c s 38: 73-90. extraordinary range of economic undertakings.
Geweke, J. 1996. Monte Carlo simulation and numerical
integration. In H a n d b o o k o f c o m p u t a t i o n a l
In any given era, computing markets are
e c o n o m i c s , vol. 1, ed. H. Amman, D. Hendrik, and J. organized around platforms - a cluster of tech-
Rust. Amsterdam: North-Holland. nically standardized components that buyers use
Goldfeld, S., and R. Quandt. 1972. N o n l i n e a r m e t h o d s together to make the wide range of applications.
i n e c o n o m e t r i c s . Amsterdam: North-Holland.
There has been an increasing secular trend in the
Golub, G.H. 1969. Matrix decompositions and statistical
calculations. In S t a t i s t i c a l c o m p u t a t i o n , ed. number of firms that possess the necessary
R.C. Milton and J.A. Milder. New York: Academic. technical knowledge and commercial
Hajivassiliou, V.A., and PA. Ruud. 1994. Classical estimation capabilities to bring to market some component
methods using simulation. In H a n d b o o k of
or service. While general improvements in
e c o n o m e t r i c s , vol. 4, ed. R. Engle and D. McFadden.
Amsterdam: North-Holland. technical capabilities are readily apparent, it is
Hajivassiliou, V.A., D.L. McFadden, and P.A. Ruud. quite difficult to calculate the productivity
1996. Simulation of multivariate normal rectangle improvements arising from increased investment
probabilities and derivatives: Theoretical and compu-
in and use of computing.
tational results. J o u r n a l o f E c o n o m e t r i c s 72(1,
2): 85-134.
Judd, K. 1996. Approximation, perturbation, and projection Keywords
methods in economic analysis. In H a n d b o o k o f
c o m p u t a t i o n a l e c o n o m i c s , vol. 1, ed. H.
Computer industry; Economic growth; Infor-
Amman, D. Hendrik, and J. Rust Amsterdam: North- mation technology; Innovation; Moore’s law
Holland.
Manski, C. 1975. Maximum score estimation of the stochastic
utility model of choice. J o u r n a l o f E c o n o m e t r i c s
JEL Classifications
3: 205-228.
McFadden, D. 1989. A method of simulated moments for
L63
estimation of multinomial discrete response models.
E c o n o m e t r i c a 57: 995-1026. The commercial computing industry accounts for a
Nagumey, A. 1996. Parallel computation, l a H a n d b o o k large fraction of economic activity. From its
o f c o m p u t a t i o n a l e c o n o m i c s , vol. 1, ed. H. military and research origins in the late 1940s, it
Amman, D. Hendrik, and J. Rust Amsterdam: North-
Holland.
spread into the commercial realm and has since
Nelder, J.A., and R. Meade. 1965. A simplex method for grown to include an extraordinary range of eco-
function minimization. C o m p u t e r J o u r n a l 7: 308- nomic undertakings. Many economists believe this
313. expansion of applications for computing has been a
Press, W.H., B.P. Flannery, S.A. Teukolsky, and W.T.
Vetterling. 2001. Numerical recipes in
driver of economic growth.
Fortran 77: T h e art of scientific Computing aids the automated tracking of
c o m p u t i n g . Cambridge: Cambridge University Press. transactions, a function that finds use, for
Quandt, R. 1983. Computational problems and methods. In
1998 Computer Industry
example, in automating billing, managing the segments. Trained engineers or programmers made
pricing of inventories of airline seating, and up the technical user base, while the commercial
restocking retail outlets in a geographically dis- market was geared more towards administrators,
persed organization. It also facilitates the coordi- secretaries and office assistants.
nation of information-intensive tasks, such as the The most popular platform in the late 1980s and
dispatching of time-sensitive deliveries or emer- 1990s differed from the prominent platforms of
gency services. Computing also enables perfor- earlier years. The personal computer (PC) began in
mance of advanced mathematical calculations, the mid- 1970s as an object of curiosity among
useful in such diverse activities as calculating technically skilled hobbyists, but became a common
interest on loans and generating estimates of office tool after the entry of IBM's design. Unlike
underground geological deposits. Computer- aided prior computing platforms, this one has diffused
precision also improves the efficiency of processes into both home and business use. From the
such as manufacturing metal shapes or the beginning, this platform involved thousands of large
automation of communication switches, to name and small software developers, third-party
just two. peripheral equipment and card developers, and a
In any given era, computing markets arc orga- few major players. In more recent experience,
nized around platforms - a cluster of technically control over the standard has completely passed
standardized components that buyers use together to from IBM to Microsoft and Intel. Microsoft
make the aforementioned wide range of appli- produces the Windows operating system and Intel
cations. Such platforms involve long-lived assets, produces the most commonly used microprocessor.
both components sold in markets (that is, hardware For this reason the platform is often called Wintcl.
and some software) and components made by The networking and internet revolution in the
buyers (that is, training and most software). late 1990s is responsible for blurring once- familiar
Important computing platforms historically include distinctions. These new technologies have made it
the UNIVAC, the IBM 360 and its descen- dents, feasible to build client-server systems within large
the Wang minicomputers, IBM AS/400, DEC VAX, enterprises and across ownership boundaries. It
Sun SPARC, Intel Windows PC, Unix/Linux, and, employs internet-based computing systems
after the mid-1990s, TCP/IP- based client-server networked across potentially vast geographic
platforms linked together. distances, supporting the emergence of a ‘network
Vendors tend to sell groups of compatible of networks’.
products under umbrella strategies aimed at the Despite frequent and sometimes dramatic tech-
users of particular platforms. In the earliest eras of nical improvements in specific areas of technology,
computing markets, the leading films integrated all many features of the most common platforms in use
facets of computing and offered a supply of goods tend to persist or change very slowly. Many durable
and services from a centralized source. In later eras, components make up platforms. And, though they
the largest and most popular platforms historically lose their market value as they become obsolete in
included many different computing, comparison with frontier products, they do not as
communications and peripheral equipment films, quickly lose their ability' to provide a flow of
software tool developers, application software services to users. Consequently, new technology
writers, consultants, system integrators, distributors, tends to be most success- fill when new components
user groups, news publications and service enhance and preserve the value of previous
providers. investments, a factor that creates demand for
Until the early 1990s, most market segments ‘backward compatible’ upgrades or improvements.
were distinguished by the size of the tasks to be It also creates a demand for support and service
undertaken and by the technical sophistication of activities to reduce the costs of making the
the typical user. Mainframes, minicomputers, transition from old to new.
workstations, and personal computers, in decreasing
order, constituted different size-based market
Computer Industry 1999
Control over changes to design and other aspects hardware and software products. It is also coinci-
of technical standards shapes the backward dent with their increasing geographic reach. In
compatibility for key components. Control of these addition, as in other manufacturing processes, the
decisions is coincident with platform leadership - increasing use of sophisticated information
determining the rate and direction of change in technology helps coordinate design and production
technical features of components around which involving firms from many countries and
other firms build their businesses. In each platform, continents.
it is very rare to observe more than a small number While the spawning of new information tech-
of firms acquiring leadership positions. Since such nology businesses in North America has tended to
positions have been historically associated with be concentrated in a small number of locations,
high firm profitability, firms compete fiercely for such as the Boston area and Santa Clara Valley
market dominance in component categories where (popularly known as Silicon Valley), every other
standards are essential. Not surprisingly, facet of the supply chain for computing involves
competitive behavioiu affiliated with obtaining and firms headquartered and operating in a much wider
retaining market leadership does occasionally set of locations. In North America, these range from
receive attention from antitrust authorities. Seattle, Austin, Los Angeles, the greater New York
Though innovative change in computing began area, Denver-Boulder, Washington DC, the North
well prior to the invention of the integrated circuit, Carolina Research Triangle, Chicago, and virtually
in popular discussion advances in computing have all major cities in the United States. The supply
become almost synonymous with advances in chain for many complementary components has
microprocessors. This is due to an observation by also been associated with many firms in Western
Gordon Moore, who co-founded and became Europe and as well as in India, Israel, South Korea,
chairman at Intel. In 1965 he foresaw a doubling of Singapore, Taiwan and China. Even more
circuits per chip every two years. This prediction widespread are computing service firms, which
about the rate of technical advance later became follow business and home users dispersed across
known as ‘Moore’s law’. In fact, microprocessors the globe.
and DRAMS have been doubling in capability Despite this geographic dispersion since the
every 18 months since the mid-1970s. 1950s, US companies have retained leadership in
Moore’s prediction pertained narrowly to inte- generating new platforms and commercializing
grated circuits. However, a similar pattern of frontier technologies in forms that most users find
improvement - though with variation in the rate - valuable. Part of this results from the persistence of
characterizes other electronic components that go platform leadership for a time within a segment. In
into producing a computer or that are addition, US firms have historically been ascendant
complementary with computing in many standard whenever platform leadership has changed.
uses. This holds for disk drives, display screens, However, this pattern seems likely to change in the
routing equipment, and data-transmission capacity, 21st century, as non-US firms already have found
to name a few. Such widespread innovation creates leadership positions in producing components of
opportunities for new entry and rearrangements in many platforms and in related areas of electronics,
the conditions of supply. such as consumer electronics, communication
Accordingly, there has been an increasing sec- equipment and specialized software.
ular trend in the number of firms that possess the While general improvements in technical capa-
necessary technical knowledge and commercial bilities are readily apparent, it is quite difficult to
capabilities to bring to market some component or calculate the productivity improvements arising
service of value to computing users. This factor from increased investment in and use of computing.
alone explains the increasing complexity of supply There is no question that existing computing
chains for the supply of most computing activities have become less expensive, while new
capabilities have been achieved. This has allowed
2000 Computer Science and Game Theory
economic actors to attain previously unobtainable Brensnahan, T., and F. Malerba. 1999. Industrial dynamics
outcomes. This shift in economic possibilities has and the evolution of firm’s and nations’ competitive
capabilities in the world computer industry. In S o u r c e s
generated a restructuring of organizational routines, o f i n d u s t r i a l l e a d e r s h i p , ed. D. Mowery and R.
market relationships, and other activities associated Nelson. Cambridge, UK: Cambridge University Press.
with the flow of goods, which inevitably improves Brynjolfsson, E., and H. Lorin. 2000. Beyond computation:
Information technology, organizational transformation
the economy’s ability to transform inputs into
and business performance. J o u r n a l o f E c o n o m i c
consumer welfare. P e r s p e c t i v e s 14(4): 23-48.
Yet altering the business use of computing can Dedrick, J., and K. Kraemer. 2005. The impacts of IT on firm
be slow. It often demands large adjustment costs and industry structure: The personal computer industry.
C a l i f o r n i a M a n a g e m e n t R e v i e w 47(3): 122-
and gradual learning about which organizational
142.
processes can best employ advances in computing. Flamm, K. 2003. The new economy in historical perspective:
It can involve a reallocation of decision rights and Evolution of digital electronics technology. In N e w
discretion inside a large organization, especially e c o n o m y h a n d b o o k , ed. D. Jones. San Diego:
Academic Press/Elsevier.
when business units alter a wide array of Greenstein, S. (ed.). 2006. T h e i n d u s t r i a l e c o n o m i c s
intermediate routine processes (such as billing, o f c o m p u t i n g . Northampton: Edward Elgar.
account monitoring, and inventory management) or Jorgenson, D., and C. Wessner (eds.). 2005.
the coordination of services (such as the delivery of Deconstructing the computer: Report of a symposium.
Washington, DC: National Academies Press.
data for decision support). Moreover, the largest McKinsey Global Institute. 2001. U.S. productivity growth,
changes come from altering many complementary 1995-2000: Understanding the contribution of information
activities that respond to new and unanticipated technology relative to other factors. Washington, DC:
McKinsey and Co.
opportunities, setting off new waves of invention.
Each wave’s productivity effect is interwoven with
others.
Along with these improvements the boundaries
of the ‘computing market’ have changed. A Computer Science and Game Theory
hardware-based definition for the computing market
was barely adequate in the 1960s and is no longer Joseph Y. Halpern
adequate for economic analysis. However, there is
no consensus about what alternative framing will be
appropriate for understanding value creation,
supplier behaviour, and user adoption in computing Abstract
in the 21st century. Work at the intersection of computer science and
game theory is briefly surveyed, with a focus on
the work in computer science. In particular, the
See Also following topics are considered: various roles of
computational complexity in game theory,
► Diffusion of Technology including modelling bounded rationality, its role
► General urpose Technologies in mechanism design, and the problem of
► -iformation Ic .. ..gy I Wc: computing Nash equilibria; the price of anarchy,
1:1 that is, the cost of using decentralizing solution
to a problem; and interactions between
► Internet, Economics of the distributed computing and game theory.
► Technical Change
Bibliography
Keywords
Bresnahan, T., and S. Greenstein. 1999. Technological Algorithmic knowledge; Algorithmic mecha-
competition and the structure of the computer industry. nism design; Bayesian networks; Bounded
J o u r n a l o f I n d u s t r i a l E c o n o m i c s 47(1): 1-40. rationality; Byzantine agreement; Cheap talk;
Computer Science and Game Theory 2001
JEL Classifications
The influence of computer science in game theory
CO; C6 has perhaps been most strongly felt through com-
plexity theory. I consider some of the strands of this
research here. There are a numerous basic texts on
complexity theory that the reader can consult for
Introduction more background on notions like NP-completeness
and finite automata, including Hopcroft and Ullman
There has been a remarkable increase in work ( ) and Papadimitriou
at the interface of computer science and ( )•
game theory in the past decade. Game theory
forms a significant component of some major Bounded Rationality
computer science conferences (see, for example, One way of capturing bounded rationality is in
Kearns and Reiter ; Sandholm and Yokoo ); leading terms of agents who have limited computational
computer scientists are often invited to speak at power. In economics, this line of research goes back
major game theory conferences, such as the World to the work of Neyman ( ) and Rubin-
Congress on Game Theory 2000 and 2004. In this stein ( ), who focused
article I survey some of the main themes of work in on finitely repeated
the area, with a focus on the work in computer Prisoner’s Dilemma. In n-round finitely repeated
2 1
science. Given the length constraints, make no Prisoner’s Dilemma, there are 2 " strategies (since
attempt at being comprehensive, especially since a strategy is a function from histories to {cooperate,
other surveys are also available, including Halpem ( defect}, and there are clearly 2" 1
), Linial ( ), Papadimitriou histories of length < n). Finding a best response to a
( ), and a comprehensive survey book particular move can thus potentially be difficult.
(Nisanetal. ). Clearly people do not find best responses by doing
The survey is organized as follows. I look at the extensive computation. Rather, they typically rely
various roles of computational complexity in game on simple heuristics, such as ‘tit for tat’ (Axelrod ).
theory in section “ Such heuristics can often be captured by finite
”, including its use in modelling bounded automata; both Neyman and Rubinstein thus focus
rationality, its role in mechanism design, and the on finite automata playing repeated Prisoner’s
problem of computing Nash equilibria. In section ”, Dilemma. Two computer scientists, Papadimitriou
I consider a game- theoretic problem that originated and Yannakakis ( ), showed
in the computer science literature, but should be of that if both players in an n-round Prisoner’s
interest to the game theory community: computing Dilemma are finite automata with at least 2" 1
the price of anarchy, that is, the cost of using a states, then the only equilibrium is the one where
decentralizing they defect in every round. This result says that a
finite automaton with exponentially many states
2002 Computer Science and Game Theory
can compute best responses in Prisoner’s Dilemma. whether there exists a correlated equilibrium
We can then model bounded rationality by (Aumann ) where each player gets a payoff of at
restricting the number of states of the automaton. least r is computable in polynomial time. In general,
Neyman ( ) showed, roughly speaking, that if qsts regarding correlated equilibrium seem easier
the two players in «-round Prisoner’s Dilemma are than the analogous qsts for Nash equilibrium; see
modelled by finite automata with a number of states Papadimitriou ( ) and
in the interval \nyk, //'] for some k, then collaboration Papadimitriou and Roughgarden ( ) for fur-
can be approximated in equilibrium; more precisely, ther examples. Chu and Halpem ( ) prove
if the payoff for (cooperate, cooperate) is (3, 3) similar NP-completeness results if the game is
there is an equilibrium in the repeated game where represented in extensive form, even if all players
the average payoff per round is greater than 3 j. for have the same payoffs (a situation that arises
each player. Papadimitriou and Yannakakis ( ) frequently in computer science applications, where
sharpen we can view the players as agents of some designer,
this result by showing that if at least one of the and take the payoffs to be the designer’s payoffs).
players has fewer than 2c‘n states, where Conitzer and Sandholm ( ) give a
= -p-j-pj—y, then for sufficiently large n, there compendium of hardness results for various qsts
is an equilibrium where each player’s average pay- regarding Nash equilibria.
off per round is greater than 3 — e. Thus, compu- Nevertheless, there is a sense in which it seems
tational limitations can lead to cooperation in that the problem of finding a Nash equilibrium is
easier than typical NP-complete problems, because
Prisoner’s Dilemma.
every game is guaranteed to have a Nash
There have been a number of other attempts to
equilibrium. By way of contrast, for a typical NP-
use complexity-theoretic ideas from computer sci-
complete problem like prptal satisfiability, whether
ence to model bounded rationality (see Rubinstein ,
or not a prptal formula is satisfiable is not known.
for some exs). However, it seems that there is much
Using this observation, it can be shown that if
more work to be done here.
finding a Nash equilibrium is NP-complete, then NP
= coNP. Recent work has in a sense completely
Computing Nash Equilibrium characterized the complexity of finding a Nash
Nash ( ) showed every finite game has a Nash equilibrium in normal-form games: it is a PPAD-
equilibrium in mixed strategies. But how hard is it complete problem (Chen and Deng ; Daskalis et
to actually find that equilibrium? On the positive al. ). PPAD stands for ‘polynomial parity argument
side, there are well known algorithms for computing (directed case)’; see Papadimitriou ( ) for a formal
Nash equilibrium, going back to the classic Lemke- definition and
Howson ( ) algorithm, with a spate of examples of other PPAD problems. It is believed
recent improvements (see, for example, Govindan that PPAD-complete problems are not solvable in
and Wilson ; Blum et al. ; Porter et al. ). Moreover, polynomial time, but are simpler than NP-complete
for certain classes of games (for example, problems, although this remains an open problem.
symmetric games, See Papadimitriou ( ) for an
Papadimitriou and Roughgarden ), there are known overview of this work.
to be polynomial-time algorithms. On the negative
side, many qsts about Nash equilibrium are known Algorithmic Mechanism Design
to be NP-hard. For example, Gilboa and Zemel ( The problem of mechanism design is to design a
) showed that, for a game pre game such that the agents playing the game, moti-
sented in normal form, deciding whether there vated only by self-interest, achieve the designer’s
exists a Nash equilibrium where each player gets a goals. This problem has much in common with the
payoff of at least r is NP-complete. Interestingly, standard computer science problem of designing
Gilboa and Zemel also show that computing protocols that satisfy certain specifications (for
example, designing a distributed protocol that
Computer Science and Game Theory 2003
achieves Byzantine agreement; see section “ related to combinatorial auctions. For a detailed
”). Work on discussion and references see Cramton et al. ( ); I
mechanism design has traditionally ignored com- briefly discuss a few of the issues
putational concerns. But Kfir-Dahav et al. ( ) involved here.
show that, even in simple settings, optimizing social Suppose that there are n items being auctioned.
welfare is NP-hard, so that perhaps the most Simply for a bidder to communicate her bids to the
common approach to designing mechanisms, auctioneer can take, in general, exponential time,
applying the Vickrey-Groves-Clarke (VCG) since there are 2" bundles. In many cases, we can
procedure (Clarke ; Groves ; Vickrey ), is not going identify a bid on a bundle with the bidder’s
to work in large systems. We might hope that, even valuation of the bundle. Thus, we can try to care-
if we cannot compute an optimal mechanism, we fully design a bidding language in which a bidder
might be able to compute a reasonable can communicate her valuations succinctly. Simple
approximation to it. However, as Nisan and Ronen ( information-theoretic arguments can be used to
, ) show that, for every bidding language, there will be
show, in general, replacing a VCG mechanism by valuations that will require length at least 2" to
an approximation does not preserve truthfulness. express in that language. Thus, the best we can
That is, even though truthfully revealing one’s type hope for is to design a language that can represent
is an optimal strategy in a VCG mechanism, it may the ‘interesting’ bids succinctly. See Nisan ( ) for
no longer be optimal in an approximation. an overview of various bidding
Following Nisan and Ronen’s work, there has been languages and their expressive power.
a spate of papers either describing computationally Given bids from each of the bidders in a com-
tractable mechanisms or showing that no binatorial auction, the auctioneer would like to then
computationally tractable mechanism exists for a determine the winners. More precisely, the
number of problems, ranging from task allocation auctioneer would like to allocate the m items in an
(Archer and Tardos ; Nisan and Ronen ) to cost- auction so as to maximize his revenue. This prob-
sharing for multicast trees (Feigenbaum et al. ) lem, called the winner determination problem, is
(where the problem NP-complete in general, even in relatively simple
is to share the cost of sending, for example, a movie classes of combinatorial auctions with only two
over a network among the agents who actually want bidders making rather restricted bids. Moreover, it
the movie) to finding low-cost paths between nodes is not even polynomial-time approximable, in the
in a network (Archer and Tardos ). sense that there is no constant d and polynomial-
The problem that has attracted perhaps the most time algorithm such that the algorithm produces an
attention is combinatorial auctions, where bidders allocation that gives revenue that is at least 1/d of
can bid on bundles of items. This becomes of optimal. On the other hand, there are algorithms
particular interest in situations where the value to a that provably find a good solution, seem to work
bidder of a bundle of goods cannot be determined well in practice, and, if they seem to be taking too
by simply summing the value of each good in long, can be terminated early, usually with a good
isolation. To take a simple example, the value of a feasible solution in hand. See Lehmann et al. ( ),
pair of shoes is much higher than that of the for an overview of the results in
individual shoes; perhaps more interestingly, an this area.
owner of radio stations may value having a licence In most mechanism design problems, compu-
in two adjacent cities more than the sum of the tational complexity is seen as the enemy. There is
individual licences. Combinatorial auctions are of one class of problems in which it may be a friend:
great interest in a variety of settings including voting. One problem with voting mechanisms is
spectrum auctions, airport time slots (that is, take- that of manipulation by voters. That is, voters may
off and landing slots), and industrial procurement. be tempted to vote strategically rather than ranking
There are many complexity-theoretic issues the candidates according to their true preferences, in
the hope that the final outcome will be
2004 Computer Science and Game Theory
more favourable. This situation arises frequently in explosion of work, leading to a better understanding
practice; in the 2000 US presidential election, of the communication complexity for many
American voters who preferred Nader to Gore to important economic allocation problems; see Segal
Bush were encouraged to vote for Gore, rather than ( ) for an overview. Two important
‘wasting’ a vote on Nader. The classic Gibbard- themes in this work are understanding the role of
Satterthwaite theorem (Gibbard ; Satterthwaite ) price- based market mechanisms in solving allo-
shows that, if there are at least three alternatives, cation problems with minimal communication, and
then in any nondictatorial voting scheme (that is, designing mechanisms that provide agents with
one where it is not the case that one particular voter incentives to communicate truthfully while having
dictates the final outcome, irrespective of how the low communication requirements.
others vote), there are preferences under which an
agent is better off voting strategically. The hope is
that, by constructing the voting mechanism The Price of Anarchy
appropriately, it may be computationally intractable
In a computer system, there are situations where we
to find a manipulation that will be beneficial. While
may have a choice between a centralized and a
finding manipulations for the plurality protocol (the
decentralized solution to a problem. By ‘central-
candidate with the most votes wins) is easy, there
ized’ here, I mean that each agent in the system is
are well-known voting protocols for which manipu-
told exactly what to do and must do so; in the
lation is hard in the presence of three or more
decentralized solution, each agent tries to optimize
candidates. See Conitzer et al. ( ) for a sum
his own selfish interests. Of course, centralization
mary of results and further pointers to the literature.
comes at a cost. For one thing, there is a problem of
Communication Complexity enforcement. For another, centralized solutions tend
Most mechanisms in the economics literature are to be more vulnerable to failure. On the other hand,
designed so that agents truthfully reveal their a centralized solution may be more socially
preferences. However, in some settings, revealing beneficial. How much more beneficial can it be?
one’s full preferences can require a prohibitive Koutsoupias and Papadimitriou ( ) formal
amount of communication. For example, in a ized this question by considering the ratio of the
combinatorial auction of m items, revealing one’s social welfare of the centralized solution to the
full preferences may require revealing what one social welfare of the Nash equilibrium with the
would be willing to pay for each of the 2"' 1 worst social welfare (assuming that the social
possible bundles of items. Even if in 30, welfare function is always positive). They called
this requires revealing more than one billion num- this ratio the price of anarchy; and proved a number
bers. This leads to an obvious qst: how much of results regarding the price of anarchy for a
communication is required by various mechanisms? scheduling problem on parallel machines. Since the
Formal work on this question in the economics original paper, the price of anarchy has been studied
community goes back to Hurwicz ( ) in many settings, including traffic routing
and Mount and Reiter ( ); their definitions (Roughgarden and Tardos ), facility location games
focused on the dimension of the message space. (for example, where is the best place to put a
Independently (and later), there was active work in factory) (Vetta ), and spectrum sharing (how should
computer science on communication complexity, the channels in a WiFi network be assigned)
number of bits of communication needed for a set (Halldorsson et al. ).
of n agents to compute the value of a function / : To give a sense of the results, consider the
xni=l 0, —> X , where each agent i knows 9j C 0, traffic-routing context of Roughgarden and Tardos (
(Think of 6t as representing agent fs type.) Recently ). Suppose that the travel time on a
there has been an road increases in a known way with the congestion
on the road. The goal is to minimize the
Computer Science and Game Theory 2005
average travel time for all drivers. Given a road are assumed to be n soldiers, up to t of which may
network and a given traffic load, a centralized be faulty (the t stands for traitor)', n and / are
solution would tell each driver which road to take. assumed to be common knowledge. Each soldier
For example, there could be a rule that cars with starts with an initial preference, to either attack or
odd-numbered licence plates take road 1, while retreat. (More precisely, there are two types of
those with even-numbered plates take road 2, to nonfaulty agents - those that prefer to attack, and
minimize congestion on either road. Roughgarden those that prefer to retreat.) We want a protocol that
and Tardos show that the price of anarchy is guarantees that (1) all nonfaulty soldiers reach the
unbounded if the travel time can be a nonlinear same decision, and (2) if all the soldiers are
function of the congestion. On the other hand, if it is nonfaulty and their initial preferences are identical,
linear, they show that the price of anarchy is at most then the final decision agrees with their initial
4/3. preferences. (The condition simply prevents the
The price of anarchy is but one way of com- obvious trivial solutions, where the soldiers attack
puting the ‘cost’ of using a Nash equilibrium. no matter what, or retreat no matter what.)
Others have been considered in the computer sci- The problem was introduced by Pease et al. (
ence literature. For example, Tennenholtz ( ) ), and has been studied in detail since
compares the safety level of a game - the optimal then; Chor and Dwork ( ), Fischer ( ), and
amount that an agent can guarantee himself, inde- Linial ( ) provide overviews. Whether the
pendent of what the other agents do - to what the Byzantine agreement problem is solvable depends
agent gets in a Nash equilibrium, and shows, for in part on what types of failures are considered, on
interesting classes of games, including load- whether the system is synchronous or asynchro-
balancing games and first-price auctions, that the nous, and on the ratio of n to t. Roughly speaking, a
ratio between the safety level and the Nash equi- system is synchronous if there is a global clock and
librium is bounded. For example, in the case of agents move in lockstep; a ‘step’ in the system
first- price auctions, it is bounded by the constant e. corresponds to a tick of the clock. In an asynchro-
nous system, there is no global clock. The agents in
the system can mn at arbitrary rates relative to each
Game Theory and other. One step for agent 1 can correspond to an
Distributed Computing arbitrary number of steps for agent 2 and vice versa.
Synchrony is an implicit assumption in essentially
Distributed computing and game theory are inter-
all games. Although it is certainly possible to model
ested in much the same problems: dealing with
games where player 2 has no idea how many moves
systems where there are many agents, facing
player 1 has taken when player 2 is called upon to
uncertainty and having possibly different goals. In
move, it is not typical to focus on the effects of
practice, however, there has been a significant
synchrony (and its lack) in games. On the other
difference in emphasis between the two areas. In
hand, in distributed systems, it is typically a major
distributed computing, the focus has been on
focus.
problems such as fault tolerance, asynchrony,
Suppose for now that we restrict to crash fail-
scalability, and proving correctness of algorithms;
ures, where a faulty agent behaves according to the
in game theory, the focus has been on strategic
protocol, except that it might crash at some point,
concerns. I discuss here some issues of common
after which it sends no messages. In the round in
interest. Most of the discussion in the remainder of
which an agent fails, the agent may send only a
this section is taken from Halpem ( ).
subset of the messages that it is supposed to send
To understand the relevance of fault tolerance
according to its protocol. Further suppose that the
and asynchrony, consider the Byzantine agreement
system is synchronous. In this case, the following
problem, a paradigmatic problem in the distributed
rather simple protocol achieves Byzantine
systems literature. In this problem, there
agreement:
2006 Computer Science and Game Theory
• In the first round, each agent tells every other sends the message to j. Thus, j also decides to attack.
agent its initial preference. A symmetric argument shows that if j decides to
• For rounds 2 to t + 1, each agent tells every attack, then so does i.
other agent everything it has heard in the pre- It should be clear that the correctness of this
vious round. Thus, for example, in round 3, protocol depends on both the assumptions made:
agent 1 may tell agent 2 that it heard from agent crash failures and synchrony. Suppose instead that
3 that its initial preference was to attack, and Byzantine failures are allowed, so that faulty agents
that it (agent 3) heard from agent 2 that its initial can deviate in arbitrary ways from the protocol; they
preference was to attack, and it heard from agent may Tie’, send deceiving messages, and collude to
4 that its initial preferences was to retreat, and fool the nonfaulty agents in the most malicious
so on. This means that messages get ways. In this case, the protocol will not work at all.
exponentially long, but it is not difficult to In fact, it is known that agreement can be reached in
represent this information in a compact way so the presence of Byzantine failures iff t < n/3, that is,
that the total communication is polynomial in n, iff fewer than a third of the agents can be faulty
the number of agents. (Pease et al. ). The effect of asynchrony is even
• At the end of round t+ 1, if an agent has heard more devastating: in an asynchronous system, it is
from any other agent (including itself) that its impossible to reach agreement using a deterministic
initial preference was to attack, it decides to protocol even if t = 1 (so that there is at most one
attack; otherwise, it decides to retreat. failure) and only crash failures are allowed (Fischer
et al. ). The problem in the asynchronous setting is
Why is this correct? Clearly, if all agents are that if none of the agents have heard from, say,
correct and want to retreat (resp., attack), then the agent 1, they have no way of knowing whether agent
final decision will be to retreat (resp., attack), since 1 is faulty or just slow. Interestingly, there are
that is the only preference that agents hear about randomized algorithms (that is, behavioural
(recall that for now we are considering only crash strategies) that achieve agreement with arbitrarily
failures). It remains to show that if some agents high probability in an asynchronous setting (Ben-
prefer to attack and others to retreat, then all the Or ; Rabin ).
nonfaulty agents reach the same final decision. So Byzantine agreement can be viewed as a game
suppose that i and j are nonfaulty and i decides to where, at each step, an agent can either send a
attack. That means that i heard that some agent’s message or decide to attack or retreat. It is essen-
initial preference was to attack. If it heard this first tially a game between two teams, the nonfaulty
at some round t' <t + 1, then i will forward this agents and the faulty agents, whose composition is
message to j, who will receive it and thus also unknown (at least by the correct agents). To model it
attack. On the other hand, suppose that i heard it as a game in the more traditional sense, we could
first at round t + 1 in a message from . Thus, this imagine that the nonfaulty agents are playing against
message must be of the form % said at round t a new player, the ‘adversary’. One of the
that... that 12 said at round 2 that /'/ said at round 1 adversary’s moves is that of ‘corrupting’ an agent:
that its initial preference was to attack. ’ Moreover, changing its type from ‘nonfaulty’ to ‘faulty.’ Once
the agents i\v ..., /, + 1 must all be distinct. Indeed, it an agent is corrupted, what the adversary can do
is easy to see that ik must crash in round k before depends on the failure type being considered. In the
sending its message to i (but after sending its case of crash failures, the adversary can decide
message to /'/, ,), for which of a corrupted agent’s messages will be
k 1............/, for otherwise i must have gotten the delivered in the round in which the agent is
message from ik, contradicting the assumption that i corrupted; however, it cannot modify the messages
first heard at round t + 1 that some agent’s initial themselves. In the case of Byzantine failures, the
preference was to attack. Since at most t agents can adversary essentially gets to make the moves for
crash, it follows that /, ,, the agent that sent the agents that have been
message to i, is not faulty, and thus
Computer Science and Game Theory 2007
corrapted; in particular, it can send arbitrary tolerance and asynchrony to standard problems in
messages. game theory; see, for example, Eliaz ( ),
Why has the distributed systems literature not Monderer and Tennenholtz ( , ) and the
considered strategic behaviour in this game? Crash definitions in the next section. This seems to be an
failures are used to model hardware and software area that is ripe for further developments. One such
failures; Byzantine failures are used to model development is the subject of the next section.
random behaviour on the part of a system (for
example, messages getting garbled in transit),
software errors, and malicious adversaries (for
example, hackers). With crash failures, it does not Implementing Mediators
make sense to view the adversary’s behaviour as
strategic, since the adversary is not really viewed as The question of whether a problem in a multiagent
having strategic interests. While it would certainly system that can be solved with a trusted mediator
make sense, at least in principle, to consider the can be solved by just the agents in the system,
probability of failure (that is, the probability that without the mediator, has attracted a great deal of
the adversary corrupts an agent), this approach has attention in both computer science (particularly in
by and large been avoided in the literature because the cryptography community) and game theory. In
it has proved difficult to characterize the probability cryptography, the focus on the problem has been on
distribution of failures over time. Computer secure multiparty computation. Here it is assumed
components can perhaps be characterized as failing that each agent i has some private information xt.
according to an exponential distribution (see Fix functions//, ..., /n. The goal is to have agent i
Babaoglu , for an analysis of Byzantine agreement learn f(xu ..., x,j without learning anything about Xj
in such a setting), but crash failures can be caused for j / i beyond what is revealed by the value off
by things other than component failures (faulty (xj ..., x„). With a trusted mediator, this is trivial:
software, for ex); these can be extremely difficult to each agent i just gives the mediator its private value
characterize probabilistically. The problems are x, the mediator then sends each agent i the value
even worse when it comes to modelling random f(xi, ..., x„). Work on multiparty computation
Byzantine behaviour. (Goldreich et al. ; Shamir et al. ; Yao ) provides
With malicious Byzantine behaviour, it may conditions under which this can be done. In game
well be reasonable to impute strategic behaviour to theory, the focus has been on whether an
agents (or to an adversary controlling them). equilibrium in a game with a mediator can be
However, it is often difficult to characterize the implemented using what is called cheap talk - that
payoffs of a malicious agent. The goals of the is, just by players communicating among
agents may vary from that of simply trying to delay themselves (cf. Barany ; Ben-Porath ; Forges ;
a decision to that of causing disagreement. It is not Gerardi ; Heller ; Urbano and Vila ). As suggested
clear what the appropriate payoffs should be for in the previous section, the focus in the computer
attaining these goals. Thus, the distributed systems science literature has been in doing multiparty
literature has chosen to focus instead on algorithms computation in the presence of possibly malicious
that are guaranteed to satisfy the specification adversaries, who do everything they can to subvert
without making assumptions about the adversary’s the computation, while in the game theory literature
payoffs (or nature’s probabilities, in the case of the focus has been on strategic agents. In recent
crash failures). work, Abraham et al. ( ) and Abraham
Recently, there has been some work on adding et al. ( ) considered deviations by both ratio
strategic concerns to standard problems in distrib- nal players, who have preferences and try to max-
uted computing; see, for example, Alvisi et al. ( ) imize them, and players who can viewed as
and Halpem and Teague ( ). malicious, although it is perhaps better to think
Moving in the other direction, there has also been
some work on adding concerns of fault
2008 Computer Science and Game Theory
of them as rational players whose utilities are not like Kazaa or Gnutella, it would seem that no
known by the other players or mechanism designer. rational agent should share files. Whether or not
I briefly sketch their results here; the following you can get a file depends only on whether other
discussion is taken from Abraham et al. ( ). people share files. Moreover, there are disincentives
The idea of tolerating deviations by coalitions of for sharing (the possibility of lawsuits, use of
players goes back to Aumann ( ); more bandwidth, and so on). Nevertheless, people do
recent refinements have been considered by Moreno share files. However, studies of the Gnutella net-
and Wooders ( ). Aumann’s defini work have shown almost 70 per cent of users share
tion is essentially the following. no files and nearly 50 per cent of responses are
from the top one per cent of sharing hosts (Adar and
Definition 1 is a k-resilient ’ equilibrium if, for Huberman ).
all sets C of players with | C | < k, it is not the One reason that people might not respond as we
case that there exists a strategy x=* ■ such that expect is that they have utilities that are different
ufx^ c, <7^—c) > for all i G C . from those we expect. Alternatively, the players
As usual, the strategy (x c, a - c) is the one may be irrational, or (if moves are made using a
where each player i e C plays x, and each player i computer) they may be playing using a faulty
C plays c r A s the prime notation suggests, this is computer and thus not able to make the move they
not quite the definition we want to work with. The would like, or they may not understand how to get
trouble with this definition is that it suggests that the computer to make the move they would like.
coalition members cannot communicate with each Whatever the reason, it seems important to design
other during the game. Perhaps surprisingly, strategies that tolerate such unanticipated
allowing communication can prevent certain equi- behaviours, so that the payoffs of the users with
libria (see Abraham et al. , for an ex). Since we ‘standard’ utilities do not get affected by the
should expect coalition members to communicate, nonstandard players using different strategies. This
the following definition seems to capture a more can be viewed as a way of adding fault tolerance to
reasonable notion of resilient equilibrium. Let the equilibrium notions.
cheap-talk extension of a game T be, roughly
speaking, the game where players are allowed to Definition 3 A joint strategy cr > is t-immune if,
communicate among themselves in addition to for all T C N with |T| < t, all joint strategies T=>, and
performing the actions of F and the payoffs are just all i £ T, we have u,{a x - i ) > w,(cr=').
as in F. The notion of /-immunity and F-resilienee
address different concerns. For t immunity, we
Definition 2 is a k-resilient equilibrium in a game consider the payoffs of the players not in T, and
I if a - is a k-resilient' equilibrium in the cheap-talk require that they are not worse due to deviation; for
extension of F (where we identify the strategy at in resilience, we consider the payoffs of players in C,
the game F with the strategy in the cheap-talk game and require that they are not better due to deviation.
where player i never sends any messages beyond It is natural to combine both notions. Given a game
those sent according to cr,). F, let I ’.be the game that is identical to r except that
A standard assumption in game theory is that the players in T are fixed to playing strategy x.
utilities are (commonly) known; when we are given
a game we are also given each player’s utility. Definition 4 o ' is a (k, t) -robust equilibrium if is
When players make decisions, they can take other t-immune and, for all T C N such that | T\ < t and
players’ utilities into account. However, in large all joint strategies x , a - _T is a ^-resilient strategy
systems it seems almost invariably the case that of I ’) .
there will be some fraction of users who do not To state the results of Abraham et al. ( )
respond to incentives the way we expect. For and ( ) on implementing mediators, three
example, in a peer-to-peer network games need to be considered: an underlying
Computer Science and Game Theory 2009
game I , an extension T^of T with a mediator, and a • Ifn < 2k + 21, then mediators cannot, in general,
cheap-talk extension Tct of 1 ’. Assume that 1 is a be e-implemented, even with broadcast
normal-form Bayesian game: each player has a type channels. Moreover, even assuming cryptogra-
from some type space with a known distribution phy and polynomially bounded players, the
over types, and the utilities of the agents depend on expected running time of an implementation
the types and actions taken. Roughly speaking, a depends on the utility functions of the players
cheap talk game implements a game with a mediator and e.
if it induces the same distribution over actions in • Ifn S' k ' t, then, assuming cryptography and
the underlying game, for each type vector of the polynomially bounded players, mediators can be
players. With this background, I can summarize the e-implemented using cheap talk, but if n# 2k +
results of Abraham et al. ( ) 2t, then the running time depends on the utilities
and ( ). in the game and s.
• If n <k+3t, then even assuming cryptography,
• If n > 3 k + 31, a (k, t) -robust strategy a=’ with polynomially bounded players, and a (k + f)
a mediator can be implemented using cheap talk -punishment strategy, mediators cannot, in
(that is, there is a (k, /(-robust strategy a=’' in a general, be e -implemented using cheap talk.
cheap talk game such that <r > and a=’' induce • Ifn > k + t, then, assuming cryptography,
the same distribution over actions in the under- polynomially bounded players, and a public- key
lying game). Moreover, the implementation infrastructure (PKI), we can e-implement a
requires no knowledge of other agents’ utilities, mediator.
and the cheap talk protocol has bounded running
time that does not depend on the utilities. The proof of these results makes heavy use of
• If/; < 3/c — 3/, then, in general, mediators techniques from computer science. All the possibil-
cannot be implemented using cheap talk without ity results showing that mediators can be
knowledge of other agents’ utilities. Moreover, implemented use techniques from secure multiparty
even if other agents’ utilities are known, medi- computation. The results showing that if n < 3k +
ators cannot, in general, be implemented with- 3t, then we cannot implement a mediator without
out having a (k +1)-punishment strategy (that is, knowing utilities, and that, even if utilities are
a strategy that, if used by all but at most (k + I ) known, a punishment strategy is required, use file
players, guarantees that every player gets a fact that Byzantine agreement cannot be reached if t
worse outcome than they do with the equilib- < n/3; the impossibility result for n < 2k - 3t also
rium strategy) nor with bounded running time. uses a variant of Byzantine agreement.
• If n > 2k + 3t, then mediators can be A related line of work considers implementing
implemented using cheap talk if there is a mediators assuming stronger primitives (which
punishment strategy (and utilities are known) in cannot be implemented in computer networks); see
finite expected running time that does not Izmalkov et al. ( ) and Lepinski et al.
depend on the utilities. ( ) for details.
• lfn#2k+3t then med i ators cannot, in general, be
implemented, even if there is a punishment
strategy and utilities are known. Other Topics
• If n > 2k + 2t and there are broadcast channels
There are many more areas of interaction between
then, for all e, mediators can be e -implemented
computer science than I have indicated in this brief
(intuitively, there is an implementation where
players get utility within e of what they could survey. I briefly mention a few others here.
get by deviating) using cheap talk, with Interactive Epistemology
bounded expected running time that does not Since the publication of Aumann’s ( ) seminal
depend on the utilities. paper, there has been a great deal of activity in
2010 Computer Science and Game Theory
trying to understand the role of knowledge in order to understand subtleties regarding basic issues
games, and providing epistemic analyses of solution such as rationality. To the extent that game theory is
concepts; see Battigalli and Bonanno ( ) used to tackle larger, more practical problems, it
for a survey. In computer science, there has been a will become important to find efficient techniques
parallel literature applying epistemic logic to reason for describing and analysing games. By way of
about distributed computation. One focus of this analogy, 2n — 1 numbers are needed to describe a
work has been on characterizing the level of probability distribution on a space characterized by
knowledge needed to solve certain problems. For n binary random variables. For n 100 (not an
example, to achieve Byzantine agreement common unreasonable number in practical situations), it is
knowledge among the nonfaulty agents of an initial impossible to write down the probability
value is necessary and sufficient. More generally, in distribution in the obvious way, let alone do
a precise sense, common knowledge is necessary computations with it. The same issues will surely
and sufficient for coordination. Another focus has arise in large games. Computer scientists use
been on defining logics that capture the reasoning of graphical approaches, such as Bayesian networks
resource-bounded agents. A number of approaches and Markov networks (Pearl ), for representing and
have been considered. Perhaps the most common manipulating probability measures on large spaces.
considers logics for reasoning about awareness, Similar techniques seem applicable to games; see,
where an agent may not be aware of certain for example, Kearns et al. ( ), Roller and Milch
concepts, and can know something only if he is ( ), and La Mura ( ) for specific
aware of it. This topic has been explored in both approaches, and Kearns ( ) for a recent over
computer science and game theory; see Dekel et al. view. Note that representation is also an issue when
( ), Fagin and Halpem we consider the complexity of problems such as
( ), Halpem ( ), Halpem and Rego computing Nash or correlated equilibria. The
( ), Heifetz et al. ( ), and Modica and complexity of a problem is a function of the size of
Rustichini ( , ) for some of the work in the input, and the size of the input (which in this
this active area. Another approach, so far considered case is a description of the game) depends on how
only by computer scientists, involves algorithmic the input is represented.
knowledge, which takes seriously the assumption
that agents must explicitly compute what they Learning in Games
know. See Fagin et al. ( ) for an There has been a great deal of work in both com-
overview of the work in epistemic logic in computer puter science and game theory on learning to play
science. well in different settings (see Fudenberg and Levine
, for an overview of the work in game theory). One
Network Growth line of research in computer science has involved
If we view networks as being built by selfish learning to play optimally in a rcinforccmcnt-
players (who decide whether or not to build links), lcaming setting, where an agent interacts with an
what will the resulting network look like? How does unknown (but fixed) environment. The agent then
the growth of the network affect its functionality? faces a fundamental tradeoff between exploration
For example, how easily will influence spread and exploitation. The question is how long it takes
through the network? How easy is it to route traffic? to learn to play well (to get a reward within some
See Fabrikant et al. ( ) and fixed e of optimal); see Brafman and Tennenholtz (
Kempe et al. ( ) for some recent computer ) and Kearns
science work in this burgeoning area. and Singh ( ) for the current state of the art.
A related question is efficiently finding a strategy
Efficient Representation of Games minimizes regret - that is, finding a strategy that is
Game theory has typically focused on ‘small’ guaranteed to do not much worse than the best
games, often two- or three-player games, that are strategy would have done in hindsight (that is,
easy to describe, such as Prisoner’s Dilemma, in
Computer Science and Game Theory 2011
even knowing what the opponent would have done). Aumann, R. 1959. Acceptable points in general cooperative n-
See Blum and Mansour ( ) for a recent person games. In Contributions to the theory of games,
ed. A. Tucker and R. Luce, vol. TV, 287-324. Princeton:
overview of work on this problem. Princeton University Press.
Aumann, R.J. 1976. Agreeing to disagree. Annals of Sta-
tistics 4: 1236-1239.
See Also Aumann, R.J. 1987. Correlated equilibrium as an expression
of Bayesian rationality. Econometrica 55: 1-18.
► Computation oi Axelrod, R. 1984. The evolution of cooperation. New York:
Basic Books.
► Computational Methods it Econometrics Babaoglu, O. 1987. On the reliability of consensus-based fault-
► Computing in Mechanism E 1 sign tolerant distributed computing systems. ACM
► )ata Mining Translation on Computer Systems 5: 394-416.
► ilectronic Commerce Barany, I. 1992. Fair distribution protocols or how the players
replace fortune. Mathematics of Operations Research
► ipistemic Game Theory: An Ov TO V
17: 327-340.
► istemic Game Theory: E ^ ? = Battigalli, P., and G. Bonanno. 1999. Recent results on belief,
► Mathematics of Networks knowledge and the epistemic foundations of game theory.
► Mechanism Design (New Developments) Research in Economics 53: 149-225.
Ben-Qr, M. 1983. Another advantage of free choice:
► Rationality, Bounded
Completely asynchronous agreement protocols. I n Pro-
► /oting P aradoxes ceedings of the second ACM symposium on principles
of distributed computing, 27-30.
Acknowledgment The work for this article was supported in Ben-Porath, E. 2003. Cheap talk in games with incomplete
part by NSF under grants CTC-0208535 and ITR-0325453, information. Journal of Economic Theory 108: 45-71.
by ONR under grant N00014-02-1- 0455, by the DoD Blum, A., and Y. Mansour. 2007. Learning, regret minimi-
Multidisciplinary University Research Initiative (MUR1) zation, and equilibria. In Algorithmic game
program administered by the ONR under grants N00014-01- theory, ed. N. Nisan, T. Roughgarden, E. Tardos, and V.
1-0795 andN00014-04-1-0725, and by AFOSR under grant Vazirani. Cambridge: Cambridge University Press.
F49620-02-1-0101. Thanks to Larry Blume, Christos Blum, B., C.R. Shelton, and D. Roller. 2003. A continuation
Papadimitriou, Ilya Segal, Eva Tardos, and Moshe method for Nash equilibria in structured games. In
Tennenholtz for useful comments. Proceedings of the 18th international joint conference
on artificial intelligence (1JCA1 ‘03), 757-764.
Bibliography Brafman, R.I., and M. Tennenholtz. 2002. R-MAX: A general
polynomial time algorithm for near-optimal
reinforcement learning. Journal of Machine Learning
Abraham, I., D. Dolev, R. Gonen, and J. Halpem. 2006.
Research 3: 213-231.
Distributed computing meets game theory: Robust
Chen, X., and X. Deng. 2006. Settling the complexity of 2-
mechanisms for rational secret sharing and multiparty
player Nash equilibrium. In Proceedings of the 47th
computation. In Proceedings of the 25 th ACM symposium
IEEE symposium on foundations of computer science,
on principles of distributed computing, 53-62. Abraham, I.,
261-272.
Dolev, D., and Halpem, J. 2007. Lower bounds on
Chor, B., and C. Dwork. 1989. Randomization in Byzantine
implementing robust and resilient mediators. Available at
agreement. In Advances in computing research 5:
V/:; Ev crg/abs/CTE 35'5. Accessed 19 June 2007.
Randomness and computation, ed. S. Micali, 443-497.
Adar, E., and B. Huberman. 2000. Free riding on Gnutella.
Greenwich: JAI Press.
First Monday 5(10).
Chu, F., and J.Y. Halpem. 2001. A decision-theoretic
Alvisi, L., A.S. Ayer, A. Clement, M. Dahlin, J.P. Martin, and
approach to reliable message delivery. Distributed
C. Porth. 2005. BAR fault tolerance for cooperative services.
Computing 14: 359-389.
In Proceedings of the 20th ACM symposium on operating
Clarke, E.H. 1971. Multipart pricing of public goods. Public
systems principles (SOSP 2005), 45-58. Archer, A., and E.
Choice 11: 17-33.
Tardos. 2001. Truthful mechanisms for one-parameter
Conitzer, V., and T. Sandhokn. 2003. Complexity results about
agents. In Proceedings of the 42nd IEEE symposium on
Nash equilibria. In Proceedings of the 18 th international
foundations of computer science, 482-491.
joint conference on artificial intelligence {IJCAI ‘03),
Archer, A., and E. Tardos. 2002. Frugal path mechanisms. In
765-771.
Proceedings of the 13th ACM-SIAM symposium on
Conitzer, V., T. Sandholm, and J. Lang. 2007. When are
discrete algorithms, 991-999.
elections with few candidates hard to manipulate?
Journal of the ACM 54(3), Article 14.
2012 Computer Science and Game Theory
Cramton, P., Y. Shoham, and R. Steinberg. 2006. Combi- Halpem, J.Y. 2003. A computer scientist looks at game theory.
natorial auctions. Cambridge, MA: MIT Press. Games and Economic Behavior 45: 114-132.
Daskalis, C., P.W. Goldberg, and C.H. Papadimitriou 2006. Halpem, J.Y., and L.C. Rego. 2007. Reasoning about
The complexity of computing a Nash equilibrium. In knowledge of unawareness. Games and Economic
Proceedings of the 38th ACM symposium on theory of Behavior. Also available at
computing, 71-78. . Accessed 24 June 2007.
Dekel, E., B. Lipman, and A. Rustichini. 1998. Standard state- Halpem, J.Y., and V. Teague. 2004. Rational secret sharing
space models preclude unawareness. Econometrica 66: and multiparty computation: Extended abstract. In Pro-
159-173. ceedings of the 36tli ACM symposium on theory of
Eliaz, K. 2002. Fault-tolerant implementation. Review of computing, 623-632.
Economic Studies 69: 589-610. Heifetz, A., M. Meier, and B. Schipper. 2006. Interactive
Fabrikant, A., A. Luthra, E. Maneva, C.H. Papadimitriou, unawareness. Journal of Economic Theory 130: 78-94.
and S. Shenker. 2003. On a network creation game. In Heller, Y. 2005. A minority-proof cheap-talk protocol.
Proceedings of the 22nd ACM symposium on Unpublished manuscript.
principles of distributed computing, 347-351. Hopcrofl, J.E., and J.D. Ullman. 1979. Introduction to
Fagin, R., and J.Y. Halpem. 1988. Belief, awareness, and automata theory, languages and computation. New
limited reasoning. Artificial Intelligence 34: 39-76. York: Addison-Wesley.
Fagin, R., J.Y. Halpem, Y. Moses, and M.Y. Vardi. 1995. Hurwicz, L. 1977. On the dimensional requirements of
Reasoning about knowledge. Cambridge, MA: MIT informationally decentralized Pareto satisfactory pro-
Press A slightly revised paperback version was published cesses. In Studies in Resource Allocation Processes, ed.
in 2003. K.J. Arrow and L. Hurwicz, 413-424. New York:
Feigenbaum, J., C. Papadimitriou, and S. Shenker. 2000. Cambridge University Press.
Sharing the cost of muliticast transmissions (preliminary Izmalkov, S., S. Micali, and M. Lepinski. 2005. Rational
version). In Proceedings of the 32nd ACM symposium secure computation and ideal mechanism design. In
on theory of computing, 218-227. Proceedings of the 46th IEEE symposium foundations
Fischer, M.J. 1983. The consensus problem in unreliable of computer science, 585-595.
distributed systems. In Foundations of computation Kearns, M. 2007. Graphical games. In Algorithmic game
theory, vol. 185, ed. M. Karpinski. Lecture Notes in theory, ed. N. Nisan, T. Roughgarden, E. Tardos, and V.
Computer Science. Berlin/New York: Springer, Vazirani. Cambridge: Cambridge University Press.
pp. 127-140. Kearns, M.J., and M.K. Reiter. 2005. Proceedings of the
Fischer, M.J., N.A. Lynch, and M.S. Paterson. 1985. sixth ACM conference on electronic commerce (EC
Impossibility of distributed consensus with one faulty ‘05). New York: ACM. Table of contents available at
processor. Journal of the ACM 32: 374-382. . Accessed 19 June 2007.
Forges, F. 1990. Universal mechanisms. Econometrica 58: Kearns, M., and S.P. Singh. 1998. Near-optimal reinforcement
1341-1364. learning in polynomial time. In Proceedings of the 15th
Fudenberg, D., and D. Levine. 1998. The theory of learning international conference on machine learning. 260-268.
in games. Cambridge, MA: MIT Press. Kearns, M., M.L. Littman, and S.P. Singh. 2001. Graphical
Gerardi, D. 2004. Unmediated communication in games with models for game theory. In Proceedings of the 17th
complete and incomplete information. Journal of conference on uncertainty in artificial intelligence
Economic Theory 114: 104-131. (UA12001), 253-260.
Gibbard, A. 1973. Manipulation of voting schemes. Kempe, D., J. Kleinberg, andE. Tardos. 2003. Maximizing the
Econometrica 41: 587-602. spread of influence through a social network. In
Gilboa, I., and E. Zemel. 1989. Nash and correlated equi- Proceedings of the ninth ACM S1GKDD international
librium: Some complexity considerations. Games and conference on knowledge discoveiy and data mining, 137-
Economic Behavior 1: 80-93. 146.
Goldreich, O., S. Micali, and A. Wigderson. 1987. How to play Kftr-Dahav, N.E., D. Monderer, and M. Tennenholtz 2000.
any mental game. In Proceedings of the 19th ACM Mechanism design for resource-bounded agents. In Inter-
symposium on theory of computing, 218-229. national conference on multiagent systems, 309-316.
Govindan, S., and R. Wilson. 2003. A global Newton method Roller, D., and B. Milch. 2001. Structured models for
to compute Nash equilibria. Journal of Economic multiagent interactions. In Theoretical aspects of ratio-
Theory 110: 65-86. nality and knowledge: Proceedings of the eighth con-
Groves, T. 1973. Incentives in teams. Econometrica 41: 617- ference (PARK 2001), 233-248.
631. Koutsoupias, E., and C.H. Papadimitriou. 1999. Worst- case
Halldorsson, M.M., J.Y. Halpem, L. Li, and V. Mirrokni. equilibria, hr Proceedings of the 16th conference on
2004. On spectrum sharing games. In Proceedings of the theoretical aspects of computer science, vol. 1563,
23rd ACM symposium on principles of distributed Lecture Notes in Computer Science. Berlin: Springer, pp.
computing, 107-114. 404^413.
Halpem, J.Y. 2001. Alternative semantics for unawareness.
Games and Economic Behavior 37: 321-339.
Computer Science and Game Theory 2013
Kushilevitz, E., andN. Nisan. 1997. C o m m u n i c a t i o n Papadimitriou, C.H. 2001. Algorithms, games, and the
c o m p l e x i t y . Cambridge: Cambridge University Press. internet. In P r o c e e d i n g s o f t h e 3 3 r d A C M
La Mura, P. 2000. Game networks. In Proceedings of the 16th s y m p o s i u m o n t h e o r y o f c o m p u t i n g , 749-
conference on uncertainty in artificial intelligence 753.
(UAI2000), 335-342. Papadimitriou, C.H. 2005. Computing correlated equilibria in
Lehmann, D., R. Muller, and T. Sandholm. 2006. The winner multiplayer games. In P r o c e e d i n g s o f t h e 3 7 t h
determination problem. In Combinatorial ACM symposium on theory of computing,
a u c t i o n s , ed. P. Cramton, Y. Shoham, and R. 49-56.
Steinberg. Cambridge, MA: MIT Press. Papadimitriou, C.H. 2007. The complexity of finding Nash
Lemke, C.E., and J.J.T. Howson. 1964. Equilibrium points of equilibria. In A l g o r i t h m i c g a m e t h e o r y , ed. N.
bimatrix games. J o u r n a l o f t h e S o c i e t y f o r Nisan, T. Roughgarden, E. Tardos, and V. Vazirani. Cam-
I n d u s t r i a l a n d A p p l i e d M a t h e m a t i c s 12: bridge: Cambridge University Press.
413—423. Papadimitriou, C.H., and T. Roughgarden. 2005. Computing
Lepinski, M., S. Micali, C. Peikert, and A. Shelat. 2004. equilibria in multi-player games. In P r o c e e d i n g s o f
Completely fair SFE and coalition-safe cheap talk. In the 16th ACM-SIAM symposium on
Proceedings of the 23rd ACM symposium d i s c r e t e a l g o r i t h m s , 82-91.
p r i n c i p l e s o f d i s t r i b u t e d c o m p u t i n g , 1-10. Papadimitriou, C.H., and M. Yannakakis. 1994. On com-
Linial, N. 1994. Games computers play: Game-theoretic plexity as bounded rationality. In P r o c e e d i n g s o f
aspects of computing. In H a n d b o o k o f g a m e the 26th ACM symposium on theory of
t h e o r y , ed. R.J. Aumann and S. Hart, vol. 2. Amster- c o m p u t i n g , 726-733.
dam: North-Holland. Pearl, J. 1988. Probabilistic reasoning in intelligent systems.
Modica, S., and A. Rustichini. 1994. Awareness and San Francisco: Morgan Kaufmann.
partitional information structures. T h e o r y a n d Pease, M., R. Shostak, and L. Lamport. 1980. Reaching
D e c i s i o n 37: 107-124. agreement in the presence of faults. J o u r n a l o f t h e
Modica, S., and A. Rustichini. 1999. Unawareness and A C M 27: 228-234.
partitional information structures. G a m e s a n d E c o - Porter, R., E. Nudelman, and Y. Shoham. 2004. Simple search
n o m i c B e h a v i o r 27: 265-298. methods for finding a Nash equilibrium. In P r o -
Monderer, D., and M. Tennenholtz. 1999a. Distributed games. ceedings of the 21st national conference
G a m e s a n d E c o n o m i c B e h a v i o r 28: 55-72. o n a r t i f i c i a l i n t e l l i g e n c e ( A A A I ' 0 4 ) , 664-
Monderer, D., and M. Tennenholtz. 1999b. Distributed games: 669.
From mechanisms to protocols. In P r o c e e d i n g s o f Rabin, M.O. 1983. Randomized Byzantine generals. In
the 16th national conference on Proceedings of the 24th IEEE symposium on foundations of
a r t i f i c i a l i n t e l l i g e n c e (AAAI '99), 32-37. computer science, 403—409.
Moreno, D., and I. Wooders. 1996. Coalition-proof equi- Roughgarden, T., and E. Tardos. 2002. How bad is selfish
librium. Games a n d E c o n o m i c B e h a v i o r 17: 80- routing? J o u r n a l o f t h e A C M 49: 236-259.
112. Rubinstein, A. 1986. Finite automata play the repeated
Mount, K.., and S. Reiter. 1974. The informational size of prisoner’s dilemma. J o u r n a l of Economic
message spaces. Journal of Economic Theory 8: 161-192. T h e o r y 39: 83-96.
Nash, J. 1950. Equilibrium points in n-person games. Pro- Rubinstein, A. 1998. Modeling bounded
ceedings of the National Academy of Sciences 36: 48^ r a t i o n a l i t y . Cambridge, MA: MIT Press.
Neyman, A. 1985. Bounded complexity justifies cooperation in Sandholm, T., and Yokoo, M. 2003. T h e s e c o n d
finitely repeated Prisoner’s Dilemma. Economic Letters i n t e r n a tional joint conference on A u t o n o m o u s
19: 227-229. A g e n t s a n d Multiagent Systems (A4M4S 2003). Table
Nisan, N. 2006. Bidding languages for combinatorial auctions. of contents available at
In Combinatorial auctions. Cambridge, MA: MIT Press. . Accessed 25 June 2007.
Nisan, N., and A. Ronen. 2000. Computationally feasible VCG Satterthwaite, M. 1975. Strategy-proofness and Arrow’s
mechanisms. In Proceedings of the second ACM conditions: Existence and correspondence theorems for
conference on electronic commerce (EC '00), 242-252. voting procedures and social welfare functions. Journal of
Nisan, N., and A. Ronen. 2001. Algorithmic mechanism Economic Theory 10: 187-217.
design. Games and Economic Behavior 35: 166-196. Segal, I. 2006. Communication in economic mechanisms. In
Nisan, N., T. Roughgarden, E. Tardos, and V. Vazirani. 2007. Advances in economics and e c o n o m e t r i c s : T h e o r y
Algorithmic game theory. Cambridge: Cambridge and application, ninth world congress
University Press. ( E c o n o m e t r i c society monographs), ed. R. Blundell,
Papadimitriou, C.H. 1994a. Computational complexity. W.K. Newey, and T. Persson, 222-268. Cambridge:
Reading: Adison-Wesley. Cambridge University Press.
Papadimitriou, C.H. 1994b. On the complexity of the parity Shamir, A., R.L. Rivest, and L. Adelman. 1981. Mental poker.
argument and other inefficient proofs of existence. Jour- In The mathematical gardner, ed. D.A. Klamer, 37-43.
nal of Computer and System Sciences 48: 498-532. Boston: Prindle, Weber, and Schmidt.
Tennenholtz, M. 2002. Competitive safety analysis: Robust
decision-making in multi-agent systems. J o u r nal o f
2014 Computing in Mechanism Design
Urbano, A., and J.E. Vila. 2002. Computational complexity Expressive commerce; Fielded combinatorial
and communication: Coordination in two-player games. auctions; Fielded expressive auctions; Gibbard-
Econometrica 70: 1893-1927.
Urbano, A., and J.E. Vila. 2004. Computationally restricted
Satterthwaite th; Incentive compatibility;
unmediated talk under incomplete information. Maximin voting rule; Non-truth- promoting
Economic Theory 23: 283-320. mechanism; Mechanism design; Performance
Vetta, A. 2002. Nash equilibria in competitive societies, with profile tree; Plurality voting rale; Preference
applications to facility location, traffic routing and
auctions. In Proceedings of the 43rd IEEE symposium
determination; Preference elicitation; Pull-
on foundations of computer science, 416-425. pushmechanism; Revelation principle; Strategic
Vickrey, W. 1961. Counterspeculation, auctions and computing; Tree search; Vickrey auction;
competitive sealed tenders. Journal of Finance 16: 8-37. Vickrey-Clarke-Groves (VCG) mechanism
Yao, A. 1982. Protocols for secure computation (extended
abstract). In Proceedings of the 23rd IEEE symposium
on foundations of computer science, 160-164.
JEL Classifications
Cl
Keywords
Algorithmic mechanism design; Automated
mechanism design; Borda voting rule; Bounded Computing by the Centre
rationality; Combinatorial auctions; Complexity
theory; Computing by the centre; Computing in Computing by the centre plays significant roles in
mechanism design; Deliberation equilibrium; mechanism design. In the following three subsec-
Elicitor; ex post equilibrium; tions I will review three prominent directions.
Computing in Mechanism Design 2015
Truthful bidding can be made a dominant strat- the supply chain several levels deep as a side effect.
egy by applying the Vickrey-Clarke-Groves (VCG) On the high end, such an auction can have tens of
mechanism to a CA. Such incentive com- patibihty thousands of items (multiple units of each), millions
removes strategic complexity of the bidders. The of bids, and hundreds of thousands of side
mechanism works as follows. The optimal constraints. Expressive mechanisms have also been
allocation is used, but the bidders do not pay their designed for settings beyond auctions, such as
winning bids. Instead each bidder pays the amount combinatorial exchanges, charity donations and
of value he takes away from the others by taking settings with externalities.
some of the items. This value is measured as the Basically all of the fielded expressive auctions
difference between the others’ winning bids’ prices use the simple pay-your-winning-bids pricing mle.
and what the others’ winning bids’ prices would There are numerous important reasons why few, if
have been had the agent not submitted any bids. any, use the VCG mechanism. It can lead to low
This mechanism can be executed by determining revenue. It is vulnerable to collusion. Bidders would
the winners once overall, and once for each agent not tell the truth because they do not want to reveal
removed in turn. (This may be accomplishable with their cost structures, which the auctioneer could
less computing. For example, in certain network exploit the next time the auction is conducted, and
auctions it can be done in the same asymptotic so on (Sandholm ; Rothkopf )•
complexity as one winner determination - Basically all of the fielded expressive auctions
Hershberger and Suri •) use tree search for winner determination. In prac-
Very few canonical CAs have found their way to tice, modem tree search algorithms for the problem
practice. However, auctions with richer bid scale to the large and winners can be determined
expressiveness forms (that are more natural in the optimally. If winner determination were not done
given application and more concise) and that optimally in a CA, the VCG mechanism can lose its
support expressiveness also by the bid taker have truth-dominance property (Sandholm ). In fact, any
made a major breakthrough into practice (Sandholm truthful sub-
; Bichler et al. ). This is sometimes called optimal VCG-based mechanism for CAs is unrea-
expressive commerce to distinguish it from vanilla sonable in the sense that it sometimes does not
CAs. The widest area of application is currently allocate an item to a bidder even if he is the only
industrial soincing. Tens of billions of dollars worth bidder whose bids assign non-zero value to that
of materials, transportation and services are being item (Nisan and Ronen ).
soinced annually using such mechanisms, yielding
billions of dollars in efficiency improvements. The Algorithmic Mechanism Design
bidders’ expressiveness forms include different Motivated by the worry that some instances of NP-
forms of flexible package bids, conditional hard problems may not be solvable within
discounts, discount schedules, side constraints (such reasonable time, a common research direction in
as capacity constraints), and often hundreds of cost theory of computing is approximation algorithms.
drivers (for example, fixed costs, variable costs, They trade off solution quality for a guarantee that
transshipment costs and costs associated with even in the worst case, the algorithm runs in
changes). The item specifications can also be left polynomial time in the size of the input.
partially open, and the bidders can specify some of Analogously, Nisan and Ronen ( ) pro
the item attributes (delivery date, insurance terms, posed algorithmic mechanism design: designing
and so on), in alternate ways. The bid taker also approximately optimal mechanisms that take the
specifies preferences and constraints. Winner centre a polynomial number of computing steps
determination then not only decides who wins what, even in the worst case. However, this is more
but also automatically configures the items. In some difficult than designing approximately optimal
of these events it also configures algorithms because the mechanism has to motivate
the agents to tell the truth.
Computing in Mechanism Design 2017
Lehmann et al. ( ) studied this for CAscustom design of mechanisms for every setting. The
with single-minded bidders (each bidder being onlysetting can be described by the agents’ (discretized)
interested in one specific package of items). Theytype spaces, the designer’s prior over types, the
present a fast greedy algorithm that guarantees a desired notion of incentive compatibility (for
example, dominant strategies vs. Bayes-Nash
solution within a factor %/#iterns of optimal. They
implementation), the desired notion of participation
show that the algorithm is not incentive compatible
with VCG pricing, but is with their custom pricingconstraints (for example, ex interim, ex post or
none), whether payments are allowed, and whether
scheme. They also identify sufficient conditions for
any (approximate) mechanism to be incentive the mechanism is allowed to use randomization. )
This can yield better mechanisms for previously
compatible (see also Kfir- Dahav et al. ). There has
been substantial follow-on work on subclasses of studied settings because the mechanism is designed
single- minded CAs. for the specific setting rather than a class of settings.
Lavi and Swamy ( It can also be used for settings not previously
) developed a technique
for a range of packing problems with which any ^- studied in mechanism design.
approximation algorithm (that is, algorithm that For almost all natural (linear) objectives, all
variants of the design problem are J\fV-complete if
guarantees that the solution is within a factor k of
the mechanism is not allowed to use randomization,
optimal) that also bounds the integrality gap of the
linear programming (LP) relaxation of the problem but randomized mechanisms can be constructed for
by k can be used to construct a ^approximation all these settings in polynomial time using linear
mechanism. The LP solution, scaled down by k, can programming. Custom algorithms have been
developed for some problems in each of these two
be represented as a convex combination of integer
categories. (Even the latter category warrants
solutions, and viewing this convex combination as
research. While the linear programme is polynomial
specifying a probability distribution over integer
in the size of the input, the input itself can be
solutions begets a VCG-based randomized
exponential in the number of agents.) Structured
mechanism that is truthful in expectation. For CAs
representations of the problem can also make the
with general valuations, this yields an 0( \/#ilems)
design process drastically faster.
-approximate mechanism.
Beyond the general setting, automated mech-
In a different direction, several mechanisms
anism design has been applied to specific settings,
have been proposed where the agents can help the
such as creating revenue-maximizing CAs (without
centre find better outcomes. This is done either by
the need to discretize types)- Likhodedov and
giving the agents the information to do the centre’s
Sandholm (a recognized problem that eludes
computing (Banks et al. ; Land et al. ; Parkes andanalytical characterization; even the two-item case
Shneidman ), or by allowing the agents to change is open), reputation systems (Jurca and Fairings ),
what they told the mechanism based on the safe exchange mechanisms (Sandholm and
mechanism’s output and potentially also based on Ferrandon ), and supply chain settings
what other agents told the mechanism (Nisan and (Vorobeychik et al. ). Automated mechanism design
Ronen ). In VCG-based mechanisms, an agent software has recently also been adopted by several
benefits from lying only if the lie causes the mechanism design theoreticians to speed up their
mechanism to find an outcome that is better overall.
research.
It turns out that even multistage mechanisms can
Automated Mechanism Design
be designed automatically (Sandholm et al. ).
Conitzer and Sandholm ( ) proposed the idea
Furthermore, automated mechanism design has
of automated mechanism design: having a com-
been applied to the design of online mechanisms
puter, rather than a human, design the mechanism.
(Hajiaghayi et al. ), that is,
Because human effort is eliminated, this enables
2018 Computing in Mechanism Design
mechanisms that execute while the world changes protocols that are easy to manipulate, so that they
-for example, agents enter and exit the system. become hard to manipulate (Conitzer and Sandholm
). Specifically, before the original protocol is
executed, one pairwise elimination round is
Computing by the Agents executed among the candidates, and only the
winning candidates survive to the original protocol.
I will now move to discussing computing by the
This makes the protocols ./UP-hard, # Phard (#P-
agents.
hard problems are at least as hard as counting the
Mechanisms That Are Hard to Manipulate number of solutions to a problem in P), or even
This section demonstrates that one can use the fact PSPACE-hard (PSPACE-hard problems are at least
that agents are computationally limited to achieve as hard as any problem that can be solved using a
things that are not achievable via any mechanism polynomial amount of memory ) to manipulate
among perfectly rational agents. constructively, depending on whether the schedule
A seminal negative result, the GibbardSat- of the pre-round is determined before the votes are
terthwaite th, states that if there are three or more collected, randomly after the votes are collected, or
candidates, then in any non-dictatorial voting the scheduling and the vote collecting are carefully
scheme there are candidate rankings of the other interleaved, respectively.
voters, and preferences of the agent, under which All of the hardness results of the previous
the agent is better off voting manipulatively than paragraph rely on both the number of voters and the
truthfully. One avenue around this impossibility is number of candidates growing. The number of
to construct desirable general non-dictatorial voting candidates can be large in some domains, for
protocols under which finding a beneficial example when voting over task or resource
manipulation is prohibitively hard computationally. allocations.
There are two natural alternative goals of However, in other elections - such as presi-
manipulation. In constructive manipulation, the dential elections - the number of candidates is
manipulator tries to find an order of candidates that small. If the number of candidates is a constant,
he can reveal so that his favourite candidate wins. both constructive and destructive manipulation are
In destructive manipulation, the manipulator tries to easy (in P), regardless of the number of voters
find an order of candidates that he can reveal so that (Conitzer et al. ). This holds even if the voters are
his hated candidate does not win. These are special weighted, or if a coalition of voters tries to
cases of the utility-theoretic notion of improving manipulate. On the bright side, when a coalition of
one’s utility, so the hardness results, discussed weighted voters tries to manipulate, complexity can
below, carry over to the usual utility-theoretic arise even for a constant number of candidates: see
setting. Tables and . Another lesson from that table is that
Unfortunately, finding a constructive manipu- randomizing over instantiations of the mechanism
lation is easy (in V) for the plurality, Borda and (such as schedules of a cup) can be used to make
maximin voting rules (Bartholdi et al. ), which are manipulation hard.
commonly used. On the bright side, constructive As usual in computer science, all the results
manipulation of the single transferable vote (STV) mentioned above are worst-case hardness. Unfor-
protocol is AfP-hard (Bartholdi and Orlin ) (as is tunately, under weak assumptions on the preference
manipulation of the second order Copeland distribution and voting rule, most instances of any
protocol (Bartholdi et al. ), but that hardness is voting rule are easy to manipulate (Conitzer and
driven solely by the tie-breaking rale). Even better, Sandholm ).
there is a systematic methodology for slightly All of the hardness results discussed above hold
tweaking voting even if the manipulators know the non-
manipulators’ votes exactly. Under weak
Number of candidates:
2
Borda V
Computing in Mechanism Design
Veto V
2019
STV V
Computing in
Mechanism
Plurality with runoff Design,
V
Table 1 Complexity of •'V P -complete
Copeland V
Maximin constructive weighted
V Ap-complete Ap-
coalitional manipulation Arp_,complete complete
Randomized cup V A"P-complete
Cup V A"P-complete
Plurality V
■AP-
complete
A'P-
coniplctc
-A^P-
Computing
in Mechanism Design,
Table 2 Complexity of
destructive weighted
coalitional manipulation
into account that valuations can be determined to report their performance profile trees to the centre,
different degrees and that an agent may want to which then coordinates the deliberations incre-
invest effort to determine others’ valuations as well mentally as agents report deliberation results. Cur-
- even in privatevalue settings.) rent research also includes designing mechanisms
Table shows in which settings strategic where strategic computing occurs but its waste-
computing can and cannot occur in deliberation fulness is limited.
equilibrium. This depends on the auction mech-
anism. Interestingly, it also depends on whether the Preference Elicitation by the Centre To reduce the
agent has limited computing (for example, owning agents’ preference determination effort, Conen and
a desktop computer that the agent can use until the Sandholm ( ) proposed a
auction’s deadline) or costly computing (for framework where the centre (also known as elic-
example, being able to buy any amount of itor) explicitly elicits preference information from
supercomputer time where each cycle comes at a the agents incrementally on an as-needed basis by
cost). posing queries to the agents. The centre thereby
The notion of deliberation equilibrium can also builds a model of the agents’preferences, and
be used as the basis for designing new mechanisms, decides what to ask, and from which agent, based
which hopefully work well among agents whose on this model. Usually the process can be
computing is costly or limited. Unfortunately, there terminated with the provably correct outcome while
is an impossibility (Larson and Sandholm ): there requiring only a small portion of the agents’
exists no mechanism that is sensitive (the outcome preferences to be determined. Multistage mecha-
is affected by each agent’s strategy), preference nisms can yield up to exponential savings in pref-
formation independent (does not do the erence determination and communication effort the
computations for the agents; the agents report agents need to go through compared to single-stage
valuations), non-misleading (no agent acts in a way mechanisms (Conitzer and Sandholm )•
that causes others to believe his true type has zero The explicit preference elicitation framework
probability), and deliberation-proof (no strategic was originally proposed for CAs (but the approach
computing occurs in equilibrium, that is, agents has since been used for other settings as well, such
compute only on their own problems). Current as voting). For general valuations, an exponential
work involves designing mechanisms that take part number of bits in the number of items for sale has
in preference formation in limited ways: for to be communicated in the worst case
example, agents
2022 Computing in Mechanism Design
and each agent is motivated to follow the protocol Acknowledgment This work was funded by the National
in terms of all three types of action. The only Science Foundation under TTR grant HS0427858, and a
Sloan Foundation Fellowship. I thank Felix Brandt, Christina
centralized party needed is a bank that can extract Fong, Joe Halpem, and David Parkes for helpful comments.
payments from the agents.
Cryptography is a powerful tool for achieving
privacy when trying to execute a mechanism in a
distributed way without a centre, using private
communication channels among the agents. Con- Bibliography
sider first the setting with passive adversaries, that Banks, J.S., J. Ledyard, and D. Porter. 1989. Allocating
is, agents that faithfully execute the specified dis- uncertain and unresponsive resources: An experimental
tributed communication protocol, but who try to approach. R A N D J o u r n a l o f E c o n o m i c s 20: 1-
infer (at least something about) some agents’ pri- 25.
Bartholdi, J. Ill, and J. Qrlin. 1991. Single transferable vote
vate information. resists strategic voting. S o c i a l C h o i c e a n d
W e l f a r e 8: 341-354.
• If agents are computationally limited - for Bartholdi, J. Ill, C. Tovey, and M. Trick. 1989. The com-
example, they are assumed to be unable to factor putational difficulty of manipulating an election. S o c i a l
C h o i c e a n d W e l f a r e 6: 227-241.
large numbers - then arbitrary functions can be
Bichler, M., A. Davenport, G. Hohner, and J. Kalagnanam.
computed while guaranteeing that each agent 2006. Industrial procurement auctions. In
maintains his privacy (except, of course, to the C o m b i n a t o r i a l a u c t i o n s , ed. Cramton, Shoham,
extent that the answer of the computation says and Steinberg. Cambridge, MA: MIT Press.
Blumrosen, L., and N. Nisan. 2005. On the computational
something about the inputs) (Goldreich et al. ).
power of iterative auctions. In P r o c e e d i n g s o f t h e
Thus the desire for privacy does not constrain ACM conference on electronic
what social choice functions can be c o m m e r c e , 29-43. Vancouver: ACM Press.
implemented. Brandt F, andSandholm T. 2004. (Im)possibility of uncon-
ditionally privacypreserving auctions. In
• In contrast, only very limited social choice
Proceedings
functions can be computed privately among of the international conference on autonomous agents and
computationally unlimited agents. For example, multi-agent systems. 8 1 0 - 1 7 .
when there are just two alternatives, every Brandt F., and T. Sandholm. 2005. Unconditional privacy in
social choice. In Proceedings of the conference on
monotonic, non-dictatorial social choice func-
theoretical aspects of rationality and knowledge. 207-18.
tion that can be privately computed is constant Conen W., and T. Sandholm. 2001. Preference elicitation in
(Brandt and Sandholm ). With special structure combinatorial auctions: Extended abstract. In Pro-
in the preferences, this impossibility can ceedings of the ACM conference on electronic commerce,
256-9. More detailed description of algorithmic aspects in
sometimes be avoided. For example, with the
Proceedings of the IJCAI01 workshop on economic
standard model of quasi-linear utility, first- price agents, models, and mechanisms. 71-80.
auctions can be implemented privately; second- Conitzer V., and T. Sandholm. 2002. Complexity of mech-
price (Vickrey) auctions with more than two anism design. In Proceedings of the conference on
bidders cannot (Brandt and Sandholm uncertainty in artificial intelligence. 103-10.
Conitzer V., and T. Sandholm. 2003. Universal voting
)•
protocol tweaks to make manipulation hard. In Pro-
A more general model is that of active ceedings of the international joint
conference on artific i a l i n t e l l i g e n c e . 781-8.
adversaries who can execute the distributed Conitzer V., and T. Sandholm. 2004. Computational criti-
communication protocol unfaithfully in a coor- cisms of the revelation principle. In C o n f e r e n c e o n
dinated way. A more game-theoretic model is that logic and the foundations of game and
of rational adversaries that are not passive, but not d e c i s i o n t h e o r y . Earlier versions: AMEC-03, EC-
04.
malicious either. For a brief overview of such work, Conitzer V., and T. Sandholm. 2006. Nonexistence of voting
see computer science and game theory. rules that are usually hard to manipulate. In
Proceedings of the national conference
on artificial intelligence.
2024 Computing in Mechanism Design
Conitzer, V., T. Sandholm, and J. Lang. 2007. When are Proceedings of the national conference on artificial
elections with few candidates hard to manipulate? intelligence. 267-74.
J o u r n a l o f t h e A C M 54(3): 14. Monderer D, and M. Tennenholtz. 1999. Distributed games:
Cramton, P., Y. Shoham, and R. Steinberg, ed. 2006. (\ > m - From mechanisms to protocols. In P r o c e e d i n g s o f
b i n a t o r i a l a u c t i o n s . Cambridge, MA: MIT Press. the national conference on artificial
Feigenbaum, J., C. Papadimitriou, R. Sami, and S. Shenker. i n t e l l i g e n c e . 32-7.
2005. A BGP-based mechanism for lowest cost routing. Muller R. 2006. Tractable cases of the winner determination
D i s t r i b u t e d C o m p u t i n g 18: 61-72. problem. In P Cramton, Y. Shoham and R. Steinberg.
Feigenbaum J., and S. Shenker. 2002. Distributed algorithmic Nisan N, and A. Ronen. 2000. Computationally feasible VCG
mechanism design: Recent results and future directions. mechanisms. In P r o c e e d i n g s o f t h e A C M
In P r o c e e d i n g s of the international c o n f e r e n c e o n e l e c t r o n i c c o m m e r c e . 242-
workshop on discrete algorithms and 52.
methods for mobile computing and Nisan. N.. and A. Ronen. 2001. Algorithmic mechanism
c o m m u n i c a t i o n s . 1 13. design. G a m e s a n d E c o n o m i c B e h a v i o r 35:
Goldreich O., S. Micali and A. Wigderson. 1987. How to play 166 196.
any mental game or a completeness theorem for protocols Nisan. N.. and I. Segal. 2006. The communication require-
with honest majority. In P r o c e e d i n g s o f t h e ments of efficient allocations and supporting prices.
s y m p o s i u m o n t h e o r y o f c o m p u t i n g . 218-29. J o u r n a l o f E c o n o m i c T h e o r y 129: 192-224.
Hajiaghayi, M.T., R. Kleinberg,and T. Sandholm. 2007. Parkes D.. and J. Shneidman. 2004. Distributed
Automated online mechanism design and prophet implementations of generalized Vickrey Clarke Groves
inequalities. In P r o c e e d i n g s o f t h e n a t i o n a l auctions. In P r o c e e d i n g s o f t h e
conference on artificial intelligence. international conference on autonomous
Hershberger J, and S. Suri. 2001. Vickrey prices and shortest a g e n t s a n d m u l t i - a g e n t s y s t e m s . 261 -268.
paths: What is an edge worth? In P r o c e e d i n g s o f Petcu A., B. Faltings. and D. Parkes. 2006. MDPOP: Faithful
the symposium on foundations of distributed implementation of efficient social choice
c o m p u t e r S c i e n c e . 252 9. problems. In Proceedings of the
Jurca R, and B. Fallings. 2006. Minimum payments that international conference on autonomous
reward honest reputation feedback. In P r o c e e d i n g s a g e n t s a n d m u l t i - a g e n t s y s t e m s . 1397 404.
of the At' M conference on electronic Rothkopf, M. 2007. Thirteen reasons why the Vickrey Clarke
c o m m e r c e . 190 9. Groves process is not practical. O p e r a t i o n s
Kfir-Dahav N, D. Monderer, and M. Tennenholtz. 2000. R e s e a r c h 55: 191-197.
Mechanism design for resource bounded agents. In Rothkopf, M., A. Pekec, and R. Harstad. 1998. Computa-
Proceedings of the international tionally manageable combinatorial auctions. M a n a g e -
c o n f e r e n c e o n m u l t i - a g e n t s y s t e m s . 309- m e n t S c i e n c e 44: 1131-1147.
315. Sandholm T. 1993. An implementation of the contract net
Land, A., S. Powell, and R. Steinberg. 2006. PAUSE: A protocol based on marginal cost calculations. In P r o -
computationally tractable combinatorial auction. In ceedings of the national conference on
C o m b i n a t o r i a l a u c t i o n s , ed. Cramton, Shoham, a r t i f i c i a l i n t e l l i g e n c e . 256-62.
and Steinberg. Cambridge, MA: MIT Press. Sandholm, T. 2000. Issues in computational Vickrey auctions.
Larson Is.. 2006. Reducing costly information acquisition in International Journal of Electronic
auctions. In P r o c e e d i n g s o f t h e a u t o n o m o u s C o m m e r c e 4 : 107-129 .Early version in ICMAS-96.
a g e n t s a n d m u l t i - a g e n t s y s t e m s . 1167-74. Sandholm, T. 2002a. Algorithm for optimal winner deter-
Larson Is., and T. Sandholm. 2001. Costly valuation com- mination in combinatorial auctions. A r t i f i c i a l
putation in auctions. In P r o c e e d i n g s o f t h e I n t e l l i g e n c e 135: 1-54 .Earlier versions: ICE-98
theoretical aspects of rationality and keynote, Washington U. tech report WUCS-99-01 Jan.
k n o w l e d g e . 169-182. 1999, IJCAI-99.
Larson Is, and T. Sandholm. 2005. Mechanism design and Sandholm, T. 2002b. eMediator: A next generation electronic
deliberative agents. In Proceedings of the international commerce server. C o m p u t a t i o n a l I n t e l l i g e n c e
conference on autonomous agents and multi-agent systems. 18: 656-676 .Earlier versions: Washington U. tech report
650 656. WU-CS-99-02 Jan. 1999, AAAI-99 Workshop on AI in
Lavi IT, and C. Swamy. 2005. Truthful and near-optimal Ecommerce, AGENTS-00.
mechanism design via linear programming. In P r o - Sandholm T. 2006. Optimal winner determination algorithms.
ceedings of the symposium on In P Cramton, Y. Shoham and R. Steinberg.
f o u n d a t i o n s o f c o m p u t e r s c i e n c e . 595 604. Sandholm, T. 2007. Expressive commerce and its application
Lehmann D., R. Muller, and T. Sandholm. 2006. The winner to sourcing: How we conducted $35 billion of generalized
determination problem. In P Cramton, Y. Shoham and R. combinatorial auctions. .1/ M a g a z i n e 28(3): 45-58.
Steinberg. Sandholm T, and Boutilier C. 2006. Preference elicitation in
Lehmann, D., L.I. O’Callaghan, and Y. Shoham. 2002. Truth combinatorial auctions. In P. Cramton, Y. Shoham and
revelation in rapid, approximately efficient combinatorial R. Steinberg.
Comte, Isidore Auguste Marie Francois Xavier (1798-1857) 2025
Sandholm T, V. Conitzer, and C. Boutilier. 2007. Automated Comte’s social philosophy encompassed all
design of multistage mechanisms. In P r o c e e d i n g s o f aspects of life, which he believed to be the har-
the international joint con ference on
a r t i f i c i a l i n t e l l i g e n c e . 1500-6.
monious working together of two inseparable
Sandholm T., and V. Ferrandon. 2000. Safe exchange planner. elements - an organism and its environment. The
In Proceedings of the international conference on multi- ‘true’ methods of science were empirical (or
agent systems. 255-62. inductive), and Comte, as did his mentor Saint-
Sandholm, T., K. Larson, M. Andersson, O. Shehory, et al.
1999. Coalition structure generation with worst case
Simon, believed that they could be applied, mutatis
guarantees. A r t i f i c i a l I n t e l l i g e n c e 111: 209-238. mutandis, to all branches of thought including the
Shneidman J, and D.C Parkes . 2004. Specification faithfulness social sciences, of which economics was a part. He
in networks with rational nodes. In P r o c e e d i n g s o f succinctly described this basis in the System of
the ACM symposium on principles of
d i s t r i b u t e d c o m p u t i n g . 88-97.
Positive Polity. ‘It [the positivist synthesis of all
Vorobeychik Y, C. Kiekintveld, and M. Wellman. 2006. knowledge] rests at every point upon the
Empirical mechanism design: Methods, with unchangeable Order of the World. The right
application to a supply chain scenario. In Proceedings of understanding of this order is the principal subject
the ACM conference on electronic commerce. 306-15.
of our thoughts; its preponderating influence
determines the general course of our feelings; its
gradual improvement is the constant object of our
actions ...’ (vol. 1, p. 21).
‘Gradual improvement’ came about in three
Comte, Isidore Auguste Marie Francois successive stages of understanding the cause of
Xavier (1798-1857) phenomena. In the ‘Law of Three Stages’, Comte
argued that man’s conception of ‘order’ or the
Robert B. Ekelund Jr. interrelationships of causes and effects, went first
through a religious or theological stage, the
theological-primitive content of which diminished
progressively from fetishism, polytheism and
monotheism. The metaphysical or a priori stage
Auguste Comte, co-founder of sociology and
followed the theological and represented man’s
positivist philosopher, was bom at Montpellier in
attempt to discover ‘order’ by reason. The final
1798 and died in Paris in 1857. A student at the
stage in understanding order was the positive stage
Ecole Polytechnique until he was dismissed for
wherein science, or the knowledge of relationships
disobedience and incorrigible behaviour, Comte
between disparate phenomena, brought man to a
became the secretary to Henri de Saint- Simon in
kind of perfection. Within this latter stage, Comte
1818, a position he held until 1824. Over this period
argued that a second great law obtained - that of
Comte developed the kernel of positivist philosophy
decreasing generality and increasing complexity of
and, along with Saint- Simon, modem sociology.
understanding. Here, sciences progressed in a
The irascible Comte spent the rest of his life - often
definite ordering, each dependent on the previous
in the face of frequent poverty and desperate
one, from mathematics to astronomy to physics to
personal circumstances including mental
chemistry to biology and, finally, to sociology.
breakdowns and a failed marriage - establishing,
Comte’s system, its broadly empirical method-
altering and working out positivism as a
ology, and its precept that society was an organism
philosophical system of knowledge and as the
which evolved under constraints that were
foundation for the ‘science of society’. Comte’s
themselves ultimately altered by social activities
chefs d’oeuvre included the encyclopedic Cours
and behaviour, had a measurable impact upon
dephilosophie positive (1830—42) and his System
classical economics and upon the form that
of Positive Polity (1851-4).
2026 Comte, Isidore Auguste Marie Francois Xavier (1798-1857)
economic analysis would take in the neoclassical reinforced Ricardo’s method which became the
period and beyond. J.S. Mill introduced Comte’s dominant method of economists in the 20th century.
ideas to England and, for a time at least, was deeply
influenced by the French philosopher. Mill’s Logic
(1843) contains copious and favourable references
to Comte’s inverse- deductive (empirical-historical)
method for use in the broader and more important See Also
investigation of integrated social studies. In the end,
however, Mill reaffirmed the a priori ‘Ricardian’
method for the narrower concerns of political
economy. Again, Comte’s influence surfaced in
Mill’s Principles of Political Economy (1848) in the Selected Works
form of the famous ‘statics and dynamics’
distinction and in Mill’s admissions that the ultimate 1830—42. Cours de philosophic positive, 2 vols,
aim of social science was a broader conception of 5th ed. Trans. H. Martineau; 3rd ed. London,
the process than political economy had to offer. In 1893.
his Principles, however, Mill reaffirmed the 1851—4. Systeme de politique positive, ou Traite
(admittedly provisional) a priori deductive method de sociologie institutant la religion de
as the essential and proper one for studying political l’humanite (Trans: J.H. Bridge et al. as System
economy. Ultimately, Mill and a number of Comte’s of positive polity). London, 1875—7; New
followers totally rejected the ‘religion of humanity’ York: Burt Franklin Reprint,
that Comte later made of his positivist principles 4 vols, 1968.
(1851-77). Mill was especially aghast at the 18 99 . Lettres inedites de John Stuart Mill a
infringements on individual liberty that Comte’s Auguste Comte, publiees avec les reponses de
quasi-medieval ‘Catholicism without Christianity’ Comte et un introduction par Lucien Levy-
envisioned. In his Autobiography Mill attacked Bruhl. Pa r i s : F e l i x A l c an .
Comte’s planned society as ‘the completest system
of spiritual and temporal despotism which ever yet
References
emanated from a human brain, unless possibly that
of Ignatius Loyola’ ( , p.149). Abrams, P. 1968. T h e origins of British
In the late classical and early neoclassical period s o c i o l o g v 1 8 3 4 - 1 9 1 4 . Chicago: University of
Comte’s ideas received some reinforcement at the Chicago Press.
Adelman, P. 1971. Frederic Harrison and the ‘Positivist’
hands of the British historicists, notably John Kells
attack on orthodox political economy. H i s t o r y o f
Ingram. But his impact, especially on methodology P o l i t i c a l E c o n o m y 3( 1): 170-189.
and on his belief that political economy should be Caimes, J.E. 1870. M. Comte and political economy. F o r t -
subordinate to sociology, was successfully nullified n i g h t l y R e v i e w 13: 579-602.
Ekelund Jr., R.B., and E. Olsen. 1973. Comte, Mill and
by the efforts of John Elliott Caimes (). While
Caimes: The positivist-empiricist interlude in late clas-
admitting that empiricism sical economics. J o u r n a l o f E c o n o m i c I s s u e s
was a necessary adjunct to economic theory, Caimes 7(3): 383-416.
defended an essentially abstract and a priori method Hayek, F.A. 1955. The counter-revolution of science, studies in
in political economy. Caime’s ideas on method were the abuse of reason. New York: The Free Press.
Ingram, J.K. 1899. A history of political
replicated by the leading methodologist of the e c o n o m y . London: A. and C. Black, 1915.
neoclassical and even post- neoclassical periods, Mill, J.S. 1865. A u g u s t e C o m t e a n d p o s i t i v i s m .
John Neville Keynes. The attempt to infuse Comtian London: N. Triibner.
and other broader methods into political economy, Mill, J.S. 1873. A u t o b i o g r a p h y . New York: Columbia
University Press, 1924.
in other words, Seligman, B.B. 1969. The impact of positivism on economic
thought. H i s t o r y o f P o l i t i c a l E c o n o m y 1(2):
256-278.
Concentration Measures 2027
The term concentration (also firm concentration, The summation in this case is taken over the set of
industry concentration or market concentration) all N firms in the market. This index lies between
refers to aspects of the distribution of firm size zero and 1: if there is only one firm in the market,
within a specific market or industry that have so that the market has the highest possible con-
traditionally been used to characterize the degree of centration, the index is 1. If, on the other hand, there
competitiveness in the market. Even though the size are many equally sized firms in the market, the
of firms can be measured using many different index will be close to zero. By squaring the
variables, such as employment or assets, the sales individual market shares, this index gives relatively
level is the most commonly used size measure. greater weight to the market shares of large firms.
Accordingly, if very few firms serve a very large Conversely, the addition of one small firm to the
portion of the market, it is said that the given market dilutes somewhat the market share of larger
market is highly ‘concentrated’, whereas if no firms, and has a marginal negative effect on the
single firm has a large share of sales it is said that index, which is consistent with any notion of market
the market is not ‘concentrated’. Since con- concentration. Any value of this index can
centration is an important reflection of the under- correspond to multiple
lying market structure, its measurement is an
2028 Concentration Measures
market configurations, being in that sense less with long-distance phone companies and Internet
illustrative of the actual concentration of a companies as providers of communication services.
market than a set of //-firm concentration rates. The Internet, on the other hand, competes in some
On the other hand, this index can be easily instances with cable and satellite companies, radio
correlated with other market characteristics and stations and even newspapers as sources of news
is therefore very useful for statistical analysis. and entertainment. What exactly the relevant market
Other less commonly used concentration faced by mobile phone companies is will depend on
measures include entropy coefficients, the Gini the type of issue being addressed. Accordingly, the
coefficient and measures of the variance of concentration measures will change depending on
market shares across firms within a market. The the adopted market definition.
entropy coefficient is usually computed using the On the other hand, even if the market is well
following formula: defined, computed concentration measures may not
reflect at all the real competitive structure of the
e= (£). market. For example, even in markets as highly
concentrated as the market for computer processors,
the dominant firms have to account for the invisible
This index takes value zero if there is only one
competition of potential entrants. The same happens
firm in the market and grows as market concen-
in regional markets where outside firms are kept at
tration decreases. The interpretation of this coef-
bay by few local films with a combination of low
ficient is complicated, because its formula
prices and high transportation costs. Computed
weights both large and small firms less heavily
concentration measures for specific markets cannot
than mid-size firms and grows rmbormdedly as
account for this unobserved competition and may
market concentration decreases. It is therefore
therefore lead to wrong conclusions regarding the
less commonly used than the Herfindahl index.
underlying behaviour of films. In these instances, a
The Gini coefficient is commonly used to char-
behavioural measure, such as die Lemer index,
acterize the income or wealth inequality within a
which measures the relative size of films' markups,
society. Its drawback as a measure of market con-
may be a better indicator of the competitive
centration is that it is useful only to measure the
structure of the market.
concentration of firms’ sizes within a market, given
There is a body of empirical literature that uses
a number of firms. So according to the Gini
market concentration measures across industries to
coefficient a duopolistic market with two firms of
approximate the underlying differences in indus-
equal size is as concentrated as a market with 100
tries' competitiveness. They were then used to infer
firms with identical size. The same drawback
statistically the relationship between market
applies for the use of measures of the variance of
‘structure’ and market ‘performance’. For example,
firm size - whose definition is simply the sample
correlations of R&D expendirtire and market con-
variance of firm sizes.
centration were computed to investigate whether
A11 the concentration measures mentioned
films in concentrated markets were more or less
above arc vciy sensitive to the acUtal market
likely to innovate than films in more competitive
definition that is used. In markets for differentiated
markets. The value of such correlations is limited
products, for example, products may face a
because the observed concentration may be both a
continuum of similar products, and determining
cause and an effect of individual films' behaviour
which similar products exactly constitute a market
and the relationship is shaped by the specifics of the
is not always easy. Take the specific example of the
industry. In order to avoid the ambiguities of such
market faced by US mobile phone services: with
an inter-industry approach, the more recent empir-
just a handfiil of national providers it is
ical microeconomic literature has generally focused
concentrated given the standard concentration
instead on the understanding of firm behaviour
measures. These national films, nevertheless, are
within specific industries, for which the use of
also competing with local companies in various
concentration measures is less relevant.
segments of the market and even
Concentration Ratios 2029
industrial census. Only manufacturing industries are weighted average degree of four-firm concentration
covered, although ratios on urban US banking in US manufacturing industries has been actually
markets are prepared by other sources). about 60 per cent, rather than the 40 per cent
Econometric analysis has made extensive use indicated by the raw census ratios.
of these ratios, with scores of papers reporting Properly adjusted to eliminate these biases, a
regressions relating concentration to price-cost concentration ratio is an excellent descriptive
patterns, growth rates, efficiency, rates of statistic. It conveys the degree of oligopoly 'tight-
innovation, etc. In fact, concentration ratios ness'. It can show changes in structure pretty accu-
reigned as the research focus for analysis of rately. Thus the market power indicated by a ratio
market power and its effects. Indeed, from 1939 of 53 or 63 may be a matter of debate, but a rise in
to about 1970 the abundant availability of the ratio from 53 to 63 for a given industry strongly
ratios reinforced the tendency to regard (perhaps suggests that there has been a rise in market power.
over-regard) oligopoly as a central issue. With By 1975, the ratios have provided enormously
the growing focus on individual market shares, valuable lessons, in several directions. They were
the role of concentration ratios became less the workhorse statistic in describing the structure of
important in the 1970s. Even so, the ratios hundreds of industries, in case sUidics of industries,
continue to be an indispensable descriptive in antitrust investigations and in other straightfor-
statistic, used widely in research into the several ward treatments. They had become a pivotal basis
elements of structure and their effects. for antitrust policy choices, such as in merger cases,
Before summarizing some of the results, one where the degree of concentration was one basis for
needs to note that the ratios arc subject to a deciding whether to oppose the merger. During
serious technical fault. A correct definition of 1960-75 they gave rise to many scores of regression
the market is important, if true concentration analyses, which tried to relate concentration to the
within the market is to be measured. The US possible effects of market power.
Bureau of the Census has a detailed system of The correlation of concentration with price
standard industrial classification (SIC). All sectors behaviour drew the largest volume of testing. These
are grouped by numbers, ranging from SIC 1 to estimations were commonly plagued by the use of
100. The manufacturing sector covers ‘industry uncorrected, raw census ratios, which introduced
groups’ 20 through 39, and so forth. Differing substantial errors. Even so, a broad and significant
degrees of fineness are given. The five-digit correlation did emerge, enough to establish a
product level is, on balance, about correct in presumption that the theoretical effects of oligopoly
fitting the average scope of true markets. Yet most market power on pricing activity do occur in
research has focused on the much broader four- practice. The most successful testing involved
digit 'industries’, which now number about 450 multivariate models of five to eight independent
in the US. variables, including filter variables such as capital
About half of these four-digit census 'indus- intensity and growth, and other structure- related
tries’ depart seriously from correct market variables such as market shares, advertising
boundaries. The use of raw census ratios has intensity and capital-requirements barriers to entry.
undermined a good deal of the past research on Another important line of research attempted to
structure and its effects. Commonly, the ratios explain concentration as the dependent variable.
arc too broad, lumping together distinct products Growth rates were one possible causative variable,
and geographic markets (for example, the while economies of scale were another. Growth
national four-firm US ratio is 14 per cent for emerged as a very weak influence, and scale econo-
newspapers, but concentration ratios in true local mics Uuncd out to explain only a limited amount of
newspaper markets probably average close to 100 actual concentration in important national markets.
per cent). Some cases go the other way, with the The weakness of these results may, in part, have
official ratios too high (for example, imports are not reflected errors in the concentration ratios
included in the ratios, and this is important for steel, themselves.
television sets, shoes, cameras, automobiles and Finally, concentration's possible effects on
many others). innovation, stability' and wealth distribution
Adjusting the ratios to fit true market conditions
Condillac, Etienne Bonnot de, Abbe de Mureau (1714-1780) 2031
have been explored. These studies often were forced Mann, H.M. 1966. Seller concentration, barriers to entry, and
to adopt quite creative approaches, in light to the rates of return in thirty industries, 1950-1960. R e v i e w
o f E c o n o m i c s a n d S t a t i s t i c s 48: 296-307.
data problems. Again, the general patterns have Mansfield, E. 1964. Industrial research and development
confirmed the main predictions of theory, that high expenditures. J o u r n a l o f P o l i t i c a l E c o n o m y
concentration tends to affect results much as high 72: 319-340.
monopoly power does. High concentration was Scherer, EM. 1980. I n d u s t r i a l m a r k e t s t r u c t u r e
a n d e c o n o m i c p e r f o r m a n c e , 2nd ed. Boston:
associated, on the whole, with slowed innovation, Houghton Mifflin.
greater instability of production and the Shepherd, W.G. 1969. Market power and racial discrimi-
disequalizing of the distribution of wealth. nation in white-collar employment. A n t i t r u s t
Yet testing continues on all of these points, and B u l l e t i n 14:141-161.
Shepherd, W.G. 1970. M a r k e t p o w e r a n d e c o n o m i c
the concentration ratios have tended only to suggest w e l f a r e . New York: Random House.
patterns, not resolve them. Studies since 1970 have Shepherd, W.G. 1979. T h e e c o n o m i c s o f i n d u s t r i a l
focused more on individual market shares, showing o r g a n i z a t i o n . Englewood Cliffs: Prentice-Hall.
them to have stronger effects than concentration, Shepherd, W.G. 1982. Causes of increased competition in the
US economy, 1939-1980. R e v i e w o f E c o n o m i c s
just as theory predicts. Though the use of
a n d S t a t i s t i c s 64: 613-626.
concentration ratios in regression analysis may have Stigler, G.J. (ed.). 1955. B u s i n e s s c o n c e n t r a t i o n
peaked, the ratios (adjusted as appropriate) will a n d p r i c e p o l i c y . Princeton: Princeton University
undoubtedly continue as a main basis for describing Press.
Stigler, G.J., and J.K. Kindahl. 1970. T h e b e h a v i o r o f
the degree of market power in a wide array of
i n d u s t r i a l p r i c e s . New York: Columbia University
markets in the US, UK, Canada, Japan and certain Press for the National Bureau of Economic Research.
other countries. US Bureau of the Census. 1980. C o n c e n t r a t i o n r a t i o s
References
Keywords
Bain, J.S. 1956. B a r r i e r s t o n e w c o m p e t i t i o n . Baudeau, N.; Condillac, E. B. de; Exchange;
Cambridge, MA: Harvard University Press. Galiani, F.; Le Trosne, G. F.; Physiocracy;
Bain, J.S. 1968. I n d u s t r i a l o r g a n i z a t i o n , Revisedth Price; Turgot, A. R. J.; Value; Verri, P.
ed. New York: Wiley.
Blair, J.M. 1972. E c o n o m i c c o n c e n t r a t i o n . New
York: Harcourt, Brace, Jovanovich.
JEL Classifications
Chamberlin, E.H. 1933. The theory of
B31
m o n o p o l i s t i c c o m p e t i t i o n , 8th ed. Cambridge,
MA: Harvard University Press. 1962. Philosopher and economist. Bom at Grenoble, the
Collins, N., and L.E. Preston. 1968. C o n c e n t r a t i o n a n d third son of a well-to-do aristocratic family,
price-cost margins in manufacturing
i n d u s t r i e s . Berkeley: University of California Press.
Condillac took his name from an estate purchased
Comanor, W.S., and R.H. Smiley. 1975. Monopoly and the by his father in 1720. As a sickly child with poor
distribution of wealth. Q u a r t e r l y J o u r n a l o f eyesight he had little early education and was
E c o n o m i c s 89: 177-194. apparently still unable to read by the age of 12.
Comanor, W.S., and T. Wilson. 1975. A d v e r t i s i n g a n d
After his father’s death in 1727 he moved to Lyon
m a r k e t p o w e r . Cambridge, MA: Harvard University
Press. to live with his oldest brother, continuing his
Goldschmid, H.J., H.M. Mann, and J.F. Weston (eds.). 1974. education at its Jesuit college. Through this brother
Industrial concentration: The new he may have first met Jean Jacques Rousseau, who
l e a r n i n g . Boston: Little, Brown.
was tutor to
2032 Condillac, Etienne Bonnot de, Abbe de Mureau (1714-1780)
his nephews in 1740 and became a life-long friend. The impetus for Condillac’s writing Le Com-
His second brother, l’Abbe de Mably, took merce et le Gouvernement has been ascribed to a
Condillac to Paris in c. 1733 to study theology at desire to assist his friend Turgot in the difficulties
Saint Sulpice and the Sorbonne. He was ordained in he faced in 1775 as finance minister over the grain
1740 and for the rest of his life ‘ever faithful to die riots induced by his restoration of the free trade in
Christian church, would always wear his cassock, grain (Le Roy , p. xxv; Knight , p. 232). This fits
always remain l’Abbe’ (Lefevre , p. 11). with the work’s unqualified support for free trade in
For the next 15 years he lived the life of a Paris general and the grain trade in particular (1776, esp.
intellectual, studying the philosophy of Descartes, pp. 344-5, which seems directly inspired by the
Malebranche, Leibniz and Spinoza, ‘to whose Paris events of 1775). Writing the book may also be
speculative systems he formed a life-long aversion, explained as a return favour for Turgot’s assistance
preferring the English philosophers Locke (who in getting Condillac (1775) published (cf. Knight ,
particularly influenced his thinking), Berkeley, pp. 13, 232). Despite Condillac’s strong support for
Newton and rather belatedly, Bacon’ (Knight , pp. this major part of Physiocratic policy and his close
8-9). In this period he published the works which adherence to other aspects of Physiocracy, his
made his philosophical reputation: the Essay on the argument that manufacturing was productive
Origin of Human Knowledge (1746), the Traite des brought critical replies from Baudeau and Le
Systemes (1749), his most famous philosophical Trosne ( ). In
work Treatise on the Sensations (1754) described as this context it may be noted that his work bears
the ‘most rigorous demonstration of the [18th- little direct Physiocratic influence, the major influ-
century] sensationalist psychology’ (Knight , p. 12) ence being Cantillon ( ), the only work
and his Traite des Animaux (1755). directly cited apart from Plumard de Dangeul ( ). It
Apart from giving him entry to the Paris salons, is, however, possible to detect some
where at Mile de Lespinasse’s salon he is reputed to influence from the economics of Turgot, Galiani
have first met Turgot, another life-long friend (Le and Verri on the theory of value, price and com-
Roy , p. ix), his intellectual reputation gained him petition (cf. Spengler , p. 212).
the position of tutor to Louis XV’s grandson, the As published, the work is divided into two
Duke of Parma. From 1758 to 1767 he resided in parts. The first provides the elements of the science.
Parma. Because of its prime minister’s economic Its starting point is the foundation of value, which
development policies, inspired by a mixture of Condillac finds in the usefulness of an object rela-
‘mercantilism, physiocracy and the ideas of tive to subjective needs making relative scarcity the
Goumay’, Condillac developed an interest in key variable determining value. Value is distin-
economic matters, an interest ‘indirectly confirmed guished from price because price can only originate
by his known contacts with the Italian political in exchange. It is determined by the competition
economists, Beccaria and Gherardo’ (Knight , pp. between buyers and sellers guided by their subjec-
231-2). In 1768 he returned to Paris, but by 1773 tive estimation of value. Gains from exchange arise
had retired to his estate of Flux near Beaugency, from differences in value; for Condillac, value
where he died in 1780. During the last decade of his cannot exchange for equal value. Although
life he published his Cours d’Etudes (1775), his Condillac did discuss the costs of acquiring com-
work on economics (1776), a text on logic (1780) modities, his emphasis is on exchange, trade and
for use in Polish Palatinate schools, and price. Exchange presumes surplus production and a
commenced the unfinished La Langue des Calcids need for consumption. Hence trade inspires and
(1798). In 1752, he became a member of the Royal animates production and is essential to increasing
Prussian Academy; in 1768 after his return from wealth. Only simple pictures of production are
Parma he was elected to the French Academy. His presented: farm labourers producing prime neces-
works have been frequently collected, most recently sities of food and materials; artisans transforming
by Le Roy ( -51). raw materials into essentials and luxuries; traders
who circulate these products at home and abroad.
By this circulation trade distributes the annual
Condorcet, Marie Jean Antoine Nicolas Caritat, Marquis de (1743-1794) 2033
Jack Hirshleifer
invasions of civilized cities and empires in ancient Static and dynamic elements are both importantly
times, and European colonial imperialism in the involved in conflict or settlement processes. In
modem era. Yet, between contending parties there game theory terms, the payoff environment,
will almost always be some element of com- represented by the familiar normal-form matrix, is
plementary interests, an opportunity for mutual gain the static element. The dynamic element may be
represented by the potential settlement region PSR. called the protocol ofplay, as pictured in the game
Orthodox economics has always emphasized the tree, the protocol specifies the allowable step-by-
scope of mutual benefit, even to the point of losing step moves in the light of the players’ information at
sight of conflict; certain dissident schools, notably each stage.
the Marxists, have committed the opposite error. A few very simple payoff environments are
While a detailed analysis cannot be provided here, shown in Matrices 1-4. The numbers in each cell
among the factors underlying the relative material indicate ordinally ranked payoffs for each player, 1
profitability of fighting versus negotiating are being the poorest outcome in each case. In Matrix 1,
wealth differentials, Malthusian pressures, military ‘Land or Sea’, the environment is characterized
technology, and the enforceability of agreements. by completely antithetical
In contrast with the materialistic approach, (constantsum) payoffs. The other three matrices -
attitudinal theories of conflict direct attention to the ‘Chicken’, ‘Reciprocity’ and ‘Prisoners’ Dilemma’
respective preference functions. An issue which has - represent several of the many different possible
excited considerable interest concerns the relative mixed-motive situations combining an element of
weights assignable to genetic versus cultural opposition of interests with an opportunity for
determinants of attitudes. One extreme viewpoint, mutual gain.
for example, regards xenophobic wars of family The simplest protocol to analyse is one-round
against family, of tribe against tribe, or nation sequential play, first Row selects one of his options,
against nation, as biologically ‘normal’ in the then Column makes his move in the light of Row’s
human species. An opposite interpretation pictures choice, and the game ends. In a sequential-play
man as an innately compliant being, who has to be protocol it is always possible to find a ‘rational’
culturally indoctrinated into bellicosity. solution. If Column can be relied to choose his best
final move then Row, knowing
Defend by Defend
land by sea Soft Tough
Attack by
land 1,2 2,1 Soft 3,3 2,4
Attack by 2,1 1,2 Tough 4,2 1,1
sea
this, can calculate his best first move accordingly. In the payoff environment of ‘Chicken’ (Matrix
(This process results in what is called a ‘perfect 2), while the opportunities remain highly
equilibrium’.) In contrast, where the protocol dic- antithetical there is now a mutual interest in
tates that players in a single-round game choose avoiding the disastrous (1,1) outcome that comes
simultaneously - or, equivalently, where each about when both play Tough. In contrast with ‘Land
chooses in ignorance of the other’s move - solution or Sea’, in the ‘Chicken’ payoff environment the
concepts are harder to justify. The most commonly advantage lies with tht first-mover. Specifically,
employed is called the ‘Nash equilibrium’ (or Row should rationally play Tough, knowing that
‘equilibrium point’), a pair of strategies from which Column then has to respond with Soft. For, Column
neither player would want to diverge unilaterally. must accept the bad (payoff of 2) to avoid the worst
In the ‘Land or Sea’ payoff environment, under (payoff of 1). If the protocol dictates simultaneous
the one-round sequential-move protocol, it is the moves, however, once again the players are groping
second-mover or defender who has the advantage. If in the dark. Under the Nash equilibrium concept
Row moves first, for example, Column can always they choose probabilistically, which implies that the
successfully counter; e.g. if Row attacks by land, disastrous (1,1) outcome will indeed occur a
Column will defend by land. Hence the (1,2) percentage of the time. There is a suggestive
payoff-pair is the outcome regardless of Row’s application of this model to industrial conflict. If
initial move. In military terms the defence has an union (or management) becomes committed to play
intrinsic advantage whenever the attacker must Soft, it will be at a disadvantage in negotiations -
visibly commit his forces to one or another line of the other side will then surely play Tough. But if
attack. And, of course, where the defence has such both play Tough, there is no hope for peaceful
an advantage neither party is motivated to initiate settlement. Hence each side should rationally adopt
warfare through aggression. But if ‘Land or Sea’ is a ‘mixed’ strategy, with the consequence that strikes
played under the simultaneous- move protocol, both and lockouts will occur in a certain fraction of the
parties are groping in the dark and little can be said dealings.
with confidence. (Here the Nash equilibrium would The ‘Reciprocity’ payoff environment (Matrix
have each side choosing its move at random, in 3) is more rewarding to cooperative behaviour. The
effect tossing a coin.) idea is that each player would
Fold Retaliat
e
Refrai Conflict and Settlement 2039
n 2,3 2,3
Attack 3,1 1,2
Conflict and Settlement, Matrix 5 Matrix 6
Fold
Fig. 4 Retaliat DETERRENCE DETERRENCE
e
Refrai WITHOUT REQUIRING
n 2,3 2,3 COMMITMENT COMMITMENT
Attack 3,2 1,1
answer Soft with Soft - leading to the mutually Imagine now they are given a chance to shift
preferred (4,4) payoffs - but failing this, would strategies under a sequential-move protocol. As
respond to Tough with Tough. If the sequential- first-mover, Row would be happy to change from
move protocol applies, the first-mover would then Tough to Soft if only he could rely upon Column to
always rationally choose Soft, and so the ideal (4,4) respond in kind. But Row may, mistakenly, believe
payoff-pair should be achieved. But under the that Column’s payoffs are as in ‘Chicken’, from
simultaneous-move protocol, with each party in the which he infers that Column would stand pat with
dark about the other’s move, again the outcome is Tough. Row would therefore not shift from Tough,
quite unclear. In fact there are three Nash equilibria: hence Column in his turn would not change either.
pure-strategy solutions at (4,4) and (2,2), and a (Some authors have gone so far as to attribute all or
mixed-strategy solution as well. almost all of human conflict to such mistaken ‘self-
Finally, in the famous ‘Prisoners’ Dilemma’ fulfilling beliefs’ about the hostility of opponents,
payoff environment (Matrix 4) the parties are likely but of course this pattern is only one of many
to find themselves in the Defect-Defect ‘trap’ with possibilities.)
(2,2) payoffs, even though (3,3) could be achieved
were each to play Cooperate. Here the ‘trap’ takes
hold under both sequential- move and Commitment and Deterrence
simultaneous-move protocols.
The preceding discussion could only be sug- In some circumstances the second-mover in point of
gestive, limited as it was to 2-player single-round time (Column) may be able to commit himself to a
games, within that category to only a few two- given response strategy before Row makes his first
strategy symmetrical payoff environments, and move. While Column thereby surrenders freedom of
finally to the very simplest protocols - excluding, choice, doing so may be advantageous. Consider
for example, all negotiations and communications threats and promises. A threat is a commitment to
between the parties. Space limitations permit undertake a second-move punishment strategy even
comment upon only a few additional points: where execution thereof is costly. A promise
similarly involves commitment to a costly reward
strategy. Matrices 5 and 6 illustrate how a threat
Perceptions works. Row’s choices are Attack or Refrain, while
Column’s only options are to Retaliate or Fold if
Standard game models assume that players know Row attacks. Column’s problem, of coiuse, is to
not only their own payoffs but also their oppo- deter Row’s attack. In Matrix 5 Column prefers to
nents’. Unintentional error on this score, or else Retaliate if attacked, a fact that - given Row’s
deliberate deception, may play a crucial role. Sup- preference - suffices for deterrence. Commitment is
pose two parties in the ‘Reciprocity’ payoff envi- not required. (Since Column prefers to Retaliate,
ronment of Matrix 3 find themselves initially there is no need to commit himself to do so.) In
playing Tough-Tough with outcome (2,2). Matrix 6 the Column player prefers to turn the other
cheek; if attacked,
2040 Conflict and Settlement
he would rather Fold than Retaliate. Unfortunately, (However, there is some reason to believe, the
this guarantees he will be attacked! (Note that here technology of attack through long-range
it is not excessive hostility, but the reverse, that weapons has now so come to prevail over the
brings on conflict.) But if Column could commit defence that a single world-state is indeed
himself to Retaliate, for example by computerizing impending.)
the associated machinery beyond the possibility' of
his later reneging, then deterrence succeeds. In Going into the basis for increasing returns, at
short, if a pacific player can reliably threaten to do any moment the stronger in battle can inflict a
what he does not really want to do, he won’t have to more-than-proportionatc loss upon his opponent,
do it! (Needless to say, so dangerous an thus becoming progressively stronger still. Impor-
arrangement is not to be casually recommended.) tant special cases of this process arc modelled via
Lanchester’s equations. In combat, in the ideal case
where all the military units distribute their fire
The Technology of Struggle equally over the enemy’s line, the process equations
arc:
Conflict is a kind of ‘industry’ in which different
'films' compete by attempting to disable opponents. AB/At = - ARR
Just as the economist without being a manager or AR/At =k B
B
beyond all reasonable prior calculations, as expe- aggression, invasion attempts can be expected to
rienced for example at Verdun. occur. Invasive and counter-invasive effort absorb a
On the other hand, an effective substitute for very substantial fraction of society’s resources in
force size is superior organization. An integrated every possible social structure, whether egalitarian
military unit is far more powerful than an equally or hierarchical, liberal or totalitarian, centralized or
numerous conglomeration of individual fighters, decentralized. Furthermore, every form of human
however brave. Organizational superiority, far more social organization, whatever else can be said for or
than superiority in weapons, explains why small against it, must ultimately meet the survival test of
I-’ll top can expeditionary contingents in early internal and external conflict.
modem times were able to defeat even vast
indigenous forces in America, Africa and Asia.
Battles are thus often a contest of organizational Notes on the Literature of Conflict (of
forms; the army whose command structure first Special Relevance for Economists)
cracks under pressure is the loser.
As for diminishing returns, in the simplest case Classical military thought from Machiavelli to
an equilibrium is achieved at a geographical Clausewitz to Liddell Hart, though rarely analytical
boundary such that: in the economist’s sense, remains well worth study.
An excellent survey is Edward Mead Earle ( ).
Modem work in this classical genre
~ SB*B = — SRXR
understandably concentrates upon the over-
Here MB and MR are military strengths at the whelming fact of nuclear weaponry and the problem
respective home bases, Sfi and .SR are decay gra- of deterrence; the contributions of Herman Kahn (
dients, and xB and xR are the respective distances , ) are notable. There is of course
from base. The condition of equality determines the a huge historical literature on conflict and war. An
allocation of territory. interesting economics-oriented interpretive history
The ‘social physics’ of straggle is of course far of modem warfare is Geoffrey Blainey ( ).
more complex than these simplistic initial models William H. McNeill ( ) examines the
suggest. There are more or less distinct offence and course of military organization and technology from
defence technologies, first-strike capability is not antiquity to the present, emphasizing the social and
the same as retaliatory strength, countering economic context. On a smaller scale John Keegan (
insurgency is a different problem from central land ) provides a valuable picture of
battle, etc. how men, weapons, and tactics compete with and
complement one another on the battlefield. There is
also a substantial body of statistical work
Conflict, Society and Economy attempting in a variety of ways to summarize and
classify the sources and outcomes of wars; the best
Conflict theory can help explain not only the size known is Lewis F. Richardson ( ).
and shape of nations, but the outcomes of com- Mathematical analysis of military activity, that is,
petition in all aspects of life: contests among social quantifiable modelling of the clash of contending
classes, among political factions and ideologies, forces, is surprisingly sparse. The classic work is
between management and labour, among Frederick William Lanchester ( [1956]).
contenders for licences and privileges (‘rent- The modem analysis of conflict, typically com-
seeking’), between plaintiffs and defendants in law bining the theory of games with the rational-
suits, among members of cartels like OPEC, decision economics of choice, is represented by
between husband and wife and sibling and sibling three important books by economists: Thomas C.
within the family, and so on. Whenever resources Schelling ( ), Kenneth E. Boulding
can be seized by
2042 Congestion
chosen. Some degree of congestion is typically stadium, the stock capacity constraint is the more
socially optimal. important. It should also be noted that a congestible
The economic theory of congestion has much in facility can take the form of a network of
common with the theory of clubs and local public congestible elements of capacity; a natural dis-
goods (Scotchmer ). The two literatures examine tinction is then between link congestion (for
similar issues, but the economic theory of example, highway links) and nodal congestion (for
congestion has a policy perspective, while the example, traffic intersections).
theory of clubs and local public goods focuses on To develop the theory, we consider a particular
decentralized provision. congestible facility having a single element of
Formally, we may define congestion as follows. capacity and identical users, that is in a steady state:
Consider a congestible facility in a steady state, that a road of uniform width connecting a single entry
comprises I congestible elements. (Congestible point A and a single exit point B, for which an
elements for a sports stadium, for example, include increase in traffic flow increases travel time and an
nearby roads, parking facilities, the ticket office, increase in road width reduces it. In this context, the
washrooms, concessions, and seating.) Element i is deterioration of quality of service with an increase in
characterized by a flow capacity, kh and a stock usage is the increase in travel time from an increase
capacity, Ki: the flow capacity is the maximum in traffic flow.
throughput per unit time, the stock capacity the We start with the short-run problem of deter-
maximum number of users at a point in time. mining optimal flow and its decentralized attain-
Similarly, the level of usage is described in terms of ment, holding road width fixed. Let/denote flow, w
the throughput of congestible element /, nh and the road width, t = t( f w) the travel time function with
number of users at a point in time, Nt. The (functional subscripts denote partial derivatives) tf >
congestible facility provides J dimensions of quality 0 and < 0 and p the value of time. Then the cost to
of service, with the level of dimension j indicated an individual driver of travelling from A to B, the
by sj. user cost, is p t( f w). Total user costs per unit time
Letting k, K,n, N and s denote the corresponding equal flow times user cost: pft{ f w). The social cost
vectors, per unit time from increasing flow by one unit, with
capacity held fixed, the short-run marginal social
s = S(k,K,n,N). ( 1) cost, is pt( f, w) + fifty (f, w). The first term is the
user cost of the extra driver; the second, the
Congestion occurs when there is at least one com-
congestion externality cost. A driver imposes a
bination of j and i for which sj is monotone
congestion externality by slowing other drivers
decreasing in n[ (flow congestion) or N] (stock
down; increasing steady- state flow by one car
congestion), that is, when some dimension of
increases each car’s travel time by tf(f,w) and social
quality of service falls as the throughput or stock of
cost by pfiy (f w).
users of some congestible element of capacity
Figure displays short-run equilibrium. p denotes
increases. This is the static or steady-state definition
trip price, D(p) the aggregate trip demand function,
of congestion. The dynamic definition of congestion
and uc(f) and snnsc(f ) the user cost and short-run
adds time subscripts to s, k, K, n and N, and appends
marginal social cost as a function of / holding w
equations of motion relating stocks and flows for
fixed. With no toll, a user’s trip price equals his user
the various elements of capacity.
cost, and equilibrium occurs where the demand and
For some congestible elements, such as a turn-
user cost functions intersect, with flow fe. Assuming
stile, the bottleneck in the Vickrey ( ) bottle
that the marginal social benefit from a trip equals the
neck model of traffic congestion, or a switching
corresponding marginal willingness to pay, the
circuit, the flow capacity constraint is the more
optimum occurs where the demand and short-run
important; for others, such as a telephone line, an
marginal social cost curves intersect, with flow/*.
elevator, a swimming pool, or seating at a football
Thus, with no toll, equilibrium flow is excessive.
2044 Congestion
Congestion,
Fig. 1 Demand and supply
on a congested highway
Efficiency obtains when economic agents face the per unit time from flow/ and C(w) the amortized
social costs of their decisions and derive the social capital cost of road width w. (We ignore the com-
benefits from them. In the notoll case, the price of a plications that arise when the congestible facility is
trip falls short of its marginal social cost since a sufficiently large that its construction alters factor
driver does not pay for slowing down other drivers. prices.) The social surplus generated by the road
Following Pigou ( ), the standard (per unit time) equals social benefit minus social
remedy for internalizing the congestion externality cost, and social cost equals total user cost plus
is to impose a toll equal to the congestion amortized capital cost:
externality cost, evaluated at the social optimum: t*
in Fig. . This causes the trip price function to shift SS(f, w) = B(f) — pftif, w ) - C{w) .
up from uc (/) to iic( f ) + t* and equilibrium flow to (2 )
fall to the optimal level.
The above argument illustrates the general It is easily seen from ( ) that the road width that
principle that efficient utilization of a congestible maximizes social surplus is that which minimizes
facility requires that the price equal short-run social cost. This means that, when the long-run
marginal social cost and the toll the congestion planning problem is solved, production is carried
externality cost. Different user types - for example, out according to the long-run cost structure, and the
cars and trucks - may impose different congestion short-run marginal cost pricing (which is again)
externality costs. Efficiency then requires that the required for optimal flow is equivalent to long-run
toll be differentiated according to user type. marginal cost pricing:
We now turn to the long-run planning problem
in which both road width and flow are choice p = LRMC.
variables. We then consider decentralization of the
(3 )
optimum. Let B (/) denote the social benefit
Now, recall the basic result of production theory
Congestion 2045
whether LRAC is constant, decreasing or increasing. and Stiglitz ), rationing, or nonlinear pricing
Combining this with ( ), we have the result that, (Wilson ); administrative costs may preclude fine-
when LRAC is constant,/? = LRA C holds at a long- tuning the fare according to distance travelled or
run optimum. This is equivalent to equality between time of day, leading to variants of Ramsey pricing
the total value of output and the total cost of output. (Mohring ); the authority may face a deficit
Since total user cost is a component of both, this constraint, requiring it to price above marginal
equality implies equality between toll receipts and social cost (Boiteux ); with
amortized capital cost. Thus, in the case of constant distortionary taxation, the social cost of financing
long-run average cost, the revenue raised from the an extra dollar of transit authority deficit may
optimal toll exactly covers the capital cost significantly exceed one dollar (Vickrey ); and the
ofproviding a road of optimal width. This is known government may choose to deviate from marginal
as the ‘self-financing’ result. It was first derived by social cost pricing to provide the public transit
Mohring and Harwitz ( ) and authority with higher-powered incentives (Laffont
subsequently generalized by Strotz ( ) (For a and Tirole ) or to achieve political objectives. These
geometric derivation, see Amott and Kraus ). considerations will also cause second-best capacity
The self-financing result extends to congestible to deviate from first-best capacity.
facilities with multiple elements of capacity, mul-
tiple dimensions of quality of service, and multiple
user groups. If a congestible facility exhibits
constant long-run average costs, provision of the
facility can be decentralized via competing ‘clubs’; See Also
competition will result in each club charging each
user a fee for use of its congestible facility equal to
the congestion externality cost he imposes, and
choosing optimal capacity.
The above theory was developed on the ► iViwi
assumption of a steady state. In the extension to
treat nonstationary dynamics, which is conceptually
straightforward, the distinction between flow
externalities and stock externalities becomes
sharper.
Bibliography
The theory relates to first-best pricing and
capacity choice when congestion is the only exter- Amott, R., and M. Kraus. 2003. Principles of transport
nality. When usage entails other externalities, such economics. In Handbook of transportation science, ed.
as pollution, firstbest pricing should take these into R. Hall, 2nd ed. Boston: Kluwer.
Atkinson, A., and J. Stiglitz. 1980. Lectures on public
account. In any policy context, additional practical
economics. New York: McGraw-Hill.
constraints that mle out attainment of the full first- Boiteux, M. 1956. Sur la gestion des monopoles publics
best allocation need to be considered. These are astreints a l’equilibre budgetaire. Econometrica 24: 22-
treated by applying second-best theory (Diamond 40.
Diamond, P., and J. Mirrlees. 1971. Optimal taxation and
and Mirrlees ). Consider, for example, the pricing
public production I: Production efficiency and II: Tax
problem facing a public transit authority. The rules. American Economic Review 61 (8-27): 261-278.
underpricing of urban auto travel may call for the Laffont, J.-J., and J. Tirole. 1993. A theory of incentives in
underpricing of mass transit (Levy-Lambert ; procurement and regulation. Cambridge, MA: MIT
Press.
Marchand ); since optimal lump-sum redistribution
Levy-Lambert, H. 1968. Tarification des services a qualite
is infeasible for informational reasons, the authority variable: Application aux peages de circulation.
may choose to sacrifice some efficiency to improve Econometrica 36: 564-574.
equity by charging lower fares to needy groups Marchand, M. 1968. A note on optimal tolls in an imperfect
environment. Econometrica 36: 575-581.
(Atkinson
2046 Conglomerates
Mohring, H. 1970. The peak-load problem with increasing particularly diverse external expansion. It has in
returns and pricing constraints. American Economic particular been used to mean a company which has
Review 60: 693-705.
Mohring, H., andM. Harwitz. 1962. Highway benefits: An
by a deliberate strategy of external growth, often
analyticalframework. Evanston: Northwestern Univer- away from declining sectors, developed a highly
sity Press. diversified product range which cannot easily be
Pigou, A. 1947. A study in public finance. 3rd ed. London: characterized in terms of a single, or well defined,
Macmillan.
Scotchmer, S. 2002. Local public goods and clubs. In
group of production technologies, a single set of
Handbook of public economics, ed. A. Auerbach and M. major competitors, or a stable place in a well
Feldstein, vol. 4. Amsterdam: North-Holland. defined industry group (Steiner 1975; Weston ).
Strotz, R. 1965. Urban transportation parables. In The public (And in the US context to have financed that
economy of urban communities, ed. J. Matgolis.
Washington, DC: Resources for the Future.
expansion with issues of paper rather than cash,
Vickrey, W. 1959. Statement on the pricing of urban street accompanied by accounting techniques for
use. Hearings: US Congress, Joint Committee on consolidating acquired companies designed to boost
Metropolitan Washington Problems, 11 Nov, 454—477. earnings per share and make future paper issues
Vickrey, W. 1969. Congestion theory and transport invest-
even more profitable (Blair ; Steiner 1975).)
ment American Economic Review Proceedings 59: 251-
260. Although diversified businesses predate World
Wilson, R. 1993. Nonlinear pricing. Oxford: Oxford Uni- War II (often in the form of financial holding
versity Press. companies in Europe and Japan) conglomerate
companies in the above sense are essentially a
postwar phenomenon, and have been associated
with the widespread adoption of decentralized
Conglomerates divisionalized management structures. In particular,
in the United States the growth of merger activity in
Alan Hughes the 1960s was dominated by diversified
acquisitions, the most spectacular of which were
associated with the emergence of a group of par-
ticularly aggressive conglomerates. For instance,
between 1961 and 1969 ITT, already a very large
The overall output of a firm may be composed of
multinational telecommunications company,
activity in more than one product market. The
acquired amongst other concerns, the largest US
growth of individual firms will be composed of
bakery, the largest US hotel chain, and the largest
changes in the scale of their activities in each of the
US house builder, the second largest US car rental
markets in which they operate and in the numbers
service and a number of large insurance and finance
of those markets. In any period these changes will
companies. Gulf and Western over a similar period
consist of horizontal expansion in the market(s) in
acquired companies in sugar, tobacco, steel, paper,
which they operated at the beginning of the period
banking, insurance and motion pictures (Blair ).
and entry into new markets; where there is a
These are extreme examples of a general longer
supplier or buyer relationship with the original
run tendency for diversification to increase in the
market then this expansion will be vertical
post-war period in all the major industrial
integration. Expansion which fits neither of these
economies (Berry ; Jacquemin and De Jong ;
categories is termed diversifying or conglomerate
Utton ). Case studies of corporate growth strategies,
expansion. Growth in any of these directions may
and estimates of levels of, and changes in,
be by the purchase of new assets (internal growth),
diversification in the 1960s and 1970s reveal that
or by the purchase of existing assets through
rapid unrelated product expansion is outweighed by
takeover or merger (external growth). Although it is
expansion based on a related product, or ‘narrow
common to refer to non-horizontal and non-vertical
spectrum’ diversification strategy
expansion as diversified, or conglomerate, the latter
term also has a more specific connotation
emphasizing
Conglomerates 2047
(e.g. outside a fairly finely defined (say 3 or 4 digit) mutual forebearance with accepted spheres of
primary product group but within a broader (say 2 influence agreed with other conglomerates. More
digit) industry, of which the primary group is a part difficult to detect may be other effects claimed to
(Wood ; Channon ; Rumelt ; Berry ; Biggadike ; arise from reductions in potential competition,
Utton ; Caves et al. ; Spruill ). Nevertheless, these where it is argued, for instance, that entry by a large
studies also suggest an increase in the importance of conglomerate may deter other likely entrants, or
unrelated product expansion and a situation has now lead to subsequent anti-competitive behaviour,
been reached where the largest companies in the which could not, or otherwise would not, occur.
major industrial countries have, by the long-term Where entry is by merger it may also be argued that
pursuit of such strategies, come to occupy leading this is at the expense of new investment in the
positions in many different industries (Shepherd ; market, either by incumbents, or the new entrant
Blair ; Utton ) and where the market in corporate itself (Markham ; Steiner ; Scherer ). Examples can
takeovers, and divisionalized management be found in the US, and in the UK and Europe, of
structures, permit the easy pursuit of further most these practices. There is little to suggest
conglomerate growth by external means (Mueller ). however that these are persistent, typical, or
This has inevitably raised questions about the pervasive features of conglomerate behaviour.
relationship between market competition and Where they have been most prominent they appear
conglomerate growth, and about the effects of to have been due at least as much to individual
conglomerate merger upon corporate performance. market power as to overall conglomerate strength
Estimating the impact of conglomerate activity (Markham ; Scherer ; Utton ).
in an industry upon levels of, and trends in, its Corresponding to claims of the anticompetitive
concentration are surrounded by empirical prob- losses which may follow from the spread of
lems. These arise from lack of precise data on the conglomeration are claims of likely benefits. Here it
market shares of individual firms in different is claimed that such firms may allocate more
industries, and on the evolution over time of those resources per unit of sales to research and
shares. Similar problems limit attempts to measure development, since the chances of utilizing spinoffs
the impact of conglomerate entry, either by merger, and unexpected findings within the organization are
or new investment. The evidence suggests that higher (Nelson ); may experience economies of
conglomerate mergers have had little impact on scope (Panzar and Willig ) and may enjoy lower
levels of, or trends in market concentration in the costs of raising capital on the stock market in
US (Goldberg , ) whilst in response to more stable earnings streams, and the
the UK and the US neither the presence of diver- reduced risk of bankruptcy that conglomerate
sified firms, nor new entry by diversification, seems spread may bring. There is no consistent evidence
to lead to increased levels of concentration. If to suggest that these effects lead to any superiority
anything the reverse seems to be the case (Berry ; in profit performance, either for the individual firm,
Utton ). Effects on competitive behaviour rather or for industries in which conglomerate firms play
than market structure are a little better documented. an important role, some studies finding positive,
Here the argument is that operating over many and others negative, effects on profit levels or
markets enhances power in each of them stability (Rhoades , ; Utton ;
individually (Edwards ). Thus it is argued that Caves et al. ; Kelly ). There is, however, some
conglomerate firms may impose reciprocal buying evidence to support the view that diversified firms
pressures upon suppliers to encourage them to use, have higher R&D inputs and patent outputs than
as inputs, the products of other divisions of the specialized firms, although there is an obvious
parent conglomerate; may employ predatory pricing problem of causation involved (Wood ; Kennedy
in newly entered markets, cross- subsidized by and Thirlwall ; Scherer ). The effects of
activities elsewhere; and practice conglomerate merger have been much
2048 Conglomerates
more extensively investigated than the impact of ). From this point of view appropriately orga-
conglomerate firms as such. Those studies for the nized conglomerates are efficient mini-capital
US which examine periods beyond the stock market markets, and represent that potent source of new
conglomerate boom years of the late 1960s show entry by mobile capital, and adjustment away from
such merging conglomerate companies either doing declining sectors which lies at the heart of the
less well, or at best as well, as other companies (or competitive adjustment process (Clifton , and, from
portfolios of shares) in terms of profitability, profit a different perspective, Weston ). Moreover, the
stability and shareholder returns. This is especially wider the spread of industries covered by any firm,
so when profitability measures are used which and the wider the threat of takeover of the
allow for the loan financing techniques used to inefficient, or sleepy, or of entry by new
build up the most spectacular conglomerate empires investment, the more forceful this argument
in the 1960s merger boom (Mueller ). Evidence for becomes. On the other hand conglomerate firms are
other countries is more fragmentary but that for the not all embracing in their industrial coverage, their
UK, for instance, suggests that conglomerate acquisitions may not be especially driven by
mergers perform relatively well in terms of industrial logic or production efficiency, and it may
profitability compared to mergers in general, though be argued that in addition they reduce the efficiency
neither outperform companies relying on internal of operation of the capital market itself, given that
growth (Meeks ; Cosh et al. ). financial reporting by large firms is notoriously
Most of this work may be set in the context of a aggregative.
static neoclassical perspective to trade off monopoly In this sense the defining characteristics of
welfare losses against efficiency gains. On this basis conglomerates are to be found in their internal
it would be hard to mount a significant case for or financial and administrative arrangements, which
against conglomerates. To look at the problem in have in principle freed them from the particular
this way is however, to distract attention from what constraints of individual product markets and pro-
may be regarded as more fundamental questions duction technologies. They represent the latest
about the working of the economic system as a boundary between organization and the market
whole. There are three interrelated issues here. (Coase ) and as such fit uneasily into any generally
What are the comparative advantages of organizing accepted model of the workings of the macro or
economic activity on the basis of interfirm market micro economy as a whole.
processes as opposed to intrafirm administrative and
organizational processes, what is the impact of the
spread of conglomeracy upon the flow of economic See Also
resources between alternative prospective uses, and
how does diversification and the spread of
conglomeracy affect the flow of information upon
which a market economy is based.
On one view of the world the growth of
divisionalized conglomerate companies may be Bibliography
regarded, in many instances, as a superior resource
allocation mechanism to interfirm market Berry, C.H. 1975. Corporate growth and diversification.
Princeton: Princeton University Press.
transactions. Internal administrative allocative Biggadike, E.R. 1979. Corporate diversification entry
decisions to move resources between the company’s strategy and performance. Boston: Harvard University,
individual markets and divisions, it is argued, are Graduate School of Business.
based upon more and better information than that Blair, J.M. 1972. Economic concentration. New York:
Harcourt Brace Jovanovich.
available to outsiders in the stock market Caves, R.E. 1980. Industrial organisation, strategy and
(Williamson , ; Chandler , structure. Journal of Economic Literature 18(1): 64-92.
Caves, R.E., et al. 1980. Competition in the open economy.
Cambridge, MA: Harvard University Press.
Conjectural Equilibria 2049
Chandler, A. 1962. Strategy and structure: Chapters in the in 1963. Review of Economics and Statistics 55(2): 146-155.
history of the industrial enterprise. New York: Anchor Rhoades, S.A. 1974. A further evaluation of the effect of
Books. diversification on industry profit performance. R e v i e w
Chandler, A. 1977. The visible hand: The managerial revolution o f E c o n o m i c s a n d S t a t i s t i c s 56(4): 557-559.
in American business. Cambridge, MA: Harvard University Rumelt, R.P. 1974. S t r a t e g y structure and
Press. economic performance. Boston: Harvard
Channon, D.F. 1973. The strategy and structure of British Graduate School of Business.
enterprise. London: Macmillan. Scherer, F.M. 1979. I n d u s t r i a l m a r k e t s t r u c t u r e
Clifton, J.A. 1977. Competition and the evolution of the a n d e c o n o m i c p e r f o r m a n c e , 2nd ed. Chicago:
capitalist mode of production. C a m b r i d g e J o u r n a l Rand McNally, 1980.
o f E c o n o m i c s 2: 137-151. Shepherd, W.G. 1970. M a r k e t p o w e r a n d e c o n o m i c
Coase, R.H. 1937. The nature of the firm. E c o n o m i c a 4 : w e l f a r e . New York: Random House.
386-405. Spruill, C.R. 1981. C o n g l o m e r a t e s and the
Cosh, A., A. Hughes, and A. Singh. 1980. The causes and e v o l u t i o n o f c a p i t a l i s m . Carbondale: Southern
effects of takeovers in the United Kingdom. In T h e Illinois University Press.
d e t e r m i n a n t s a n d e f f e c t s o f m e r g e r s , ed. Steiner, P.O. 1976. M e r g e r s : M o t i v e s , e f f e c t s ,
D.C. Mueller. Cambridge, MA: O.G.H. Publishers. p o l i c i e s . Ann Arbor: University of Michigan Press.
Edwards, C.D. 1955. Conglomerate bigness as a source of Utton, M.A. 1979. Diversification and
power. In B u s i n e s s c o n c e n t r a t i o n a n d p r i c e c o m p e t i t i o n . Cambridge: Cambridge University
p o l i c y , ed. G. Stigler. New York: National Bureau for Press.
Economic Research. Weston, J.F. 1980. Industrial concentration, mergers and
Goldberg, L.G. 1973. The effect of conglomerate mergers on growth. In Mergers and economic
competition. J o u r n a l o f L a w a n d E c o n o m i c s e f f i c i e n c y , vol. I. Washington, DC: U.S. Government
16(1): 137-158. Printing Office.
Goldberg, L.G. 1974. Conglomerate mergers and concen- Williamson, O.E. 1975. Markets and hierarchies: Analysis and
tration ratios. R e v i e w o f E c o n o m i c s a n d anti-trust implications. New York: Free Press.
S t a t i s t i c s 56(2): 303-309. Williamson, O.E. 1985. T h e e c o n o m i c i n s t i t u t i o n s
Hughes, A., and A. Singh. 1980. Mergers concentration and o f c a p i t a l i s m . New York: Free Press.
competition in advanced capitalist economies: An Wood, A.J.B. 1971. Diversification, merger and research
international perspective. In D e t e r m i n a n t s a n d expenditure: A review of empirical studies. In T h e
e f f e c t s o f m e r g e r s , ed. D.C. Mueller. Cambridge, c o r p o r a t e e c o n o m y , ed. R. Harris and A.J.B.
MA: O.G.H. Publishers. Wood. London: Macmillan.
Jacquemin, A.P., and H.W. De Jong. 1977. E u r o p e a n
i n d u s t r i a l o r g a n i s a t i o n . London: Macmillan.
Kelly, M. 1980. The effects of diversification on market
structure and monopoly power. In M e r g e r s a n d Conjectural Equilibria
e c o n o m i c p e r f o r m a n c e , ed. K. Cowling et al.
Cambridge: Cambridge University Press. F. H. Hahn
Kennedy, C., and A.P. Thirlwall. 1972. Technical progress: A
survey. E c o n o m i c J o u r n a l 82: 11-72.
Markham, J.W. 1973. C o n g l o m e r a t e e n t e r p r i s e
a n d p u b l i c p o l i c y . Boston: Harvard Graduate
School of Business. Abstract
Meeks, G. 1977. Disappointing marriage.
In imperfectly competitive economies, agents
Cambridge: Cambridge University Press.
Mueller, D.C. 1969. A theory of conglomerate mergers. must take note of the effects of their decisions
Q u a r t e r l y J o u r n a l o f E c o n o m i c s 83(4): 643- on the market environment. Such effects, being
659. uncertain, are the subject of conjecture. Even if
Mueller, D.C. 1977. The effects of conglomerate mergers. A
conjectures are not derivable from some first
survey of the empirical evidence. J o u r n a l o f
B a n k i n g a n d F i n a n c e 1(4): 315-347. principles of rationality, conjectural theories are
Mueller, D.C. (ed.). 1980. The determinants and effects of of interest because they attempt a general
mergers: An international comparison. New York: O.G.H. equilibrium analysis of non-perfect competition.
Publishers.
The conjectural approach takes proper and
Nelson, R. 1959. The simple economics of basic scientific
research. J o u r n a l o f P o l i t i c a l E c o n o m y 67: explicit note of the perceptions by individuals of
297-306. their market environment; it is possible that
Panzar, J.C., and R.D. Willig. 1981. Economies of scope. what is the case may depend on what agents
American Economic Review: Papers and
believe to be the case.
P r o c e e d i n g s 71(2): 268-272.
2050 Conjectural Equilibria
with elements a («,, a,). Then a conjecture Cj is a conjectures are less arbitrarily imposed on the theory
map from A x A, to AT written as not much has been gained - almost any pair of
actions could be a conjectural equilibrium. A first
Ci = 6i(a, a')- attempt to remedy this is to ask that conjectures be
correct (rational). If that is to succeed in any simple
Its interpretation is this: given the actions ofthe fashion it will be necessary to suppose that each
two agents (a), C, is the action of j conjectured by i agent has a unique best action under this conjecture.
to be result from his choice of <1 ,. (In a more This is very limiting and it means that some of the
general formulation the conjecture can be a prob- classical duopoly problems cannot be resolved in
ability distribution but that is not considered here.) this way.
We require conjectures to be consistent'. Let the status quo again be a*. Then if 0j and 02
are correct conjectures it must be that
6i(a, at) = aj ( 1)
V 2{e* 1 (a*,a 2 ),e* 1 [(a u a* 1 ) ; e* 2 (a*,a 2 )\}
This says that if agent i continues in his action at x > vijflj, 6'1 [(aj,a 2 ), d\\ } all flj = A2.
then he conjectures that j will do likewise. (This use (3)
of the word ‘consistent’ is not that of Bresnahan ,
and others who use it to mean ‘correct’.) vi { 6 * 2 ( a * , a 2 ) , [(aj, a 2 ) , 0*2{a*, a2)] }
Suppose now that there is a function v from A
to R2, written as v(a) = \\’\{a), v2(a)], which gives > v i | a s , 0 [ ( a [ , a 2 ), a j | a l l f l j = A i. ( 4 )
the payoffs to the agents as a function of their joint
action a. Consider a* to be one such joint action.
A rational conjectural equilibrium is then a con-
One says that a* is a conjectural equilibrium for the
jectural equilibrium a* (with conjectures 9\(j, 0*2 (•)
two agents if
which satisfy ( ) and ( )). It must be re-emphasized
that such an equilibrium may not exist for some A
v1 [«/, 6j(a*,aj)\ < Vi[a*,6i(a*)] all a; eA,-,/ =
1,2 and v (see Gale ; Hahn )•
However, the idea is simple and, where appli-
(2)
cable, coherent. It has however been criticized (in a
That is, the joint action a* is a conjectural equi- somewhat intemperate and muddled paper) by
librium if no agent, given his conjecture, believes Makowski ).This criticism appears to have had some
that he can improve his position by deviating to a appeal to some game theorists who like to think of
different action. games in extensive form (which they sometimes like
It is not the case that conjectural equilibrium, as to call dynamic). The criticism is this: when agent 1
defined, always exists. For instance in the case of a deviates from a* he is interested in the payoffs
duopoly in a homogeneous product where the action which he will get given this deviation and agent 2’s
is ‘setting the price’, v may not be concave and a response. This payoff Makowski thinks of as
sensible conjecture may have discontinuities. One accruing in the ‘period’ after agent l’s deviation. But
thus needs special assumptions to ensure existence when agent 2 responds in that period he is interested
or one must face the possibility that agents do not in this payoff in the period following this response.
chose actions but probability distributions over So the agents expect ‘the game to end’ in different
actions (mixed strategies); for example, Kreps and periods (Makowski , p. 8). Moreover, after agent 2
Wilson ( ) in has responded, agent 1, in his turn, will again want
their work on sequential equilibrium employ con- to respond, that is, deviate from the deviation he
jectures which are probability distributions. started with. This criticism is then illustrated with an
Supposing that a conjectural equilibrium exists, example in which one agent
one may reasonably argue that until
2052 Conjectural Equilibria
expects the other to return to the status quo after he > u u(0/)
*($)'*{* yj
has deviated from it. (6)
All of this, however, is wrong. Firstly, if one i,j=l,2,6le6.
wants to give a time interpretation to conjectures
and so forth, then actions must be thought of as This is just a Nash equilibrium where conjectures
strategies. That is, the deviating agent deviates in are interpreted as strategies (Hart ).
one or more elements of his plan over the whole While this is still quite demanding, it is signif-
length of the game (perhaps infinite). Under correct icantly weaker than ( ). If equilibria exist they may
conjectures responses and counter-responses are be ‘bootstrap equilibria’, that is, they will depend on
taken into account in evaluating the benefits of beliefs about the actions of others, which beliefs
deviation. Hence, and secondly, a deviating agent is may be incorrect. There is certainly no ground for
in this situation never surprised by the response of believing that they will be efficient.
the other, which therefore does not lead him to One can go one step further in the direction of
further revise his deviation. On the definition, agent plausibility by requiring that conjectures be rea-
1 expects the response to his deviation to be Oja*, sonable only for small, or infinitesimal, deviations
ax). Suppose this gives a2 which is correct. Then that from the status quo. After all, large experiments are
agent knows that the new status quo will be (ai_ a f) likely to be costlier than small ones. This will allow
= a and if he has calculated benefits correctly he a larger class of reasonable conjectures and
will not wish to deviate again. equilibria.
However, there is the following to be said in
favour of Makowski’s criticism. Deviations in General Conjectural Equilibrium
strategies may not be observable by the other agent.
It is fair to say that at present general equilibrium
Therefore in traditional duopoly models with a
theory is in some way complete only for a perfectly
sequential structure the re-interpretation of actions
competitive economy, that is, one where the returns
as strategies may be inappropriate. There is some
to an individual agent are just equal to the contri-
evidence that in the duopoly literature with
bution which he makes (Makowski ; Ostroy ). In
conjectures the consequent difficulties have not
general (although there are exceptions) such an
always been appreciated. It is also the case that too
economy exists when it is large (for example, it
little attention has been paid to the assumption of a
consists of a non-atomic continuum of agents). But
unique best response on which the above
there is now another possibility: an economy can be
formulation depends.
perfectly competitive if agents conjecture that their
An alternative to rational conjectures are rea-
market actions will have no effect on the prices at
sonable conjectures (Hahn ). A conjecture is
which they can trade.
reasonable if acting on any other conjecture would
The following assertion will be clear from what
lower profits given the conjectures of other firms.
has already been discussed. Let us say that an
Suppose that 6 is the set of all possible consistent
economy is intrinsically perfectly competitive if it
conjectures. For any 9, c d; assume that there is a
satisfies the no-surplus condition. Then perfectly
unique optimizing choice of output by firm i of
competitive conjectures are rational if an economy
yjOj. Then z’s conjecture (fj c 9 is reasonable if
is intrinsically perfectly competitive. But perfectly
given/th conjecture Of.
competitive conjectures can be reasonable even
when the economy is not intrinsically perfectly
competitive. That is, conjectures may be such that,
if an agent acts on any conjecture other than the
v, a i i ei e e.
perfectly competitive one, his profits will be lower.
(5) For instance, this may even be the case for two
duopolists with constant marginal costs whose
But then a reasonable conjectural equilibrium is a conjectures refer to the price
pair (dp d!}) each in 0 such that
Conjectural Equilibria 2053
charged by the rival firm. It will also be clear that if Given any y £ F we think of each household as
we do not require conjectures to be either rea- endowed with a certain strictly positive stock of
sonable or rational then, in general, conjectures can non-produced goods and 0;,;T; of the production of
be found to support a competitive equilibrium in an firm j. To avoid unnecessary complications assume
economy which is not intrinsically perfectly 0;,; (j = 1, ... , n). to be such that if zh is the stock of
competitive. non-produced goods owned by household h:
In a general equilibrium context it is not clear
what it is that firms are supposed to conjecture. In For ally eF : zh + e'vyj ^ 0 aH h- (7)
some sense the conjecture must refer to the reaction j
of the whole economy to the action of the
conjecturing agent. In other words, it is not obvious Households consume both types of goods. Hence
how to define a game which adequately represents for any y e F there is now an associated pure
the economy. But in what sense? exchange economy where each household’s
Consider an economy with n produced goods endowment is given by ( ). Making the usual
and m non-produced goods. For simplicity suppose assumptions there will exist at least one equilibrium
that all firms are single-product firms and that all \p(y), w(y)]. Suppose for the moment that there is
only one for each y e F.
firms producing the same good are alike, including
Now firm j in this equilibrium observes \p(y),
their conjectures. There are very many households
w(y)] and will deviate from y} (if it deviates at all) if
whose reasonable conjectures are always the
it can thereby increase its conjectured profits. Let
competitive one. Households receive the profits of
firms. Since the action of any one firm can affect the
prices at which households can trade it is not at all
clear what it is in the households’ interest that the
firms should maximize (Gabszewicz and Vial ). If p{y)My)Jj
all households are alike it could be their common
utility function, but that seems far removed from the
be the conjectural profit function of firm j. Then y°,
world. I shall arbitrarily assume that firms maximize
p(y°),w(y°) is a conjectural equilibrium if for ally =
then- profits in terms of one of the non-produced
1,. •n:
goods, say the first. This is arbitrary but it seems to
me equally dubious to suppose that firms always
choose in the ‘best interests of shareholders’, p(y°),w(y°),y0
especially when that interest is often difficult and (8)
x > 7ij ' p ( y ° ) My ° ) , y j all y,eF.
sometimes impossible to define.
Let p e R", w e R™ 1 be the price vectors in
terms of good m of produced and non- produced Such a conjectural equilibrium will exist if all
goods respectively (so wm= 1). Let yk G Yj c R"+m be TIJ(-) are quasi-concave, an assumption for which
the production of firm j where y,; > 0 is its output of there is scant justification (Hahn ).
good j, ytl < 0 is an input of good i, produced or non- If we demand that conjectures be rational then
produced. Let y Ly;-, where yy C_ all j. Let z c conjectured and actual profit must coincide for all
R™ be the endowment yt(thc two coincide for yk = yk by the requirement
of non-produced goods and that conjectures be consistent). One proceeds as
follows. Let yk y{)k . Given the conjectures of the
remaining firms find the conjectural equilibrium of
F = {y| y > (0, -z)} the economy p{y*(k), w\y*(k), y*(kj\), where y(k) is
the vector y with yk in the Ath place and condition
so that F is the set of feasible net production vectors ( ) is not imposed for firm k. One then requires that
Y. Let 0h/ be the share of household h in firm/. for allyk> 0
2054 Conjectural Equilibria
equilibrium (that is, that the conjecture be infini- constraints on others. For instance, unemployment
tesimally or ‘first order’ rational). It turns out that statistics do not tell us whether workers have chosen
this extra requirement does not much restrict con- not to work or whether they are constrained in their
jectures, nor thus the set of equilibria which can be sale of labour. None the less, this objection has
generated by some conjectures. The reason roughly some force and needs further study with proper
is this: in conjectural equilibrium, when conjectured attention to the information of agents.
profit functions are twice differentiable, the partial The other objection is that these kinked con-
derivative of the conjectured profit function of firm jectures are not explained. That is tme if explanation
j with respect to its own output much vanish. turns on what an agent knows or can learn. None the
Suppose the economy to be in such an equilibrium less, the hypothesis seems to be to have
and consider an infinitesimal output deviation by psychological verisimilitude. If I can always sell my
firm k. To find the equilibrium which ensues, labour at the going wage there is little occasion for
differentiate all equilibrium relations, other than that the difficult conjecturing of what would happen if I
for firm k, with respect to the output of firm k. raised my wage. This is not so if I find that I cannot
Amongst these will be the condition that the find employment at the going wage.
marginal profit conjectured of every firm (other than In any event these simpler approaches allow one
k), be zero. Hence differentiation of that condition to incorporate traditional monopolistic competition
will yield second-order terms. But we can choose in a general equilibrium framework. Of course,
these arbitrarily since we are requiring only first- some of the assumptions such as concave
order rationality. One can show in fact that these conjectured profit functions are strong. On the other
second-order terms can be chosen so as to make the hand, one can now allow for a certain amount of
first-order conjectured change in profit of any firm k increasing returns (Silvestre ).
correspond to the actual change. (Details in Hahn .)
Hence first-order rationality imposes few
restrictions. Some Conclusions
Both Hahn ( ) and Negishi ( ) have
also considered kinked conjectures. The idea is this. The conjectural approach has this merit: it takes
If an agent can transact at the going price as much proper and explicit note of the perceptions by
as he desires his conjectures are competitive. If he is individuals of their market environment. Economic
quantity constrained (for example, if a firm cannot theory perhaps too often neglects the possibility that
sell an amount determined by equality between what is the case may depend on what agents believe
marginal cost and price) his conjectures are non- to be the case. Historians and others have long since
competitive. That is, he considers that a price studied the intimate mutual connection between
change is required to relax the quantity constraint. beliefs and events but economists have not made
The fixprice methods of Dreze ( ) and much headway here. The conjectural approach is
others can be interpreted as an extreme form of such perhaps a small beginning. For it deals with the
conjectures - for instance to relax a constraint on theories agents hold and this must plainly enter into
sales, price, it is conjectured, must be reduced to our theory of agents.
zero. In particular one should not pay too much
To such conjectures there have been two objec- attention to the objection that conjectures may not
tions. Firstly, they assume that an agent’s conjec- be derivable from some first principles of
tures are not influenced by constraints on others. For rationality. It seems to me quite proper to find their
instance, a firm which can hire as much labour as it description in history. Nor, as has been argued, will
wants at the going wage while workers cannot sell an appeal to learning render conjectures in some
as much as they like does not conjecture that it sense objectively justifiable. This is clear from the
could have the same amount of labour at a lower discussion of reasonable
wage. To this one can answer that it is not easy for
an agent to observe the quantity
2056 Conspicuous Consumption
conjectures and from the costs of experimentation. Gabszewicz, J.J., and J.D. Vial. 1972. Oligopoly ‘a la
For hundreds of years witches were burned in the Cournot’ in general equilibrium analysis. Journal of
Economy Theory 4: 381-400.
light of a reasonable theory which few would now Gale, D. 1978. A note on conjectural equilibria. Review of
regard as having proper objective correlatives. Economic Studies 45 (1): 33-38.
There is no reason to suppose that it is possible for Hahn, F.H. 1977. Exercise in conjectural equilibria.
businesses or governments now to do better than Scandinavian Journal of Economics 79: 210-226.
Hahn, F.H. 1978. On non-Walrasian equilibria. Review of
some of the best minds of the past. Economic Studies 45: 1-17.
From a more immediately relevant standpoint, Hart, O. 1982. Reasonable conjectures, Theoretical eco-
conjectural theories are of interest because they nomics paper no. 61. STICERD, London School of
attempt a general equilibrium analysis of nonperfect Economics.
Kreps, D.M., and R.B. Wilson. 1982. Sequential equilibria.
competition. It is good to know that in a proper Econometricas 50: 863-894.
sense perfectly competitive economies can be Makowski, L. 1980. A characterisation of perfectly com-
viewed as limiting Cournot conjectural equilibrium petitive economies with production. Journal of Eco-
economies (Novshek and Sonnenschein ). But this nomic Theory 22: 208-221.
Makowski, L. 1983. 'Rational conjectures' aren't rational
knowledge does not contribute to the study of
and 'reasonable conjectures ' aren't reasonable , Dis-
properly imperfectly competitive economies. Again cussion paper no. 60. SSRC Project on isk,
the study of fixprice equilibria has borne some Information and Quantity Signals, Cambridge
fruits, but not those which were first sought by University.
Negishi, T. 1960. Monopolistic competition and general
Trillin ( ) when he proposed a
equilibrium. Review of Economic Studies 28: 196-202.
framework for general equilibrium with monopo- Negishi, T. 1979. Micro-economic foundations of Keynesian
listic competition. If it is the case that actual macro-economics. Amsterdam: North- Holland.
economies are not perfectly competitive nor that Novshek, W., and H. Sonnenschein. 1978. Cournot and
Walras equilibrium. Journal of Economic Theory 19:
they behave ‘as if’ they were, then the task set by
223-266.
Triffin requires serious attention, and it is likely that Ostroy, J. 1980. The no-surplus condition as a characteri-
con jectural theories will have a role to play. sation of perfectly competitive equilibrium. Journal of
Recent developments in game theory (for Economic Theory 22: 183-207.
Silvestre, J. 1977. A model of general equilibrium with
example, Kreps and Wilson ) suggest that these two
monopolistic behavior. Journal of Economic Theory 16:
conjectures will have to play a part. Indeed, quite 425-442.
generally in that theory players conjecture that their Triffin, R. 1940. Monopolistic competition and general
opponent is ‘rational’ in an appropriate sense. It is equilibrium theory. Cambridge, MA: Harvard Univer-
sity Press.
not the case that the conjectural equilibrium
Ulph, D. 1983. Rational conjectures in the theory of oli-
approach is an alternative to the game theoretic one. gopoly. International Journal of Industrial Organization
1 (2): 131-154.
See Also
Conspicuous Consumption
Bibliography F. Stankovic
Arrow, K.J. 1959. Toward a theory of price adjustment.
In M. Abramovitz et al., The allocation of economic
resources. Stanford: Stanford University Press.
Bresnahan, T.F. 1981. Duopoly models with consistent
Keywords
conjectures. American Economic Review 71: 934—945.
Dreze, J. 1975. Existence of equilibrium under price rigid- Conspicuous consumption; McCulloch, J. R.;
ity and quantity rationing. International Economical Rae, J.; Say, J.-B.; Scarcity; Smith, A.; Vanity;
Review 16: 301-320. Veblen effect; Veblen, T
Constant and Variable Capital 2057
part of capital advanced in wages (Marx Vol. I, ch. relationship between capitalists and labourers. In
6). These definitions come from his concept of fact definition ( ) can be rewritten as
Value: he defined the value of commodities as the
amount of labour directly and indirectly necessary
to produce commodities (Vol. I, ch. 1). In other
words, the value of commodities is the sum of C
and N, where C is the value of the means of C/N reflects the character of the process of pro-
production necessary to produce them and N is the duction and N/V reflects the class relationship
amount of labour used that is directly necessary to between capitalists and labourers. C/N is the ratio of
produce them. The value of the capital advanced in the amount of labour necessary to produce the
the means of production is equal to C. means of production to the amount of labour
However, the value of the capital advanced in directly bestowed, which is completely determined
wages is obviously not equal to N, because it is the by the material condition in the process of
value of the commodities which labourers can buy production, while N/V is the ratio of the amount of
with their wages, and has no direct relationship with labour which labourers actually expend to the
the amount of labour which they actually expend. amount of labour that is necessary in order to
Therefore, while the value of the part of capital that produce commodities which labourers can purchase
is advanced in the means of production is with their wages. If labourers are forced to work
transferred to the value of the products without longer with less wages, this ratio must rise.
quantitative change, the value of the capital Marx proposed to call the value-composition of
advanced in wages undergoes quantitative change in capital, insofar as it is determined by the material
the process of transfer to the value of the products. condition of the process of production, ‘the organic
This is the reason why Marx proposed the composition of capital’. More explicitly, ‘The
definitions of constant capital C and variable capital value-composition of capital, inasmuch as it is
V. determined by, and reflects, its technical com-
The definition of constant capital and variable position, is called the organic composition of
capital must not be confused with the definition of capital’ (Capital, Vol. Ill, ch. 8). However, as
fixed capital and liquid capital. Fixed capital is a shown above, the value composition of capital is not
part of constant capital which is totally used in determined by the material condition of the process
production process but transfers its value to prod- of production alone. So it is better to introduce the
ucts only partially. Liquid capital is a part of ratio C/N in the place of the organic composition of
constant capital which is totally used up and trans- capital, which is determined only by the material
fers its whole value within one production process. condition in the process of production. In order to
So constant capital is composed of both fixed avoid confusion, I call this ratio the ‘organic
capital and liquid capital, and on the other hand composition of production’. This is the ratio of dead
liquid capital belongs partly to constant capital and labour to living labour, which Marx himself
partly to variable capital. frequently used in Das Kapital.
Marx introduced the concept ‘value-
composition of capital’, /t, which is defined as the
ratio of constant capital C to variable capital V: Variable Capital and Source of Profit
B (10)
h > aiti + x2 — t2
u>o (11)
T > Bt2.
(1 3)
N>V (14)
a2T\B
x2B.
(1 - ) Organic Composition and Production
Price
Inequality ( ) is rewritten as
The concept of organic composition of capital (19 )
plays an important role in Marx’s analysis of prices.
T>B The price of production (Ricardo’s ‘natural
price’) that gives every industry the equal rate of
By ( ), ( ) and ( ) the above becomes profit is determined by the following equations:
( 20 )
' ' Pl = (1 + r)(alPl + n w) (24)
T > Bt2.
[] = ( h w ~ P i ) ( — A>0. (30)
\T1 T2 /
fi| P-AppaAf-l.
(3i)
t2<P2 TI>T2
r
where suffixes t, t + 1 denote periods.
= JTv e = s/v (35
> The RHS of ( ) is an increasing function of e. If
we take the limiting value as e tends to infinity, we
where e is the rate of exploitation.
have
He asserted that if the organic composition of
capital n increases sufficiently then the rate of profit
r must inevitably decrease. This is the famous Taw 1
rt+1 - t , <----------r,.
of the tendency for the rate of profit to fall’ '7+1
(Capital, Vol. Ill, Chap. 13).
Many people have criticized this theorem. They Therefore we conclude, if vt+\ > 1 !ru then
have said that if the rate of exploitation e increases rt+\ - r t < 0 .
sufficiently, r may increase in spite of the increase The above reasoning can be restated. The
of p So r does not necessarily decrease, even if ji reciprocal of the organic composition of production
increases sufficiently (Robinson ; Sweezy ). Such a sets an upper limit to the rate of profit, because
critique overlooks the logic of Marx’s argument.
Marx stated:
5 ^S + V N
(38)
c+vK c ~c
Constant and Variable Capital 2063
From ( ), ( ), ( ), we know
Many people thought that even if C/N rises amount of employed labourer, Nt + i must inevi-
sufficiently, still if constant capital C also increases tably become less than Nt, however high the rate of
then the absolute level of employment can also accumulation of capital may be (Okishio ). The rate
increase, though less than proportionately to of accumulation of capital AC/C itself is bounded
constant capital (Oppenheimer ). But by reasoning by the reciprocal of the organic composition. From (
similar to that used for ‘the tendency of the rate of ) and ( )
profit to fall’, we can prove that if organic
composition rises sufficiently, then the absolute AC N _ 1
level of employment must actually decrease. ~C< V
The organic composition of production in the fth so that, because it is reasonable to assume that the
period vt is defined as growth rate of labour supply is non-negative, we
can say that if the organic composition rises
sufficiently the rate of unemployment inevitably
rises. Though Marx did not state this explicitly, we
think that this is what he wanted to say.
In analysing Marx’s theorem on the movement
The accumulation of constant capital AC = Ct+1 - C, of the rate of profit and employment, we have
is financed from surplus value S. accepted his central assumption that the organic
composition of production rises sufficiently over
C,+1 — C t < S t . (46) time. However, there arises the problem: under
what conditions do capitalists choose techniques
The surplus value A is a part of the amount of living
that have sufficiently high organic compositions of
labour which labourers expend
production?
Marx seemed to think that the rise in labour
S, < Nt. (47)
productivity and the rise in the organic composition
are two aspects of the same thing. But these two do
By ( ), we obtain, not always go together. Marx himself knew that if
labour productivity in the means of production
sector rises very high then even if technical
Af/+i — N t —- - -C,+i------C t
V/+1 v, composition rises, still the value composition may
-----(C/+i — C , ) + C , (------- remain constant or decrease.
V/+1 \V, | | v , As to the capitalists’ introduction of new tech-
niques we have the following propositions:
From ( ) and ( ) we get (1) If the real wage rate remains constant and
capitalists introduce new techniques which
N , + i - N , < — S t + C A — --------- raise the rate remains of profit (calculated at the
V,+ l \V/+1 V , current prevailing prices and wage) then the
N, ^ ( 1 1
<--------Y C , I----------- new general rate of profit does not decrease,
V/+1 \v,+i v, whatever the organic composition may be.
= —!—(1 + v,-v(+1)0 (2) If the real wage rate rises and capitalists adapt
V/+1V, to this situation with the introduction of new
techniques, then the new general rate of profit
we can say, if (1 + v, - v, +1) < 0 then Nt+l - Nt < 0.
Therefore, if the organic composition of production
in the t + 1th period, vt+i, increases sufficiently so as
to exceed 1 + vt, then the
Constitutional Economics 2065
does is higher than the one which would be Sweezy, P.M. 1942. The theory of capitalist development:
expected if such a new technique were not Principles of Marxian political economy. New York:
Oxford University Press.
introduced.
Constitutional Economics
James M. Buchanan
See Also
the choices and activities of economic and political agents is incorporated in the analysis of alternative
agents, the rules that define the framework within sets of constraints.
which the ordinary choices of economic and
political agents are made. In this sense, con-
stitutional economics involves a ‘higher’ level of Constitutional Economics and
inquiry than orthodox economics; it must incor- Classical Political Economy
porate the results of the latter along with many less
As suggested, Constitutional Economics is related
sophisticated subdisciplines. Normative consider-
to classical political economy and it may be con-
ations enter the analysis in a much more complex
sidered to be an important component of a more
manner than through the artificially straightforward
general revival of the classical emphasis, and par-
efficiency criteria. Alternative sets of rules must be
ticularly as represented in the works of Adam
evaluated in some sense analogously to ranking of
Smith. (The closely related complementary com-
policy options within a specified institutional
ponents are discussed briefly in section “
structure, but the epistemological content of the
”.) One obvious aim of the
‘efficiency’ criteria becomes more exposed.
classical political economists was to offer an
The constitutional economist, precisely because
explanation and an understanding of how markets
the subject matter is the analysis of alternate sets of
operate without detailed political direction. In this
rales, has nothing to offer by way of policy advice
respect, orthodox neoclassical economics follows
to political agents who act within defined rales. In
directly in the classical tradition. But the basic
this sense, constitutional economics is not
classical analysis of the working of markets was
appropriately included within ‘policy science’ at all.
only a necessary step toward the more compre-
At another level, however, the whole exercise is
hensive purpose of the whole exercise, which was
aimed at offering guidance to those who participate
that of demonstrating that, precisely because mar-
in the discussion of constitutional change. In other
kets function with tolerable efficiency indepen-
words, constitutional economics offers a potential
dently of political direction, a powerful normative
for normative advice to the member of the
argument for constitutional structure exists. That is
continuing constitutional convention, whereas
to say, Adam Smith was engaged directly in
orthodox economics offers a potential for advice to
comparing alternative institutional structures,
the practising politician. In a real sense,
alternative sets of constraints within which
constitutional economics examines the choice of
economic agents make choices. In this comparative
constraints as opposed to the choice within
analysis, he found it essential to model the working
constraints, and as this terminology suggests, the
properties of a non-politicized economy, which did
disciplinary attention of economists has almost
not exist in reality, as well as the working properties
exclusively been placed on the second of these two
of a highly politicized mercantilist economy, which
problems.
could be directly observed.
A preliminary illustration of the distinction may
There is no need here to enter the lists on either
be drawn from the economics of monetary policy.
side of the ‘ideas have consequences’ debate. We
The constitutional economist is not directly
know that the economy of Great Britain was
concerned with determining whether monetary ease
effectively de-politicized in the late 18th and early
or monetary restrictiveness is required for furthering
19th centuries, and from the analysis of Smith and
stabilization objectives in a particular setting. On
his classical fellow travellers there emerged both
the other hand, he is directly concerned with
positive understanding of economic process and
evaluating the properties of alternative monetary
philosophical argument for a particular regime. The
regime (e.g. rale-directed versus discretionary, fiat
normative argument for laissez faire was, perhaps
versus commodity standards). The ultimate
inevitably, intermingled with the positive analysis
objective of analysis is the choice among the
of interaction within a
institutions within which political agents act. The
predicted behaviour of these
Constitutional Economics 2067
particular structure of constraints, essentially those prosperity? This central question of social philos-
that describe the minimal, protective, or night- ophy requires continuing contributions from many
watchman state. Economics, as a social science, specialists in inquiry, surely including those of the
emerged, but in the process attention was diverted constitutional economists. By their focus directly on
from the institutional structure. Even the predicted the ultimate selection of a set of constraining rules
normative reaction against the overly zealous within which ordinary social interaction takes place,
extension of the laissez faire economics argument constitutional economists remove themselves at
was couched in ‘market failure’ terms, rather than least one stage further from the false position of
in the Smithian context of institutional comparison. ‘social engineer’ than their counterparts in orthodox
The early socialist critique of market order, both in economics. Precisely because there is no apparently
its Marxist and non-Marxist variants, was almost simple evaluative criterion analogous to ‘allocative
exclusively negative in that it elaborated putative efficiency’ at hand, the constitutional economist is
failures of markets within an unexamined set of less tempted to array alternatives as if an
legal-political rules while it neglected analysis of unexamined criterion commands universal assent.
the alternative rules that any correction of the The artificial abstraction of ‘ social utility’ is likely
alleged failures might require. Only with the to be less appealing to those who concentrate on
debates on socialist calculation in the decades prior choices among constraints than to those who
to World War II did the issues of comparative examine choices within constraints.
structure come to be examined. If, however, there is no maximand, how can
It was only in the half-century after these ultimate normative consequence emerge? In this
debates that political economy, inclusively defined, respect, one contribution lies at the level of positive
returned, in fits and starts, to its classical tradition. analysis rather than in a too-hasty leap into
Given the legal order of the protective state (the normative evaluation. Classical political economy
protection of property and the enforcement of contains the important principle of spontaneous
contracts ), we now know that under some coordination, the great discovery of the 18th cen-
conditions ‘markets fail’ when evaluated against tury. This principle states that, within the legal
idealized criteria, whether these be ‘efficiency’, umbrella of the minimal state and given certain
‘justice’, or other abstract norms. We also know conditions, the market ‘works’. Even if in the
that ‘politics fails’ when evaluated by the same principle’s modern embellishment we must add
criteria. Any positive analysis that purports to be of ‘warts and all’, we still have come a long way
use in an ultimate normative judgment must reflect toward a more comprehensive understanding of the
an informed comparison of the working properties alternatives for social order. To economics the
of alternative sets of rules or constraints. This extent that his efforts expand the public under-
analysis is the domain of Constitutional Economics. standing of this principle, in application to all
institutional settings, the constitutional economist
remains under less apparent compulsion to advance
his own privately preferred ‘solutions’ to the
ultimate choice among regimes.
Classical political economy emerged from moral Care should be taken not to claim too much for
philosophy, and its propounders considered their Constitutional Economics, especially if a narrow
efforts to fall naturally within the limits of philo- definition is used. As noted earlier, this research
sophical discourse. As a modem embodiment, programme, by designation, emerged in the 1970s
Constitutional Economics is similarly located, to describe efforts at analysing the effects of
regardless of disciplinary fragmentation. How can
persons live together in liberty, peace and
2068 Constitutional Economics
alternative sets of rales, as opposed to analyses of closely parallel Constitutional Economics, with, of
choices made within existing and unexamined course, an historical rather than a comparative
structures. In a more comprehensive overview of emphasis (North and Thomas ).
developments after World War II, Constitutional
Economics takes its place among an intersecting set
of several research programmes, all of which have Presuppositions
roots in classical political economy. Critical
Constitutional Economics, along with the related
emphases differ as among the separate programmes,
research programmes mentioned above, shares a
but each reflects efforts to move beyond the
central methodological presupposition with both its
relatively narrow confines of orthodox neoclassical
precursor, classical political economy, and its
economics.
counterpart in modem neoclassical microeconom-
In continental Europe, the whole set of sub-
ics. Only individuals choose and act. Collectivities,
disciplines is included under the rubric ‘The New
as such, neither choose nor act and analysis that
Political Economy’. Within this set we can place (1)
proceeds as if they do is not within the accepted
Public Choice, from which Constitutional
scientific canon. Social aggregates are considered
Economics emerged; (2) Economics of Property
only as the results of choices made and actions
Rights; (3) Law and Economics or Economic
taken by individuals. The emphasis on explaining
Analysis of Law; (4 ) Political Economy of Regu-
non-intended aggregative results of interaction has
lation; (5) the New Institutional Economics, and (6)
carried through since the early insights of the
the new Economic History. Defined imperial-
Scottish moral philosophers. An aggregative result
istically, Constitutional Economics would parallel
that is observed but which cannot, somehow, be
the inclusive term and embrace all of these pro-
factored down and explained by the choices of
grammes, since some attention is drawn in each
individuals stands as a challenge to the scholar
case to the legal-political constraints within which
rather than as some demonstration of non-
economic and political agents choose. Differences
individualistic organic unity.
can be identified, however, and it may be useful to
Methodological individualism, as summarized
summarize some of these here, even if detailed
above, is almost universally accepted by economists
discussion of the other research programmes cannot
who work within mainstream, or non-Marxian,
be attempted.
traditions. A philosophical complement of this
Public Choice, in its non-constitutional aspects
position that assumes a central role in Constitutional
of inquiry, concentrates attention on analyses of
Economics is much less widely accepted and is
alternative political choice structures and on
often explicitly rejected. A distinction must be
behaviour within those structures. Its focus is on
drawn between the methodological individualism
predictive models of political interactions, and is a
that builds on individual choice as the basic unit of
preliminary but necessary stage in the more general
analysis and a second presupposition that locates
constitutional inquiry. The economics of property
the ultimate sources of value exclusively in
rights, law and economics, and the political
individuals.
economy of regulation remain somewhat closer to
The first of these presuppositions without the
orthodox economic theory than Constitutional
second leaves relatively little scope for the
Economics or Public Choice. The standard
derivation of constitutional structures from indi-
efficiency norm remains central to these
vidual preferences. There is no conceptual nor-
subdisciplines, both as an explanatory benchmark
mative bridge between those interests and values
and as normative ideal. The new institutional eco-
that individuals might want to promote and those
nomics is directed more toward the interactions
non-individualistic values that are presumed to
within particular institutional forms rather than
serve as ultimate normative criteria. The whole
toward the comprehensive structure of political
constitutional exercise loses most if not all of its
rales (Furabotn and Richter ; Frey ). Some elements
raison d'etre in such a setting. If the ultimate
of the new economic history
Constitutional Economics 2069
values which are to be called upon to inform the groups) into agreement. If no such scheme can be
choices among institutions are non- individualistic, arranged, the observing political economist remains
then there is, at best, only an instrumental argument silent. The Wicksellian contribution allowed the
for using individually expressed preferences in the modem economist to bring the comparative analysis
process of discovering those values. of rules or institutions within a methodological
On the other hand, if the second presupposition framework that utilizes and builds on the efficiency
concerning the location of the ultimate sources of criterion, which, when interpreted as indicated, does
value is accepted, there is no other means of not require departure from either of the
deriving a ‘logic of rules’ than that of utilizing individualistic presuppositions previously
individually expressed interests. At base, the second discussed.
presupposition implies democracy in governance,
along with the accompanying precept that this
structure of decision-making only takes on Homo Economics in Constitutional
normative legitimacy with the prefix ‘constitu- Choice
tional’ appended to it.
Constitutional Economics, as distinct from the
complementary research programme on political
Wicksell as Precursor constitutions that are within the boundaries of law,
political science, sociology and other disciplines,
The single most important precursor to Constitu- goes beyond the logical presuppositions of indi-
tional Economics in its modem variant is Knut vidualism to incorporate nontautological models of
Wicksell, who was individualist in both of the individual utility maximization. Homo economicus
senses discussed above. In his basic work on fiscal takes a central role in comparative institutional
theory (Finanztheoretische Untersuchungen, inquiry. Individuals are assumed to seek their own
), Wicksell called attention to the significance interests, which are defined so as to retain
of the rales within which choices are made by operational content.
political agents, and he recognized that efforts at Two quite different arguments can be made in
reform must be directed toward changes in the rules support of this postulate in Constitutional Eco-
for making decisions rather than toward modifying nomics. The first is based simply on methodological
expected results through influence on the behaviour consistency. To the extent that individuals are
of the actors. modelled as utility maximizers as they participate in
In order to take these steps, Wicksell needed market relationships, there would seem to be no
some criterion by which the possible efficacy of a basis for postulating a shift in motivation as they
proposed change in rules could be judged. He behave within non-market constraints. There is at
introduced the now-familiar unanimity or consensus least a strong presumption that individuals do not
test, which is carried over into Constitutional undergo character transformation when they shift
Economics and also allows the whole research from roles as buyers or sellers in the market-place
programme to be related closely to the contractarian to roles as voters, taxpayers, beneficiaries, politi-
tradition in political philosophy. The relationship cians, or bureaucrats in the political process. A
between the Wicksellian and the Paretian criteria is more sophisticated reason for postulating con-
also worthy of note. If only individual evaluations sistency in behaviour lies in the usefulness of the
are to count, and if the only source of information model for the whole exercise of institutional com-
about such evaluations is the revealed choice parison. If the purpose is to compare the effects of
behaviour of individuals themselves, then no alternative sets of constraints, some presumption of
change could be assessed to be ‘efficient’ until and behavioural consistency over the alternatives is
unless some means could be worked out so as to necessary in order to identify those differences in
bring all persons (and results that are attributable to the differences in
constraints.
2070 Constitutional Economics
demonstrate, however, that government, if given the relationship with the efforts to derive criteria for
authority to tax A, will tend to levy a tax that will political and economic rules of social interaction.
generate more revenue than would be forthcoming Economists, as well as other social scientists and
under an authority to tax B. The equi- revenue social philosophers, have come increasingly to
alternatives may not be effective political recognize that the untrammelled interplay of
alternatives under any plausibly acceptable interest-group politics is unlikely to further objec-
modelling of the behaviour of political agents. Once tives for distributive justice. Analysis of how this
this simple point is recognized, the normative politics operates in the making of fiscal transfers
significance of the neoclassical ranking of tax suggests that principled adjustments in the posttax,
instruments is reduced. Discussion shifts necessarily post-transfer distribution of values is only likely to
to the level of interaction between political decision be achieved if the institutional rules severely restrict
structures and fiscal institutions. the profitability of investment in attempts to subvert
A second application of Constitutional Eco- the transfer process.
nomics is found in the post-Keynesian discussion of Further applications include the regulatory
budgetary policy. The Keynesian advocacy of the constitutions, along with the organization of public
use of governmental budgets to accomplish enterprises. In its inclusive definition, Constitutional
macroeconomic objectives was based on a neglect Economics becomes the analytical route through
of the political decision structure. The proclivity of which institutional relevance is reintroduced into a
democratic governments to prefer spending over sometimes sterile social science. In its less inclusive
taxing, and hence to bias budgets toward deficit, is definition, Constitutional Economics, along with its
readily explained in elementary public choice related and complementary research programmes,
theory (Buchanan and Wagner ). This essential step restores ‘political’ to ‘economy’, thereby bringing a
in public choice reasoning leads naturally to inquiry coherence that was absent during the long hiatus
into the relationships between the constraints that during which ‘economics’ made putative claims to
may be placed on political choice and predicted independent status.
patterns of budgetary outcomes. Out of this
intensely practical, and important, application of
Constitutional Economics emerged the intellectual See Also
bases for the normative argument that, in the post-
Keynesian era when moral constraints on political
agents have lost much of their previous
effectiveness, formal rules limiting deficit financing
may be required to insure responsible fiscal
decisions. In the modem setting, such rales would
limit spending rates. But it is perhaps worth noting
►
that, in the political environment of Sweden in the
1890s, Wicksell advanced analytically similar
proposals for reform in the expectation that, if the References
suggested reforms should be implemented, public
Brennan, G., and J.M. Buchanan. 1980. T h e p o w e r t o
sector outlay would increase.
tax: Analytical foundations of the fiscal
The analysis of alternative rules for ‘the transfer c o n s t i t u t i o n . Cambridge: Cambridge University
constitution’ represents a third application of Press.
constitutional economics. With the 1971 publication Brennan, G., and J.M. Buchanan. 1985. T h e r e a s o n o f
rules: Constitutional political economy.
of John Rawls’s A Theory of Justice, renewed
Cambridge: Cambridge University Press.
attention came to be placed on principles of Buchanan, J.M. 1974. T h e l i m i t s o f l i b e r t y :
distributive justice. Although explicitly pre- B e t w e e n a n a r c h y a n d l e v i a t h a n . Chicago:
constitutional, Rawls’s work has a close University of Chicago Press.
Buchanan, J.M., and G. Tullock. 1962. T h e c a l c u l u s o f
consent: Logical foundations of
constitutional democracy. Ann Arbor:
2072 Constitutions, Economic Approach to
Buchanan, J.M., and R.E. Wagner. 1977. D e m o c r a c y i n normative vs positive;; Corruption; First
deficit: The political legacy of Lord
amendment; Harsanyi, J; Landes, W; Negative
K e y n e s . New York: Academic.
Frey, B. 1984. A new view of economics: Comparative externalities; Party systems; Philadelphia con-
analysis of institutions. S c e l t e P u b b l i c h e 1: 17-28. vention; Posner, R; Rawls, J; Rent seeking;
Furubotn, E.G, Richter, R. (eds). 1980. T h e n e w i n s t i t u - Social contract theory; Social welfare function;
t i o n a l e c o n o m i c s - A s y m p o s i u m . Zeitschrift
Tullock, G
fur die gesamte Staatswissenschaft, 140.
Flayek, F.A. 1973-1979. L a w , l e g i s l a t i o n , a n d
l i b e r t y , 3 vols. Chicago: University of Chicago Press.
Flume, David. 1741. On the interdependency of parliament. In JEL Classification
Essays, moral, political and literary. K1
London: Oxford University Press, 1963.
McKenzie, R. 1982. B o u n d t o b e f i ' e e . Palo Alto:
Hoover Press.
The economic approach to constitutions applies the
McKenzie, R. (ed.). 1984. C o n s t i t u t i o n a l e c o n o m i c s . methodology of economics to the study of
Lexington: Lexington Books. constitutions, just as public choice applies this
Mill, J.S. 1861. Considerations on representative government. methodology to the full range of topics of political
In E s s a y s o n p o l i t i c s a n d s o c i e t y , vol. XIX of
Collected works of J.S. Mill. Toronto: University of
science.
Toronto Press, 1977. The economic approach to constitutions began
North, D.C., and R.P. Thomas. 1973. T h e r i s e o f t h e with The Calculus of Consent by James Buchanan
western world: A new economic history. and Gordon Tullock ( , hereafter B&T).
Cambridge: Cambridge University Press.
Theirs was largely a normative analysis of what
Rawls, J. 1971. A t h e o r y o f j u s t i c e . Cambridge, MA:
Harvard University Press. ought to go into a constitution. Their main findings
Wicksell, K. 1896. Finanztheoretische and the literature that grew out of their work are
U n t e r s u c h u n g e n . Jena: Gustav Fischer. Central reviewed first, after which the positive stream of the
portions of this work were published in English
literature is discussed.
translation as ‘A new principle of just taxation’. In
Classics in the theory ofpublic finance,
ed. R.A. Musgrave and A.T. Peacock. London:
Macmillan, 1959.
Normative Research on Constitutions
under which a B&T constitution maximizes a higher required majorities become optimal to
Harsanyian SWF. implement ~ x, with unanimity being optimal when
Another innovation in The Calculus was to the asymmetry in payoffs is very large. Mueller (
introduce the external costs of collective decisions , , ch. 14) employed this anal
(B&T, pp. 63-8). When a collective choice is made ysis to explain why placing rights to act into a
without the consent of all members of the constitution would maximize the expected utilities
community, the decision can make some members of those writing it.
worse-off. The votes of those favouring the decision
thus impose a negative externality on those
opposing it. The smaller the majority required to Positive Research on Constitutions
pass an issue, the more likely it is that an individual
is on the losing side. However, the amount of time The positive literature of constitutions falls into two
required to make a collective decision is also likely categories: studies of constitutional conventions
to increase with the required majority. The optimal and of the consequences of constitutions. The
majority minimizes the sum of collective decisions’ second category is obviously very large, and so I
external and decision-making costs. provide only the flavour of this type of work.
There is nothing in B&T’s costs-minimization- Charles Beard’s work ( ) might well be
approach that implies that the optimal majority is regarded as the first economic analysis of the Phil-
likely to be a simple majority, and thus their adelphia Convention. Beard stressed the self-
approach does not account for this rule’s ubiquitous interest of the participants, and claimed that the
use. The approach does imply the widespread use of final product reflected the interests of the landown-
the simple majority rule, if one of the two cost ing aristocracy. In an equally cynical analysis,
curves - most plausibly decision-making costs - has Landes and Posner ( , p. 893) claimed that the
a sharp discontinuity at 50% (Mueller , pp. 76-8). First Amendment was a result of pressure from
Rae ( ) used the two-stage approach to ‘publishers, journalists, pamphleteers, and others
provide a completely different normative justifi- who derive pecuniary and nonpecuniary income
cation for the simple majority rale. At the consti- from publication and advocacy of various sorts’.
tutional stage, each individual is uncertain of Case studies of constitutional conventions confirm
whether he will favour x or - x in future votes on the importance of the self-interest of the
these binary issues. The expected gain if an indi- participants in determining the constitution’s
vidual favours x and x wins equals the expected loss content. For example, representatives from small
if x wins and the individual favours ~ x. Rae further parties favour mles that produce proportional
assumed that the probability of favouring x equals representation and low percentage thresholds for
the probability of favouring ~ x. An egoist chooses taking seats in the parliament. Representatives from
the voting rule that minimizes the probability that large parties favour the reverse. If delegates are
she favours x in the future and ~ x is imposed, or selected geographically, the constitution protects
that she favours ~ x and x is imposed. The simple geographic interests. (For further discussion and
majority is the only rule satisfying this condition. references to the literature, see Elster , and
(For additional discussion and references see, Rae Mueller , ch. 21). Econometric analyses confirm
and Schickler .) these findings. McGuire and Ohlsfeldt ( )
Mueller ( ) generalized the two-stage and McGuire ( ) concluded
approach to show that the optimal majority for that the votes of delegates to the Philadelphia con-
binary choices depends on the relative payoffs from vention reflected both their personal interests and
the two issues. (Riley , presents a game theoretic those of their constituencies. Eavey and Miller (
analysis of a two-stage constitutional process.) As ) reached the same conclusion from the vot
the loss to those favouring x rises relative to the ing patterns of those who ratified the Pennsylvania
gain to those favouring ~ x, and Maryland constitutions.
A key decision facing any constitutional convention
is whether to design institutions that will
2074 Constitutions, Economic Approach to
produce a two-party system or a multiparty system. Harsanyi, J. 1955. Cardinal welfare, individualistic ethics, and
In practice, this choice appears to rest upon the interpersonal comparisons of utility. Journal of Political
Economics 63: 309-321.
number of representatives elected from each Landes, W., and R. Posner. 1975. The independent judiciary
electoral district (Taagepera and Shugart ; Lijphart ; in an interest-group perspective. Journal of Law and
Mueller , chs. 8-10). Recent theoretical and Economics 18: 875-901.
empirical work by Persson and Tabellini ( , , Lijphart, A. 1990. The political consequences of electoral laws,
1945-85. American Political Science Review 84: 481-
, ) and 496.
Persson et al. ( ) demonstrates the economic McGuire, R. 1988. Constitution making: A rational choice
importance of this choice. They find more rent model of the Federal Convention of 1787. American
seeking, more cormption, more redistribution and Journal of Political Science 32: 483-522.
McGuire, R., and R. Ohlsfeldt. 1986. An economic model of
larger deficits in multiparty systems. Presidential
voting behavior over specific issues at the constitutional
systems lead to smaller governmental sectors convention of 1787. Journal of Economic History 46: 79-
because they generally contain stronger checks and 111.
balances than parliamentary systems. (For a review Mueller, D. 1973. Constitutional democracy and social
welfare. Quarterly Journal of Economics 87: 60-80.
and references to other contributions, see Persson
Mueller, D. 1991. Constitutional rights. Journal of Law,
and Tabellini .) Economics, and Organization 7: 313-333.
Mueller, D. 1996. Constitutional democracy. Oxford:
Oxford University Press.
Conclusions Mueller, D. 2001. The importance of uncertainty in a two-
stage theory of constitutions. Public Choice 108: 223-258.
Mueller, D. 2003. Public choice III. Cambridge: Cambridge
There are two kinds of people in the world: those University Press.
who believe that constitutions matter and those who Persson, T., and G. Tabellini. 1999. The size and scope of
do not. The contributors to the literature reviewed government: Comparative politics with rational politi-
cians. European Economic Review 43: 699-735.
here fall into the former category. Their work helps
Persson, T., and G. Tabellini. 2000. Political economics -
illustrate why and in what way constitutions matter, Explaining economic policy. Cambridge, MA: MIT
and further illustrates the fruitfulness of Press.
undertaking an economic approach to the study of Persson, T., and G. Tabellini. 2003. Economic effects of
constitutions. constitutions. Cambridge, MA: MIT Press.
Persson, T., and G. Tabellini. 2004a. Constitutions and
economic policy. Journal of Economic Perspectives 18:
75-98.
Persson, T., and G. Tabellini. 2004b. Constitutional rules and
See Also fiscal policy outcomes. American Economic Review 94:
25-45.
Persson, T., G. Roland, and G. Tabellini. 2000. Comparative
► Indiana . Jame M. (Born 1919) politics and public finance. Journal of Political
► V. -: - Ratioi Economy 108: 1121-1161.
Rae, D. 1969. Decision-rules and individual values in
constitutional choice. American Political Science
Bibliography Review 63: 40-56.
Rae, D., and E. Schickler. 1997. Majority rule. In Perspec-
Beard, C. 1913. An economic interpretation of the constitution tives on public choice, ed. D.C. Mueller. Cambridge,
of the United States. New York: Macmillan, 1941. MA: Cambridge University Press.
Buchanan, J., and G. Tullock. 1962. The calculus of consent. Rawls, J. 1971. A theory of justice. Cambridge, MA:
Ann Arbor: University of Michigan Press. Belknap.
Eavey, C., and G. Miller. 1989. Constitutional conflict in state Riley, J. 2001. Constitutional democracy as a two-stage game.
and nation. In The federalist papers and the new In Constitutional culture and democratic rule, ed. J.
institutionalism, ed. B. Grofman and D. Wittman. New Ferejohn, J. Rakove, and J. Riley. Cambridge:
York: Agathon Press. Cambridge University Press.
Elster, J. 1991. Arguing and bargaining in two constituent Taagepera, R., and M. Shugart 1989. Seats and votes. New
assemblies. Mimeo. Storrs Lectures, Yale Law School. Haven: Yale University Press.
Consumer Durables 2075
and durable demands based on a quadratic utility of the type defined in ( ) above. As long as these are
function with quadratic adjustment costs and linear functions of (q,p qy _ b qy _ 2, ...) a user cost
rational expectations. Apart from the coefficients on price py can be defined. Then, since the
financial assets he finds a reasonable degree of intertemporally separable form of the utility func-
coherence between the two equations and the tion permits two-stage budgeting (see Gorman, on
durables equation fits well and its residuals are well SEPARABILITY below), there exist demand
behaved. Muellbauer and Pashardes ( , functions
revised 1987) build persistence effects into
zpreferences in a rather more general way though, Z
U = SiiPnA) (11)
in the context of a quadratic utility function, the
effect is similar to Weissen-berger’s. Since their Where
empirical results on annual British data for a system
of nine demand functions, one of which is for
A=
durables, are rather satisfactory, it is worth exam- 2=1
ining the approach.
From ( ), purchases of good i
They assume that the utility function is
intertemporally separable in transformed quantities
Zjf. Qu = \ai ~ (1 - Si)]Sit-i + 1 , a‘ tit- (12)
Oi
and ought to be treated as such in econometric assumed to be the same for all consumers. Equation
work. In practice, however, new and used durables ( ) suggests the use of Probit or Logit
may not be perfect substitutes, new prices may be analysis to examine ownership variations on micro-
set by oligopolistic producers and the second-hand data (see McFadden 1973, McFadden
market, despite its imperfections, may be more ). Given information on rental expenditure
demand responsive in its prices. Nevertheless, the defined as pDS = pDgD(x, p, pD\ b, e) if gD( ) > 1,
above model is a bench-mark that raises issues Tobit analysis is the appropriate technique (see
which applied economists working on the demand Tobin ).
for durables need to face. Equation ( ) also has implications for
The other source of data on the demand for the ‘quasi’-Engel curves which link ownership in an
durables is household surveys. In such cross- income bracket to the income level. Suppose that (
sections, the discreteness of ownership must be ) holds if € < 6(x, p, pD, b). Then the pro
explicitly recognized. The classic paper by Farrell ( portion of households with budget .r and charac-
) was one of the first to analyse the threshold teristics b owning the durable is
effects which govern ownership. The modem eco-
'Q /»oo
nomic treatment in its simplest form can be f(x,b,e) de/ f ( x , b , e ) d e (17)
explained as follows. Let P° be the rental price of a —oo —oo
durable good and the budget constraint be
where /() is the joint probability density of x, b and
e. Provided the budget .r can be plausibly linked to
pq + p DS = x
observable income, this provides a justification for
(1 4) the ‘quasi’-Engel curves estimated, for example, by
Aitchison and Brown ( ),
Suppose S = 1 if the durable is owned and S 0 if Cramer ( ), Pyatt ( ), and Bonus ( ).
not. Let the single period utility function be These studies have not always, however, given as
much attention to the household characteristics b as
u = v(q,S,b, e ) they might have done. If b and e were inde-
where b is a vector of observable household char- pendently distributed of x and 0 monotonic in x,
acteristics and s summarizes unobservable ones in a then there must be a sigmoid relationship between
scalar. If S = 0, we can solve for q = x/p from the the level of x and the proportion owning the durable
budget constraint and if .S' I , q (x - l>n)/P. Thus the given the sigmoid shape of the cumulative
durable is owned if the utility from owning v[(x distribution function for x. In practice, though there
pD)/p, 1, b, e ] exceeds that from not owning v[xip, is quite a high correlation between such household
0, b, C ]. These solved out utility functions are characteristics as size and income, empirical
termed ‘indirect utility functions’. This ownership ‘quasi’-Engel curves are usually sigmoid in shape.
criterion is still valid if, in fact, durables of this type In aggregate, as average income rises over time
vary in size, performance, luxuriousness or other there is also a sigmoid relationship between the
characteristics that can be summarized in a quality average income level and the aggregate proportion
index. 5=1 then refers to the minimum quality owning a durable. As Deaton and Muellbauer ( , pp.
available while .S' > 1 for higher qualities. 370-1) note, this sigmoid
Maximizing (15) subject to ( ) then gives shape could be partly due to the sigmoid shape of
conventional demand functions, the cumulative distribution function of income and
given S ^ l , q = g{x,p,pP; b, e ), S = g°(x, p, //’; b, partly due to other causes of diffusion such as
C ) and the durable is owned if epidemic models of the spread of a disease which
may be appropriate when new goods such as
gD(x,p,pD-,b,(E:) >\. television are introduced.
For simplicity the discussion above has taken the
(1 6) case of a single type of durable. But as
This article examines the interplay between the efficiency with which the economy achieves this
theory and evidence in the study of consumers’ goal is the criterion by which alternative systems,
expenditure and its composition. Although institutions and policies are to be judged. Within a
models have been developed from the theory of capitalist economy, such considerations lead to an
consumption that illuminate much of the examination of the relationship between prices and
available data, many standard presumptions of consumption behaviour, and theoretical
economics lack substantial bodies of evidence development and empirical analysis have been a
such as central theories in the natural sciences major continuous activity since the middle of the
enjoy. last century. Even older is the tradition of using
individual household budgets to dramatize poverty,
Keywords and the relationship between household incomes and
Almost ideal demand system (AIDS); ARIMA household expenditure patterns has occupied social
process; Bernoulli utility functions; Compen- reformers, statisticians and econometricians since at
sated demand; Consumer expenditure; Con- least the 18th century. In more modem times, it has
sumers’ expenditure; Consumption function; been recognized that the study of public finance and
Convexity; Demand curve; Demand functions; of taxation depends on a knowledge of how price
Denver Income Maintenance Experiment; changes affect the welfare and behaviour of
Econometrics; Elasticity; Engel curves; Engel’s individuals, and the recent development of optimal
law; Euler equation; Fisher, I.; Fixed needs tax theory and of tax reform analysis has placed
model; Flexible functional forms; Friedman, M.; additional demands on our understanding of the
Generalized axiom of revealed preference links between prices, expenditures and welfare.
(GARP); Generalized Leontief system; Hicks, In the last fifty years, aggregate consumption
J.; Household budgets; Inflation; Intertemporal has become as much of an object of attention as has
utility functions; Keynes, J.; King, G.; Kuznets, its composition, and in spite of a common
S.; Law of demand; Life cycle hypothesis; Life- theoretical structure, there has been a considerable
cycle income model; Linear expenditure system; division of labour between macro economists,
Lucas, R.; Marginal rate of substitution; interested in aggregate consumption and saving, and
Michigan Panel Study of Income Dynamics micro economists whose main concern has been
(PSID); Modigliani, F.; Non-linear optimization; with composition, and with the study of the effects
Optimal taxation; Permanent income model; of relative prices on demand. The interest of
Prices; Ramsey, F.; Random walk model; macroeconomics reflects both long-term and short-
Revealed preference theory; Risk aversion; term interests. What is not consumed is saved,
Roy’s identity; Samuelson, P.; Saving ratio; saving is thrift and the basis for capital formation, so
Shephard’s lemma; Slutsky matrix; Stigler, G.; that the determinants of saving are the determinants
Stone, J.; Stone-Geary utility function; of future growth and prosperity. More immediately,
Substitution effect; Tax reform analysis; Unit aggregate consumption accounts for a large share of
root model; Utility maximization national income, typically more than three-quarters,
so that fluctuations in behaviour or ‘consumption
shocks’ have important consequences for output,
employment, and the business cycle. Since Keynes’s
General Tlieoiy, the consumption function, the
JEL Classifications relationship between consumption and income, has
E2
played a central role in the study of the macro-
The study of consumers’ expenditure, both in total economy. Since the 1930s, there has been a con-
and in composition, has always been of major tinuous flow of theoretical and empirical
concern to economists. Neoclassical economics sees developments in consumption function research,
the delivery of individual consumption as the main
object of the economic system, so that
2082 Consumer Expenditure
and some of the outstanding scientific achievements are vectors of consumption in periods 1 to T, and /
in economics have been in this field. (•) is a quasi-concave function that is nondecreasing
In this essay, the major themes will be the inter- in each of its arguments. Several things about this
play between theory and evidence in the study of formulation are worth brief discussion. The function
consumers’ expenditure and its composition. If /(•) yields the utility that would be obtained from the
economists have any serious claim to being scien- consumption vector under certainty, and it
tists, it should be clearly visible here. The best represents the utility from a life-time of
minds in the profession have worked on the theory consumption; the indices 1 to T therefore represent
of consumption and on its empirical implementa- age with 1 the date of birth and T that of death. The
tion, and there have always been more data avail- expectation operator is required because choice is
able than could possibly be examined. I hope to made subject to uncertainty, not about the choices
show that there have been some stunning successes, themselves, which are under the consumer’s control,
where elegant models have yielded far from obvious but about the consequences of current choices for
predictions that have been well vindicated by the future opportunities. It is not possible to travel
evidence. But there is much that remains to be done, backward through time, so that choices once made
and much that needs to be put right. Many of file cannot be undone, and yet the cost of current
standard presumptions of economics remain just consumption in terms of future consumption
that, assumptions unsupported by evidence, and foregone is uncertain, as is the amount of resources
while modem price theory is logically consistent that may become available at future dates. The
and theoretically well developed, it is far from consumer must therefore travel through life, filling
having that solid body of empirical support and in the slots in ( ) from left to right as best as he or
proven usefulness that characterizes similar central she can, and at time (or age) t, everything to the left
theories in the natural sciences. will be fixed and unchangeable, whether now seen
to be optimal or not, while everything ahead of t is
A Simple Theoretical Framework subject to the random buffeting of unexpected
changes in interest rates, prices, and incomes. The
Almost all discussions of consumer behaviour begin solution to this sort of maximization problem has
with a theory of individual behaviour. I follow been elegantly characterized by Epstein ( );
neoclassical tradition by supposing that such here I shall work
behaviour can be described by the maximization of with something that is more restrictive but more
a utility function subject to suitable constraints. The useful and note in section “
axioms that justify utility maximization are mild, ” below some phenomena that are better
see any microeconomic text such as Varian ( handled by the more general model.
/1984) or Deaton and Intertemporal utility functions are frequently
Muellbauer ( ), so that utility maximization assumed to be intertemporally additive, so that the
should be seen as no more than a convenient preference rankings between consumption bundles
framework that rules out the grossest kind of in any two periods or ages are taken to be
behavioural inconsistencies. The assumptions that independent of consumption levels in any third
have real force are those that detail the constraints period. If so, the utility function ( ) takes the more
facing individuals or else put specific structure on mathematically convenient form
utility functions. Perhaps the most general
specification of preferences that could be considered T
is one that is written * = (2 )
r= 1
Ui=El{f{ql,q2,...,qt,...,qT)} ( 1)
Note that by writing utility in the form ( ), since the
where u, is utility at time t, Et is the expectation
expectation operator is additive over states of the
operator for expectations formed at time /, q! to qT
world preferences are in effect assumed to be
Consumer Expenditure 2083
simultaneously additive over both states and Write the maximized value of utility from the
periods, an assumption that can be formally period t problem as ij/t(xt, pt), where i]/(■) is a
defended, see Gorman ( ) and Browning standard indirect utility function. The original
et al. ( ). It has the consequence that risk aver intertemporal utility function then takes the form
sion and intertemporal substitutability become two
aspects of the same phenomenon. Individuals that T-t
dislike risk, and will pay to avoid it, will also n, =E,^21J/r (xl+r,pl+r). ( 4)
attempt to smooth their consumption over time and i=0
will require large incentives to alter their preferred The constraints under which this function is max-
consumption and saving profiles. Note also that the imized are most conveniently analysed through the
additive structure of ( ) means that, unlike the case conditions governing the evolution of wealth from
of ( ), previous decisions are irrelevant for current period to period. If A, is the (ex-dividend) value of
ones. For decision-making at time t, bygones are assets at the start of period t, N,t is the nominal
bygones, and conditional on asset and income holdings of asset i with price Pit, d„ is the dividend
positions, future choices are unaffected by what has on i paid immediately before the beginning of t, and
happened in the past. There can therefore be no v, is income in period t, then
attempt to make up for lost opportunities, nor can
such phenomena as habit formation be easily A,1 i = 'y ' Nit(Pi, 11 I dlt 11) (5)
modelled. i
Because utility in ( ) is intertemporally sepa-
rable, maximization of life-time utility implies that, Y] NitPit = A,+ yt-xt. ( 6)
i
within each period, the period subutility function v,
( •) must be maximized subject to whatever total it Conditions ( ) and ( ) determine how wealth evolves
is optimal to spend in that period. The period by from period to period, and the picture is completed
period allocation of consumption expenditure to by requiring that the consumer’s terminal assets be
individual commodities need not, therefore, be positive, i.e.
planned in advance, but can be left to be determined
when that period or age is reached, and period t dr+i > 0 (7)
allocation will follow according to the rule
To solve this problem, the technique of backward
maximize v, (qt) subject to pt ■ qt= xh (3) recursion is used. This rests on the observation that
it is impossible to know what to do in period t
where pt is the price vector corresponding to q, and without taking into account the problem in period (t
xt is the total amount to be spent in t. Problem ( ) is + 1), nor that in (t + 1) without thinking about (t +
one of standard (static) utility maximization, though 2), and so on. However, in period T there is no
note that xt is not given to the consumer, but is future, so that looking ahead from date t, we can
determined by the wider intertemporal choice write subutility in period T in terms of that period’s
problem. Nevertheless, not the least advantage of price and inherited assets, and we write this as v- /;
the intertemporally additive formulation is its i.e.
implication that the composition of expenditure
follows the standard utility maximization rale. It
vj = vj'(Aj) = i / ' T{AT +yT>Pt)- ( 8)
allows separate attention to be given to demand
analysis on the one hand, i.e. to the problem ( ), and
to the consumption function on the other hand, this Given this, the consumer can look ahead from
being understood to be the intertemporal allocation period t to period (T — 1) and foresee that the
of resources, i.e. the determination of xt. problem then will be to choose the composition of
assets N so as to maximize vr_i, where
2084 Consumer Expenditure
= max [^r_j(Ar_! +yT_l-N- PT-1 , pT-i) +Ej-I {i;r[N many of the implications of the theory of con-
sumption. Note first that it is little more than the
■ (Pj + dj)}} 1.
standard result that the marginal rate of substitution
(9)
between today’s and tomorrow’s consumption
At the next stage, assets in (T — 2) will be allocated should be equal to the relative price. However, the
so as to trade off the benefits of consumption in (T equation is set in a multiperiod framework, not a
— 2) versus the benefits of AT_X in vT_\ in ( ) above two-period one, and it explicitly recognizes the
and again yielding a maximized value vT_2. As we uncertainty in both asset returns and in the value of
follow this back through time, the consumer finally money in subsequent periods. The equation also
reaches the current period t, where he or she faces holds for all i, i.e. for all assets, so that the result
an only slightly complicated version of the usual also has implications for asset pricing as well as for
‘today tomorrow’ trade-off; the asset vector N must consumption and saving, and for this reason the
be chosen to solve the problem, model is often referred to as the consumption-asset
pricing model. I shall return to these implications
u, = ma x i (i,{A, + y, - N ■ Pt,pt) below.
N The theory as presented above is the modem
+ E,{vl+l[N •( / > , + , +dt+1)}}. equivalent of the life-cycle theory of consumption
that dates back to Irving Fisher ( ) and Frank
(1 0) Ramsey ( ), and that had its modem genesis in
the papers by Modigliani and Brumberg ( )
From this sequence of problems, several and (1954, published ). Modigliani and Brumberg’s
important results readily follow. First, consider the treatment differs from the above only in not
derivatives of each of the functions vr(Ar) which explicitly modelling uncertainty, and by including
represent the marginal value of an extra unit of only a single asset. The modem version appears
currency for the remaining segment of life time first in Breeden ( ) and in
utility from r through to T. By the envelope theorem Hall ( ), see also Grossman and Shiller ( ).
(see for example Dixit ( ) for a
good exposition), it is legitimate to differentiate
through the maximization problem, from which Predictions and Evidence
v'(A, ) = d\j/r/dxr = Xr, say, (11) One of the most important implications of the
theory above, and of Eq. ( ) in particular, is that the
so that Xr is the marginal utility of money in period
evolution of consumption over the fife- cycle is
r. Secondly, the maximization of ( ) with respect to
independent of the pattern of income over the life-
portfolio choice gives the relationship, for each
cycle. The asset evolution Eqs. ( ) and ( ) allow
asseti,
consumers to borrow and lend at will, so that the
only ultimate constraint on their consumption is one
Pud^Jdx, =E,{(Pit+1 + dil+i)d\jjl+Jdxt+\}
of life-time solvency. In consequence, consumption
(12) patterns are free to follow tastes, the evolution of
which, defining the asset return Rh , as (Pit+1 + family structure, or the different needs that come
du+\)/Pit< and using ( ) can be rewritten in the with ageing, provided that in the end total life-time
simple form expenditure lies within (total) life-time resources,
whether from inherited wealth or from labour
X, = E,(X,+iRj,+1). income. It is often assumed that tastes are such that
consumers prefer to have a relatively smooth
(1 3) consumption stream, and this can be illustrated
from a special case of Eq. ( ).
This equation, in current parlance often referred to Assume that the within-period utility function is
as the ‘Euler equation’, can be used to derive
Consumer Expenditure 2085
homothetic so that if/(x, p) is <p(x/a(p)) for some is a permanent income theory of consumption,
linearly homogeneous function a(-), and that </?(•) where permanent income is the annuity value of
has the isoelastic from with elasticity (1 — <r). lifetime wealth, though the lifetime interpretation is
Life-time utility takes the form only one of the many that are offered in Friedman’s
( ) original statement. Whether
T-t life-cycle or not, linking consumption to future
M, = ^ (i + S)~r [.xt+r/a(pt+r)]1_<r (14)
r=0 incomes has important consequences. First, con-
sumption will respond only to ‘surprises’ or
where <5 is the rate of pure time preference, and a ‘shocks’ in income; changes in income that have
> 0 is the coefficient of relative risk aversion and been foreseen are already discounted in previous
the reciprocal of the intertemporal elasticity of behaviour and should not induce any changes in
substitution. Equation ( ) can be used to evaluate ( ), plans. Of course, this does not mean that con-
and gives immediately sumption will not change along with changes in
income; a change may have been planned in any
case, and some proportion of any actual change may
£[{(1 + r*+i)/(l + 8)}{ct/ct+iy] = 1 (15)
well have been unforeseen. However, if a
where rt+ \ is the real after tax rate of interest from / substantial fraction of the regular changes in income
to / + 1 on any asset, and ct is real consumption, x/a over the business cycle are foreseen by consumers,
(/;,). Equation ( ) shows that, if expectations are or if unanticipated fluctuations in income are
fiilfilled, consumption will grow over the life- cycle regarded as only temporary with limited
if the real rate of interest is greater than the rate of consequences for total life time resources, then
pure time preference, and vice versa, while with i~f consumption will not respond very much to cyclical
= 8, consumption is constant with age. These results fluctuations in income. Aggregate consumption is
are of course an artefact of the specific assumptions indeed much smoother than is aggregate income,
about utility, and for any real household and this has been traditionally accepted as an
consumption can be expected to vary predictably important piece of confirmatory evidence. I shall
with age according to patterns of family formation, take up the matter again below when I deal with the
growth, and ageing; Modigliani and Ando ( ) recent econometric evidence.
have suggested that consumption The distinction between measured income and
per ‘equivalent adult’ might be constant over the permanent income is also important for the inter-
life-cycle. But whatever the shape of preferences, pretation of cross-sectional evidence. Since mea-
there need be no relationship between the profiles of sured income can be regarded as an error-ridden
consumption and of income; income can be saved proxy for permanent income, the regression of
until it is needed, or borrowed against if it is not yet consumption on measured income will be biased
available. downward (rotated clockwise) compared with the
Independent of the life-time pattern of con- true regression of consumption on permanent
sumption is its level, which under the life-cycle income. Crosssectional regressions, or time-series
model is determined by the level of total life-time regressions of simple Keynesian consumption
resources, so that individuals with the same tastes functions will therefore tend to understate the long-
but with higher incomes or higher inherited assets run marginal propensity to consume. Well before
will have higher levels of consumption throughout the work on life-cycle models, Kuznets ( )
their lives. If the future were entirely predictable, showed that the long-run saving ratio in
the consumption plan at any point in time could be the United States had been roughly constant in spite
decided with reference to the level of total wealth, of repeated cross-sectional analyses showing that
this being the value of financial assets and the the saving ratio rose with income, and the life- cycle
discounted present value of current and future theory could also readily account for these findings.
incomes. In this sense, the life-cycle model It is interesting to note that the constancy of the
saving ratio is far from being well
2086 Consumer Expenditure
established as an empirical fact; the evidence for uncertainty about the date of death and about
other countries with long-run data is very mixed, possible medical expenses near the end of life
and even the United States saving ratio is clearly (Davies ), partly there may be statistical problems
influenced in the long-run by technical change, of measurement (Shorrocks ), and partly consumers
migration patterns, and demographic shifts, see may wish to leave bequests. However, most
Kuznets ( ) and Deaton ( ). Life-cycle countries’ tax systems penalize donors who do not
and permanent income theories also predict that pass on assets prior to death, so the reason for the
households with atypically high income will tend to size of actual bequests remains something of a
save a great deal of it, a prediction which explained mystery. Bemheim et al. ( ) have gone so far as to
the apparently anomalous finding that black suggest that parents
households tend to save more than white retain their wealth until death in order to control
households at the same level of measured income; their heirs and to solicit attention from them. They
since blacks typically have lower household income claim empirical support for a positive relationship
than whites, those with the same measured income between visits by children to their parents and
can be expected to have a higher transitory parents’ bequeathable assets; visits are apparently
component. especially frequent to rich sick parents, but not at all
The Modigliani and Brumberg life-cycle story frequent to poor sick parents. Related to the dispute
was also important because it offered a story of about the reason for bequests is a parallel dispute on
capital accumulation in society as a whole that their importance in the transmission of the capital
relied on the way in which people made preparation stock, see the original contribution by Kotlikoff and
for their own futures, particularly for their future Summers ( ) and Modigliani's
retirement. In a stationary life-cycle economy, in reply, summarized in his ( ) Nobel lecture.
which there is neither economic nor population The life-cycle and permanent income models
growth, aggregate saving is zero, and the old, as also provided the econometric specifications for a
they dissave, pass on the ownership of the capital generation of macro econometric models. Ando and
stock to the next generation who are, in turn, saving Modigliani ( ) suggested a simple form
for their own retirement. With either population or for the aggregate consumption function in which
income growth, the aggregate scale of saving by the real aggregate consumption was a linear function of
young would be greater than that of dissaving by expected real labour income, YL, and of the real
the old, so that, to a first approximation, the value of financial assets, i.e.
aggregate saving ratio, while in the long ran
independent of the level of national income, would c, = acE,(YL) + 8W,.
depend on the sum of its population and per capita ( 16 )
real income growth rates. Modigliani ( ), in
his Nobel address, has In practical econometric work, the expectation was
given an account of how very simple stylized typically replaced by a linear function of current
models of saving and refinement yield quite accu- and past values of labour income, a procedure that
rate predictions of the saving ratio and of the ratio can be formally justified by modelling labour
of wealth to national income, and the predictions income as a linear ARIMA process, a topic to which
about the growth effects have been repeatedly borne I shall return below. Wealth or a subset of wealth
out in international comparisons of saving rates, see was included as data allowed, although sometimes
Modigliani ( ), Houthakker ( , the return to wealth was included with labour
), Leff ( ) and Surrey ( ). Perhaps the income which could then be replaced by total
only problem with these interpretations is that there income, so that, with smoothing, ( )
is little evidence that the old actually dissave, becomes a permanent (total) income model of
except by running down state social security or consumption. A favoiuite variant, suggested in
pension schemes; see for example Mirer ( ). Friedman ( ), was to model permanent
Partly, this may be a rational response to income as an infinite moving average of current
income with geometrically declining weights,
Consumer Expenditure 2087
There is overwhelming evidence from a large has expectation zero at time t. Equation ( )
number of countries, see in particular Koskela and shows that, in the absence of ‘news’, consumption
Viren ( , ), that saving increased with will grow or decline at a steady rate g, so that
inflation in the 1970s, even when we allow for real nothing that is known by the consumer at time / or
income and its various lags. Such a finding is also earlier should have any value for predicting the
consistent with the life-cycle theory since unan- deviation of the rate of change of consumption from
ticipated inflation imparts a negative shock to real its constant mean. The result is often referred to as
assets, so that risk-averse, low inter-temporal elas- the ‘random walk’ property of consumption, though
ticity consumers will save to replace the lost assets the theory does not predict that vt+\ has constant
so as to avoid the chance of low consumption later. variance, so that, strictly speaking, the stochastic
It is also possible to explain the relationship process is not a random walk.
through the confusion between relative and absolute For someone used to thinking about the con-
price changes that is engendered by unanticipated sumption function as the relationship between
inflation in an environment in which goods are consumption and income, Eq. ( ) is notable for the
bought sequentially, see Deaton ( ), but it apparent absence of any reference to income. But of
would be hard to devise a test that would separate course income can appear through the stochastic
this from the life-cycle explanation. But if inflation term vt +1 if current income contains new
was indeed the cause of the failure of the empirical information about its own value or about future
consumption functions, then it is a standard enough values of income, and this will generally be the
story. An important variable was omitted from the case. The random walk model does not predict that
analysis, it had not been very variable in the past so consumption should not respond to current income.
that its omission was hard to detect, and economists It does however predict that, conditional on lagged
had not been imaginative enough to perceive its consumption, past income or changes in income
importance in advance. The Lucas critique is only should not be correlated with the current change in
one of the many problems that can beset an consumption, and a considerable amount of effort
econometric equation, and it does not seem to have has recently gone into testing this proposition. In
been the fatal one in this case. Hall’s ( ) original
The second research direction, the joint exam- paper, to the surprise of the author and of much of
ination of income and consumption, has proved the profession, the model worked well for an
more productive. The first important step was taken aggregate of United States consumption of non-
by Hall ( ), who pointed out that durables and services. The level of consumption
Eq. ( ) implies that, as an approximation con certainly depends on its own lagged value, but the
sumption should follow a random walk with drift. addition of one or more lagged values of income or
To see why, assume that the real interest rate r is of further lagged values of consumption did not
constant and known, and write ( ) in the form significantly add to the explanatory power of the
model. Hall examined the role of the number of
c
t+ I = {(1 + + r )} c t a + e<+1 (19) other lagged variables and discovered that lagged
stockmarket prices had predictive power for the
where the expectation at t of £t+\ is zero. Equation (
change in consumption, so that he concluded by
) is exact, but a convenient expres
formally rejecting the model. However, the over-
sion can be reached by factoring ct out of the right whelming impression was favourable, at least rel-
hand side, taking logarithms, and approximating. ative to expectations.
This gives Hall’s test procedures are attractive because
they do not depend on the properties of the income
lnc,+i = Inc, + g + v,+i (20) process, and focus only on consumption and its
where g is positive or negative as r is greater than lags. But robustness comes at the price of power,
or less than S, and the ‘innovation’ v,_ b like e,+, and later work has devoted considerable attention to
the joint properties of consumption and real
Consumer Expenditure 2089
income. Perhaps the natural route to modelling is to this should be ascribed to lack of degrees of free-
find a representation of real income as a stochastic dom; for example Blinder and Deaton consistently
process, typically as some sort of ARIMA. Once find that unanticipated changes in wealth affect
this is known, changes in income can be consumption and that anticipated changes do not.
decomposed into anticipated and unanticipated Third, even in a bivariate income- consumption
components using the standard forecasting formulae model, Campbell ( ) has found
from statistical time series analysis, so that it that the model is largely consistent with the time-
becomes possible to test whether consumption series evidence. Campbell recognizes the possibility
responds to one but not to the other. The random of time-series feedback from lagged consumption to
walk model seemed not to survive these tests so income, and models saving and the change in
well. Papers by Flavin ( ) and by Hayashi income as a bivariate vector- autoregressive system
( ) showed that, for United States data, con in which each series is regressed on lagged values
sumption is sensitive to anticipated changes in of both. The structure of this representation then
income, something that should not be the case in a turns out to be very close to what it would have to
thoroughgoing life-cycle model in which consumers be if the life-cycle rational expectations model were
are efficiently looking into the future. The correct. The conflict between Campbell’s results
phenomenon became known as the ‘excess sensi- and the excess sensitivity findings are presumably
tivity’ result, and was typically ascribed to the accounted for by the feedback from saving to
existence of a substantial number of consumers who changes in labour income, since his model is
wish to borrow against future income but are unable otherwise compatible with the earlier ones.
to do so. Such liquidity constrained consumers can Similarly mixed findings are also being uncov-
be expected to consume all their available income, ered from longitudinal panels that follow individual
so that their consumption will increase one for one households over time. In contrast to the situation
with all income changes, whether anticipated or not. with labour supply, there are few panel data in the
However, it is not clear that the excess sensi- United States that cover household consumption,
tivity finding is itself robust. First, it is becoming and most work has used the data on expenditure on
increasingly recognized that the problems of food that is contained in the Michigan Panel Study
econometric testing in the time-series models are of Income Dynamics (PSID). In an elegant paper,
more severe than had been generally supposed. The Hall and Mishkin ( ) found results that
time series of both consumption and income are were in accord with the
non-stationary, and it sometimes seems as if excess sensitivity results; there is a strong negative
hypothesis testing in models involving non- correlation in their data between changes in
stationary variables is like building on shifting consumption and changes in lagged income that is
sands; see Mankiw and Shapiro ( , ) inconsistent with the view that only surprises in
and Durlauf and Phillips ( ) for some of the income should matter. However, since in their data
problems. Second, there are a large number of changes in income are negatively correlated over
variables other than income which can affect con- time, a negative correlation between the lagged
sumption, so that, according to ( ), surprises in income change and the change in consumption can
wealth and in inflation should affect consumption, be interpreted as a positive correlation between
as should the level of real interest rates. Adding consumption changes and changes in actual income,
even a few of these variables reduces degrees of as predicted by the model of liquidity constraints.
freedom and diminishes the probability of being Hall and Mishkin conclude that these results would
able to reject the basic model. Both Bean ( ) be consistent with a model in whi ch about one fifth
and Blinder and Deaton ( ) find that time- of consumers were unable to borrow as much as
series models of consumption with several variables they wished. Once again, these results were
are more easily reconciled with the theory than are supported by other similar evidence, see in
the simple two variable models. Not all of particular Zeldes ( ) and
2090 Consumer Expenditure
Bemanke ( ), also using the PSID, Runkle parameters can be estimated by minimizing a qua-
( ), using data from the Denver Income Main- dratic form that can be thought of as a weighted
tenance Experiment, and Hayashi ( ) using sum of squares of the left-hand side of ( ); see
panel data from Japan. However, one potential Hansen and Singleton for details. If the model were
problem with the use of panels is the importance of true, this minimized value ought to be small, so that
errors of measurement in such data. There is a with more instruments than parameters, the
considerable body of evidence that PSID income generalized method of moments procedure yields a
changes are subject to very substantial reporting test-statistic that is diagnostic for model adequacy.
errors, see in particular Altonji ( ), Duncan Test procedures based directly on the Euler
and Hill ( ), and Abowd and Card ( ). conditions have several notable advantages. As was
Altonji and Siow ( ) have recently estimated the case for Hall’s procedures, few assumptions
a model similar to Hall and Mishkin’s using the have to be made about the structure of the income
PSID but with allowance for measurement error, process, and the model satisfies the best
and they find little conflict with the view that professional standards of seeking a direct confron-
consumption responds only to news. However, it is tation between theory and data with as few
unclear, at least to this reader, whether the approximations and supplementary assumptions as
acceptance of the model represents low power once possible. The model can also be readily extended to
errors of measurement are allowed for, or whether test the implications of the consumption asset
such errors really offer a plausible explanation for pricing model by repeating the tests using the
Hall and Mishkin’s findings. returns on a range of alternative assets, see ( )
A more formal line of research has attempted to above. Hansen and Singleton’s study, as well as
estimate the Euler condition ( ) directly, thus several others, find that the test statistics are much
avoiding the approximations made by Hall and by too large to be consistent with the theory and so
others. Rewrite (15) once more, this time as reject the intertemporal model implied by the Euler
conditions. Given the apparent superiority of the
(1 + 7"f+l)(c*+l) a ~ (l+^)(c;) a = £;+l (21) tests, these results have been accorded a great deal
of weight in the literature. However, while I believe
where, as before £,+! is orthogonal to any variable
that Hansen and Singleton’s work represents a very
known in period t or earlier. Hansen and Singleton (
important methodological advance, I think that
) proposed that the parameters in ( ) be
there are good reasons for not treating their results
estimated by a generalized methods of moments
as a definitive rejection of life-cycle theory. The
scheme. Suppose that we have two variables or
high level of technique that is embodied in deriving
instruments zu and z2t, each known at time t, so that
the Euler equation, not to mention the complexity of
we have Et(zit £, + j) 0 for i 1,2. We can
generalized methods of moments estimation, should
then estimate the two unknown parameters, a and
not blind us to the very simple, even simple-
A, by equating sample and theoretical moments,
minded, economic story that underlies these models.
and solving the two equations, i = 1, 2
Fundamentally, the Euler equation says that the
marginal rate of substitution between today’s and
tomorrow’s consumption should be equal to the rate
T '^[z„{(l +/V+iKo+i) a — (1 + £>XC<) °1] = 0-
1=0
of return on assets between today and tomorrow, so
(22) that estimation of the Euler equation, unlike the
Hall or excess-sensitivity tests, focuses very directly
If, as is typically the case, we have more than two on the relationship between real interest rates and
z-variables, then it will not generally be possible to changes in real consumption, and the model will not
choose the two parameters so that ( ) is exactly fit the data if there is no close association between
zero. Instead, the vector can be made as small as the two. And it only takes a very cursory
possible, or more specifically, the
Consumer Expenditure 2091
inspection of United States time-series data to see attempts to isolate the effect, careful studies have
that there is no such association. Real consumption tended to be unable to do so, or at least to find
grew in all but one year between 1954 and 1984, effects that are at all robust, or that can be replicated
while real after-tax interest rates were as often on even slightly different data sets or data periods.
negative as positive, so that consistency with the Economic theories or policy prescriptions that rely
theory would require that the pure rate of time on intertemporal substitution of consumption in
preference be negative. Nor is there any association response to changes in real interest rates are not
between the rate of growth of consumption and the well-buttressed by any solid body of empirical
level of real after-tax interest rates, see Deaton ( evidence.
) for some data. But this Another useful approach to testing the life- cycle
in no way reflects badly on the life-cycle theory. As model is to consider the stylized facts of the income
was made perfectly clear in the original Modigliani and consumption processes, and to see whether
and Brumberg papers, and it is the essence of the consumption behaves in the way that is to be
life-cycle model, aggregate consumption cannot be expected given the stochastic process of income.
expected to behave like individual consumption. Most people who have studied the time series for
Imagine a stationary economy with neither quarterly real disposable income in the United
population nor real income growth, in which there States agree that, like GDP, the series can be
is an excess of real interest rates over the rate of parsimoniously described by a model that is linear
pure time preference, and in which all consumers in its first two lags, i.e. an autoregression of the
have identical additive lifetime preferences with form
isoelastic subutility functions. In such an economy,
each individual has a consumption path that is y, = «i + + oc3y;_2 + ut (23)
growing over time, but aggregate consumption is
where u, is the income innovation, that part of
constant, a result that is achieved by old people
current income that cannot be anticipated from
dying and being replaced by young people who
previous observation of the series. Of course, real
have much lower consumption levels relative to
income is not a stationary series, but has a strong
their incomes. Unless we believe that there is some
upward trend, and there is considerable
automatic and immediate relationship between real
disagreement about the nature of this trend, what is
interest rates, time preference and growth, as would
the economic story behind it, and how it should be
obtain for example along a ‘golden age’ growth
modelled. One possibility is that real income
path, or unless we believe that consumers have
contains a deterministic time trend, so that there is
infinite lives, then there is no reason at all to
some sort of equilibrium growth path that cannot be
suppose that aggregate consumption should look at
altered by shocks to the economy. Shocks certainly
all like the life-cycle path of a representative
exist, but they cause only short term temporary
consumer. Representative agent models are
deviations from the path and have little or no long-
frequently useful, and it is not very constructive to
term temporary deviations from the path and have
dismiss macroeconomics because it requires
little or no long-term significance. In this view, Eq.
implausible aggregation assumptions. However, the
( ) applies to the deviations of income from trend,
life-cycle model provides a well-worked-out
not to income itself; equivalently, ( ) can be
account of individual and aggregate saving, an
modified by including a linear or quadratic time
account that is consistent with a good deal of other
trend. The alternative view is that there is no
evidence and theory, and it does not predict that
deterministic trend, but that the rate of change of
aggregate consumption should be consistent with
income is a stationary stochastic series with
the intertemporal optimization conditions for a
constant mean. In practice, this can look very like
single individual. The general question of the
the previous model, but there is the vital conceptual
effects of interest rates on consumption is
difference that in the second, non-deterministic
something that has remained in dispute for a long
model, there is nothing that will
time, and in spite of repeated
2092 Consumer Expenditure
ever bring income back to any deterministic path. In trend are themselves close to non-stationarity. Since
consequence, shocks to current income have both process are special cases of ( ) with
permanent and long-lasting effects. The version of ( the inclusion of a trend, and since each assumes
) that corresponds to this view can be written. parameter values that are very close to one another,
one might think (and hope) that the two models
would have very similar implications. But it is easy
(Or - y - 1) -1’} = p { ( y t-1 - ^-2) - y }
t + u,
to see this is not true. If permanent income is taken
(24)
as the annuity value of discounted future incomes,
then ( ) implies that any innovation u, to current
which can readily be seen to be a special case of (
income, because it will persist forever, and because
), though note that it is the case where the time
it can be expected to be followed by another
series possesses a unit root, or is stationary in first
infinitely persistent innovation of the same sign,
differences. For ( ) to be a valid specialization of
will change permanent income by more than the
( ), the quadratic equation with the a’s of ( ) as
amount of the innovation. Equation ( ) below gives
coefficients must have a unit root, hence the term.
the formula for the change in permanent income, if
Equation ( ) appears to fit the data well and the
the real interest rate is r, and if real income follows
parameter p turns out to be around 0.4, so that ( )
( ), see Flavin ( ) or Deaton ( ),
says that if the increase in real income in one
quarter is greater than its long term mean, then the
next quarter’s increase is also likely to be above the
mean, though by less. While the longterm mean of
+
the rate of change of income is constant and equal Ay? (25)
>■)
to y, good fortune (positive w’s) and bad fortune +1 -
(negative m’s) never have to be paid for (or made
up), since shocks are immediately consolidated into so that the change in permanent income is between
the income level, and growth goes on in the same one and a half and twice as large as the innovation
way as before, but from the new base. As Campbell in current income. By contrast, fitting the
and Mankiw ( ) have emphasized, the unit root deterministic model yields a much smaller effect,
model with the change in permanent income about one
exhibits shock persistence, while the deterministic fifth of the shock in measured income. Since
trend model does not; they suggest that shock consumption should change by about the same
persistence is what we should expect if supply amount as does permanent income, the fife- cycle
shocks predominate over demand shocks, with the model, together with the unit root formulation,
reverse in standard Keynesian models where shocks yields the uncomfortable prediction that con-
are typically attributed to fluctuations in aggregate sumption should be more variable than income over
demand. the business-cycle, not less. If the unit root model is
It turns out that it is almost impossible to tell correct, then the life-cycle and permanent income
these two processes apart on United States time- models can be rejected because they predict what
series data. Processes with unit roots are inherently they were designed to predict, that consumption is
difficult to tell apart from processes that are smooth relative to real income! The deterministic
stationary around deterministic trends, and the tests model gives no such problems, but as yet we have
that are available, Dickey and Fuller (), Phillips and no way of being sure that it is correct, unless, of
Perron ( ), certainly can- coinse we assume from the start that the life-cycle
not reject the hypothesis that ( ) is a valid spe- story is true.
cialization of ( ). Nor would the There is insufficient space in this essay to fol-
tests convince a low these issues further, or to discuss in detail the
believer in the deterministic model that income does evidence for and against the two formulations of the
not have a deterministic trend, even though it wi 11 stochastic process governing real income; the
readily be recognized that the deviations from interested reader can refer to Deaton ( ) and
Consumer Expenditure 2093
to the evidence on persistence in GDP presented by therefore have consequences for utility in subse-
Campbell and Mankiw ( ) and by quent periods, something that will be taken into
Cochrane ( ). There are a number of possible account by a forward looking consumer. In the case
solutions to these puzzles, and a great deal of of durable goods, the assumption of perfect capital
empirical work remains to be done, though I suspect markets effectively converts durable into non-
that the time-series data on income are durable goods, with the price of a unit of stock for
insufficiently long to allow the isolation of the very one period being the implicit rental or user cost, the
long-run properties on which the permanent income latter being defined as the sum of interest cost,
theory rests, see in particular the interesting paper depreciation, and expected capital loss, see for
by Watson ( ). example Diewert ( ) or Deaton
and Muellbauer ( , ch. 13).
However, various authors, Houthakker and
Variations on the Basic Theme Taylor ( ) perhaps being the first, have
extended the durable model to encompass ‘psychic’
There exist many interesting developments of the stocks which, like physical stocks, are augmented
basic life-cycle model, and I have space to discuss by purchases and diminished by depreciation, but
only a few. I have already mentioned the role of unlike physical stocks, can either increase or
liquidity constraints, and many people would take it decrease utility. The latter case covers habit
as transparent that many consumers do not have formation; consumption of an addictive good
access to unlimited credit, or else face borrowing generates pleasure now, but engenders a hungry
rates that are higher than the rates at which they can habit that is pleasureless but costly in the future.
lend. Of course, many consumers may be able to The model has been given an elegant formulation in
smooth their consumption without recourse to two papers by Spinnewyn ( , ).
borrowing, and the borrowing needs of many others As an example, see also Muellbauer ( ), take
may be met by the typically rather good markets in the utility function
home mortgages. For consumers who nevertheless
wish to borrow but cannot, then- spending will be T-t
closely tied to their actual income. For some of the ut = ^2 + d)~kv(ct+k - <xct+k-1)
( 26 )
theoretical and empirical literature on this point see k=0
Flemming ( ),
Dolde and Tobin ( ), and Hayashi ( ). where a is a measure of habit formation. Spinnewyn
The theoretical consequences of uncertainty about maximizes this function with respect not to c,, but
the date of death have been worked out by Yaari ( with respect to the ‘net’ quantities z, = c, — y.c, _ i,
), and as argued above, play a possibly and shows how to rewrite the budget constraint so
important part in the explanation of the saving as to define corresponding prices of the z s that
behaviour of the elderly. reflect not only market prices of the goods, but also
Another line of research is the possible relaxa- the costs of consumption now in terms of pleasure
tion of the assumption that preferences are foregone later. Under certainty, and looking ahead
intertemporally additive. Allowing all periods (or from time t, the full shadow price of an additional
ages) to interact with all other periods in an unit of consumption now is
unrestricted way, as in Eq. ( ), would be much too
general to be useful, and the search has been for T-t
proportional to permanent income, so that con- facts is done by this story. All the evidence quoted
sumption itself will adjust only sluggishly to above looks across different individuals at different
changes in permanent income with habits causing a points in their life-cycles, while the theory says that
drag. Other formulations of non-separable pref- the same individual will change his or her hours of
erences can be found in the papers by Kydland and work along with changes in the real wage over the
Prescott ( ), and by Eichenbaum et al. life-cycle. Time-series and panel data from the
( ), both of which are concerned to reconcile United States and time-series of cross-sections from
fluctuations in the aggregate economy with the the United Kingdom suggest that this is simply not
behaviour of a single representative agent. the case, see for example Mankiw et al. ( ),
Many of the models discussed so far assume that Ashenfelter and Ham
the consumption function actually exists, hence ( ), Ashenfelter ( ), and Browning et al.
taking for granted the essentially keynesian ( ). Even MaCurdy’s ( ) more postive
assumption that income is given to the consumer, study provides only very weak evidence, see in
and is not chosen together with consumption. A particular Altonji ( ). The joint consumption
considerable body of work has grown up in the last and labour supply story fares even less well than the
ten years that is concerned with the simultaneous labour supply model alone, and there is clear
choice of labour supply and consumption in a life- evidence that the way in which consumption and
cycle setting. Heckman ( ) and Ghez hours fluctuate over the cycle (sometimes together
and Becker ( ) are among the pioneers of this and sometimes in opposite directions) is not con-
approach. Unlike the price of goods, the price of sistent with the way in which they move together
leisure tends to show a systematic pattern over the over the life-cycle. The attempt to provide a unified
life-cycle, so that, if consumers are free to choose theory of business and life-cycles has been an
their hours, and if they can freely borrow and lend interesting and important one, but it cannot be said
so as to transfer resources between periods, it will to have been successful.
pay them to work hardest during those periods in I have been somewhat cavalier in my treatment
their life-cycles when the rewards for doing so are of aggregation issues, choosing to emphasize them
highest, and to take their life-time leisure when when I believe them to be important, for example in
wage rates are low and leisure is cheap. There is the fitting of Euler conditions, and ignoring them
superficial evidence in favour of this story, and when it has been convenient to do so. Attempts to
Ghez and Becker, followed by Smith ( ) and do better than this have not been notably successful.
Browning et al. ( ), all find that workers tend Formal conditions that allow aggregation in
to work longest hours in middle age when wage consumption function models are typically too
rates are high and the lowest number of hours at the restrictive to be useful, so that, in theory, changes in
beginning and end of the economically active life, the distribution of income should have detectable
when wage rates are relatively low. Consumption effects on aggregate consumption. However,
also tends to peak in the middle age, and this can be attempts such as that by Blinder ( ) to link the
brought into the story by assuming that distribution of income
consumption and leisure are complements, so that to consumption have not been notably successful,
the lack of leisure in middle age is partially perhaps because the income distribution is not
compensated by high levels of expenditure. This variable, or because it changes smoothly enough
elegant fable has also been made much of in over time to preserve a stable relationship between
equilibrium theories of the business cycle, which average income and average consumption. There is
accounts ‘unemployment’ as a voluntary vacation also an issue of aggregation over goods in order to
taken when the real wage is low and leisure is on define real consumption at all, even at the level of
sale, see in particular Lucas and Rapping ( ) the individual agent. In the derivation in section “
and Lucas ( ). ” above,
There now exists a growing volume of literature I made the convenient assumption that within-
that shows just how much violence to the period preferences were homothetic, so that an
Consumer Expenditure 2095
index number of real consumption could be formed. consumer behaviour. Much practical work con-
But homotheticity, although very convenient for tinues in the tradition of King, paying little attention
studying the consumption function, is very to formal theory, concerning itself instead with
inconvenient for studying the allocation of finding empirical regularities. For a firm studying
expenditure among goods since it implies that the the demand for its product, or for anyone interested
within-period total expenditure elasticities of each in establishing a single price elasticity, this
good are all equal to unity. Fortunately, there are probably remains the best approach; the major
aggregation results of Gorman’s ( ), see also developments in econometric technique and
Deaton and Muellbauer ( , eh. 5) for an empirical formulation have not been much
exposition that allows us to have the best of both concerned with, or relevant to, these very practical
worlds, at least if we remain with intertemporally questions. The pragmatic approach (the term comes
additive preferences. If the single-period indirect from Goldberger’s famous but unpublished ( )
utility function iJ/(x, p) takes the form known as the study ), probably reached its peak with the
‘generalized Gorman polar form’ publication of Richard Stone’s great monograph,
(Stone ), and much is still to be learned by a careful
' K X , P) = F[x/a(p)\ + b(p) ( 28 ) study of Stone’s procedures for measuring income
and price elasticities. However, in this essay, I shall
where a(p) and b(p) are linearly homogeneous
follow the literature, and follow its more
functions of prices and F(-) is monotone increasing,
methodological approach.
then the real expenditure index x/a(p) can serve as
The theory outlined in section “
an indicator of real consumption just as in the
” above suggests that the
homothetic case. This happens because when the demand functions of an individual consumer can be
consumer chooses the allocation of life-time derived by maximizing a utility function v(q)
expenditure over periods so as to maximize the subject to a budget constraint p ■ q = x, where x is
intertemporal sum of terms like ( ), the total expenditure. In the analysis here, x is chosen at
b(p) terms are irrelevant. However, the intraperiod some previous level of decision making, but
demand functions that correspond to ( ) traditionally it is treated as if it were a datum by the
do not display unitary elasticities unless the b(p) is consumer, the utility maximization yields a vector q
identically equal to zero, and quite general func- that is some function g(x, p), say, of total
tional forms are permitted. There is therefore no expenditure and prices. These demand functions
real conflict between the analysis of the consump- cannot simply be any functions, but must have
tion function on the one hand, and the analysis of certain properties as a result of their origins in
demand on the other. It is to the latter that I now utility maximization. Obviously, the total value of
turn. the demands should be equal to total outlay x, the
‘adding-up’ property, and it must be true that
proportional changes in x and in p do not have any
Theoretical and Empirical Demand effect on quantities demanded, the ‘homogeneity’
Functions or ‘absence of money illusion’ property. Somewhat
less obvious are the famous symmetry and
Demand functions are the relationships between the negativity properties. These apply to the Slutsky (
purchase of individual goods, income or total ) matrix, S, the typical element of which is
expenditure, prices, and a variety of other factors defined as
depending on the context. Economists have
attempted to make empirical links between demand Sij = dqi/dpj + qpjqj/dx. (2 9)
and price since Gregory King’s famous demand
curve for wheat, see Davenant ( ), As any intermediate text shows, see for example
and since the middle of the 19th century, there has Deaton and Muellbauer ( , ch. 2), the
been a great development in the theory of Slutsky matrix must be symmetric and negative
2096 Consumer Expenditure
semi-definite. The symmetry property is not readily him to obtain parameter estimates and a good fit to
turned into simple intuition; negativity implies that interwar British data for a six commodity disag-
the diagonal elements of the matrix are nonpositive, gregation of expenditures. This was a major
a proposition often referred to as ‘the law of breakthrough, not only in demand analysis, but also
demand’. The four properties, adding-up, in applied econometrics in general. Indeed, much of
homogeneity, symmetry and negativity, essentially demand analysis for a decade or so after Stone’s
exhaust the implications of utility maximization, so paper consisted of applying better algorithms and
that any empirical demand functions that satisfy faster computers to the fitting of Stone’s model to
them can be regarded as having been generated by different data sets.
utility maximization, or by rational choice, with The linear expenditure system offers a demand
‘rational’ defined, following Gorman ( ), as model for a system of, say n goods, and requires
‘having smooth strictly quasi-concave only 2« — 1 parameters, a degree of parsimony that
preferences, and being greedy’. was very important in allowing the model to be
Stone ( ) was the first to attempt to use estimated on very short time-series data. However,
this theory directly to confront the data. He started such economy brings its own price, and the linear
from a (general) linear expenditure system of the expenditure system is very restrictive in the sort of
form behaviour that it can allow. In particular, and
pathological cases apart, the model cannot allow
Pfli = X! avPj + b‘x (3
°) inferior goods (goods the demand for which falls as
j total outlay increases), nor can it allow goods to be
complements rather than substitutes. (As defined by
where a,, and bt are unknown parameters. Stone Hicks ( ) goods i and
showed that, in general, the system ( ) does not j are complements if the (/', /jth term in the Slutsky
satisfy the four requirements, but will do so if, and matrix is negative, so that the utility compensated
only if, the parameters are restricted so that the cross-price response of i to an increase in the price
model can be written in the form of j is positive.) Normal (non-inferior) goods that
are substitutes for one another may be the most
Pfli = Pill + lh( x
P ■ y) (31) important case, but they do not encompass every-
thing that we might want to study. The linear
with the /(-parameters summing to unity. In this expenditure system also implies that the marginal
form the model is known as the linear expenditure propensity to consume each good is the same no
system. As Samuelson ( -8) and Geary matter what is the total to be spent, and many cross-
( -50) had earlier shown, the utility function section studies of household budgets have suggested
corresponding to ( ) has the form that this is not in fact the case.
Unfortunately, it is quite difficult to write down
u
= X) X) AM?/ “ ?/) > utility functions that will lead to more general
demand functions than those of the linear expen-
(32) diture system, nor is there any obvious way of
i generalizing Stone’s procedure of writing down
functions and making them consistent with the
sometimes refened to (somewhat inappropriately) theory. Progress was only really made once applied
as the Stone-Geary utility function. It can be demand analysis started using ‘dual’ formulations of
thought of as a sum of Bernoulli utility functions of preferences to specify demands. In the demand
the quantity of each good above the minimal y’s. context, duality refers to a switch of variables, from
Stone’s achievement lay not in deriving the quantities to prices, so that utility becomes a
demand functions, but in thinking to estimate them. function, not directly of quantities consumed, but
The demand functions ( ), even if fitted to the data indirectly of prices and total expenditure. This
by least-squares, require non-linear optimization, indirect utility formulation is given by
and Stone invented a simple and not very efficient
scheme, but one that allowed
Consumer Expenditure 2097
the function ij/(x, p), already used above, and this is not just to generate demand functions serendipi-
simply the maximum attainable utility from total tously, but to generate good and useful ones
outlay x at prices p. Since ij/(x, p) = u, and the deliberately.
function is monotone increasing in x, it can be There are many examples that could be cited
inverted to give x c(u, p), known as the ‘cost from the literature. One of the most widely used in
function’, since it gives the minimum necessary the translog model which was first proposed in
cost that is required to reach the utility level u. By a 1970 by Jorgenson and Lau, see Christensen et al.
theorem usually attributed to Shephard ( ) ( ) for a convenient reference. To derive the
and to Uzawa ( ), these two functions contain translog, write the indirect utility function in terms
a complete representation of preferences; provided of the ratios of prices to outlay, r = p/x, and
preferences are convex, and provided the functions approximate the indirect utility function as a second
satisfy homogeneity and convexity (or concavity) order polynomial in the logarithms of r. Application
conditions, preferences can be reconstructed from of Roy’s identity yields demand functions in which
knowledge of either of the two functions. It is also the budget share of each good is the ratio of two
very easy to move from either cost or indirect functions, each of which is linear in the logarithms
utility functions to the demand functions. For the of the price to outlay ratios. Estimation of these
indirect utility function, we have Roy’s identity rational functions, like estimation of the linear
(Roy ). expenditure system, requires the use of non-linear
maximization techniques. A related model, the
q = - V p i j/{x,p)/\i/x{x,p) = g ( x, p ) (3 3) ‘almost ideal demand system’ (AIDS) has been
proposed by Deaton and Muellbauer ( ),
which immediately yields demand functions from
and I use this to illustrate some of the
preferences in a form that are suitable for estima-
issues that arise with the current generation of
tion, while for the cost function, we have
demand models. The AIDS is specified by the
Shepard’s Lemma ( ),
logarithm of its cost function which takes the form
q = Vpc(u,p) = Vpc[il/(x,p),p] = g ( x, p ) (3 4)
In c{u,p) = ao + o^ln pk + 0.5
where, as in ( ), the operator V denotes a vector of k k
partial derivatives. X ^Jkm^ /frln Pm
m
Demand analysis now had a high road to spec-
ification. Think of some quasi-convex decreasing
+ u exp P* j> (35)
function of the ratios of price to total outlay and call
it an indirect utility function, or think of some
function of utility and prices that is increasing in its so that, applying Shephard’s lemma and
arguments and linearly homogeneous and concave rearranging, we have demand functions
in prices and call it a cost function. Either way, and
with only simple differentiation, new (and Ptq jx EE Wi
sometimes) interesting demand functions will be
generated. Alternatively, and even more = ai + PM X /P) + 7'7ln Pj (36)
j
importantly, it is possible to use theory to aid and
check out empirical knowledge. If it is known that where P is a linearly homogeneous price index, the
the marginal propensity to spend on food is a form of which can readily be inferred from ( ). The
declining function of total expenditure, or if it is parameters of the model must satisfy certain
thought likely that some goods do not depend very restrictions if ( ) is to be a proper (log) cost
directly on the prices of other goods, it is relatively function, and ( ) a proper system of demand
straightforward to find out what preferences (if any) functions. The matrix of y-parameters can be
will yield the result. It becomes possible,
2098 Consumer Expenditure
taken to be symmetric in ( ), but must be so in Savin ( ) and Bera et al. ( ) for further
( ), and its rows and columns must add to zero evidence.
for the homogeneity and adding-up properties to be The AIDS model, like the translog and several
satisfied. The /(-parameters can be positive or others, e.g. Diewert’s ( ) ‘generalized
negative, with positive values indicating luxury Leontief’ system, fall into the class of ‘flexible
goods, and negative values necessities. The main functional forms’. This criterion of flexibility, first
advantage ofthe AIDS model in time-series appli- proposed by Diewert ( ), is an important guar
cations is that the price index P can typically be antee that the model is sufficiently richly parame-
approximated by some known price index selected trized so as to allow estimation of what are thought
before estimation, so that the demand system is to be the main parameters of interest, typically the
linear in its parameters. In consequence, it can be total expenditure elasticities, and the matrix of own
estimated by ordinary least squares on an equation and cross-price elasticities. A ‘second order’
by equation basis, at least if the symmetry of the y- flexible functional form is one that has sufficient
mattix is ignored. The homogeneity restrictions can parameters, so configured, that it is possible to set
be tested equation by equation using a t- or F-test, the value of the function, and of its first and second
and while imposing or testing symmetry requires an partial derivatives to any arbitrary set of
iterative procedure, estimation can be done by (theoretically permissible) values. By applying
straightforward iterated restricted generalized least- Roy’s identity or Shephard’s lemma, it is clear that
squares, see Barten (1979) or Deaton ( ) for a cost or indirect utility function that is a second
further discussion. order flexible functional form will yield demand
The results of estimating the AIDS model are functions that are first-order flexible, so that it is
sufficiently similar to those from other models and possible for estimation to yield any set of price and
other studies, see e.g. Barten ( ), Deaton expenditure elasticities that are consistent with
( ), Christensen et al. ( ), and many utility theory. For empirical work, such a guarantee
others, that perhaps they can be taken as represen- is important, because it ensures that the elasticities
tative. What typically seems to happen is that the are being measured, not assumed. Contrast, for
homogeneity restrictions appear not to be satisfied, example, the linear expenditure system ( ) with the
so that in the application of AIDS to British data, AIDS model ( ). Both could be fitted to the same set
Deaton and Muellbauer found, for example, that the of data, and the parameter estimates of each could
F-test for transport had a value of 172 compared be used to generate a complete set of expenditure
with the 5 per cent critical value of 4.8. Results on and price elasticities. But the linear expenditure
symmetry from AIDS and other systems are more system is not a flexible functional form, and so its
mixed, and it now seems clear that testing estimated elasticities are not independent of one
symmetry is not usually possible given the amount another, as is apparent from the fact that there are
of data typically available in time series, or put 2n — 1 parameters compared with the total number
more positively, that there is no convincing of potentially independent elasticities, which is (n
evidence against symmetry. The difficulty is that — 1)(1 + n/l). (There are n 1 independent demand
symmetry involves a set of restrictions across equations, each of which has an expenditure
different equations, so that unlike homogeneity, elasticity, and n price elasticities; however, one
which involves tests within each equation, exact, price elasticity per equation is lost to homogeneity,
small sample tests are not available. Researchers and symmetry imposes a further (n l)(n 2)/2
have therefore fallen back on asymptotically valid constraints.) The linear expenditure system does not
tests, and it turns out that these work very badly for therefore measure all the price and income
the usual sort of samples, especially when there are elasticities, but determines them by a mixture of
more than a very small number of goods in the measurement and assumption, the main assumption
demand system. The papers by Laitinen ( ) being that of additive preferences, see Deaton () for
and Meisner ( ) further details. The AIDS, by contrast, has exactly
first established the problem, see also Evans and
Consumer Expenditure 2099
the right number of parameters to allow for inter- important properties that are not assured. First, there
cepts and a full set of elasticities, so that when it (or is no guarantee that the predicted budget shares will
the translog, or the generalized Leontief) is necessarily lie between zero and one, so that there
estimated, so is the full set of elasticities. may be regions of price space in which the
Being able to do this is a great step forward in estimated model yields nonsensical predictions.
methodology, but just as the linear expenditure Second, there is no way that the AIDS can be
system probably asks too little of modem data, guaranteed to have a negative semi-definite Slutsky
(although not of the data available to Stone and the matrix for all prices, at least not without restricting
early pioneers of the systems approach), the second- parameters to the point where the model ceases to
order flexible functional forms probably ask too be a flexible functional form. The parameters could
much, or equivalently, put too little structure on the be chosen so as to satisfy negativity for some
problem. The consequences show up in large particular combination of prices and outlay, but
standard errors, a high frequency of apparently there will be no guarantee that the law of demand
chance correlations, and a lack of robustness to will be satisfied elsewhere. In the translog model, it
functional form changes within the class of flexible is possible to impose a restriction that guarantees
functional forms, in other words, in all the standard negativity everywhere, but the model with the
symptoms of over-parametrization. These problems restriction has the property that all estimated own
are particularly acute for the measurement of price price elasticities must be less than minus one,
elasticities, because in most time-series data, independently of whether this is in fact true, and it
commodity prices tend to move together with almost certainly is not, see Diewert and Wales (
relatively little variation in relative prices. And ). A demand system
although the focus of most research on demand is described as ‘regular’ if it has anegative definite
analysis over the last thirty years has been on the Slutsky matrix and predicts positive demands, and
estimation and testing of price responses, there is several empirical studies, see e.g. Wales ( )
certainly no consensus on what numbers, if any, are for one of the first, found that estimated flexible
correct. Estimates obtained from the linear functional forms were not regular over disturbingly
expenditure system are not credible because they large regions of even the parameter space used to
are forced to satisfy an implausibly restrictive estimate them. Caves and Christensen ( ), and
structure, while those from flexible functional forms later Barnett and Lee ( ) and
are not credible because the data are not informative Barnett et al. ( ), investigated the same prob
enough to supplement the lack of prior structure. lem theoretically by taking a known utility function,
Some intermediate forms are clearly required. choosing the parameters of flexible functional forms
One of the attractions of flexible functional to match its level and derivatives at a point, and
forms is their ability to approximate quite general then mapping out the regions of price space in
forms for preferences. However, the models so far which the systems remained regular. The results at
considered offer only approximations, and there is least for the translog and the generalized Leontief
no guarantee that they have satisfactory global model, were not good.
properties. Partly this is the standard problem that a These regularity issues may seem of limited
fitted model will be forced to give a reasonable importance in practice, but this is far from being the
account of the data over the sample used for case. One of the major reasons for being interested
estimation, but may predict very badly elsewhere. in complete empirical demand systems is to be able
But there are other deeper issues. Taking the AIDS to examine the consequences of price changes,
as an example, estimation of ( ) subject to sym particularly of price changes that follow changes in
metry and homogeneity will produce a system of government policy. The United States relies
estimated demand functions that will satisfy adding- relatively little on indirect taxation as a source of
up, homogeneity and symmetry for all values of x public finance, but such is not the case in most of
and p. However, there are two other Europe, and the vast majority of developing
countries maintain complex systems of
2100 Consumer Expenditure
price wedges, particularly for foods and for agri- expenditure Eq. ( ) can clearly be justified as a
cultural production. The effects of such systems Taylor approximation to any set of homogeneous
cannot be predicted without good information on demand functions, and yet the imposition of only
how demands respond to price changes, nor can symmetry generates the demand system ( ) which
reforms be intelligently discussed. However, esti- comes from the additive utility function ( ).
mated demand systems that are not regular are not a Additivity is not imposed, but linear expen
great deal of help. All of the theory of welfare diture systems are only symmetric if they are
economics, of consumer surplus, of optimal taxation additive. Similarly many flexible function forms are
and of tax reform, assumes that demand behaviour only globally regular if they are homothetic, see for
is generated by utility maximization at the example, Blackorby et al. ( ). Several
individual level, and implementation without recent studies have proposed alternative ways of
regularity risks internal contradiction. For example, making functional approximations. Gallant ( ) has
if compensated demand functions slope upwards, proposed using Fourier series approximations while
the government can generate a deadweight gain by Barnett ( ) has suggested that
imposing a distortionary tax. Of course, it may not Laurent series can be used to generate demand
be the empirical work that is wrong, but the theory models with good properties. Gallant’s models are
that we used to try to model behaviour. If so, the even more heavily parametrized than standard
estimated demand functions are still not useful, flexible functional forms, and there must be some
since we now have no idea what to do with them. question as to the suitability of trigonometrical
But I doubt that evidence goes so far; it is not that functions for demand functions. Barnett’s ‘miniflex
behaviour itself is irregular, but that we have not yet Laurent’ model does not use the full flexibility of
found a good modelling strategy that contains a the Laurent series, but appears to have quite good
reasonable amount of prior information to approximation and regularity properties in practice,
supplement the paucity of data, and at the same time see Barnett and Lee ( ) and Barnett
can deliver global regularity if it is warranted by the et al. ( ); even so, its estimation is complex,
evidence. and many of the parameters have to be estimated
A number of interesting experiments are cur- subject to inequality constraints.
rently under way that involve new modelling A second line of current research has abandoned
techniques. One possibility is that the Taylor series the standard approach of econometric analysis,
expansions that motivate most flexible functional taking instead a completely non-parametric
forms are themselves inadequate to the task. In approach. Since many of the difficulties discussed
particular, Taylor approximations lose their ability above arise from choice of functional form, it is
to approximate if they are also asked to possess useful to ask how far it is possible to go without
other properties of the functions that they are assuming any functional form at all. We know from
approximating. For example, we might want to test standard revealed preference theory that two
whether or not preferences are additively separable, observed vectors of prices and quantities can be
as in the linear expenditure system. One strategy inconsistent with utility maximization; if bundle one
would be to write down some second-order is chosen when bundle two is available, so that
approximation to preferences, estimate the resulting bundle one is revealed preferred to bundle two, then
demand model, and then test whether or not the no subsequent choice should reveal bundle two to
conditions imposed on the demands by additivity be preferred to bundle one. Before embarking on
are satisfied. But this will not work in general, the exercise of fitting some specific utility function
because there may be no additive system of demand to any finite collection of price and quantity pairs,
equations that has the precise functional form one might then ask whether the collection is
demanded by the approximation. The same conceivably consistent with any set of preferences.
phenomenon is well illustrated by Stone’s If it is, then contradictions between an estimated
derivation of the linear expenditure system itself. system and the theory must be a matter of
The original general linear inappropriate functional form. The
Consumer Expenditure 2101
conditions for utility consistency of a finite set of problems of measurement. For many purposes, the
data were originally derived by Afriat ( ), who theory is only useful if it is capable of delivering a
proposed a condition called cyclical consistency. description of the data that is reasonably
Much later Varian ( ) not only provided an parsimonious. There is also something rather simple
accessible and clear account of Afhat’s results, but minded about non-parametric techniques that tends
also recast the cyclical consistency condition into a to be disguised by the sophisticated and elegant
‘generalized axiom of revealed preference (GARP)’ expositions that have been given them by Varian
that runs as follows. A bundle q is strictly directly and others. Consider a very simple theory that says
revealed preferred to a bundle q if p'q' > p'q, while variable x should move directly with variable y as,
q‘ is revealed preferred to q, if there exists a for example, in the Euler Eq. ( ) above which says
sequence, j, k,..., m such that p'q1 > p‘q>, p’q* > that, under certainty consumption should grow from
p>qk,..pmqm > pmq, so that c{ is directly or indirectly period t to t + 1 if and only if the real interest rate
(weakly) revealed preferred to q. GARP is satisfied from t to t + 1 is greater than some fixed constant.
if for all q revealed preferred to cj, it is not true that A non- parametric test on a finite set of data would
cj is strictly directly revealed preferred to q\ and accept the theory if, in fact, x, and y always did
given GARP the data can be rationalized by a move together, and reject it if x and y ever moved in
continuous, strictly concave, and non-satiated utility opposite directions. That such testing procedures
function. Differentiability can also be ensured by a are widely employed in the press and by the
sight strengthening of GARP, see Chiappori and uninformed public is no reason for treating them
Rochet ( ). GARP is readily tested for any seriously in economics.
given set of data by checking the pairwise inequal- I have so far discussed the formulation and
ities and using a simple algorithm provided by estimation of demand functions, meaning the rela-
Varian to map out the patterns of indirect revealed tionships between quantities, outlay, and prices, and
preference. Repeated applications of the method to this has been the topic of most applied demand
time-series data have nearly always confirmed the analysis over the last thirty years. However, there is
consistency of the data with the theory. In an older tradition of demand analysis, in which the
retrospect, it is clear that violations of GARP cannot object of attention is household budget data, and
occur unless some budget lines intersect, so that if, this literature has recently been enjoying something
over time, economic growth has resulted in the of a revival. Since household budget data typically
aggregate budget line moving steadily outward with come from a cross-section of households over a
little change in slopes, GARP is bound to be short period of time, usually within a single year,
satisfied. (However, post-war United States data prices are treated as common to all sample points,
budget planes do occasionally intersect, and Bronars so that the focus of attention becomes the
( ) has recently shown that hypothet relationship between demand and outlay and the
ical demands generated by selecting random points influence of household composition on the pattern
on the actual budget lines would more often than not of household expenditures. The oldest, and perhaps
fail GARP. ) only law of economics, Engel’s Law that the share
The contradictions between the parametric and of food in the budget declines as total outlay
non-parametric approaches can perhaps be resolved increases, comes from Engel’s (1857, published )
by thinking of the latter as a modelling technique study of Belgian working-class families, and early
that uses a very large number of parameters, so that empirical studies of demand were almost inevitably
the failure of the parametric models to fit theory to based on household surveys (see Stigler ( ) for
data can be thought of as failure to parametrize the a mas
models sufficiently richly. But I have already argued terly review). The modem study of Engel curves,
that these models already have too many the relationships between expenditure and total
parameters, and adding more would only exacerbate outlay, begins (and almost ended) with Prais and
the already serious Houthakker ( ). Prais and Houthakker studied
2102 Consumer Expenditure
the shapes of Engel curves, the relationship between quadratic expenditure system, see Howe et al. ().
demand and households, particularly in relation to Given Gorman’s results, and the empirical
the choice of quality, a topic that has subsequently success of the Working form, it and its quadratic
been unjustly neglected. The functional forms for generalization deserve wide use in the analysis of
Engel curves that Prais and Houthakker examined budget studies. There is also accumulating evidence
became the staple menu for most subsequent that such forms are indeed necessary. Thomas (
studies, even though only one of their forms, the ), in a wide-ranging examination
linear Engel curve, is capable of satisfying adding- of household survey data from developing coun-
up, and the linear form typically performs very tries, has shown that Engel’s Law itself does not
badly on the data. Since 1955 a number of other appear to hold among the very poor, so that, in
Engel curves have been proposed, notably the many cases, the share of the budget devoted to food
lognormal Engel curve of Aitchison and Brown ( at first rises with total outlay before falling in
), and Leser’s conformity with the Law.
( ) revival of the form suggested much earl i er Prais and Houthakker also proposed a much-
by Holbrook Working ( ). Working’s form, used formulation for the effects of household
which apparently escaped the attention of Prais and composition on behavioiu. It can be written
Houthakker, makes the budget share of each
commodity a linear function of the logarithm of AwMO) = f i { x / m 0 {a ) }
total outlay. The formulation is particularly useful,
(3 8)
for not only is it capable of accounting for most of
the curvature that is discovered in empirical Engel where a is a vector of household demographic
curves, but it is also consistent with utility theory, characteristics (perhaps a list of numbers of people
and corresponds to the case where the welfare in each age and sex category) and /M, and m0 are
elasticity of the cost of living is independent of scalar valued functions known as the ‘specific’ and
income. Gorman ( ) has provided a general ‘general scales’ respectively. In this literature,
characterization theorem for Engel curves of the scales are devices that convert family structure into
form numbers of equivalent adults, so that a family of
two adults and two children might be two
PMi = X!atk (?) ^ (*) (37)
equivalent adults for theatre entertainment, three
k equivalent adults for food, and six equivalent adults
for milk. The general scale is supposed to reflect the
and has shown that the £*(•) functions can be overall number of equivalent adults, so that the
powers of .r (polynomial Engel curves), or .r Prais and Houthakker model is a simple
multiplied by powers of log x (Engel curves relating generalization of the idea that per capita demand
budget shares to powers of the logarithm of outlay ), should be a function of per capita outlay. Barten (
or have trigonometric forms. This last form includes ), in a very important paper, took up the
Fourier representations of Engel curves, while the Prais-Houthakker idea of specific scales, but
first two allow Taylor or Laurent expansions for the assumed that the arguments of the household utility
expenditure/outlay and for the share/log-outlay function were the household consumption levels
forms. The Working-Engel curve is the first each deflated by the corresponding specific scale.
member of Gorman’s ‘share to log’ class, and the The consequences of Barten’s formulation are
theorem tells us that we may add quadratic or similar to those of Prais and Houthakker, but
higher order terms to improve the fit. However, embody the additional insight that changes in
Gorman’s paper contains a remarkable result; the family composition affect the effective shadow
matrix of the a-coefficients in ( ) prices of goods, so that demographic changes will
has rank at most equal to three. In consequence, the exercise, not only income, but also substitution
share to log and log-squared Engel curves are as effects on the pattern of demand. The story is often
general as any, as are the Engel curves of the summarized by the phrase, ‘if you have a wife and
child, a penny bun costs three-pence’,
Consumer Expenditure 2103
quoted in Gorman ( ), but the really far- also at Nicholson ( ). Poliak and Wales ( )
reaching substitution effects of children are prob- weigh in on the opposite side, and claim that it is
ably on time use and labour supply, particularly of impossible to measure child costs from expendi-
women. ture data. My own position is argued in Deaton
Since household surveys typically contain large and Muellbauer ( ); there are certainly grave
samples of households, there is less need for theory problems to be overcome in moving from the
to save degrees of freedom, and it is possible to analysis of household survey data to the measure-
estimate quite general functional forms that link ment of the costs of children, and it is clear that
expenditures to household composition patterns and identifying assumptions must be made that are
then to interpret the results in terms of the various more severe and more controversial than those
models. In addition, neither the Prais-Houthakker required, for example, to go from demand func-
nor the Batten model seem to yield easily tions to consumer surplus. But that does not mean
implemented functional forms, e.g. linear ones, nor that it is not possible for such assumptions to be
is it clear that either model is even identified on a proposed and to be sensibly discussed.
single cross-sectional household survey in which all
prices are constant, see for example Muellbauer ( See Also
) and Deaton
( ). However, some empirical results for the ^ Beauests 3-HQ. the Life
two models can be found in Muellbauer ( , Cvcle
) and in Poliak and Wales ( , ) who ► Consumer Expenditure lTe
also examine Gorman’s ( ) extension of and. the State of Research)
Barten’s model in which additional people are ^ ) ernann Tiieorv
supposed to bring with them fixed needs for par- ► Eder Equations
ticular commodities. The fixed needs model is close ► Eiedman, Milton •• 2 200
to the formulation proposed by Rothbarth ( ) for ► Rational Expectations
measuring the costs of children.
Rothbarth pointed out that there are certain com- Bibliography
modities, adult goods, that are not consumed by
children, so that when children are added to a Abowd, J.M., and D. Card. 1985. The covariance structure
household, the only effects on the household’s ofearnings and hours changes in three panel data sets.
consumption of adult goods will be the income Princeton University, mimeo.
Afriat, S.N. 1967. The construction of a utility function
effects that reflect the fact that, with unchanged from expenditure data. International Economic Review
total resources, the household is now poorer. 8: 67-77.
Deaton et al. ( ) have recently attempted to Aitehison, J., and J.A.C. Brown. 1957. The lognormal
test Rothbarth’s contention, and in their Spanish distribution. Cambridge: Cambridge University Press.
Altonji, J. 1986. Intertemporal substitution in labor
data it seems possible to identify a sensible group of supply: Evidence from micro data. Journal of Political
adult goods, the expenditure on each of which Economy 94: S176-S215.
changes with additional children in the same way as Altonji, J., and A. Siow. 1985. Testing the response of
they change in response to changes in outlay. consumption to income changes with (noisy) panel
data. Industrial Relations Section Working Paper No.
Studies of the effects of family composition on 186. Princeton University, mimeo.
household expenditure patterns have frequently Ando, A., and F. Modigliani. 1963. The life-cycle hypoth-
been concerned, not only with estimating demands, esis of saving: Aggregate implications and tests. Amer-
but also with attempts to measure the ‘cost’ of ican Economic Review 53: 55-84.
Ashenfelter, 0.1984. Macroeconomic analyses and micro-
children. It would take me too far afield to do economic analyses of labor supply. Presented to
justice to this topic here. Readers interested in this Camegie-Rochester Conference, Bal Harbor, Florida,
controversial area should perhaps start with November 1983.
Rothbarth ( ), who in a few pages makes a Ashenfelter, O., and J. Ham. 1979. Education, unemploy-
ment, and earnings. Journal of Political Economy 87:
very simple and quite convincing case, and look
S99-116.
2104 Consumer Expenditure
Barnett. W.A. 1983. New indices of money supply and the Caves, D.W., and L.R. Christensen. 1980. Global properties of
flexible Laurent demand system. J o u r n a l of flexible functional forms. A m e r i c a n E c o n o m i c
B u s i n e s s a n d E c o n o m i c S t a t i s t i c s 1:7 23. R e v i e w 70: 422-432.
Barnett. W.A.. and Y.W. Lee. 1985. The global properties of Chiappori. P.-A.. and.I.-C. Rochet. 1987. Revealed prefer-
the miniflex Laurent, generalized Leontief, and translog ences and differentiable demand. E c o n o m e t r i c a 55:
flexible functional forms. E c o n o m e t r i c a 53: 1421- 687-691.
1437. Christensen, L.R.. D.W. Jorgenson, and L.J. Lau. 1973.
Barnett. W.A.. Y.W. Lee, and .VI. Wolfe. 1985. The three Transcendental logarithmic production frontiers.
dimensional global properties of the miniflex Laurent, R e v i e w o f E c o n o m i c s a n d S t a t i s t i c s 55: 28-
generalized Leontief, and translog flexible functional 45.
forms. J o u r n a l o f E c o n o m e t r i c s - . 3-31. Cochrane, J.H. 1986. How big is the random walk in GNP?
Barten. A.P. 1964. Family composition, prices, and expen- Department of Economics. University of Chicago,
diture patterns. In E c o n o m e t r i c a n a l y s i s f o r processed.
n a t i o n a l e c o n o m i c p l a n n i n g , ed. P.E. Hart, G. Davenant, C. 1699. Essay upon the probable methods of making
Mills, and J.K. Whitaker. London: Butterworth. a people gainers in the balance of trade. London.
Barten. A.P. 1969. Maximum likelihood estimation of a Davidson. J.E.H.. et al. 1978. Econometric modelling of the
complete system of demand equations. E u r o p e a n aggregate time-series relationship between consumers
E c o n o m i c R e v i e w 1: 7-23. expenditure and income in the United Kingdom.
Bean. C.R. 1985. The estimation of surprise models and the E c o n o m i c J o u r n a l 88: 661 692.
surprise consumption function. Centre for Eco- nomic Davies, J.B. 1980. Uncertain lifetime, consumption and
Policy Research (London). Discussion Paper dissaving in retirement. J o u r n a l o f P o l i t i c a l
No.54, mimeo. E c o n o m y 89:561-577.
Bora, A.K., R. Byron, and C.M. Jarque. 1981. Further Deaton, A.S. 1974a. The analysis of consumer demand in the
evidence on asymptotic tests for homogeneity in large United Kingdom. 1900 1970. E c o n o m e t r i c a 42: 341
demand systems. E c o n o m i c s L e t t e r s 8:101 105. 367.
Bemanke, B.S. 1984. Permanent income, liquidity, and Deaton, A.S. 1974b. A reconsideration of the empirical
expenditure on automobiles: Evidence from panel data. implications of additive preferences. E c o n o m i c
Q u a r t e r l y J o u r n a l o f E c o n o m i c s 99: 587- J o u r n a l 84: 338 348.
614. Deaton, A.S. 1975. The structure of demand in Europe 1920
Bcrnhcim, B.D., A. Schleiffer, and L.H. Summers. 1985. 1970. In T h e F o n t a n a e c o n o m i c h i s t o r y o f
Bequests as a means of payment. J o u r n a l o f E u r o p e , ed. C.M. Cippola, vol. 5. London: Collins-
P o l i t i c a l E c o n o m y : 1045 1076. Fontana.
Blackorby, C., D. Primont, and R.R. Russell. ll>77. On testing Deaton, A.S. 1977. Involuntary saving through unanticipated
separability restrictions with flexible functional forms. inflation. A m e r i c a n E c o n o m i c R e v i e w 6 1 :
J o u r n a l o f E c o n o m e t r i c s 5: 195 209. 81)1) 910.
Blinder, A.S. 1975. Distribution effects and the aggregate Deaton, A.S. 1986a. Demand analysis. In H a n d b o o k o f
consumption function. J o u r n a l o f P o l i t i c a l e c o n o m e t r i c s , ed. Z. Griliches and M. Intriligator,
E c o n o m y 83: 447 475. vol. 3. Amsterdam: North-Holland.
Blinder, A.S., and A.S. Deaton. 1985. The time series Deaton, A.S. 1986b. Life-cycle models of consumption: Is the
consumption function revisited. B r o o k i n g s P a p e r s evidence consistent with the theory? N B I I R Working
o n E c o n o m i c . i c t i v i t y 2: 465 5 1 1 . Paper No. 1910, processed.
Breeden, D. ll>7‘>. An intertemporal asset pricing model with Deaton, A.S., and J. Muellbauer. 1980a. An almost ideal
stochastic consumption and investment opportunities. demand system. A m e r i c a n E c o n o m i c R e v i e w
J o u r n a l o f F i n a n c i a l E c o n o m i c s 7: 265-296. 1 0 : 312-326.
Bronars, S.G. 1987. The power of non-parametric tests of Deaton, A.S., and J. Muellbauer. 1980b. E c o n o m i c s a n d
preference maximization. E c o n o m e t r i c a 55: 693 698. c o n s u m e r b e h a v i o u r . New York: Cambridge
Browning, M.J., A.S. Deaton, and M.J. Irish. 1985. A University Press.
profitable approach to labor supply and commodity Deaton, A.S., and J. Muellbauer. 1986. On measuring child
demands over the life cycle. E c o n o m e t r i c a 53: 503- costs, with applications to poor countries. J o u r n a l o f
543. P o l i t i c a l E c o n o m y 94: 720-744.
Campbell, J.Y. 1687. Does saving anticipate declining labor Deaton, A.S., J. Ruiz-Castillo, and D. Thomas. 1985. The
income? An alternative test of the permanent income influence of household composition on household
hypothesis. E c o n o m e t r i c a 55: 1249-1273. expenditure patterns: Theory and Spanish evidence.
Campbell, J.Y., and N.G. Mankiw. 1986. Are output fluc- Woodrow Wilson School, Princeton University,
tuations transitory? National Bureau of Economic processed.
Research Working Paper 1916, processed. Dickey, D.A., and W.A. Fuller. 1981. Likelihood ratio statistics
for autoregressive time series with a unit root.
E c o n o m e t r i c a 49: 1057-1072.
Consumer Expenditure 2105
Diewert, W.E. 1971. An application of the Shephard duality Ghez, G., and G.S. Becker. 1975. The allocation of time and goods over
theorem: A generalized Leontief production function. the life-cycle. New York: Columbia University Press.
Journal of Political Economy 79: 481-507. Goldberger, A.S. 1967. Functional form and utility: A review
Diewert, W.E. 1973. Functional forms for profit and trans- of consumer demand theory. Social Systems Research
formation functions. Journal of Economic Theory 6: 284-316. Institute, University of Wisconsin, processed.
Diewert, W.E. 1974. Intertemporal consumer theory and the Gorman, W.M. 1959. Separable utility and aggregation.
demand for durables. Econometrica 42: 497 516. Econometrica 27: 469-481.
Diewert, W.E., and T.J. Wales. 1987. Flexible functional Gorman, W.M. 1976. Tricks with utility functions. In Essays in
forms and global curvature conditions. Econometrica 55: 43- economic analysis, ed. M. Artis and A.- R. Nobay. Cambridge:
68. Cambridge University Press.
Dixit, A.K. 1976. Optimization in economic theory. Oxford: Oxford Gorman, W.M. 1981. Some Engel curves. In Essays in the theory and
University Press. measurement of consumer behaviour in honour of Sir Richard Stone, ed. A.S.
Dolde, W., and J. Tobin. 1971. Monetary and fiscal effects on Deaton. Cambridge: Cambridge University Press.
consumption in consumer spending and monetary policy: Gorman, W.M. 1982. Facing an uncertain future. IMSS
The linkages. Boston: Federal Reserve Bank of Boston, Technical Report No.359, Stanford University, processed.
Conference Series no. 5. Grossman, S.J., and R.J. Shiller. 1981. The determinants of
Duncan, G.J., and D.H. Hill. 1985. An investigation of the the variability of stock market prices. American Economic
extent and consequences of measurement error in labor Review, Papers and Proceedings 71: 222-227.
economic survey data. Journal of Labor Economics: 508-532. Hall, R.E. 1978. Stochastic implications of the life cycle-
Durlauf, S.N., and P.C.B. Phillips. 1986. Trends versus permanent income hypothesis: Theory and evidence.
random walks in time-series analysis. Cowles Foundation Journal of Political Economy 86: 971-987.
Discussion Paper No.788. Yale University, New Haven, Hall, R.E., and F.S. Mishkin. 1982. The sensitivity of
processed. consumption to transitory income: Estimates from panel
Eichenbaum, M.S., L.P Hansen, and K. Singleton 1984. A data on households. Econometrica 50: 461-481.
time series analysis of representative agent models of Hansen, L.P, andKJ. Singleton. 1982. Generalized instru-
consumption and leisure choice under uncertainty. mental variables estimation of non-linear rational
Graduate School of Industrial Administration, Camegie- expectations models. Econometrica 50: 1269-1286.
Mellon University, Pittsburgh, mimeo. Hayashi, F. 1982. The permanent income hypothesis: Esti-
Engel, E. 1895. Die lebenkosten Belgischer Arbeiter- Familien mation and testing by instrumental variables. Journal of
frixher und jetzt International Statistical Institute Bulletin 9: 1-74. Political Economy 90: 895-916.
Epstein, L. 1975. A disaggregate analysis of consumer choice Hayashi, F. 1985a. Permanent income hypothesis and
under uncertainty. Econometrica 43: 877-892. consumption durability: Analysis based on Japanese
Evans, M.K. 1967. The importance of wealth in the con- panel data. Quarterly Journal of Economics: 183-206c.
sumption function. Journal of Political Economy 75: 335-351. Hayashi, F. 1985b. Tests for liquidity constraints: A critical
Evans, G.B.A., and N.E. Savin. 1982. Conflict among the survey. Osaka University and NBER, processed. Pre-
criteria revisited; the W, LR, and LM tests. Econometrica 50: sented at the Fifth World Congress of the Econometric
737-748. Society, Cambridge, MA, August 1985.
Fisher, I. 1930. The theory of interest. New York: The Macmillan Heckman, J.J. 1971. Three essays on the supply of labor and
Company. the demand for goods. Unpublished PhD thesis, Princeton
Flavin, M. 1981. The adjustment of consumption to changing University.
expectations about future income. Journal of Political Economy Hicks, J.R. 1939. Value and capital. Oxford: Oxford University
89: 974—1009. Press.
Flemming, J.S. 1973. The consumption function when capital Houthakker, H.S. 1961. An international comparison of
markets are imperfect: The permanent income hypothesis personal saving. Bulletin of the International Statistical Institute 38: 55-
reconsidered. Oxford Economic Papers 25: 160-172. 70.
Friedman, M. 1957. A theory of the consumption function. Princeton: Houthakker, H.S. 1965. On some determinants of saving in
Princeton University Press. developed and underdeveloped countries. In Problems in
Gallant, A.R. 1982. Unbiased determination of production Economic Development, ed. A.G. Robinson. London: Macmillan.
technologies. Journal of Econometrics 20: 285-323. Houthakker, H.S., and L.D. Taylor. 1970. Consumer demand in the
Geary, R.C. 1949-50. A note on ‘a constant utility index of the United States: Analysis and projections. 2nd ed. Cambridge, MA:
cost of living’. Review of Economic Studies 18,65-66. Harvard University Press.
2106 Consumer Expenditure
Howe, H., R.A. Poliak, andT.J. Wales. 1979. Theory and time Modigliani, F. 1986. Life cycle, individual thrift, and the
series estimation of the quadratic expenditure system. wealth of nations. American Economic Review 76: 297 313.
Econometrica 47: 1231-1247. Modigliani, F., and A. Ando. 1957. Tests of the life-cycle
Koskela, E., and M. Viren. 1982a. Saving and inflation: Some hypothesis of savings. Bulletin of the Oxford Institute of Economics and
international evidence. Economics Letters 9: 337 344. Statistics 19: 99-124.
Koskela, E., and M. Viren. 1982b. Inflation and savings: Modigliani. F.. and R. Brumberg. 1954. Utility analysis and
Testing Deaton’s hypothesis. Applied Economics 14: 579 590. the consumption function: An interpretation of cross-
Kotlikoff, L.J., and L.H. Summers. 1981. The role of section data. In Post-Keynesian economics. ed. K.K. Kurihara.
intergenerational transfers in aggregate capital accumu- New Brunswick: Rutgers University Press.
lation. Journal of Political Economy 89: 706-732. Modigliani. F.. and Brumberg. 1979. Utility analysis and
Kuznets, S. 1946. National income: A summary of findings. National aggregate consumption functions: An attempt at inte-
Bureau of Economic Research. New York: Arno Press. gration. In The collected papers of Franco Modigliani, ed. A. Abel.
Kuznets, S. 1962. Quantitative aspects of the economic growth vol. 2. Cambridge. MA: MIT Press.
of nations: VU: The share and structure of consumption. Muellbauer. J. 1977. Testing the Barten model of household
Economic Development and Cultural Change 10: 1 92. composition effects and the cost of children. Economic Journal
Kydland, F.E., and E.C. Prescott. 1982. Time to build and 87: 460M87.
aggregate fluctuations. Econometrica 50: 1345 1370. Muellbauer. J. 1980. The estimation of the Prais-Houthakker
Laitinen, Is.. 1978. Why is demand homogeneity so often model of equivalence scales. Econometrica 48: 153-176.
rejected? Economics Letters 1: 187-191. Muellbauer. J. 1985. Habits, rationality and the life-cycle consumption function.
Leff, N. 1969. Dependency rates and saving rates. Economic Nuffield College, Oxford, mimeo.
Journal 59: 886-896. Nicholson, J.F. 1976. Appraisal of different methods of
Leser, C.E.V. 1963. Forms of Engel functions. Econometrica 31: estimating equivalence scales and their results. Review of
694-703. Income and Wealth 22: 1 11.
Lucas, R.E. 1976. Econometric policy evaluation: A critique. Phillips, P.C.B., and P. Perron. 1986. Testing for a unit root in
In The Phillips curve and labor markets, Camegie-Rochester a time series regression. Cowles Foundation Discussion
Conference Series on Public Policy 1, ed. K. Brunner and Paper No.795. Yale University, New Haven, processed.
A. Meltzer. Amsterdam: North-1 lolland. Poliak. R.A.. and T.J. Wales. 1979. Welfare comparisons and
Lucas, R.E. 1981. Introduction. In Studies in business cycle theory, ed. equivalent scales. American Economic Review 69: 216-221.
R.E. Lucas. Cambridge, MA: MIT Press. Poliak. R.A.. and T.J. Wales. 1980. Comparisons of the
Lucas, R.E., and L. Rapping. 1969. Real wages, employment, quadratic expenditure system and translog demand sys-
and inflation. Journal of Political Economy 77: 721 754. tem with alternative specifications of demographic effects
MaCurdy, T.E. 1981. An empirical model oflabor supply in .Econometrica 48: 595-612.
a life-cycle setting. Journal of Political Economy 89: 1059 1085. Poliak, R.A., andT.J. Wales. 1981. Demographic variables in
Mankiw, N.G., and M. Shapiro. 1985. Trends, random walks, demand analysis. Econometrica 49: 1533-1551.
and tests of the permanent income hypothesis. Journal of Prais, S.J., and H.S. Houthakker. 1955. The analysis of family budgets.
Monetary Economics 16: 165-174. Cambridge: Cambridge University Press.
Mankiw, N.G., and M. Shapiro. 1986. Do we reject too Ramsey, F.P. 1928. A mathematical theory of saving. Eco-
often? Small sample properties of tests of rational nomicJournal 38: 543 559.
expectations models. Economics Letters 20: 139 145. Rothbarth, E. 1943. Note on a method of determining
Mankiw, N.G., J.J. Rotenrberg, and L.H. Summers. 1985. equivalent income for families of different composition.
Intertemporal substitution in macroeconomics. Quarterly Appendix 4 in War-time patterns of saving and spending, ed. C.
Journal of Economics 100: 225-251. Madge, Occasional Paper 4, National Institute of
Meisner, J.F. 1979. The sad fate of the asymptotic Slutsky Economic and Social Research, London.
symmetry test. Economics Letters 2: 231-233. Roy, R. 1943. De I’utilitee: contribution a la theorie des choix. Paris:
Mirer, T.W. 1979. The wealth-age relationship among the Herman.
aged. American Economic Review 69: 435-443. Runkle, D.E. 1983. Liquidity constraints and the permanent income hypothesis:
Modigliani, F. 1970. The life-cycle hypothesis of saving and Evidence from paneI data. Mi l", processed.
inter-country differences in the saving ratio. In Induction, Samuelson, PA. 1947 8. Some implications of linearity.
growth and trade: Essays in honour of Sir Roy Harrod, ed. W.A. Eltis et Review of Economic Studies 15: 88 90.
al. Oxford: Clarendon Press.
Consumer Expenditure (New Developments and the State of Research) 2107
The state of research on consumer expenditure up In its simplest incarnation the PILC model considers
to the mid-1980s is described in consumer expen- a finite horizon, no uncertainty and very simple
diture. Here, we provide an overview of recent preferences. In such a situation, it is simple to
developments on the intertemporal model of con- translate the basic intuition of the model, to which
sumer behaviour under uncertainty. We organize we referred above, into a closed form solution for
our discussion around what has been the workhorse consumption that depends not just on current
model for the analysis of dynamic consumption income but on the total amount of resources
behaviour - the life-cycle permanent income model. available to an individual and intertemporal prices.
Although our discussion of the intertemporal model The problem of this specification, of course, is its
is self-contained, it is not meant to be an exhaustive lack of realism. Not only do consumers in reality
survey of this large literature. We do not cover face much more complicated intertemporal envi-
demand analysis, despite the many exciting ronments, but it is likely that these complications
developments that have occurred in recent years. have a first-order effect on consumption choices.
The permanent income life-cycle (PILC) model, Therefore, the simplest version of the model is a
introduced during the 1950s by Modigliani and useful way to convey the main ideas behind PILC,
Brumberg ( ) and Friedman ( ), still plays but it needs to be complicated considerably to be of
an important role in the consumption literature. The use for policy analysis.
PILC model can be loosely defined as a framework The introduction of uncertainty in the model,
where individuals maximize utility over time given which makes it much more realistic, complicates the
a set of intertemporal trading opportunities. Con- problem enormously. The first formalizations of the
sumption at different points in time is treated as life-cycle model under uncertainty date back to the
different commodities, so that, given intertemporal 1970s (Bewley ). Typically, one assumes that
trading opportunities, consumption in a given consumers maximize expected lifecycle utility
period depends on total (life-cycle) resources and choosing consumption and, in more general
(intertemporal) prices. Optimal consumption settings, leisure and financial asset holdings.
choices are such that the ratio of (expected) mar- Consumers are assumed to know the stochastic
ginal utilities of consumption at different times nature of their environment. Even with many
equals the ratio of intertemporal prices. Therefore, simplifications on the nature of preferences, the
the relationship between consumption and total model does not yield closed form solutions for
resources is likely to depend on preferences (and in consumption, except in the most special cases.
particular on the elasticity of intertemporal sub- MaCurdy ( , see also ) uses dynamic
stitution and the rate at which the future is optimization techniques to derive necessary con-
discounted) and on interest rates (as they represent ditions for the optimal solution of the intertemporal
intertemporal prices). If we allow for uncertainty, as optimization problem faced by consumers. The
we discuss below, risk will also enter as a poten- attractiveness of this approach lies in the fact that it
tially important determinant of consumption. cuts through the necessity of solving the model
This model can generate implications and completely, which is a very hard task indeed, to
insights for many important questions not only in focus on some useful implications of the model. In
macroeconomics but also in public finance, and has particular, these contributions focus on the basic
therefore attracted much attention, both first order condition, the so-called Euler equation,
theoretically and empirically. Recent research has that equates the ratio of marginal utilities to
stressed the need to look at preferences on the one intertemporal prices.
hand and markets on the other, as the policy The first macro paper to take this approach is
implications are the result of both. Hall ( ): under strong assumptions on prefer
ences and returns, (non-durable) consumption is a
random walk, that is:
Consumer Expenditure (New Developments and the State of Research) 2109
An equation such as ( ) can be log-linearized to The great merit of the model with prudence is
obtain (see Hansen and Singleton ): that it highlights the need to save for rainy days
even if sunny days are equally likely. An increased
AlogC,+i = k + ^log(l + rf,,) + e(+1. (8) variance in the shocks to income reduces
consumption even if expected income does not
change. In the case of discrete variables, such as
Although consumption appears on the left- hand
unemployment or illness, changes in first and
side of Eq. ( ), this equation is not a consumption
second moments occur simultaneously, but this is
function, but an equilibrium condition. It cannot
not the case for continuous variables. The ability to
explain or predict consumption levels: consumption
distinguish between first and second moment
is crucially determined by the residual term £,+1 and
effects is of crucial importance in the analysis of
there is nothing that tells us what determines such a
public policy, because of its social insurance
term or how this term changes with news about
characteristics.
income, interest rates or any other relevant variable.
The solution of the Bewley model with more
The Euler equation for a single asset can iden-
general utility functions has to be computed
tify the elasticity of intertemporal substitution, a key
numerically or rely on approximations. Several
parameter for the evaluation of the welfare costs of
studies in the early 1990s took up the challenge of
interest taxation (Boskin ; Summers ) and for the
characterizing such solutions. Deaton ( )
analysis of real business cycles (King and Plosser ).
studied a model with power utility and infinite life.
The joint estimation of several Euler equations can
Deaton considered the existence of liquidity or no-
help identify the pure discount rate parameter
borrowing constraints, and showed that impatient
(governing patience), but also shed light on risk
consumers would hold limited assets to insure
aversion, given that different assets typically have
against low income draws. Carroll ( ) instead
different risk characteristics.
covered the case of finite lives,
The derivation of a closed form solution for
and showed that, if consumers are sufficiently
consumption when certainty equivalence does not
impatient and their labour income is subject to both
hold was first successfully tackled by Caballero (
permanent and temporary shocks, they set
, ). Caballero ( ) took the Flavin
consumption close to income. The model with
model with known finite life, and constant absolute
impatient consumers under labour income uncer-
risk aversion (CARA) preferences, and showed that,
tainty has been labelled ‘the buffer stock model’,
when the optimal consumption age profile is flat
because saving is kept to the lowest level compat-
with no uncertainty, it is increasing with income
ible with the need to buffer negative income shocks.
uncertainty. This change in the slope of the
Later work by Attanasio et al. ( )
consumption profile was described as precautionary
and Gourinchas and Parker ( ) clarifies the
saving, because early in life consumers save more if
role played by age-related changes in demographics
labour income is more uncertain. Later work by
and the hump-shape age profile of labour income in
Gollier ( ) and Carroll and
generating income tracking for relatively young
Kimball ( ) established that a similar result
consumers (micro data show that financial asset
holds whenever the third derivative of the utility
accumulation starts around age 40). Hubbard et al. (
function is positive, a feature of preferences labelled
, ) show instead
prudence. Both CARA and power utility exhibit
how precautionary motives interact with the insur-
prudence. The presence and size of precautionary
ance properties of Social Security in the United
savings is a matter of great relevance for public
States.
policy, in so far as public insurance schemes
Many of the papers cited in the preceding par-
covering such risks as unemployment, health and
agraph consider relatively simple versions of the
longevity should reduce the need for consumers to
life-cycle model. In particular, a single non-durable
accumulate assets.
commodify is assumed and preferences are
assumed to be additively separable
2112 Consumer Expenditure (New Developments and the State of Research)
with leisure and over time. While this greatly The life-cycle model with various sources of
simplifies the solution, the construction of a more uncertainty and generic preferences generates
realistic and complex model has become an decision rules and behaviour of great complexity.
important area of research. This development Consumption and saving choices depend in an
follows from the recognition that, for many pur- unknown fashion on eveiy single aspect of the
poses, and in particular for policy analysis, a model stochastic environment faced by the consumer, for
that delivers consumption as a function of instance on the entire distribution of fitUtre wages
exogenous variable is a very useful tool indeed. and earnings opportunities, on pension
This area of research has to deal with two arrangements, on the asset markets the consumer
important issues. First of all, the model can become can access, on mortality risks and so on and so
very quickly, from a numerical point of view, very forth. The Euler equation approach allows
difficult to solve. The large number of state researchers to deal in a rigorous fashion with
variables that characterize the solution of extremely rich models and yet derive relatively
reasonably realistic models and the consideration of simple implications to test some aspects of the
discrete choices and non-convexities linked to model and, with the help of additional assumptions,
transaction costs can push the numerical capabilities to identify some of the structural parameters that
of even very powerful computers. Second and even inform individual behaviour. We now understand
more importantly, if one wants to obtain solutions that Euler equations can be used to determine what
for consumption in a dynamic context, one has to type of preferences fits the available data and can
characterize completely the stochastic environment therefore provide one of the building blocks
in which the consumer lives. This contrast sharply (preferences) in the study of the questions above.
with the Euler equation approach that allowed the We also know that the presence of liquidity
researcher to be agnostic about most aspects of the constraints does not necessarily produce violations
environment and, under certain conditions, avoid of Euler equations because, even when liquidity'
solving difficult problems, such as labour supply, constraints are present, they might be rarely
housing and other durable choices and so on. The binding.
Euler equation would hold regardless of the The Euler equation is robust to a number of
presence of non-convexities and other type of market imperfections, but is silent about how con-
difficulties connected with these choices. These, sumption or its growth reacts to specific news about
instead have to be frilly specified if one wants to shocks, changes in interest rates, taxation and so on.
work with a model that delivers a solution for It is therefore useless for specific policy analysis. In
consumption. These two difficulties constitute other words, while the parameters of an Euler
limits for the research in this area that, in all equation can be estimated in a wide set of
likelihood will not be overcome in the near future. circumstances, and one can use the equation to test
the specification of the model, none of these results
will provide an answer to questions like what is the
effect of a change in taxation or interest rates on the
level of consumption and saving?
The Empirical Evidence on the PILC This important shortcoming of the Euler equa-
Model tion approach explains why such an approach,
which has informed and dominated the large
Since its introduction in the 1950s, there is no
empirical literature on the validity of the life- cycle
consensus about the empirical relevance of the
permanent income model is virtually absent in the
PILC model. While the model it is one of the main
public economics literature on, say, the effect of
tools in modern macroeconomics and public
pension reforms on saving or on the effect of
finance, its empirical performance is mixed. In this
changes in the taxation of interest on saving. And
section, we discuss two branches of the literature.
yet the conceptual framework that is
Consumer Expenditure (New Developments and the State of Research) 2113
behind the study of these issues is the same as that • What is the magnitude of the relevant preference
used to study consumption behaviour. parameters?
Policy analysis requires instead the availability
of a consumption function, that is, a relation that Tests of the Model
explains consumption as a function of those As mentioned above, a prediction of the model is
variables that the consumer can take as exogenous that changes in consumption cannot be predicted by
at any given moment. Only in the simplest versions expected changes in income or any other variable
of the life-cycle model is it possible to derive an known to the consumer at time t — 1. This is the
analytical expression for the consumption function. essence of the Hall ( ) test and of many
In general, given a set of assumptions on preferenceothers. Evidence that consumption can be predicted
parameters and market and non-market by lagged variables has been interpreted as
indicative of liquidity constraints, myopic
opportunities, one has to rely on numerical solutions
and/or approximations. behaviour, misspecification of preferences and so
A less ambitious but potentially profitable on. The relationship between consumption and
approach that does not require numerical methods income has received considerable attention. The
or incredibly rich data-sets is the estimation of first to observe that the life-cycle model predicts no
reduced form equations, whose specification is relation between the life-cycle profile of income
informed by the life-cycle model. These are par- and consumption was Thurow ( ).
ticularly useful in situations in which one analysesThurow argued that the fact that consumption
large (and possibly exogenous) changes to some of tracked income over the life-cycle was a rejection of
the likely determinants of consumption or saving. the main implications of the PILC model. To this
Such studies can address substantive issues and argument, essentially identical to many others
even test some aspects of the life-cycle model. proposed subsequently, Heckman ( ) replied
that non-separability between consumption and
Examples of studies of this kind include the reaction
of consumption (and saving) to changes in pension leisure could explain such a relationship.
entitlements (Attanasio and Brugiavini ; Attanasio Despite this early exchange, after Hall ( )
and Rohwedder ; Miniaci and Weber ), to swings in a
the value of important wealth components (such as large fraction of the literature based on consumption
housing, Attanasio and Weber ) and to changes in Euler equations focused on the relationship between
specific taxes (Parker ; Souleles ; Shapiro and predictable changes in income and expected
Slemrod ). consumption growth. Hall and Mishkin ( ),
Below we review the empirical evidence on the as well as Campbell and Mankiw ( ,
PILC model, organizing it in two subsections. First ) all report violations of this prediction, and
we start with the empirical evidence derived from label this finding ‘excess sensitivity’. Excess sen-
Euler equations. We then move on to evidence that sitivity can be explained by the presence of liquidity
constrained consumers, or of rule-of-thumb
considers the levels of consumptions, rather than its
changes. consumers, that is, consumers who let their expen-
diture track their income as a way to avoiding the
complexities of choosing the optimal consumption
Evidence from Euler Equations path. However, consistently with Heckman’s ( )
Two important empirical issues can be addressed argument, excess sensitivity can be recon
with the study of Euler equations: • ciled with the intertemporal optimization model if
more general, and sensible, utility functions are
used. In particular, if one assumes that leisiue
affects utility in a non-additive way, consumption
• What is the empirical relevance of the model? changes respond to predictable labour income
Is there a sensible specification of preferences changes, whether or not leisiue is a freely chosen
that fits the observed data? variable. Finally, and importantly, the aggregation
2114 Consumer Expenditure (New Developments and the State of Research)
issue proves to be important. Attanasio and Weber ( the reciprocal of the coefficient of relative risk
) show that results obtained with improperly aversion. Not many empirical papers have worked
aggregated micro data are consistent with results with preferences that allow for these two parameters
obtained with aggregate data and indicate rejections to be disjoint.
of the model that instead disappear with properly An influential paper by Hall ( ) claimed
aggregated data and rich enough preference that this parameter is close to zero. This finding has
structures. been challenged on various grounds. Attanasio and
To summarize the discussion so far, it seems Weber ( , ) point out that
that while simple tests of the life-cycle model seem aggregation bias could be responsible for such a low
to reject the implications from the model and in estimate: they estimated a much higher elasticity
particular those derived from Euler equations, it is (around 0.8) using UK and US cohort data (that is,
possible to find specification of preferences that do data from repeated cross-sections, consistently
a good job at fitting the available data, especially for aggregated over individuals bom in the same years).
households that are headed by prime-aged In the macro literature little attention has been
individuals. Aspects that are crucial for fitting the paid to the possibility that the EIS may differ across
data are the use of household level data, allowing consumers, particularly as a function of their
for changes in consumption needs induced by consumption. A simple way to capture the notion
changes in family composition and the use of that poor consumers may be less able to smooth
preferences specifications that allow for the consumption across periods and states of nature is
marginal utility of consumption to depend on labour to assume that the utility function does not depend
supply. on total (non-durable) consumption, but rather on
the difference between consumption and needs.
Estimation of Preference Parameters Thus we could retain the analytical attraction of
Recent research on consumption and saving has power utility, but have (C - C*) as its argument,
singled out three preference parameters for atten- where C* is an absolute minimum that the
tion: the elasticity of intertemporal substitution, the consumer must reach in each and every period. This
relative risk aversion parameter and the subjective functional form is known as Stone-Geary utility in
discount rate. The size of these parameters has demand analysis, and is the simplest way to
important implications in many applications of the introduce non-homotheticity in a demand system.
model, ranging from macroeconomics to public One could interpret ‘external habits’ (Abel ;
finance to financial economics. Campbell and Cochrane ) as a special way to
Perhaps surprisingly, not much evidence has parameterize C* (by making it a fraction of past
been accumulated on the discount factor from the consumption of other consumers). Attanasio and
estimation of Euler equations. This can be explained Browning ( ), Blundell et al. () and
by the fact that in log-linearized versions of the Atkenson and
Euler equation, the parameter is not identified, while Ogaki ( ) are among the few examples of
non-linear versions of the model are ridden by a papers that explicitly allow for wealth- dependent
number of econometric problems, particularly in EIS.
relatively small samples of the type used in Euler Demographics might also affect preferences,
equation estimation (see Attanasio and Low ). and might explain consumption changes and the
As for the distinction between the elasticity of shape of the consumption age profile, as argued by
intertemporal substitution (EIS) and the coefficient Attanasio et al. ( ) as well as Browning and
of risk aversion, it is absent in the most popular Ejmaes ( ).
specifications used in the literature: a model where
consumers maximize expected utility and Evidence from the Levels of Consumption
preferences are iso-elastic and additively separable As stressed above, the Euler equation imposes some
over time. In such a situation, the EIS is restrictions on the dynamics of consumption
Consumer Expenditure (New Developments and the State of Research) 2115
but, on its own, does not determine the level of individuals have access to, on their imperfections
consumption. If one neglects numerical compli- and on the nature of the equilibrium they give rise
cations, a solution for consumption can be obtained to. The implications of complete markets would be
by considering jointly the Euler equation and the very different from those one would derive if
sequence of budget constraints faced by the liquidity constraints or other markets imperfections
consumer as well as his or her initial wealth and a were prevalent.
terminal condition. As noted by Sargent ( ),
Flavin ( ) and later by Campbell
( ), the Euler equation and the intertemporal Insurance and Credit Markets
budget constraint imply a number of crossequation
restrictions for the joint time series processes of So far we have taken the assets the consumer can
consumption and income. When one is able to use to move resources over time as given and, in the
obtain a closed form solution for consumption, as is simplest versions of the model, we have made very
the case with quadratic utility, these restrictions can strong assumptions on this crucial aspect. For
be easily expressed in terms of a linear time series instance, we have assumed that consumers can
model, and tested. borrow and lend at a fixed interest rate. The reality
Some of these restrictions are also implied by is, obviously, much more complex and, from a
the Euler equation, while others are not. In partic- theoretical point of view, very many different
ular, the restrictions on the contemporaneous cor- environments have been studied. In particular, the
relation between income and consumption are not possibilities open to a consumer depend on the
implied: as we stressed above, the Euler equation is market arrangements available. Below we discuss
silent about how news about income is translated several of these market arrangements and briefly
into news about consumption. mention their implications for the determination of
Campbell and Deaton ( ) and West ( ) consumption.
proposed a test that links the innovation to perma-
nent income to consumption and presented evidence Perfect Insurance Markets
that aggregate consumption seems to be If markets are complete and consumers can trade a
‘excessively smooth’ in that it does not react full set of contingent claims without cost, individual
enough to news about income. Campbell and risk will be completely diversified. In such a
Deaton make a connection between excess sensi- situation, a number of results deliver very useful
tivity and excess smoothness. Within the certainty predictions. In particular, it can be shown that a
equivalent model, they jointly model the con- competitive equilibrium is symmetric and it is
sumption and income processes as a vector auto- therefore possible to characterize the properties of
regression, assuming that income has a unit root competitive equilibria by considering the problem
plus some persistence. In this context, consumption of a fictitious social planner, which, given a set of
changes reflect the permanent income innovation Pareto weights, maximizes social welfare. A strong
more than one-to-one: not only is the income shock implication of perfect markets is that the mar ginal
permanent, but it also predicts future, smaller utility of different consumers will move
shocks of the same sign. This implies that over the proportionally over time. The implication is very
business cycle consumption should be more volatile intuitive: the social planner faces a unique resource
than income. But in actual aggregate data constraint, and marginal utility of all individuals,
consumption is smoother than income: this is multiplied by the appropriate (and ar bitrary) Pareto
labelled ‘excess smoothness’, and is shown to be weight, will be equal to the multiplier associated to
exactly equivalent to excess sensitivity. this unique constraint. As a consequence, marginal
Clearly the implications of a given set of utility will move proportionally. If utility is
intertemporal preferences for policy relevant isoelastic, consumption moves proportionally.
questions depend crucially on the markets These implications, stressed by Townsend ( ),
have been tested in several
2116 Consumer Expenditure (New Developments and the State of Research)
papers (Cochrane ; Attanasio and Davis ; Hayashi mentioned that, in a model with finite lives and a
et al. ). non-zero probability that income would be zero in
each time period, standard regularity conditions on
Many Assets the utility function imply that a consumer will never
When there are many assets, one can derive an want to borrow. If income is bounded away from
Euler equation such as ( ) for each of the assets for zero, then the maximum the consumer will want to
which the consumer is not at a comer. The Euler borrow is the present discounted value of the
equations for consumption with different assets minimum value of income repeated in the future.
naturally ties up with asset pricing equations. This This type of constraint has been sometimes referred
approach to asset pricing was developed by to as a ‘natural’ liquidity constraint. Notice that
Breeden ( ) and Lucas ( ), and extended such a constraint does not imply a violation of the
to the case of non-additive separability of con- Euler equation. If the restriction to borrowing is
sumption and leisure in an incomplete markets tighter, the Euler equation will instead be
setting by Bodie et al. ( ). The model we occasionally violated. And, even in periods in which
sketched above is quite restrictive: the relative riskit is not violated, the level of consumption will be
aversion parameter is inversely related to the affected by the possibility that the constraint will be
elasticity of intertemporal substitution: Epstein and binding in the future. As Hayashi ( ) explains, the
Zin ( ) show how this restriction can be presence of an
relaxed in a more general model with power utility operative, albeit not binding, liquidity constraint is
where the timing of uncertainty resolution matters equivalent to a shortening of the planning horizon
(see also Epstein and Zin ; Attanasio and Weber ). or an increase in the discount rate. Evidence can be
Interestingly, an Euler equation for an asset obtained by noting that consumers who are liquidity
holds even if there are important imperfections in constrained will not be sensitive to changes in the
some other assets. As long as the consumer is level of the interest rate. As they will be at a kink of
exploiting a given margin to move resources over an intertemporal budget constraint, the demand for
time, an equation such as 7 will apply. If the loans will be inelastic to changes in the slope of
interest rate for a given asset changes with the levelsuch an intertemporal budget constraint: the interest
of the asset, then the Euler Eq. ( ) will have to be rate.
augmented with a term reflecting this effect
(Pissarides ). Endogenous Liquidity Constraint
In recent years, several studies have tried to model
Liquidity Constraints the shortcomings of credit and insurance markets by
The Euler equation will be violated when the allowing for specific imperfections and frictions
consumer is able, for some reason, to borrow explicitly. The two main causes of imperfections
against future income. In such a situation, Eq. ( ) that have been considered are: (a) private and
will hold as an inequality and the marginal utility of asymmetric information and (b) the inability to
current consumption will be higher than the present perfectly enforce contracts. Models of this type can
discounted value of future consumption. Consumers be seen as ways to endogenize specific market
who are liquidity constrained will be very sensitive structures (such as one where consumers have
to changes in current income. This case has access to a single asset in which they cannot
received a considerable amount of attention in the borrow). In an influential paper, for instance, Cole
literature. Many of the tests of violation of the Euler and Kocherlakota ( ) show that an econ
equation, such as Zeldes ( ), have focused on the omy where individuals have a single bond in which
so-called they can borrow can be derived as a constrained
‘excess sensitivity’ of consumption changes to equilibrium outcome where individuals have private
predictable changes in income. It should be information both on their income and on their
savings.
Consumer Expenditure (New Developments and the State of Research) 2117
Further Extensions and Alternative goods). For this reason micro data seem preferable.
Models The simplest way to introduce habits (or
durability) of consumption is to write the utility
While the evidence on the relevance of the life- function as follows:
cycle model is still inconclusive, a number of
empirical puzzles have directed attention to more - y'x,-i\z,)
complex preference structures. In particular, the
equity premium puzzle and the evolution of (9)
aggregate saving rates in high-growth economies where x is a vector of goods or services and z is any
(South East Asia) has led macroeconomists to other variable that affects marginal utility
incorporate habits into the model. However, there is (demographics, leisure, other goods that are not
still little formal evidence on the empirical explicitly modelled). The y parameters are positive
relevance of habits in micro data. The widely for goods that provide services across periods
documented retirement consumption puzzle (that is, (durability), negative for goods that are addictive
a sudden drop of consumption at retirement ) as (habit formation) or zero for goods that are fully
well as a number of more or less anecdotal pieces of non-durable, non-habit forming (Hayashi ).
evidence on the inadequacy of saving for retirement The Euler equations corresponding to ( )
and other forms of ‘irrational’ behaviour, have been involves x at four different periods of time, and
interpreted as potentially supportive of time- their estimation typically requires panel data. High-
inconsistent preferences. The most elegant way to quality consumption panel data are rare, and this has
introduce time-inconsistent preferences is provided limited the scope for empirical analysis. Meghir and
by the hyperbolic discounting assumption Weber ( ) have used Consumer
(Laibson ). Expenditure Survey (CEX ) quarterly data on food,
transport and services (and a more flexible
Habits
specification of intertemporal non-separabilities
Habits cause consumers to adjust slowly to shocks
than is implied by Eq. ), and found no evidence of
to permanent income, thus potentially explaining
either durability or habits once leisure, stock of
the excess smoothness of aggregate consumption,
durables and cars as well as other conditioning
but also increase the utility loss associated with
variables are taken into consideration.
consumption drops, and may therefore help explain
Similarly negative evidence on habits has been
the equity premium puzzle.
reported by Dynan ( ), using Panel Study of
Habits can take various forms: today’s marginal
Income Dynamics (PSID) annual food at home data.
utility may depend on the consumer’s own past
Carrasco et al. ( ) use Spanish panel data
consumption level (internal habits ) or the past
and find some evidence for habits.
consumption level of other consumers (external
The few studies that have used micro data on
habits). This latter model seems to work better on
non-durable consumption items to investigate the
aggregate data (Campbell and Cochrane ), even
issue find little or no evidence of habits, at least
though a recent survey by Chen and Ludvigson (
once preferences capture the presence of non-
) challenges this conclusion.
separabilities between goods and leisure.
Empirical macro-evidence on the presence of
habits is mixed, and this may be due to the very
Durable Goods
nature of aggregate consumption data, as stressed in
The presence of durable goods has received less
Dynan ( ). The serial correlation of aggre
attention in the micro-based literature than in the
gate consumption growth is affected by time
macro-literature, which has stressed the importance
aggregation (Heaton ), by aggregation over
consumers, and by data construction methods of their high volatility to explain business cycle
(particularly for the services from durable fluctuations (Mankiw ; Chah
et al. ).
2118 Consumer Expenditure (New Developments and the State of Research)
The simplest way to introduce durable goods role, if they can be used as collateral to obtain a
loan that pays for current consumption (Alessie et
into the analysis is to let the stock of durables affect
utility (on the assumption that services are al. ). A typical example could be the ability to
proportional to the stock), and to posit a relation remortgage a house, or to borrow 100 per cent of
between current stock, St, previous stock, St i, and the value of a newly purchased car.
current purchases q, (or maintenance and repairs) in Even if one is not interested in modelling dura-
physical terms like: ble goods, the existence of a stock of durables
should not be neglected when estimating preference
S; = (l-p)S,_,+<7; (10) parameters if utility is not additive in non-durable
goods and durable services. Significant effects of
where p is a constant depreciation rate. This leads durable goods (cars) on the Euler equation for non-
to the standard first-order condition for the durable durables have been found in UK data (Alessie et
good, according to which the relevant price is the al. ), and US data (Padula ).
user cost.
Typically, durable goods are costly to adjust, Quasi-Hyperbolic Discounting
because of transaction costs (resale markets are The widely documented consumption puzzle (that is
dominated by information problems, known as the the sudden drop of consumption at retirement, see
‘lemon’ problem, and search costs are non- Hamermesh ; Banks et al. ; and Bemheim et al. ), as
negligible). Sometimes these costs are modelled as well as a number of more or less anecdotal pieces of
a convex, differentiable function (Bemanke ), but evidence on the inadequacy of saving for retirement
the recent literature has stressed the need to take and other forms of ‘irrational’ behaviour, have been
into account their non-differentiable nature interpreted as potentially supportive of time-
(Grossman and Laroque ; Eberly ; Attanasio ; inconsistent preferences. The most elegant way to
Bertola et al. ). This generates infrequent introduce time-inconsistent preferences is provided
adjustment: consumers do not adjust continuously by the quasi-hyperbolic discounting assumption
in response to depreciation, or income and price (Laibson ). Consumers maximize the expected
shocks, but wait until the actual stock hits value of the following life-time utility index:
either a lower limit, s, or an upper limit, S,
and then adjust it to a target level. An T-t
interesting feature of this literature is that u(ct)+^Mcl+d (11)
aggregate behaviour reflects changes in both T= 1
Evidence from the consumer expenditure survey. Campbell, J.Y., and N.G. Mankiw. 1990. Permanent
Journal of Political Economy 103: 1121 1157. income, current income, and consumption. Journal of
Attanasio. O.P.. J. Banks. C. Meghir, and G. Weber. 1999. Business and Economic Statistics 8: 265 279.
Humps and bumps in life-time consumption. Journal Campbell, J.Y., and N.G. Mankiw. 1991. The response of
of Business and Economic Statistics 17: 22 35. consumption to income: A cross-country investigation.
Banks. J., R. Blundell, and S. Tanner. 1998. Is there a European Economic Review 35: 723 756.
retirement-savings puzzle? American Economic Carrasco, R., J.M. Labeaga. and J.D. Lopez-Salido. 2005.
Review 88: 769-788. Consumption and habits: Evidence from panel data.
Bemanke. B. 1985. Adjustment costs, durables and aggre- Economic Journal 115: 144-165.
gate consumption. Journal of Monetary Economics 15: Carroll. C.D. 1992. The buffer-stock theory of saving:
41-68. Some macroeconomic evidence. Brookings Papers on
Bemheim, B.D.. J. Skinner, and S. Weinberg. 2001. What Economic Activity 1992(2): 61-156.
accounts for the s anation in retirement wealth among Carroll. C.D. 1997. Buffer-stock saving and the life cycle/
U.S. households1? American Economic Review 91: 832- permanent income hypothesis. Quarterly Journal of
857. Economics 112: 1-55.
Bertola. G.. L. Guiso. and L. Pistaferri. 2005. Uncertainty Carroll, C.D.. and VI.S. Kimball. 1996. (On the concavity
and consumer durables adjustment. Review of Eco- of the consumption function. Econometrica 64: 981-
nomic Studies 72: 973-1007. 992.
Bewley, T.F. 1977. The permanent income hypothesis: A Chah. E.Y.. V.A. Ramey, and R.M. Starr. 1995. Liquidity
theoretical formulation. Journal of Economic Theory constraints and intertemporal consumer optimization:
16: 252 259. Theory and evidence from durable goods. Journal of
Blinder. A., and A. Deaton. 1985. The time series con- Money, Credit and Banking 27: 272 287.
sumption function revisited. Brookings Papers on Eco- Chen, X.. and S.C. Ludvigson. 2004. A land of addicts? An
nomic Activity 1985(2): 465-521. empirical investigation of habits-based asset pricing
Blundell. R., .VI. Browning, and G. Meghir. 1994. Con- models. New York: New York University Press.
sumer demand and the life-cycle allocation of house- Choi, J.J., D. Laibson, B.C. Madrian, and A. Metrick.
hold expenditures. Review of Economic Studies 61: 57- 2006. Saving for retirement on the path of least
80. resistence. In Behavioral public finance: Toward a New
Bodie, Z., R.C. Merton, and W.F. Samuelson. 1992. Labor Agenda, ed. Is. McCaffrey and J. Slemrod. New York:
supply flexibility and portfolio choice in a life-cycle Russell Sage Foundation.
model. Journal of Economic Dynamics and Control 16: Cochrane, .1.11. 1991. A simple test of consumption insur-
427 449. ance. Journal of Political Economy 99: 957 976.
Boskin, M.J. 1978. Taxation, saving and the rate of interest Cole. H.L.. andN.R. Kocherlakota. 2001. Efficient alloca-
Journal of Political Economy. S3 S28. tions with hidden income and hidden storage. Review
Breeden, D.T. 1979. An intertemporal asset pricing model with of Economic Studies 68: 523 542.
stochastic consumption and investment opportunities. Journal Deaton. A. 1991. Saving and liquidity constraints.
of Financial Economics 7: 265 296. Econometrica 59: 1221 1248.
Browning, M., and T.F. Crossley. 2000. Luxuries are easier to Dynan. K. 2000. I labit formation in consumer
postpone: A proof. Journal of Political Economy 108: 1022- preferences: Evidence from panel data. American
1026. Economic Review 90:391 406.
Browning, M., and M. Ejmaes. 2002. Consumption and children. Eberly, J.C. 1994. Adjustment of consumers' durables
Working Paper No. 2002 6, Centre for Applied stocks: Evidence from automobile purchases. Journal
Microeconometrics, University of Copenhagen. of Political Economy 102: 403-436.
Caballero, R.J. 1990. Consumption puzzles and precautionary Epstein, L.G., and S.E. Zin. 1989. Substitution, risk aver-
savings. Journal of Monetary Economics 25: 113-136. sion, and the temporal behavior of consumption and
Caballero, R.J. 1991. Earnings uncertainty and aggregate wealth asset returns: A theoretical framework. Econometrica
accumulation. American Economic Review 81: 859-871. 57: 937 969.
Campbell, J.Y. 1987. Does saving anticipate declining labor Epstein, L.G., and S.E. Zin. 1991. Substitution, risk aver-
income? An alternative test of the permanent income sion, and the temporal behavior of consumption and
hypothesis. Econometrica 55: 1249 1273. asset returns: An empirical analysis. Journal of Politi-
Campbell, J.Y., and J. Cochrane. 1999. Force of habit: A cal Economy 99: 263-286.
consumption-based explanation of aggregate stock market Flavin, M.A. 1981. The adjustment of consumption to
behavior. Journal of Political Economy 107: 205-251. changing expectations about future income. Journal of
Campbell, J.Y., and A. Deaton. 1989. Why is consumption so Political Economy 89: 974 1009.
smooth? Review of Economic Studies 56: 357 373. Friedman, M. 1957. A theory of the consumption, function.
Princeton: Princeton University Press.
Gollier, C. 1995. The comparative statics of changes in risk
revisited. Journal of Economic Theory 66: 522 536.
Gourinchas, P.-O., and J.A. Parker. 2002. Consumption
over the life cycle. Econometrica 70: 47-89.
Consumer Expenditure (New Developments and the State of Research) 2121
Grossman, S.J., and G. Laroque. 1990. Asset pricing and MaCurdy, T.E. 1981. An empirical model of labor supply
optimal portfolio choice in the presence of illiquid in a life-cycle setting. Journal of Political Economy 89:
durable consumption goods. Econometrica 58: 25-51. 1345-1370.
Gul, F., and W. Pesendorfer. 2001. Temptation and self- MaCurdy, T.E. 1999. An essay on the life cycle: Charac-
control. Econometrica 9: 1403-1435. terizing intertemporal behavior with uncertainty,
Gul, F., and W. Pesendorfer. 2004. Self-control and the taxes, human capital, durables, imperfect capital
theory of consumption. Econometrica 72: 119-158. markets, and nonseparable preferences. Research in
Hall, R.E. 1978. Stochastic implications of the life cycle- Economics 53: 5-46.
permanent income hypothesis: Theory and evidence. Mankiw, G.N. 1982. Hall’s consumption hypothesis and
Journal of Political Economy 86: 971-987. durable goods. Journal of Monetary Economics 10:
Hall, R.E. 1988. Intertemporal substitution in consump- 417^425?
tion. Journal of Political Economy 96: 339-357. Meghir, C., and G. Weber. 1996. Intertemporal non-
Hall, R.E., and F.S. Mishkin. 1982. The sensitivity of separability or borrowing restrictions? A disaggregate
consumption to transitory income: Estimates from analysis using a U.S. consumption panel.
panel data on households. Econometrica 50: 461-481. Econometrica 64: 1151-1181.
Hamermesh, D.S. 1984. Consumption during retirement: Miniaci, R., and G. Weber. 1999. The Italian recession of
The missing link in the life cycle. Review of Economics 1993: Aggregate implications of microeconomic evi-
and Statistics 66: 1-7. dence. Review of Economics and Statistics 81: 237-249.
Hansen, L.P., andK.J. Singleton. 1982. Generalized Modigliani, F., and R. Brumberg. 1954. Utility analysis
instrumental variables estimation of nonlinear and the consumption function: An interpretation of
rational expectations models. Econometrica 50: 1269- cross-section data. In Post Keynesian economics, ed. K.
1286. Kurihara. New Brunswick: Rutgers University Press.
Hansen, L.P., and K.J. Singleton. 1983. Stochastic con- Padula, M. 1999. Euler equations and durable goods.
sumption, risk aversion, and the temporal behavior of Working Paper No. 30, CSEF.
asset returns. Journal of Political Economy 91: 249- Parker, J.A. 1999. The reaction of household consumption
265. to predictable changes in social security taxes. Ameri-
Harris, C., and D. Laibson. 2001. Dynamic choices of can Economic Review 89: 959-973.
hyperbolic consumers. Econometrica 69: 935-957. Pissarides, C.A. 1978. Liquidity considerations in the the-
Hayashi, F. 1985. The permanent income hypothesis and ory of consumption. Quarterly Journal of Economics
consumption durability: Analysis based on Japanese 92: 279-296.
panel data. Quarterly Journal of Economics 100: 1083- Poterba, J. 1988. Are consumers forward looking? Evi-
1113. dence from fiscal experiments. American Economic
Hayashi, F. 1987. Tests for liquidity constraints: A critical Review 78: 413-418.
survey. In Advances in econometrics 11: Fifth world Sargent, T.J. 1978. Rational expectations, econometric
congress, ed. T. Bewley. Cambridge: Cambridge Uni- exogeneity, and consumption. Journal of Political
versity Press. Economy 86: 673-700.
Hayashi, F., J. Altonji, and L. Kotlikoff. 1996. Risk- Shapiro, M.D., and J. Slemrod. 2003. Consumer response
sharing between and within Families. Econometrica to tax rebates. American Economic Review 93: 381-396.
64: 261-294. Sinai, T., and N.S. Souleles. 2005. Owner-occupied hous-
Heaton, J. 1993. The interaction between time- ing as a hedge against rent risk. Quarterly Journal of
nonseparable preferences and time aggregation. Economics 120: 763-789.
Econometrica 61: 353-385. Souleles, N.S. 1999. The response of household consump-
Heckman, J.J. 1974. Life cycle consumption and labor tion to income tax refunds. American Economic
supply: An explanation of the relationship between Review 89: 947-958.
income and consumption over the life cycle. American Summers, L. 1981. Capital taxation and capital accumula-
Economic Review 64: 188-194. tion in a life-cycle growth model. American Economic
Hubbard, R.G., J. Skinner, and S.P. Zeldes. 1994. The Review 71: 533-544.
importance of precautionary motives in explaining Thurow, L.C. 1969. The optimum lifetime distribution of
individual and aggregate saving. Carnegie-Rochester consumption expenditures. American Economic
Conference Series on Public Policy 40: 59-125. Review 59: 324-330.
Hubbard, R.G., J. Skinner, and S.P. Zeldes. 1995. Precau- Townsend, R.M. 1994. Risk and insurance in village India.
tionary saving and social insurance. Journal of Politi- Econometrica 62: 539-591.
cal Economy 103: 360-399. West, K.D. 1988. The insensitivity of consumption to news
King, R.G., and C.I. Plosser. 1984. Money, credit, and about income. Journal of Monetary Economics 21: 17-
prices in a real business cycle. American Economic 33.
Review 14: 363—380. Zeldes, S.P. 1989. Consumption and liquidity constraints:
Laibson, D. 1997. Golden eggs and hyperbolic An empirical investigation. Journal of Political Econ-
discounting. Quarterly Journal of Economics 62: 443— omy 97: 305—346.
477.
Lucas, R.E. Jr. 1978. Asset prices in an exchange
2122 Consumer Surplus
JEL Classifications
If ACSk is positive (negative), the price change
Dll is judged to have increased (decreased) the welfare
of individual k. Is it ordinally equivalent to the
How does the market power exercised by firms change in utility? A necessary condition is that the
influence consumer welfare? What is the effect of demand function is generated by a rational
excise taxes on households with different levels of consumer who maximizes utility subject to a budget
income? Does governmental regulation increase the constraint. Unless consumers have optimized and
welfare of consumers? Topical issues such as these are at the boundaries of their budget sets, it is
indicate that the measurement of welfare is a impossible to assess the welfare effects of changes
fundamental element of public policy analysis. in prices and incomes. (That is, demands must be
Indeed, a full consideration of taxes, subsidies, ‘integrable’ and consistent with a well-behaved
transfer programmes, health care reform, regulation, utility function. Hurwicz and Uzawa ( ),
environmental policy, the social security system,
and educational reform must ultimately
Consumer Surplus 2123
in utility. That is, EVk is positive if and only if measure of the change in lifetime welfare. Define V{
V1 > V°. to be the maximum level of lifetime welfare when
The utility level at which ( ) is evaluated is the profile of prices and interest rates are {pj} and
important for multiple comparisons of price and {r)} and denote the corresponding time path of
income changes. The generalized equivalent var- utility as {V^ } . The reference prices and interest
iation will give an ordering of outcomes that is rates, {pj} and {/f }, yield a lifetime utility level of
identical to that based on utility levels. If ( ) is V°L and within-period utilities {V®,}. Keen’s exact
evaluated at V/0 = V'fp0, Y{)k, Ak), we obtain the measure of the change in lifetime welfare, evaluated
generalized compensating variation: at the reference prices, is exactly analogous to the
generalized equivalent variation:
CVt =E{p°,V°,Ak) -E(p\V°,Ak) + (Y\-Y°)
= E{p\V\Kk)-E(p\V°,Ak). AW t = Q t ({p?},{i®},V>)
(7)
-Q*({p?}.K}.v2).
Because the utility levels are ‘cardinalized’ using
different prices for each set of binary comparisons,
the ordering of multiple outcomes based on ( ) need The concepts of the equivalent and compensat-
not match the ordering based on utility levels. ing variation can also be extended to cases in which
Chipman and Moore ( ) have shown the choices made by consumers are discrete rather
that consistent rankings of outcomes require
than continuous. Dagsvik and Karlstrom ( )
restrictions on preferences that are the same as for
describe the compensating variation in
consumer surplus.
the context of a random utility model defined as:
While the simple static formulation of consumer
surplus is the most frequent application, the (pj> Ak^ + £jk(j — 1 , 2 , . . . , J ) ,
conceptual framework can be extended to analyse Ujk = V
the effects of changes in utility in more general
settings. For example, intertemporal welfare effects where Ujk is the utility of individual k in alternative
are often represented as the discounted sum of the j, V(.) is a deterministic indirect utility function, and
within-period equivalent or compensating Ejk are random variables. There are a total of J
variations. choices available to the consumer and, for
Keen ( ) has shown that this will differ simplicity, it is assumed that only prices vary across
from the lifetime equivalent variation to the extent alternatives.
that individuals are able to substitute Consider the welfare effect of a change in the set
intertemporally. As an alternative approach, he of prices and income facing individual k from
defines Vi to be the maximum level of lifetime (p?. p°. • • • - p“. Y k ) to ( P 1 - pi • • • - pj’ Y \ )
■ If the
utility of an individual who lives T periods when the
consumer chooses the alternative that maximizes
profiles of prices and interest rates are {pt} and \r,}
Ujk, the compensating variation is defined implicitly
respectively. If the (optimal) time path of utility
as that value CV k that satisfies the following
corresponding to VL at these prices and interest rates
equality:
is {Vk, (, the lifetime expenditure function can be
represented as:
max/ y(p?,Tk,Ak) +£jk
= * 5 2 g , £(p t ,V*„A kt ), = maxjV(p|, Y\ - CVk, A k ) + 8jk.
t
From Demand Functions to Welfare Closed form solutions to the partial differential
Measurement equation implied by Roy’s Identity can be obtained
for only a limited class of demand functions. An
While it was understood that the equivalent varia- alternative approach is to begin with an assumed
tion resolved the conceptual problem of welfare form of the indirect utility function and use Roy’s
measurement, it had little influence on applied Identity to obtain a system of demand equations.
welfare economics because compensated demand Since the form of the utility function is assumed
functions were presumed to be unobservable. Willig from the outset, it is unnecessary to solve a
( ) made the first attempt to bridge the complex system of partial differential equations.
gap between theory and application by showing Muellbauer ( ) provided an early example
that, for a single price change, consumer surplus of this approach. He assumed that demands were
can provide an approximation to the equivalent or consistent with a Stone-Geary utility function given
compensating variation. However, with multiple by:
price and income changes, consumer surplus is not
single-valued and is of no use in approximating
changes in economic welfare (McKenzie ). np.n) Jn ~ n f. pA) W
Shortly after the publication of Willig’s paper,
however, empirical procedures were developed to where 8 = (8i, 82, .. .,8„) and a = (ai, a2, .. ,,a„) are
estimate the equivalent or compensating variation. unknown parameters. The corresponding
Each method begins with the specification of a expenditure function is:
demand function and, under the assumption of
1 V U
integrability, is used to recover the utility or E ( p , V ) Y,P‘8‘ ( P?)-
expenditure functions. The complexity of this
procedure diminishes if demand functions are lin- The unknown parameters can be estimated by
ear, and consideration is restricted to changes in the fitting the linear expenditure system to household
price of a single good. budget data:
Hausman ( ) provided an analytic solution
to this problem for a demand function given by: ptXi = p^i
x
i = JpPi + JYY/C + YAAk, + a i( Y k - (j = 1,2,
GO)
where yp, yY, and yA are unknown parameters to be
estimated econometrically. Roy’s Identity provides Given estimates of a and S, the expenditure func-
a partial differential equation that can be solved to tion can be used to compute the equivalent or
obtain an expenditure function of the form: compensating variation as in ( ) and ( ).
While this is more general than Hausman’s
E(puV, Ak) =Ve™ approach, it has its own disadvantages. For an
- (1/Yy) [ i p P i + (J p / l r ) + YAAk] • (8) assumed form of the utility function, the functional
forms of the demands are the same for every good,
The expenditure function allows the equivalent which may hinder the ability of the model to fit the
variation to be computed exactly as in ( ) and data. Is it possible to start with an arbitrary demand
Willig-type approximations are unnecessary. system (rather than a utility function) and measure
Hausman’s method has the same data requirements the welfare effects of multiple price changes? Two
as consumer surplus, and only linear regression elegant procedures were proposed that required
methods are needed to estimate the unknown more complicated calculations to recover the
parameters. expenditure function, but did not impose
restrictions on the form of the demand
2126 Consumer Surplus
functions other than the standard integrability 0 < t < 1. As prices and income change, the
conditions. movements of demands along an indifference curve
The first method is based on an approximation can be represented implicitly by the differential
to McKenzie’s ( ) indirect money metric util equation:
ity function defined as:
dYk(i) E*i(p(0,r*(0.Ak)^.
l * ( p , Y k , A k ;p°) = £(p°,V(p,r*,A k ),A k ). dt
McKenzie and Pearce ( ) showed that A/t can Integrating over t yields an expression that can, in
be approximated by a Taylor’s series expansion principle, be solved to obtain £(p(/), V°, A k ) which
about the initial equilibrium: is the centrepiece of the welfare calculations:
A^ = ^Ap(l/2) A
A
' A
£(p(r),y°,A k ) -£(p°,V°,A k )
P 7TTT7AP
t
dpdp'
dp d2p = } Xi
(p(0. E ( P (0> v°, A k ,), A k ) d t .
r 7Ap+ 1/2^4AT)AT
dY dpdY 1 ' dY1 ) 1=1
o
(13)
R
(11)
where R represents higher order terms in the series. Vartia described several algorithms that can be
The expression in ( ) can be represented as a used to solve this equation numerically over the
function of uncompensated demand functions when price path p(t) so that, when evaluated at / 1, we
p is evaluated at the reference prices (this follows obtain £(p1, V°, Ak). As long as the demands satisfy
from Roy’s Identity and from the fact that at these the integrability conditions, the solution to ( ) will
prices the marginal utility of income is equal to one be independent of the price path used in the
and all higher income derivatives are zero - see algorithm. This method is valid for multiple price
McKenzie and Pearce , for details): and expenditure changes and, because a closed-form
solution is unnecessary, facilitates flexibility in
Ap = —X'Ap estimating demand patterns.
Aggregation
Dupuit, J. 1844. On the measurement of the utility of public goods’ (Sraffa , p. 93). This stands in striking
works. Trans R.H. Barback. In R e a d i n g s i n contrast to the approach of classical political econ-
W e l f a r e E c o n o m i c s , ed. K.J. Arrow and T.
Scitovsky. Home- wood: Richard D. Irwin.
omy which views the system of consumption and
Harberger, A.C. 1971. Three basic postulates for applied production as a circular process. This perspective
welfare economics. J o u r n a l of Economic was first developed by Quesnay ( ) and elab-
L i t e r a t u r e 9: 785-797. orated by Marx ( ) in his
Hausman, J.A. 1981. Exact consumer’s surplus and dead-
weight loss. A m e r i c a n E c o n o m i c R e v i e w 71:
analysis of the econ
662-676. omy in general. Marx developed a distinction
Hicks, J.R. 1942. Consumer’s surplus and index numbers. between ‘production and productive consumption’
R e v i e w o f E c o n o m i c S t u d i e s 9 (2): 126-137. and ‘consumption and consumptive production’ and
Hurwicz, L., and H. Uzawa. 1971. On the integrability of
demand functions. In P r e f e r e n c e s , u t i l i t y a n d
related this to the concepts of exchange and
d e m a n d , ed. J. Chipman et al. New York: Harcourt, distribution. This distinction fell into diuse with the
Brace and Jovanovich. rise of neoclassical economics but has been
Keen, M. 1990. Welfare analysis and intertemporal substi- rehabilitated by Sraffa ( ) in his famous
tution. J o u r n a l o f P u b l i c E c o n o m i c s 42: 47-
critique of modem economics. The concept ‘pro-
66.
Kirman, A.P. 1992. Whom or what does the representative duction and productive consumption’ provides the
individual represent? J o u r n a l o f E c o n o m i c general conceptual framework within which his
P e r s p e c t i v e s 6 (2): 117-136. particular theory of commodities is elaborated (see
Marshall, A. 1890. P r i n c i p l e s o f e c o n o m i c s .
Sraffa , p. 3). Sraffa’s exposition not only advances
London: Macmillan.
McKenzie, L. 1957. Demand theory without a utility index. our understanding of the theory of commodities, it
R e v i e w o f E c o n o m i c S t u d i e s 24 (65): 185-189. also enables us to grasp the essence of Marx’s
McKenzie, G.W. 1979. Consumer’s surplus without apology: important distinction between consumption and
Comment. A m e r i c a n E c o n o m i c R e v i e w 69:
production. Marx expressed himself in rather
465-468.
McKenzie, G.W., and I. Pearce. 1976. Exact measures of obscure Hegelian terms and Sraffa’s simple
welfare and the cost of living. R e v i e w o f E c o n o m i c numerical examples clarify much of Marx’s
S t u d i e s 43: 465^168. argument.
Muellbauer, J. 1974. Prices and inequality: The
Following Sraffa, let us suppose that an
U.K. experience. E c o n o m i c J o u r n a l 84 (333): 32-
55. extremely simple society is producing just enough
Poliak, R.A. 1981. The social cost of living index. J o u r n a l wheat and iron to maintain itself. If 400 quarters of
o f P u b l i c E c o n o m i c s 15: 311-336. wheat (hereafter 400 W) were produced using 280
Sen, A.K. 1970. C o l l e c t i v e c h o i c e a n d s o c i a l
W and 12 tons of iron (121) and 1/2 of the annual
w e l f a r e . San Francisco: Holden Day.
Sonnenschein, H. 1972. Market excess demand functions. labour supply (1/2 L) as inputs, while 201 were
E c o n o m e t r i c a 40: 549-563. produced using 120 W, 81 and 1/2 L, then the
Taylor, A.E., and R. Mann. 1972. A d v a n c e d c a l c u l u s . methods of production and productive consumption
2nd ed. Lexington: Xerox College Publishing. can be tabulated as follows:
Vartia, Y.O. 1983. Efficient methods of measuring welfare
change and compensated income in terms of ordinary
1 /2L + 280W + 121 > 400W
1/2L+120W + 81 —» 201
Consumption and Production In order for the process to be repeated the wheat
industry must exchange 120 W for 121. This
C. A. Gregory restores the original distribution of products and
enables the process to be repeated.
A three-product model takes us from barter to
triangular trade: an n-product model to more com-
Neoclassical economic analysis is carried out within plex forms of exchange and distribution.
a conceptual framework that views the economic The general formulation of the concept ‘pro-
process as a ‘one way avenue’ leading from ‘factors duction and productive consumption’ implicit in
of production’ to ‘consumption Sraffa’s analysis is:
Consumption and Production 2129
JEL Classifications
Dll
Bibliography
Gregory, C.A. 1982. G i f t s a n d c o m m o d i t i e s . London: Consumption externalities occur when consumption
Academic. by some creates external costs or benefits for others.
Marx, K. 1859. A contribution to the critique of political Their recognition by economists dates at least as far
economy. London: Lawrence & Wishart 1971.
back as Adam Smith’s discussion of how local
Marx, K. 1867. C a p i t a l , vol. I. Moscow: Progress.
Polanyi, K. 1944. T h e g r e a t t r a n s f o r m a t i o n . New consumption standards influence the goods that
York: Rinehart. people consider essential (or ‘necessaries’, as Smith
Quesnay, F. 1759. The ‘Tableau Economique’. In T h e called them). In the following passage, for example,
e c o n o m i c s o f p h y s i o c r a c y , ed. R. L. Meek. he described the factors that influence the amount
London: George Allen, 1962.
Sraffa, P. 1960. P r o d u c t i o n o f c o m m o d i t i e s b y
someone must spend on clothing in order to be able
m e a n s o f c o m m o d i t i e s . Cambridge: Cambridge appear in public ‘without shame’:
University Press.
By necessaries I understand, not only the commod-
ities which are indispensably necessary for the sup-
port of life, but whatever the custom of the country
renders it indecent for creditable people, even of the
Consumption Externalities lowest order, to be without. A linen shirt, for exam-
ple, is, strictly speaking, not a necessary of life. The
Greeks and Romans lived, I suppose, very comfort-
Robert H. Frank ably, though they had no linen. But in the present
times, through the greater part of Europe, a credit-
able day-labourer would be ashamed to appear in
publick without a linen shirt, the want of which
would be supposed to denote that disgraceful degree
Abstract of poverty which, it is presumed, no body can well
Consumption externalities occur when con- fall into without extreme bad conduct. (Smith , pp.
869-70)
sumption by some creates costs or benefits for
others. According to Duesenberry’s ‘relative Consumption externalities received only limited
income hypothesis’, spending is influenced by attention in Smith’s Wealth of Nations and only
the individual’s own standard of living in the occasional mention by economists during the
recent past and the living standards of others in century that followed its publication. Karl Marx (
the present. This hypothesis tracks observed ), for example, noted that ‘A house may be
behaviour more closely than Friedman’s ‘per- large or small; as long as the neighboring houses are
manent income hypothesis’, which assumes that likewise small, it satisfies all social requirement for
context has no influence on spending. When a residence. But let there arise next to the little
context is more important for some goods house a palace, and the little house shrinks to a hut.’
(positional goods) than for others (non-positional It was not until Thorstein Veblen’s The Theory
goods), positional goods crowd out non- of the Leisure Class appeared in 1899 that
positional goods, causing welfare losses like
those that occur when bombs crowd out
consumption in military arms races.
Consumption Externalities 2131
consumption externalities received their first seri- frequently for the poor - and hence the absence of
ous, book-length treatment in economics. Veblen’s any tendency for savings rates to rise with income
thesis was that much of consumption is undertaken in the long ran.
to signal social position. But although his book is To explain why consumption is more stable than
still widely read and cited by scholars in numerous income in the short ran, Duesenberry argued that
disciplines, its general theme was largely ignored families compare their living standards not only to
by economists during the 50 years following its those of others around them but also to their own
publication. standards from the past. The high consumption
level once enjoyed by a formerly prosperous family
thus constitutes a frame of reference that makes
Duesenberry's Relative Income cutbacks difficult when income falls.
Hypothesis Despite Duesenberry’s success in fracking the
data, many economists felt uncomfortable with his
Interest in this theme was rekindled with the pub- relative income hypothesis, which to them seemed
lication of James Duesenberry’s Income, Saving, more like sociology or psychology than economics.
and the Theory of Consumer Behavior in 1949. In The profession was therefore immediately receptive
this volume, Duesenberry offered his ‘relative to alternative theories that purported to explain the
income hypothesis’, in which he argued that an data without reference to softer disciplines. The
individual’s spending behavioiu is influenced by most important among these theories was Milton
two important frames of reference - the individual’s Friedman’s permanent income hypothesis, variants
own standard of living in the recent past and the of which still dominate today’s research on
living standards of others in the present. Thus, in spending.
Duesenberry’s account, people are subject to both In hindsight, however, there remain grounds for
intrapersonal and interpersonal consumption scepticism about whether Friedman’s theory was a
externalities. real step forward. For example, its fundamental
His theory attempted to explain three important premise - that savings rates are independent of
empirical regularities: (a) long- ran aggregate sav- permanent income - has been refuted by numerous
ings rates remain roughly constant over time, even careful studies (see, for example, Car- roll ). Some
in the face of substantial income growth; (b) modem consumption theorists have responded by
aggregate consumption is much more stable than positing a bequest motive for rich consumers, a
aggregate income in the short ran; and (c) indi- move that begs the question of why leaving
vidual savings rates rise substantially with income bequests should entail greater satisfaction for the
in cross-section data. When Duesenberry’s book rich than for the poor.
was first published, individual consumption was Another problem is that, contrary to Friedman’s
generally modelled by economists as a linear assertion that the marginal propensity to consume
function of income with a positive intercept term. out of windfall income should be nearly zero,
This model could accommodate rising savings rates people actually consume such income at almost the
in cross-section data and the stability of same rate as permanent income (Bodkin ). To this
consumption over the business cycle, but not the observation, Friedman ( ) himself responded
long-run stability of aggregate savings rates. that consumers appear
Duesenberry’s hypothesis was hailed as an to have unexpectedly short planning horizons. But
advance because of its ability to track all three if so, then consumption does not really depend
stylized fact patterns. The poor save at lower rates, primarily on permanent income.
he argued, because they are more likely to Abundant evidence suggests that context influ-
encounter others with desirable goods that are ences evaluations of living standards (see, for
difficult to afford. Moreover, since this will be true example, Veenhoven ; Easterlin ; Luttmer ). In the
no matter how much national income grows, light of this evidence, it seems fair to say that
unfavorable comparisons will always occur more Duesenberry’s hypothesis
2132 Consumption Externalities
not only has been more successful than Friedman’s weeks a year of vacation time and others would
in tracking how people actually spend but also rests have six weeks; and world D, in which you would
on a more realistic model of human nature. And yet have two weeks of vacation, others one week. This
the relative income hypothesis is no longer even time most people pick C, choosing greater absolute
mentioned in most leading economics textbooks. Its vacation time at the expense of lower relative
absence appears to signal the profession’s con- vacation time.
tinuing reluctance to acknowledge concerns about The modal responses in these two thought
relative consumption. experiments suggest that housing is a positional
good and vacation time a non-positional good. The
point is not that absolute house size and relative
Welfare Implications vacation time are of no concern. Rather, it is that
positional concerns weigh more heavily in the first
In traditional economic models, individual utility domain than in the second.
depends only on absolute consumption. These When the strength of positional concerns differs
models lie at the heart of claims that pursuit of across domains, the resulting conflict between
individual selfinterest promotes aggregate welfare. individual and social welfare is structurally iden-
In contrast, models that include concerns about tical to the one inherent in a military arms race.
relative consumption identify a fundamental When deciding how to apportion available
conflict between individual and social welfare. This resources between domestic consumption and
conflict stems from the fact that concerns about military armaments, each country’s valuations are
relative consumption are stronger in some domains typically more context-dependent in the armaments
than in others. The disparity gives rise to domain than in the domain of domestic
expenditure arms races focused on ‘positional consumption. After all, being less well armed than a
goods’ - those for which relative position matters rival nation could spell the end of political
most. The result is to divert resources from ‘non- independence. The familiar result is a mutual
positional goods’, causing welfare losses. (The late escalation of expenditure on armaments that does
Fred Hirsch , coined these terms.) not enhance security for either nation. Because the
The nature of the misallocation can be made extra spending comes at the expense of domestic
clear with the help of two simple thought experi- consumption, its overall effect is to reduce welfare.
ments. In each, you must choose between two Note, however, that if each country’s valuations
worlds that are identical in every respect except were equally context- sensitive in the two domains,
one. The first choice is between world A, in which there would be no arms race, for in that case the
you will live in a 4,000-square-foot house and attraction of having more arms than one’s rival
others will live in 6,000-square-foot houses; and would be exactly offset by the penalties of having
world B, in which you will live in a 3,000-square- lower relative consumption.
foot house, others in 2,000-square-foot houses. For parallel reasons, the modal responses to the
Once you choose, your position on the local hous- two thought experiments suggest an equilibrium in
ing scale will persist. which people consume too much housing and too
If only absolute consumption mattered, A would little leisure (for a formal demonstration of this
be clearly better. Yet most people say they would result, see Frank ). In contrast, conventional welfare
pick B, where their absolute house size is smaller theorems, which assume that individual valuations
but their relative house size is larger. Even those depend only on absolute consumption, imply
who say they would pick A seem to recognize why optimal allocations of housing and leisure.
someone might be more satisfied with a 3,000- In addition to leisure, goods that have been
square-foot house in B than with a substantially classified as non-positional by various authors
larger house in A. include workplace safety, workplace democracy,
In the second thought experiment, your choice is savings and insurance. And since public goods
between world C, in which you would have four
Consumption Function 2133
are, by definition, available in equal quantities to all Frank, R. 1985a. The demand for unobservable and other
consumers, they, too, are inherently non-positional. nonpositional goods. American Economic
R e v i e w 75: 101-116.
The general claim is that unregulated market Frank, R. 1985b. C h o o s i n g t h e r i g h t p o n d . New
exchange will tend to emphasize the production of York: Oxford University Press.
positional goods at the expense of these and other Friedman, M. 1957. A t h e o r y o f t h e c o n s u m p t i o n
non-positional goods (Frank ). Among the policies f u n c t i o n . Princeton, NJ: Princeton University Press.
Friedman, M. 1963. Windfalls, the horizon, and related
suggested as remedies for this imbalance have been concepts in the permanent income hypothesis. In M e a -
income and consumption taxes, overtime laws, s u r e m e n t i n e c o n o m i c s , ed. C. Christ. Stanford:
hours laws for commercial establishments, legal Stanford University Press.
holidays, workplace safety and health regulation, Hemenway, D., and S. Solnick. 1998. Is more always better?
Journal of Economic Behavior and
non-waivable workers’ rights, and tax-financed O r g a n i z a t i o n 37: 373-383.
savings accounts. Hirsch, F. 1976. S o c i a l l i m i t s t o g r o w t h . Cambridge,
Consumption externalities also have implica- MA: Harvard University Press.
tions for the theory of revealed preference, which Luttmer, E. 2005. Neighbors as negatives: Relative earnings
and well-being. Quarterly Journal of
says that, if a well-informed individual chooses a
E c o n o m i c s 120: 936-1002.
risky job that pays $600 a week rather than a safer Marx, K. 1847. Relation of wage-labour to capital. In W a g e
one that pays only $500, he reveals that the safety L a b o u r a n d C a p i t a l . Marx/Engels Internet
increment is worth less than $100 to him. If safety Archive, 1999. Online. Available at
is a non-positional good, however, this inference
. Accessed 16 August 2005.
does not follow, for it ignores the fact that, if all
Smith, A. 1776. A n i n q u i r y i n t o t h e n a t u r e a n d
workers exchange safety for increased income, the c a u s e s o f t h e w e a l t h o f n a t i o n s , ed. R.
anticipated increase in relative consumption does Campbell and A. Skinner. Oxford: Oxford University
not occur. The value that workers assign to safety Press, 1976.
Veblen, T. 1899. T h e t h e o r y o f t h e l e i s u r e c l a s s .
may thus be revealed as much in the patterns of
New York: Modem Library.
safety regulation they favour as in the nature of the Veenhoven, R. 1993. Happiness in nations: Subjective
jobs they choose. appreciation of life in 56 nations. Rotterdam: Erasmus
University:
Michael R. Darby
The Keynesian Conception consumption) over time and viewed saving in terms
of a bequest motive. The life-cycle hypothesis
Keynes conceived of the consumption function as (LCH) stressed predictable variations in income
relating consumption to disposable income as these (and consumption) over the life cycle and viewed
are now conventionally measured in the national saving as resulting from the greater wealth and
income accounts. These concepts were basic to the numbers of younger savers in comparison to older
model of The General Theory and Keynes was dissavers.
doubtless pedagogically correct to posit a simple The original published references are to
relationship which could be refined by future Friedman ( ) for the PIH and Modigliani
research. and Brumberg ( ) for the LCH. Given the
The need for refinement became apparent delays in NBER publication of Friedman’s work
shortly. In longer time series, consumption seemed which was widely circulated in manuscript form,
to vary around a constant fraction of disposable the two hypotheses are generally regarded as
income. In contrast, consumption functions fitted to distinct, contemporaneous responses to the
depression-era or cross-section data seemed to described conflict between earlier studies and
indicate that this ratio (which Keynes called the Simon Kuznets’ data on the national income
average propensity to consume or APC) declined as accounts for the 20th century. From the perspective
disposable income rose. In other words, these of the monumental careers of the two principal
studies estimated that the derivative of consumption proponents, priority does not seem an issue that
with respect to disposable income (the marginal need be resolved here.
propensity to consume or MPC) was less than the The PIH relates (pure) consumption to the per-
APC. petuity stream that could be consumed forever. The
Alvin Hansen ( ) among others predicted agent is typically regarded as an infinitely lived
that a secular stagnation would result unless gov- individual. This represents the underlying notion of
ernment spending filled this growing gap between a family whose generations are linked by operative
output and consumption. When the gap failed to transfers from parent to child or vice- versa. Saving
appear, the time was ripe for more sophisticated arises to equate the ratio of marginal utility of
theories of the relationship between consumption present and future consumption to the marginal rate
and income. These theories were the earliest and of transformation implicit in market (real) interest
perhaps still most successful resorts to microeco- rates. In this way the PIH is said to emphasize the
nomic foundations for macroeconomics. bequest motive for saving.
In contrast, the strict LCH had individuals con-
suming their entire endowments over their lifetime.
Permanent Life-Cycles Saving was supposed to arise because young
workers were more numerous and wealthy (due to
In the early 1950s our two dominant models of
technological progress) than the older generation
consumption developed: the permanent-income and
who were dissaving to finance retirement. This
life-cycle hypotheses. While these models were
provides an avenue by which faster growth can
once viewed as competing, they can now be seen as
increase saving. Alternatively, as discussed below,
complementary with differences in emphasis which
factors such as social security which change the
serve to illuminate different significant problems.
extent of mismatch between lifetime consumption
Both models emphasized the distinction between
and income patterns are predicted to have profound
consumer expenditures measured by the national
effects on aggregate saving.
income accounts and pure consumption which was
These approaches - and their synthesis with
to be explained by optimal allocation of present and
inter-generationally linked utility functions - have
future resources over time. The permanent income
led to a rich literature quite apart from the con-
hypothesis (PIH) stressed stochastic variations in
sumption function, but those developments are
income (and
beyond the scope of this essay.
Consumption Function 2135
From the point of view of the consumption as Friedman’s 0.35 per annum. These higher values
function per se, the PIH and LCH imply that pure were explained by biases that arise as data deviate
consumption is a fraction (variable in principle but from pure consumption toward expenditures by
rarely in practice) of wealth or permanent income. consumers.
Here wealth is inclusive of human as well as non- Empirical work on consumption functions has
human capital and permanent income is a frequently floundered on the use of theories of pure
(conventionally constant) long-term ex ante real consumption to explain data which are in whole or
interest rate times this wealth. (Note that, contrary part consumer expenditures. Both the PIH and LCH
to Sargent ( ) and others this wealth is not the were theories of pure consumption. Modigliani and
discounted present value of expected future income Ando provided one link to consumer expenditures
to the extent, as in the PIH, that future income is in their MPC model by modelling household
expected to rise through planned saving.) The investment in durable goods analogously to firm’s
empirical estimation of wealth or permanent investment behaviour. Operating in the PIH
income became a central issue in the specification tradition, Darby ( , ) argued that aggregate
of the consumption function. transitory
Friedman proposed a computationally simple income represented a change in wealth, part of
estimator of permanent income as a geometrically which change would be invested in consumers’
weighted average of past income. Since on this durable goods. (Darby ( ) in particular argued
scheme, permanent income changes - besides that because transitory income is received in non-
normal growth - by a fraction, say [1, of the dif- human form, a disproportionate effect on durable-
ference between current income and permanent goods purchases may arise during the adjustment
income, Friedman related this scheme to the process, a result which explains the results of
adaptive-expectations approach recently introduced Hayashi .) Darby ( , -8)
by his student Phillip Cagan ( ). later combined pure consumption and durable
Modigliani and his associates proxied normal investment equations to obtain a unified consumer
labour income by current income and the product of expenditure function which avoided some of the
this variable and the unemployment rate and inherent difficulties in dividing consumer expen-
attempted to measure non-human wealth by ditures into durable and nondurable portions.
collecting estimates of the national balance sheet at The PIH and LCH thus evolved to explain
market values. In principle, this method seemed aggregate consumer expenditures by wealth as a
more clearly related to the underlying framework determinant of pure consumption and by changes in
than Friedman’s permanent-income proxy, but in wealth and other variables which determine
practice it suffered several comparative disadvan- household investment in durable goods. The cor-
tages: (1) major components of non-human wealth relation of the determinants of this household
had no market valuation; (2) the wealth estimates investment with short-run (transitory) fluctuations
were not part of the national income accounts and in income explain a MPC which is substantial in
competing variants were available with substantial magnitude even though substantially below the
delay and at irregular intervals; (3) for forecasting APC.
purposes, substantial additional equations were This brief development has omitted discussion
required to forecast (often poorly) future of alternative views of the consumption function.
movements in wealth. Perhaps the most notable of these is the view that
Darby ( ) reconciled these empirical mea the substantial value of the MPC reflects liquidity
sures of wealth by demonstrating that under the constraints which prevent a substantial share of
PIH, Friedman’s geometrically weighted measure consumers (measured by wealth and consumption)
could be derived as the constant real interest rate fj from following their optimal intertemporal
times a (backward-looking) perpetual inventory of consumption plan. The author of this essay regards
wealth. This [1 value was estimated as about these alternative views as providing qualification of
1. 10 per annum in contrast to higher values the dominant wealthbased view.
such
2136 Consumption Function
and do individuals effectively see through gov- Friedman, M. 1957. A theory of the consumption
ernment? This author’s reading of the evidence function, NBER general series. Vol. 63. Princeton:
Princeton University Press.
suggests answers of no and maybe, but it is hard to Hall, R.E. 1978. Stochastic implications of the life cycle -
put much certainty in any answer unless one starts permanent income hypotheses: Theory and evidence.
with dogmatic priors. Journal of Political Economy 86(6): 971-987.
Hansen, A.H. 1939. Economic progress and declining
The consumption function has faded as a topic
population growth. American Economic Review 29:
of intense research largely because of the success of 1-15.
previous work in achieving a workable consensus. Hayashi, F. 1982. The permanent income hypothesis: esti-
The unsettled issues, however, have crucial policy mation and testing by instrumental variables. Journal
of Political Economy 90(5): 895-916.
implications and there is much value yet to be
Keynes, J.M. 1936. The general theory of employment,
added. interest, and money. New York: Harcourt, Brace, and
Co.
Kochin, L.A. 1974. Are future taxes anticipated by con-
sumers? Journal of Money’, Credit, and
See Also Banking 6(3): 385-394.
Komiendi, R.C. 1983. Government debt, government
► Consult spending, and private sector behavior. American Eco-
► .eyries’ nomic Review 73(5): 994-1010.
Kotlikoff, L.J., and L.H. Summers. 1981. The role of
► life Cyc intergenerational transfers in aggregate capital accumu-
► .eal lation. Journal of Political Economy 89(4): 706-
► Relative 732.
► Wealth Modigliani, F., and R. Brumberg. 1954. Utility analysis and
the consumption function: An interpretation of cross-
section data. In Post Keynesian economics, ed. K.E.
Kurihara. New Brunswick: Rutgers University Press.
Bibliography Sargent, T.J. 1978. Rational expectations, econometric
exogeneity, and consumption. Journal of Political
Barro, R.J. 1974. Are government bonds net wealth? Jour- Economy 86(4): 673-670.
nal of Political Economy 82(6): 1095-1117. Seater, J.J. 1982. Are future taxes discounted? Journal of
Barro, R.J. 1978. Tile impact of social security on Money, Credit, and Banking 14(3): 376-389.
private saving: Evidence from the U.S. time White, B.B. 1978. Empirical tests of the life cycle hypothesis.
series. Washington, DC: American Enterprise Institute. American Economic Review 68(4): 647-660.
Cagan, R 1956. The monetary dynamics of hyperinflation. In
Studies in the quantity theory of money, ed. M.
Friedman. Chicago: University of Chicago Press.
Darby, M.R. 1972. The allocation of transitory income among
consumers’ assets. American Economic Review
62(5): 928-941.
Darby, M.R. 1974. The permanent income theory of con- Consumption Sets
sumption: A restatement. Quarterly Journal of Eco-
nomics 88(2): 228-250. Peter Newman
Darby, M.R. 1975. Postwar U.S. consumption, consumer
expenditures, and saving. American Economic
Review 65(2): 217-222.
Darby, M.R. 1977-8. The consumer expenditure function.
Explorations in Economic Research 4(5): 645-
JEL Classifications
674.
E2
Darby, M.R. 1979. The effects of social security on
income and the capital stock. Washington, DC:
American Enterprise Institute. The idea of consumption sets was introduced into
Feldstein, M. 1974. Social security, induced retirement, and general equilibrium theory in July 1954 in Arrow
aggregate capital accumulation. Journal of Political and Debreu ( , pp. 268-9) and Debreu ( ,
Economy 82(5): 905-926.
Flavin, M.A. 1981. The adjustment of consumption to
p.588), the name itself appearing only in the latter
changing expectations about future income. Journal of paper. Later expositions were given by Debreu (
Political Economy 89(5): 974-1009. ) and Arrow and Hahn ( ) and a more
2138 Consumption Sets
general discussion by Koopmans ( , Essay 1). consumption of which would result in her death by
Although there have been several articles concerned starvation. However, she might well prefer the
with nonconvex consumption sets (e.g. Yamazaki ), second bundle to a third, whose consumption would
in more recent years their role in general cause her to die from thirst (the representation of
equilibrium theory has been muted, especially in such preferences by a real-valued utility function
approaches that use global analysis (see for might pose problems, but that is another matter). On
example, Mas-Colell , p. 69). Such sets play no role the other hand, the same individual might not be
in partial equilibrium theories of consumer’s able to rank in order of preference two bundles each
demand, even in such modem treatments as Deaton of which contains exotic food and drink, even
and Muellbauer ( ). Since though fully assured that the consumption of either
general equilibrium theory prides itself on precision bundle would allow her to survive.
and rigour (e.g. Debreu , p. x), it is odd that on close More importantly, M3 implicitly introduces
examination the meaning of consumption sets consumption activities, the actual eating and
becomes unclear. Indeed, three quite different drinking and sheltering that are essential to survival.
meanings can be distinguished within the various Such activities constitute what are sometimes
definitions presented in the literature. These are called, by analogy with production, the
given below (in each case the containing set is the consumption technology. Some partial equilibrium
commodity space, usually Rn): Ml The consumption models, such as ‘the new home economics’ and the
set Cl is that subset on which the individual’s theory of characteristics, have treated aspects of
preferences are defined. M2 The consumption set such technologies but so far general equilibrium
C2 is that subset delimited by a natural bound on theory has not. In particular, Arrow-Debreu theory
the individual’s supply of labour services, i.e. 24 has not done so. As a consequence (and unlike some
horns a day. M3 The consumption set C3 is the forms of the classical ‘com model’) it does not give
subset of all those bundles, the consumption of any a coherent account of the birth and death of
one of which would permit the individual to individual persons, any more than it does of the
survive. Each definition in the literature can (but birth and death of individual firms (see general
here will not) be classified according to which of equilibrium). Hence the third meaning M3, which in
these meanings it includes. In probably the best effect presumes that the model contains such an
known of them (Debreu , eh. 4), the consumption account when it does not, is hard to interpret. One
set appears to be the intersection of all three subsets major difficulty of interpretation arises with the
C1-C3. Ml is plain. After all, preferences have to be Slater-like condition that each individual’s
defined on some proper subset of the commodity endowment of goods and services, valued at the
space, since the whole space includes bundles with competitive prices p , should be strictly greater than
some inadmissibly negative coordinates. M2 is also inf {(x, p ): x € C}, where < .,. > denotes inner
reasonable, although a full treatment of product and C is ‘the’ consumption set (see cost
heterogeneous labour services does raise problems minirnization and utility maximization). This
for what is meant by an Arrow-Debreu condition is important in proofs of existence of
‘commodity’ (see for example, that of Arrow- competitive equilibrium, to ensure for example that
Hahn , pp. 75-6). It is M3 that gives real difficulty, the budget correspondence is continuous, or that a
both in itself and in relation to the others. compensated equilibrium is a competitive
First, there is little reason to expect either Cl or equilibrium. It is itself guaranteed by assumptions
C3 to be a subset of the other, and so still less to (discussed by McKenzie , pp. 821-5) on the
expect Ml and M3 to define the same set. No relations between ‘individual’ consumption sets and
individual would have any problem in preferring the aggregate production set.
one bundle, the consumption of which would ensure If C is taken to contain C3 then the assumptions
her survival, to a second bundle, the just referred to imply that every consumer
Consumption Sets 2139
survives in every competitive equilibrium, not finite partitions of the time interval, not all of which
merely for one period but over the whole (finite) need to be ranked equally by the individual. In
Arrow-Debreu span. This is a breathtaking assertion effect, convexity of the consumption set comes
of fact which recalls irresistibly Hicks’s wry down to the divisibility of consumer goods, an
observation: ‘Pure economics has a remarkable way assumption which in the past has proved not such a
of producing rabbits out of a hat - apparently a bad approximation if one is interested mainly in
priori propositions which apparently refer to general equilibrium aspects of market demand, and
reality. It is fascinating to try to discover how the representative rather than actual consumers.
rabbits got in’( , p. 23). Indivisibilities of producer goods are of course
On the other hand if C is taken to be Cl, then the much more serious.
assumptions take on a purely technical (and so less
objectionable) aspect, whose role is essentially to
ensure that the system stays within the (relative)
interior of the sets concerned and so displays See Also
appropriate continuity. But then there is no
presumption that individual agents survive in a ► Arrow-Debreu Model of General Equilibrium
competitive equilibrium, even for one period (cf. ► Cost Minimization and Utility Maximization
Robinson , p. 3). The multi-period versions of the ► General Equilibrium
Arrow-Debreu model are then at risk, since ► Divisibilities
individuals disappear and take their labour service
endowments with them. This should not come as a Bibliography
surprise - the problems of time in economics are
really too complicated to be overcome simply by Arrow, K.J., and G. Debreu. 1954. Existence of an equi-
adding more dimensions to the one-period model. librium for a competitive economy. Econometrica 22:
265-290.
Some models that include C3 in C attempt to Arrow, K.J., and F.H. Hahn. 1971. General competitive
justify Slater-like conditions directly, on the analysis. San Francisco: Holden-Day.
grounds that ‘ Not many economies in the present Deaton, A., and J. Muellbauer. 1980. Economics and con-
day are so extremely laissez faire as to permit sumer behaviour. Cambridge: Cambridge University
Press.
people to starve’ (Gale and Mas- Colell , p. 12). Debreu, G. 1954. Valuation equilibrium and Pareto optimum.
This justification clearly fails as long as the Proceedings of the National Academy of Sci-
behaviour of the public agency whose actions ences 40 (7): 588-592.
allegedly prevent such starvation is not modelled Debreu, G. 1959. Theory of value, Cowles commission
monograph no. 17. New York: Wiley.
explicitly, like that of the private agents. Gale, D., and A. Mas-Colell. 1975. An equilibrium existence
It is usually assumed that consumption sets are theorem for a general model without ordered preferences.
bounded below, closed and convex. The first two Journal of Mathematical Economics 2: 9-15.
assumptions are innocuous but the third poses Hicks, J.R. 1939. Value and capital. Oxford: Clarendon
Press.
issues of a conceptual kind, which spring from
Koopmans, T.C. 1957. Three essays on the state of eco-
difficulties in interpreting the idea of a convex nomic science. New York: McGraw-Hill.
combination x' = tx 1 + (1 - t)x2 of two bundles A 1 McKenzie, L.W. 1981. The classical theorem on existence of
and x2, where fc[0, 1], Consider the example, competitive equilibrium. Econometrica 49: 819-841.
Mas-Colell, A. 1985. The theory of general economic
sometimes used, in which A 1 is a house in London
equilibrium. A differentiable approach.
and x2 a house in Paris. We cannot take seriously Cambridge: Cambridge University Press.
the claim that A-' is a house in the Channel, so t Robinson, J.V. 1962. The basic theory of normal prices.
cannot refer to distance. An alternative claim that t Quarterly Journal of Economics 16 (1): 1-20.
Yamazaki, A. 1978. An equilibrium existence theorem without
refers to the proportion of the period that is spent in
convexity assumptions. Econometrica 46: 541-555.
London could arise from many different
2140 Consumption Taxation
tax-consumption tax hybrid. In the United States, experience with RSTs above ten per cent. One
for example, capital income that accrues in unresolved question with regard to proposals that
employer-provided pension plans and in a variety of call for significantly higher RSTs is whether the
taxpayer-directed retirement saving accounts, such difficulty of monitoring all points of purchase
as Individual Retirement Accounts (IRAs), is would lead to substantial problems of tax evasion.
excluded from income taxation. Some types of A value added tax (VAT) is a very common
capital income are taxed at rates below the top form of consumption tax. Virtually all developed
statutory tax rates on wage income. Realized capital nations with the exception of the United States levy
gains have often been taxed at preferential rates, some form of VAT, with rates ranging up to 25 per
and in some cases dividend income to households is cent in Denmark, Norway and Sweden. The VAT is
also subject to reduced rates of tax. There is collected from businesses on the difference between
substantial variation in tax structures across nations, the gross value of their sales and the cost of any
but the principle of allowing some tax reduction on inputs that they purchase from other entities that
capital income is widespread. This makes it difficult have already paid VAT.
to assess where any particular nation’s tax system To illustrate the operation of VAT, consider a
falls on the spectrum between an income tax and a bakery that produces and sells bread for $ 100. The
consumption tax. baker’s input costs are $30 for flour and $65 for an
employee. The bakery earns a $5 profit. If flour is
purchased from another firm that has already paid
Types of Consumption Taxes VAT, then the bakery’s VAT liability equals $70
times the VAT tax rate, since its value added equals
In practice, there are many ways to implement a
its sales of $100 minus input purchases that have
consumption tax. Two, the retail sales tax and the
already paid VAT, or $30. Wages are not deducted
value added tax, are widely used in practice. Both
from sales when computing value added. Although
are examples of indirect consumption taxes,
the VAT is collected in stages from all firms in a
because they are levied without any reference to the
production chain, it is equivalent to an RST at the
consumer’s identity. Direct consumption taxes, in
same rate. One attractive feature of the VAT is that
contrast, are levied on households by computing
downstream firms, such as the baker in this
their total consumption. In contrast to indirect
example, help ensure VAT compliance by upstream
consumption taxes, direct consumption taxes can be
firms that supply intermediate goods. In this
levied at progressive rates. While direct
example if the flour seller cannot provide
consumption taxes have never been used as the
documentation for its VAT payment, the baker will
primary revenue source in any nation, they have
face tax on value added of $100. Thus the baker has
been actively debated in the policy reform literature.
an incentive, all else equal, to purchase inputs from
Tax structures that closely resemble direct
suppliers who pay VAT.
consumption taxes have been adopted as
Ebrill et al. ( ) offer a comprehensive dis
components of existing tax systems. The two most
cussion of VAT implementation issues and sum-
widely discussed direct consumption tax options are
marize experience with the VAT in both developed
the savings-exempt income tax and the ‘X-tax,’ a
and developing nations. The VAT accounts for a
combination of a cash-flow tax on business income
substantial share of revenue in most industrialized
and a household wage tax.
nations. The treatment of international transactions
A retail sales tax (RST) is the simplest con-
has proven a source of difficulty in some nations,
sumption tax. It is collected by retailers at the point
since exporting firms are typically granted a rebate
of final sale, and it corresponds directly to the tax
for their VAT payments. Some tax evasion schemes
on consumption spending described in Eq. above. In
involve exporting goods to qualify for the rebate
2006,44 of the 50 US states levied some form of
and re-importing the same goods without paying
sales tax, with rates typically between four and
VAT on the import. The
seven per cent. There is little
Consumption Taxation 2143
taxation of financial services also proves challeng- existing wealth into ‘qualified accounts’ at the time
ing under the VAT. the SEIT is adopted. Such transfers could sharply
A savings-exempt income tax (SEIT) is a con- reduce tax collections, but, since they involve
sumption tax that is built on an income tax model. previously accumulated assets, they would not
For those who are familiar with an income tax translate into marginal incentives for new saving. If
system, it provides a way of shifting to a con- it were possible to inventory the assets of each
sumption tax without drastic administrative changes taxpayer when the SEIT was implemented, this
in the tax system. The Nunn-Domenici ‘USA Tax’, would make it possible to design regulations to limit
introduced in the US Senate in the mid-1990s and the transfer problem. Absent such information on
analysed in Ginsburg ( ), was previously accumulated wealth, however, transfers
a based on this type of consumption tax. of pre-existing wealth into qualified accounts are
Under the SEIT, the tax base is income less likely to prove a difficult implementation issue for
saving. To prevent taxpayers from simply claiming the savings-exempt income tax.
high levels of saving and thereby avoiding tax An X-tax combines a cash flow tax on busi-
liability, saving must be documented in the form of nesses, much like a VAT with a deduction for
a contribution to a ‘qualified account’. Income wages, with a household-level tax on wage income.
earned on assets held in the qualified account is not The X-tax and its relatives are descended from
taxed, but withdrawals from the qualified account proposals in the US Treasury Department’s ( )
are included in the tax base. Thus a taxpayer who report on fundamental tax reform. Bradford ( )
earns $50,000 and contributes $5,000 would be discusses several plans of this type,
taxed on $45,000 in the contribution period. If, and one widely discussed variant was developed by
some years later, when earnings equal $25,000, the Hall and Rabushka ( ). The X-tax has
taxpayer withdraws $10,000 from the qualified greater flexibility than a VAT for achieving distri-
account, she would be taxed on $35,000. butional goals, since the household level tax can
Even though the SEIT taxes the earnings that include progressive rates or transfers to low-
have accrued on the contributions to the qualified earning households. This illustrates the distribu-
account when the fluids are withdrawn from this tional flexibility of direct rather than indirect con-
account, the return on capital is untaxed in this sumption taxes. If the household tax is a flat rate tax
setting. Taxing accumulated capital income when on wages at the same rate as the corporate cash flow
the proceeds are withdrawn is not equivalent to tax, then the X-tax is equivalent to a VAT or an
taxing capital income as it accrues: this is the reason RST. When the rates are different, then the X-tax
Individual Retirement Accounts, 401(A) plans and becomes a combination of a VAT and an additional
other tax-deferred saving programmes provide an tax or subsidy on labour income. The cash flow
incentive for personal saving. When capital income nature of the business tax eliminates the need to
is taxed as it accrues, the value of earning one measure depreciation, since firms can claim an
dollar, paying tax on it at rate T, and then investing immediate deduction - expensing - for purchases of
it for T periods at a pretax rate of return r but with capital goods.
an accrual tax rate T, is (1 — T) (1 + (1 - T)rf. In In practice, neither the RST nor the VAT is
contrast, if the initial earnings are excluded from implemented strictly along the principles described
taxation, there is no taxation of accruing capital above. Proposals for both the SEIT and the X-tax
income, and withdrawals are taxed at 100t per cent, also include additional features that often introduce
then the value after T periods is (1 - T)(1 + r)T. The efficiency costs that would not arise in ‘textbook’
qualified account approach eliminates the tax versions of these taxes. The RST, for example,
burden on the ‘inside build up’ of capital assets. typically exempts some goods and services.
One of the key challenges in implementing a SEIT Expenditures on food, medical care and clothing are
is avoiding the wholesale reallocation of often excluded from the tax base, thereby achieving
a more progressive distribution of tax burdens while
creating
2144 Consumption Taxation
distortions between various classes of consumption from replacing an income tax with a consumption
goods. The VAT is often implemented at different tax. Their results suggest that for a given revenue
rates on different goods, with exemptions for some requirement, the steady-state capital stock is larger
goods, creating the same distortionary effects. with a consumption tax than with an income tax.
Because both the savings- exempt income tax and This translates into higher steady-state per capita
the X-tax require households to file tax returns, they utility under the consumption tax than the income
are prone to modification to allow deductions for tax.
some expenditure categories, such as mortgage The steady-state comparison is not the only
interest or health insurance premiums. While consideration when evaluating two alternative tax
neither of these consumption tax plans has been systems, however. It is possible to design tax
tried in practice, they probably would be influenced reforms that raise steady-state welfare but cause
by the same political pressures that have generated a welfare losses in the transition from an initial
wide array of tax expenditures in the current income equilibrium to the new steady state. The tradeoff
tax code. between short-run and long-run policy effects
depends on the policymaker’s discount rate and in
calibrated general equilibrium models it is possible
Efficiency Gains from Replacing an to compute the present discounted value of the gains
Income Tax with a Consumption Tax and losses to the cohorts alive at different dates.
Sharpe ratio (that is, the equity premium per unit of because these households are extremely unwilling
risk) can be decomposed as: to substitute consumption over time. As a result,
they desire a flat consumption profile and, therefore,
would like to transfer resources from the future to
\ max std(Ac) x corr(Ac,R ), (1)
e
e
std(R ) y y
’ today. But since this is not feasible in an
endowment economy, the equilibrium risk-free rate
where R is the excess return on stocks over bonds, needs to be very high to discourage this type of
e
a is the relative risk aversion (RRA) parameter, and consumption smoothing and make individuals
Ac denotes log consumption growth. The equity willing to consume their endowment every period.
premium is about 6 per cent per year in the US data The last term in ( ) captures the precautionary
with a standard deviation of 15 per cent, producing savings motive, which becomes active in the pres-
e e
a Sharpe ratio (E(R )/ std(R )) of 0.4. Mehra and ence of uncertainty. For very high levels of risk
Prescott ( ) used aversion, this effect dominates the intertemporal
the construct of a representative agent who substitution effect, and an increase in the RRA
consumes the aggregate endowment stream. coefficient reduces the risk-free rate. Epstein and
Constantinides ( ), Rubinstein ( ) and Zin ( ) developed a class of reclusive prefer
Wilson ( ) derived aggregation results that ences that disentangles the inverse of the elasticity
rely on either complete markets or the absence of of intertemporal substitution from the coefficient of
idiosyncratic income risk. By appealing to these risk aversion. As discussed below, these preferences
aggregation results, Mehra and Prescott could allow one to make progress on the equity premium
substitute per-capita consumption growth into ( ). puzzle without running into the risk-free rate
This series has a standard deviation of less than 2 puzzle.
per cent in the post-war US data, and a low Against the backdrop of Mehra and Prescott’s
correlation with stock returns - less than 0.25 by benchmark model, subsequent papers that attempt to
most estimates. Substituting these values into the resolve these puzzles can be categorized according
expression above implies a lower bound for the to whether they modify (i) the preferences, (ii) the
relative risk aversion coefficient of 80, which is endowment process, or (iii) the market and asset
implausibly high judging by its implications for an structure. We discuss each of these approaches in
individual’s choices in other settings. In other turn.
words, we need extremely high risk aversion to
rationalize the observed equity premium, and that is
the puzzle. Furthermore, even if one is willing to
The Utility Function
accept such a high coefficient of risk aversion, this
Recursive Preferences
choice creates different puzzles itself - a point first
In the case of CRRA utility, the stochastic discount
noted by Weil ( ).
factor (SDF) has the following form: M, ,+i =
To understand Weil’s ‘risk-free rate puzzle’,
P(Ct+ilCt)~a, where C denotes the level of
first note that the Euler equation for the risk-free
consumption. A drawback of this specification is
asset choice can be linearized to obtain:
that it restricts the elasticity of intertemporal
substitution (EIS) to be the reciprocal of the RRA
a-
E[R^] m —In [i + aE(Ac) — —var(Ac). (2 ) parameter when in fact these two parameters
capture conceptually distinct aspects of individuals’
preferences. Building on work by Kreps and Porteus
Let us assume a positive time discount rate (ft < 1 ),
( ), Epstein and
and an average consumption growth rate of 1.5 per
Zin ( ) and Weil ( ) introduced ‘recur
cent per year. Let us also abstract from uncertainty
sive preferences’ (also called ‘non-expected
for the moment. Then a risk aversion of 40 would
utility’):
imply an implausibly high interest rate of nearly 60
per cent per year simply
Consumption-Based Asset Pricing Models (Empirical Performance) 2147
while the asset pricing implications of his model are the i.i.d. assumption by introducing a small pre-
similar to those in CC, the implications for dictable component in consumption growth that is
macroeconomic questions (such as policy analysis, statistically hard to detect. This long-run component
and so on) are quite different. increases the market price of consumption risk. In
addition, they add some time variation in the size of
Additional Arguments in the Utility Function the long-run risk component. Colacito and Croce (
The models discussed so far assume that investors ) show these long-run risk
only derive utility from non-durable consumption. models can reconcile the low volatility of exchange
In exchange economy models (in which the con- rate changes with the large market price of risk.
sumption process is exogenous) this is equivalent to Finally, Longstaff and Piazzesi ( ) argue that
assuming that non-durable consumption enters the corporate earnings are much
utility function in a separable manner. Some recent more risky than aggregate consumption growth, and
papers explicitly model the utility flow from that this can account for a large share of the equity
housing consumption (in a non-separable manner), premium puzzle.
and find that such an extension improves the asset
pricing performance (see Grossman and Laroque ;
Piazzesi et al. ; Flavin and Yamashita ). Similarly, a Production Economy Models
labour-leisure choice was introduced by Boldrin et
al. ( ) These asset pricing puzzles have also attracted a lot
and Danthine and Donaldson ( ), in a repre- attention from macroeconomists because the same
sentative agent framework, and by Uhlig ( ) basic framework used in Mehra and Prescott ( )
in an incomplete markets framework. However, also forms the backbone of the Kydland
these authors find that this extension negatively and Prescott ( ) model and the subsequent real
affects the performance of asset pricing models, business cycle literature. Therefore, understanding
because it allows households to smooth their mar- why individuals dislike risk in financial markets
ginal utility by adjusting on the labour-leisure could help shed light on individuals’ perceptions of
margin. As a result, one needs to introduce macro risk and consumption fluctuations, which are
additional - typically labour market - frictions to key issues for macroeconomic policy. However,
counteract this new smoothing opportunity'. macroeconomists are also interested in the
determination of quantities, such as output,
investment and consumption, making the exchange
Consumption Dynamics economy framework unsuitable for their purposes.
Therefore, macroeconomists replace the exogenous
In consumption-based asset pricing models, it is endowment stream with the endogenous
common to assume that aggregate consumption equilibrium consumption process generated by a
growth is i.i.d. overtime, because the evidence for standard neoclassical production economy that
consumption growth predictability in the data is faces technology shocks. One of the first findings of
weak. In the i.i.d. case, the conditional market price this approach, summarized in Rouwenhorst ( ), is
of risk, which can be approximated by the that resolving the equity
conditional standard deviation of the log SDF, premium puzzle in a production economy is far
at(logMt t-i) = a x <7,(Ac), is constant. Therefore, more challenging than in an exchange economy,
these models cannot generate any time variation in because this endogenous consumption process
risk premia on equity or any other asset. becomes too smooth if one increases risk aversion.
In the context of a standard representative agent As a result, one needs to resort to real frictions such
model, Kandel and Stambaugh ( ) gen as large adjustment costs in Jermann’s ( )
erate time-variation in risk premia by introducing model. Furthermore, and as noted above,
heteroskedasticity in aggregate consumption allowing for an endogenous labour supply choice,
growth. Bansal and Yaron ( ) deviate from as is common in macroeconomic analysis, gives
Consumption-Based Asset Pricing Models (Empirical Performance) 2149
Campbell, J.Y.. and J.H. Cochrane. 1999. By force of habit: A Krusell. P. and J.A. Smith. 1997. Income and wealth
consumption-based explanation of aggregate stock market heterogeneity, portfolio selection, and equilibrium asset
behavior. J o u r n a l o f P o l i t i c a l E c o n o m y 107: returns. M a c r o e c o n o m i c D y n a m i c s 1: 387 422.
205-251. Kydland, Finn E.. and Edward C. Prescott. 1982. Time to
Colacito. R.. and M. Croce. 2005. Risks for the long-run and build and aggregate fluctuations. E c o n o m e t r i c a 50:
the real exchange rate. Working paper, New York 1350-1370.
University. Levine, D.. and W. Zame. 2002. Does market incompleteness
Constantinides. G.M. 1982. Intertemporal asset pricing with matter1? E c o n o m e t r i c a 70: 1805 1839.
heterogeneous consumers and without demand Longstaff. F.. and M. Piazzesi. 2002. Corporate earnings and
aggregation. J o u r n a l o f B u s i n e s s 55: 253 267. the equity premium. Working paper, UCLA Anderson
Constantinides. G.M. 1990. Habit-formation: A resolution of School.
the equity premium puzzle. J o u r n a l o f P o l i t i c a l Lucas, D. 1994. Asset pricing with unidiversifiable income
E c o n o m y 98: 519-543. risk and short sales constraints: Deepening the equity
Constantinides. G.M., and D. Duffie. 1996. Asset pricing with premium puzzle. J o u r n a l of Monetary
heterogeneous consumers. J o u r n a l o f P o l i t i c a l E c o n o m i c s 34: 325 341.
E c o n o m y 104: 219-240. Mankiw, G.N. 1986. The equity premium and the concen-
Danthine, J.-P, and J.B. Donaldson. 2002. Labour relations tration of aggregate shocks. J o u r n a l o f F i n a n c i a l
and asset returns. Review of Economic Studies 69: 41-64. E c o n o m i c s 17: 2 11 219.
Dunn, K.., and K. Singleton. 1986. Modeling the term Marcel, A., and K.. Singleton. 1999. Equilibrium asset prices
structure of interest rates under nonseparable utility and and savings of heterogeneous agents in the presence of
durability of goods. Journal of Financial Economics 17: incomplete markets and portfolio constraints.
769-799/ M a c r o e c o n o m i c D y n a m i c s 3: 243-277.
Eichenbaum, M., and L.P. Hansen. 1990. Estimating models Mehra, R., and E. Prescott. 1985. The equity premium: A
with intertemporal substitution using aggregate time puzzle. J o u r n a l o f M o n e t a r y E c o n o m i c s 15:
series data. Journal of Business and Economic Statistics 8: 145 161.
53-69. Mcnzly, 1.., T. Santos, and P. Veronesi. 2004. Understanding
Epstein, L.G., and S. Zin. 1989. Substitution, risk aversion predictability. J o u r n a l o f P o l i t i c a l E c o n o m y
and the temporal behavior of consumption and asset 112: 1 47.
returns: A theoretical framework. Econometrica 57: 937 Piazzesi, M., M. Schneider, and S. Tuzel. 2007. Housing,
969. consumption, and asset pricing. J o u r n a l of
Flavin, M., and T. Yamashita. 2002. Owner-occupied housing F i n a n c i a l E c o n o m i c s 83: 531-569.
and the composition of the house-hold portfolio. Rietz. T.A. 1988. The equity risk premium: A solution 1?
A m e r i c a n E c o n o m i c R e v i e w 79: 345 362. J o u r n a l o f M o n e t a r y E c o n o m i c s 22: 117-
Grossman, S.. and G. Laroque. 1990. Asset pricing and 131.
optimal portfolio choice in the presence of illiquid durable Rouwenhorst, G. 1995. Asset pricing implications of equi-
consumption goods. E c o n o m e t r i c a 58:25 51. librium business cycle models. In F r o n t i e r s o f
Grossman, S.. and R. Shiller. 1982. Consumption b u s i n e s s c y c l e r e s e a r c h , ed. T.F. Cooley.
correlatedness and risk measurement in economies with Princeton: Princeton University Press.
non-traded assets and heterogeneous information. Rubinstein, M. 1974. An aggregation theorem for security
J o u r n a l o f F i n a n c i a l E c o n o m i c s 10: 195 210. markets. J o u r n a l o f F i n a n c i a l E c o n o m i c s 1:
Guvenen, F. 2005. A parsimonious macroeconomic model for 225-244.
asset pricing: Habit formation or cross-sectional Santos, .1., and P. Veronesi. 2005. Labor income and pre-
heterogeneity? Working paper, University of Texas at dictable stock returns. R e v i e w o f F i n a n c i a l
Austin. S t u d i e s 19: 1 43.
1 lansen. L.P.. and K. Singleton. 1983. Stochastic consump- Sundaresan, S. 1989. Intcrtemporally dependent preferences
tion. risk aversion, and the temporal behavior of asset and the volatility of consumption and wealth. R e v i e w
returns. J o u r n a l o f P o l i t i c a l E c o n o m y 91: 249 o f F i n a n c i a l S t u d i e s 2 : 73 89.
265. Telmer, C. 1993. Asset-pricing puzzles and incomplete
Heaton, .1.. and D. Lucas. 1996. Evaluating the effects of markets. J o u r n a l o f F i n a n c e 48: 1803 1832.
incomplete markets on risk sharing and asset pricing. Uhlig, II. 2006. Macroeconomics and asset prices: Some
J o u r n a l o f P o l i t i c a l E c o n o m y 104: 668 712. mutual implications. Working paper, Humboldt
Jermann, U. 1998. Asset pricing in production economies. University.
J o u r n a l o f M o n e t a r y E c o n o m i c s ' . 257 275. Verdelhan, A. 2004. Habit-based explanation of the exchange
Kandel. S.. and R.F. Stambaugh. 1990. Expectations and rate risk premium. Working paper, Boston University.
volatility of consumption and asset returns. R e v i e w o f Wachter, .1. 2002. 1 labit formation and returns on bonds and
F i n a n c i a l S t u d i e s 3: 207-232. stocks. Unpublished paper, Stern School of Business,
Kreps, D.. and E.L. Porteus. 1978. Temporal resolution of New- York University.
uncertainty and dynamic choice theory. Weil, P. 1989. The equity premium puzzle and the risk-free
E c o n o m e t r i c a 46: 185-200. rate puzzle. J o u r n a l o f M o n e t a r y E c o n o m i c s
24: 401 424.
Consumption-Based Asset Pricing Models (Theory) 2151
Wilson, R. 1968. The theory of syndicates. E c o n o m e t r i c a by {C'}^ () where T is possibly infinite. Suppose
36: 119-132.
Yogo, M. 2006. A consumption-based explanation of the cross- further that an asset j is available with a return R\ t+l
section of expected stock returns. J o u r n a l o f between periods / and t + 1 , and the investor is not
F i n a n c e 61: 539-580. facing a binding portfolio constraint with respect to
this asset Then a feasible strategy is to reduce con-
sumption at time l by a small amount e, invest it in
asset j, and consume the proceeds, C'f j + , in
the next period. Assuming a time-separable utility
Consumption-Based Asset Pricing function, with the one-period felicity function
Models (Theory) denoted by U and a time discount factor of ff this
strategy changes the investors’ expected lifetime
Fatih Guvenen and Hanno Lustig Uc (CrX,)e + E, \pUc ( C ' + 1 £^ + 1 )] ,
utility by
where Et is the mathematical conditional expectation
operator; X represents the arguments of the utility
function other than consumption; and XJC denotes
Abstract
the partial derivative with respect to consumption.
The essential element in modem asset pricing
The optimality of the original sequence implies that
theory is a positive random variable called ‘the
this strategy cannot be profitable for any amount e
stochastic discount factor’ (SDF). This object
and any asset available. Setting this gain to zero and
allows one to price any payoff stream. Its exis-
rearranging yields the Euler equation:
tence is implied by the absence of arbitrage
opportunities. Consumption-based asset pricing
models link the SDF to the marginal utility M/,t+1^,,+! where
growth of investors - and in turn to observable
Uc(Cl+1,X'+1) (1)
economic variables - and in doing so they Mm + 1 = p
provide empirical content to asset pricing theory. UciCrX,)
This article discusses this class of models.
This Euler equation was first derived by Rubin-
Keywords stein ( ) and Lucas ( ) in discrete time,
Consumption-based asset pricing models; Equity and by Breeden ( ) in continuous time. While
premium puzzle; Euler equations; Sharpe ratio; this class of models can in principle be used to study
Stochastic discount factor a broad variety of assets, this article will focus on
stocks and short-term bonds, which have received
the greatest attention in the consumption- based
JEL Classifications
asset pricing literature.
D4; DIO; G12 In the case of a one-period discount bond with
gross return pft , + 1 = 1 //f — a bond that costs P1,
Consumption-based asset pricing models study the dollars today and pays off 1 dollar tomorrow - the
pricing of payoff streams using the covariance of Euler equation can be rewritten as
these payoffs with the marginal utility growth of
investors.
The central component of a consumption-based Pf, =E,[MI,I+1\. (2 )
asset pricing model is die Euler equation, which
imposes restrictions on die covariance between asset Similarly, when the asset is a stock with ex-dividend
returns and the marginal utility growth of investors. price Pst and dividend payment Dt, the Euler
An easy and intuitive way to derive this equation is equation can be rearranged to read P\ = E, [/W,., I
by using a variational argument. Suppose that the (F) , + D,+i)]. By forward substitution this equation
optimal consumption path of investor i is given yields:
2152 Consumption-Based Asset Pricing Models (Theory)
impose additional structure on Mt t, |, which requires This expression shows that the P/D ratio moves
being more specific about (1 ) the functional form only because it predicts future returns on stocks or
and the arguments of the utility function; (2 ) the because it predicts future dividend growth. In the
stochastic properties of variables affecting marginal data, most of the volatility in P/D ratio is due to
utility (that is, consumption, leisure, and so on); and news about future expected returns (‘discount
(3) the market structure. The latter determines rates’), not due to future dividend growth (‘cash
whether an appropriate aggregation theorem holds flows’) (Campbell ; Cochrane ). There is a large
(which happens for example when markets are literature that documents the predictability of stock
complete), in which case C, can be replaced with returns over longer holding periods, starting with
aggregate consumption. Therefore, consumption- work by Campbell and Shiller ( ,
based models can be broadly categorized based on ), Poterba and Summers ( ) and Fama
the assumptions they make along these three and French ( , ). Other variables that
dimensions. These different models are discussed in predict returns include the spread between long and
consumption-based asset pricing models (empirical short bonds (Fama and French ) and the T-bill rate
performance). (Lamont ). More recently, more attention has been
Another feature of asset markets that has paid to macroeconomic variables that predict
received much attention in the literature concerns returns, most notably in the work by Lettau and
the high volatility of stock prices. For example, the Ludvigson ( ) who doc
standard deviation of the log price/dividend (P/D) ument that the consumption/wealth ratio is a pow-
ratio of stocks is about 40 per cent per annum in the erful predictor of stock returns.
US data. In a world with a constant SDF (as would So, the volatility of P/D ratio implies that excess
be the case with risk-neutral investors), it is returns on stocks are highly predictable. In other
impossible to rationalize this high volatility with words, expected excess returns change a lot over
the relatively low variability of the underlying time, even per unit of risk. We use the conditional
dividend stream (LeRoy and Porter ; Shiller ). Let version of the expression in ( ) to understand the
p, denote the log price, d, denote the log dividend, implications of this finding:
and r, denote the log stock return. Using a first-
order approximation, Campbell and Shiller ( )
show that the log e,(rsi+i -#)
P/D ratio can be decomposed as follows:
*(*;+!-*()
OO I) (6)
= —7
p, — d, = constant + El'^pi~1 [ A — rt+j\ --------------; rCorr (M,,,+URt-\-1
dl+j s
j=i t
with p = exp(prf)/l (l + exp(pd)) and pd denotes the where a, denotes the conditional standard deviation.
average log P/D ratio. The first term in the square Good models need to produce a lot of time variation
brackets is referred to as the cash flow component, in the right-hand side of ( ) and this happens mostly
and the second part is referred to as the discount through variation in the conditional market price of
rate component. This decomposition implies that risk (first term). This is an upper bound on the
the variance of the log P/D ratio can be stated as: conditional Sharpe ratio. (See also Lettau and
Ludvigson , on how to measure variation in the
conditional Sharpe ratio.) Another test of
var(p, - d,)cov ^p, - d„ ^ Ad,+;^ consumptionbased asset pricing models is whether
they are able to generate as much predictability as
found in the data. Examples of early models that
- cov ^p, - dh p'~1rl+^j . match the variation in the conditional market price
of risk include Kandel and Stambaugh ( ),
Campbell and
2154 Consumption-Based Asset Pricing Models (Theory)
Cochrane ( ) and Barberis et al. ( ). More Hansen, L.P., and R. Jagannathan. 1991. Implications of
recent work includes the work by Santos and security markets data for models of dynamic economies.
J o u r n a l o f P o l i t i c a l E c o n o m y 99: 252-262.
Veronesi ( ), Menzly et al. ( ), Piazzesi Hansen, L.R, and K. Singleton. 1983. Stochastic consumption,
et al. ( ), Guvenen ( ), Lustig and Van risk aversion, and the temporal behavior of asset returns.
Nieuwerburgh ( , ) and Bansal and J o u r n a l o f P o l i t i c a l E c o n o m y 91: 249-265.
Kandel, S., and R.F. Stambaugh. 1990. Expectations and
Yaron ( ). These models are discussed in
volatility of consumption and asset returns. R e v i e w o f
detail in consumption-based asset pricing models F i n a n c i a l S t u d i e s 3: 207-232.
(empirical performance). Lamont, O. 1998. Earnings and expected returns. J o u r n a l
o f F i n a n c e 53: 1563-1587.
LeRoy, S.F., and R.D. Porter. 1981. The present-value
relation: Tests based on implied variance bounds.
See Also E c o n o m e t r i c a 49: 555-574.
Lettau, M., and S.C. Ludvigson. 2001a. Consumption,
aggregate wealth and expected stock returns. J o u r n a l
o f F i n a n c e 56: 815-849.
Lettau, M., and S.C. Ludvigson. 2001b. Measuring and
modeling variation in the risk-return tradeoff. In H a n d -
b o o k o f F i n a n c i a l E c o n o m e t r i c s , ed. Y. Ait-
Sahalia and L.P Hansen. Amsterdam: North-Holland.
Lucas, R. 1978. Asset prices in an exchange economy.
Bibliography E c o n o m e t r i c a 46: 1429-1454.
Lustig, H. and S.V. Nieuwerburgh. 2006. C a n h o u s i n g
Bansal, R., and A. Yaron. 2004. Risks for the long-run: A collateral explain long-run swings in
potential resolution of asset pricing puzzles. J o u r n a l a s s e t r e t u r n s ? Working Paper NYU Stem and
o f F i n a n c e 59: 1481-1509. UCLA.
Barberis, N., M. Huang, and T. Santos. 2001. Prospect theory Lustig, H., and S. van Nieuwerburgh. 2005. Housing col-
and asset prices. Q u a r t e r l y Journal of lateral, consumption insurance and risk premia: An
E c o n o m i c s 116: 1-53. empirical perspective. J o u r n a l o f F i n a n c e 60:
Breeden, D.T. 1979. An intertemporal asset pricing model 1167-1219.
with stochastic consumption and investment opportu- Luttmer, E. 1996. Asset pricing in economies with frictions.
nities. J o u r n a l o f F i n a n c i a l E c o n o m i c s 7: E c o n o m e t r i c a 64: 1439-1467.
265-296. Mehra, R., and E. Prescott. 1985. The equity premium: A
Campbell, J.Y. 1991. A variance decomposition for stock puzzle. J o u r n a l o f M o n e t a r y E c o n o m i c s 15:
returns. E c o n o m i c J o u r n a l 101: 157-179. 145-161.
Campbell, J.Y., and J.H. Cochrane. 2000. Explaining the poor Menzly, L., T. Santos, and P Veronesi. 2004. Understanding
performance of consumption-based asset pricing models. predictability. J o u r n a l o f P o l i t i c a l E c o n o m y
J o u r n a l o f F i n a n c e 55: 2863-2878. 112: 1-47.
Campbell, J.Y., and R.J. Shiller. 1988. The dividend-price Piazzesi, M., M. Schneider, and S. Tuzel. 2007. Housing,
ratio and expectations of future dividends and discount consumption, and asset pricing. J o u r n a l of
factors. R e v i e w o f F i n a n c i a l S t u d i e s 1: 195- Financial
227. E c o n o m i c s 83: 531-569.
Campbell, J.Y., and R.J. Shiller. 1998. Stock prices, earnings Poterba, J.M., and L.H. Summers. 1986. The persistence of
and expected dividends. J o u r n a l o f F i n a n c e 43: volatility and stock market fluctuations. A m e r i c a n
661-676. E c o n o m i c R e v i e w 76: 1142-1151.
Cochrane, J.H. 1991. Explaining the variance of price- Rubinstein, M. 1976. The valuation of uncertain income
dividend ratios. R e v i e w o f F i n a n c i a l S t u d i e s 5: streams and the pricing of options. B e l l J o u r n a l o f
243-280. E c o n o m i c s 7: 407-425.
Fama, E.F., and K.R. French. 1988. Dividend yields and Santos, J., and P. Veronesi. 2005. Labor income and pre-
expected stock returns. J o u r n a l o f F i n a n c i a l dictable stock returns. R e v i e w o f F i n a n c i a l
E c o n o m i c s 22: 3-27. S t u d i e s 19: 1-43.
Fama, E.F., and K.R. French. 1989. Business conditions and Shiller, R.J. 1981. The use of volatility measures in assessing
expected returns on stocks and bonds. J o u r n a l o f market efficiency. J o u r n a l o f F i n a n c e 36: 291-
F i n a n c i a l E c o n o m i c s 25: 23^49. 304.
Guvenen, F. 2005. A parsimonious macroeconomic model for Shiller, R.J. 1982. Consumption, assest markets and mac-
asset pricing: Habit formation or cross-sectional roeconomic fluctuations. C a r n e g i e - R o c h e s t e r
heterogeneity? Working paper, University of Texas at S e r i e s o n P u b l i c P o l i c y 17: 203 238. North-
Contemporary Capitalism 2155
William Lazonick
JEL Classifications
PI
Abstract
The key to understanding ‘capitalism’ as a mode
of resource allocation that generates economic What Is 'Capitalism'?
growth is the organization and performance of
its most innovative business enterprises. The At the beginning of the 21st century, ‘capitalism’
‘Old Economy business model’ that made the has triumphed as the dominant system for allocat-
United States the world’s most powerful nation ing a society’s economic resources. The last time in
in the post-Second World War decades came history in which the persistence of capitalism in the
under challenge in the 1970s and 1980s, and the world’s most advanced economies was seriously
ideology of ‘maximizing shareholder value’ called into question was the Great Depression of the
arose to legitimize a redistribution of income 1930s - a decade during which the unemployment
from labour interests to financial interests. The rate in the United States remained at 15 per cent or
‘New Economy business model’ emerged in the higher, notwithstanding unprecedented state
1980s and 1990s to drive the innovation process, intervention under the New Deal. It took the Second
contributing, however, to unstable and World War to pull the United States and the world
inequitable economic growth. economy out of depression, and in the subsequent
decades it took substantial and sustained
government spending in the rich economies of
North America and western Europe to hold
Keywords
unemployment to acceptable levels.
Acquisitions; Business enterprises; Capitalism;
In the post-war era, the Soviet Union’s highly
Collective capitalism; Competitive advantage;
planned economy posed as a possible alternative to
Conglomerate movement; Contemporary
capitalism. The purported strength of the Soviet
capitalism; Corporations; Creative destruction;
challenge, however, turned out to be based at least
Defined-benefit pensions; Developmental state;
as much on Cold War ideology emanating from the
Dividend yield.; Division of labour; Economic
United States as on the actual productive power of
development; Entrepreneurship; Financial
the Soviet Union and its satellites. By the 1990s the
commitment; Foreign direct investment;
Soviet model had virtually vanished, as Russia
Globalization; Great Depression; Hostile take-
itself sought to make the transition to a ‘market
overs; Information and communications tech-
economy’, guided, tragically, by a mythical
nology; Innovation; Japan, economics in; Junk
ideology of how capitalism is supposed to operate,
bonds; Leveraged buyouts; Lifelong employ-
imported from the United States.
ment; Mergers; Milken, M.; NASDAQ; New
Over the same period capitalism entrenched
Deal; New Economy business model; New York
itself in East Asia. During the 1970s and 1980s
Stock Exchange; Old Economy business model;
Japan became a rich economy on the basis of a
Organization man; Organizational integration;
distinctive model of ‘collective capitalism’, and in
Outsourcing; Patents; Research and
the 1980s and 1990s the East Asian ‘Tigers’ - Hong
development; Schumpeter, J. A.; Separation of
Kong, Singapore, South Korea
ownership and control; Shareholder value;
Social inclusion; Stock options; Stock price
2156 Contemporary Capitalism
and Taiwan - closed the gap, each with its own (Lazonick ). Can a theory of capitalism as a market
variant of the Japanese model. More recently China economy comprehend the innovation process?
and India, with one-third of the world’s population, In the early 20th century a young Joseph
have experienced rapid economic growth, driven by Schumpeter asked this question. As a Viennese
what many would call ‘capitalist’ institutions. Yet, economics student, Schumpeter was versed in the
even as firms cross the globe to access Indian relatively recent, and increasingly influential,
software engineers, and vice versa, India remains a Austrian and Walrasian theories of how, through the
nation with one-third of the world’s illiterates. equilibrating mechanism of the market, the
Meanwhile the fact that China, the world’s second economy could achieve an ‘optimal’allocation of
largest economy since the early 1990s, continues to resources across productive uses. Schumpeter’s
be guided by an avowedly Communist government insight was to recognize that such a view of the
raises the question of what ‘capitalism’ really is. economic world could not explain economic
Defining contemporary capitalism is not merely development. In 1911 Schumpeter wrote The The-
a question of semantics. If, as has been ory of Economic Development (first translated into
demonstrated since the mid-20 th century, ‘capi- English in 1934) to argue that entrepreneurial
talism’ is a powerful engine of economic growth, activity that results in innovation - what he called
we want to know how it functions as a mode of the ‘Fundamental Phenomenon of Economic
resource allocation and the social conditions under Development’ - can disrupt the ‘Circular Flow of
which capitalist growth is not only strong but also Economic Life as Conditioned by Given Cir-
stable, and equitable. We also want to know how cumstances’ to change the ways in which the
the institutions of contemporary capitalism that economy operates and performs. Without such
generate growth might be transferred to those parts disruption of equilibrium conditions, the economy
of the world - first and foremost Africa but also would not develop. Over the next four decades
eastern Europe and Latin America as well as parts Schumpeter sought to elaborate a theory of
of the Middle East - that have economically been economic development informed by his own,
left behind. Given its pervasiveness and dominance, evolving, understanding of the changing reality of
a depiction of the institutions that define the most advanced capitalist economies.
contemporary capitalism is tantamount to a In particular, Schumpeter sought to understand
description of the economic world in which perhaps the role of the business enterprise in advanced
one-half of the world’s population now lives and to capitalist development. By the 1940s he had taken
which much of the other half now aspires. definitive leave of his youthful conceptions of the
There is no consensus among economists on the innovative entrepreneur as an individual actor and
definition of contemporary capitalism. The innovation as simply ‘new combinations’ of
dominant approach to analysing resource allocation existing resources. Rather, he saw that powerful
and the economic performance of an advanced business organizations both developed and utilized
economy rests on the notion that a capitalist productive resources to create new technologies and
economy is essentially a market economy that access new markets. The creation of new tech-
allocates resources to their most productive uses. nologies, moreover, destroyed the commercial via-
But what at any time and in any place, the student bility of old technologies. In Capitalism, Socialism,
of economic development asks, explains how those and Democracy, first published in 1942,
most productive uses come to exist? And why in Schumpeter argued that the process of ‘creative
certain times and places? Fundamental to capitalist destruction’ had become embodied in established
growth is ‘innovation’, the process that generates corporations as ‘technological “progress” tends,
goods and services that, even with factor prices held through systematization and rationalization of
constant, are of higher quality and lower cost than research and of management, to become more
those previously available effective and surefooted’, being ‘the business of
teams of trained specialists who turn out what is
Contemporary Capitalism 2157
required and make it work in predictable ways’ is replete with examples of the critical role of the
(Schumpeter , pp. 118, 132). developmental state in allocating resources to the
This article takes as its point of departure the processes of knowledge creation that then provided
proposition, suggested by Schumpeter, that the key the foundations for US industrial leadership. Yet, as
to understanding ‘capitalism’ as a mode of resource important as the developmental state has been even
allocation that generates economic growth is the in a so-called ‘market economy’ such as the United
organization and performance of its most innovative States, the allocation of resources to knowledge
business enterprises. That is not to say that markets creation would have been wasted, and would
and states are unimportant to the operation, and probably never have been made, had it not been for
hence definition, of capitalism. Historically, the presence and influence of innovative enterprises
however, well-functioning markets are outcomes of that have made use of this knowledge to generate
successful capitalist development. For the higherquality, lower-cost products than had
individual, markets create the possibility of previously been available.
choosing what to consume and for whom to work, In this article I focus on the changing role of
including the prospect of working for oneself. But innovative enterprise in determining resource
markets cannot explain the development of the new allocation and economic performance in contem-
products and processes that drive the growth of the porary capitalism. Space constraints dictate that I
capitalist economy. The innovation process is confine the analysis of contemporary capitalism to
uncertain, collective and cumulative (see O’sullivan the case of the United States, with the caveat that,
). The uncertain character of innovation means that even in a highly globalized economy in which one
investments in innovation require strategic control might expect convergence to a common business
over resource allocation by individuals who have model, there are almost as many distinctive
intimate knowledge of the technologies, markets ‘varieties of capitalism’ in terms of governance,
and competitors that an innovative strategy must employment and investment institutions, as there
confront. The collective character of innovation are advanced capitalist nations. The US economy is,
means that the implementation of an innovation however, the world’s largest and richest economy.
strategy requires the organizational integration of a It is also the one in which market ideology is most
hierarchical and functional division of labour into a virulent and the actual mode of resource allocation
process of organizational learning. The cumulative most misunderstood. Section “ ” of
character of innovation means that the process this article provides historical background to
requires financial commitment until it can generate understanding contemporary US capitalism by
financial returns. Enterprises, not markets, engage describing the key characteristics of the ‘Old
in strategic control, organizational integration and Economy business model’ (OEBM) that made the
financial commitment (Lazonick ). United States the world’s most powerful nation in
Nor can one explain innovation by appealing to the decades after the Second World War. Section “
the notion of the developmental state as its driving analyses the challenges that confronted OEBM in
force, as has often been done for the East Asian the 1970s and 1980s, and how the ideology of
economies. Implicit, and at times explicit, in this ‘maximizing shareholder value’ arose to legitimize
view is an acceptance of the ideology that the a redistribution of income from labour interests to
economic development of the United States is an financial interests. Section “
exemplar of the workings of the market economy. ” shows how the ‘New
Yet from gun manufacture and interchangeable Economy business model’ (NEBM) emerged in the
parts in the first half of the 19th century to the 1980s and 1990s to drive the innovation process,
computer revolution and Internet in the late 20th but in ways that have contributed to unstable and
century, as well as railroads, aviation and the life inequitable economic growth. Section “
sciences in between, the history of US capitalism
2158 Contemporary Capitalism
contracts, programmes for technology development the rapid increase in Japanese exports to the United
in sectors such as aerospace, computers and life States in the 1970s to Japan’s lower wages and
sciences. The development of the productive longer working hours. By the early 1980s, however,
potential of these government investments relied in with real wages in Japan continuing to rise, it
turn on corporate research capabilities. Retained became clear that Japanese advantage was based on
earnings formed the foundation of committed the superior organization of their enterprises, and in
finance for new corporate investments in particular on a more thoroughgoing integration of
innovation. When corporations needed additional participants in the functional and hierarchical
investment financing, they issued corporate bonds divisions of labour for the dual purposes of
at favourable rates that reflected the established transforming technologies and accessing new
position of the company as well as its conservative markets. Indeed, during the 1980s Japan exported
debt-equity ratios. Companies used bank loans management practices as well as material goods to
almost exclusively for working capital, and made the West. From the second half of the 1980s, with
only limited use of the stock market as a source of the yen strengthening and trade surpluses generating
investment funds. political backlash, Japanese companies made a
These social conditions enabled US corporations transition to direct investment in the United States
to grow very large in the post-war decades. The 50 and other advanced economies.
largest US industrial corporations by revenues on A growing financial orientation of US business
the Fortune 500 list averaged 87,070 employees in that had surfaced in the conglomerate movement of
1957, 117,393 in 1967, and 119,093 in 1977. These the 1960s undermined the abilities and incentives of
figures do not include employment at AT&T, the established US corporations to respond to the
regulated telephone monopoly, which in 1971 Japanese challenge. To some extent the growth of
employed 1,015,000 people, of whom 700,000 were the US industrial corporation in the post-war
union members with good wages, stable decades had been based on strategic investments in
employment and excellent benefits. By the late new product lines and geographic areas that built on
1960s and early 1970s increasing numbers of blacks the corporation’s existing productive capabilities,
were moving into union jobs in the steel, and yielded economies of scale and scope. The
automobile, electrical equipment, consumer durable conglomerate movement, however, saw major
and telecommunications industries. The growth of corporations invest in scores of unrelated
established corporations in these industries in the businesses, often through mergers and acquisitions,
three decades after the Second World War based on the prevailing, but erroneous, ideology
contributed to a more equal distribution of family that a good corporate executive could manage any
income in the US economy. type of business, and that conglomeration offered
the synergies of superior corporate management.
The conglomerate movement failed because it
'Maximizing Shareholder Value' segmented top executives, in positions of strategic
control, from the rest of the managerial organization
During the 1970s the US model faltered in the face
that had to develop and utilize productive resources
of Japanese competition. Building on innovative
to sustain the firm’s competitive advantage
capabilities developed for their home markets
(Lazonick ).
during the 1950s and 1960s, Japanese companies In the late 1970s and early 1980s the conglom-
gained competitive advantage over US companies erates unraveled. In the mid-1970s Michael Milken,
in industries such as steel, memory chips, machine a Drexel Burnham investment banker, had created
tools, electrical machinery, consumer electronics the junk bond market by convincing institutional
and automobiles. US companies had entered the investors, in search of higher yields in an
1970s as world leaders in these industries. Many inflationary era, to hold downgraded corporate
US observers attributed securities, many of them ‘fallen angels’ from
1960-9 1970-9 1980-9 1990-9 2000-5
Source'. Council of Economic Advisers ( , Tables B-62, B-73, B-95 and B-96)
Notes: Stock yields are for Standard and Poor’s composite index of 500 US corporate stocks (424 of which are, as of 28 March
2006, NYSE). Bond yields are for Moody’s Aaa-rated US corporate bonds
unsuccessful conglomeration. By the late 1970s, States option awards did not require that the
with the junk-bond market well developed, it company’s stock price outperform the stock market
became possible to issue new junk bonds to finance or even the stock prices of a group of competitors,
leveraged buyouts (LBOs) in which the top those who received these awards could only gain
managers of a conglomerate division turned it into from what, from July 1982 to August 2000, turned
an independent company to recapture strategic out to be the longest stock market boom in US
control over resource allocation. By the late 1980s, history, with the Dow Jones Industrial Average and
however, the junk bond had become an instrument the S&P500 Index both rising about 1,300 per cent.
for the hostile takeover of entire companies, with As Table shows, stock-price appreciation drove
KKR’s 1989 LBO of RJR Nabisco for $24.5 billion the extraordinary real stock yields that were
marking the height of what became known as ‘the sustained over the 1980s and 1990s. The relatively
deal decade’. low dividend yields in the 1990s did not reflect
The ideology that justified hostile takeovers was stinginess on the part of US corporations; the US
that the corporation should be run to ‘maximize corporate payout ratio - the amount of dividends as
shareholder value’ (see Lazonick and O’sullivan ). a percentage of after-tax corporate profits (with
Proponents of shareholder value charged that, either inventory evaluation and capital consumption
because of opportunism or incompetence, many adjustments) - averaged 48 per cent in the 1980s
incumbent corporate managers were making poor and 57 per cent in the 1990s compared with 3 9 per
allocative decisions. By exercising their influence cent in the 1960s and 41 per cent in the 1970s. It
through the market for corporate control, was just that the rate of increase of stock prices
shareholders could force incumbents to alter their outstripped the rate of increase of dividend
allocative decisions, replace them with those who payments, thus depressing the dividend yield. The
would maximize shareholder value, or distribute form that the stock yield takes is of significance
cash to shareholders in the forms of dividends and because investors can capture the dividend yield by
stock repurchases so that shareholders themselves holding stocks, whereas they can capture the price
could, so the argument goes, reallocate the yield only by buying and selling stocks. Inherent in
economy’s resources to their best alternative uses. high-price yields, therefore, is a volatile stock
While the hostile takeover movement did not market.
directly threaten high-tech companies (in which the A volatile stock market benefits those who are
most valuable assets could walk out the door), by compensated in stock options on an annual basis,
the end of the 1980s the top executives of virtually especially when, as is the case in the United States,
all US industrial corporations had embraced the options vest as quickly as one year from the date of
ideology of maximizing shareholder value and grant and can be exercised for up to ten years. It has
made it their own. By the 1980s executive stock been estimated that, largely because of the gains
option compensation was a well-established from exercising stock options, on average the ratio
practice. Since in the United of CEO pay of an S&P500
Contemporary Capitalism 2161
company to that of a production worker was 42 in period 1991-4, 450,000 per year in 1995-7, and
1985, 107 in 1990, 525 in 2000, and 411 in 2005. 656,000 per year during the boom years 1998-2000.
Top executives took a keen interest in their Meanwhile, from the mid-1980s US corpora-
company’s stock price, and in the 1980s and 1990s, tions began to actively support their stock prices
in the name of ‘maximizing shareholder value’, they through large-scale stock repurchases. Companies
found ways in which they could use their positions included in the S&P500 in March 2006 distributed
of strategic control over corporate resource more cash to shareholders in repurchases than in
allocation to influence it. They could cook the dividends in 1997 through 2000 and again in 2004,
corporate books to boost current earnings, a practice and just slightly less in 2001 through 2003. Since
that became widespread in these decades and one 1978 net equity issues by US non-financial
for which a few executives have been fined or even corporations has been positive in only six of 28
jailed. The American Competitiveness and years (1980, 1982, 1983, 1991, 1992, 1993); since
Corporate Accountability Act of 2002, better known the early 1980s US industrial corporations have in
as Sarbanes-Oxley, has sought to stem this practice. aggregate been supplying capital to the stock market
But quite apart from artificially inflating corporate rather than vice versa. In 2005 the net flow of cash
earnings, top corporate executives also found that from non-financial corporations to the stock market
downsizing the labour force and repurchasing was a record $366 billion, 1.42 times in real dollars
corporate stock helped to boost a company’s stock the previous high in 1998 (Lazonick ).
price, even though these resource allocations did not
necessarily improve the company’s competitive
performance. The New Economy Business Model
The era of corporate downsizing took hold in the
recession of 1980-2 when hundreds of thousands of On 29 December 1995, AT&T announced that, as
stable, well-paid blue-collar jobs were lost that were part of the process of breaking itself up into three
never subsequently restored (see Lazonick ). It separate companies, it would be cutting 40,000 jobs.
would appear that the blacks who had relatively AT&T was a company that could trace its origins
recently moved into these types of jobs were back to the 1870s, had created the world’s most
particularly hard hit; last hired, they tended to be the advanced telephone system, was the home of the
first fired. The subsequent ‘boom’ years of the mid- famous Bell Labs that among many other
1980s witnessed hundreds of plant closings. In the accomplishments invented the transistor in 1947,
‘white-collar’ recession of the early 1990s tens of and, despite having lost its status as a regulated
thousands of professional, administrative and monopoly in 1984, still employed 308,700 people.
technical employees found that their jobs had been Now, however, AT&T became emblematic of the
eliminated, although once again it was blue-collar failure of US Old Economy corporations to continue
workers who bore the brunt of the downturn. In to provide employment opportunities. With
1980 manufacturing employment was 22 per cent of campaigning for the 1996 presidential election
the labour force; by 1990 it had fallen to 17 per cent picking up steam, Patrick Buchanan, a right- wing
and by 2001 to 14 per cent. While the employment politician, caught the attention of the media by
picture generally became much better during the denouncing the highly paid executives of AT&T
Internet boom of the last half of the 1990s, job and other downsizing corporations as ‘corporate hit
cutting remained a way of life for many major US men’. Fuel was added to the fire by the revelation
corporations. According to data on layoff that, in the name of ‘creating shareholder value’, A1
announcements by companies in the United States Dunlap, whom the American public came to know
collected by the recruitment firm, Challenger, Gray as ‘Chainsaw AT, had in 20 months as CEO of Scott
and Christmas, announced job cuts averaged just Paper devastated the 115-year old company while
under 550,000 per year for the putting an estimated
Old economy business model (OEBM) New economy business model (NEBM)
Strategy, Firm growth based on multidivisional structure: multi- New firm entry into specialized ICT markets; accumulate
product product firm new capabilities by acquiring (other) young technology
firms
Strategy, Vertical integration of the value chain; in-house standards Vertical specialization of the value chain; industry
process and proprietary R&D technology standards; R&D for cross-licensing and
alliances; outsourcing routine work to specialist contract
manufacturers and/or offshoring routine work to low-
wage nations
Finance Venture finance from savings, family and business
associates; NYSE listing, growth finance from retentions, Organized venture capital; early IPO on NASDAQ;
after dividends, and bond issues retentions with zero dividends; use of own stock as a
compensation and combination currency; systematic
stock repurchases to support stock price
Organization
Insecure employment; interftrm mobility of labour,
Secure employment; ‘organization man’ (career with one broad-based stock options, nonunion, defined
company), industrial union; defined- benefit pension, contribution pension, employees bear more burden of
good medical coverage in employment and retirement medical insurance
$100 million in his own pocket. In March 1996, the continued to dominate the microprocessor and logic
New York Times ran a seven-part series, later chip markets that drove product innovation in the
released as a paperback, on ‘the downsizing of microelectronics industry (Lazonick ). While
America’ (Lazonick ). Silicon Valley was not the only US location for
By the spring of 1996, however, the fbror over innovation in this industry, the concentration of
corporate downsizing had disappeared. In its place, semiconductor start-ups in the region from the late
Americans became enthralled by the prosperity 1950s resulted in the emergence by the 1980s of a
promised by what in the second half of the 1990s distinctive mode of combining strategy, finance and
came to be called the ‘New Economy’. In the organization: the ‘New Economy business model’
United States the previous half-century had seen a (NEBM) (see Table ). During the 1990s NEBM
massive accumulation of information and spread beyond Silicon Valley start-ups and was
communications technology (ICT) capabilities. The adopted successfully by leading Old Economy ICT
development of computer chips from the late 1950s companies such as Hewlett-Packard and IBM. In
had provided the technological foundation for the the Internet boom of the late 1990s elements of
microcomputer revolution from the late 1970s, NEBM diffused to other ICT companies, including
which in turn had provided the technological an Old Economy company such as Lucent
infrastructure for the Internet boom of the second Technologies, spun off from AT&T in its 1996
half of the 1990s. The research funding for this trivestiture, which almost destroyed itself in
accumulation of ICT capabilities had come mainly attempting to adopt the business model. In the
from the US government and the research 2000s NEBM characterizes the most innovative
laboratories of established Old Economy high-
sectors of the US economy (for the case of
technology corporations. Each wave of techno-
biotechnology, see Pisano )•
logical innovation, however, created opportunities
The founders of New Economy firms have
for the emergence of start-up companies that were
typically been scientists and engineers who have
to become central to the commercialization of the
gained specialized experience in existing firms,
new technologies.
although in some cases they have been university
Although by the mid-1980s the Japanese had
faculty members intent on commercializing their
outcompeted even the leading US semiconductor
academic knowledge. Some of these entrepreneurs
firms in the memory chip market, US companies
have come from existing Old Economy
such as Intel, Motorola and Texas Instmments
Contemporary Capitalism 2163
companies, where it was often difficult for their these highly mobile people and retain their services,
new ideas to get internal backing. But New Econ- Silicon Valley firms increasingly adopted ‘broad-
omy companies themselves have become increas- based’ employee stock option plans that extended
ingly important as sources of new entrepreneurs this form of compensation to a large proportion,
who left their current employers to start new firms. sometimes all, of the firm’s non-executive
Large numbers of high-tech entrepreneurs in the employees rather than just to top executives. In
United States have been foreign-bom, coming start-ups, stock options usually served as a partial
mainly from India and China (Saxenian ). substitute for cash salaries, and the eventual gains
Typically, the founding entrepreneurs of a New from exercising options were viewed as a substitute
Economy start-up seek committed finance from for a company-funded pension (Lazonick ).
venture capitalists with whom they share not only Again, the underlying stock would become
ownership of the company but also strategic control. valuable if and when the start-up did an IPO or a
In the 2000s Silicon Valley remains by far the private sale to a publicly listed company. Short-
leading location in the United States and the world ening the expected period between the launch of a
for venture-backed high-tech start-ups. The region company and an IPO was the practice of most
acquired this position from the 1960s as a venture-backed high-tech start-ups of going public
distinctive venture capital industry emerged out of on NASDAQ, created in 1971 as an electronic
the opportunities for start-ups created by the exchange for the over-the-counter markets with less
microelectronics revolution. Besides sitting on the stringent listing requirements than the ‘Old
board of directors of the new company, the venture Economy’ New York Stock Exchange (NYSE). The
capitalist generally recruits professional managers, 1978 cut in the capital gains tax rate to 28 per cent,
who are given company stock along with stock after it had been raised to 49 per cent just two years
options, to lead the transformation of the firm from before, provided further encouragement to
a new venture to a going concern. This stock-based entrepreneurs and venture capitalists to found new
compensation gives these managers a powerful companies, and for employees of these companies,
financial incentive to develop the innovative rewarded with stock options, to provide the skills
capabilities of the company to the point where it can and efforts needed to transform new ventures into
do an initial public offering (IPO) or private sale to going concerns. In 1979 the clarification of the
a listed company, thus enabling the start-up’s ‘prudent man’ rule as applied to the Employee
privately held shares to be transformed into publicly Retirement Income Security Act (ERISA) of 1974
traded shares. Both before and after making this gave asset managers the green light to allocate a
transition, their tenure with, and value to, the portion of their portfolios to riskier stocks and
company depends on their managerial capabilities, venture capital funds, and resulted in a flood of new
not their fractional ownership stakes (Lazonick ). money, especially from pension funds, into the
The stock market speculation of the ‘dotcom’ venture capital industry. The American Electronics
era made it all too easy to cash out of a start-up, as Association and the National Venture Capital
many high-tech firms that had not engaged in Association, with their strongest and deepest roots
innovation did IPOs or were sold to established in Silicon Valley, were the frontline Washington
companies. When start-ups do innovate, the key to lobbyists for these regulatory changes.
making the transition from new venture to going While institutional money provided capital to
concern has been the organizational integration of NEBM, high-tech labour became more mobile from
an expanding body of technical and administrative one firm to another than it had been in the Old
‘talent’. As Silicon Valley developed from the Economy. Employee stock options induced this
1960s, this educated and experienced labour had to mobility, but what made it possible in terms of the
be induced to trade secure employment with an Old knowledge bases that managers and engineers
Economy company for insecure employment with a possessed were industry standards, as distinct
start-up. To attract
2164 Contemporary Capitalism
from in-house standards, that emerged in the var- careers and to increase the value of younger
ious sectors of ICT. In the Old Economy in-house employees who may have had experience working
standards promoted the growth of large vertically for other ICT companies. Explicitly reflecting this
integrated firms on the basis of proprietary tech- change in employment policy, in 1999 IBM
nologies, whereas in the New Economy industry announced that it would replace its traditional
standards encouraged new entry. Nevertheless, as defined-benefit pension plan, which favoured long-
demonstrated by the important cases of Intel and term employees, with a portable ‘cash- balances’
Microsoft in the development of the microcomputer plan that would be much more attractive to younger
industry, those New Economy firms that dominated employees who did not envisage a lifelong career
in the setting of the industry standards could also with IBM. In December 2004, as its employment
grow very large (at the end of fiscal 2005 Intel reached 329,000, IBM announced that new
employed 99,900 and Microsoft 61,000). By employees would no longer be eligible for the cash-
establishing industry standards, their growth balances pension fund. Instead the company would
encouraged rather than discouraged startups, which offer them a defined-contribution pension, with the
in turn depended on the availability of not only company matching the employee contribution up to
venture capital (which came from many sources six per cent of his or her compensation.
besides the formal venture capital industry) but also From the mid-1990s, with the Old Economy
mobile labour whose knowledge and experience commitment to its employees out of the way, IBM
could be easily integrated into the start-up’s adopted all of the elements of NEBM. It shifted out
learning processes. of hardware into services, and outsourced its
manufacturing. It became by far the leading
Of critical importance in setting industry stan-
patenter in the United States, even as it cut R&D
dards in microelectronics was IBM’s decision in
from the ten per cent of sales that prevailed in the
1980 to enter what became known as the personal
1980s to six per cent of sales since the mid-1990s,
computer (PC) industry with Intel supplying the
this change reflecting an expressed shift to product
microprocessor and Microsoft the operating system.
development and away from basic research. Since
At the time IBM controlled about 80 per cent of the
the early 1990s IBM has engaged in patenting much
computer market, had over 341,000 employees, and,
less to control proprietary technologies, as had been
with an explicit system of ‘lifelong employment’,
the case in the past, and much more to gain access
trumpeted the fact that since 1921 it had not
through cross-licensing to technologies controlled
terminated an employee involuntarily. Yet between
by other companies and to generate intellectual
1990 and 1994 IBM slashed its employment from
property revenue ($1.3 billion per year in the
374,000 to 220,000. In 1991-3, the company had 2000 s).
losses of $16 billion (including more than $8 billion As it rid itself of lifelong employment in the
in 1993, at the time the largest annual loss in US early 1990s IBM began to extend stock options,
corporate history) on total revenues of $192 billion, previously reserved for top executives, to a broad
and encouraged the media to believe that the mass base of employees. In 1990 options outstanding
layoffs were necessary to avoid bankruptcy. Yet were only four per cent of all shares outstanding; in
virtually all of the losses came from ‘restructuring’ 2005, 15.2 per cent. As for distributions to
charges, that is, the cost of terminating employees shareholders, in New Economy fashion, subsequent
(Lazonick )• to its early 1990s restructuring IBM has favoured
In retrospect, it is clear that these charges were repurchases over dividends. In 1981-90 IBM’s
the cost of ridding the company of its 70-year-old dividends were 48 per cent and repurchases 12 per
system of lifelong employment. The industry cent of net income; in 1993-2005 dividends were 15
standards in ICT, which IBM had played a leading per cent and repurchases 91 per cent. In an effort to
role in establishing, served to reduce the value to offset dilution of shareholdings as employees
the company of older employees with experience exercise
accumulated at IBM over the course of their
Contemporary Capitalism 2165
stock options, and more generally to boost its stock gain access to new technologies and markets. Cisco
price, in 1995-2005 IBM has spent $62.6 billion on mastered this growth-through-acquisition strategy.
stock repurchases. Over the same period the From 1993 through 2005 Cisco made 106
company has spent $56.6 billion on R&D. acquisitions valued in nominal terms at $46.9
As for a New Economy company that, unlike billion, over 80 per cent of which was paid in the
IBM, started out that way, Cisco Systems, which company’s stock rather than cash. In 1999 and 2000
since the late 1990s has controlled about 75 per cent alone, Cisco did 41 acquisitions at a cost of $26.7
of the Internet router market, is a prime example of billion with over 99 per cent paid in stock
the importance, and implications, of broad-based (Carpenter et al. ).
stock options in NEBM compensation. Founded in At the same time, like many if not most New
Silicon Valley in 1984, Cisco grew from about 200 Economy companies, Cisco conserved cash by
employees at the time of its IPO in 1990 to 40,000 paying no dividends. Along with its use of stock as
employees during 2000. Throughout its history a combination currency, this payout policy enabled
Cisco has awarded stock options to virtually all of Cisco to become a giant company in the 1990s
its employees. By the end of fiscal 2000 stock without taking on any long-term debt. Since the
options outstanding accounted for 14 per cent of the bursting of the Internet bubble from mid-2000
company’s total stock outstanding; by 2005 that through 2005, however, Cisco has spent $27.2
number was 23 per cent. In March 2000, at the peak billion repurchasing its own stock to support its
of the Internet boom, Cisco had the highest market sagging stock price. In 2004-5, as it spent $19.3
capitalization of any company in the world. Under billion on stock repurchases, Cisco used $8.3 billion
such conditions its stock options were very lucra- in cash - including $6.5 billion of it raised through
tive. I have estimated that over the 11 years 1995- its first-ever bond issue - to do 24 acquisitions
2005 (all years for which data are reported refer to rather than continue to use its stock as an
fiscal year’s end, the last week in July), Cisco acquisition currency that it would then feel
employees, totaling about 256,000 employee-years, compelled to offset with repurchases. (Cisco’s
shared $21.5 billion in gains from exercising stock decision to use cash rather than stock for
options, for an average of $84,000 per employee- acquisitions was helped by the Financial
year. The annual averages per employee ranged Accounting Standards Board’s 2001 closing of the
from less then $9,000 in 2003 to more than ‘pooling-of-interests’ loophole that enabled
$281,000 in 2000. Of the total amount, the highest companies like Cisco that did all-stock acquisitions
paid executives, totaling 57 executive-years, shared to record them on their balance sheets at book
$893 million, for an average of $15.7 million per values, which were generally a small fraction of
executive-year, with annual averages ranging from market values, and thus inflated future earnings.
$1.3 million in 2003 to $51.3 million in 2000. The Nevertheless, in 2002 and 2003, with pooling-of-
annual ratios of average top-executive to average interests accounting outlawed, Cisco still used stock
employee gains from exercising stock options for payment of over 97 per cent of the price of its
ranged from 36:1 in 1997 to 594:1 in 2005 nine acquisitions.)
(Lazonick ). Cisco employees have a clear financial The corporate obsession with supporting its
interest in the company’s stock price, and the stock price through massive stock repurchases has
company’s top executives even more so. therefore taken hold of companies in the most
Besides using their own stock as a compensation innovative sectors of the US economy. As further
currency, during the 1990s some New Economy notable examples, for the years 1995-2005 Intel
companies grew large by using their stock, instead distributed $51.3 billion in repurchases along with
of cash, to acquire other, smaller and typically $6.0 billion in dividends compared with R&D
younger, New Economy firms in order to spending of $38.0 billion, while Microsoft
distributed $45.4 billion in repurchases and $38.7
billion in dividends compared with R&D spending
of $40.8 billion.
2166 Contemporary Capitalism
Microsoft’s dividends included a one-time payment better employment opportunities for the indigenous
of $36.1 billion in November 2004. well-educated labour force. As a striking example,
These companies would argue that R&D in 1984 Intel claimed that, of its 8,500 employees
spending and stock repurchases are both working outside of the United States (of26,000 employees
toward the same end: to enhance the company’s worldwide), only 60 were US citizens. This
innovative capabilities by, in the case of R&D, indigenous employment through foreign direct
generating new knowledge, and in the case of investment encouraged the national governments to
repurchases, competing for high-tech labour capable increase the level of investment in their already
of transforming that knowledge into innovative well-developed systems of higher education, thus
products and processes. By boosting stock prices, it augmenting the future high-tech labour supply
is argued, repurchases help to attract, retain and (Lazonick ).
motivate people who choose to work for companies In the 1990s established US ICT companies, led
in which they are partially compensated with stock by IBM and Hewlett-Packard, dramatically reduced
options. In the case of Microsoft the argument has their employment of production workers by
had less weight since July 2003 when the company outsourcing manufacturing operations to electronic
ended its option programme (although many manufacturing service providers, also known as
Microsoft employees still have unexercised options contract manufacturers (Lazonick ). Indeed,
awarded prior to that date). In the 2000s, moreover, younger companies like Cisco grew rapidly without
the extent and location of the talented labour doing any in-house manufacturing. Initially the
supplies for which companies like Cisco, IBM, Intel contract manufacturers would set up operations or
and Microsoft compete have changed dramatically take over existing plants of their customers in the
with the rise of India and China (Lazonick ). These United States. But a key capability of the leading
dramatically changed labour market conditions for contract manufacturers is to shift production that
high-tech labour raise serious questions concerning has become more routine and cost-sensitive to
which employees benefit from a company’s stock lower wage areas of the world. In the late 1990s and
price performance and for how long, and indeed early 2000 s the leading contract manufacturers grew
whether established high-tech companies even need at a rapid pace; at the end of 2005 employment at
to use employee stock options to compete the five largest - Flextronics, Solectron, Sanmina-
successfully for high-tech labour. SCI, Celestica and Jabil Circuit - totalled 260,000.
The offshoring to India and China in the 2000s While we do not know the global distribution of
of high value-added jobs of software engineers and this labour force, North America accounts for only
computer programmers that it was previously an estimated 25 per cent of the sales of these five
thought could not go abroad represents the latest companies.
stage in four decades of the globalization of NEBM. Meanwhile, in the 1990s and 2000s hundreds of
Beginning in the early 1960s Silicon Valley thousands of foreigners, especially Indians, with
semiconductor companies were among the first to college degrees in science and engineering have
offshore assembly to East Asia, and by the early migrated to the United States for graduate education
1970s virtually every US semiconductor and work experience (Lazonick ). Many acquired
manufacturer had followed suit. When these com- permanent resident (immigrant) status in the United
panies set up plants in places like South Korea, States, as the US government expanded
Taiwan, Hong Kong, Singapore and Malaysia, they employment-based preferences in the issuance of
employed, alongside unskilled and predominantly immigrant visas. For access to US work experience,
female assembly labour, indigenous university however, the most important mode of entry for
graduates as managers and engineers. Over time the high-tech employees has been on non-immigrant H-
US companies upgraded their facilities in these 1B and L-l visas. The H-1B programme enables
locations, and offered more and
Contemporary Capitalism 2167
non-immigrants, the vast majority of whom have at is that America must decide to compete. If we don’t
least a bachelor’s degree and whose skills are compete and win, there will be very serious conse-
quences for our standard of living and national
purportedly unavailable in the United States, to security in the future.. .U.S. leadership in the nano-
work in the United States for up to six years. In the electronics era is not guaranteed. It will take a
first half of the 2000s about 70 per cent of H-1B massive, coordinated U.S. research effort involving
visa holders had science or technology degrees, and academia, industry, and state and federal govern-
ments to ensure that America continues to be the
40-50 per cent came from India (the next largest world leader in information technology.
national group is from China, at less than ten per
cent). The L-l visa programme permits a company At the time Barrett was a member of the US
with operations in the United States to transfer National Academy of Sciences Committee on
foreign employees to the United States to acquire Prospering in the Global Economy in the 21st
work experience, with no limitation of time. In Century, which delved into deficiencies in the
2001, there were an estimated 810,000 people on H- development of science and engineering capabilities
1B visas in the United States, and possibly as many in the United States. Notwithstanding his obvious
on L-l visas. concern about these problems from a public policy
Many of these non-immigrant visa holders have perspective, on a radio talk show in February 2006
continued to work in the United States by obtaining Barrett (by this time Chairman of Intel) remarked:
permanent resident status. But most have returned ‘Companies like Intel can do perfectly well in the
to their native countries with valuable industrial global marketplace without hiring a single US
experience that can be used to start new firms and, employee’ (wbur.org ).
more typically, to work as technical specialists for The problem with this statement is not that US
indigenous or foreign companies. As a result of workers should have privileged access to jobs at a
both the migration of US companies abroad in US-based company like Intel (which still employs
search of high-tech labour as well as the migration half of its almost 100,000 employees in the United
of foreign high-tech labour to the United States, and States). The problem is that, if a powerful company
then back to their home countries, in the 2000 s, to like Intel is not dependent on US high-tech
an extent never before imagined, even the best- employees for its future labour force, why should it
educated US high-tech employees compete with a be concerned about supporting the mass educational
truly global labour supply for jobs. infrastructure in the United States needed to
develop this future labour force? And what does it
mean to say that ‘America must decide to compete’
Stable and Equitable Growth? if, as I would argue is the case (Lazonick ), the most
innovative US corporations have more of an interest
On 16 March 2005 the Semiconductor Industry in the Malaysian or Indian system of mass
Association (SIA) organized a Washington, DC education than in the US system?
press conference in which it exhorted the US Since the mid-1970s the US mass education
government to step up support for research in the system has been performing poorly in science and
physical sciences, including nanotechnology, to mathematics by the standards of both the advanced
assure the continued technological leadership of the and many developing economies. Such was much
United States. Intel CEO Craig Barrett was there as less the case in the three decades or so after the
a SIA spokesperson to warn: ‘U.S. leadership in Second World War, when the Old Economy
technology is under assault’ corporation was more dependent upon a labour
(Electronic News ): force that was well-educated at the primary and
The challenge we face is global in nature and secondary levels in the United States. This shift in
broader in scope than any we have faced in the
past. The initial step in responding to this challenge
the performance of the mass education
2168 Contemporary Capitalism
parallels the reversal of post-war progress towards a corporations have been in the forefront of innova-
more equal distribution of income that began about tion. Given the unchallenged sway that the ideology
three decades ago. The much less secure of ‘maximizing shareholder value’ has over the
employment of most US corporate employees in the governance of these corporations, I have no doubt
shift from OEBM to NEBM would seem to have that instability, as reflected in the boom and bust of
contributed to this reversal. the stock market in the late 1990s and early 2000 s,
Meanwhile in the 2000s the compensation of the and inequity, as reflected in the worsening of the
CEOs of US corporations has long since passed distribution of income, will continue to beset the US
levels that are at a minimum unseemly and some economy.
would say obscene. The ‘explosion in CEO pay’, Whether US corporations will remain in the
which has been discussed in the United States since forefront of innovation that, by necessity, must
the mid-1980s, seems to have no limits, especially underpin long-term economic growth is another
if, when the corporate stock price falls, it can be matter. Notwithstanding globalization, the US
once again pumped up or boards of directors can model of contemporary capitalism is not a global
replace the ‘lost’ stock option income by other model. No other contemporary capitalist economy
forms of remuneration such as salaries, bonuses or has made the commitment to ‘shareholder value’
restricted stock. The seemingly endless explosions that is the most distinctive feature of the US model.
in top-executive pay reflect the obsession of US Japan has come through the stagnation of the 1990s
corporate executives with ‘maximizing shareholder as a highly innovative economy, while eschewing
value’ and, cash flow permitting, disgorging shareholder value ideology and practices
billions upon billions of corporate cash to (Lazonick ). In western European nations the
shareholders in the forms of repurchases and ideology has been tempered by a commitment to
dividends to try to make it happen. ‘social inclusion’; the question is whether the equity
In terms of public policy initiatives, virtually and stability that social inclusion brings can be
nothing has been done to control top executive pay harnessed to support innovative enterprise. In the
in the United States. One well-known attempt was emerging giants, India and China, the stock market
misguided. In 1993 President Clinton carried out a has come to play a more important, and possibly
campaign promise to control CEO pay by dangerous, role. In all of these economies, the
legislating a cap of $1 million on the amount of success of innovative companies has been based,
‘non-performance-related’ top executive however, not on the stock market, but on the
compensation - salary and bonus - that a principles of strategic control, organizational
corporation could claim as a tax deduction. One integration, and financial commitment. Historically
perverse result of this law was that companies that these principles also underpinned innovative
were paying CEOs less than $1 million in salary enterprise in the United States. Many corporate
and bonus raised these components of CEO pay executives who exercise control over resource
towards $1 million, which executives now viewed as allocation in the US economy may, however, have
the government-approved CEO ‘minimum wage’. forgotten these principles, or worse yet, while they
The other perverse result was that companies have been busy enriching themselves, they may
increased CEO option compensation, for which tax have never bothered to leam them.
deductions were not in any case being claimed, as
an alternative to exceeding the $1 million salary-
and-bonus cap.
That having been said, the limits to the gains
from stock options, not just for top executives but
Bibliography
also for broad bases of the employees of US high-
tech corporations, would long ago have been To conserve both the word-count and flow of this essay, I have
reached if not for the fact that many of these kept bibliographic references to a minimum, indicating
instead works of mine in which these references can be
found.
Contestable Markets 2169
best-practice production technology, the same input of industry whether or not scale economies are
markets and the same input prices as those available prevalent. Where they are not, perfectly competitive
to the incumbents. There are no legal restrictions on behaviour is necessary for equilibrium in PCMs,
market entry and exit, and there are no special costs and, where scale economics do prevail, equilibrium
that must be borne by an entrant that do not fall on in PCMs entails behaviour different from that found
incumbent firms as well. Consumers have no in perfectly competitive markets but which none the
preferences among firms except those arising less tends to exhibit desirable welfare properties. In
directly from price or quality differences in firms’ other words, perfect contestability is a
offerings. generalization of perfect competition that has strong
Potential entrants into perfectly contestable implications in significant circumstances where the
markets are profit-seekers who respond with pro- latter is inapplicable.
duction to profitable opportunities for entiy. They In order to clarify and expand on these ideas,
assess the profitability of their marketing plans by subsequent sections offer analytic outlines of the
making use of the current prices of incumbent theory of perfectly contestable markets and appli-
firms. Thus, for example, an entrepreneur will enter cations of the theory to single-product and multi-
a market if he anticipates positive profit from product industries. Finally, observations are offered
undercutting the incumbent's price and solving the on the implications of this theory for the
entire market demand at the new lower price. formulation of government policy towards industry.
Potential entrants are undeterred by prospects of
retaliatory price cuts by incumbents and, instead,
are deterred only when the existing market prices Perfectly Contestable Markets: Definitions and
leave them no room for profitable entry. Basic Properties
These features of the behaviour of potential
entrants are key to the workings of perfectly con- The theory presented here lies in the realm of partial
testable markets, and they are fully rational only equilibrium. It deals with the provision of
where entry faces no disadvantages and is costlessly the set of products N = {1.........n}, some of which
reversible. Hence, the benchmark case of perfect may not actually be produced, and which is a proper
contestability excludes the sunk costs, pre- subset of all the goods in the economy. The prices
commitments, asymmetric information and strategic of these products are represented by vectors p t R+
behaviour that characterize many real markets and +, and other prices are assumed to be exogenous and
that are the focus of much current research attention are suppressed in the notation. Qip) c R"+ is the
in the field of industrial organization. With vector-valued market demand function for the
irreversibilities and the inducements for strategic products in N, and it suppresses consumers’
behaviour absent, industry structure in PCMs is incomes which are assumed to be exogenous. For
determined by the fundamental forces of demand any output vectory e R", C(v) is the cost at
and production technology. exogenously fixed factor prices when production is
Of course, this is also true of perfectly compet- efficient, the underlying technology is assumed to
itive markets. However, this most familiar idealized be freely available to all incumbent firms and all
limiting case is not a satisfactory benchmark for the potential entrants. Where necessary, C(y) and Q(p)
study of industry structure in general, because it is will be assumed to be differentiable.
intrinsically inapplicable to a variety of significant
Definition 1 A feasible industry configuration is
cases. In particular, where increasing returns to
composed of in firms producing output vectors i
scale are present, perfectly competitive behaviour is
c Rfi, at prices p c R" such that the
logically inconsistent with the long-run financial
markets clear, W™ , y‘ = Q(p), and that each firm at
viability of unsubsidized firms.
least breaks even,/) . / - C(y') > 0,i = 1, ..., in.
Perfectly contestable markets can serve in place
of perfectly competitive markets as the general
standard of comparison for the organization
Contestable Markets 2171
following propositions summarize some charac- be dispensed with, almost entirely, with exclusive
teristics of equilibria in PCMs. reliance on the freedom of entry that characterizes
PCMs.
Proposition 3 Let /?; y1,..ym be a sustainable
industry configuration. Then each firm must (i) earn Proposition 5 Let /?; y 1 ,..., ym be a sustainable
zero profit by operating efficiently, p . y C(y‘) = 0; configuration. Then, for any y1, ... ,yk with
(ii) avoid cross-subsidization, Ps ■y‘s > C(y‘) — k mk m
C(y‘N s),MSCN (where the vector xT agrees with the E ^' = Ey> Ec (^)> Ec (y).
i= i i= i i= i i i
vector x in components j € T and has zeros for its
other components); (iii) price at or above marginal That is, a sustainable configuration minimizes
cost,/?, > dC(y')ldyj. the total cost to the industry of producing the total
The interpretation of condition (ii) is that the industry output.
revenues earned from the sales of any subset of the This proposition is a generalization to PCMs of
goods must not fall short of the incremental costs of a well-known result for perfect competition. It can
producing that subset. Otherwise, in view of the be interpreted as a manifestation of the power of
equality of total revenues and costs, the revenues unimpeded potential entry to impose efficiency
collected from the sales of the other goods must upon the industry. For example, the proposition
exceed their total stand-alone production cost. In implies that if a monopoly occupies a PCM it must
be a natural monopoly - production by a single firm
PCMs, such pricing invites entry into the markets
must minimize industry cost for the given output
for the goods providing the subsidy.
vector. Thus, Propositions , and are powerful tools
Proposition 4 Let p; y1,..., ym be a sustainable for the analysis of industry structure in PCMs.
configuration with yj < ELi yj ■ Thcn Pj = dCiy^/dyj. Proposition permits information on the properties of
That is, if two or more firms produce a given good production costs to be used to assess the scale and
in a PCM, they must select input-output vectors at scope of firms’ activities in PCMs. Then,
which their marginal costs of producing it are equal Propositions and permit inferences to be drawn
to the good’s market price. about the corresponding equilibrium prices.
The implications of this result are surprisingly
strong. The discipline of sustainability in perfectly PCMS with a Single Product
contestable markets forces firms to adopt prices just
equal to marginal costs, provided only that they are This analytic approach leads to very strong results
not monopolists of the products in qst. Conventional in the single-product case. Propositions - show that
wisdom implies that, generally, only perfect there are only two possible types of sustainable
competition involving a multitude of firms, each configurations in single-product industries. The first
small in its output markets, can be relied upon to type involves a single firm which charges the lowest
provide marginal-cost prices. Here we see that price that is consistent with nonnegative profit. The
potential competition by prospective entrants, rather firm must be a natural monopoly when it produces
than rivalry among incumbent firms, suffices to the quantity that is demanded at this price. And, in
make marginal-cost pricing a requirement of this circumstance, the result maximizes welfare
equilibrium in PCMs, even those containing as few subject to the constraint that all firms in question be
as two active producers of each product. The viable financially without subsidies. Such a second-
conventional view holds that the enforcement best maximum is referred to as a ‘Ramsey
mechanism of full competitive equilibrium requires optimum’.
the smallness of each active firm in its product The second type of sustainable configuration
market, in addition to freedom of entry. We see that involves production by one or more firms of
the smallness requirement can
Contestable Markets 2173
outputs at which both marginal cost and average Proposition 6 In a single-product industry in
cost are equal to price. Here, in the long run, all which the firm’s average cost curve has a flat-
active firms exhibit the behaviour that characterizes bottom between minimum efficient scale and twice
perfectly competitive equilibrium. And, of course, minimum efficient scale, a configuration is
the result involves both (first-best) welfare sustainable if and only if it is Ramsey optimal.
optimality and financial viability. Hence, in this This result shows that, under the conditions
case, Ramsey optimality and the first-best coincide. described, there is equivalence between welfare
This establishes the result that in a singleproduct optimality and equilibrium in PCMs. This extends
industry any sustainable configuration is Ramsey the corresponding result for perfectly competitive
optimal. equilibria to cases of increasing returns to scale.
However, in general, because of the ‘integer Moreover, since the behavioural assumptions
problem’, sustainable configurations may generally required for a PCM are weaker than those under-
not exist. This problem arises, for example, where lying perfectly competitive markets, the equiva-
there is only one output at which a firm’s marginal lence result is more sweeping. In particular,
and average costs coincide, and where the quantity Proposition implies that PCMs can be expected to
of output demanded by the market at the perform well, whatever the number of firms
competitive price is greater than this, but is not an participating in equilibrium. It is the potential
integer multiple of that amount. Then, no sus- competition of potential entrants, rather than the
tainable configurations exist. active competition of existing rivals, that drives
There is, however, a plausible assumption, equilibrium in PCMs with a single product to
supported by empirical evidence, at least to some welfare optimality.
degree, that eliminates the integer problem.
Suppose that a firm’s average cost curve has a flat-
bottom rather than being ‘U’-shaped. In particular, Multi-Product Perfectly Contestable
suppose that the minimum level of average cost is Markets
attained not only at one output, but (at least) at all
outputs between the minimum efficient scale, ym, In industries that produce two or more goods, a rich
and twice the minimum efficient scale. Then any variety of industry structures become possible, even
industry output, y1, that is at least equal to ym can be in PCMs. Here, while the constraints imposed upon
apportioned among an integer number of firms, incumbents by perfect contestability are not nearly
each of which achieves minimum average cost. as effective in limiting the range of possible
Specifically, y1 can be divided evenly among outcomes as they are in single product industries,
L);1/y"J firms (where |vj is the largest integer not they nevertheless provide a helpful basis for
greater than x) and each firm’s output, y1/ L);1/y'"J analysis. In particular, Propositions - indicate
must lie in the (half-open) interval between y,„ and connections among various qualitative properties of
2y,„. Hence, in this case, the Ramsey optimum can multi-product cost functions and various elements
either be a sustainable configuration of two or more of industry structure in PCMs. These connections
firms performing competitively, or a sustainable constitute one theme of this section. The other
natural monopoly. Such a monopoly may either theme is the normative evaluation of the industry
produce an output at which there are increasing structures that arise in multi-product PCMs.
returns to scale and it will then price at average Before proceeding, it may be useful to provide
cost, or it may produce an output between y,„ and definitions of some of the multiproduct cost prop-
2y,„ with locally constant returns to scale and adopt erties that are used in the analysis.
a price equal both to average and marginal cost.
Definition 5 Let P = [ 7',, ... , Tk) be a nontrivial
This, together with the preceding argument,
partition of S C N. There are (weak)
establishes the following result.
2174 Contestable Markets
economies of scope at ys with respect to the partition over the set of goods it produces. When strict
P if ffki=i C(yTi) > (>)C'(js). If no partition is economies of scope are present, there must be at
mentioned explicitly, then it is presumed that least one multi-product firm in any PCM that
T, = {/}• supplies more than one good.
The second basic question that arises is whether
Definition 6 The degree of scale economies there can be two or more firms actively producing a
defined over the entire product set, N {1,.. n), at}’, particular good in a PCM. If there are, then, by
is given by = C(y)/y.VC(y). Proposition , marginal cost pricing must result. The
Returns to scale are said to be increasing, con- answer depends upon the availability of product-
specific scale economies.
stant or decreasing as SA- is greater than, equal to or
less than unity. This occurs as the elasticity of ray Proposition 8 Any product with average incre-
average cost with respect to t is negative, positive or mental costs that decline throughout the relevant
zero; where ray average cost is RAC (O'0) - range (that is, that offers product-specific increasing
C{ty°)/t. returns to scale) must be produced by only a single
Definition 7 The incremental cost of file product firm (if it is produced at all) in a PCM. Further,
set T C Naty is given by IC j{y)= Ciy) C (yy _ f). such a product must be priced above marginal cost,
The average incremented cost of T is AICr (y) unless the degree of product- specific scale
=ICifyVEy e 7}’j. economies is exactly one.
The average incremental cost of T is decreasing, Thus, regardless of file presence or absence of
increasing or constant at y if MCftyT-yN T) is a economies of scope, globally declining average
decreasing, increasing or locally constant function incremental costs imply that a product must be
of t at t = 1. These cases are labelled respectively, monopolized in a PCM. It is an immediate corollary
increasing, decreasing or constant returns to the that if all goods in the set N exhibit product-
scale of the product line T. The degree of scale specific scale economies, and if there are economies
economics specific to T is of scope among them all, then the industry is a
natural monopoly that must be monopolized in a
PCM.
dC(y)
ICr(y)/$>f Another route to this result is provided by the
it T c l - ‘weak invisible hand theorem of natural monopoly’.
two firms actively producing each good. For in this laissez-faire rather than active regulation by
case Propositions and guarantee industrywide cost administrative or antitrust means. Small numbers of
efficiency and marginal-cost pricing of all products. large firms, vertical and even horizontal mergers
Here, two issues must be resolved: Does industry- and other arrangements which have traditionally
wide cost minimization require at least two been objects of suspicion of monopolistic power,
producers of each good? And if so, do sustainable are rendered harmless and perhaps even beneficial
configurations exist? by the presence of contestability.
The existence problem can be solved in a man- On the other hand, contestability theory does not
ner analogous to its solution in the case of single- lend support to the proposition that the unrestrained
product industries: by assuming that ray average market automatically solves all economic problems
costs remain at their minimum levels for output and that virtually all regulation and antitrust activity
vectors that lie (on each ray) between minimum entails unwarranted and costly intervention. The
efficient scale and twice minimum efficient scale. economy of reality is composed of industries which
And the presence of at least two producers (or one vary widely in the degree to which they
operating in the region where constant returns approximate the attributes of perfect contestability.
prevail) of each good is assured if the quantities Before the theory of contestability can be
demanded by the market at the relevant marginal- legitimately applied to reach a conclusion that
cost prices are no smaller than minimum efficient intervention is unwarranted in a specific sector, it
scale (along the relevant ray) and if the cost func- must first be shown that the sector ties unprotected
tion exhibits trans-ray convexity. by entry barriers and that the force of potential entry
therefore actively constrains the behaviour of
incumbent firms. This then becomes the appropriate
Policy Implications of PCMS first stage in an analysis of efficient government
policy towards an industry. Only where the
One of the principal lessons of the analysis of PCMs conditions of contestability are found to
is that monopoly does not necessarily entai I characterize the reality of an industry can there be
welfare losses. Rather, the ‘weak invisible hand th’ validity in applying the normative conclusions of
shows that under certain conditions sustainability contestability theory concerning the power of
and Ramsey optimality are consistent, so that the potential competition actually to enforce efficient
total of consumers’ and producers’ surpluses may behaviour on incumbents.
well be maximized (subject to the constraint that Even where contestability is absent in reality,
firms be self-supporting) in the equilibrium of a the formulation on efficient regulation can be use-
monopoly which operates in contestable markets. fully guided by the theory of contestable markets
Even stronger results follow from the discussed instead of the theory of perfectly competitive mar-
results that under certain conditions sustainability kets. The first-best lesson of the perfect competition
and a first-best solution are consistent in an model, calling for prices to be set equal to marginal
oligopoly with a small number of firms. When costs, has no doubt contributed to the common
minimization of industry cost requires that each regulatory ethos which equates price to some
good be produced by at least two firms, measure of cost. This doctrine has been used
sustainability requires any equilibrium to satisfy the frequently where it is completely inappropriate and
necessary conditions for a first-best allocation of without logical foundation, that is, in cases where
resources. Thus, in these cases, the invisible hand prices should be based on demand as well as cost
has the same power over oligopoly in perfectly considerations, because of the presence of
contestable markets that it exercises over a perfectly economies of scale and scope. Such arbitrary
competitive industry. measures as fully distributed costs cannot substitute
This theory suggests that in a market that for marginal cost measures as decision rales for
approximates perfect contestability, the general proper pricing, and the search for a substitute is a
public interest is well-served by a policy of remnant of inappropriate reliance on the
2176 Contingent Commodities
model of perfect competition for guidance in 8 plus the common cost of 4, less the contribution
regulation. towards that common cost from service A.
In contrast, contestability theory suggests cost Thus, the forces of idealized potential competi-
measures that are appropriate guideposts for reg- tion in perfectly contestable markets enforce cost
ulated pricing - incremental and stand-alone costs. constraints on prices, but prices remain sensitive to
The incremental cost of a given service is, of demands as well. Actual competition and potential
course, the increment in the total costs of the competition are effective if they constrain rates in
supplying firm when that service is added to its this way, and in such circumstances regulatory
product line. In perfectly contestable markets, the intervention is completely unwarranted. But if, in
price of a product will lie somewhere between its fact, market forces are not sufficiently strong, then
incremental and its stand-alone cost, just where it there may be a proper role for regulation of natural
falls in that range depending on the state of demand. monopoly, and the theoretical guidelines derived
One cannot legitimately infer that monopoly power from the workings of contestable markets are the
is exercised from data showing that prices do not appropriate ones to apply. That is, prices must be
exceed stand-alone costs, and stand-alone costs constrained to lie between incremental and stand-
constitute the proper cost- based ceilings upon alone costs. (This is the approach recently adopted
prices, preventing both cross-subsidization and the by the Interstate Commerce Commission to
exercise of monopoly power. A simple example determine maximum rates for US railroad services,
will show why this is so. and the method has already withstood appeals to the
First, suppose that a firm supplies two services, federal courts.)
A and B, which share no costs and that each costs
10 units a year to supply. The availability of See Also
effective potential competition would force reve-
nues from each service to equal 10 units a year. For ► F....
higher earnings would attract (profitable) entry, and
lower revenues would drive the supplier out of Bibliography
business. In this case, in which common costs are
absent, incremental and stand-alone costs are equal Baumol, W..L and R.D. Willig. 1986. Contestability:
to each other and to revenues, and the competitive Developments since the book. Oxford Economic Papers 38:
9-36.
and contestable benchmarks yield the same results.
Baumol, W.J., J.C. Panzar, and R.D. Willig. 1982. Contestable
Next, suppose instead that of the 20 unit total markets and the theory of industry structure. New York:
cost 4 are fixed and common to A and B, while 16 Harcourt Brace Jovanovich.
are variable, 8 of the 16 being attributable to A and Baumol, W.J., J.C. Panzar, and R.D. Willig. 1985. On the theory
of perfectly contestable markets. In New developments in
8 to B. If, because of demand conditions, at most
the analysis of market structure, ed. J. Stightz and F.
only a bit more than 8 can be generated from Mathewson. New York: Harcourt Brace.
consumers of A, then a firm operating and surviving
in contestable markets will earn a bit less than 12
from B. These prices lie between incremental costs
(8) and standalone costs (12), are mutually
advantageous to consumers of both services, and
will attract no entrants, even in the absence of any Contingent Commodities
entry barriers. In contrast, should the firm attempt
Zvi Safra
to raise the revenues obtained from B above the 12
unit stand-alone cost, it would lose its business to
competitors willing to charge less. Similarly, the
same fate would befall it in contestable markets if it
priced B in a way that earned more than Keywords
Contingent commodities; Convexity; General
equilibrium; Risk; Risk aversion; Uncertainty
Contingent Commodities 2177
JEL Classification
similar way. Note that, although this model deals
D5 with uncertainty, no concept of probabilities is
needed for its formal description.
The theory of general competitive equilibrium was To make things more explicit we look at a
originally developed for environments where no simple model with contingent commodities.
uncertainty prevailed. Everything was certain and Assume that, without referring to uncertain events,
phrases like ‘it might rain’ or ‘the weather might be there arc k > 1 commodities, indexed by i, and that
hot’ were outside the scope of the theory. The idea there are n > 1 mutually exclusive and jointly
of contingent commodity, that was introduced by exhaustive events (or states of nature ), indexed by
Arrow ( ) and further s, where k and n are finite. Thus a contingent
developed by Debreu ( ), was an ingenious commodity is denoted by x i s and the total number of
device that enabled the theory to be interpreted to these commodities is hi, which is greater than k but
cover the case of uncertainty about the availability still finite. Consumption and production sets are
of resources and about consumption and production thus defined as subsets of the foi-dimensional
possibilities. Basically, the idea of contingent Euclidean space, and the economic behaviour of
commodity is to add the environmental event in firms and consumers naturally follows from profit
which the commodity is made available to the other maximization (by firms) and utility maximization
specifications of the commodity. With no (by consumers). The price pIS of the contingent
uncertainty every commodity is specified by its commodity xis is the number of units of account that
physical characteristics and by the location and date have to be paid in order to have the zth commodity
of its availability. It is fairly clear, however, that being delivered at the vth event. It is assumed that
such a commodity can be considered to be quite the market is organized before the realization of the
different where two different environmental events possible events. Thus payment for the contingent
have been realized. The following examples clarify commodity xiy is done at the beginning while
this: an umbrella at a particular location and at a delivery takes place after the realization of events
given date in case of rain is clearly different from and only in case event s has occurred. Note that the
the same umbrella at the same location and date price of the (certain) rth commodity, that is, the
when there is no rain; some ice cream when the number of units of account that have to be paid in
weather is hot is clearly different from the same ice order to have the zth commodity for sure, is the sum
cream (and at the same location and date) when the over 5 of the prices pis. For example, assume that
weather is cold; finally, the economic role of wheat the price of one quart of ice cream if the weather is
with specified physical characteristics available at hot is $2.00, the price of one quart of the same ice
some location and date clearly depends on the cream if the weather is cold is $1.00 and that n = 2
precipitation during its growing season. Thus, (either it is hot or cold). Thus the price of having
specifying commodities by both the standard one quart of that ice cream for sure is $2.00 + $1.00
characteristics and the environmental events seems = $3.00.
very natural, whereas the role of the adjective in It should be noted that, although the probabili-
‘contingent commodities’ is simply to make it clear ties of the possible events do not explicitly enter the
that one is dealing with commodities the availability model, the attitude towards risk of both consumers
of which is contingent on the occurrence of some and producers is of interest and does play a signif-
environmental event. With this specification the icant role in this framework. The preference rela-
model with contingent commodities is very similar tions of consumers defined on subsets of the An-
to the classical model of general competitive dimensional Euclidean space reflect not only their
equilibrium and thus questions like the existence of ‘tastes’ but also their subjective beliefs about the
equilibrium and its optimality (with the additional likelihoods of different events as well as their
aspect of efficient allocation of risk bearing) are attitude towards risk. Convexity of consumers’
answered in a preferences, for example, is interpreted as risk aver-
sion while, in the same spirit, profit maximization
2178 Contingent Valuation
of firms is interpreted as risk neutrality. It should be reliable revealed preference data about demands
mentioned that both Arrow and Debreu basically for certain types of goods. A number of
assume expected utility maximizing behaviour, in significant lawsuits have promoted their use in
the sense of the Savage ( ) framework. A more estimating demand for environmental goods.
general approach to such preference relations can They are also used by transportation economists,
be found in Yaari ( ), where, again, convexity is health economists and market researchers.
taken to mean risk aversion. Although the degree of acceptance of these
A unified and more formal treatment of time methods varies, many economists agree that a
and uncertainty using contingent commodities can value based on stated preferences derived from
be found in Debreu ( , ch. 7). Radner carefully conceived and executed research is
( ) presents an extension of the above model almost certainly preferable to no number at all.
to the case in which different economic agents have
different information.
Keywords
See Also Conjoint analysis; Construct validity assess-
ments; Contingent valuation; Discrete- choice
► Arrow-Debreu Model of General Equ
►J models; Dummy variables; Environmental
economics; Expectations; Health economics;
Household surveys; Indirect utility functions;
► Uncertainty and General Equ
Logit models; Marginal utility of income;
Maximum likelihood; Neuroeconomics; Probit
Bibliography models; Random utility models; Revealed
Arrow, K.J. 1953. Le role de valeurs boursieres pour la repartition preference; Scope tests; Stated preference;
la meilleure des risques. Econometrie. Paris: CNRS. English Willingness to pay
translation ‘The role of securities in the optimal allocation of risk-
bearing’ in Review of Economic Studies (1964); reprinted in
K.J. Arrow, Essays in the theory of risk-bearing. Chicago:
Markham, 1971. Debreu, G. 1953. Une economic de Tincertain. JEL Classifications
Mimeo, Paris: Electricite de France. Q51
Debreu, G. 1959. Theory of value. New York: Wiley. Radner, R.
Most economists would agree that no researcher
1968. Competitive equilibrium under uncertainty. Econometrica
36: 31-58.
should prefer demand data from hypothetical mar-
Savage, L. J. 1954. Foundations of statistics. New York: Wiley.
kets if data concerning the identical goods or
Yaari, M.E. 1969. Some remarks on measures of risk aversion
services, based on real markets, are readily avail-
and their uses. Journal of Economic Theory 1:315-329.
able. However, there are many situations when even
the cleverest empirical economist cannot come up
with revealed preference data from actual markets
that can be relied upon for information about
household demands for some types of goods.
Environmental goods are one class of goods where
Contingent Valuation real-market demands sometimes cannot be
measured adequately. In the 1980s, environmental
Trudy Ann Cameron
economists began in earnest to exploit stated-
preference demand information, usually collected
using household surveys. This demand information
is used primarily to produce utility-theoretic
Abstract measures of the social benefits of environmental
‘Contingent valuation’ methods are used to protection measures for benefit-cost analyses.
generate demand data, usually from household
surveys, when real markets do not supply
Contingent Valuation 2179
Environmental economists called these methods least needed. The challenge for researchers is to
‘contingent valuation methods’ (CVM) because the ensure that demand information gathered using
valuations were elicited ‘contingent upon the CVM, in less-than-ideal contexts, is as valid and
conditions described in the survey’. Research that useful as possible.
focused on the development and assessment of Myriad biases and qualifications may afflict
CVM in environmental economics was well under poorly executed CVM studies. A partial list
way by the mid-1980s. However, two events in includes incentive compatibility, hypothetical bias
1989 thrust the method to the forefront of the field. (if the choice is perceived to have absolutely no real
First, the Exxon Valdez, an ocean tanker, ran consequences), strategic bias (when people try to
aground in Prince William Sound in Alaska, spilling manipulate the outcome by misrepresenting their
11 million gallons of oil in an environmental preferences), non-response bias (since people
disaster that attracted a huge amount of media cannot be compelled to participate), starting-point
attention worldwide. Second, just a few months bias (for surveys with iterative bids), interviewer
later, the US Court of Appeals held that the bias (for in-person surveys), and information bias
economic damages assessed for spills of oil or other (when some portion of the value is constructed
hazardous substances could include ‘lost passive use during the survey where it did not exist before).
values’, and that these values could legally be Other problems include yea-saying, part-whole bias
measured via CVM. or embedding, scenario rejection, and the potential
Plaintiffs and defendants in the Exxon Valdez for respondents not to pay sufficient attention to
case thus had a big incentive to advocate and their real budget constraints.
derogate CVM, respectively. For at least a time, the Choice formats have been an important issue in
discussion in the literature teetered on the brink of the development of CVM. For example, in some
losing its polite academic tone. Given the escalation early applications of CVM survey respondents were
of the controversy over CVM, the US National asked directly to identify the single highest dollar
Oceanic and Atmospheric Administration (NOAA) amount that they would pay to obtain some change
convened a panel of experts (untainted by any active in conditions. These were called open- ended CV
role in the Exxon Valdez litigation) to assess CVM. questions. Researchers quickly realized that such a
This exercise, by Arrow et al. ( ), produced a task was difficult for consumers who were
set of pronounce unfamiliar with naming their own price, especially
ments concerning best practices for the conduct of for goods they may never before have thought much
CVM studies. While the 1993 NOAA Panel report about having to pay for. CVM elicitation techniques
cannot be considered the last word on CVM, it was evolved fairly quickly to a dichotomous-choice
very influential, and there has since been strong format, where respondents are given a choice
pressure on researchers either to conform to the between two states of the world. One state is
NOAA best practices or to fully justify any typically the status quo, while the other involves a
departures from them. specified change or set of changes (such as an
As a result of the Exxon Valdez case, much improvement in environmental quality, or some
doubt about the reliability of stated preference data other rationed public good) that come at a price
led to numerous comparisons of the implications of (typically a lump-sum payment). This binary choice
stated and revealed preference data (for example, format was found to fit naturally into a random
Carson et al. ). CVM works best when respondents utility model (RUM) framework that had also
have a clear sense of the consequences of their become a popular approach to consumer choice
choices - in terms of both their own budgets and the problems, both real and hypothetical, in the
exact nature of the good that they are being asked to transportation mode- choice literature and
consider paying for - and when they are reasonably elsewhere in economics.
familiar with market transactions involving that Respondents’ preferences, based on then-
good. This means that CVM is, unfortunately, most answers to dichotomous-choice CV questions,
successful when it is
2180 Contingent Valuation
can be characterized either in an ad hoc fashion or additively separable in net income under each
in a more formal utility-theoretic framework. One alternative.
standard approach is to specify an indirect utility Linearity and additive separability' in income is
function shared by all respondents. In its simplest convenient (w'hen warranted) because the willing-
form, the level of indirect utility' is assumed to ness to pay (WTP) function associated with the
depend on the individual’s net income under each of fitted model is given by the marginal rate of sub-
the two alternatives, and upon a discrete indicator of stitution (MRS) between the programme and
whether there is a change in the rationed public income. This MRS is given by the ratio of the
good, or no change, under each alternative. marginal utility of the programme to the marginal
Respondents can choose the environmental utility of income, producing a result that can be
improvement programme along with its associated expressed in dollars per 'unit' of the programme,
cost (implying lower net income), or decline the where the program indicator is either zero or 1 in
environmental improvement programme in order to the simple binary case. In the non-stochastic case,
avoid the cost (preserving their net income). If a for a simple dichotomous choice CVM model, this
respondent prefers the programme with its is a single number - 'WTP for the program’ - if the
associated cost, the researcher assumes that the researcher has assumed homogeneous preferences
respondent's utility' level is higher under that throughout the sample.
alternative. Equivalently, this means that the net Some extra empirical housekeeping is necessary
indirect utility' associated with the programme w'hen it is acknowledged that this point estimate is
alternative is positive. constructed as the ratio of two estimated quantities,
A discrete-choice econometric model, typically each of which (due to the use of maximum
involving a binary logit estimator, is used to likelihood estimation methods for the logit or probit
estimate the sample average marginal utilities of model) is an asymptotically normally distributed
(</) net income and (b) the discrete bundle of random variable. In theoretical terns, the ratio of
changes represented by the programme in question. two normally distributed random variables has an
It is of course possible to allow for heterogeneity undefined mean, because zero is a possible value
across the sample in these marginal utilities. Most for the denominator. As a practical matter, some
often, heterogeneity is introduced by allowing the researchers use simulation methods to build up a
otherwise scalar marginal utility associated with sampling distribution for the estimated WTP. It is
going from ‘no programme’ to ‘programme’ to possible to use packaged soft- ware to make a large
become a systematically varying parameter. Of number of random draws from the joint distribution
course, if the identical programme is offered to all of the logit or probit parameters (based on the
respondents, it is not possible to allow this marginal estimated parameter point estimates and the
utility to vary with attributes of the programme. parameter variance- covariance matrix). One can
However, it can easily be allowed to vary with then build up a sampling distribution for the needed
characteristics of the respondent. ratio. Other strategies for dealing with this
Less commonly, the marginal utility of income inconvenience involve estimating the model in
is also allowed to vary across respondents, either ‘WTP-space’ rather than ‘utility-space’ or
with the respondent’s income (to allow' for employing the new'er mixed logit (random-
diminishing marginal utility' of income) or with parameters logit models) and stipulating that the
other respondent characteristics. However, there is a marginal utility of income parameter be distributed
premium on simplicity for the marginal utility of lognormal (since it should be strictly positive, on
income, stemming from the need to use the average), rather than normal, so that the potential
estimated marginal utility' of income parameter (s) divide-by-zero problem goes away.
to recover demand information. For this reason, Over the 1990s contingent valuation researchers
many researchers will, if it is justified by the data, in environmental economics gradually made better
prefer a choice model that is linear and contact with their counterparts working in other
literatures who w'ere confronted
Contingent Valuation 2181
by similar problems where there is a lack of market CVM with the types of choices that are most
data for products or public goods that need to be common in health economics contexts.
valued. In the transportation literature, researchers In the transportation and marketing literatures,
had grappled early on with the problem of the desired demand information often spanned a
forecasting demand for public transportation number of possible alternative products or services.
projects, or new types of vehicles, that did not yet Stated preference studies were often conducted not
exist. Researchers began to introduce hypothetical just to determine respondents’ willingness to pay
new transportation options which could be for a single well-defined good but to understand
characterized in the same terms as existing options how willingness to pay might be affected by
(in ‘attribute space’) but which had some attributes variations in the mix of attributes making up a
that lay well outside the set of existing options on prospective good. It was often necessary to
some dimensions, or which involved attributes that anticipate demands for differentiated products,
were not relevant for existing options (such as travel where each product could be characterized as a
range or recharge time for prospective electric bundle of attributes and the levels of these attributes
vehicles). One key difference from contingent differed across alternatives.
valuation was the practice of asking survey In contrast, more of the impetus for CVM non-
respondents to consider more than just ‘the status market valuation research in environmental
quo versus a single alternative’. Furthermore, the economics stemmed from a number of significant
alternatives were more richly specified. Instead of lawsuits. In the legal context, there is a premium on
using simply a dummy variable to indicate whether simplicity in economic modelling so as not to
the policy, programme or public good was present confuse the jury or the judge. It is often best to
or absent, each alternative was characterized in produce one value for one clearly defined com-
terms of an array of attributes. modity. (Providing a judge or jury with a function
Similar problems were also being addressed in that describes demand, where WTP depends upon a
the marketing literature, particularly in the context wide array of attributes, conditions or respondent
of ‘pre-test’ marketing. Companies considering attributes, can actually be a liability when attorneys
whether to develop and introduce new products are trying to make a simple, clear and persuasive
needed to know in advance about the likely demand case. In a legal context, it is most incisive to value
for these products, perhaps as a function of one thing, and to value it as precisely as possible.)
alternative possible product configurations. Market Eventually, however, environmental economists
researchers developed a set of techniques they began to acknowledge the value of understanding
called ‘conjoint analysis’. In the marketing the heterogeneity in demands for environmental
literature, the specifications used for the choice goods, since this knowledge can be very helpful to
models were initially very ad hoc. Little attention policymakers who wish to consider how different
was paid to the interpretation of the estimated versions of a policy might affect different
coefficients as marginal utilities, and simple linear constituencies.
and additively separable specifications were very There are many commonalities between the
common. The slope coefficients were known as tasks faced by environmental economists and those
‘part-worths’ rather than marginal utilities. How- faced by transportation economists and market
ever, much was learned about the degree of con- researchers, but there is one key difference. In
sistency between planned purchase behaviour and transportation economics and market research, it is
actual purchase behaviour. often the case that the public transit system in
CVM has also recently grown in popularity in question will actually be built, or the new product
other sub-disciplines, notably health economics. will actually be developed and put on the market.
However, Smith ( ) surveys that literature and There is an opportunity to go back and see whether
suggests that researchers in that field have not yet the level of demand predicted by the stated
developed a set of best practices for the use of preference study actually materializes when there is
a real market. In the environmental
2182 Contingent Valuation
economics literature, there are typically fewer that can be collected. In fact, the need for econo-
opportunities to ‘validate’ the stated preference mists to rely upon survey data (what people say as
demand information with revealed preferences for opposed to what they actually do in markets) is now
the same product. being acknowledged in the other contexts in
One common expectation for a good CVM study economics. For example, expectations about future
is now a demonstration that the demand function income or life expectancy figure prominently in a
that has been estimated should ‘walk and talk’ like a number of economic theories. These expectations
demand function. For example, is willingness to pay typically cannot be measured directly, but can
to preserve big-game hunting opportunities lower, sometimes be elicited using surveys and put to good
on average, for elderly women than for middle-aged use empirically (see Manski )•
males? Is willingness to pay to preserve air quality It is worth noting that not just stated preference
higher for people with lung disease or asthma, or data but also revealed preference data can be highly
for people who have family members with these variable in its quality. Much revealed preference
illnesses? These tests are commonly called demand data is also drawn from consumer surveys.
‘construct validity’ assessments. Contingent It is not always clear that the individual respondent
valuation studies that pass a battery of plausibility sees the need for accuracy and completeness to be
tests such as these can generally be viewed as more as critical as researchers using the data might hope.
reliable. In consumer expenditure surveys, for example,
Another common test of contingent valuation interviewers prompt subjects for different types of
estimates that these stated preference demand expenditures, but the enthusiasm and engagement of
functions are typically expected to satisfy is some- the survey subject often determines the acciuacy
thing called a ‘scope test’. This means that, on and completeness of the data. Rather than viewing
average, respondents’ willingness to pay for an revealed preference data as of unambiguously high
alternative that involves more of the ‘good’ in quality and stated preference data (such as that
question should be greater than that for an alter- produced by CVM) as of unambiguously low
native that involves less of the ‘good’. It is of quality, it may be prudent simply to acknowledge
course possible that marginal utility may be positive that both types of data can have their problems.
(as the scope test implies) at low levels of the good, A partial list of current frontiers in CVM-related
but also that it may go to zero if the quantity of the research is possible. These frontiers include
good is high enough for satiation to set in, and there continuing assessment of (a) alternative elicitation
is no theoretical basis for expecting willingness to formats (there are many candidates beyond the
pay to be proportional to the amount of the good in simple NOAA-recommended binary choice format),
question. (b) the choice contexts presented to subjects, (c) the
CVM data can also sometimes be pooled with effects of allowing subjects to express uncertainty
actual choice data. This can allow portions of the about their choices, (d) the effects of practice and
underlying indirect utility function to be anchored fatigue when several CVM questions are presented
upon real choices, even though the variability in to each respondent, (<?) integrating stated choices
attributes in the real alternatives may not span the with additional types of real market information, (/)
full domain relevant to pending policy decisions. how the degree of complexity of the CVM choice
The stated choice questions can be used to extend scenarios interacts with the cognitive capacity of the
the domain of the estimated demand function. subject and/or the subject’s inclination to pay
While economists will remain uncomfortable attention, and (g) the neuroeconomics of real as
about reliance upon stated preference information, opposed to stated choices.
many now acknowledge that there are cir-
cumstances where stated preference data are all
Contingent Valuation 2183
Two of the classic books on CVM are Cum- zero would otherwise be imputed, by default, for
mings et al. ( ) and Mitchell and Carson use in policy decisions.
( ). Following the Exxon Valdez case, a pro
vocative debate was featured in the Journal of
Economic Perspectives (Diamond and Hausman ; See Also
Hanemann ; Portney ). McFad- den ( ) raised some
specific concerns about ► Environmen
the reliability of CVM data in the context of an
empirical application to the existence value of
wilderness areas in the western United States. In the Bibliography
intervening years, however, research concerning
Arrow, K.., R. Solow, P.R. Portney, E.E. Learner, R. Radner, and
CVM has continued apace. Helpfiil expositions and
H. Schuman. 1993. Report of the NOAA panel on contingent
discussions of recent innovations have made their valuation. Federal Register 58:4601-4614.
way into textbook form, with one particularly useful Boyle, K..J. 2003. Contingent valuation in practice. In A primer
summary being provided in Chapter 6 of Freeman ( on nonmarket valuation, ed. P.A. Champ, K..I. Boyle,
andT.C. Brown. Boston: KJuwer Academic Publishers.
). A brief, accessible
Carson, R.T. 2000. Contingent valuation: A user’s guide.
and very helpful introduction to CVM for non- Environmental Science & Technology 34: 1413-1418.
specialists is contained in Carson ( ). Carson, R.T., N.E. Flores, K.M. Martin, and J.L. Wright. 1996.
Louviere et al. ( ) offer a comprehensive dis Contingent valuation and revealed preference methodologies:
Comparing the estimates for quasipublic goods. Land
cussion of stated choice methods broadly defined,
Economics 72: 80-99.
including experimental design, modelling, esti- Cummings, R.G., D.S. Brookshire, and W.D. Schulze, eds. 1986.
mation and combining revealed and stated prefer- Valuing environmental goods: An assessment of the
ence data, with illustrations in marketing, contingent valuation method. Totowa: Rowman and
transportation and environmental economics. An Allanheld.
Diamond, P.A., and J.A. Hausman. 1994. Contingent valuation -
inventory of the wide range of practical issues to Is some number better than no number? Journal of
consider in actually implementing a C VM study is Economic Perspectives 8(4): 45-64.
provided by Boyle ( ), while Holmes and Freeman, A.M.I. 2003. The Measurement of Environmental
Adamowicz ( ) update the state of the art for and Resource Values: Theory and Methods. Washington,
DC: Resources for the Future.
attribute-based (conjoint choice) methods. Hanemann, W.M. 1994. Valuing the environment through
There is still considerable variation in individual contingent valuation. Journal of Economic Perspectives
researchers’ levels of comfort with CVM and stated 8(4): 19-43.
preference data more generally. We might Holmes, T.P., and W.L. Adamowicz. 2003. Attribute-based
methods. In A primer on nonmarket valuation, ed. P.A.
reconsider the question posed by Diamond and Champ, K.J. Boyle, and T.C. Brown. Boston: Kluwer
Hausman ( ): ‘ Contingent valuation - is some Academic Publishers.
number better than no number?’ There are now Louviere, J.J., D.A. Hensher, and J.D. Swait 2000. Stated choice
many economists who would agree that a value methods. New York: Cambridge University Press.
Manski, C.F. 2004. Measuring expectations. Econometrica 72:
based on stated preferences - from a study that is 1329-1376.
carefully conceived and executed, based on a suf- McFadden, D. 1994. Contingent valuation and social choice.
ficiently large sample that is representative of its American Journal of Agricultural Economics 76: 689-708.
intended population, that has been put through a Mitchell, R.C., and R.T. Carson. 1989. Using surveys to value
public goods: The contingent valuation method.
battery of consistency and validity assessments, and
Washington, DC: Resources for the Future.
that produces an implied demand function that Portney, P.R. 1994. The contingent valuation debate-Why
behaves the way we would expect a ‘real’ demand economists should care. Journal of Economic Perspectives
function to behave - is almost certainly better than U4): 3 17.
no number. This is especially true when ‘no
number’ creates the risk that a value of
2184 Continuity in Economic History
Smith, R.D. 2003. Construction ofthe contingent valuation market from which the great oak of the industrial revolution
in health care: A critical assessment. Health Economics 12:
grew ‘is a bit like studying the history of Jewish
609-628.
dissenters between 50 BC and 50 AD. What we are
looking at is the inception of something which was
at first insignificant and even bizarre’, though
‘destined to change the life of every man and
Continuity in Economic History woman in the West’.
What is destined or not destined to change our
Donald N. McCloskey fives will look rather different to each of us. Each
historian therefore has his or her own dating of the
industrial revolution. Each sees another disconti-
nuity. E.M. Carus-Wilson ( , p. 41) spoke of
Continuity and discontinuity are devices of story- ‘an industrial revolution of the 13th century’: she
telling, telling the story of monetary policy over the found that the fulling mill was ‘due to scientific
past few months or the story of modem economic discoveries and changes in technique’ and ‘was
growth. They raise certain questions in philosophy destined to alter the face of medieval England’.
and lesser matters, such as precedence and politics. A.C. Bridbury ( , p. xix-xx) found in the late
It is well to have a case in mind. The most middle ages ‘a country travelling slowly along the
important is that of the British industrial road ... that [it] travelled so very much more quickly
revolution. in Adam Smith’s day’. In the eyes of Marxist
If it was a ‘revolution’, as it surely was, it writers the 16th century was the century of
happened sometime. There was a discontinuity, a discontinuity, when capitalism set off into the world
before and after. When? Various dates have been to seek its fortune. John U. Nef, no Marxist,
proposed, down to the day and year: 9 March 1776, believed he saw an industrial revolution in the 16th
when the Wealth of Nations provided an ideology century, centred on coal ( ), though
for the age; the five months in 1769 when Watt admittedly slowed in the 17th century. A student of
took out a patent on the high pressure steam engine the 17th century itself, such as D.C. Coleman ( ),
and Arkwright on the cottonspinning water frame; finds glimmerings of economic growth
or 1 January 1760, when the furnaces at Carron even in that disordered age. The most widely
Ironworks, Stirlingshire, were lit. accepted period for the industrial revolution is the
Such dating has of course an amateur air. A late 18th century, especially the 1760s and 1770s
definite date looks handsome on a plaque or scroll (Mantoux ; Landes ), but recent students of the
but the precision does not fit well with sophisticated matter (Harley ; Crafts ) have found much to admire
story-telling. The discontinuity is implausibly in the accomplishments of the early 18th century.
sharp, drawing attention to minor details. The Great W.W. Rostow ( ) placed the ‘takeoff into self-
Depression did not start on 24 October 1929; the sustained
deregulation of American banking was not growth’ in the last two decades of the 18th century,
completed with the fall of Regulation Q. Nicholas but others have observed that even by 1850 the
Crafts ( ) has pointed out majority of British people remained in traditional
that the detailed timing of the industrial revolution sectors of the economy. And later still there was a
should not anyway be the thing to be studied, second industrial revolution (of chemicals,
because small beginnings do not come labelled with electricity, and internal combustion) and a third (of
their probabilities of developing into great electronics and biology).
revolutions. He is identifying a pitfall in story- Wider perspectives are possible, encouraging
telling. Joel Mokyr identifies another ( p. 44): the observer to see continuity instead. Looking at
rummaging among the possible acorns the matter from 1907, the American historian Henry
Adams could see a ‘movement from unity into
multiplicity, between 1200 and 1900, ...
Continuity in Economic History 2185
unbroken in sequence, and rapid in acceleration’ (p. change to an absence of change. Tme, if you squint
498). The principal modem student of the industrial and fit a curve then no economic change looks
revolution, R.M. Hartwell, appealed for continuity discontinuous in the mathematical sense; but it is
against the jostling throng of dates ( , p. 78): ‘Do wrong then to deduce that ‘really’ there is no
we need an explanation of the change at all, or that the industrial revolution is a
industrial revolution? Could it not be the culmi- mirage. ‘Continuity’ in the strict mathematical
nation of a most unspectacular process, the con- sense must be kept distinct from ‘continuity’ in the
sequence of a long period of economic growth?’ story-telling sense.
Such questions of continuity and discontinuity Economists have often been muddled about this
are asked widely in economics, though sometimes philosophical distinction, drawing surprising
half consciously. They should not be left to histo- ideological implications from it. Alfred Marshall
rians. Economics is mainly contemporary history, enshrined on the title page of his Principles the
and faces the problem of deciding when a piece of motto ‘natura non facit saltum’ (nature does not
history has been continuous or not. For instance the make a jump; Leibnitz had invented it as ‘la nature
cmcial discontinuity in the growth of big gov- ne fait jamais des sauts’). Marshall himself perhaps
ernment, as Robert Higgs ( ) points out, might believed that the ability to represent behaviour with
be placed when the institutions of centralized inter- differentiable functions implies that marginalism is
vention were conceived (1900-1918) or made a good description of human behaviour. It is less
(1930^45) or expanded (1960-70). Even recent sure that he believed that the lack of jumps in nature
history faces this narrative problem. When, if ever, (this on the eve of quantum physics) implies people
did purchasing power parity break down in the should not jump either, and should change society
1970s? When did policy on antitrust alter to favour only gradually. Anyway, both implications are non
mergers? When did monetary policy last become sequiturs. Though both have been attributed to
expansionary? Where is the break? neoclassical economics, neither is necessary for it.
The difficulty in answering the question has Much bitter controversy has assumed that
often been misconstrued as philosophical. The neoclassical economics depends on smooth curves
philosophical difficulty was first articulated in the and in consequence must advocate smooth social
5th century BC by Parmenides and his student policies. The peculiar alliance between discrete
Zeno: that if everything is perfectly continuous, mathematics and Marxian economics has this
change is impossible (Komer ). Everything is so to origin, as does the enthusiasm of some conservative
speak packed too tightly to move. The economist writers for continuities in economic history.
will recogn ize the poi nt as analogous to an Gerschenkron cursed both their houses; the social
extreme form of economic equilibrium, or to the scientist should study change and continuity
physicist’s maximum entropy. If human nature ‘unbothered by the lovers and haters of revolutions
doesn’t ‘really’ change, then history will be a string who must find themselves playgrounds and
of weary announcements that the more things battlegrounds outside the area of serious
change the more they stay the same. If the economy scholarship’ (p. 39).
is ‘really’ in equilibrium all the time, then nothing In one sense of ‘continuity’ it is trivial that
remains to be done. economic history is continuous. History has causes
Alexander Gerschenkron, the economic histo- (the fourth of five historically relevant definitions
rian who has contributed most to the understanding that Gerschenkron distinguishes). Continuity, then,
of continuity and discontinuity in economics, noted can be viewed as being merely an impressively long
that such a metaphysics would close the book of causal chain. The exploitation of Scottish iron
history ( , p. 12). A history of eco deposits in the 18th century was caused by bold
nomics that began with the Parmenidean continuum investments, but these depended on a reliable law of
would never speak. property and commerce, which depended on certain
For purposes of social science Gerschenkron legal developments in the 16th century, and on the
rejects the transition from the connectedness of all growth of
2186 Continuity in Economic History
political stability in the early 18th century, which in which in principle is closely related to the concept
turn depended on all manner of earlier events. of the great spurt as developed by this writer'.
Establishing continuities, as Gerschenkron remarks, The point is a good one, and applies to all
is the historian’s purpose - or, one might add, the questions of continuity in aggregate economics.
economist’s, who is doing historian’s work when he Small (and exciting) beginnings will be hidden by
is not doing philosopher’s. The purpose might be to the mass until well after they have become routine.
find a cause of, say, the Great Depression. It would Joel Mokyr has put it as a matter of arithmetic: if
be to find a chain of events the absence of which the traditional sector of an economy is growing at a
would have made a difference: the international slow one per cent per annum, and starts with 90 per
irresponsibility of the United States, for instance, as cent of output, the modem sector growing at four
Kindleberger argued; or the domestic per cent per annum will take three-quarters of a
irresponsibility of the Federal Reserve, as Friedman century to account for as much as half of output (
and Schwartz argued. Finding such chains has its , p. 5). We may call
own philosophical difficulties (see the article in this it the Weighting Theorem (or the Waiting Theorem,
Dictionary on “► )”. for the wait is long when the weight is small to
The main problems of continuity and disconti- begin with). There are parallel points to be made
nuity, however, are not solvable in seminars on elsewhere in economics and in social science
philosophy. They are practical problems in the uses generally. In growth theory, for instance, as was
of measiuement, and must be solved in the noticed shortly after its birth, a century of
economic or historical workshop. When shall we theoretical time is needed in most models for a shift
say that the industrial revolution happened? to yield growth as much as 90 per cent of its steady
Gerschenkron gives an answer confined to industry, state. More generally, economists have long
for in common with most economic historians he recognized the tension between microeconomic
regards agriculture and services as laggards in explanations and the macroeconomic things to be
economic growth. explained. And sociologists have been quarrelling
In a number of major countries of Europe ... after a along similar lines for a century, using even the
lengthy period of fairly low rates of growth came a
same jargon of micro and macro.
moment of more or less sudden increase in the rates,
which then remained at the accelerated level for a In other words, the search for discontinuity in an
considerable period. That was the period of the great aggregate time series raises the question of the level
spurt in the respective countries’ industrial at which we should do our social thinking, the
development The rates and the margin between
aggregation problem. Yet Gerschenkron himself did
them in the ‘pre-kink’ and the ‘post-kink’ periods appear
to vary depending on the degree of relative not answer the question well, and was hoist by his
backwardness of the country at the time of the own petard. Calculating Italian industrial output he
acceleration, (pp. 33-4) placed his ‘big spurt’ in 1896-1908, and wished to
The level at which such discontinuity is to be explain it with big banks founded in the 1890s.
observed is at choice. As Gerschenkron remarks, Stefano Fenoaltea, once his student, applied the
Weighting Theorem to the case (Fenoaltea ). Surely,
If the seat of the great spurt lies in the area of
Fenoaltea reasoned, the components of the
manufacturing, it would be inept to try to locate the
discontinuity by scrutinizing data on large industrial index - the steel output and the chemical
aggregate magnitudes such as national income_________ output - are the ‘real’ units of economic analysis
By the time industry has become bulky enough to affect (note the similarity of this rhetoric to that advo-
the larger aggregate, the exciting period of the great spurt cating a micro foundation for macroeconomics). If
may well be over. (pp. 34—5)
the components started accelerating before the new
In a footnote to these sentences he remarks that banks appeared, becoming bulky only later, then the
‘Walt Rostow’s failure to appreciate this point has new banks could not have been the initiating force.
detracted greatly from his concept of the take-off, Alas, the components did just this. They spoil
Gerschenkron’s bank-led story: the
Continuous and Discrete Time Models 2187
components accelerated not in the 1890s but in the 39-60. Reprinted in Essays in economic history, vol. I, ed.
1880s, not after but before the banks. To paraphrase E.M. Cams-Wilson. London: Edward Arnold, 1954.
Coleman, D.C. 1977. The economy of England 1450-1750.
Gerschenkron on Rostow, by the time the Oxford: Oxford University Press.
progressive components of industry had become Crafts, N.F.R. 1977. Industrial revolution in England and France:
bulky enough to affect the larger aggregate, the Some thoughts on the question ‘Why was England first?’.
exciting period was well over. Economic History Review, 2nd series 30(3): 429-441.
Crafts, N.F.R. 1984. Economic growth during the British
Yet the moral is still Gerschenkron’s: that con- industrial revolution. Oxford: Oxford University Press.
tinuity and discontinuity are tools ‘forged by the Fenoaltea, S. 1987. Italian industrial production, 1861-1913: A
historian rather than something inherently and statistical reconstruction. Cambridge: Cambridge
invariantly contained in the historical matter .... [A]t University Press.
Gerschenkron, A. 1962. On the concept of continuity in history.
all times it is the ordering hand of the historian that Proceedings of the American Philosophical Society June.
creates continuities or discontinuities’ (p. 38). Reprinted in A. Gerschenkron, Continuity in history and
Gerschenkron nodded, but in nodding made the other essays. Cambridge, MA: Harvard University Press,
point. The multiple datings of the industrial 1968.
Harley, C.K. 1982. British industrialization before 1841:
revolution make it, too. So does any choice of
Evidence of slower growth during the industrial revolution.
smoothness or suddenness in economic storytelling. Journal of Economic Histoiy 42(2): 267-290.
The point is that history, like economics, is a Hartwell, R.M. 1965. The causes of the industrial revolution: an
story we tell. Continuity and discontinuity are essay in methodology. Economic History Review, 2nd
series 18: 164-182. Reprinted in The causes of the
narrative devices, to be chosen for their storytelling
industrial revolution in England, ed. R.M. Hartwell.
virtues. Niels Bohr said once that ‘It is wrong to London: Methuen, 1967.
think that the task of physics is to find out how Higgs, R. 1987. Crisis and Leviathan: Critical episodes in the
nature is. Physics concerns what we can say about growth of American government. New York: Oxford
University Press.
nature’. It is our say. We can choose to emphasize
Komer, S. 1967. Continuity. In The encyclopedia of philosophy.
the continuous: ‘Abraham begat Isaac; New York: Macmillan and Free Press.
... begat... begat... and Jacob begat Joseph the Landes, D.S. 1969. The unboundprometheus. Cambridge:
husband of Mary, of whom was bom Jesus’. Or the Cambridge University Press.
Mantoux, P. 1928. The industrial revolution in the eighteenth
discontinuous: ‘There was in the days of Herod, the
century. New York: Harper, 1961.
king of Judea, a certain priest named Zacharias.’ It Mokyr, J. (ed.). 1985. The economics of the industrial
is the same story, but its continuity or discontinuity revolution. Totowa: Rowman and Allanheld.
is our creation, not God’s. That it is out of God’s Nef, J.U. 1932. The rise of the British coal industry, 2 vols.
London: Routledge.
hands does not make it arbitrary. Scholars speak of
Rostow, W.W. 1960. The stages of economic growth.
the industrial revolution as early or late, gradual or Cambridge: Cambridge University Press.
sudden. Other scholars believe or disbelieve their
stories on the usual grounds.
See Also
connections exist between results with discrete to justify economic theories built on a discrete time
time data and the underlying continuous time scale. But to say that there are elements of discrete
dynamics, it is possible for discretization to delay or time discontinuity in behavioiu does not
create bias or have unintuitive effects. Some imply that discrete time models are appropriate. A
economics literature investigates such distor- model built in continuous time can include discrete
tions. It is also possible to estimate explicitly delays and discontinuities. Only if all delays were
continuous-time models, using discrete data. discrete multiples of a single underlying time unit,
This approach raises its own difficulties, but has and synchronized across agents in the economy,
become more usable as computing power and would modelling with a discrete time unit be
the techniques to exploit it have improved. appropriate.
Nonetheless, sometimes discrete models can
Keywords avoid extraneous mathematical complexity at little
Approximation theory; Continuous and discrete cost in approximation error. It is easy enough to
time models; Distributed lags; Dynamic argue that time is in fact continuous and to show
stochastic general equilibrium models; Granger that there are in principle cases where use of
causal priority; Markov chain Monte Carlo discrete time models can lead to error. But it is also
methods; Martingales; No-arbitrage models; true in practice that more often than not discrete
Stochastic differential equations; Vector time models, translated intuitively and informally to
autoregressions; Wiener process give implications for the real continuous time world,
are not seriously misleading. The analytical task,
still not fully executed in the literature, is to
JEL Classifications understand why discrete modelling usually is
C3 adequate and thereby to understand the special
circumstances under which it can be misleading.
Discrete time models are generally only an The basis for the usual presumption is that,
approximation, and the error induced by this when the time unit is small relative to the rate at
approximation can under some conditions be which variables in a model vary, discrete time
important. models can ordinarily provide good approximations
Most economists recognize that the use of dis- to continuous time models. Consider the case,
crete time is only as an approximation, but assume examined in detail in Geweke ( ), of a
(usually implicitly) that the error of approximation dynamic multivariate distributed leg regression
involved is trivially small relative to the other sorts model, in discrete time.
of simplification and approximation inherent in
economic theorizing. We consider below first the Y(t)=A*X(t) + U(t), (1 )
conditions under which this convenient assumption
may be seriously misleading. We discuss briefly
how to proceed when the assumption fails and the where * stands for convolution, so that
state of continuous time economic theory. OO
A*X{t)= Y, A(s)X(t-s). (2 )
s=—OO
Approximation Theory
We specify that the disturbances are
Some economic behavioiu does involve discrete uncorrelated with the independent variable vector X,
delays, and most calculated adjustments in indi- that is, cov[AT/j, U(s)] = 0, all t, s. The natural
vidual patterns of behavioiu seem to occiu fol- assumption is that, if approximation error from use
lowing isolated periods of reflection, rather than of discrete time is to be small, A(s) must be
continually. These notions are sometimes invoked
Continuous and Discrete Time Models 2189
smooth as a function of s, and that in this case ( ) is This is the precise sense in which the intuition
a good approximation to a model of the form that discrete approximation does not matter much is
correct. But there are important limitations on the
y(t) = a* x(t) + u{t) (3) result. Most obviously, the result depends on a in ( )
being an ordinary function. In continuous time,
where well-behaved distributed lag relations like ( ) are not
the only possible dynamic relation between two
»oo
series. For example, if one replaces ( ) by
a* x(t) a(s)x(t
—OO
— s)ds (4)
higher than firstorder derivatives (for example, if Granger causal priority in large samples, if signif-
they are generated by a third-order stochastic dif- icance tests were applied naively.
ferential equation) the lag distributions in ( ) and ( ) Geweke’s exploration of multivariate models
are replaced by still higherorder limiting forms. shows that the possibilities for confusing results are
Thus, different continuous time processes for x more numerous and subtle in that case. In
which all imply differentiable time paths produce particular, there are ways by which poor approx-
different limiting discrete A. Here the fact that the imation of otj{s) by Aj(s/T) in some s interval (for
time unit becomes small relative to the rate of example, around 5 = 0) can lead to contamination of
variation in x does not justify the assumption that the estimates of other elements of the A matrix,
approximation of continuous by discrete models is even though they correspond to xjs and QjS that in a
innocuous. In particular, the notion that discrete univariate model would not raise difficulties.
differencing can approximate derivatives is In estimation of a dynamic prediction model for
potentially misleading. a single vector y, such as a vector autoregression
It should not be surprising that the discrete time (VAR) or dynamic stochastic general equilibrium
models may not do well in approximating a con- model (DSGE), the question for approximation
tinuous time model in which derivatives appear. theory becomes whether the continuous time
Nonetheless, empirical and theoretical work which dynamics for y, summarized in a Wold moving
ignores this point is surprisingly common. average representation
If a is an ordinary function, there is still chance
for error despite Geweke’s result. His result implies y(t) = a*u(t) ( 9)
only that the mean square deviation of a from A is
small. This does not require that individual A(t/x)’s has an intuitively transparented connection to the
converge to the corresponding aft) values. For corresponding discrete time Wold representation
example, in a model where x is univariate and a(/) =
0,t < 0, a(0) = 1, a(s) continuous on [0,oo], the Y(t)=A*U(t). ( 10 )
limiting value for A(0) is 0.5, not 1.0. Thus, \fa(t) e
"" on [0, oo), making a monotone decreasing over In discrete time the U(t) of the Wold repre-
that range, A(t) will not be monotone decreasing. It sentation is the one-step-ahead prediction error, and
will instead rise between i 0 and / I. This is not in continuous time u(t) also represents new
unreasonable on reflection: the discrete lag information about y arriving at t. There are two
distribution gives a value at i 0 which averages the related sub-questions. Is the A function the same
continuous time distribution’s behaviour on either shape as the a function; and is the U vector related
side of t 0. It should therefore not be surprising that in a natural way to the u vector? The u vector is a
monotonicity of a does not necessarily imply continuous time white noise, so that U cannot
monotonicity of A, but the point is ignored in some possibly be a simple discrete sampling of u.
economic research. If y is stationary and has an autoregressive
Another example of possible confusion arises representation, then U(t) A 1 with the expression
from the fact that, if the .r process has differentiable interpreted as convolution in continuous time, but
paths, aft) 0 for / < 0 does not imply Ad) 0 for / < 0. with A 1 putting discrete weight on integers. The
The mean-square approximation result implies that operator connecting U and u is then A '*a. There are
when the time unit is small the sum of squares of cases where the connection between continuous and
coefficients on Xft s) for negative s must be small discrete time representations is intuitive. For
relative to the sum of squares on X(t — s) for example, if a(s) = exp(- Bs) (with the
positive s, but the first few lead coefficients will exponentiation interpreted as a matrix exponential
generally be non-zero and will not go to zero as the in a multivariate case), then
time interval goes to zero. This would lead to
mistaken conclusions about
*
w c(s w(t)
>2
■
.HO. 0 b(s)
[HO J
least in parameters and therefore to be more diffi- numerical approximations to the distribution of yt +
cult to handle than a more naive discrete model. s conditional on data through time t. Ait- Sahalia (
However with modem computing power, such ) surveys methods based on this
models are usable. Bergstrom ( ) provides a approach.
discussion of estimating continuous time constant Modelling in continuous time does not avoid the
coefficient linear stochastic differential equation complexities of connecting discrete time data to
systems from discrete data, the papers in the book ( continuous time reality - it only allows us to
) he edited provide related discussions, and confront them directly. One reason this is so seldom
Hansen and Sargent ( ), in some of their own done despite its technical feasibility is that it forces
chapters of that book, discuss estimation of con- us to confront the weakness of economic theory in
tinuous time rational expectations models from continuous time. A model like ( )-( )
discrete data. makes an assertion about how many times y and x
Estimating stochastic differential equation are differentiable, and a mistake in that assertion can
models from discrete data has recently become result in error as bad as the mistake of ignoring the
easier with the development of Bayesian Markov time aggregation problem. Economic theory does
chain Monte Carlo (MCMC) methods. Though not have much to say about the degree of
implementation details vary across models, the differentiability of most aggregate macroeconomic
basic idea is to approximate the diffusion equation time series. When the theory underlying the model
has no believable restrictions to place on fine-
dyt = a(yt)dt + b(yt)dWh (17) where Wt grained dynamics, it may be better to begin the
modelling effort in discrete time. As is often true
is a Wiener process, by when models are in some respect underidentified, it
yl = e-a^)y,-5 + b{yl^l. (1 8) is likely to be easier to begin from a normalized
reduced form (in the case the discrete time model)
Such an approximation can be quite inaccurate in exploring the range of possible interpretations
unless 6 is very small. But one can in fact choose generated by different potential identifying
<5 very small, much smaller than the time interval assumptions.
at which data are observed. The values ofyt at times Recent developments in financial economics
between observations are of course unknown, but if have produced one area where there are continuous
they are simply treated as unknown ‘parameters it time economic theories with a solid foundation.
may be straightforward to sample from the joint Stochastic differential equations (SDEs) provide a
posterior distribution of the y’s at non-observation convenient and practically useful framework for
times and the unknown parameters of the model. modelling asset prices. These SDE models imply
The Gibbs sampling version of MCMC samples non-differentiable time paths for prices, and it is
alternately from conditional posterior distributions known (Harrison et al. ) that differentiable time
of blocks of parameters. Here, sampling from the paths for asset prices would imply arbitrage
distribution of y at non-observation dates opportunities, if there were no transactions costs or
conditioning on the values of model parameters is bounds on the frequency of transactions.
likely to be easy. If the model has a tractable form, However, there are in fact transactions costs and
it will also be easy to sample from the posterior bounds on transactions frequencies, and no-
distribution of the parameters conditional on all the arbitrage models for asset prices break down at very
y values, both observed and unobserved. fine, minute-by-minute, time scales. Successful
Application of these general ideas to a variety of behavioural modelling of these fine time scales
financial models is discussed in Johannes and requires a good theory of micro-market structure,
Poison ( ). which is still work in progress.
Another approach that has become feasible with It is worthwile noting that a process can have
increased computing power is to develop non-differentiable paths without producing white
Continuous-Time Stochastic Models 2193
noise residuals at any integer order of differentia- Sims, C.A. 1971. Approximate specifications in distributed
lag models. In Proceedings of the 38th Session, Bulletin of the
tion: for example, a model satisfying ( ) with a(s) =
International Statistical Institute 44, Book 1.
s'^e Such a process has continuous paths with Sims, C.A., and S. Maheswaran. 1993. Empirical implications
unbounded variation and is not a semimartingale. of arbitrage-free asset markets. In Models, methods and
That is, it is not the sum of a martingale and a applications of econometrics, ed. P.C.B. Phillips. Oxford:
See Also
Models in which agents can revise their decisions
► Time Se continuously in time have proved fruitful in the
analysis of economic problems involving
intertemporal choice under uncertainty (cf.
Malliaris and Brock ). These models frequently
Bibliography
produce significantly sharper results than can be
A'it-Sahalia, Y. 2007. Estimating continuous-time models derived from their discrete-time counterparts. In the
using discretely sampled data. In Advances in economics and majority of such cases, the dynamics of the
econometrics, theory and applications. Ninth World Congress , ed. R. underlying system are described by diffusion
Blundell, T. Persson, and W.K. Newey, vol. 3.
processes, whose continuous sample paths can be
Cambridge: Cambridge University Press.
Bergstrom, A.R., ed. 1976. Statistical inference in continuous time economic represented by Ito integrals. However, in selected
models. Amsterdam: North- Holland. applications, this assumption can be relaxed to
Bergstrom, A.R. 1983. Gaussian estimation of structural include both non-Markov path- dependent
parameters in higher order continuous time dynamic
processes and Poisson-directed jump processes.
models. Econometrica 51: 117-152.
Geweke, J. 1978. Temporal aggregation in the multiple An early application of this mode of analysis
regression model. Econometrica 46: 643-662. was the lifetime consumption-portfolio selection
Hansen, L.P., and T.J. Sargent, eds. 1991. Rational expectations problem (Merton , ). Under the assump
econometrics. Boulder and Oxford: Westview Press.
tions of continuous trading and asset returns gen-
Harrison, J.M., R. Pitbladdo, and S.M. Schaefer. 1984.
Continuous price processes in frictionless markets have erated by diffusion processes, the derived structure
infinite variation. Journal of Business 57: 353-365. of optimal portfolio demands produce portfolio-
Johannes, M., and N. Poison. 2006. MCMC methods for separation or mutual fund theorems like those
continuous-time financial econometrics. In Handbook of derived in the static Markowitz-Tobin mean-
financial econometrics, ed. Y. A'it-Sahalia and L.P. Hansen.
Amsterdam: North-Holland.
variance model, but without the objectionable
Marcet, A. 1991. Temporal aggregation of economic time assumption of either quadratic preferences or
series. In Rational expectations econometrics, ed. L.P. Hansen and Gaussian-distributed asset prices. Indeed, in the
T.J. Sargent. Boulder and Oxford: Westview Press. special, but prototypical, case of lognormally-
Rozanov, Yu. A. 1967. Stationary random processes, trans A.
Feinstein. San Francisco/Cambridge/London/
distributed asset prices, the intertemporal optimal
Amsterdam: Holden-Day. rales are identical to those of the mean-variance
model. The continuous-time analysis thus provides
a reconciliation of this classic model, with models
of general expected utility maximization
2194 Continuous-Time Stochastic Models
in an environment where asset ownership has strategies which replicate the payoff structure of the
limited liability. missing pure securities. There is much analysis to
Using these same assumptions of continuous suggest that continuous-trading opportunities
trading and lognormality of security prices, Black together with diffusion representations sentations
and Scholes ( ) derived a formula for pricing for the evolution of the economy provide a
options that provided the foundation for subsequent particularly fertile environment for fulfilling the
development, of a unified theory of corporation Radner conditions.
liability evaluation and general contingent- claim Under reasonably general assumptions about
pricing. Cox et al. ( ) use the agents’ preferences and endowments Breeden ()
continuoustrading methodology with diffusion among others has shown that the
processes to derive a general theory for the term intertemporal equilibrium allocations generated in
structure of interest rates. economies with continuous trading in a finite
Building on the continuous-time model of indi- number of securities can be Pareto efficient. In the
vidual choice, Merton ( ), Breeden ( ), analysis of the individual portfolio selection prob-
and Cox et al. ( ) develop intertemporal lem underlying these equilibrium models, the
models of equilibrium asset prices. Huang ( ) derived portfolio-separation theorems show that the
provides a stronger foundation for these set of individually-optimal portfolios can be
models by showing that if information in an econ- generated by combinations of relatively few com-
omy with continuous-trading opportunities evolves posite securities or mutual funds.
according to diffusion processes, then equilibrium The extensive literature on options and
security prices will also evolve according to contingent-claims pricing provides further evidence
diffusion processes. that continuous-trading opportunities make possible
In the intertemporal version of the Arrow- a large reduction in the number of securities
Debreu general equilibrium model with complete markets without loss of efficiency. Although
markets, the markets need only to be open ‘once’ typically partial equilibrium in nature, these
because agents will have no need for further trade. analyses show that continuous-trading dynamic
The continuous-trading model is in this respect at portfolio strategies using as few as two securities
the opposite extreme. Economies in which the can replicate a wide range of state and time-
dynamics of the system are described by diffusion dependent payoff structures.
processes will have a continuum of possible states In perhaps the most general analysis to date,
over any finite interval of time. Thus, in the strict Duffie and Huang ( ) study the role continu
sense, to have complete markets in the continuous- ous trading plays in successfully implementing
time diffusion model requires an uncountable Arrow-Debreu equilibria with infinite dimensional
number of pure Arrow-Debreu securities. The commodity spaces, using only a finite number of
continuoustrading model with diffusions, securities. In particular, they derive necessary and
nevertheless, appears to have many of the important sufficient conditions for continuous-trading
properties of the Arrow-Debreu model, but without portfolio strategies with a finite number of
nearly so many securities. securities to effectively complete markets in a
As is well known, in the absence of complete Radner economy. By working with martingale
Arrow-Debreu markets, a competitive equilibrium representation theorems, Duffie and Huang show
does not in general produce Pareto optimal that the class of dynamics for which these results
allocations. However, Radner ( ) has shown obtain extends beyond vector diffusion processes to
that an Arrow-Debreu equilibrium allocation can be include some non-Markov path dependent
achieved without a full set of pure time-state processes. They also show that having
contingent securities if agents can use the available heterogeneous probability assessments among
securities to implement dynamic trading agents provides no important difficulties with the
results, provided all agents’ subjective
Continuous-Time Stochastic Models 2195
probability measures are uniformly absolutely return per unit time is the interest rate r(t). The rate
continuous. Although there remain further technical of return dynamics on risky security j can be written
issues to be resolved, it is evident that the as
continuous-trading models provide a strong foun-
dation for the belief that a good substitute for dp p
i/ j = Xj(S, t)dt + (Tj(S, t)dzj,
having many markets and securities is to have fewer 7= 1, 2, ...,n
markets which are open for trade more frequently.
A sketch of the derivation of the portfolio where a; is the instantaneous conditional expected
separation theorem along the lines of Merton ( , rate of return per unit time; af is the conditional
, ) and Breeden ( ) is a variance per unit time; and dzy is a Wiener process.
follows: Denote by pJk(S, t) the instantaneous correlation
coefficient per unit time between dzj and dzk, and
At each time t, each consumer-investor acts so
denote by p,/S, t) the instantaneous correlation
as to
coefficient between dq, and dzj, i = 1 , 2 , . . . , m
a n d j = 1 , 2 , . . . , n.
The accumulation equation for the consumer’s
U[c{i:),S{x),x\dx + B[W(T),S(T), T] j
Max E, wealth can be written as
(l)
d W = [rW + y — c\dt
where E, is the conditional expectation operator,
conditional on information available at time t. S (f) + Y/wjW[dPj/Pj-r dt] (4 )
= [Si (f), . . . , Sm(t)] is a finite-TM vector set of 7=1
state variables which together with the consumer’s
current wealth W(t) is sufficient to describe the state where y = y(S,!) is the consumer’s wage income;
of the economy at time t. c(r) denotes the Wj is the fraction of his wealth allocated to risky
instantaneous consumption flow selected at time t.
security j at time t, and is the
U is a strictly concave, statedependent utility
function for consumption and B represents utility fraction allocated to the riskless asset.
from bequests at date T. The optimal consumption and portfolio mles,
The evolution of the state variables S is c*(W, S, t) and w*(W, S, t), are derived by the
described by a Markov system of fto stochastic technique of stochastic dynamic programming.
differential equations Among the first-order conditions to be satisfied by
these optimal rules are the n conditions for the
d.S’,(7j = Gi(S, i)dt + ll,(S, t)dq„ i optimal portfolio holdings at time t, which can be
= 1,2, . . . , m (2) expressed as j = 1, 2,..n
Consider the continuous-trading portfolio structure, it is evident that this security is the natural
strategy which allocates fraction x; (?) = (dF/dSj) generalization of Arrow-Debreu pure state
Hjl\ajV{i)\ to security j, j = 1 , . . . , m a n d securities to an environment where there is a
continuum of states defined by St and x. By
1 to the riskless security at time t,
changing the time and state parameters x and St, one
where V(t) denotes the value of die portfolio. It can generate all of the uncountable number of pure
follows from Eq. and the prescribed allocation that securities. Moreover, F, the solution to Eq. used to
the dynamics of the portfolio value can be written as implement each strategy, w i l l also be the
equilibrium price for the corresponding pure Arrow-
Debreu security.
J2xj(aJ ~ r ) + r At + '^jXjOjAzj Continuous trading, like any other continuous-
dV = V
revision process, is of course an abstraction from
f-irp m
dF physical reality. If, however, the length of time
Y.Q^ H j( a J- r )/^J + rV At +
Y.FFHjAclj
YdSj between revisions is very short, then the
8
( )
continuous-trading optimal solutions will be a
reasonable approximation to their discrete-time
because dZj = dq/J = 1, 2 , . . . , m. counterparts (see Samuelson and Merton , ).
As a solution to Eq. , F is twice-continuously From the work of Magill and
differentiable. Thus, Ito’s Lemma can be used to Constantinides ( ), this conclusion appears
describe the stochastic process for F as to be robust even in the presence of transactions
costs, which cause trading to be discrete almost
1 „dF certainly.
dF'
2
length of time between revisions in the capital Merton, R.C. 1977. On the pricing of contingent claims and
stock, although hardly instantaneous, may well be the Modigliani-Miller theorem. J o u r n a l o f F i n a n -
c i a l E c o n o m i c s 5: 241-249.
quite short, relative to the time scale of the Merton, R.C. 1982a. On the microeconomic theory of
exogenous processes. investment under uncertainty. In H a n d b o o k o f
m a t h e m a t i c a l e c o n o m i c s , vol. 2, ed. K.J. Arrow
and M.D. Intriligator. Amsterdam: North-Holland.
See Also Merton, R.C. 1982b. On the mathematics and economics
assumptions of continuous-time models. In F i n a n c i a l
economics: Essays in honor of Paul
► Continuous-Tiny Stochastic Processes C o o t n e r , ed. W.F. Sharpe and C.M. Cootner.
► 'inance Englewood Cliffs: Prentice-Hall.
► Options Radner, R. 1972. Existence of plants, prices, and price
expectations in a sequence of markets. E c o n o m e t r i c a
40: 289-303.
Bibliography Samuelson, P.A. 1970. The fundamental approximation
theorem of portfolio analysis in terms of means, vari-
Bismut, J.M. 1975. Growth and optimal intertemporal ances, and higher moments. R e v i e w o f E c o n o m i c
allocation of risks. J o u r n a l of Economic S t u d i e s 32: 537-542.
T l i e o n ’ 10: 239-257.
Black, F., andM. Scholes. 1973. The pricing of options and
corporate liabilities. J o u r n a l of Political
E c o n o m y 81: 637-654.
Bourguignon, F. 1974. A particular class of continuoustime
stochastic growth models. J o u r n a l o f E c o n o m i c
T h e o r y 9: 141-158. Continuous-Time Stochastic Processes
Breeden, D.T. 1979. An intertemporal asset pricing model
with stochastic consumption and investment opportunities.
Chi-Fu Huang
J o u r n a l o f F i n a n c i a l E c o n o m i c s 7: 265-296.
Cox, J.C., J.E. Ingersoll Jr., and S.A. Ross. 1985a. Atheory of
the term structure of interest rates. E c o n o m e t r i c a 53:
385-408.
Cox, J.C., J.E. Ingersoll Jr., and S.A. Ross. 1985b. An
intertemporal general equilibrium model of asset prices.
E c o n o m e t r i c a 53: 363-384. Applications of continuous-time stochastic pro-
Duffie, D., and C. Huang. 1985. Implementing Arrow-Debreu cesses to economic modelling are largely focused
equilibria by continuous trading of a few long-lived securities.
E c o n o m e t r i c a 35: 1337-1356. Huang, C. 1985. on the areas of capital theory and financial markets.
Information structure and equilibrium asset prices. In these applications as in mathematics generally,
J o u r n a l o f E c o n o m i c T h e o t y 35: 33-71. Magill, the most widely studied continuous time process is
M.J.P., and G.M. Constantinides. 1976. Portfolio selection a Brownian motion - so named for its early appli-
with transactions costs. J o u r n a l o f E c o n o m i c
T h e o i y 13: 245-263.
cation as a model of the seemingly random move-
Malliaris, A.G., and W.A. Brock. 1982. S t o c h a s t i c ments of particles which were first observed by the
methods in economics and finance. English botanist Robert Brown in the 19th century.
Amsterdam: North-Holland. Merton, R.C. 1969. Lifetime Einstein ( ), in the context of statistical
portfolio selection under uncertainty: The continuous-time
case. R e v i e w o f E c o n o m i c s a n d S t a t i s t i c s 51:
mechanics, is generally given credit for the first
247-257. mathematical formulation of a Brownian motion
Merton, R.C. 1971. Optimum consumption and portfolio process. However, an earlier development of an
rules in a continuous-time model. J o u r n a l o f equivalent continuous-time process is provided by
E c o n o m i c T h e o r y 3: 373-413.
Merton, R.C. 1973. An intertemporal capital asset pricing
Louis Bachelier ( ) in his theory of stock
model. E c o n o m e t r i c a 41: 867-887. option pricing. Framed as an abstract mathematical
Merton, R.C. 1975a. An asymptotic theory of growth under process, a Brownian motion {B(t); t e R+} is
uncertainty. R e v i e w o f E c o n o m i c S t u d i e s 42: described by the following properties: (1) for 0 : (s
375-393.
Merton, R.C. 1975b. Theory of finance from the perspective
< t < oo,B(t) — B(s) is a normally distributed
of continuous time. J o u r n a l o f F i n a n c i a l a n d random variable with mean zero and variance t - s;
Q u a n t i t a t i v e A n a l y s i s 10: 659-674. (2) for 0 < to < t\ < • • • < t\ < oo,
Continuous-Time Stochastic Processes 2199
invented for the study of financial markets; for information revelation over time. For simplicity, we
references, see Dellacherie and Meyer ( , take the time span of the economy to be [0,1], We
1982), Jacod ( ), and Meyer ( , ). assume that agents are endowed with the same
Although making no explicit use of French information structure F. Readers can think of a
probability theory, the seminal paper of Black and filtration to be like an event tree in a discrete time
Scholes ( ) in the pricing of stock options finite state setting. We also assume that agents at
nevertheless opens up the possibility of its appli- time zero knows that the frue state of the nature is
cation in financial economics. This work was sub- an element co e Q, which they will learn at time
sequently generalized and formalized by Merton ( one.
, ). The idea is that the payoff of a Agents can only consume at time one. For
stock option can be replicated by continuous trad- simplicity again, we take the commodity space to be
ing in its underlying stock and a riskless asset. The the space of square-integrable random variables
replicating strategy is self-financing in that after the defined on (Q, fty P), denoted by L2(P).
beginning of this strategy there are neither There are N + 1 long-lived securities traded
additional funds invested into it nor funds with- indexed by n 0 , 1 , . . . , N. A long-lived security is a
drawn out of it. Thus, to rule out arbitrage oppor- security available for trading all the time in [0,1]
tunities, the stock option must sell for the exact and is represented by a price process. {>S„(0} It
value of the replicating portfolio at any point in pays a dividend only at time one and is equal to
time. S„( 1) almost surely. Price processes are semi-
Black and Scholes’s theory provided a strong martingales (adapted to F) and S„( 1) C_ L2(P). In
incentive for financial economists to study contin- modelling a dynamic asset trading economy, before
uous time stochastic processes. The key observation anything interesting can be said, one has to
in this literature was made by Cox and Ross ( ). formulate a budget constraint. That naturally
They noted that since the expected rate of involves stochastic integrals. Jacod ( ) has
return of the stock does not enter the Black and shown that for stochastic integrals to have desired
Scholes pricing formula for a stock option, the price properties, it is necessary that integrators be semi-
of an option must be determined as if investors were martingales. Thus, semimartingale price processes
risk neutral and had probability beliefs such that the can be assumed without loss of generality.
stock earns an expected rate of return equal to the In a Walrasian economy, only relative prices are
riskless rate. Harrison and Kreps ( ) formalized determined. Thus we can assume that the price
this observation in showing system has been normalized such that So(t) = 1 V/
that any arbitrage-free price system can be C [0, 1], We will call the 0th security the riskless
converted into a martingale through a change of an security and the rest risky seciuities.
equivalent probability after a suitable normal- A trading strategy is an (N - 1 (-dimensional
ization. Note that this martingale connection of an predictable process 0 = {0„ (t); n = 0, 1, - N}, where
arbitrage-free price system was vaguely we interpret 0„ (/) to be the number of shares of
foreshadowed in Samuelson and Merton ( ). security n held from t - to t before trading at time t.
Harrison and Kreps ( ) and Harrison and A process is predictable if its values at time t
Pliska ( ) make clear that the answer to depend only upon the information available strictly
whether a contingent claim can be replicated by before time t. Given the interpretation of 0,
dynamic trading is intimately related to the mar- predictability is a natural information constraint.
tingale representation theorem. A sketch of their A trading strategy is said to be simple if it is
arguments will be given. bounded and changes its value at most at a finite
Taken as primitive is a complete separable number of time points in [0,1].
probability space (Q, W- P) and a filtration F {7Ft; i A trading strategy 0; is said to be self-financing
C [0, 1]}. A filtration is an increasing family of sub- if the stochastic integral
sigma-algebras of W representing
Continuous-Time Stochastic Processes 2201
[ 6(s)TdS = N rt
Jo 51 en(s) dS„ (i). (2)
is well-defined and if
Now here is the key observation. Let x be a
T r
f
d(t) S ( t ) = 0(O) S(O) + 6 ( s ) r d S ( s ) Vt e [0, 1] a.s.,
JO
contingent claim. We know from relation ( ) that if
it is marketed, its value over time is equal to its
(l) initial value at time zero plus a stochastic integral
r
where denotes transpose. That is, the value of the with respect to N g-martingalcs. Conversely, a
portfolio 9 at time / is equal to its initial value plus contingent claim x marketed if the conditional
accumulated capital gains or losses from time zero expectation E [x | y(], which is a g-martingale, can
to time t. There are neither new investments into nor be represented by a stochastic integral with respect
withdrawals of funds out of the portfolio. This is to the N g-martingales {Sn(t); n = 1, 2, ..., N}. [Here
just a natural budget constraint. we should remark that any x e L2(P) has a finite
Harrison and Kreps ( ) and Kreps ( ) expectation under g by the Cauchy-Schwarz
show that if all the simple of self-financing trading inequality.] This observation turns on the
strategies are allowed and if arbitrage opportunities machinery of the martingale representation theorem
are absent, then there exists a probability measure g in the study of market completeness.
equivalent to P such that the Radon- Nikodym The security markets are said to be dynamically
derivative E, = dQ/dP is an element of L2(P) and complete if all contingent claims are marketed.
that S is a martingale under g, or a g-martingale. Fix From the above discussion, it follows that markets
g and note that since P and g are equivalent, all the are dynamically complete if all g-martingales are
a.s. statements to follow apply to both. representable as stochastic integrals with respect to
Now we can specify the space of admissible the N risky g-martingale prices. In such event, the
strategies 0 [5]. A self-financing trading strategy 9 N g-martingales are said to have the martingale
is admissible if representation property. Readers might be curious
by now that the riskless asset seems to disappear
from the story. Indeed, whether a contingent claim
is generated by a (not necessarily self-financing)
trading strategy does not depend upon the riskless
d (s ) T dS (s ) t e [0,1] asset after time zero. The riskless asset, however, is
Jo
a vehicle through which the budget is balanced over
is a g-martingale and 9 (1 ) 7 .ST 1) C L2(P). [See time.
Jacod ( ) for sufficient conditions for this to be The contribution made by Harrison, Kreps, and
true.] Then one can show that given 0 [5] indeed Pliska is methodological. They make available a
there are no arbitrage opportunities. powerful machinery for the study of financial/cap-
A contingent claim is an element of L2{P). A ital markets: the theory of martingales. Now we
contingent claim .r is said to be marketed if it can be shall present some consequences of their work.
dynamically manufactured by an admissible trading Since Merton’s ( , , ) analyses
strategy. Formally, .r is marketed if there exists 9 e 0 of optimal intertemporal consumption-portfolio
[*S] such that policies and their implications on equilibrium asset
prices, the conditions under which a price system is
1 representable is an Ito process had been an open
x = d(0)T S(0) + d ( t ) T d S( t ) a . s. question for more than a decade. A short answer
JO
found in Huang ( ) is as follows:
Take the set-up of the economy as above and
The value of .r at time t is 9 (t)T S(t). By the
definition of admissibility, we have
2202 Continuous-Time Stochastic Processes
assume henceforth that there are no arbitrage does there exist an equilibrium where equilibrium
opportunities. Moreover, assume that the informa- price processes are ltd processes? More impor-
tion structure F is a Brownian motion filtration. We tantly, does there exist an equilibrium where
know S is a 0-martingalc, so we can write although there are only a finite number of long-
lived securities traded, the markets are dynamically
S„(f) = £*[S„(l)|r] a.s. complete and thus the equilibrium allocation is
Pareto optimal? Note that in the Arrow-Debreu
_E[S,{1)Zfr]
equilibrium theory, markets for all contingent
E[^] a.s.,
claims are available at time zero. Agents trade to a
Pareto optimal allocation. There is no need and no
where the second equality follows from the Bayes’ incentive for the markets to reopen after time zero.
rule, and where we recall that c = d0/dP, which is Of course, this does not conform with actual market
strictly positive by the fact that Q and P are equiv- structures. We do not have a complete set of
alent. The numerator and the denominator of the contingent markets. What we do have are
above relation are both /’-square integrable martin- constantly-open financial markets where a finite
gales. By the martingale representation theorem of number of long-lived assets are traded. Thus it is
Kunita and Watanabe ( ), we know that any important to know whether there exists an equi-
P-square integrable martingale is representable as librium in such a world and to know the efficiency
an Ito integral. Then S„ is an ltd process by ltd of the resulting allocation.
lemma. Hence any arbitrage-free price system is an It follows from the earlier discussion on the
ltd process when the information structure is a martingale representation property of risky price
Brownian filtration. processes that what is needed for an affirmative
We can also study the sample path properties of answer to the above questions: is that there be a
a price system, which relates to examining empir- riskless security with unit price throughout and a
ically the so-called efficient market hypothesis. finite number of risky long-lived securities that
Much of the empirical work in financial economics have the martingale representation property. What
and accounting concerns the response of cap- complicates the story, however, is that the demand
ital/financial asset prices to information. The null and supply of the long-lived securities must be
hypothesis in this work is typically that the capi- equal in equilibrium. Thus those securities must be
tal/financial markets are efficient in the sense that picked carefully. Moreover, it is not true that a
prices rapidly adjust to new information. But is it finite number of martingales having the martingale
true that prices only make large adjustments at representation property can always be found. Duffie
surprises and what exactly is a surprise, mathe- and Huang ( , ) and Dufifie
matically? Here we turn to the classification of ( ), in exchange as well as production econo
stopping times in the general theory of processes. In mies, demonstrated a procedure to select long-lived
this context, a surprise is a non-predictable stopping securities having the desired properties and condi-
time. We also know that a martingale must be tions under which the number is finite.
continuous at predictable stopping times (provided The martingale connection of an arbitrage-free
that a minor technical condition is satisfied). Thus, system has been generalized to economies where
S can make discrete changes only at nonpredictable securities can pay dividends and agents can con-
stopping times or at surprises. This and other related sume at any time in [0,1], After a suitable normal-
issues can be found in Huang ( , ). A ization, a price system plus the accumulated
reference for the classification of dividends form a martingale under an equivalent
stopping times is Dellacherie and Meyer ( ). probability measure. This is done in Huang ( ).
So we discover that a price system must be ltd Similar theory is also valid in economies
process when the information is a Brownian motion where agents have differential information. Inter-
filtration and when there are no arbitrage ested readers are referred to Duffie and Huang (
opportunities. There still remain further questions: ) for details.
Continuous-Time Stochastic Processes 2203
Although the focus of research has been on Duffie, D. 1986b. Price operators: Extensions, potentials, and
capital theory and financial markets, applications of the Markov valuation of securities. Research Paper No.
813. Graduate School of Business, Stanford University.
the theory of continuous time stochastic processes Duffie, D., and C. Huang. 1985. Implementing Arrow-Debreu
to economic problems outside these areas can be equilibria by continuous trading of few long-lived
found. For example, Dufifie ( ) securities. Econometrica 53: 1337-1356.
applies classical potential theory as in the context of Duffie, D., and C. Huang. 1986a. Multiperiod securities
markets with differential information: Martingales and
Markov processes to valuation of securities, and Li resolution times. Journal of Mathematical
( ) examines the stochastic theory of Economics 15: 283.
the firm in continuous time. He uses point processes Duffie, D., and C. Huang. 1986b. Stochastic production-
to model stochastic demands for commodities and exchange equilibria. Stanford: Graduate School of
Business, Stanford University
endogenizes a firm’s demand for inventories, Duffie, C., and P. Meyer. 1982. Probabilities and
among other things. For applications of the theory potential B: Theory of martingales. New York:
of optimal stopping to game theory, see Hugues ( North-Holland.
) for zero-sum stopping games, Einstein, A. 1905. On the movement of small particles
suspended in a stationary liquid demanded by the
and Huang and Li ( ) for nonzero-sum stop molecular kinetic theory of heat. Annals of Physics
ping games. 17: 549.
Harrison, M., and D. Kreps. 1979. Martingales and arbitrage
in multiperiod securities markets. Journal of Eco-
See Also nomic Theory 20: 381—408.
Harrison, M., and S. Pliska. 1981. Martingales and stochastic
integrals in the theory of continuous trading. Stochas-
► Capital Asset Pricing Model
tic Processes and their Applications 11: 215-260.
► Continuous-Time Stochastic Models Huang, C. 1985a. Information structure and equilibrium asset
► Options prices. Journal of Economic Theory 35: 33-71.
Huang, C. 1985b. Information structure and viable price
systems. Journal of Mathematical Economics 14:
Bibliography 215-240.
Huang, C., and L. Li. 1986. Continuous time stopping games,
Bachelier, L. 1900. Theorie de la speculation. Paris: Gauthier- Working Paper No. 1796-86, Sloan School of
Villars. Management, MIT.
Bismut, J. 1975. Growth and optimal intertemporal allocation Hugues, C. 1974. Markov games. Technical Report No.33,
of risks. Journal of Economic Theory 10: 239-257. Black, F., Department of Operations Research, Stanford University.
andM. Scholes. 1973. The pricing of options and corporate Ito, K. 1944. Stochastic integrals. Proceedings of the
liabilities. Journal of'Political Economy 81: 637-654. Imperial Academy 22: 519-524. Tokyo.
Brock, W., and M. Magill. 1979. Dynamics under uncertainty. Ito, K. 1951. On stochastic differential equations ,
Econometrica 47: 843-868. Memoirs of the American Mathematical Society. Rhode
Chung, K., and R. Williams. 1983. An introduction to Island: The American Mathematical Society.
stochastic integration. Boston: Birkhauser. Ito, K., and H. McKean. 1965. Diffusion processes and
Cox, .1., and S. Ross. 1976. The valuation of options for their sample paths. New York: Springer.
alternative stochastic processes. Journal of Jacod, J. 1979. Calcul stochastique etproblemes de
Financial Economics 3: 145-166. martingales, Lecture Notes in Mathematics, vol. 714.
Cox, .1., .1. Ingersoll, and S. Ross. 1985a. An intertemporal New York: Springer.
general equilibrium model of asset prices. Kreps, D. 1981. Arbitrage and equilibrium in economies with
Econometrica 53: 363-384. infinitely many commodities. Journal of Mathe-
Cox, I, I Ingersoll, and S. Ross. 1985b. A theory of the term matical Economics 8: 15-35.
structure of interest rates. Econometrica 53: 385-408. Kunita, H., and S. Watanabe. 1967. On square-integrable
Dellacherie, C., and P. Meyer. 1978. Probabilities and martingales. Nagoya Mathematics Journal 30:
potential A: General theoiy of process. New York: North-Holland. 209-245.
Duffie, D. 1986a. Stochastic equilibria: Existence, spanning Li, L. 1984. A stochastic theory of the firm. Unpublished PhD
number, and the ‘no expected gains for trade’ hypothesis. thesis, Northwestern University
Econometrica. McKean, H. 1965. Appendix: A free boundary problem for
the heat equation arising from a problem in mathemat-
ical economics. Industrial Management Review 6:
32-39.
2204 Contract Theory
McKean, H. 1969. Stochastic integrals. New York: Academic. problem; Incentive compatibility; Incentive
Merton, R. 1969. Lifetime portfolio selection under uncer- constraints; Incomplete contracts; Informative-
tainty: The continuous case. Review of Economic and Statistics 51:
247-257.
ness principle; Insurance; Laffont, J.-J.; Limited
Merton, R. 1971. Optimum consumption and portfolio rules liability; Monotonicity; Moral hazard; Multi-
in a continuous time model. Journal of Economic Theory 3: 373- agent organizations; Non-verifiability; Optimal
413. contract; Pontryagyn principle of optimality;
Merton, R. 1973a. An intertemporal capital asset pricing
model. Econometrica 41: 867-888.
Principal and agent; Revelation principle; Risk
Merton, R. 1973b. Theory of rational option pricing. Bell Journal aversion; Risk neutrality; Sharecropping;
of Economics and Management Science 4: 141-183. Spence-Mirrlees condition; Tournaments
Merton, R. 1975. An asymptotic theory of growth under
uncertainty. Review of Economic Studies 42: 375-393.
Merton, R. 1977. On the pricing of contingent claims and the
Modigliani-Miller theorem. Journal of Financial Economics 5:
241-249. JEL Classifications
Meyer, P. 1966. Probability and potentials. Waltham: Blaisdell DO
Publishing Company.
Meyer, P. 1976. Un corns sur les integrales stochastiques. In
Seminaires de Probability X . Lecture Notes in Mathematics 511.
As with so many major concepts in economics,
New York: Springer. contract theory was introduced by Adam Smith
Samuelson, P. 1965. Rational theory of warrant pricing. who, in his monumental Wealth of Nations ( ,
Industrial Management Review 6: 13-32.
book ID, ch. 2), considered the relationship between
Samuelson, P., and R. Merton. 1969. A complete model of
warrant pricing that maximizes utility. Industrial Management
peasants and farmers through this lens. For instance,
Review 10: 17-46. he pointed out the perverse incentives provided by
Stratonovich, R. 1966. A new representation for stochastic sharecropping contracts, widespread in 18th-century
integrals and equations. SIAM Journal of Control 4: 362-371. Europe. However, it is fair to say that the issues of
incentives and contract theory were largely ignored
by economists until the end of the 20th century. By
then, the focus of economic theory was on the
working of markets and price formation. Firms were
viewed only as production technologies, and the
issue of the separation between ownership and
Contract Theory control was most often put aside. This black-box
approach was, of course, quite unsatisfactory. At the
David Martimort
turn ofthe 1970s, with the methodological
revolution of game theory, more emphasis was
placed on strategic interactions between a small
number of players in a world where informational
Abstract problems matter. From this new perspective, the
This article offers a brief overview of contract. It allocation of resources is no longer ruled by the
focuses on the theory of complete contracts and price system but by contracts between
the three associated paradigms of adverse selec- asymmetrically informed partners. Contract theory
tion, moral hazard and non-verifiability. By has deeply changed our view of the functioning of
showing difficulties in allocating resources organizations and markets.
between asymmetrically informed partners, This article aims to provide a brief overview of
contract theory has deeply changed our view contract theory, stressing a few major insights and
ofthe functioning of organizations and markets. illustrating them with useful applications. Due to
space constraints, it does not do justice to several
Keywords aspects of contract theory, and will mostly reflect
Adverse selection; Asymmetrical information; my own tastes in the field. In particular, I focus on
Bayesian-Nash equilibrium; Collusion; Contract the so-called theory of complete contracts, leaving
theory; Cost observability; Free-rider
Contract Theory 2205
aside the burgeoning theory of incomplete contracts Many mechanisms or institutions lead to this
which is covered elsewhere in this dictionary. outcome. Both the price mechanism and a take-it-
Successive sections deal respectively, with adverse or-leave-it offer by one party to the other would
selection, moral hazard and non-verifiability: the achieve the same allocation, although with different
three different paradigms which have been used in distributions of the surplus between the traders. If
the field of complete contract theory. Since the the principal retains all bargaining power (for
distinction between complete and incomplete instance, because there is a competitive fringe of
contracts is easier to draw once these notions have potential sellers), he could offer a forcing contract
already been explained, I will postpone such stipulating an output q (0) and a transfer /(()) which
discussion to the end of the article. just covers the seller’s cost. This forcing contract
maximizes the buyer’s net gains from trade and
leaves the seller just indifferent between
participating or not.
In what follows, we mostly focus on the case
Adverse Selection
where the uninformed principal has full bargaining
Consider the following buyer-seller relationship as power in contracting. In this framework, the con-
the archetypical example of contractual relationship tract between the buyer and the seller does not only
between a principal (the buyer) and his agent (the have the allocative and distributive roles it has
seller) who produces some good or service on his under complete information. It also has the role of
behalf. The mere delegation of this task to the agent communicating information from the informed party
gives the agent access to private information about to the uninformed party. This communication role
the technology. This adverse selection environment suggests that the informed party should be given a
is captured by assuming that a technological choice among different options and that this choice
parameter 9 is known only by the agent. It is drawn should reveal information about the adverse
from a distribution in an exogenous type space © selection parameter.
which is common knowledge. Neither the principal A first step in the analysis consists of describing
nor a court of law observes this parameter. the set of allocations which are feasible under
Contracts cannot specify outputs and prices as a asymmetric information. The basic tool for doing so
function of the realized state of nature. is the revelation principle (see Gibbard ; Green and
The buyer enjoys a net benefit S(9, q) -1 when Laffont ; Dasgupta et al. ; Myerson , among others),
buying q units of output at a price t. The seller which states that there is no loss of generality in
enjoys a profit t —C{9, q) from producing that restricting the analysis to revelation mechanisms
good. We will assume that these functions are that are direct, that is, of the form Jv & with 9 a
concave in q. Notice that the state of nature 9 might message (‘report’) sent by the informed seller to the
affect both the agent’s and the principal’s utility uninformed buyer, and truthful, that is, such that the
functions. This can, for instance, be the case if this agent finds it optimal to report his true type.
parameter also determines the quality of the good to Therefore, incentive feasible contracts satisfy
be traded. the following incentive constraints
Under complete information, efficiency requires
that the buyer and the seller trade the first-best
t { 9) - C ( 9, q( 9 )) > t ( e )
quantity q (0) such that the buyer’s marginal benefit
from consumption equals the seller’s marginal cost -c[9,q(9^v(9,9^je& 2 . (2)
of production:
To be acceptable, a contract must also satisfy
^ (9 ,q f 9 ) ) = ^( 9 ,q *( 9) ) . ( 1) the seller’s participation constraints
2206 Contract Theory
one needs also to impose a property on the type organizations. Such complex organizations emerge
distribution, the so-called monotonicity of the haz- because of the need to share common resources,
ard rate (see Bagnoli and Bergstrom ). Otherwise, produce public goods, internalize production
the optimal contract may entail some area of externalities or enjoy information economies of
pooling such that all types belonging to a set with scale. Although any such reason calls for a specific
positive measure produce the same amount and are analysis, a few common themes of the literature can
paid the same price. The optimal solution may then be highlighted by remaining at a rather general
level.
be obtained using ‘ironing techniques’ (see for
Regarding the implementation concept, different
instance Guesnerie and Laffont )•
notions of incentive compatibility may be used
depending on the context. First, agents may know
Direct Extensions
each other’s types and play a Nash equilibrium of
Adverse-selection methodology has been success-
the direct revelation mechanism offered by the
fully extended in various directions allowing for
principal (see Maskin , and the discussion of the
multidimensional types (Armstrong and Rochet ),
non-verifiability paradigm below). Second, agents
and/or multiple outputs (Laffont and Tirole , ch. 3),
may only know their own type, form beliefs on each
and type-dependent reservation utilities (Lewis and
others’ types and play a Bayesian-Nash equilibrium
Sappington ; Jullien ). There, the analysis is
(see D’Aspremont and Gerard-Varet ). Third, one
substantially more complex as types can no longer
may also
be ranked as easily as in the model sketched above.
insist on dominant strategy implementation because
The Spence-Mirrlees condition might fail to hold
it does not depend on the specification of beliefs
and global incentive constraints may bind, leading
(see Gibbard ; Groves ; Green and Laffont ). To
to pooling allocations being optimal. Another
each implementation concept corresponds a notion
interesting extension is the case of hidden
of incentive feasibility. Once the set of incentive
knowledge, in which contracting takes place before
feasible contracts is defined, one can proceed to
the agent becomes informed. The logic of such
optimization. It is a trivial observation that, the
models is very close to that we discuss below in the
more restrictive the implementation concept, the
section on moral hazard. In a nutshell, the trade-off
lower is the principal’s payoff at the optimum.
between allocative efficiency and rent extraction is
In some cases, such as the provision of public
now replaced by the trade-off between insuring the
goods within a society of privately informed agents
agent against shocks on costs and inducing him to
or in bargaining models between a buyer and a
reveal his cost once it is known. Output distortions
seller with equal bargaining power, the goal is no
still arise (see Laffont and Martimort ch. 2, for
longer to design a multilateral contract which would
details). Others have endogenized the asymmetric
extract the rents of all agents but, instead, to
information structure and examined the incentives
maximize some ex ante efficiency criterion under
to leam about the unknown parameter (see, for
incentive constraints. Groves ( ) showed that
instance, Cremer et al. ). Finally, there exists a
dominant strategy mechanisms suffice to implement
literature that considers the case where the principal
the first-best decision in a public good context. One
is the informed party (Maskin and Tirole , ).
caveat is that the budget generally fails to be
New difficulties
balanced. D’Aspremont and Gerard-Varet ( )
arise from the fact that the mere offer of the
proposed a Bayesian incentive-compatible
contract may signal information.
mechanism which implements the first-best and still
satisfies budget balance. As argued by Laffont and
Multi-agent Organizations
Maskin ( ), such a mechanism may con
The most important extensions of the adverse
flict with the agents’ participation constraint. In a
selection paradigm certainly concern multi-agent
bargaining environment, Myerson and Satterthwaite
( ) showed in a similar vein
2208 Contract Theory
that there exists no Bayesian bargaining mechanism When competition between agents or between
that is efficient, budget-balance and individually agents and the supervisors supposed to monitor
rational. them would benefit the principal, one must consider
The optimal multilateral contract can be very the possibility of collusion aimed at securing more
sensitive to the information structure. In environ- rent. Reducing the scope for collusion requires
ments where risk-neutral agents have correlated using mechanisms that are less sensitive to
types but know only their own type, the principal information and reducing supervisory discretion.
can condition one agent’s compensation on Incentive contracts look more like inflexible
another’s report. By doing so, the principal can bureaucratic rules (see Tirole ; Laffont and
fully extract the rent from both agents in a Martimort ). The optimal response to collusion may
Bayesian-Nash equilibrium. One may view this also entail more delegation to lower levels of the
result as a strong rationale for relative performance hierarchy, as in Laffont and Martimort ( ) and
evaluation, yardstick competition, benchmarking Faure-Grimaud et al. ( ).
and internalization of similar activities within the
same organization. This puzzling insight of Cremer Dynamics
and McLean ( ) no longer Different extensions of the static framework cor-
holds when one introduces risk-aversion, ex post respond to different abilities of the contractual
participation constraints or limited liability con- partners to commit themselves inter-temporally
straints. These assumptions reintroduce information and/or different ways for the cost parameters to vary
rents in the multi-agent organization, and the over time. Under full commitment, the lessons of
standard trade-off between efficiency and rent the static rent-efficiency trade-off can be easily
extraction reappears. extended, although the precise features of the
When the agents’ types are independently dis- optimal contract depend on how types evolve over
tributed, yardstick competition is ineffective and the time (see, for instance, Baron and Besanko , for the
agents derive information rents. However, the case of persistent types). The case of limited
externality that one agent’s task may exert on commitment is more interesting. Longterm
another can shape the distribution of these rents. In contracts may either be renegotiated (Dewatripont ;
competitive environments, such as procurement Hart and Tirole ; Laffont and Tirole ) or even are
auctions among sellers, it is no longer the distribu- not feasible, in which cases the parties resort to spot
tion of the agents’ marginal costs but the distribu- contracts (Laffont and Tirole ). The rent-efficiency
tion of their virtual marginal costs (see Myerson ) trade-off must be adapted to take into account how
which determines who should produce and how information is revealed progressively over time.
much. Because virtual costs may be ranked However, the basic idea still holds. As past per-
differently from true costs, inefficiencies arise formances reveal information about the agent’s
under asymmetric information. Moreover, compe- type, the optimal contract trades off ex post effi-
tition may help reduce rents by putting each agent ciency gains in contracting against the agent’s
under the threat of being excluded from production desire to hide information in the earlier periods of
if he overstates his cost too much. There is then a the relationship so as to secure more rent in the later
positive externality among competing agents. periods.
Instead, more cooperative environments, such as
Applications
public good problems or procurement of com-
Since the mid- 1980s, models of optimal contracting
plementary inputs by several suppliers, involve
under adverse selection have spanned the economic
negative externalities between agents. Given that
literature. Let us quote only a few major applica-
each agent has a limited impact on the organiza-
tions. Mirrlees ( ) analysed optimal taxation
tion’s overall production, the incentives to overstate
schemes when the agent’s productivity is privately
costs and thereby receive greater transfers are
observed. He introduced the Spence-Mirrlees
exacerbated. ‘Free riding’ arises in such orga-
nizations (see Mailath and Postlewaite ).
Contract Theory 2209
condition and derived die implementability condi- is affected by an effort <? undertaken by the agent
tions. He also used optimal control techniques at a cost ij/(e) (where i// > 0 and ij/" > 0). The
(Pontryagyn Principle) to compute the optimal tax- cumulative distribution is F(q\e) (with
ation scheme. (The taxation problem differs from density/(q\ej) on a support Q \q, q\ independent of
our buyer-seller example because participation in the agent’s effort. To simplify, the agent’s
the mechanism is mandatory and the state’s budget preferences are separable in money and effort: U =
constraint must be added to the characterization of u(t) — ij/(e) where w(-) is increasing and concave
feasible allocations. ) (u' > 0, u" < 0). The agent’s outside option is not to
Mussa and Rosen ( ) studied the problem produce, which gives him a payoff normalized to
of a monopolist selling one unit of a good to a zero.
continuum of consumers vertically differentiated The agent’s effort is observable neither by the
with respect to their willingness to pay for the principal nor by a court of law. This is a moral
quality of this good. This was the first model using hazard setting. Contracts stipulate the agent’s
adverse selection techniques in a framework payment as a function of the realized quality
without income effect. Maskin and Riley ( ) assumed to be observable and verifiable
were interested in characterizing the optimal non- (contractible) by a court of law. Therefore, con-
linear price used by a monopolist in a second- tracts are of the form {t(q)}-, n.
degree price discrimination context. If the effort were observable, its value could
Baron and Myerson ( ) applied the meth also be specified by contract. Therefore, the seller
odology to the regulation of natural monopolies can at the same time be forced to exert the first-
privately informed about their marginal costs of best level of effort and be frilly insured against
production. Laffont and Tirole ( ) extended uncertainty on realized quality with a flat payment
this analysis to allow for cost observability but also independent of his performance:
introduced moral hazard elements (the possibility
for the regulated firm to reduce its costs by u(t*) = i// (e*).
undertaking some non-observable effort). They
derived cost-reimbursement rules and pricing pol- This is no longer the case when the agent’s effort is
icies. They showed that menus of linear contracts non-verifiable. The first step of the analysis is to
might implement the optimal contract. describe the set of feasible incentive contracts
Green and Kahn ( ) and Hart ( ) implementing a given level of effort e.
studied labour market contracts and discussed In amoral hazard setting, incentive constraints
distortions towards overemployment or underem- write as:
ployment that may arise depending on the con-
tractual environment considered.
Finally, Townsend ( ) and Gale and “W?))/(<7\ e)dq - He)
1
Hellwig ( ) analysed optimal financial con
tracts in a framework where the borrower’s income > f u(t (q))f (q\ ^)dq- i l / (e') V(e,e' ). (6)
is observable only ex post and at a cost. Optimal Jq
contracts may look like debt in such environments.
The agent’s participation constraint is:
Moral Hazard
-H ) > °-
e
(7)
To return to our buyer-seller example, we now 2
assume that there is only one unit of a good to be
Risk Neutrality
traded whose quality q is random and which yields
A first case of interest is when the agent is risk-
a surplus S(q) to the buyer. The distribution of
neutral (u(t) = t). The simple ‘sell-out’ contract,
quality
t(q) = S(q) — C where C is a constant, implements
2210 Contract Theory
the first-best level of effort e . Provided that 5 exert effort, which requires that payments be linked
to performance. There is now a trade-off between
(q)f(qI e*) ~ ^is scheme also extracts all insurance and incentives.
the surplus from the agent who is just indifferent Second, the monotonicify of the agent’s com-
between producing or not.
pensation with respect to the qualify level (a priori a
Intuitively, with such a ‘sell-out’ contract, the
quite intuitive property) is obtained only when the
agent’s private incentives to exert effort are aligned
monotone likelihood ratio property holds,
with the social incentives. This efficient outcome is
namely, when fe{Ct\e)
obtained by, first, having the agent pay a bond f(q\e)
> 0 . This property
worth C for the right to serve the principal, and _
second, having the principal pay an amount S(q) means that higher levels of performance are
contingent on the quality realized. more informative about the agent’s effort.
Such a ‘sell-out’ contract requires that the agent Finally, the optimal contract must use all sig-
bears the full consequences of a bad performance. It nals which are informative about the agent’s effort
might not be feasible when the agent has limited but no uninformative signals. Using them would
liability and cannot be punished for bad only let the agent bear more risk without any
performances. (For details, see Laffont and beneficial impact on incentives. This is the
Martimort , ch. 4). The conjunction of moral hazard so-called informativeness principle of
and limited liability allows the agent to derive a Holmstrom ( ).
limited liability rent. Intuitively, only rewards, not
punishments, can be used to provide incentives, and Extensions
this restriction on instruments is costly for the In a model with a finite number of qualify and effort
principal. This rent creates a trade-off between levels, Grossman and Hart ( ) offered a
efficiency and rent extraction, as in the adverse careful study of the set of incentive constraints and
selection framework. Effort is distorted below the its consequences for the shape of optimal contracts.
first-best level. There is no general result on the ranking between the
first-best and the second-best effort levels in such
environments. The discrete version of the first-order
Risk Aversion approach requires that only nearby constraints matter
Let us turn to the more complex case of risk in the agent’s problem. This concavity of the agent’s
aversion. A first concern of the literature has been problem is ensured when F(q\e) is itself convex in q.
to ‘simplify’ the set of incentive constraints ( ) by In models with a continuum of effort levels and
replacing it with a first-order condition: outcomes, this first- order approach was suggested in
Mirrlees ( ),
more rigorously justified in Rogerson ( ) and
u{t{q))fM\e)dq = i//'(e). (8)
Jewitt ( ) and applied in Holmstrom ( )
and Shavell ( ).
Denoting by X (resp. /i) the positive multiplier of The moral hazard methodology has been used to
the incentive (resp. participation) constraint ( ) justify the optimality of linear incentive schemes in
(resp. ( )), the optimal second-best schedule tsil(q) well-structured environments (Holmstrom and
satisfies Milgrom ); an often found feature of real world
contracts. Equipped with this tool, Holmstrom and
1 Milgrom ( , ) investi
„ | JM\e) (9.
u'i^iq)) f{q\e) ' gated how multiple tasks and jobs should be
arranged in an organization.
This condition yields two important insights. First, To avoid the complexity of models with a
the contract must simultaneously provide the risk- continuum of effort levels, modellers have found it
averse agent with insurance, which requires a fixed useful to focus on simplified environments with two
payment, and with incentives to levels of effort. This approach was
Contract Theory 2211
instrumental in the work on corporate finance of of his performances. This may eliminate any agency
Holmstrom and Tirole ( ). problem, at least when parties do not discount too
much the future (see Laffont and Martimort , ch. 8,
Multi-agent Organizations for an example). More generally, the insurance-
When applied to multi-agent organizations, the incentives trade-off may be relaxed when the risk-
‘informativeness principle’ suggests that an agent’s averse agent’s rewards and punishments can be
compensation should be linked to another’s smoothed over the whole relationship, as shown in
performance if it is informative about his own effort Spear and Srivastava ( ). A direct consequence
(see Mookherjee ). Relative performance evaluation of inter-temporal
and benchmarking can help eliminate common smoothing is that the optimal dynamic contract
shocks affecting all agents’ performances. Of exhibits memory, good (resp. bad) performance
particular importance in this respect are today will also affect positively (resp. negatively)
tournaments which use only the ranking of the future compensations. This insight has been used to
agents’ performances to determinate their formalize a theory of the wage dynamics inside the
compensations. Tournaments provide agents with firm (Harris and Holmstrom ).
insurance against common shocks, which has a Fama ( ) argued that reputation in the
positive incentive effect. More generally, the labour market exerts enough discipline on managers
properties of tournaments and how they compare to alleviate moral hazard even in the absence of
with (a priori suboptimal) linear schemes have been explicit contracts. Holmstrom ( ) built a
investigated in Nalebuff and Stiglitz () and Green model of career concerns where the manager’s
and Stokey ( ). interest in influencing the labour market’s beliefs
In more cooperative environments where dif- concerning his or her quality provides incentives to
ferent agents contribute to a joint project, the exert effort. Career concerns are nevertheless in
fundamental difficulty is how to share the proceeds general not enough to induce first-best effort levels,
of production among agents of the team and still and some inefficiencies remain.
provide some incentives. Since each agent enjoys
only a fraction of those proceeds but bears the full
cost of his effort, he reduces his effort supply. This Non-verifiability
leads to a free-rider problem within teams, which is
analysed in Holmstrom ( ). Let us return to the buyer-seller model above.
If we remain in cooperative environments but Although we now assume that it is observable by
allow now for a principal acting as a budget both the principal and the agent, the state of nature
breaker, this principal may find it worthwhile to 6 may still not be verifiable by a court of law, in
reduce the agency cost of implementing a given which case it cannot be part of the contract. This
effort profile by having agents behave cooperatively shared knowledge stands in sharp contrast with the
(Itoh ). Even when agents do not cooperate, mutual asymmetric information structures examined in
observability of effort levels can also help to previous sections.
eliminate agency cost, as in Ma ( ). The first difficulty consists of building a mech-
This last argument relies on the logic of non- anism based only on verifiable variables (namely,
verifiability models, developed below. the quantities traded and corresponding payments)
which implements the first-best quantity q*(0) and
Dynamics
transfers f*(0). This problem was addressed by
The basic issue investigated by dynamic models of
Maskin ( ). He demonstrated
moral hazard is the extent to which repeated
that the first-best quantities and transfers can easily
relationships alleviate the moral hazard problem.
be implemented with a direct revelation mechanism
The intuition is that the principal should filter out
| t(6a,6b^,q^6a,6b^ g) e 0 2 where
the agent’s effort by looking at the whole history
both the buyer and the seller report
2212 Contract Theory
simultaneously the state of nature they commonly Alternatively, this threat point may be deter-
know. Truth-telling is obviously a Nash equilibrium mined by the allocation of ownership rights where
of this mechanism provided that both traders are such a right gives the owner the opportunity to use
severely punished when making different reports, assets as he prefers in case bargaining fails (see
since such cases would be inconsistent with the Grossman and Hart ; Hart and Moore ). The issue is
underlying information structure. then to derive from those exogenous constraints
A more subtle issue is how to design a mech- distortions of investments and optimal
anism such that this truthful Nash equilibrium is organizations which may mitigate those distortions.
unique. Maskin ( ) proposed a condition for The incomplete contracts paradigm is similar to
players’ preferences such that this is the case. the complete contracts one (adverse selection,
Moore and Repullo ( ) significantly extended moral hazard and non-verifiability) in the sense that
the domain of preferences by hardening the imple- it also imposes limits on what a court may verify. It
mentation concept, replacing Nash behaviour by differs from it because it also imposes exogenous
subgame-perfection in a sequential moves mech- restrictions on the set of mechanisms available to
anism (see Laffont and Martimort , ch. 6, for an the parties. The justification for these restrictions is
example, and Moore , for an exhaustive survey of found either in the bounded rationality of players or
the literature). the difficulties in describing or foreseeing
The basic thrust of the non-verifiability para- contingencies, all theoretical issues which remain
digm is that a court of law can get around non- high on the agenda of economic theorists and are
verifiability by building such revelation still unsettled. The relevant literature on incomplete
mechanisms, at least as long as the non-verifiable contracts is too large to be summarized in this short
state is payoff-relevant. If one sticks to that inter- article. The interested reader may refer to Tirole (
pretation, non-verifiability does not present a ) for an
significant limit on contracting. overview or to the entry for this term in this
A second issue of the literature is the impact of Dictionary.
non-verifiability on the incentives of traders to
perform specific and non-verifiable investments.
Given our previous claim that non-verifiability is
generally not a constraint, the model resembles the
standard moral hazard model. Providing incentives See Also
for investments meets the same difficulties as in the
previous section. ► p 2..
► Ag»,::r;j '-'ms
Extensions
► Incomplete Contracts
In practice, revelation mechanisms have been
► Mechanism Design
criticized as overly complex, as relying on threats
► i/X Design. J Wtr'l e v * . n t s )
which may either be non-credible or violate limited
► i/'orc.CFaza: d
liability constraints. The so-called incomplete
contracts literature has thus focused on cases where Acknowledgment I thank D. Gromb and J. Pouyet for
helpful comments on an earlier version
such revelation mechanisms are not feasible. In
such environments, either no contract at all or only
a very rough one can be written ex ante. For Bibliography
instance, parties can agree ex ante on a simple
fixed-price/fixed quantity contract which serves as a Armstrong, M., and J. Rochet. 1999. Multidimensional
threat point for the bargaining which takes place ex screening: A user’s guide. European Economic Review
43: 959-979.
post when the state of nature is realized (see Ed I in
Bagnoli, M., and T. Bergstrom. 2005. Log-concave probability
and Reichelstein , among others). and its applications. Economic Theory 26: 445-469.
Contract Theory 2213
Baron, D., and D. Besanko. 1984. Regulation and information Hart, O., and J. Moore. 1988. Property rights and the nature
in a continuing relationship. Information Economics and of the firm. Journal of Political Economy 98: 1119-1158.
Policy 1: 447-470. Hart, O., and J. Tirole. 1988. Contract renegotiation and
Baron, D., andR. Myerson. 1982. Regulating a monopolist Coasian dynamics. Review of Economic Studies 55: 509-
with unknown costs. Econometrica 50: 911-930. 540.
Cremer, J., andR. McLean. 1988. Full extraction of surplus in Holmstrom, B. 1979. Moral hazard and observability. Bell
Bayesian and dominant strategy auctions. Econometrica Journal of Economics 10: 74-91.
56: 1247-1257. Holmstrom, B. 1982. Moral hazard in teams. Bell Journal of
Cremer, J., F. Khalil, and J. Rochet. 1998. Strategic infor- Economics 13: 324-340.
mation gathering before a contract is offered. Journal of Holmstrom, B. 1999. Managerial incentive problems: A
Economic Theory 81: 163-200. dynamic perspective. Review of Economic Studies 66: 169-
D’Aspremont, C., and L. Gerard-Varet. 1979. Incentives and 182.
incomplete information. Journal of Public Economics 11: Holmstrom, B., and R Milgrom. 1987. Aggregation and
25-45. linearity in the provision of intertemporal incentives.
Dasgupta, R, R Hammond, and E. Maskin. 1979. The Econometrica 55: 303-328.
implementation of social choice rules. Review of Economic Holmstrom, B., andR Milgrom. 1991. Multi-task principal
Studies 46: 185-216. agent analysis. Journal of Law, Economics, and Orga-
Dewatripont, M. 1989. Renegotiation and revelation infor- nization 7: 24-52.
mation over time: The case of optimal labour contracts. Holmstrom, B., and R Milgrom. 1994. The firm as an
Quarterly Journal of Economics 104: 489-520. incentive system. American Economic Review 84: 972-991.
Edlin, A., and S. Reichelstein. 1996. Hold-ups, standard Holmstrom, B., and R. Myerson. 1983. Efficient and durable
breach remedies, and optimal investments. American decision rales with incomplete information. Econometrica
Economic Review 86: 478-501. 51: 1799-1819.
Fama, E. 1980. Agency problem and the theory of the firm. Holmstrom, B., and J. Tirole. 1997. Financial intermediation,
Journal of Political Economy 88: 288-307. loanable funds, and the real sector. Quarterly Journal of
Faure-Grimaud, A., J.-J. Laffont, and D. Martimort. 2003. Economics 112: 663-691.
Collusion, delegation and supervision with soft Itoh, H. 1993. Coalition incentives and risk-sharing. Journal
information. Review of Economic Studies 70: 253-280. of Economic Theory 60: 416-427.
Gale, D., and M. Hellwig. 1985. Incentive-compatible debt Jewitt, I. 1988. Justifying the first-order approach to
contracts: The one-period problem. Review of Economic principal-agent problems. Econometrica 56: 1177-1190.
Studies 52: 647-663. Jullien, B. 2000. Participation constraints in adverse-
Gibbard, A. 1973. Manipulations of voting schemes: A selection models. Journal of Economic Theory 93: 1-47.
generalized result. Econometrica 41: 587-601. Laffont, J.-J., and D. Martimort 1998. Collusion and del-
Green, J., and C. Kahn. 1983. Wage-employment contracts. egation. Rand Journal of Economics 29: 280-305.
Quarterly Journal of Economics 98: 173-188. Laffont, J.-J., and D. Martimort 2000. Mechanism design
Green, J., and J.-J. Laffont. 1977. Characterization of with collusion and correlation. Econometrica 68: 309-342.
satisfactory mechanisms for the revelation of preferences Laffont, J.-J., andD. Martimort 2002. The theory of incen-
for public goods. Econometrica 45: 427-438. tives: The principal-agent model. Princeton: Princeton
Green, J., and N. Stokey. 1983. A comparison of tournaments University Press.
and contracts. Journal of Political Economy 91: 349-364. Laffont, J.-J., and E. Maskin. 1979. A differentiable approach
Grossman, S., and O. Hart 1983. An analysis of the principal- to expected utility maximizing mechanisms. In
agent problem. Econometrica 51: 7-45. Aggregation and revelation of preferences, ed. J.-J.
Grossman, S., and O. Hart. 1986. The costs and benefits of Laffont. Amsterdam: North-Holland.
ownership: A theory of lateral and vertical integration. Laffont, J.-J., and J. Tirole. 1986. Using cost observation to
Journal of Political Economy 94: 691 -719. regulate firms. Journal of Political Economy 94: 614-641.
Groves, T. 1973. Incentives in teams. Econometrica 41: 617- Laffont, J.-J., and J. Tirole. 1988. The dynamics of incentive
631. contracts. Econometrica 56: 1153-1175.
Guesnerie, R., and J.-J. Laffont 1984. A complete solution to a Laffont, J.-J., and J. Tirole. 1990. Adverse selection and
class of principal-agent problems with an application to renegotiation in procurement. Review of Economic
the control of a self-managed firm. Journal of Public Studies 57: 597-626.
Economics 25: 329-369. Laffont, J.-J., and J. Tirole. 1993. A theoty of incentives in
Harris, M., and B. Holmstrom. 1982. A theory of wage procurement and regulation. Cambridge, MA: MTT
dynamics. Review of Economic Studies 49: 315-333. Press.
Hart, O. 1983. Optimal labour contracts under asymmetric
information: An introduction. Review of Economic Studies
50: 3-35.
2214 Contracting in Firms
Lewis, T., and D. Sappington. 1 989. Countervailing incen- Tirole, J. 1986. I lierarchies and bureaucracies: On the role of
tives in agency problems. Journal of Economic Theory 49: collusion in organizations. Journal of Law, Economics and
294 313. Organization 2 : 1 8 1 2 1 4 .
Ma, C. 1988. Unique implementation of incentive contracts Tirole, J. 1999. Incomplete contracts: Where do we stand?
witli many agents. Review of Economic Studies 55: 555 Econometrica 67: 741 782.
572. Townsend, R. 1979. Optimal contracts and competitive
Mailath, G., and A. Postlcwaitc. 1690. Asymmetric infor- markets w i t h costly state verification. Journal of Eco-
mation bargaining problems witlr many agents. Review of nomic Theory 2 1 : 4 1 7 425.
Economic Studies 57: 35 1 638.
Maskin, E. 1999. Nasli equilibrium and welfare optimality.
Review of Economic Studies 66: 23-38.
Maskin, E.,and J. Riley. 1684. Monopoly with incomplete
information. Rand Journal of Economics 15: 171 196.
Contracting in Firms
Maskin, E., and J. Tirole. 1990. The principal-agent rela-
tionship with an informed principal. I: Private values.
Econometrica 58: 37l> 410. Canice Prendergast
Maskin, E., and J. Tirolc. 1992. The principal-agent rela-
tionship with an informed principal. II: Common values.
Econometrica 60: 1 42.
Mirrlees, .1. 1071. An exploration in the theory of optimum
income taxation. Review of Economic Studies 38: 175 208. Abstract
Mirrlees, .1. 1999. The theory of moral hazard with This article provides an overview of recent
unobservable behaviour. Part I. Review of Economic
advances in theoretical and empirical work on
Studies 66: 3 22.
Mookheijee, D. 1684. Optimal incentive schemes with many incentive contracting in firms. The specific focus
agents. Review of Economic Studies 51: 4.33 446. is on a variety of reasons why the prediction of
Moore, .1. 1992. Implementation in environments with the early literature on contracting -suggesting a
complete information. In Advances in economic theory, ed.
strong relationship between performance and
J.-J. Laffont. Cambridge: Cambridge University Press.
Moore, .1., and R. Repullo. 1988. Subgame-perfect imple- pay - has not been borne out.
mentation. Econometrica 56: 1191-1120.
Mussa, M., and S. Rosen. 1978. Monopoly and product
Keywords
quality. Journal of Economic Theory 18: 301-317.
Myerson, R. 1979. Incentive compatibility and the bargaining Agency theory; Contracting in firms; External-
problem. Econometrica 47: 61-73. ities; Free rider problem; Incentive contracts;
Myerson, R. 1981. Optimal auction design. Mathematics of Input monitoring; Measurement error; Multi-
Operations Research 6: 58-63.
tasking; Performance-related pay; Principal and
Myerson, R., and M. Satterthwaite. 1983. Efficient mech-
anisms for bilateral trading. Journal of Economic Theory agent; Private information; Risk
28: 61 7.3.
Nalebuff, B., and .1. Slight/. 1983. Prizes and incentives:
Towards a general theory of compensation. Bell Journal
JEL Classifications
of Economics 14:21 4.3.
J410
Rogerson, W. 1985. The first-order approach to principal-
agent problems. Econometrica 5.3: 1357 1368.
ShavcM, S. ll>71). On moral hazard and insurance. Quarterly
Journal of Economics 93: 541 562. In many realms of economic life, the actions of
Smith, A. 1776. The wealth of nations, 1991. New York: individuals affect the welfare of others. Nowhere is
Prometheus Books. this more relevant than in films, where employees
Spear, S., and S. Srivastava. 11>87. On repeated moral hazard act on behalf of owners or shareholders to provide
witli discounting. Review of Economic Studies 54:599-617.
Spence, M. 1673. Job market signaling. Quarterly Journal of
services for customers and clients. This separation
Economics 87: 355 .374. of the interests of employees from those whose
Spence, M. 1974. Market signalling: Informational transfer in actions they benefit has generated a large literature
hiring and related processes. Cambridge, MA: Harvard on incentive contracting, where the overarching
University Press.
objective is the alignment of such interests. The
early literature on agency theory, described in the
first edition
Contracting in Firms 2215
of this volume by Lazear ( ), conceptually the benefit to others by setting [>v = 1. In words,
mimics that on externalities - the other area of efficiency arises when the agent is residual claimant
economics that deals with welfare consequences of on the benefit of others.
actions on others - by showing a variety of ways in This solution, providing a simple prescription
which the compensation of agents can be for how compensation contacts should be designed,
constructed to internalize the effects on one’s is both simple and intuitive. And empirically false.
actions on others. There are two ways of doing this. There are, of course, some occupations where one
First, one could simply tell employees what to do can find evidence of such ‘high- powered
and to penalize them if they fail to do so. In the incentives’, where agents are essentially residual
literature, this is referred to as input monitoring. claimants on output. Indeed, the literature on
While this can sometimes help, it is often hard to agency theory is replete with references to such
monitor either what workers do, or the intensity occupations - taxi cab drivers, franchisees, share-
with which they do so - a salesman on the road cropper farmers and the self-employed. Yet these
would be a good example. Similarly, while over- are exceptions; instead, ‘low-powered incentives’ in
seers can sometimes identify what it is that agents firms are more the norm (see Prendergast for
are doing, they may not know what they should be details). Consequently, one of the quandaries of the
doing - a board of directors monitoring a CEO literature has become why so few workers seem to
would be apposite here. Accordingly, the second have contracts where their pay is strongly linked to
solution to misaligned incentives is to design their performance, and much of the subsequent
literature to that outlined in the first edition of the
compensation plans such that the agent’s pay
New Palgrave has identified relevant constraints on
depends on her contribution - ‘output’ - so that the
incentive contracting.
concerns of other parties are internalized.
The earliest candidate to explain why high-
A simple model can illustrate this point, and is
powered incentives are rare is that high-powered
useful to describe other complications that can arise.
contracts impose risk on workers (Holmstrom ).
The agent is assumed to take some action (‘effort’)
Consider the contract that induces efficient effort
e > 0, which is unobserved by the principal. She is
above: /?„ = 1. The objective of the firm is to
averse to exerting effort. Consider a simple
maximize profits subject to the worker’s willingness
parameterization of the agent’s utility function,
to take the position. This implies that the fixed
where the agent cares about wages w and effort;
component, /i0, is changed to guarantee that agents
assume that the agent has exponential utility V =
earn their reservation utility, so the principal’s
cxp[ r(w — C(e))], where w is the worker’s wage, r objective becomes a surplus maximization exercise.
> 0 is the constant rate of absolute risk aversion, the When the worker is risk neutral, the fixed
worker’s cost of supplying effort is C(e) = and her component is reduced sufficiently such that the total
reservation utility is U*. To focus attention on the compensation cost is U * + %. In words, the only
role of output contacting, assume that the principal cost that the employer incurs in addition to U* is the
cannot observe effort e (so monitoring of inputs is effort cost. This is not true when the worker is risk
not possible), but instead only observes a signal on averse. In the context of the preferences V above,
effort y = e + £, where e ~ N(0, a2), with a2 compensation costs increase when incentive
representing measurement error. Assume also that contracts are used for two reasons - the cost of
the principal chooses to reward the agent in a linear increased effort as above, but also a risk cost
fashion on output - a piece rate: w = fj0 + ft vv. imposed on workers. Both costs are increasing in
(There is a large literature on the optimal shape of [y. With exponential preferences and linear
compensation contracts - see Prendergast ; contracts, this trade-off results in the optimal
Gibbons , for an overview). Then there is a simple contract being /?* = l+lrcai ■ This approach to
solution to attaining efficient effort: choose the studying incentive contracting has become knows as
contact to internalize the ‘trade-off of risk and
2216 Contracting in Firms
incentives’, where firms trade off the benefits of well they do relative to each other. Yet empirically
great effort with higher compensation costs induced there is relatively little evidence of such
by a risk premium, such that the chosen level of benchmarking (for example, see Janakiraman et
effort falls below the level that internalizes benefits al. ).
to others. Only in the case where there is either no A second limitation on incentive contracting
measurement error (a2 = 0) or risk neutrality (r = 0) arises when measures do not reflect the objectives
does efficient effort arise. of the principal. Workers often carry out a host of
At its most general, this costliness of exposing a activities in their jobs, yet measures of performance
worker to large degrees of risk (or its analogue, may not reflect all these aspects. A good example of
liquidity constraints) surely explains some part of this would be measuring the performance of a
the absence of high-powered incentives. In much teacher. While measures may be available on some
the same way as financial assets with higher component of what they do - such as test scores for
undiversifiable risk require higher expected returns, a teacher - many important aspects may remain
so also are risky jobs likely to demand higher unmeasured. When contracts are designed on the
compensation. Despite this, the empirical literature subset of things that can be measured, there is a
on how compensation contracts trade off such risk danger that they ignore the unmeasured aspects. For
issues against higher effort has shown little instance, there is evidence of teachers ‘teaching for
evidence in its favour. There are two principal the test’ or cheating to achieve higher test scores
empirical implications of the theory. First, riskier (Jacob and Levitt ). This phenomenon has become
environments should have lower incentives /;,* known as multitasking (Holmstrom and Milgrom ),
which becomes potentially important when there is
declines with a2. There have been many studies of
no single measure that reflects the contribution of
the relationship between risk and the strength of
an agent. Accordingly, it is not surprising that a
incentives in a variety of occupations. If anything,
consistent empirical finding is that jobs which are
this literature suggests that the relationship between
described by firms as complex tend not to offer
risk and the provision of incentives is positive rather
significant incentive pay (see Prendergast , for
than the negative relationship posited by this theory.
details).
See Prendergast ( ) for details and an explanation
Another limitation on the ability of firms to
as to why
provide incentives to workers comes from team
this may be. Second, the trade-off of risk and
production. Measures of performance for most
incentives implies that compensation should not
workers reflect not only what they do but also the
depend on measures that workers cannot control.
contributions of others. In itself, this does not
Again, this has found little support in the data. For change the calculus above in any conceptual sense,
example, Bertrand and Mullainathan ( ), have other than that the measurement error now includes
examined executive contracts in the United States, the actions of others. As an example, assume that
and found little evidence that contracts reward two agents (1 and 2) work on a team and that output
executives any less for measures that they cannot measures the true contributions of both plus an error
control (say, where an oil company’s profits change term y = ei + e2 + e. Efficient effort arises as before
simply because the price of crude changed) than for by setting [iv = 1 for each worker. However, there is
those that they can (such as a merger). More now a potential problem of budget breaking, where
evidence on this failure to filter out uncontrollable marginal payments exceed marginal output. In this
factors concerns the infrequency of relative example, when total output rises by one dollar,
performance evaluation. Consider two sales-force compensation costs increase by two dollars. In
workers (or executives) who carry out a similar job. many firms - for instance, partnerships - such
If demand for the products that they sell varies for budget breaking is not possible. If instead the
common reasons beyond their control, an efficient principal
way of limiting risk exposure is to (at least
partially) reward the workers on how
Contracting in Firms 2217
can pay out no more than one dollar for every dollar an incentive not to implement the (ex ante) efficient
extra on output, this naturally places an upper contract by reneging on a promised payment to save
bound of /(,. ^ on average for the agents. Hence, costs. Thus, even though an agent exerts effort and
budget balancing places a natural limitation on firm performs well, the supervisor claims otherwise to
incentives. This also leads to a free rider problem in keep costs down. This can arise either by outright
teams, where maximum incentive compensation in lying or perhaps by manipulating whatever
an N member team mechanically declines as N measures are available. A relevant example here is
increases (This is known as the ‘^problem’). There the movie industry, where actors are sometimes
is also considerable empirical evidence (such as paid on the ‘net profits’ of a film. As a result, there
Gaynor and Pauly ) on such free riding - mostly have been numerous court cases regarding firms
from legal and medical partnerships - illustrating using creative transfer pricing arrangements to
how various measures of performance disimprove reduce profits for very successful movies. See
as the size of the team being rewarded increases. Cheatham et al. ( ) for more
Many measures of output are not denominated details on this. Such incentives to renege are likely
in dollar terms, but instead come in the form of even worse when there are no objective measures of
evaluations by others. For instance, it would be performance. Because the desire to renege is greater
difficult to measure the contribution of a social when discretionary incentive payments are higher, it
worker or a customer service representative without follows that the only credible contracts often
using feedback from supervisors or clients. Another involve few incentives (Clive Bull considers a role
limitation on contracts arises when such subjective for repeated interaction between the principal and
measures can be corrupted by evaluators with agent as a means of reducing incentives to renege.
vested interests. Two particular sources of such While repeating the relationship can result in
vested interests have been considered in the sufficient incentives for complete honesty by the
literature. First, information on performance often principal, it remains the case that, if the
originates with clients as they are the only ones with relationship’s value is not sufficiently great,
first-hand experience of the agent’s efforts. For incentives must be muted to reduce incentives for
instance, compensation for many customer service cheating).
representatives depends on client evaluations. When It is incorrect to assume that the ability to
clients have relatively similar preferences to the manipulate measures of performance always mute
principal - such as that the agent should be incentives - sometimes it can result in incentive pay
courteous and efficient - contracts based on being inefficiently high. Consider again two
evaluations can mimic the objective contracts occupations where agents are typically residual
above. Yet in other instances, the vested interest of claimants - taxi drivers and sharecroppers. At first
clients can render incentives difficult to implement. blush, it would seem odd that they have such
A good example of this arises in occupations such extreme incentives. Aren’t these as likely candi-
as police or immigration control, whose objective is dates for trading off risk against incentives as any?
not necessarily to make their clients happy. Making However, one characteristic of each of these occu-
pay depend on evaluations in these instances can be pations is that they have opportunities for hiding
harmful as it gives agents incentives to keep clients output, either by taking fares without using the
happy when they should not, such as a police metre (in the case of cab drivers) or selling crops
officer not arresting a suspect. In these cases, privately (in the case of farmers). In both cases, the
incentive contracting on evaluations typically needs only outcome that makes this incentive irrelevant is
to be curtailed to avoid such incentives (see to render them residual claimants, even if risk
Prendergast , for details). considerations would suggest otherwise.
The second example of vested interests with Another issue which can constrain efficient
subjective evaluations is where the principal has incentives, yet which has received almost no
attention in the empirical literature, is where
2218 Contracting in Firms
agents hold private information. Take a specific that in many occupations agents enjoy carrying out
instance - real estate agents. In Chicago, real estate the activity or care about the outcomes of their
contracts take a simple form - agents make three per actions. As a result, they will exert effort beyond
cent of the sales price of the house. Assume that my that which they can get away with even in the
home is worth $500. This linear contract not only absence of incentive contracts. This, in itself, is not
offers only three per cent on the relevant margin for enough to limit incentive contracting. However,
improving the selling price of the house, but there is some psychological evidence that agents
predominantly rewards the agent for selling the enjoy their jobs less when incentive contracting is
house for say $450. Yet anyone could sell the house used. In effect, they feel that they are only doing it
for $450 and it seems highly inefficient to reward in ‘for the money’ and hence lose interest. A
this way. So why not renegotiate to something commonly cited example is the willingness of
better? An example of such an improvement people to donate blood, where the warm feeling
(subject to risk issues) would be to offer nothing on from donating declines when payments are made. In
the first $450, but to pay a piece rate of 30 per cent some instances, this can imply that incentive
on anything over $450. In this way, the agent has contracting can reduce effort if these crowding out
more incentives on margin, yet breaks even relative effects are strong enough. As a result, it can be
to the original contract if the house sells for its optimal to provide no incentives even when effort is
original price. one-dimensional. This area of research, whose
One reason why such renegotiation does not empirical testing has largely been restricted to the
arise is that the agent may privately know the true laboratory, is still in its early stages and is likely to
value of the home, while the owner believes it to be see much refining over the coming years. See Frey
worth $500 on average. Consider a homeowner who and Jegen ( ) for a
offers the new contract above to the agent. It is clear survey.
that the agent rejects the new contract if it is truly Another likely fruitful area of future research
worth less than expected, and accepts it if worth concerns non-monetary ways of motivating
more. But this implies that the agent earns workers. This literature largely began as an exercise
information rents on average. As a result, on aver- in how workers could be motivated to internalize
age the homeowner loses money from the renego- the benefits of others, yet has almost exclusively
tiation unless effort increases enough. This option become an exercise in how to motivate through
available to the agent limits the ability of contracts monetary contracting. Yet it is clear that there are a
to attain efficiency. Instead, in the usual monopoly myriad of means of motivating workers - sense of
fashion, the homeowner would offer a contract to achievement, ‘doing good’, status, and so on - that
trade off the efficiency gains of increased effort firms tap into. How such mechanisms operate, and
with infra-marginal losses of the type described the way in which they interact with monetary
above, resulting in lower-powered incentives (There contracts, remain an unstudied topic of research,
is a large mechanism design literature on this topic though see Besley and Ghatak ( ), for some
that has largely been ignored by the empirical theoretical work on
literature on incentives; see Laffont and Tirole , for this issue.
example. This is surely partly because of the It is worthwhile to note a caveat before con-
empirical conundrum as to why mechanisms are so cluding. The discussion above concerns the absence
rare in reality). of observed incentive contracts. Yet workers often
Much of the recent literature has been focused have unobserved carrots and sticks that can
on how incentive contracts can cause adverse motivate them. For instance, many workers exert
behavioural responses. Another possible mechanism effort in the hope of attaining a promotion (Lazear
for such responses is where intrinsic motivation can and Rosen ), or a better job offer (Holmstrom ).
be crowded out by the use of incentive contracts. Many ofthese mechanisms for inducing desired
The premise of this literature has been behaviour are dynamic, where good performance
today results
Contradiction 2219
in a greater likelihood of promotion, or better job Frey, B., and R. Jegen. 2001. Motivation crowding theory: A
offers in future. Such incentives are clearly impor- survey of empirical evidence. J o u r n a l of
E c o n o m i c S u r v e y s 15: 589-611.
tant for workers. However, it remains the case that Gaynor, M., and M. Pauly. 1990. Compensation and pro-
explicit incentive payments remain limited even in ductive efficiency in partnerships. Evidence from medical
those cases where the above types of career concern group practice. J o u r n a l o f P o l i t i c a l E c o n o m y
are negligible (For example, it is well known that 98: 544-574.
Gibbons, R. 1996. Incentives and careers in organizations. In
promotion prospects become very limited for Advances in economics and
workers who remain in a job grade for a long econometrics: Theory and applications,
period. Yet explicit incentives are no more common ed. D. Kreps and K. Wallis. Cambridge: Cambridge
for those workers than for any other). The University Press.
Holmstrom, B. 1979. Moral hazard and observability. B e l l
interaction of unobserved (typically career) J o u r n a l o f E c o n o m i c s 10: 74—91.
incentives with the more exphcit set of piece rates Holmstrom, B. 1999. Managerial incentive problems: A
and bonuses that have been considered above is dynamic perspective. R e v i e w of Economic
surely of first-order importance to firms, though it S t u d i e s 66: 169-182.
Holmstrom, B., andP. Milgrom. 1992. Multi-task principal
remains surprisingly unexplored in the literature
agent analyses: Linear contracts, asset ownership and job
(see Baker et al. , for an exception). design. J o u r n a l o f L a w , E c o n o m i c s , a n d
To conclude, perhaps the central foundation of O r g a n i z a t i o n 7: 24-52.
modem economics is the idea that appropriate Jacob, B.A., and S.D. Levitt. 2003. Rotten apples: An
investigation of the prevalence and predictors of teacher
prices guide behaviour in efficient ways. Despite
cheating. Q u a r t e r l y J o u r n a l o f E c o n o m i c s
this, one of the defining characteristics of the 118: 843-877.
employment relationship in many firms is the Janakiraman, S.N., R.A. Lambert, and D.F. Larker. 1992. An
absence of the kind of exphcit prices whereby empirical investigation of the relative performance
evaluation hypothesis. J o u r n a l o f A c c o u n t i n g
wages depend in a clear way on observed outcomes.
R e s e a r c h 30: 53-69.
The early incarnations of agency theory were Laffont, J.-J., and J. Tirole. 1986. Using cost observation to
concerned with designing prices in a way that could regulate firms. J o u r n a l o f P o l i t i c a l E c o n o m y
serve to fully internalize the effects of agents’ 94: 614-641.
Lazear, E.P. 1987. Incentive contracts. In T h e N e w P a l -
actions on the welfare of their employers. Yet this
g r a v e : A D i c t i o n a r y o f E c o n o m i c s , ed. J.
initial optimism has now been tempered with a Eatwell, M. Milgate, and P. Newman, vol. 2. London:
somewhat more nuanced view that shows trade-offs Macmillan.
that will ultimately help in defining more precisely Lazear, E., and S. Rosen. 1981. Rank order tournaments as
optimal labor contracts. J o u r n a l o f P o l i t i c a l
the nature of labour market relationships.
E c o n o m y 89: 841-864.
Prendergast, C. 1999. The provision of incentives in firms.
Journal of Economic Literature 37: 7-63.
Bibliography Prendergast, C. 2002. The tenuous trade-off between risk and
incentives? J o u r n a l o f P o l i t i c a l E c o n o m y
Baker, G., R. Gibbons, and K.J. Murphy. 1994. Subjective 110: 1071-1102.
performance measures in optimal incentive contracts. Prendergast, C. 2003. The limits ofbureaucratic efficiency.
Q u a r t e r l y J o u r n a l o f E c o n o m i c s 109: 1125-1156.
Bertrand, M., and S. Mullainathan. 2001. Are CEOs
rewarded for luck? The ones without principals are.
Q u a r t e r l y J o u r n a l o f E c o n o m i c s 116: 901-932.
Besley, T., and M. Ghatak. 2005. Competition and incentives Contradiction
with motivated agents. A m e r i c a n E c o n o m i c
R e v i e w 95: 616-636.
Michael Dummett
Bull, C. 1987. The existence of self-enforcing wage contracts.
Q u a r t e r l y J o u r n a l o f E c o n o m i c s 102: 147-
159.
Cheatham, C., D. Davis, and L. Cheatham. 1996. Hollywood
profits: Gone with the wind? C P A J o u r n a l 12: 32-34.
The fundamental form of a contradiction is a pair of
propositions, ‘A’ and ‘Not A’, one the negation of
the other. If such an explicit contradiction is
2220 Contradiction
part of a body of propositions asserted by some absurdum. In conjunction with premisses, ‘BC and
individual or group at a given time, it follows that ' B 2 ‘ , say asserted outright, a proposition ‘C’ is
not all those propositions can be true: by thus presented as a hypothesis: not as a conjecture or
impairing the reliability of the proponent, the supposition to be seriously entertained, but with an
occurrence of the contradiction throws doubt upon eye to proving its negation ‘Not C’. From the
the truth of all the other propositions asserted. premisses ' B ^ and ‘ B 2 ’ and the hypothesis ' C \
Far more frequent than an overt or explicit two contradictory propositions 'A' and 'Not A' are
contradiction is a hidden contradiction. A hidden then derived: on the basis of the two premisses, the
contradiction is contained in a body of propositions hypothesis now being dropped, its negation ‘Not C’
when, by logically valid deductive reasoning, an can now be definitely asserted. Two special cases
explicit contradiction can be derived from them. fall under this general description. One is that under
Two quite different cases arise, according to the which ‘A’ is actually one of the two premisses, say
stafris of the original propositions from which the ‘B] If from premisses ‘ B 2 ’ and the hypothesis ‘C\
contradiction was derived. The first is that in which, the conclusion 'Not B \ can be derived, one may
as before, those propositions were all asserted by an assert 'Not C ' on the strength of the two premisses.
individual or body of people. In this case, the hidden The other special case is that in which 'A' is the
contradiction is as fatal to the joint correctness of hypothesis ‘C\ itself. If from premisses ‘ B i ’ and
those assertions as is the explicit one, although, of ' B 2 ‘ and the hypothesis ‘C\ the negation 'Not C’ of
course, considerable work may have had to be the hypothesis can be derived, ‘Not C ' may be
expended in bringing it to light. asserted outright on the strength of the two
The second ease is that in which at least some of premisses.
the original propositions were not asserted, but As is well known, reductio ad absurdum argu-
merely advanced as suppositions to be considered. ments are frequent in mathematics. One of the
The formal conclusion now remains exactly the simplest, as well as historically most important, is
same: given that the contradiction was validly the proof that 2 has no rational square root. We
derived, not all the original propositions can be true. may take as the premisses: (Bi) any rational number
The effect, of course, is very different. Suppose that may be represented as a fraction whose numerator
the original propositions - the premisses from which and denominator have no common factor; and (B2)
the contradictory pair ‘A’ and ‘Not A' have been the square of an odd number is odd. As the
derived - were four in number: ‘Bf, ‘B 2\ ‘CI’, and hypothesis we may take: ( C ) 2 has a rational
‘C2’; and suppose that ‘BC and ‘B 2’ were asserted square root. From (BO and ( C ) it follows that, for
outright, but that ‘Q ’ and ‘C 2’ were merely some integers m and n, m and n have no common
advanced for consideration. The contradiction factor and (m/n)2 2; hence
shows that not all four can be true: without having m2 = 2rf. Hence, by the definition of ‘even’, m2 is
to withdraw anything that he asserted, the even, and from (B2) it can be inferred that m is not
proponent, still maintaining ‘Bi’ and ‘B2’, is now in odd, and hence is even. Thus m = 2k for some k,
a position to assert 'If Ci, then not C2’ (or its and we have: m2 = 4Ir - In2, and so 2k2 = n2.
equivalent, ‘If C2, then not Cf). Applying (B2) once more, wc infer that n is also
For this reason, derivation of a contradiction can even. Since m and n arc both even, they have 2 as a
be employed, not merely as a means of refuting the common factor, which contradicts the earlier
assertions of another, but as a method of stipulation that they have no common factor. By
demonstrating negative propositions: this is the arriving at this contradiction, wc have achieved a
celebrated mode of argument reductio ad demonstration that 2 has no rational square root,
that is, a proof of the negation of our hypothesis
(C).
Contradiction 2221
The derivation of a hidden contradiction from foundations for the theories of natural numbers and
premisses all definitely asserted may also be illus- of real and complex numbers, just when he believed
trated from mathematics. The most celebrated that he had accomplished it; and it cast the
examples are the set-theoretic paradoxes which foundations of mathematics into confusion for some
provoked the first ‘crisis in the foundations’ of years. By showing that assumptions (1) and (2)
mathematics in the early years of this century. Some cannot both be consistently made, it challenged
of these, such as Burali-Forti’s paradox and mathematicians and logicians to find weaker
Cantor’s paradox, demand some technical assumptions that would be consistent: for which
background; but Russell’s paradox has the advan- properties F do assumptions (1) and (2 ) hold good?
tage of being statable in very simple terms. The This question proved exceedingly difficult to
propositions leading to this contradiction appear at answer: hence the ‘crisis in the foundations’.
first sight entirely harmless. They are: (1) to every Why is the discovery of a hidden contradiction
property there corresponds a class, the class of so devastating? One answer is that, if a pair of
things having that property; (2) the class of things contradictory propositions are both consequences of
having a given property F contains as members all a given set of assumptions, then every proposition is
and only those things that have the property F. a consequence of them. This holds good in virtue of
Russell’s contradiction arises from considering the the logical law, known to the medievals as ex fa Iso
property G of being a class that does not contain quodlibet, that, from a pair ofpremisses ‘A’ and ‘Not
itself as a member. By assumption (1), there exists a A’, any proposition ‘B’ may be inferred. This law
class, which we may call W, of things having the appears at first glance both useless and implausible;
property G. We now ask whether W contains itself a natural first reaction is therefore to suggest
as a member. It may then be argued by reductio ad repudiating it. That, however, is not easily done,
absurdum that it does not: we have first to suppose, since it is a consequence of two other, seemingly
as a hypothesis, that it does. By assumption (2), it inescapable, laws concerning the logical constant
therefore has property G. But properly G is the ‘or’. The first is the law now usually known as ‘or’-
properly of being a class that does not contain itself: introduction or disjunction-introduction, that, for
it follows, contrary to the hypothesis, that W does any propositions ‘A’ and ‘B’, ‘A or B’ follows from
not contain itself. We may thus discard the hypothe- ‘A’. The second is that known in the traditional
sis, and assert outright that W does not contain logic as modus tollendo ponens, that ‘B’ follows
itself. Applying assumption (2) once more, we may from the premisses ‘A or B’ and ‘Not A’. It is quite
then conclude that W lacks property G. It therefore obvious that, from the premisses ‘A’ and ‘Not A’ of
is either not a class or does contain itself as a the ex falso quodlibet, its conclusion ‘B’ can be
member; since it is by definition a class, it contains reached by first applying ‘or’-introduction and then
itself as a member. This now contradicts our earlier modus tollendo ponens: but it is very hard to see
conclusion that it does not contain itself as a how we could possibly reject either of the two latter
member: the two intuitively reasonable assumptions laws, for they appear absolutely constitutive of the
(1) and (2) have led to a contradiction. meaning of the connective ‘or’. The best possible
Russell’s contradiction is so simple to state and way to establish the truth of a disjunction - a
so easy to derive that those who are unaware of the statement of the form ‘A or B’ - is to establish the
importance in mathematics of the notion of a class truth of one or other of the disjuncts ‘A’ and ‘B ’: if
or set are liable to think it no more than a trivial the truth of ‘A’ does not guarantee the truth of ‘A or
verbal puzzle. It is very far from trivial. It over- B ’, what does ‘or’ mean? Conversely, the truth of a
threw the life’s work of the great logician Gottlob disjunction appears to demand that at least one of
Frege, namely to provide unquestionably firm the disjuncts be true: so, if ‘A or B’ is