Professional Documents
Culture Documents
The Continuous Prisoners Dilemma. Linear Reactive Strategies
The Continuous Prisoners Dilemma. Linear Reactive Strategies
The Continuous Prisoners Dilemma. Linear Reactive Strategies
We present a general model for the Prisoner's Dilemma in which variable degrees of coopera-
tion are possible, and payo!s are scaled accordingly. We describe a continuous strategy space,
and divide this space into strategy families. We derive the payo! function for these families
analytically, and study the evolutionary outcome when a wide range of strategies play against
each other. Our results show that the initial degree of cooperation o!ered by a strategy is
a decisive factor for evolutionary robustness: the most successful strategies in our model o!er
full cooperation as an initial move, but thereafter cooperate fully only if their opponent does
the same. These strategies gradually raise the stakes when playing a strategy which is initially
reticent to cooperate, but di!er from the strategies predicted by other continuous models in
that they are not only generous, but are also consistently optimistic and uncompromising.
1999 Academic Press
We de"ne a payo! function, F(S, S) which a game between two members of this strategy
corresponds to the mean payo! per round that family move the play towards the value given by
strategy S receives when playing against S. The the superscript. For example, P is the set of all
payo! is clearly a function of the slopes, inter- cooperators, as described above. In a game be-
cepts and starting moves of the two strategies, of tween any two players in the family P, the play
the cost c and bene"t b, and of the total number will move towards either zero or one, depending
of rounds in the game. In this paper, we discuss on the starting move. For N, the play will move
two treatments of the number of rounds: the case towards a repeated sequence of alternating zero
when the total number of rounds, n, is "xed; and and one. In the central region of the plane, de-
the case when nN denotes the average number of noted M, are those strategies which move the
rounds per game, such that the probability of play towards some intermediate value between
a further move after each round of the game is zero and one. Thus, if two strategies M and
IB
given by (1!1/nN ). M play against each other, the moves played
IYBY
Because the payo! will also depend on whether by strategy M will approach x"kx#d, while
IB
S or S moves "rst, we de"ne the payo! function the moves played by M approach x"
IYBY
as the average payo! between these two cases. As kx#d. Solving, we "nd x"(kd#d)/(1!kk)
a limiting case, we sometimes consider the payo! and x"k(kd#d)/(1!kk)#d (We will return
of an in"nitely iterated game (payo! averaged to this interesting result in Section 3.2.).
over n rounds as nPR). For example, in the In analogy with the cooperators, we de"ne
in"nite game F(AllC, AllC)"(b!c), and like- &&defective'' strategies as those strategies which lie
wise F(AllD, AllD)"0. We also "nd that F(I, I)" completely below the identity line, for which
(b!c), because the starting move for I is de"ned S(x)(x for all x. We see that after some "nite
to be 1. For other strategies which lie on the number of moves both players will play zero
identity line, the payo! when S plays consistently, and that the mean payo! when two
V
S will be equal to 1/2(b!c) (x #x ). such strategies play each other will approach
VY
zero as nPR. The set of defectors is the family
P in Fig. 1. Indiscriminate cooperators, and
2.1. STRATEGY FAMILIES
indiscriminate defectors, lie along the boundaries
We de"ne &&cooperative'' strategies as those of the regions above and below the unit square,
which lie entirely on or above the line of identity, respectively.
i.e. those strategies for which S(x)*x for all x. Figure 2 shows the payo! function for each of
When two such strategies play against each these strategy families. We show here the payo!
other, each successive move is greater than the received in the limit when a strategy plays an
opponent's previous move, and after some in"nite game against itself, determined analyti-
"nite number of moves both players will co- cally. For those regions of the plane in which the
operate fully. The mean payo! over an in"nite payo! in the in"nite game depends on x , we
game between cooperative players is thus (b!c). plot the mean payo! for a mix of strategies with
For cooperators, k*0; the slope of the line is x uniformly distributed on [0, 1]. We have not
positive, and repeated rounds move the play been able to determine an analytical expression
closer to one. for the payo! between two arbitrary linear react-
The space of all possible strategies is a three- ive strategies in either the "nitely or in"nitely
dimensional polyhydron in k, d and x . We can iterated game.
subdivide this space based on the qualitative fea-
tures of the strategies; in particular, we "nd that 2.2. THE ROBUSTNESS OF COOPERATION
the most important factors are whether k is
greater than zero, and which sides of the unit We are interested in which cooperative strat-
egies, P can resist invasion by AllD, that is,
square the strategy intersects. This classi"cation IBV
on the plane of k and d is shown in Fig. 1. Here, under whatconditions is
strategy families denoted P or N have positive or
negative slopes, respectively. Repeated rounds of F(AllD, P )(F(P , P ). (1)
IBV IBV IBV
310 L. M. WAHL AND M. A. NOWAK
c
d(1! . (2)
b
x #(2n!1)d
b 'b!c. (3)
2n
c
x '2n 1! !(2n!1)d. (4)
b
payo!, F(S, S) is shown for the limiting case of an in"nite
game, for b"2 and c"1. In those regions of the plane in 2n c
d' 1! . (5)
which the payo! in the in"nite game depends on x , the
2n!1 b
mean payo! for a mix of strategies with x uniformly distrib-
uted on [0, 1] is shown. Note that the maximum payo! is
achieved by the family of cooperators, P. Thus, for the "nite game with n moves, a coop-
erator P can be invaded by AllD if d is
IBV
For cooperative strategies, the response to an larger than (1!c/b) by a su$cient margin, and
opponent's defection, y"S(0), is given by the may be invaded by AllD if d is less than the same
intercept, d, and thus the payo! received by AllD value but x is above the theshold given in
when playing a given cooperator will depend on eqn (4).
CONTINUOUS PRISONER'S DILEMMA 311
h(b!c)!(1!h)cd'hbd, (6)
which reduces to
cd
h' . (7)
(1!d) (b!c)
3. Adaptive Dynamics
3.1. THE EVOLUTION OF x
We are interested in which starting move, or
initial investment, is &&best'' for a given strategy.
To determine this, we held k and d "xed and
allowed x to evolve (Maynard Smith, 1982),
i.e. we start with an initial strategy S
IBV
and consider a small perturbation in the start- FIG. 3. Steady-state values of x . The surfaces show the
ing move, x "x #dx. If F(S ,S )' steady-state values of x (x' ) when both k and d are "xed. In
IBVY IBV
these simulations, strategy S played a single game
F(S ,S ) and F(S ,S )'F(S , IBV
IBV IBV IBVY IBVY IBV against S where x "x #dx, and b"5, c"1. We set
IBVY
S ), it is clear that S will invade and dx to 0.02 or !0.02 when x was initially set to 0 or 1,
IBVY IBVY
takeover a population of S : we let x respectively. Games which were 20 rounds long on average
IBV (nN "20) were modelled by simulating 100 rounds and
replace x and continue.
weighting the payo! in round i by the probability that
The results of these simulations are shown in a game would last to round i, (1!1/nN )G\. We note that
Fig. 3. These surfaces plot the steady-state values xL "1 is robust for a signi"cant fraction of the strategy
of x (x' ) when x is originally set to be 0 or 1 in space, and in particular we "nd a region of strategy space in
which x evolves towards one, regardless of its initial value.
the simulation. We note that x' "1 is robust for
a signi"cant fraction of the strategy space, and in initial value. This region appears in the upper
particular we "nd a region of strategy space in corner (k is high) of strategy family M. The
which x evolves towards one, regardless of its steady-state values of x for strategy families
312 L. M. WAHL AND M. A. NOWAK
N, N and N were zero, regardless of the the condition k!(b/c)k#1'0 is met (see
starting value. Appendix A).
We are particulary interested in the steady-
3.2. TRAJECTORIES THROUGH STRATEGY SPACE:
state x value for the cooperative strategies, fam-
THE EVOLUTION OF k, d AND x
ily P. We have found (refer to Appendix A) that
for most strategies in this family, the surface We repeated these simulations, allowing k, d
shown in Fig. 3 is described by the equation and x to vary simultaneously. For each strategy
x' "(1!d )/k, that is, x evolves from any initial S we considered strategies distributed in
IBV
value to the lowest x which elicits full coopera- a sphere around that strategy, and accepted the
tion in a single move (S(x' )"1 for x' " parameter values of the strategy S with the high-
(1!d )/k). The best starting move for cooper- est payo! against S , under the conditions
IBV
ators is not full cooperation but just enough to that F(S, S)'F(S, S) and F(S, S)'F(S, S)
move the play to full cooperation in the next (and (strategy S invades and takes over a population
consequently every subsequent) move. Among of S).
cooperators, full cooperation is the best initial Figure 4 illustrates two examples of such
move (x' "1) only for those strategies along the evolution through strategy space, where a given
boundary of P and M (these strategies intersect starting strategy has been followed for 1000
the unit square at [1, 1] ). successful mutations. The circle on each graph
We have also observed in some cases that shows the value of x for the strategy illustrated.
when the initial value of x is below some critical In the "rst example, the simulation was seeded
value, it may reach a stable equilibrium in region with a fairly uncooperative strategy, for which
P for intermediate values of x (0(x' ((1!d )/k). k"0.51, d"0.2 and x "0.35. Despite the low
This occurs when full cooperation is elicited not values of x and d, however, the strategy evolves
in the partner's "rst but in a subsequent move of towards a cooperative steady state [S(x)'x, ∀x],
the game, when for example S(S(x' ))"(1!d )/k. and x evolves towards one. Note that in this
For instance, in the "nite game with n moves with example the steady-state strategy is along the
a starting x of zero, the evolution of x may halt boundary of strategy families M and P, since the
when S(S(x' ))"(1!d)/k because the cost of strategy intersects [1, 1] on the unit square.
a further increase in x outweighs the bene"t; In contrast, when seeded with a highly cooper-
we can show that this situation occurs when ative strategy (k"0.3, d"0.8 and x "1) which
FIG. 4. Examples of strategy evolution. Two examples of the evolution of a given starting strategy are shown; all three
parameters of the starting strategy were allowed to evolve. Each panel plots the value of S(x) for 0)x)1; the circle on each
graph shows the value of x for the strategy illustrated. Leftmost panels plot the starting strategies and subsequent panels
show the strategy after 25 through 1000 successful mutations (indicated by the bold numbers above each column). We used
b"2, c"1, and nN "20 for these simulations. In the "rst example, the simulation was seeded with a k"0.51, d"0.2 and
x "0.35. Despite the low values of x and d, this strategy evolved towards a cooperative steady state [S(x)'x, ∀x], and
x evolves towards one. The second example was seeded with a highly cooperative strategy (k"0.3, d"0.8 and x "1), but
evolved towards AllD.
CONTINUOUS PRISONER'S DILEMMA 313
o!ers full cooperation as an initial move, the Lines indicate the direction of the evolution
strategy evolves initially to an even more co- through strategy space. Where open circles ap-
operative state (dP1), but x gradually de- pear without visible evolutionary trajectories
creases, and eventually x , k and d all evolve to in this plane, the trajectory is into the page;
zero; the steady state is AllD. x evolves towards smaller values in regions
These examples are illustrative of two broad P, P and N.
classes of steady-state strategies that we ob- This "gure reveals a fairly complex set of tra-
served: (i) cooperative strategies that intersect jectories on the x "1 plane. For the majority of
[1, 1] on the unit square and have fairly low strategies within P and M, and for all strategies
intercepts; and (ii) indiscriminate defectors. To in N and N, strategies evolve towards the lower
examine the evolutionary trajectories through left boundary of the space, or towards indiscri-
strategy space more generally, we seeded strat- minate defection. For sections of P and M
egies at uniform intervals in k, d and x , and where k is su$ciently large, however, strategies
followed the evolution of each strategy for a small evolve towards the upper left; they become
number of mutations. Figure 5 shows one plane more cooperative and move towards the line
through the results of this simulation, the d}k k#d"1.
plane at the level x "1. Open circles show the The crosses in this "gure indicate strategies
initial positions of the strategies investigated. that are not invaded by their neighbors in strat-
egy space. We are especially interested in the set
of crosses which lies along the boundary between
M and P, since these represent cooperative strat-
egies which are stable with respect to the adap-
tive dynamics. For this set of strategies kx#d"1
when x"1; as in the "rst example of Fig. 4, they
intersect the unit square at [1, 1].
Some insight into this set of strategies can be
obtained analytically, particulary for the limiting
case of an in"nite game. Consider a set of strat-
egies on the d}k plane (at x "1) with the prop-
erty that k#d"1. We denote one such strategy,
S , and a near neighbor S . If the neigh-
IBV IYBYVY
bor is a member of M, then (as demonstrated
previously) the play moves towards x"(kd#d )/
(1!kk) and x"k(kd#d )/(1!kk)#d. In
FIG. 5. Strategy trajectories, x "1. Strategies were the limit of an in"nitely repeated game, the mean
payo! F(S ,S ) is equal to bx!cx. For
seeded at uniform intervals in k, d and x and were allowed
to evolve through two successful mutations; all three para-
IYBYVY IBV
S to be invaded by its neighbor, we "nd
meters of the starting strategies were allowed to evolve. For IBV
this "gure we used b"2, c"1 and n"20, the results for
nN "20 were qualitatively identical. The "gure illustrates the F(S ,S ))F(S ,S ), (8)
d}k plane at the level x "1. Open circles show the initial IBV IBV IYBYVY IBV
positions of the strategies investigated, and these are connec-
ted by solid lines to the "nal position of the strategy after b!c)bx!cx (9)
two mutations. Crosses indicate strategies that are not in-
vaded by their neighbors in strategy space. Where open
circles appear without visible evolutionary trajectories in which, substituting, reduces to
this plane, the trajectory is into the page; x evolves towards
smaller values in these regions. For the majority of strategies
within P and M, and for all strategies in N and N, c
k) (10)
strategies evolve towards the lower left boundary of the b
space, or towards indiscriminate defection. For sections of
P and M where k is su$ciently large, however, strategies
evolve towards the upper left; they become more co- under the conditions that k(1 and k#e(1,
operative and move towards the line k#d"1. which are true for members of M.
314 L. M. WAHL AND M. A. NOWAK
FIG. 8. Evolution of F(S, S), x , k and d for 2500 successive strategies. The value of the payo! function and the three
parameters of the strategy are shown, for a typical segment of 2500 strategies from the stochastic adaptive sequence described
for Fig. 7; for the sequence shown here b"2. The payo! illustrated is the payo! recieved when the strategy plays itself, F(S, S),
and is highly correlated to x .
We repeated this simulation with b"2, and (1998) found that the initial o!ers of both host
found only 7% cooperators, and 35% defectors and symbiont evolve towards zero unless spatial
[Fig. 7(b)]. This illustrates the sensitivity of this e!ects are included in the model (but see Killing-
distribution in strategy space to the cost-to- back et al., 1999). We concur that many strategies
bene"t ratio. in our strategy space evolve towards total defec-
Figure 8 shows the evolution of F(S, S), x , k tion, but have found some notable exceptions to
and d over a subset of successive strategies, for this trend. In particular, strategies that respond
the initial simulation with b"2. Note the strik- sensitively to cooperation (k su$ciently high),
ing correlation between the payo! and x . For but are not suckers (d su$ciently low), have the
100 000 strategies, the correlation coe$cient potential to evolve towards cooperative Nash
between F(S, S) and x was 0.86 (compared to equilibria.
!0.02 and 0.02 for k and d, respectively). These Nash equilibria are a set of coopera-
tive strategies which receive the maximum
payo! when playing against themselves, and
6. Discussion yet are neither susceptible to invasion by
By simulating the evolutionary dynamics of defectors nor by more generous strategies.
interspeci"c mutualisms, Doebeli and Knowlton We note three de"ning characteristics of these
318 L. M. WAHL AND M. A. NOWAK
the overall success and robustness of every type HOFBAUER, J. & SIGMUND, K. (1998). Evolutionary Games
of strategy. We "nd that this distribution is ex- and Population Dynamics. Cambridge: Cambridge Univer-
sity Press.
tremely sensitive to the cost-to-bene"t ratio. KILLINGBACK, T., DOEBELI, M. & KNOWLTON, N. (1999).
When the bene"ts of cooperation are su$ciently Variable investment and the origin of cooperation. Proc.
high, cooperators will outnumber defectors. Natl. Acad. Sci. ;.S.A. (submitted).
KRAINES, D. & KRAINES, V. (1989). Pavlov and the
prisoner's dilemma. ¹heory Decision 26, 47}79.
KREBS, J. R. & DAVIES, N. B. (eds) (1991). Behavioural
We thank the two referees for a number of insightful Ecology: An Evolutionary Approach. Oxford: Blackwell
comments on the paper, and gratefully acknowledge Scienti"c.
the support of The Florence Gould Foundation; KUG LLING, D. & MILINSKI, M. (1992). Size-dependent pred-
J. Seward Johnson, Sr. Charitable Trusts; The ation risk and partner quality in predator inspection of
Ambrose Monell Foundation; and The Alfred P. sticklebacks. Anim. Behav. 44, 949}955.
Sloan Foundation. LINDGREN, K. (1991). Evolutionary phenomena in simple
dynamics. In: Arti,cial ¸ife II (Langton, C. G., Taylor, C.,
Farmer, J. D. & Rasmussen, S., eds), Vol. X, pp. 295}312.
REFERENCES Santa Fe: Santa Fe Institute for Studies in the Sciences of
Complexity.
ALEXANDER, R. D. (1987). ¹he Biology of Moral Systems. MAY, R. M. (1987). More evolution of cooperation. Nature
New York: Aldine de Gruyter. 327, 15}17.
AXELROD, R. (1984). ¹he Evolution of Cooperation. New MAYNARD SMITH, J. (1982). Evolution and the ¹heory of
York: Basic Books (reprinted 1989 in Penguin, Harmonds- Games. Cambridge: Cambridge University Press.
worth). MESTERTON-GIBBONS, M. & DUGATKIN, L. A. (1992).
AXELROD, R. & DION, D. (1988). The further evolution of Cooperation among unrelated individuals: evolutionary
cooperation. Science 242, 1385}1390. factors. Q. Rev. Biol. 67, 267}281.
AXELROD, R. & HAMILTON, W. D. (1981). The evolution of METZ, J. A. J., GERITZ, S. A. H., MESZENA, G., JACOBS,
cooperation. Science 211, 1390}1396. F. J. A. & VAN HEERWAARDEN, J. S. (1996). Adaptive
BOYD, R. (1988). Is the repeated Prisoner's Dilemma a good dynamics: a geometrical study of the consequences of near-
model of reciprocal altruism? Ethol. Sociobiol. 9, 278}305. ly faithful reproduction. In: Stochastic and Spatial Struc-
BOYD, R. & LORBERBAUM, J. P. (1987). No pure strategy is tures of Dynamical Systems (Van Strien, S. J. & Verduyn
evolutionarily stable in the repeated Prisoner's Dilemma. Lunel S. M., eds), pp. 183}231. Amsterdam: North Hol-
Nature 327, 58}59. land.
BULL, J. J. & RICE, W. R. (1991). Distinguishing mechanisms MILINSKI, M. (1987). Tit for tat in sticklebacks and the
for the evolution of cooperation. J. theor. Biol. 149, 63}74. evolution of cooperation. Nature 325, 433}435.
CROWLEY, P. H. (1996). Evolving cooperation: Strategies as MILINSKI, M., KUG LLING, D. & KETTLER, R. (1990). Tit for
hierarchies of rules. Biosystems 37, 67}80. tat: sticklebacks (Gasterosteus aculeatus) &&trusting'' a co-
CROWLEY, P. H., PROVENCHER, L., SLOANE, S., DUGATKIN, operating partner. Behav. Ecol. 1, 7}11.
L. A., SPOHN, B., ROGERS, B. L. & ALFIERI, M. (1996). MOLANDER, P. (1985). The optimal level of generosity in
Evolving cooperation: the role of individual recognition. a sel"sh, uncertain environment. J. Con-ict Resolution 29,
Biosystems 37, 49}66. 611}618.
DOEBELI, M. & KNOWLTON, N. (1998). The evolution of NOWAK, M. A. & MAY, R. M. (1992). Evolutionary games
interspeci"c mutualisms. Proc. Natl. Acad. Sci. ;.S.A. 95, and spatial chaos. Nature 359, 826}829.
8676}8680. NOWAK, M. A. & SIGMUND, K. (1990). The evolution of
DUGATKIN, L. A. (1997). Cooperation Among Animals: An reactive strategies in iterated games. Acta Applicandae
Evolutionary Perspective. New York: Oxford University Math. 20, 247}265.
Press. NOWAK, M. A. & SIGMUND, K. (1992). Tit for tat in hetero-
DUGATKIN, L. A. & MESTERTON-GIBBONS, M. (1996). Co- geneous populations. Nature 355, 250}252.
operation among unrelated individuals: reciprocal altru- NOWAK, M. A. & SIGMUND, K. (1993). Win-stay, lose-shift
ism, by-product mutualism and group selection in "shes. outperforms tit-for-tat. Nature 364, 56}58.
Biosystems 37, 19}30. NOWAK, M. A. & SIGMUND, K. (1994). The alternating
DUGATKIN, L. A., MESTERTON-GIBBONS, M. & HOUSTON, Prisoner's Dilemma. J. theor. Biol. 168, 219}226.
A. I. (1992). Beyond the Prisoner's Dilemma: toward mod- NOWAK, M. A. & SIGMUND, K. (1998). The dynamics of
els to discriminate among mechanisms of cooperation in indirect reciprocity. J. theor. Biol. 194, 561}574.
nature. ¹REE 7, 202}205. PACKER, C. (1977). Reciprocal altruism in Papio anubis.
FRANK, S. A. (1995). The origin of synergistic symbiosis. Nature 265, 441}443.
J. theor. Biol. 176, 403}410. PECK, J. & FELDMAN, M. (1985). The evolution of helping
FREAN, M. (1994). The Prisoner's Dilemma without syn- in large, randomly mixed populations. Am. Nat. 127,
chrony. Proc. R. Soc. ¸ond. B 257, 75}79. 209}221.
FREAN, M. R. (1996). The evolution of degrees of coopera- RAPOPORT, A. & CHAMMAH, A. M. (1965). Prisoner1s Dilem-
tion. J. theor. Biol. 182, 549}559. ma. Ann Arbor: University of Michigan Press.
HAMILTON, W. D. (1963). The evolution of altruistic behav- ROBERTS, G. & SHERRATT, T. N. (1998). Development of
iour. Am. Nat. 97, 354}356. cooperative relationships through increasing investment.
HAMILTON, W. D. (1964). The genetical evolution of social Nature 394, 175}179.
behaviour. J. theor. Biol. 7, 1}16.
320 L. M. WAHL AND M. A. NOWAK
SELTEN, R. & HAMMERSTEIN, P. (1984). Gaps and condition (A.1) is not met if
Harley argument on evolutionary stable learning rules
and in the logic of tit for tat. ¹h. Behav. Brain Sci. 7,
115}116. bkdx!cdx
SMALE, S. (1980). The Prisoner's Dilemma and dynamical
F(S, S)!F(S, S)" '0 (A.3)
2n
systems associated to non-cooperative games. Econo-
metrica 48, 1617}1634.
TRIVERS, R. L. (1971). The evolution of reciprocal altruism. This equation reduces to k'c/b. Thus, as
Q. Rev. Biol. 46, 35}57. long as the slope of the cooperative strategy is
VERHOEFF, T. (1993). A continuous version of the Prisoner1s
Dilemma. Computing Science Note 93/02. Eindhoven: Eind- higher than the cost-to-bene"t ratio, the
hoven University of Technology (also available at http: downwards evolution of x will stop at xL "
//www.win.tue.nl/wstomv/publications/).
(1!d )/k. The sequence of moves at this
WILKINSON, G. S. (1984). Reciprocal food-sharing in the
vampire bat. Nature 308, 181}184. point is:
First consider the case when x is initially set to 1, Player 1: kxJ #kd#d"(1!d )/k, Player 2: 1,
and x "1!dx. As long and dx is small enough,
S(x )"S(x )"1 (i.e. kx #d'1). In this situ-
Player 1: 12 , Player 2: 12 .
ation, strategy S receives complete cooperation
(full bene"t) when playing strategy S, but pays
Consider the next possible step in the evolu-
a slightly smaller cost; condition (A.1) is clearly
tion of x , x "xJ #dx, with dx'0. The se-
met. By similar arguments, it is clear that condi-
quence of moves would then be:
tion (A.2) will be met, and x will evolve towards
smaller values.
Player 1: xJ #dx, Player 2: kxJ #kdx#d,
At some point in this downwards evolution,
x will cross a threshold xL such that S(xL )"kxL
#d"1 but S(xL !dx)"kxL #d!kdx(1. Player 1: kxJ #kdx#kd#d, Player 2: 1,
We see that xL "(1!d )/k and the sequence of
moves, when player S plays "rst, will be Player 1: 12 , Player 2: 12 ,
Player S: xL !dx, Player S: kxL !kdx#d We will accept the new strategy, with
x "xJ #dx, only if the bene"t of this increase
Player S: 12, Player S: 12 in x outweighs the additional cost. Thus, for
condition (A.1) to be met, we "nd that
Summing the costs and bene"ts over an iter-
ated game with exactly n rounds, we "nd that bkdx'c(dx#kdx) (A.4)
CONTINUOUS PRISONER'S DILEMMA 321
which can be rewritten, for dxO0, as The payo! an indiscriminate defector receives
when playing this strategy is clearly
b
k! k#1(0, (A.5) x
c F(AllD, S )"b . (B.2)
IV 2n
This quadratic equation in k has two real roots Similarly, the payo! an indiscriminate cooper-
for b/c'2. When the bene"t-to-cost ratio ator receives against S is
exceeds this limit, k must be between these IV
roots in order for the bene"t of increasing x
x #(2n!1)k
past xJ to outweigh the additional cost involved. F(AllC, S )"b !c. (B.3)
IV 2n
Thus, the upwards evolution of x will stop at
xJ (xL for cooperators with slopes outside this
Using these payo! values, we solve to de-
range.
We have therefore demonstrated that the steady- termine the conditions under which strategy
state value of x is x' "(1!d)/k when x is S is stable against invasion by both AllC and
IV
initially set to 1 and k'c/b, but that inter-
AllD, and "nd that this is true for a small region
mediate steady states are possible if x is initially de"ned approximately by
set to zero.
c c 2n(1!k)!x
)k) . (B.4)
b b (2n!1)(1!k)!x
APPENDIX B
Stability of Sk, 0, x AllC and AllD are representative of extremes in
the spectrum of all strategies, and so we have
It is simple to show that for a game of exactly some insight as to why strategies near S ap-
n moves, the payo! a strategy along the line A@V
pear to be stable against invasion by any strategy
segment +d"0; 0(k(1, receives against itself in the space. Also note that x drops out of the
is given by
equation, such that S is stable for any value
A@V
of x , and further that 2n!1P2n as nPR, so
x (1!kL) that S only resists invasion when k is exactly
F(S ,S )"(b!c) . (B.1) IV
IV IV 2n (1!k) equal to c/b for large values of n.