Professional Documents
Culture Documents
Lecture 5
Lecture 5
Lecture I-5
© WU IMS
Organization
© WU IMS 2
Organization
1000
E$
900
800
700
600
500
400
300
200
100
0
1 2 3 4 5 6 7 8 9 10 11 12 13 14
© WU IMS 3
Organization
▪ Homework assignment 1 results are out.
© WU IMS 4
Recap
▪ What is a Nash equilibrium?
▪ How can we find Nash equilibria?
© WU IMS 5
Organization
▪ A game is defined by: Players, Strategies, Payoffs
▪ Game payoffs are utility payoffs. They depend on the preferences of
players, and are not necessarily equal to $$$ payoffs.
▪ → We have to know the preferences of players!
Otherwise we solve the wrong game.
• Experiments:
– Experiment 8: Dictator game
– Experiment 9: Ultimatum game
– Experiment 10: Repeated Ultimatum game
– Experiment 11: Market game
▪ → And we assume rationality of all players. But what if they aren’t?
• Experiment 12: Centipede Game
© WU IMS 6
Experiment 12
▪ Centipede game
a) Derive the subgame perfect Nash equilibrium for the one-shot
game (game played only for one round). Prove that the
equilibrium you found is indeed one. Are there more Nash
equilibria, which are not subgame perfect?
© WU IMS 7
Experiment 12
▪ Centipede game
1 2 3 4 5 6 7 8 9 10
A: 96
A B A B A B A B A B B: 32
© WU IMS 9
Experiment 12
▪ When do participants choose down?
1 2 3 4 5 6 7 8 9 10
A: 96
A B A B A B A B A B B: 32
© WU IMS 10
Limited depth of reasoning
▪ How far can you think?
▪ Cognitive psychology:
• short term memory capacity limit of 7±2 “chunks”
• Embedding creates cognitive problems
Four levels: “The movie that the script that the novel
that the producer whom she thanked discovered
became was made into was applauded by the critics.”
▪ k-level thinking models
• Players have different levels of rationality, which can be expressed
by a level k. Думаю, що всі інші теж 0
• Level-0 players play some simple strategy which is not dependent
on other players’ behavior. Examples: random, just cooperate, ...
• Level-k>0-players play a best response to the distribution of
players of levels 0 to k-1. варіанту,
Думають, що всі інші k-1 або нижчі. Але вони не розглядають
що суперник теж k
© WU IMS 11
Limited depth of reasoning
▪ Example1 2 3 4 5 6 7 8 9 10
1 2 3 4 5 6 7 8 9 10
A: 96
A B A B A B A B A B B: 32
Якщо ти 1, то ти вирахуюш
стратегію 0 і обереш "вниз" на 7
▪ Level 0: go down at last move, i.e. 9/10 чи 8
Якщо ти рівень 2, то ти
▪ Level 1: best response (BR) against level 0: go down at 7/8 думатимеш, що інші або 1, або
0. Треба змішана стратегія, аби
▪ Level 2: BR against 0 and 1: mix of down at 7/8 or 5/6 вирахувати, ким є інший (1 чи 0)
і базуючись на цьому обрати
▪ Level 3: BR against 0, 1, 2: mix of down at 7/8, 5/6, 3/4 свою стратегію: це або вниз на
5-6, або вниз на 7-8
▪ … more levels …
© WU IMS 12
Limited depth of reasoning
▪ What are the issues with this level-k reasoning approach?
• All depends on choice of level-0 behavior. (Is it random? Is it an
„intuitive“ choice? What does „intuition“ mean?)
© WU IMS 13
Take-aways
▪ A game is defined by: Players, Strategies, Payoffs
▪ → We have to know the preferences of players! Otherwise we
may solve the wrong game.
▪ Utilities over game outcomes might depend on other’s payoffs or
intentions.
© WU IMS 14
Experiment 13
© WU IMS 15
Experiment 13
Player 2
0.08 0.12
▪ The Nash
equilibrium of the
12* 0 one-shot game is:
0.08
• Player 1: Choose
Player 1
12* 24 *
price of 0.08.
24* 17.5 • Player 2: Choose
0.12
0 17.5 price of 0.08.
© WU IMS 16
Experiment 13
▪ The equilibrium of the repeated game (repeated with the same
players) corresponds to playing the one-shot equilibrium in each
repetition.
▪ Why? Backward induction!
• In the very last repetition, what is the equilibrium of the game?
• The unique one-shot equilibrium! That means, behavior in the
very last round is completely independent of any behavior
before.
• Thus, in the second last round, as behavior has no impact on
how players will behave in the last round, the game is played as
if it is the last game.
• The same is true for the second-to-last round, etc.etc., until we
arrive at the first round.
© WU IMS 17
Recap: Finitely repeated games theorem
▪ Theorem:
• If a game G has a unique subgame perfect Nash equilibrium,
then there exists a unique subgame perfect Nash equilibrium in
the finitely repeated game G(T) in which the subgame perfect
Nash equilibrium of the game G is played in each repetition.
© WU IMS 18
Experiment 13
▪ The subgame perfect Nash equilibrium of the repeated game is:
• Player 1: Choose price of 0.08 in each round.
• Player 2: Choose price of 0.08 in each round.
© WU IMS 19
Experiment 13
▪ What about punishing strategies. E.g., both play: “I start with a high
price, but I will select a low price whenever I see the other selecting a
low price?” Is this an equilibrium?
▪ No:
• A best response to someone playing this strategy would be: I do
the same, only that in the very last period I choose a low price.
• A best response against that would be to defect in the second-to-
last period …. etc. etc.
▪ Even if the other guy believes my punishing strategy, and plays
cooperatively, I myself would have an incentive to choose another
strategy.
→ Even in the repeated game, there is only one unique Nash
equilibrium. There are no other Nash equilibria
(for example, involving not credible threats or similar).
© WU IMS 20
Experiment 13
▪ Data
c) Analyze the data set of experiment 13. What do people do? Can
you observe specific patterns of behavior over time?
© WU IMS 21
Experiment 13
© WU IMS 22
Experiment 13
© WU IMS 23
Experiment 13
© WU IMS 24
Experiment 13
▪ Data
d) Compare the results of theoretical and data analysis to
experiment 2 (simultaneous price competition, conducted in
tutorial 1 and analyzed in lecture 2). What are the most
important differences between experiment 13 and experiment
2?
© WU IMS 25
Experiment 13
▪ For the competition game of experiment 2, we had a similar prediction
of behavior as for the game of this experiment: all firms set the lowest
price.
▪ In experiment 2 we observed that this was indeed the case: already in
the 1st round, the equilibrium price was reached, and most individual
price offers were at the equilibrium in the 2nd round, with very few
deviations.
▪ In experiment 13, we observe more cooperation but still not full
cooperation.
▪ Generally speaking:
• Groups of 2: better coordination on collusion.
• Same partner in all rounds: potential of punishment!
© WU IMS 26
Dilemma games
▪ A dilemma game is a game where
• individual rationality and
• group rationality
suggest different ways of behavior
▪ In other words: given behavior of other group members, what’s best
for the individual is not best for the group.
▪ General problem:
• negative external effects of my behavior on others
are not “internalized“
• Others’ profit reduction when I undercut
• Team’s loss in marks when I do not contribute = don’t
• Friends’ loss when I take too much for myself affect or
bother me
• Partner’s loss if I do not cooperate
© WU IMS 27
Dilemma games: Examples
▪ Collusion of firms (also experiments 2, 3, 7)
• “Group” rationality: all choose monopoly price and share the
resulting monopoly profits
• Individual rationality: undercutting others’ prices yields high profits
• Outcome: competitive prices, all firms make less profits than when
cooperating
▪ Note: this outcome is good for a larger group: the society
© WU IMS 28
Dilemma games: Examples
Provision of public goods
• non-excludable: nobody can be excluded
• non-rival: one’s consumption does not hurt others
• Public transport, streets and sidewalks, …
• Group rationality: everybody benefits if provided
• Individual rationality: free-ride on others’ contributions
• Outcome: public goods are not (or under-) provided
© WU IMS 29
Dilemma games: Examples
▪ Common pool resources (very similar to public goods)
• Imagine a pond with a number of fishermen. Each fisherman can fish
how many he likes, but if too many fish are caught, the fish will die out
• Group rationality: fish as much as optimal long term
• Individual rationality: given other fisherman reduce, just fish a little
more which does not hurt myself
• Outcome: everybody fishes more
Confess Deny
10 years 25 years
Confess
10 years 1 year
HUSBAND
1 years 3 years
Deny
25 years 3 years
© WU IMS 32
Infinity
▪ We started with most restrictive assumptions: complete information, full
rationality, selfish players
© WU IMS 34
Experiment 14
▪ Dilemma of unknown (probabilistic) length
Player 2 ▪ Backward induction?
0.08 0.12 → No defined end, so no rollback
possible!
0.08
12* 0 → Any round is equal, as the expected
number of rounds to play stays the same
Player 1
12* 24*
in each round.
24* 17.5
0.12 ▪ What now?
0 17.5 ▪ Is 0.08/0.08, played in
all rounds, still an
δ=0.9 equilibrium?
▪ Yes.
But there are more!
© WU IMS 35
Experiment 14
Player 2
0.08 0.12
12* 0
Player 1
0.08
12* 24*
24* 17.5
0.12
0 17.5
© WU IMS 36
Experiment 14
▪ Subgames in infinite games
• A subgame is a game which starts at some decision node after
the root node of the original game.
• In an infinite game with a fixed discount factor/continuation
probability per repetition,
• every subgame looks exactly like the original game.
© WU IMS 37
Experiment 14
▪ Assume the following grim trigger strategies: “I start
Player 2
0.08 0.12 with cooperating. If you cooperate, then I will
12* 0
Player 1
0.08
12* 24*
continue cooperating. If you defect once, from the
next round on I will defect forever.”
24* 17.5
0.12
0 17.5 t =1
t
=
1−
▪ Is playing these strategies an equilibrium?
▪ Say I comply with the strategy, then my expected profit is:
0.9
17.5 + 0.9 17.5 + 0.9 17.5 + ... = 17.5 + 0.9t 17.5 = 17.5 +
2
17.5 = 175
t =1 1 − 0.9
▪ If I deviate once, and then always have no cooperation:
0.9
24 + 0.9 12 + 0.9 12 + ... = 24 + 0.9t 12 = 24 +
2
12 = 132
t =1 1 − 0.9
▪ Is the punishment credible? Yes: It’s the one-shot Nash!!!
© WU IMS 38
Experiment 14
▪ Data
c) Analyze the data set of experiment 14. Do you see any pattern
in the interactions of the two players?
© WU IMS 39
Experiment 14
© WU IMS 40
Experiment 14
© WU IMS 41
Experiment 14
© WU IMS 42
Infinitely repeated games
▪ Some players might have limited foresight such that they see a game as
infinite.
▪ A game might actually be infinite, or has a certain continuation
probability after each round.
▪ Does this change rational behavior in such games?
▪ Yes:
• Backward induction is not feasible anymore, thus arguments relying
on rollback cannot be applied.
• Rather, each repetition of the game can be seen as having the same
future prospects → solving for one round (under the shadow of the
future!) should yield a solution for all rounds.
▪ Note: Playing the one-shot equilibrium in each round is still an
equilibrium in the infinitely repeated game. But there are others now …
© WU IMS 43
Infinitely Repeated games theorem
▪ If a game G has a unique Nash equilibrium, then there exists a unique
subgame perfect Nash equilibrium in the infinitely repeated game
G(T∞) in which the Nash equilibrium of the game G is played in each
round t.
▪ So, playing the Nash equilibrium of the one-shot game in each round
is always one Nash equilibrium of the infinitely repeated game.
▪ But there are likely more Nash equilibria in the infinitely repeated
game.
© WU IMS 44
Infinitely repeated games
Player 2 ▪ How to interpret δ ?
D C
2 1
D
2 6
6 5
C
1 5
δ
© WU IMS 45
Infinitely repeated games
und 1
Player 2 ▪ How to interpret δ ?
D C
2 1 a) Game continuation probability:
D
2 6 with a probability of δ the game is
6 5 continued. Thus, in expected terms,
C seen from today, the next game is
1 5
Player 2 δ played for amounts of δ · X, etc.
und 2 D C b) Discount factor: right now, money
δ2 δ1
D
δ2 δ6
earned next year is less valuable to
δ6 δ5 us than money earned today. So
C
δ1 δ5 when a game is played infinitely,
Player 2 income from later rounds is less
und 3 D C
δ valuable than today’s income. 1 – δ
δ2 2 δ2 1 is this loss of value for each round in
Player 1
D 2
δ 2 δ2 6 the future, seen from today.
δ2 6 δ2 5
C
δ 2 1 δ2 5 … c) Product of both
© WU IMS 46
Infinitely repeated games
Let C, D, H, and L be the utility payoffs Player 2
from a dilemma game, with H > C > D > L. D C
Let 0 p 1 be the probability that the
D L
game continues after a round. D
Player 1
D H
Let 0 d 1 be the discount factor per
round. Let δ = p · d. H C
C
L C
© WU IMS 47
Infinitely repeated games
▪ Is one-time defection worthwhile? Player 2
▪ If a player complies, then her net present D C
¥
value is
å
C + dC + d t ×C D L
t=2 D
Player 1
D H
▪ If a player defects once against
T4T her net present value is ¥ H C
H + d L + åd t ×C
C
L C
t=2
▪ So she should one-time defect if
¥ ¥
H -C
H + d L + åd ×C > C + dC + åd ×C Þ
t t
>d
t=2 t=2
C- L
▪ If loss from punishment (C-L) is large or benefit from defecting (H-
C) is small or the next round has relatively high value, then one-
time defection does not pay.
© WU IMS 48
Infinitely repeated games
Is all-time defection worthwhile?
Player 2
=
t
1−
If a player complies, then his net t =1 D C
present value is
C + C = C +
t
C D L
t =1 1− D
Player 1
D H
If a player defects forever against T4T then
his net present value is H C
H + t D = H +
C
D L C
t =1
1−
So he should defect if
H+
1−
DC+
1−
C H −C
C − D 1−
H −C
1
C − D 1−
−1
If loss from continued punishment (C-D) is high or one-time benefit
from defection (H-C) is low or the future has a high value, then all-
time defection does not pay.
© WU IMS 49
Infinitely repeated games
▪ Folk Theorem (Friedman 1971):
• Let G be a static n-person game with complete information. Let
e = (e1, ..., en) be the payoff vector in a Nash equilibrium of G and
let x = (x1, ..., xn) be a feasible payoff vector in G, and let xi > ei
for all players i .
• If close enough to 1, then there exists a subgame perfect
Nash equilibrium of the infinitely repeated game G(, ) which
implements the payoff vector x als average payoff.
▪ In other words: any feasible payoff combination (x1, ..., xn) (with xi >
ei for all players i) can be implemented by some trigger strategies in
an infinite game.
© WU IMS 50
Infinitely repeated games Player 2
▪ What does that mean? D C
• Payoff vectors 2 1
Player 1
D
• Feasible payoff vectors 2 6
• with xi > ei for all players i 6 5
C
1 5
▪ Folk theorem: Each point in the green
area can be implemented as average (1,6)
payoff, by a) playing a combination plan 6 (5,5)
over the 4 cells and b) threatening to
always play (2,2) if somebody deviates
Player 2
from this plan
▪ Note: (2,2) is a credible threat, as it is a
stage game equilibrium, and worse (2,2)
than any payoff in the green area. (6,1)
0 Player 1 6
© WU IMS 51
Take-aways
© WU IMS 52
Experiments
© WU IMS 53