Professional Documents
Culture Documents
Teoria de Juego
Teoria de Juego
GAME THEORY
Lecture 1: Intro and Motivational Examples
January 8, 2024
Course Information
1 / 339
Course Information
Course Information
2 / 339
Course Information
Course Information
Course Information
4 / 339
Course Information
5 / 339
Course Information - Schedule
Game theory
Éin a nutshell:
6 / 339
Game theory
Éin a nutshell:
7 / 339
John von Neumann
Quiz:
Read John von NeumannÕs Wikipedia page and answer:
9 / 339
John von Neumann? É no!
10 / 339
The essential ingredients
The essential ingredients defining a
game are:
players N = {1, 2, É, n}
and for each i in N:
actions Ai = {a1, a2, É, am}
payoffs ui : A1 x É x An -> R
We will study this in more detail in a few weeks.
Note: Many types of interactions are games!
Applications
Game theory started out as a mathematical
discipline and has eventually been applied to
various fields such as:
Ñ Economics, Business, Strategy
Ñ Political Science, Law, Political Philosophy
Ñ International Relations, Military Strategy
Ñ Environmental Sciences, Climate
Ñ Biology, Medicine, Neuroscience
Ñ Computer Science, Logic
Ñ etc.
We will see various applications in class.
11 / 339
Application: Telecom market
¥ We want to understand strategic interactions.
x*
x*
x*
x**
x**
x**
16 / 339
Edward O. WilsonÕs Half-Earth Project
30% by 2030.
17 / 339
Progress at the UN: High Seas Treaty
18 / 339
Some examples
Some examples
Some examples
Some examples
Fabrizio Germano
Winter 2024
Uncertainty
· So far (in Intro to Micro and Micro I), consumers and firms knew
the exact consequences/outcomes of their actions.
Example A firm knows its profits given its level of production.
· But often there can be UNCERTAINTY about the payoffs
associated to different actions (e.g., due to unknown future prices).
Example
- A farmer might have to choose whether to cultivate carrots or potatoes,
- but when he makes the decision, he’s uncertain about the profits that
he’ll obtain,
- since the best choice depends on weather conditions, future prices,
demand conditions... none of which he controls.
· Gap between actions and consequences.
· Decision theory shows us how to deal with this gap.
23 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 2
Example
Uncertainty
Example
Risky alternatives
sun rain
ice-cream shop 6 0
bookshop 2 2
24 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 4
Example
Strategic interaction
tobacco ice-cream
ice-cream shop 6 0
bookshop 2 2
25 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 6
Where are we going?
The key concepts we need to know are:
· Certainty equivalent
· Attitudes towards risk
· risk averse – risk neutral – risk loving
· Applications: asset purchase – insurance – portfolio choice
26 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 8
· θ can take any of m different values in Θ = {θ1 , θ2 , ..., θm }.
· Realizations of θ follow a probability distribution.
· Value θi occurs with probability pi .
m
· We assume that for all i, pi ~ 0 and ∑ pi = 1.
i =1
Example We don’t know what the weather will be, but we know it
can be sunny or rainy and we know the probabilities.
Decision Theory
exploits the information given by probabilities and the information
about agents’ preferences in order to make recommendations for
making “good decisions” j help to choose the “favorite lottery”.
27 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 10
The farmer’s example
set of actions A = { }
set of consequences C = { }
28 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 12
Lotteries
· A formal representation of each risky action/alternative:
· a set of consequences (in this case, monetary payoffs), each
with an associated probability pi :
L = ïp1 , ..., pm | x1 , ..., xm ð .
· L is called a lottery.
· The set of all such lotteries is denoted by L.
Rational behavior
29 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 14
The Expected Value
30 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 16
Back to the farmer’s example.
31 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 18
An experiment
An experiment
32 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 20
The Saint Petersburg Game
· Consider the game in which you pay a fixed fee and then you
toss a coin repeatedly until it turns up heads in which case the
game stops and you get:
· 2e if the coin comes up heads in the first toss
· 4e if heads in the second toss
· 8e if heads in the third toss
· 16e if heads in the fourth toss
· etc.
· How much would you be willing to pay to play this game?
· You can think of it as a lottery: LSP = .
· How much would you be willing to pay to participate in the
lottery LSP ?
50 50
Frequency in group T3
Frequency in group T1
40 40
30 30
20 20
10 10
0 0
0 20 40 60 80 100 0 20 40 60 80 100
Amount of money stated (100 means '100 or more') Amount of money stated (100 means '100 or more')
1 1 1
The lottery LSP = 2, 4, 8, . . . | 2, 4, 8, . . . has expected value:
1 1 1 1
E (LSP ) = (2) + (4) + (8) + (16) · · ·
2 4 8 16
= 1+1+1+1+···
= ∞.
· Certainty equivalent
· Attitudes towards risk
· risk averse – risk neutral – risk loving
· Applications: asset purchase – insurance – portfolio choice
34 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 24
Expected Utility Theory
In this case u (·) is linear in money outcomes (xi ’s) and U (·) is
linear in both probabilities and money outcomes (pi ’s and xi ’s).
36 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 28
Example
· Suppose an economic agent has preferences that can be
represented by an expected utility function U (·) over lotteries
:
with von Neuman-Morgenstern utility function u (x ) = x.
· Which lottery will this agent choose?
E ( L1 ) =
E ( L2 ) =
37 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 30
Example
Compute the expected utility of the St Petersburg lottery LSP for
two individuals with:expected utility preferences with vNM utility
functions u1 (x ) = x and u2 (x ) = log x.
· U2 (LSP ) =
· U2 ( L 5 ) =
· Individual 2 prefers
Example
Compute the expected utility of the St Petersburg lottery LSP for
two individuals with:expected utility preferences with vNM utility
functions u1 (x ) = x and u2 (x ) = log x.
· U2 (LSP ) j 1.39
· U2 (L5 ) j 1.61
· Individual 2 prefers 5e for sure.
Who seems to be more risk averse?
38 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 32
Example
Compute the expected utility of the St Petersburg lottery LSP for
two individuals with:expected utility preferences with vNM utility
functions u1 (x ) = x and u2 (x ) = log x.
· U2 (LSP ) j 1.39
· U2 (L5 ) j 1.61
· Individual 2 prefers 5e for sure.
Who seems to be more risk averse? Individual 2.
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 32
Preference representations
39 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 33
A few more details about preferences and utility functions
41 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 37
Proof.
L L2 ñó U (L) g U (L2 )
õ
m n
∑ p i u ( xi ) g ∑ pi2 u (xi2 )
i =1 i =1
õ ( α > 0)
m n
α ∑ pi u (xi ) + β g α ∑ pi2 u (xi2 ) + β
i =1 i =1
õ
m n
∑ p i v ( xi ) g ∑ pi2 v (xi2 )
i =1 i =1
õ
V (L ) g V (L2 )
1
u (0) = 10 2 = 9.
0+1
³ u isn’t satisfactory.
42 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 39
· Construct v subtracting 9 from u:
1 x
v (x ) = u (x ) 2 9 = 1 2 = .
x +1 x +1
11 11x
w (x ) = v (x ) = .
10 10(x + 1)
43 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 41
Method 2:
Let us write w (x ) = αu (x ) + β so that w (to be determined) will
represent the same preferences as u (assuming that α > 0).
Now find α and β such that
0 = w (0) = αu (0) + β = 9α + β = 0
1
1 = w (10) = αu (10) + β = α(10 2 ) + β
11
This system of 2 linear equations with two variables α and β has a
11
unique solution: α = 10 and β = 2 99
10 . Hence,
11 99 11x
w (x ) = u (x ) 2 = .
10 10 10(x + 1)
Since α > 0, w is a positive affine transformation of u. So, w
represents the same preferences as u. And of course, by
construction, it satisfies the two conditions w (0) = 0 and
w (10) = 1.
44 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 43
Should the expected profits be used?
We have to choose between two lotteries
The oil well L2 should be chosen if the investor is risk neutral. Yet,
you should check his attitude toward risk before giving any advice.
u (60) = 0
u (200) = 1
45 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 45
Assessing the utility of 160e
· Ask your client: what is the probability p that makes 160e
with certainty indifferent to the reference lottery?
46 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 47
The estimated vNM utility function
Euros 60 130 160 200
Utility 0 0.85 0.95 1
47 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 49
Summary of decision-making “recipe”
48 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 51
Expected Utility in a Graph
L = ïα, 1 2 α | x1 , x2 ð
remember: 0 f α f 1 (α is a probability)
Important note: If you have two points, A and B, say, A = (1, 1) and B = (3, 4), then
C = 13 A + 23 B = ( 37 , 39 ) j (2.33, 3). If you draw A, B, C , you can see that C is on the
line between A and B, from A it’s 2/3 of the way towards B. Draw this!
49 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 53
Example of a graphical analysis
Initial Wealth and Investment
· an investment project can yield a profit of 40 if things go well
and a loss of 20 if things go wrong
· a long experience with projects of this type show that they fail
one out of three times
· initial wealth of the investor ω = 30
· utility function u is
Should he invest?
· compare 2 lotteries: invest (L) and not invest (H )
· remember to add the initial wealth to the outcomes of each
lottery (level of final wealth may be important to investor)
50 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 55
Investor should
51 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 57
Investor should
Certainty Equivalent
u [C (L)] =p u (x ) + (1 2 p ) u (x 2 )
ô C (L) =u 21 [p u (x ) + (1 2 p ) u (x 2 )]
53 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 61
Graphical representation of the certainty equivalent
Take p = 1/2.
Take p = 1/2.
54 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 63
Example of computation of a certainty equivalent
Suppose your vNM utility function is u (x ) = x 0.62 .
· Consider the lottery L = ï 12 , 21 | 20, 280ð.
· What is your certainty equivalent C (L) for this lottery?
u [C (L)] = p u (x ) + (1 2 p ) u (x 2 )
55 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 64
Example of computation of a certainty equivalent
Suppose your vNM utility function is u (x ) = x 0.62 .
· Consider the lottery L = ï 12 , 21 | 20, 280ð.
· What is your certainty equivalent C (L) for this lottery?
Recall the formula for the certainty equivalent:
u [C (L)] = p u (x ) + (1 2 p ) u (x 2 )
C (L) = u 21 [U (L))] = u 21 [p u (x ) + (1 2 p ) u (x 2 )]
56 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 65
Attitudes towards risk
57 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 66
Actuarially equivalent lotteries
· Two lotteries are called actuarially equivalent if they have
the same expected value.
· Given a lottery L, let L7 denote the degenerate lottery that
gives with certainty (probability 1) the expected value of L.
E (L) = pb + (1 2 p )d = E (L7 )
Risk Neutral
· A decision maker is said to be risk neutral if he is always
indifferent between the uncertain lottery L and the actuarially
equivalent certain lottery L7 .
· A decision maker is risk neutral if and only if his vNM utility
function u is linear.
Proof.
Compare the expected utilities of L and L7 :
U (L7 ) = u (pb + (1 2 p ) d ) = pu (b ) + (1 2 p ) u (d ) = U (L)
58 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 68
Risk Averse
· A decision maker is said to be risk averse if he always prefers
the certain lottery L7 to the actuarially equivalent uncertain
lottery L.
· A decision maker is risk averse if and only if his vNM utility
function u is strictly concave.
Proof.
Compare the expected utilities of L and L7 :
U (L7 ) = u (pb + (1 2 p ) d ) > pu (b ) + (1 2 p ) u (d ) = U (L)
Risk Loving
Proof.
Homework.
59 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 70
Summary of ttitudes towards risk
With expected utility there are three types of risk attitudes.
· Risk neutral (linear u (·))
· Risk averse (strictly concave u (·))
· Risk loving (strictly convex u (·)).
60 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 72
Certainty equivalent and risk aversion
The relation between the certainty equivalent and the expected
value of a lottery characterizes the attitude towards risk:
Proof.
Risk aversion means that
2 2
p u (x ) + (1 2 p ) u (x ) < u p x + (1 2 p ) x
or, equivalently,
u 21 [pu (x ) + (1 2 p )u (x 2 )] < px + (1 2 p )x 2 .
62 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 76
Experiment on Risk Attitudes
And graphically, for both groups the outcomes were:
· Certainty equivalent
· Attitudes towards risk: risk averse / neutral / loving
Applications
64 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 80
Application 1. Buying and selling risky assets
65 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 82
Determining the selling price
1 3 1: 3: 5
U ( L1 ) = u (16) + u (4) = 16 + 4= .
4 4 4 4 2
66 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 84
Determining the selling price
4
p s =2.25, p b =1.98 L 1 =+0.25,0.75|4,16+
3
U(L 1 ) u(x)=x 0.5
2
u(4+ps )
U(L 3 ) u(4)
1 L 3 =+0.25,0.75|4-p b ,16-p b +
5 10 15 20
Note that ps = 2.25 > 1.98 = pb and both are < E (L) = 3.
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 87
15 L 1 =+0.25,0.75|4,16+
p s =3, p b =3
10 u(x)=x
u(4+ps )
U(L 1 )
u(4)
5
U(L 3 )
L 3 =+0.25,0.75|4-p b ,16-p b +
0 5 10 15 20
30 p s =3.33, p b =3.68
25 L 1 =+0.25,0.75|4,16+
20 u(x)=x 1.2
u(4+ps )
15
U(L 1 ) u(4)
10
U(L 3 )
5
L 3 =+0.25,0.75|4-p b ,16-p b +
5 10 15 20
Note that now ps = 3.33 < 3.68 = pb and both are > E (L) = 3.
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 89
69 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 90
Experiment on buying and selling a risky asset
70 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 92
Application 2. Insurance contracts
· An insurance contract is a contract in which the insured pays
a relatively small and certain fee (premium) to the insurer in
exchange for the insurer’s promise to compensate the insured
in case of an uncertain event that causes financial/personal
loss (accident, robbery, fire, death, etc.).
· The contract establishes the premium the insurer has to pay
and the compensation x paid by the insurer.
· The insurance contract is a contract of transfer of risk: the
insurance company accepts the insured’s risk in exchange for
the premium. When the contract includes an excess
(franquicia), part of the risk remains in the hands of the
insured individual.
· The insurance mechanism doesn’t change the risk probabilities
or consequences.
Insurance: Notation
ωg0 initial wealth
0fd fω loss
p probability of fire
x fd compensation in case of fire
s price of a unit of insurance
Insurance as a lottery:
x
Note: When using Lz where z = d * [0, 1], we have L0 and L1 .
Given all the other variables, the economic agent has to choose the
level of insurance x that maximizes expected utility
max f (x ) = p u (ω 2 d 2 sx + x ) + (1 2 p ) u (ω 2 sx )
x
s.t. 0fx fd
72 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 96
Insurance contracts: a numerical example
73 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide 98
Full insurance contract example: solution
Assume the insurance company is risk neutral so that its expected
utility equals expected value. It decides between B0 and B1 .
E ( B0 ) = 0
E (B1 ) = 600e
³ The expected utility for the plant owner is higher with the
insurance contract (L1 { L0 ).
80 u(x)=100- 10.000
x
u(1000-pI )=U(L1 )
70
U(L0 )
60
U(L 1 ) > U(L 0 )
f(x) = 0.001u(400+(1-s)x)+0.999u(1000-sx)
j
88.8
U(L 1 ) -- with full insurance 1.8033*10^10+4.4965*10^7 x-100 x 2
2.0040*10^8+499599.1984 x-x 2
88.7
U(L1/2 )
U(L1 )
88.6
U(L0 )
U(L 0 ) -- without insurance
88.5
x = amount insured
0 100 200 300 400 500 600
u(1000-pI )=U(L1 )
70 U(L0 )
U(L1/2 )
60
U(L 1/2 ) > U(L 1 ) > U(L 0 )
75 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide102
Partial insurance contract: plant owner’s decision
Recall:
· No insurance corresponds to the risky asset:
L0 = ï0.001, 0.999 | 400, 1000ð (in 1,000s of euros).
· The optimal partial insurance contract is at z 7 = 1/2 and yields
expected utility U (L1/2 ) > U (L1 ) > U (L0 ), where
· L1/2 = ï0.001, 0.999 | 700 2 0.6, 1000 2 0.6ð.
100
u(1000-pI )=U(L1 )
70 U(L0 )
U(L1/2 )
60
U(L 1/2 ) > U(L 1 ) > U(L 0 )
86 u(1000-pI )=U(L1 )
U(L0 )
84
U(L1/2 )
82
76 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide104
Insurance markets
240
u 22 (x ) = 2 <0
(x + 4)3
so that the vNM utility function is strictly concave. Hence,
both hunters are risk averse.
· When an individual goes hunting he gets 2 units of deer.
From time to time (with probability 20%) a hunter gets ill and
cannot go hunting. In that case he gets nothing.
77 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide106
Solving the hunters’ insurance problem
The lottery L0 represents the situation of a hunter without any
insurance contract.
Suppose now that they decide to insure themselves and share the
risk: if one of them gets ill, they will share what the other gets.
The lottery L1 represents this situation: there are 3 possible
outcomes:
1. With probability 0.64 both are healthy and each gets 2 deer.
2. With probability 0.04 both are ill and both get 0 deer.
3. With probability 0.32 one of them is ill and the other is
healthy. In that case they consume 1 deer each.
78 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide108
Application 3. Optimal portfolio
79 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide110
Comparing the two assets
Let H and L be the two lotteries that result from investing all the
money in one asset.
Optimal portfolio
Suppose we could also decide to invest only a fraction of our wealth
in the risky asset. There is a whole range of possible portfolios.
· Let z denote the fraction of our wealth that we decide to
invest in the risky asset, and denoting by Lz the lottery we are
confronted with, we get:
82 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide116
Preferences over a finite set of alternatives
preference relation
83 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide118
Preferences over a finite set of alternatives
Example. You won a weekend trip to a Northern European city!
Survey questions:
· Amsterdam vs Berlin?
· Berlin vs Copenhagen?
· Copenhagen vs Dublin?
· Dublin vs Amsterdam?
· Copenhagen vs Amsterdam?
· Berlin vs Dublin?
Possible answers for each question were:
· 1: X { Y , 2: Y { X , 3: X > Y .
Which of the following satisfy transitivity and completeness?
(1, 2, 1, 2, 1, 1), (1, 1, 1, 1, 2, 1), (1, 1, 1, 2, 2, 1), (1, 3, 1, 2, 2, 2), (3, 1, 1, 2, 1, 1).
In a class survey at least 90% were complete and transitive, and less than
10% violated only transitivity.
(1, 2, 1, 2, 1, 1) ñó Copenhagen { Amsterdam { Berlin { Dublin.
UPF, Introduction to Game Theory, Winter 2024 Topic 1, Slide119
Fabrizio Germano
Winter 2024
85 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 1
Test 1
Seminars 1-6 cover problem sets 1-6 and in seminar 7 you will present
your final projects. Also see the syllabus for more info.
86 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 3
Preliminaries
where:
· N is the set of players
· Si is the set of pure strategies of player i
· ui are utilities of player i.
We assume that N and each Si are finite and the players know G .
Players are utility maximizers with utilities ui .
Preliminaries
where:
· N is the set of players
· Si is the set of pure strategies of player i
· ui are utilities of player i.
We assume that N and each Si are finite and the players know G .
Players are utility maximizers with utilities ui .
Note: Here, in Topics 2, 3, we only consider pure strategies.
87 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 4
Preliminaries
where:
· N is the set of players
· Si is the set of pure strategies of player i
· ui are utilities of player i.
We assume that N and each Si are finite and the players know G .
Players are utility maximizers with utilities ui .
Note: Here, in Topics 2, 3, we only consider pure strategies.
Later, in Topics 4, 5, we also consider so-called mixed strategies, where
players can randomize over pure strategies.
Preliminaries
G= s11 4, 2 0, 0
s12 0, 0 2, 4
u1 (s ) = u1 (s11 , s21 ) = 4
u2 (s ) = u2 (s11 , s21 ) = 2.
88 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 5
Preliminaries
Sometimes it’s easier or more natural to label strategies by A, B, or
X , Y or other labels, similarly for players.
Example. Consider the 2-player game G (battle of the sexes):
W \H Ballet Boxing
G= Ballet 4, 2 0, 0
Boxing 0, 0 2, 4
uW (s ) = uW (Ballet, Boxing ) = 0
uH (s ) = uH (Ballet, Boxing ) = 0.
89 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 7
Writing down a game...
What is the Split or Steal game Sarah and Steven are playing?
How can you analyze this game? What do you expect people to
play? What are the “equilibria”?
How can you analyze this game? What do you expect people to
play? What are the “equilibria”?
90 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 8
Where are we going?
After introducing the setup of simultaneous games, we study how
players might behave, and how we can rationalize their choices.
Key concepts we will learn are:
· Iterated elimination of strictly dominated strategies
· Strictly dominant and strictly dominated strategy
· Iterated elimination of weakly dominated strategies
· Weakly dominant and weakly dominated strategy
· Best response and rationalizable strategies
The concepts learned here will be used and revisited in the rest of
this class and in other classes in microeconomics, IO and elsewhere.
91 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 10
Strict Dominance
Definition
The strategy si * Si strictly dominates ti ;= si if it gives a strictly
higher payoff than ti against every strategy profile of the opponents:
Strict Dominance
Definition
The strategy si * Si strictly dominates ti ;= si if it gives a strictly
higher payoff than ti against every strategy profile of the opponents:
1\2 L M R
T 2, 3 3, 0 0, 1
B 0, 3 1, 5 6, 2
92 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 11
Strict Dominance
Definition
The strategy si * Si strictly dominates ti ;= si if it gives a strictly
higher payoff than ti against every strategy profile of the opponents:
1\2 L M R
T 2, 3 3, 0 0, 1
B 0, 3 1, 5 6, 2
Strict Dominance
1\2 L M R
T 2, 3 3, 0 0, 1
B 0, 0 1, 5 6, 2
93 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 13
Strict Dominance
1\2 L M R
T 2, 3 3, 0 0, 1
B 0, 0 1, 5 6, 2
Strict Dominance
Definition
The strategy si * Si is strictly dominant if it strictly dominates every
strategy ti * Si with ti ;= si :
Note that si gives player i a strictly higher payoff than any other strategy
of player i no matter what the opponents are playing.
94 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 15
Strict Dominance
Definition
The strategy si * Si is strictly dominant if it strictly dominates every
strategy ti * Si with ti ;= si :
Note that si gives player i a strictly higher payoff than any other strategy
of player i no matter what the opponents are playing.
Strict Dominance
Definition
The strategy si * Si is strictly dominant if it strictly dominates every
strategy ti * Si with ti ;= si :
Note that si gives player i a strictly higher payoff than any other strategy
of player i no matter what the are opponents playing.
96 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 17
Iterated Elimination of Strictly Dominated Strategies
What is your “predicted” outcome for this game? (T , L)
Start with : 1\2 L M R
T 2, 3 3, 0 0, 1
B 0, 3 1, 5 6, 2
Eliminate R: 1\2 L M R
T 2, 3 3, 0 0, 1
B 0, 3 1, 5 6, 2
Definition
The set Si∞ is the set of (pure) strategies of player i that survive
iterated elimination of strictly dominated strategies (IESDS).
Sometimes we also refer to Si∞ as the set surviving iterated strict
dominance (ISD). In the previous example S1∞ = T and S2∞ = L.
Note: Si∞ does not depend on the order of elimination.
98 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 21
S ∞ and Common Knowledge of Rationality
To predict that only strategies in S ∞ will be played, in two-player
games, we need to assume that:
(1)
Here Si∞ = Si = {D }. (Why?)
Note: Here the strategies D are strictly dominant for both players, so that the
profile (D, D ) is also what is called a dominant strategy equilibrium (DSE ).
99 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 23
Iterated Elimination of Strictly Dominated Strategies
(1)
Here Si∞ = Si = {D }. (Why?)
Note: Here the strategies D are strictly dominant for both players, so that the
profile (D, D ) is also what is called a dominant strategy equilibrium (DSE ).
Here you played: 64% A and 36% B (T1) and 81% A and 19% B (T3).
Here you played: 60% A and 40% B (T1) and 76% A and 24% B (T3).
100 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 25
Prisoner’s Dilemmas – Class Experiment (2023)
You played the same two prisoner’s dilemma type games:
Here you played: 67% A and 33% B (T1) and 88% A and 12% B (T3).
Here you played: 80% A and 20% B (T1) and 82% A and 18% B (T3).
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 26
Dominance Solvability
101 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 27
Iterated Elimination of Weakly Dominated
Strategies
Weak Dominance
Here’s an alternative, weaker notion of dominated strategy.
Definition
The strategy si * Si weakly dominates ti ;= si if it gives a weakly
higher payoff than ti against every strategy profile of the opponents:
u i ( s i , s 2 i ) > u i ( ti , t 2 i ) .
102 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 29
Iterated Elimination of Weakly Dominated Strategies
103 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 31
Iterated Elimination of Weakly Dominated Strategies
The notion of weak dominance leads to a procedure analogous to
iterated elimination of strictly dominated strategies (IEDS or S ∞ ),
where strategies that are weakly dominated are eliminated
sequentially.
· The corresponding concept is referred to as Iterated
elimination of weakly dominated strategies (IEWDS or SW ∞ ).
104 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 32
Iterated Elimination of Weakly Dominated Strategies
Consider the same game and start eliminating T :
1\2 L R 1\2 L R
T 1, 1 0, 0 T 1, 1 0, 0
Eliminate T : Eliminate L:
M 1, 1 2, 1 M 1, 1 2, 1
B 0, 0 2, 1 B 0, 0 2, 1
1\2 L R 1\2 L R
T 1, 1 0, 0 T 1, 1 0, 0
Eliminate T : Eliminate L:
M 1, 1 2, 1 M 1, 1 2, 1
B 0, 0 2, 1 B 0, 0 2, 1
105 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 33
Iterated Elimination of Weakly Dominated Strategies
Consider the same game and start eliminating T :
1\2 L R 1\2 L R
T 1, 1 0, 0 T 1, 1 0, 0
Eliminate T : Eliminate L:
M 1, 1 2, 1 M 1, 1 2, 1
B 0, 0 2, 1 B 0, 0 2, 1
1\2 L R 1\2 L R
T 1, 1 0, 0 T 1, 1 0, 0
Eliminate B: Eliminate R:
M 1, 1 2, 1 M 1, 1 2, 1
B 0, 0 2, 1 B 0, 0 2, 1
106 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 34
Iterated Elimination of Weakly Dominated Strategies
Recall:
Definition
The pure strategy si * Si is a weakly dominant strategy if
Weak dominance (or also IEWDS) may eliminate the unique Pareto
efficient Nash equilibrium.
1\2 A B 1\2 A B
A 2, 2 0, 2 A 2, 2 0, 2
B 2, 0 1, 1 B 2, 0 1, 1
107 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 35
Application: Duel
108 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 37
Where are we going?
Key concepts we learned are:
· Iterated elimination of strictly dominated strategies
· Strictly dominated and strictly dominant strategy
· Iterated elimination of weakly dominated strategies
· Weakly dominated and weakly dominant strategy
Next we study:
· Best response and rationalizable strategies
Then we study:
· Nash equilibrium in pure strategies (Topic 3).
IMPORTANT:
· Test 1 will be next Tuesday, February 6, in class.
· It will cover Topics 1 and 2.
· We will do it online through Aula Global.
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 38
Rationalizability
109 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 39
Rationalizability
Best Response
Definition
The pure strategy si * Si is a best response to the strategy
profile of the opponents s2i * S2i if:
110 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 41
Best Response
Definition
The pure strategy si * Si is a best response to the strategy
profile of the opponents s2i * S2i if:
Best Response
Definition
The pure strategy si * Si is a best response to the strategy
profile of the opponents s2i * S2i if:
111 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 41
Rationalizability
Rationalizability is a recursive concept based on three ideas:
· Rational players form beliefs about what other players play.
· They best respond to those beliefs.
· There is common knowledge of rationality.
Rationalizability
Rationalizability is a recursive concept based on three ideas:
· Rational players form beliefs about what other players play.
· They best respond to those beliefs.
· There is common knowledge of rationality.
Definition
The (pure) strategy si * Si is “a best response” if there exists a
strategy profile of the opponents s2i * S2i such that:
112 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 42
Rationalizability
Example 1. Consider the game:
1\2 a b c
x 1, 3 1, 0 2, 3
y 6, 1 0, 4 1, 0
Rationalizability
Example 1. Consider the game:
1\2 a b c
x 1, 3* 1*, 0 2*, 3*
y 6*, 1 0, 4* 1, 0
Rationalizability
Example 2. Hence if player 2’s strategy a is not rationalizable, then
we can eliminate it and obtain a smaller subgame, in which we can
further check for non-rationalizable strategies:
1\2 a b c
x 1, 1 1, 0 2, 3
y 6, 1 0, 4 1, 0
Now player 1’s strategy y is not rationalizable so we can eliminate it, and
then we can further eliminate player 2’s strategy b.
1\2 a b c 1\2 a b c
x 1, 1 1, 0 2, 3 x 1, 1 1, 0 2, 3
y 6, 1 0, 4 1, 0 y 6, 1 0, 4 1, 0
114 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 46
Rationalizability
Example 2. Hence if player 2’s strategy a is not rationalizable, then
we can eliminate it and obtain a smaller subgame, in which we can
further check for non-rationalizable strategies:
1\2 a b c
x 1, 1 1, 0 2, 3
y 6, 1 0, 4 1, 0
Now player 1’s strategy y is not rationalizable so we can eliminate it, and
then we can further eliminate player 2’s strategy b.
1\2 a b c 1\2 a b c
x 1, 1 1, 0 2, 3 x 1, 1 1, 0 2, 3
y 6, 1 0, 4 1, 0 y 6, 1 0, 4 1, 0
Rationalizability
Start again with:
Define recursively:
(1)
G (1) = (N, (Si )i *N , (ui )i *N )
where now
· Si(1) = set of strategies that are best responses for i in G .
Definition
The set Ri is set of rationalizable strategies of player i and
R = ×i *N Ri is the set of rationalizable strategy profiles of the
game G .
Note: Again Ri∞ does not depend on the order of elimination.
But does this differ and how does this differ from the set obtained with
iterated strict dominance? In general, Ri ;= Si∞ , because rationalizability
can eliminate strategies that are not strictly dominated (see Example 2).
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 48
Limits of Rationalizability
116 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 49
Application: Guessing Game or Beauty Contest Game
In the 1950s in the US, beauty contests were popular in the Sunday
newspapers. Given a series of photos of women, readers were asked to
cast a vote on the most beautiful. Prizes were distributed among the
readers that had voted for the woman that received most votes.
117 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 51
Why (Keynesian) beauty contest?
”It is not a case of choosing those [faces] that, to the best of one’s
judgment, are really the prettiest, nor even those that average opin-
ion genuinely thinks the prettiest. We have reached the third degree
where we devote our intelligences to anticipating what average opin-
ion expects the average opinion to be. And there are some, I believe,
who practice the fourth, fifth and higher degrees.”
Keynes, General Theory of Employment, Interest and Money, 1936.
See also:
· John Kay, “The parable of the ox” Financial Times, 24 July, 2012.
Note: The version in PS#2 is a discrete version with S = {1, 2, . . . , 100} and with a
different multiplicative factor defining the target average, and so is somewhat different
from the one on these slides (with S = [0, 100]).
118 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 53
Application: Guessing Game and Iterated Dominance
119 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 54
Some Behavioral Game Theory
Experimental Evidence
121 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 58
Application: Guessing Game and Level-k Reasoning
In general, the model of k-level reasoning assumes some 0-level for the irrational
players, also referred to as L0-players.
· L0 or level-0 players: these are “random”, “intuitive” or “spontaneous” players.
Then it considers various hierarchical types of players:
· L1 or level-1 players: these best respond to L0.
· L2 or level-2 players: these best respond to L1.
· L3 or level-3 players: these best respond to L2.
· etc.
The theory has been used to explain behavior in various lab experiments.
Recall:
· L0 chooses 50 (L0), L1 — 33.33 (A), L2 — 22.22 (B), L3 — 14.81 (C)
Applying iterated weak dominance gives:
· One round: 66.67 (D1), Two rounds: 44.44 (D2), Three rounds, four rounds, ...
∞ = S ∞ = 0 (EQ).
· The unique equilibrium prediction is SW
The winner of the actual game said 19 (target was 18.92).
122 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 59
Application: Guessing Game – Class Experiment (T3-2022)
Recall:
· L0 chooses 50 (L0), L1 — 33.33 (A), L2 — 22.22 (B), L3 — 14.81 (C)
Applying iterated weak dominance gives:
· One round: 66.67 (D1), Two rounds: 44.44 (D2), Three rounds, four rounds, ...
∞ = S ∞ = 0 (EQ).
· The unique equilibrium prediction is SW
The winner of the actual game said 23 (target was 22.99).
Recall:
· L0 chooses 50, L1 — 33.33, L2 — 22.22, L3 — 14.81
Applying iterated weak dominance gives:
· One round: 66.67 (D1), Two rounds: 44.44 (D2), Three rounds, four rounds, ...
∞ = S ∞ = 0 (EQ).
· The unique equilibrium prediction is SW
The winner of the actual game said 27.3 (target was 26.19).
123 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 2, Slide 61
Application: Guessing Game – Class Experiment (T3-2023)
In the guessing games done in class we have:
Recall:
· L0 chooses 50, L1 — 33.33, L2 — 22.22, L3 — 14.81
Applying iterated weak dominance gives:
· One round: 66.67 (D1), Two rounds: 44.44 (D2), Three rounds, four rounds, ...
∞ = S ∞ = 0 (EQ).
· The unique equilibrium prediction is SW
Fabrizio Germano
Winter 2024
124 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 1
Test 1
Preliminaries
We consider simultaneous games defined by the list:
G = (N, (Si )i *N , (ui )i *N )
where:
· N is the set of players
· Si is the set of pure strategies of player i
· A strategy profile s * S = S1 × · · · Sn is usually written as
s = ( s 1 , s 2 , . . . , sn ) .
· ui : S ³ R are utilities of player i.
We assume that N and each Si are finite and the players know G .
Players are utility maximizers with utilities ui .
Note: Here (in Topics 2, 3), we only consider pure strategies.
Later (in Topics 4, 5) we also consider so-called mixed strategies, where
players can choose randomizations over pure strategies.
125 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 3
Where are we going?
Key concepts we learned were:
· Iterated elimination of strictly dominated strategies
· Strictly dominant and strictly dominated strategy
· Iterated elimination of weakly dominated strategies
· Weakly dominant and weakly dominated strategy
· Best reponse and rationalizable strategies
Now, we’ll study:
· Nash equilibrium in pure strategies (Topic 3) and later in mixed
strategies (Topic 5)
· Strict Nash equilibrium
We’ll also briefly talk about:
· Risk dominance and strategic uncertainty
The concepts learned here will be used and revisited in the rest of this class and
in other classes in microeconomics, IO and elsewhere.
Nash Equilibrium
126 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 5
Example
Consider the following game:
1\2 D E F
A 2, 0 0, 3 0, 4
B 0, 0 1, 1 0, 0
C 0, 4 0, 3 2, 0
Example
Consider the following game:
1\2 D E F
A 2, 0 0, 3 0, 4
B 0, 0 1, 1 0, 0
C 0, 4 0, 3 2, 0
127 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 6
Example
Consider the following game:
1\2 D E F
A 2, 0 0, 3 0, 4
B 0, 0 1, 1 0, 0
C 0, 4 0, 3 2, 0
Example
Consider the following game:
1\2 D E F
A 2, 0 0, 3 0, 4
B 0, 0 1, 1 0, 0
C 0, 4 0, 3 2, 0
128 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 6
Best Response
· The basic ingredient in the concept of Nash equilibrium is that
of a best response to the opponents’ strategy profile.
Definition
The pure strategy si * Si is a best response to the strategy profile
of the opponents s2i * S2i if:
Best Response
· The basic ingredient in the concept of Nash equilibrium is that
of a best response to the opponents’ strategy profile.
Definition
The pure strategy si * Si is a best response to the strategy profile
of the opponents s2i * S2i if:
Formally:
· For player 1: u1 (B, E ) g u1 (A, E ) and u1 (B, E ) g u1 (C , E ).
· For player 2: u2 (B, E ) g u2 (B, C ) and u2 (B, E ) g u1 (B, F ).
130 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 8
Mutual Best Response – Example
1\2 D E F
A 2, 0 0, 3 0, 4
B 0, 0 1, 1 0, 0
C 0, 4 0, 3 2, 0
Formally:
· For player 1: u1 (B, E ) g u1 (A, E ) and u1 (B, E ) g u1 (C , E ).
· For player 2: u2 (B, E ) g u2 (B, C ) and u2 (B, E ) g u1 (B, F ).
Each player best responds to what the opponent actually plays!
Are there other such profiles of mutual best-responses?
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 8
Formally:
· For player 1: u1 (B, E ) g u1 (A, E ) and u1 (B, E ) g u1 (C , E ).
· For player 2: u2 (B, E ) g u2 (B, C ) and u2 (B, E ) g u1 (B, F ).
Each player best responds to what the opponent actually plays!
Are there other such profiles of mutual best-responses? No.
131 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 9
Best Response Correspondence
Nash Equilibrium
Definition
A strategy profile s = (s1 , . . . , sN ) * S is a Nash equilibrium if,
for every player i * N, si is a best response to s2i (si * BRi (s2i )),
that is, if, for every i * N,
132 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 11
Nash Equilibrium
Definition
A strategy profile s = (s1 , . . . , sN ) * S is a Nash equilibrium if,
for every player i * N, si is a best response to s2i (si * BRi (s2i )),
that is, if, for every i * N,
Nash Equilibrium
Definition
A strategy profile s = (s1 , . . . , sN ) * S is a Nash equilibrium if,
for every player i * N, si is a best response to s2i (si * BRi (s2i )),
that is, if, for every i * N,
Examples
134 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 13
Example: Pure Coordination Game
1\2 X Y
X 3, 3 0, 0
Y 0, 0 1, 1
1\2 X Y
X 3, 3 0, 0
Y 0, 0 1, 1
135 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 14
Example: Battle of the Sexes
Husband
Ballet Boxing
Wife Ballet 4*, 2* 0, 0
Boxing 0, 0 2*, 4*
136 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 16
Example: Bar scene from the film A Beautiful Mind
The payoffs are described in the following matrix for the two-player case:
1\2 Blonde Brunette A Brunette B
Blonde 0, 0 10, 5 10, 5
Brunette A 5, 10 0, 0 5, 5
Brunette B 5, 10 5, 5 0, 0
138 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 19
Example: Battle of the Sexes Revisited
Consider this version of the Battle of the Sexes game:
1\2 X Y
X 4*, 2* 0, 0
Y 0, 0 2*, 4*
1\2 X Y
X 4*, 2* 0, 0
Y 0, 0 2*, 4*
u1 ( X , X ) > u1 ( Y , X ) , 4 > 0
u2 (X , X ) > u2 (X , Y ), 2 > 0.
Both players strictly better off playing si = X against s2i = X .
139 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 20
Example: Battle of the Sexes Revisited
Consider this version of the Battle of the Sexes game:
1\2 X Y
X 4*, 2* 0, 0
Y 0, 0 2*, 4*
u1 ( X , X ) > u1 ( Y , X ) , 4 > 0
u2 (X , X ) > u2 (X , Y ), 2 > 0.
Both players strictly better off playing si = X against s2i = X .
· 2. Check that (Y , Y ) is strict:
u1 ( Y , Y ) > u1 ( X , Y ) , 2 > 0
u2 (Y , Y ) > u2 (Y , X ), 4 > 0.
Both players strictly better off playing si = Y against s2i = Y .
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 20
Weak dominance
· If the iterated elimination of weakly dominated strategies leads to a
unique profile surviving (SW∞ = {s }), then that profile is a Nash
141 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 22
Example: Split or Steal Revisited
The payoffs are described in the following matrix (in £1000’s):
1\2 Split Steal
Split 50, 50 0, 100
Steal 100, 0 0, 0
1. Check that (Split, Split ) is not a Nash equilibrium:
· si = Split is not a best response to s2i = Split for both players:
u1 (Split, Split ) s u1 (Steal, Split ), 50 s 100
u2 (Split, Split ) s u2 (Split, Steal ), 50 s 100.
· Both players are strictly better off deviating to Steal if s2i = Split.
2. Check that (Split, Steal ) is a Nash equilibrium:
143 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 23
Example: Split or Steal Revisited
The payoffs are described in the following matrix (in £1000’s):
1\2 Split Steal
Split 50, 50 0, 100
Steal 100, 0 0, 0
145 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 24
Example: Split or Steal Revisited
The payoffs are described in the following matrix (in £1000’s):
146 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 25
More Examples
1\2 X Z
X 3, 3 1, 4
Z 4, 1 2, 2
147 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 27
Example: Fishermen’s Dilemma
narrative from environmental evonomics
Consider two fishermen fishing in the same small pond.
They can choose to fish moderately (X ) or to fish a lot (Z ).
The payoffs are described in the following matrix:
1\2 X Z
X 3, 3 1, 4
Z 4, 1 2, 2
149 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 30
Example: Dove-Hawk Game II – Chicken Game
narrative from biology
Consider two predators competing or fighting for a prey and incurring a
cost from playing a hawkish strategy (Hawk) but facing an extra cost
when both play Hawk.
· If the both play Dove they share the prey.
· If one plays Hawk and the other Dove, then Hawk gets the prey
and incurs a cost of 1.
· If both play Hawk they each end up with 0.
1\2 Dove Hawk
Dove 3, 3 1, 4
Hawk 4, 1 0, 0
What are the Nash equilibrium profiles? Are they strict?
· There are two Nash equilibria: (Dove, Hawk ), (Hawk, Dove ).
· Both are strict.
Note: This version of the Dove-Hawk game is also called chicken game:
· (Hawk, Hawk ) is not a Nash equilibrium here.
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 30
150 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 31
Example: Dove-Hawk Game III – Chicken Game
narrative from biology
Consider two predators competing or fighting for a prey and incurring a
cost from playing a hawkish strategy (Hawk) but facing an even higher
cost when both play Hawk.
· If the both play Dove they share the prey.
· If one plays Hawk and the other Dove, then Hawk gets the prey
and incurs a cost of 1.
· If both play Hawk they each end up with 21 (serious injuries).
1\2 Dove Hawk
Dove 3, 3 1, 4
Hawk 4, 1 21, 21
What are the Nash equilibrium profiles? Are they strict?
· There are still two Nash equilibria: (Dove, Hawk ), (Hawk, Dove ).
· Both are strict.
Note: This version of the Dove-Hawk game is still a chicken game:
· (Hawk, Hawk ) is definitely not a Nash equilibrium here!
¡
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 31
151 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 32
Example: Dove-Hawk Game III – Chicken Game
narrative from (intl.) politics: Catalonia 2017, Cuba 1962 ... Ukraine 2022, 2023
Suppose they face even higher costs when the both play Escalate:
152 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 32
Example: Dove-Hawk Game III – Chicken Game
narrative from (intl.) politics: Catalonia 2017, Cuba 1962 ... Ukraine 2022, 2023
Suppose they face even higher costs when the both play Escalate:
1\2 S H
S 5, 5 0, 3
H 3, 0 3, 3
Suppose you are considering whether to play S, but you are unsure of
whether you opponent will also play S (strategic uncertainty):
· How confident should you be that your opponent will also play S?
Suppose you are considering whether to play S, but you are unsure of
whether you opponent will also play S (strategic uncertainty):
· How confident should you be that your opponent will also play S?
We look for the smallest p * [0, 1] such that, if you believe your opponent
will play S with at least probability p, then you also want to play S.
· Find smallest p such that:
p · 5 + (1 2 p ) · 0 = p · 3 + (1 2 p ) · 3 ó p = 3/5.
Suppose you are considering whether to play S, but you are unsure of
whether you opponent will also play S (strategic uncertainty):
· How confident should you be that your opponent will also play S?
We look for the smallest p * [0, 1] such that, if you believe your opponent
will play S with at least probability p, then you also want to play S.
· Find smallest p such that:
p · 5 + (1 2 p ) · 0 = p · 3 + (1 2 p ) · 3 ó p = 3/5.
1\2 A B
A 2, 2 0, 2
B 2, 0 1, 1
· What kind of (p, q )-dominance does the Nash eq. (A, A) have?
· What about (B, B )? (Find smallest p and q.)
155 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 35
Strategic Uncertainty and (p, q )-Dominance
Example 1. Consider the following game:
1\2 A B
A 2, 2 0, 2
B 2, 0 1, 1
· What kind of (p, q )-dominance does the Nash eq. (A, A) have?
· What about (B, B )? (Find smallest p and q.)
1\2 S H
S 6, 5 0, 2
H 4, 0 4, 2
1\2 A B
A 2, 2 0, 2
B 2, 0 1, 1
· What kind of (p, q )-dominance does the Nash eq. (A, A) have? (1,1)
1\2 S H
S 6, 5 0, 2
H 4, 0 4, 2
· What kind of (p, q )-dominance does (S, S ) have here? (2/3, 2/5)
· What about (H, H )? (Always find the smallest p’s and q’s.) (1/3, 3/5)
156 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 36
Application: Multi-Person Coordination Games
Consider the following coordination game:
· N players have to choose a number from the set {1, 2, . . . , 7}.
· The payoff to a player depends on his number and on the
lowest number chosen as follows:
157 / 339
UPF, Introduction to Game Theory, Winter 2024 Topic 3, Slide 38
Multi-Person Coordination Games – Experiments
Some statistics the two coordination games (A and B) above.
35 35
30 30
25 25
Choices in T1
Choices in T3
20 20
A A
15 15
B B
10 10
5 5
0 0
0 1 2 3 4 5 6 7 0 1 2 3 4 5 6 7
X X
· T1:
· Choosing 1: 6 in A and 35 in B
· Choosing 7: 5 in A and 36 in B
· Average payoff: 0.29 in A and 0.70 in B
· T3:
· Choosing 1: 20 in A and 18 in B
· Choosing 7: 20 in A and 34 in B
· Average payoff: 0.42 in A and 0.70 in B
Fabrizio Germano
Winter 2024
158 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 1
What would you play?
1\2 A B C
A 2, 2 3, 1 0, 2
B 1, 3 2, 2 3, 2
C 2, 0 2, 3 2, 2
1\2 A B C
A 2, 2 3, 1 0, 2
B 1, 3 2, 2 3, 2
C 2, 0 2, 3 2, 2
159 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 2
Nash equilibrium vs secure payoff
Example 1.
1\2 A B C
A 2, 2 3, 1 0, 2
B 1, 3 2, 2 3, 2
C 2, 0 2, 3 2, 2
Example 2.
1\2 A B
A 9, 10 8, 9.9
B 10, 10 -1000, 9.9
Example 2.
1\2 A B
A 9, 10 8, 9.9
B 10, 10 -1000, 9.9
161 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 6
Some objectives of Topic 4
162 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 8
Maxminimization in pure strategies
Recall:
Wi (si ) = min ui (si , sj )
sj * Sj
max Wi (si ).
si * Si
Player i’s pure security (or maxmin) payoff level (i.e., with pure
strategies) is maxsi *Si Wi (si ).
The security payoff level can also be written as
Example 3.
1\2 A B
A 4, 2 0, 6
B 2, 4 6, 0
163 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 10
Maxminimization in pure strategies
Example 3.
1\2 A B
A 4, 2 0, 6
B 2, 4 6, 0
164 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 12
Maxminimization in pure strategies
Mixed strategies
1\2 A B
G = A 4, 2 0, 6
B 2, 4 6, 0
165 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 14
Mixed strategies
1\2 A B
G = A 4, 2 0, 6
B 2, 4 6, 0
Player 1 In this game player 1 could flip a coin and play A half of
the time and B the other half.
· IF player 2 could somehow exactly predict the outcome of 1’s
coin tosses, then 1’s expected payoff is 1. (Why 1?)
· You might be better off with the pure security strategy
calculated above as it would guarantee a payoff of 2.
· BUT if, 1’s randomized strategy is unpredictable, then:
1 1 1 1 1 1 1 1
u1 , ,A = 4 + 2 = 3 and u1 , ,B = 0 + 6 = 3.
2 2 2 2 2 2 2 2
Player 1 can guarantee a higher expected payoff than the one he
could guarantee with his pure security strategy (which was 2).
· Can he do better? No. (Why?)
· What about player 2?
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 15
Mixed strategies
1\2 A B
G = A 4, 2 0, 6
B 2, 4 6, 0
1\2 A B
G = A 4, 2 0, 6
B 2, 4 6, 0
1 1
Could the randomized or mixed strategies above ( 2 2 for player
,
1 and 34 , 41 for player 2) be some kind of Nash equilibrium? Yes!
Mixed strategies
Definition
· Si = set of pure strategies of player i (with #Si = Ki ).
· si = (si1 , . . . , siKi ) is a pure strategy of i.
· Σi = set of mixed strategies of player i.
· σi = (σi1 , . . . , σiKi ) is a mixed strategy of i.
· σik * [0, 1] is the probability that σi assigns to sik * Si .
· Σi = is the set of probability distributions over Si .
Formally, for i = 1, 2:
n o
Σi = σi = (σi1 , . . . , σiKi ) * [0, 1]Ki σi1 +···+ σiKi =1 .
In the example above, since for both players we only have two
strategies (Ki = 2), we can write:
Σi = σi = (σi1 , σi2 ) * [0, 1]2 | σi2 = 1 2 σi1 , i = 1, 2.
168 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 20
Maxminimization in mixed strategies
Fix a two player game G = ({1, 2}, Σ1 × Σ2 , (u1 , u2 )).
Players are allowed to use mixed strategies σi * Σi , i = 1, 2.
· The worst expected payoff that player i can get is:
Wi (σi ) = min ui (σi , sj )
s j * Sj
W2 (σ2 ) = min{u2 (A, (σ21 , 1 2 σ21 )), u2 (B, (σ21 , 1 2 σ21 ))}
= min{2σ21 + 6(1 2 σ21 ), 4σ21 + 0(1 2 σ21 )}
3
4σ21 if σ21 f
1 1 4
= min{6 2 4σ2 , 4σ2 } =
6 2 4σ21 else
Example (cont.)
u1 u2
6 6
5 5
4 u1(Ã1,A) 4 u2(A,Ã2)
3 u1(Ã1,B) 3 u2(B,Ã2)
W1(Ã1) W2(Ã2)
2 2
1 1
Ã11 Ã21
0.2 0.4 0.6 0.8 1.0 0.2 0.4 0.6 0.8 1.0
Figure: Game G : The functions W1 (left) and W2 (right) (these are the
minimum of the red and blue lines) and player 1 and 2’s security strategies
(achieved at the maximum of W1 at σ11 = 21 at which player 1’s payoff is
v 1 = 3 for player 1 and achieved at the maximum of W2 at σ12 = 34 at which
player 2’s payoff is v 2 = 3 for player 2.
Figure: Game G : The saddle point of the game G with the plane u1 = 3.
Player 1’s payoffs are in green (player 2’s payoffs are u2 = 6 2 u1 ). The plane
at u1 = 3 shows the saddle point. Both player 1 and player 2 can guarantee a
payoff of 3. (Player 1 by playing σ1 = ( 12 , 21 ) and player 2 by playing
σ2 = ( 34 , 41 ).)
Example (cont.)
171 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 26
Two player zero-sum and strictly competitive games
A class of games where the maxminimization procedure makes a
lot of sense are the so-called strictly competitive games, formally:
Definition A strictly competitive game is a two-player game
where the following holds for any two pure strategy profiles
s, s 2 * S = S1 × S2 ,
u1 (s ) g u1 (s 2 ) if and only if u2 (s ) f u2 (s 2 ).
In other words, if one player is better off, then the other is worse
off.
· Notice that an implication is that in a strictly competitive
game we have for any two pure strategy profiles s, s 2 * S,
u1 (s ) = u1 (s 2 ) if and only if u2 (s ) = u2 (s 2 )
and
u1 (s ) > u1 (s 2 ) if and only if u2 (s ) < u2 (s 2 ).
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 27
1\2 A B C
1\2 A B
A 4, 4 3, 1 0, 2
A 9, 10 8, 9.9
B 1, 3 3, 3 3, 2
B 10, 10 -1000, 9.9
C 2, 0 2, 3 2, 2
1\2 A B 1\2 H T
A 4, 2 0, 6 H 1, -1 -1, 1
B 2, 4 6, 0 T -1, 1 1,-1
172 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 28
Two player zero-sum and strictly competitive games
A more narrow class, but one that shares a lot of properties in
common with the strictly competitive games are the zero-sum
games.
Definition A two player zero-sum game is a two-player game
where the following holds at any pure strategy profile s * S,
u1 ( s ) = 2 u2 ( s ) .
Or alternatively, u1 (s ) + u2 (s ) = 0; in other words, whatever one
player earns, the other loses.
· A slight generalization are the two player constant-sum games,
where the following holds at any pure strategy profile s * S,
u1 ( s ) + u2 ( s ) = C ,
for some constant C * R. When C = 0 you get back the
zero-sum games.
· Every constant-sum game is strictly competitive but not all
strictly competitive games are constant-sum.
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 29
· “As far as I can see, there could be no theory of games [...] without
that theorem [...] I thought there was nothing worth publishing until
the Minmax Theorem was proved.”
173 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 30
Minmaximization in Mixed Strategies
where
Bi (σj ) = max ui (si , σj ).
si * Si
Note:
u1 u2
6 6
5 5
4 u1(A,Ã2) 4 u2(Ã1,X)
3 u1(B,Ã2) 3 u2(Ã1,Y)
B1(Ã2) B2(Ã1)
2 2
1 1
Ã2X Ã1A
0.2 0.4 0.6 0.8 1.0 0.2 0.4 0.6 0.8 1.0
Figure: Game G : The functions B1 (left) and B2 (right) (these are the
maximum of the red and blue lines) and player 2 and 1’s strategies of forcing
the lowest acceptable payoffs (achieved at the minimum of B1 at σ12 = 43 at
which player 1’s payoff is v 1 = 3 for player 1 and achieved at the minimum of
B2 at σ11 = 21 at which player 2’s payoff is v 2 = 3 for player 2.
Or, alternatively stated, there exists a value v * R and mixed strategies, σ1 and σ2 ,
such that,
(i) given player 2’s strategy σ2 , the best possible payoff for player 1 is v
(= v 1 = u1 (σ1 , σ2 ) = maxσ1 *Σ1 u1 (σ1 , σ2 )),
(ii) given player 1’s strategy σ1 , the best possible payoff for player 2 is 2v
(= v 2 = u2 (σ1 , σ2 ) = maxσ2 *Σ2 u2 (σ1 , σ2 )).
The game does not have an equilibrium in pure strategies but since
it is a zero-sum game it does have a value v .
· What is the value of the game?
· How do yo compute it?
· From the theorem above it suffices to solve player 1’s
maxminimization problem.
· We compute player 1’s worst payoffs:
1
2σ11 2 1 if σ11 f
W1 (σ1 ) = min{1σ11 2 1(1 2 σ11 ), 21σ11 + 1(1 2 σ11 )} = 2
1 2 2σ11 else
· It is easy to check that this function is maximized at σ11 = 12 ,
which leads to an expected payoff for player 1 of 0.
· This is the value of the game.
176 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 36
Matching Pennies (cont.)
Consider the following game
1\2 H T
GMP = H 1, -1 -1, 1
T -1, 1 1,-1
Figure: Game GMP : The saddle point of the matching pennies game GMP with
the plane u1 = 0. Player 1’s payoffs are in green (player 2’s payoffs are
u2 = 2u1 ). The plane at u1 = 0 shows the saddle point. Both player 1 and
player 2 can guarantee a payoff of 0.
177 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 38
Matching Pennies (cont.)
Figure: Game GMP : The saddle point of the matching pennies game GMP with
the plane u2 = 0. Player 2’s payoffs are in red (player 1’s payoffs are
u1 = 2u2 ). The plane at u2 = 0 shows the saddle point. Again, both player 1
and player 2 can guarantee a payoff of 0.
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 39
u1(Ã1,Ã2)
u2(Ã1,Ã2)
v
Figure: Game GMP : The saddle point of the matching pennies game GMP with
both players’ payoffs. Player 1’s payoffs are in green and 2’s in red. The plane
at v = 0 shows the value. Both players guarantee a payoff of 0 and are playing
optimally.
178 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 40
Application: Dominance Solvable Game
Consider the following game
1\2 A B
GDS = A 2, -2 1, -1
B -2, 2 0, 0
2 2
1 1
u1(Ã1,A) u2(A,Ã2)
Ã11 Ã21
u1(Ã1,B) u2(B,Ã2)
0.2 0.4 0.6 0.8 1.0 W1(Ã1) 0.2 0.4 0.6 0.8 1.0 W2(Ã2)
-1 -1
-2 -2
Figure: Game GDS : The functions W1 (left)and W2 (right) which are the
minimum of the red and blue lines, and player 1’s security strategy achieved at
the maximum of W1 at σ11 = 1 at which player 1’s payoff is v 1 = 1; and player
2’s security strategy (achieved at the maximum of W2 at σ12 = 0 at which
player 2’s payoff is v 2 = 21.
180 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 43
Application: Military Attack – see also PS #3
Consider the following situation:
1\2 A B C
A 0 4 4
GMA =
B 3 0 3
C 2 2 0
σ11 = prob. 1 plays A, σ12 = prob. 1 plays B, σ13 = prob. 1 plays C = 1 2 σ11 2 σ12
σ21 = prob. 2 plays A, σ22 = prob. 2 plays B, σ23 = prob. 2 plays C = 1 2 σ21 2 σ22
181 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 45
Military Attack (cont.)
From the theorem above it suffices to solve player 1’s
maxminimization problem.
· As we did with the previous examples, we compute player 1’s
worst payoffs (where here: σ1 = (σ11 , σ12 , 1 2 σ11 2 σ12 )):
W1 (σ1 ) = min{u1 (σ1 , A), u1 (σ1 , B ), u1 (σ1 , C )}
= min{0σ11 + 3σ12 + 2(1 2 σ11 2 σ12 ), 4σ11 + 0σ12 + 2(1 2 σ11 2 σ12 ),
4σ11 + 3σ12 + 0(1 2 σ11 2 σ12 )}
= min{2 2 2σ11 + σ12 , 2 + 2σ11 2 2σ12 , 4σ11 + 3σ12 }
Figure: Game GMA : The three planes 2 2 2σ11 + σ12 , 2 + 2σ11 2 2σ12 , and
3 4
4σ11 + 3σ12 defining the function W1 . These intersect at (σ11 , σ12 ) = ( 13 , 13 ).
182 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 47
Military Attack (cont.)
· It can be checked that the security strategies for the two
players are:
3 4 6 7 5 1
( σ1 , σ2 ) = , , , , , .
13 13 13 13 13 13
· These imply that player 1 can guarantee himself a payoff of
24
v = v 1 = 13 j 1.8 and player 2 a payoff of v 2 = 2v .
England vs. Italy, penalty shootout, UEFA Eurocup final, July 2021
Proof To see the first statement, notice that from the definition of a mixed
Nash equilibrium,
and in particular,
Will the worker always work? Does the employer have to monitor? This game
does not have a Nash equilibrium in pure strategies and is not zero-sum.
185 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 53
Inspection Game (cont.)
The security strategies of the players are easily computed as:
σ1 = (1, 0) and σ2 = (1, 0).
The employer always monitors and the worker always works.
Player 2’s maxmin payoff (= 210) does not coincide with his payoff at the
security strategy profile (= 90).
Moreover, given 1’s security strategy to work, 2 prefers not to monitor.
· The game is not zero-sum and does not satisfy the Minmax Theorem.
· The profile ((1, 0),(1, 0)) or (Work, Monitor ) leads to payoffs of (50, 90)
but is not optimal for both players and hence not Nash.
· There is no pure Nash equilibrium.
· As you show in PS #4, the unique Nash equilibrium is in mixed strategies:
9 1
σNE = (( 10 , 10 ), ( 12 , 21 )) with payoffs (50,80).
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 54
80
50
u1(Ã1,Monitor) u2(Work,Ã2)
60
Ã21
u1(Ã1,Not monitor) u2(Not work,Ã2)
40 W1(Ã1) 0.2 0.4 0.6 0.8 1.0 W2(Ã2)
20 -50
Ã11 -100
0.2 0.4 0.6 0.8 1.0
Figure: Game GI : Player 1 and 2’s maxmin strategies: σ1 = (1, 0)(= Work ) and
σ2 = (1, 0)(= Monitor ) with maxmin payoffs (v 1 , v 2 ) = (50, 210).
Figure: Game GI : There is no saddle point. Maxmin strategies differ from Nash
9 1 1 1
equilibrium, where σ1 = ( 10 , 10 ) and σ2 186
= (/ 339
2 , 2 ) yielding payoffs (50, 80).
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 55
Application: Another Non-Zero-Sum Game
1\2 C D
A 1, 0 1, 100
B 0, 0 100, 100
1\2 C D
A 1, 0 1, 100
B 0, 0 100, 100
187 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 4, Slide 56
Introduction to Game Theory
Topic 5. Nash Equilibrium II
Fabrizio Germano
Winter 2024
Test 2
It will be done in class online through Aula Global and will cover:
188 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 2
Preparing the Game Projects
189 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 4
Game Project: Identify an interaction as a game!
Most important for the project:
· Find an interaction that you are interested in analyzing!
· Remember it can be taken from any field or area of life.
· It can but does not have to be taken from economics or business.
190 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 4
Game Project: Identify an interaction as a game!
Most important for the project:
· Find an interaction that you are interested in analyzing!
· Remember it can be taken from any field or area of life.
· It can but does not have to be taken from economics or business.
Assuming you have identified an interaction you are interested in, what next?
· You need to specify the game.
191 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 4
Game Project: Identify an interaction as a game!
Most important for the project:
· Find an interaction that you are interested in analyzing!
· Remember it can be taken from any field or area of life.
· It can but does not have to be taken from economics or business.
Assuming you have identified an interaction you are interested in, what next?
· You need to specify the game.
192 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 5
Game Project: What are the basic elements of a game?
More specifically...
193 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 5
Game Project: What are the basic elements of a game?
More specifically...
194 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 5
Game Project: What is (often) assumed about behavior?
195 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 6
Game Project: What is (often) assumed about behavior?
Rationality
· Players have well-defined preferences over all possible outcomes.
· Every player calculates a strategy that best serves (or maximizes) those
preferences.
Information, Beliefs, Knowledge and Common Knowledge
· All players have a “common understanding” of the rules of the game.
This includes:
· the list of players;
· the actions and information available to every player;
· the utilities/payoffs of all players for every possible combination of
actions; and
· that every player is rational.
197 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 8
Nash Equilibrium with a Continuum of Strategies
We now allow strategy sets to be continuous, like Xi = [0, 1] or Xi = R + .
· The basic ingredient to the concept of Nash equilibrium remains that of a
best response to the opponents’ strategy.
But with a continuum of strategies:
· We do not consider mixed strategies.
· Player’s best responses will typically be unique (“well-behaved” games) so
we can define best response functions (also called reaction functions).
Definition
The strategy xi * Xi is a best response to the strategy profile of the
opponents x2i * X2i if:
ui (xi , x2i ) g ui (xi2 , x2i ), for every xi2 * Xi .
In “well-behaved” games, this defines a best response function:
fi : X2i ³ Xi , x2i 7³ fi (x2i ) = argmaxxi2 *Xi u (xi2 , x2i ),
also called player i’s reaction function.
198 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 10
Application: Duopoly with differentiated products
Consider the following situation.
· Two producers, Apple and Samsung, produce similar but different
products, also called differentiated products.
· The costs for a unit of their product are cA = 20 and cS = 10.
· If they charge prices pA , pS * R + respectively, then:
· Apple faces the following demand for its product:
x A (pA , pS ) = 240 2 2pA + pS .
· Samsung faces the following demand for its product:
x S (pA , pS ) = 120 2 2pS + pA .
· Suppose they both maximize profits and the strategic variables are
their own price (pA for Apple and pS for Samsung).
· What is the game that this defines?
· What is the Nash equilibrium?
199 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 11
Application: Duopoly with differentiated products
To visualize the Nash equilibrium, we can plot the firms’ reaction functions:
· fA (pS ) = 70 + pS /4
62
58
52
· This gives pS = 4pA 2 280.
80 82 84 86 88 90
p A (Price charged by Apple)
4500
£S (Profits of Samsung)
8000
£A (Profits of Apple)
4000
1500
2000
30 40 50 60 70 80 90 100 20 30 40 50 60 70 80
p A (Price charged by Apple) p S (Price charged by Samsung)
200 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 13
Preliminaries
We consider simultaneous games defined by the list:
We assume that N and each Si are finite and the players know G .
Players are expected utility maximizers with vNM utilities ui .
The mixed extension of G , where each player can choose a mixed
strategy σi * Σi , is well-defined. Recall:
n o
· Σi = σi = (σi1 , . . . , σiKi ) * RKi ∑K k k
k =1 σi = 1, σi g 0, k = 1, . . . , Ki
i
Preliminaries
In the mixed extension of a game, player i’s expected payoff at a given strategy
profile σ * Σ is given by:
Ui (σ) = ∑ ui (s )Πj *N σj (sj ),
s *S
where s = (s1 , . . . , sn ) is a pure strategy profile and σj (sj ) is the probability
that player j’s mixed strategy σj assigns to his pure strategy sj .
Example. Consider the following game G (battle of the sexes) with the
distribution over outcomes π induced by the mixed strategy profile
σ = (σ1 , σ2 ) = (( 32 , 13 ), ( 41 , 43 )) (also called joint frequencies of play):
Definition
The mixed strategy σi * Σi is a best response to the mixed strategy
profile of the opponents σ2i * Σ2i if:
ui (σi , σ2i ) g ui (τi , σ2i ), for all τi * Σi .
· BRi (σ2i ) = is the set of all mixed strategy best responses of player i to
the profile σ2i .
· It defines a correspondence, BRi : Σ2i ó Σi , σ2i 7³ BRi (σ2i ), where
BRi (σ2i ) is a nonempty, compact and convex valued set.
Definition
A strategy profile σ = (σ1 , . . . , σN ) * Σ is a Nash equilibrium if, for
every player i * N, σi is a best response to σ2i .
· σ * Σ is a strict Nash equilibrium if, moreover, for every i * N,
ui (σi , σ2i ) > ui (τi , σ2i ), for all τi ;= σi .
0.8
n 0.6
0.2
0.0
0.0 0.2 0.4 0.6 0.8 1.0
0.8
n 0.6
0.2
0.0
0.0 0.2 0.4 0.6 0.8 1.0
where σ1 = (σ1X , σ1Y ) = (σ1X , 1 2 σ1X ) and σ2 = (σ2U , σ2V ) = (σ2U , 1 2 σ2U ).
202 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 17
Example 1: Best Response and Nash Equilibrium
Consider the prisoner’s dilemma game:
1\2 U V
X 3, 3 1, 4
Z 4, 1 2, 2
0.8
0.6
BR1 (Ã2 U )
0.4
BR2 (Ã1 X )
0.2
NE
0.0
0.0 0.2 0.4 0.6 0.8 1.0
They are: A: ((0, 1), (0, 1)), that is, 1 plays Z and 2 plays V .
1\2 X Y
A 1, 1 0, 0
B 0, 0 2, 3
203 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 19
Example 2: Best Response and Nash Equilibrium
Consider the coordination game:
1\2 X Y
A 1, 1 0, 0
B 0, 0 2, 3
σ2X <
ù
0 if 3
Ã2 X (Player 2's probability of X)
ü
ú 0.8
0.0
0.0 0.2 0.4 0.6 0.8 1.0
204 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 20
Example 2: Best Response and Nash Equilibrium
Consider the coordination game:
1\2 X Y
A 1, 1 0, 0
B 0, 0 2, 3
σ2X <
ù
0 if 3
0.0
0.0 0.2 0.4 0.6 0.8 1.0
σ1A <
ù
0 if 4
0.0
0.0 0.2 0.4 0.6 0.8 1.0
1\2 X Y
A 1, 1 0, 0
B 0, 0 2, 3
0.8
0.6 NE3
BR1 (Ã2 X )
0.4
BR2 (Ã1 A )
0.2
NE1
0.0
0.0 0.2 0.4 0.6 0.8 1.0
205 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 21
Example 2: Best Response and Nash Equilibrium
Consider the coordination game:
1\2 X Y
A 1, 1 0, 0
B 0, 0 2, 3
0.8
0.6 NE3
BR1 (Ã2 X )
0.4
BR2 (Ã1 A )
0.2
NE1
0.0
0.0 0.2 0.4 0.6 0.8 1.0
They are: NE1: ((0, 1), (0, 1)), NE2: ((1, 0), (1, 0)), NE3: (( 34 , 41 ), ( 32 , 31 ))
with payoffs, respectively: NE1: (2, 3), NE2: (1, 1), NE3: ( 23 , 43 ).
206 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 22
Example 3 (α = 0): Best Response and Nash Equilibrium
Consider the game:
1\2 s21 s22
s11 1, 1 0, 0
s12 0, 0 0, 0
σ21 = 0 σ11 = 0
( (
[0, 1] if [0, 1] if
σ11 (σ21 ) = σ21 (σ11 ) =
1 if σ21 > 0 1 if σ11 > 0
1.0
B
Ã21 (Player 2's probability of s2 1 )
0.8
0.6
BR1(Ã2)
0.4
BR2(Ã1)
0.2
A
0.0
0.0 0.2 0.4 0.6 0.8 1.0
· What are the Nash equilibria? A: ((0, 1), (0, 1)), B: ((1, 0), (1, 0)).
· What are the players’ best response correspondences?
1 and 2’s best response correspondences BR1 (σ2 ), BR2 (σ1 ):
σ21 = 0 σ11 = 0
( (
[0, 1] if [0, 1] if
σ11 (σ21 ) = σ21 (σ11 ) =
1 if σ21 > 0 1 if σ11 > 0
1.0
B
Ã21 (Player 2's probability of s2 1 )
0.8
0.6
BR1(Ã2)
0.4
BR2(Ã1)
0.2
A
0.0
0.0 0.2 0.4 0.6 0.8 1.0
207 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 23
Example 3 (α): Best Response and Nash Equilibrium
Consider the following variation of the previous game:
1\2 s21 s22
s11 1, 1 0, 0 for α * R.
s12 0, 0 α, α
208 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 24
Example 3 (α): Best Response and Nash Equilibrium
Consider the following variation of the previous game:
1\2 s21 s22
s11 1, 1 0, 0 for α * R.
s12 0, 0 α, α
1.0
0.8
0.6
0.4
0.2
0.0
209 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 25
Properties of Nash Equilibrium
Lemma
If σ * Σ is a mixed Nash equilibrium profile, then for all i * N:
2 2
ui (sik , σ2i ) = ui (sik , σ2i ), for all sik , sik * supp(σi ).
210 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 27
Properties of Nash Equilibrium
Lemma
If σ * Σ is a mixed Nash equilibrium profile, then for all i * N:
2 2
ui (sik , σ2i ) = ui (sik , σ2i ), for all sik , sik * supp(σi ).
In words:
· Any two pure strategies of a player (sik , sik 2 * Si ), used with
strictly positive probability at a Nash equilibrium (σik , σik 2 > 0),
must yield the same expected payoff (ui (sik , σ2i ) = ui (sik 2 , σ2i )).
211 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 28
Example 4: Best Response and Nash Equilibrium
Consider the matching pennies game:
1\2 s21 s22
s11 1, 21 21, 1
s12 21, 1 1, 21
1
σ21
ü
1 if
û
> 2 0.2
0.0
0.0 0.2 0.4 0.6 0.8 1.0
ü
ú 2 0.8
1
σ21
ü
1 if
û
> 2 0.2
0.0
0.0 0.2 0.4 0.6 0.8 1.0
ü
ú 2 0.8
1
σ11
ü
0 if
û
> 2 0.2
0.0
0.0 0.2 0.4 0.6 0.8 1.0
0.6
BR1 (Ã2 1 )
0.4 NE
BR2 (Ã1 1 )
0.2
0.0
0.0 0.2 0.4 0.6 0.8 1.0
0.8
0.6
BR1 (Ã2 1 )
0.4 NE
BR2 (Ã1 1 )
0.2
0.0
0.0 0.2 0.4 0.6 0.8 1.0
1.0 1.0
0.5 0.5
u1(Ã1,s21 ) u2(s11 ,Ã2)
Ã11 Ã21
u1(Ã1,s22 ) u2(s12 ,Ã2)
0.2 0.4 0.6 0.8 1.0 0.2 0.4 0.6 0.8 1.0
W1(Ã1) W2(Ã2)
-0.5 -0.5
-1.0 -1.0
214 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 31
Example 4.1: Variations on Matching Pennies (see PS#5)
Consider this variation of the matching pennies game:
1\2 s21 s22
s11 2, 21 21, 1
s12 21, 1 1, 21
0.5
0.5
u1(Ã1,s21 ) u2(s11 ,Ã2)
Ã11
Ã21
2
u1(Ã1,s2 ) u2(s12 ,Ã2)
0.2 0.4 0.6 0.8 1.0
W1(Ã1) 0.2 0.4 0.6 0.8 1.0
W2(Ã2)
-0.5 -0.5
-1.0
-1.0
σ21 <
ù
0 if
Ã21 (Player 2's probability of s2 1 )
ü
ú 5 0.8
0.4 BR1(Ã2)
2
σ21
ü
1 if
û
> 5 0.2
0.0
0.0 0.2 0.4 0.6 0.8 1.0
σ11 <
ù
1 if
Ã2 1 (Player 2's probability of s2 1 )
ü
ú 2 0.8
0.0
0.0 0.2 0.4 0.6 0.8 1.0
Player 1’s best response correspondence changes while 2’s does not.
215 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 32
Example 4.1: Variations on Matching Pennies
There is a unique fully mixed Nash equilibrium:
· A’: (( 21 , 12 ), ( 25 , 35 )) with expected payoffs of (1/5, 0).
· Note that only player 2’s strategies change (slightly less weight on s21 to
make 1 indifferent).
1.0
BR1(Ã2)
s12
A'
21, 1 1, 21 0.4
BR2(Ã1)
0.2
0.0
0.0 0.2 0.4 0.6 0.8 1.0
s21 s22
0.8
1\2
s11 2, 21 21, 1 0.6
BR1(Ã2)
s12
A'
21, 1 1, 21 0.4
BR2(Ã1)
0.2
216 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 33
Example 4.1: Variations on Matching Pennies
There is a unique fully mixed Nash equilibrium:
· A’: (( 21 , 12 ), ( 25 , 35 )) with expected payoffs of (1/5, 0).
· Note that only player 2’s strategies change (slightly less weight on s21 to
make 1 indifferent).
1.0
BR1(Ã2)
s12
A'
21, 1 1, 21 0.4
BR2(Ã1)
0.2
217 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 34
Example 4.2: Variations on Matching Pennies
Consider this further variation of the matching pennies game:
1.0
0.2
A''
0.0
0.0 0.2 0.4 0.6 0.8 1.0
0.8
0.2
A''
0.0
0.0 0.2 0.4 0.6 0.8 1.0
218 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 34
Example 4.2: Variations on Matching Pennies
Consider this further variation of the matching pennies game:
1.0
0.2
A''
0.0
0.0 0.2 0.4 0.6 0.8 1.0
Ã11
0.2 0.4 0.6 0.8 1.0
219 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 35
Example 4.2: Variations on Matching Pennies
Consider this further variation of the matching pennies game:
u1
Ã11
0.2 0.4 0.6 0.8 1.0
Ã11
0.2 0.4 0.6 0.8 1.0
Ã11
0.2 0.4 0.6 0.8 1.0
-15
-20
221 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 38
Example 4.3: Variations on Matching Pennies
Consider this final(!) zero sum variation of the matching pennies game:
u2
-15
-20
-15
-20
222 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 5, Slide 38
Example 4.3: Variations on Matching Pennies
Consider this final(!) zero-sum variation of the matching pennies game:
1.0
0.2
A'''
0.0
0.0 0.2 0.4 0.6 0.8 1.0
0.8
0.2
A'''
0.0
0.0 0.2 0.4 0.6 0.8 1.0
Fabrizio Germano
Winter 2024
225 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 1
Test 2
It will be done in class online through Aula Global and will cover:
226 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 3
Game Project: Identify an interaction as a game!
Most important for the project:
· Find an interaction that you are interested in analyzing!
· Remember it can be taken from any field or area of life.
· It can but does not have to be taken from economics.
Assuming you have identified an interaction you are interested in, what next?
· You need to specify the game.
227 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 5
Game Project: What is (often) assumed about behavior?
Rationality
· Players have well-defined preferences over all possible outcomes.
· Every player calculates a strategy that best serves (or maximizes) those
preferences.
Information, Beliefs, Knowledge and Common Knowledge
· All players have a “common understanding” of the rules of the game.
This includes:
· the list of players;
· the actions and information available to every player;
· the utilities/payoffs of all players for every possible combination of
actions; and
· that every player is rational.
Keep in mind that there are important branches of game theory (such as
behavioral game theory) that study bounded rationality in games. This includes
studying limitations on individuals’ formation of beliefs.
228 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 7
Where are we going?
229 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 9
Sequential Games with Perfect Information
Sequential Games...
230 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 10
Sequential Games with Perfect Information
... with Perfect Information
231 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 10
Strategy = Complete Contingent Plan!
Very important:
· A strategy is a complete contingent plan for a given player.
· This means that it needs to specify an action at every decision
node of that player.
So in general, a strategy ;= (is not):
· the moves of a player (individual actions taken by a player) at decision nodes
along a game path;
And a strategy profile ;= (is not):
· a game path (sequence of moves taken in a game);
· the final outcome (obtained vector of payoffs of a game) of a game path.
232 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 12
Writing down a sequential game...
What is the game the Good, the Bad and the Ugly are playing?
Assuming they all have perfect aim and exactly one bullet each!
233 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 14
Writing down a sequential game...
What is the game the Good, the Bad and the Ugly are playing?
234 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 14
Writing down a sequential game...
Some different two-player duels:
1\2 2± 2³1
1. Simutaneous: 1± [12] [2]
1³2 [1] [∅]
1\2 2± 2³1
1. Simutaneous: 1± [12] [2]
1³2 [1] [∅]
235 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 15
Writing down a sequential game...
Some different two-player duels:
We will compute and describe the equilibria later (see also Problem Set #5).
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 18
237 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 19
Example 1: The Senate Race Game
The Game Tree is a graphical description of the game.
· It is useful for analyzing sequential games.
· It is also referred to as the extensive form of the game.
· It has decision nodes (associated with a player), branches (associated with
an action) and terminal nodes (associated with a payoff for each player).
enter 1, 1
bob
advertise
retire 3, 3
alice
enter 2, 4
do not advertise
bob
retire 4, 2
Model:
Players Alice and Bob.
Actions Alice’s action set is {a, d }. Bob’s action set is {e, r }.
Strategies Alice’s strategy set is {a, d }. Bob’s strategy set is
{e, r } × {e, r } = {(e, e ), (e, r ), (r , e ), (r , r )}.
Information There is perfect information: all players see previous
actions chosen by other players (no hidden actions).
Solution:
Equilibrium Each player is using a strategy that is a best response
to the other players’ strategies (next slide).
Equilibrium path The actual sequence of actions taken when all
players play their equilibrium strategies (next slide).
238 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 21
Backward Induction Equilibrium
enter 1, 1
bob
advertise
retire 3, 3
alice
enter 2, 4
do not advertise
bob
retire 4, 2
239 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 23
Example 1: The Senate Race Game
· Firm B is the only grocery store in the city and makes profits of 20e.
· Firm A has an option to make a profit of 5e by investing in an
independent project, but is also considering to enter the grocery business
and to compete for a share of the profits of B.
· Entry requires an investment (sunk cost) of 3e.
· When firm B learns about A’s intentions, B can threaten with a price war
and sell everything at cost price, making zero profits. In this case firm A
would also makes zero profits and lose 3e (the fixed costs).
But firm B can also decide to accommodate and share the profits
peacefully with A.
· The incumbent firm B makes a decision knowing what A has decided.
240 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 25
Example 2: An Entry Game
241 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 27
Example 2: An Entry Game
enter 1, 1
bob
advertise
retire 3, 3
alice
enter 2, 4
do not advertise
bob
retire 4, 2
244 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 33
Example 3: The Centipede Game
Players 1 and 2 choose whether to go down (D) or across (A).
245 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 34
Example 3: The Centipede Game
Players 1 and 2 choose whether to go down (D) or across (A).
247 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 37
Second mover advantage
Here [ij ] means players i and j survive, and ∅ means none survive.
248 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 39
Example 4: Threeway Duel Game
Game tree: Assume that:
· player 1 moves first, then 2, then 3
· they have perfect aim and one bullet each!
Preferences: Players prefer to survive and that as few of their opponents
survive. To get uniqueness, suppose also that 3 really dislikes 2.
Efficiency
249 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 41
Exercise: Determine the BIE and the efficient outcomes
Consider the game
250 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 43
Example 5: The Borrower-Lender Game
· The bank, anticipating that the firm will default (d), does not
give a loan (n).
· If instead of playing (n, d ), they would have played (y , p ) they
would have obtained 5 each, instead of 0 each.
· The bank, anticipating that the firm will default (d), does not
give a loan (n).
· If instead of playing (n, d ), they would have played (y , p ) they
would have obtained 5 each, instead of 0 each.
Basic Lesson
Rational behavior can lead to socially inefficient outcomes.
251 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 44
Example 5.1: The Borrower-Lender Game – variant 1
Suppose that now the lender can sue the debtor (s) if he refuses to
pay back the loan. Very often the case is won and the debtor is
forced to return the loan and also pay a penalty of 20e to the
government. However, in 40% of the cases the case is lost. The
litigation costs (attorney fees, delays, etc.) for the lender are 75e.
252 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 46
Example 5.1: The Borrower-Lender Game – variant 1
Suppose that now the lender can sue the debtor (s) if he refuses to
pay back the loan. Very often the case is won and the debtor is
forced to return the loan and also pay a penalty of 20e to the
government. However, in 40% of the cases the case is lost. The
litigation costs (attorney fees, delays, etc.) for the lender are 75e.
No, not really, the bank is not in a better position and will still
prefer not to give the loan (n, f ).
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 46
253 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 47
Example 5.2: The Borrower-Lender Game – variant 2
Consider another mechanism: if the debtor loses the case, instead of
paying a fine to the government, he has to pay all litigation costs.
Yes, now bank has an incentive to sue, and gives the loan (y , s ).
· Slightly changing the game can lead to better outcomes.
Yes, now bank has an incentive to sue, and gives the loan (y , s ).
· Slightly changing the game can lead to better outcomes.
Basic Lesson
Appropriately designed legal institutions to enforce contracts can prevent
bad outcomes.
254 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 47
Credible Commitments
· A party in a conflict can strengthen its position by cutting off some of its
options to make its threats more credible (e.g., an army that burns the
bridge behind it making retreat impossible).
· A famous example of this tactic is when Hernán Cortés had his men burn
and sink their ships.
· With a force of fewer than six hundred men, Hernán Cortés invaded and
conquered Moctezuma’s Aztec empire.
· The natives of the East Coast told Cortés about the great wealth of the
Aztec Empire, farther into the interior. To prevent his own small force
from deserting because many feared venturing inland, Cortés burned his
ships, leaving no means of escape.
· Let’s see if we can give a game-theoretic explanation of Cortés’ decision...
255 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 49
Hernán Cortés’ original game
Payoffs are listed in alphabetic order (Cortés first and Moctezuma second),
where f denotes “fight” and r “run away”.
· What would happen if Cortés threatens to fight back if Moctezuma
decides to fight?
Payoffs are listed in alphabetic order (Cortés first and Moctezuma second),
where f denotes “fight” and r “run away”.
· What would happen if Cortés threatens to fight back if Moctezuma
decides to fight?
· Moctezuma would be better off running away.
· Is Cortés’ threat to fight credible?
256 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 50
Hernán Cortés’ original game
Payoffs are listed in alphabetic order (Cortés first and Moctezuma second),
where f denotes “fight” and r “run away”.
· What would happen if Cortés threatens to fight back if Moctezuma
decides to fight?
· Moctezuma would be better off running away.
· Is Cortés’ threat to fight credible?
No, but if Cortés actually commits and burns and sinks his ships, so that
running away is no longer an option or him, then it is credible.
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 50
257 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 51
Hernán Cortés’ game after burning the ships
· Suppose now that Cortés burns his ships, which means that running away
becomes impossible for him.
· If the ships are burned, then running away is not an option, which means
that Cortés’ only remaining option is to fight (f ).
258 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 52
Hernán Cortés’ complete game
In the complete game, Hernán Cortés moves first: he decides whether to burn
the ships, B, or whether not to burn them, NB.
260 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 55
Senate Race Game Revisited
· Consider the senate race game analyzed analyzed earlier: an
incumbent senator (Alice) decides whether to use a preemptive
campaign against a potential challenger (Bob), who has to choose
whether to enter the race.
enter 1, 1
bob
advertise
retire 3, 3
alice
enter 2, 4
do not advertise
bob
retire 4, 2
· This strategy profile (do not adv., enter) gives rise to a path (or
sequence of actions) leading to a single terminal node.
· The payoffs associated to that terminal node are (2, 4) and have to
be put in the cell at row do not adv. and column (retire, enter).
261 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 57
The Senate Race Game – normal form
alice
enter 2, 4
do not advertise
bob
retire 4, 2
alice
enter 2, 4
do not advertise
bob
retire 4, 2
262 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 59
The Senate Race Game – normal form
Senate race game in extensive form:
enter 1, 1
bob
advertise
retire 3, 3
alice
enter 2, 4
do not advertise
bob
retire 4, 2
263 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 60
Application: Ultimatum Game – extensive form
264 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 62
Application: Ultimatum Game – extensive form
· Offers below 10% of the total sum at stake are very rare.
· The median offer (half of the proposers offer less than that
and the other half more) is in the 40-50% range.
265 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 64
Experimental results ECO/ADE – 2023
Groups T1 and T3 together:
Probabilities:
T1+T3: 0–10 11–20 21–30 31–40 41–50 51–60 61–70 71–80 81–90 91–100
Prop. 5.5 3.1 13.2 24.3 32.0 14.1 5.5 2.3 0.0 0.0
Resp. 17.9 4.9 10.6 17.9 34.1 8.1 2.4 1.6 1.6 0.1
Probabilities:
T1+T3: 0–10 11–20 21–30 31–40 41–50 51–60 61–70 71–80 81–90 91–100
Prop. 7.0 5.4 8.5 17.1 43.4 11.6 3.1 3.9 0.8 3.1
Resp. 16.8 6.2 13.3 17.7 27.4 11.5 4.4 1.8 1.6 0.8
266 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 66
Application: Ultimatum Game – experimental evidence
Experimental results ECO/ADE (previous years knowing BIE)
267 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 68
Application: Ultimatum Game – experimental evidence
Some possible explanations
268 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 69
Application: Ultimatum Game – experimental evidence
Some possible explanations
269 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 70
Dictator Games and Revealed Social Preferences
Giving according to GARP (Andreoni and Miller, 2002)
From Problem Set #3, Question 5 in Micro 1:
In their paper, Andreoni and Miller (2002) test whether subjects behave
rationally when deciding over opportunities of sharing a surplus with another
(anonymous) individual. Subjects are offered various bundles of the form
(x1 , x2 ) where:
· x1 is payment to the subject (payment to self) and
· x2 is payment to an anonymous other (payment to other).
Subjects are presented different budget lines corresponding to incomes and
prices of sharing the surplus. For each of the presented sets of bundles (each
on a different budget line), subjects are asked to select their preferred one.
The study finds three main types of individuals:
· selfish types (47.2%);
· perfect complements types, with fixed preference for fairness (30.4%);
· perfect substitutes types, showing willingness to substitute allocation to
self with allocation to other depending on relative price (22.4%).
Which of the following figures corresponds to which type of subjects?
100 100
50 50
0 0
0 50 100 150 0 50 100 150
Payment to self (x1 ) Payment to self (x1 )
100
50
0
0 50 100 150
Payment to self (x1 )
Figure: From social anthropology book on the origin and history of inequality
Also recommended: Bowles and Gintis (2012) A Cooperative Species: Human
Reciprocity and its Evolution, Princeton University Press.
UPF – Introduction to Game Theory, Winter 2024 Topic 6, Slide 72
Fabrizio Germano
Winter 2024
271 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 1
Sequential Games with Imperfect Information
We used
· Game trees (extensive forms) to represent sequential move games.
· Game matrices (normal forms) to represent simultaneous move games.
· However, many strategic situations contain elements of both types of
interactions, so we need to combine both approaches.
272 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 3
From Extensive to Normal Form Games
273 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 5
The Entry Game Revisited
Consider the simple entry game in extensive form we analyzed in Topic 6:
7, 10
a
e
B
r -3, 0
A
n
5, 20
274 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 6
The Entry Game – normal form
A\B a r
e 7, 10 −3, 0
n 5, 20 5, 20
A\B a r
e 7, 10 −3, 0
n 5, 20 5, 20
275 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 7
The Entry Game – normal form
Make sure you understand why (n, r ) is a Nash equilibrium.
7, 10
a
e
B
r -3, 0
A
n
5, 20
276 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 9
From normal to extensive form: Matching Pennies revisited
Consider again the Matching Pennies game:
· Two players, A and B, decide secretly (or simultaneously) whether to
Heads (H) or Tails (T ).
· If players’ decisions are the same, then A wins (payoffs are (1, −1)).
· Otherwise, B wins (payoffs are (−1, 1)).
A\B H T
H 1,-1 -1, 1
T -1, 1 1,-1
Does the following extensive form game describe the Matching Pennies game?
Since the game is played simultaneously, player B does not observe A’s action
and therefore cannot identify from which decision node he is moving.
· To indicate this, we draw a dotted line between B’s two decision nodes:
278 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 13
A Cournot Entry Game
(r , (n, a)), (r , (n, r )) and (a, (e, a)) are pure strategy Nash equilibria.
279 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 15
A Cournot Entry Game
5, 0
n
a 3, 1
B A
a r 0,-3
e
B
a 2, 0
r
A
r -2,-5
(r , (n, a)), (r , (n, r )) and (a, (e, a)) are pure strategy Nash equilibria.
280 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 17
Nash equilibria of the Cournot Entry Game
· Consider now the second stage of the game.
a 3, 1
A
a r 0,-3
B
a 2, 0
r
A
r -2, -5
· In short, in the first equilibrium of the ‘complete game’ (r , (n, a)), the
incumbent A is threatening with retaliation, but this threat is a
dominated strategy in the subgame!
· In other words, the first Nash equilibrium of the ‘complete game’ says
that the players make irrational decisions in a subgame (since they do not
play according to a NE of that subgame)!
· So, it can happen that some NE of the ‘complete game’ are not
reasonable.
We look for a procedure that can find more ’reasonable’ Nash equilibira...
281 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 19
Subgame Perfect Nash Equilibrium
Subgames
282 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 21
Examples of subgames
The following game has 4 subgames:
s ( -3, 1)
a B
t ( 5, 0)
A
n ( 4,-7)
r
a ( 3, 1)
B A
a r ( 1,-2)
e
B
a (-2, -1)
r
A
r (-3, -1)
Examples of subgames?
Are these subgames?
s ( -3, 1)
a B
t ( 5, 0)
A
n r ( 4,-7)
a ( 3, 1)
B A
a r ( 1,-2)
e
B
a (-2, -1)
r
A
r (-3, -1)
s ( -3, 1)
a B
t ( 5, 0)
A
n r ( 4,-7)
a ( 3, 1)
B A
a r ( 1,-2)
e
B
a (-2, -1)
r
A
283 / 339 r (-3, -1)
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 23
Subgame Perfect Nash Equilibrium
We extend the concept of backward induction equilibrium to games of imperfect
information to rule out non-credible threats as in the Cournot Entry Game.
· Instead of “choosing optimally at each decision node,” now we require
“choosing optimally at each information set,” and, as before,
“anticipating that others will be playing optimally too”.
284 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 25
Finding SPNE: An example
s -3, 1
a B
t 5, 0
A
n
r 4,-7 a 3, 1
B A
a r 1, 0
e
B
a -2, -1
r
A
r -1, -1
· Consider first the simultaneous move subgame and put it in normal form:
A\B a r
a 3, 1 −2, −1
r 1, 0 −1, −1
· The subgame has a unique Nash equilibrium with profile (a, a).
(Why is there no other equilibrium in mixed strategies?)
· So, we know that if the simultaneous move game is played, then the
players will obtain payoffs (3, 1).
· We indicate this by a small payoff vector just above player B’s second
decision node in the complete game.
285 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 27
Finding SPNE: An example
or equivalently
· We now continue the calculation of the SPNE by using the new tree:
-3, 1 s -3, 1
a B
4, -7
t 5, 0
A
n
3, 1 r 4,-7
B
e
3, 1
· From which we compute the rest of the SPNE strategies of the players.
286 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 29
Finding SPNE: An example
· Overall:
-3, 1 s -3, 1
a B
4, -7
t 5, 0
A
n r
3, 1 4,-7 a 3, 1
B A
3, 1
a r 1, 0
e
B
a -2, -1
r
A
r -1, -1
La Vanguardia, March 1, 2022, Eusebio Val, Putin olvida la doctrina MAD (loco)
https://www.lavanguardia.com/internacional/20220301/8089950/putin-olvida-doctrina-mad-loco.html
287 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 31
Mutually Assured Destruction (MAD)
Joint Strategic Capabilities Plan (US, 1961)
Answer to the question to the Joint Chiefs of Staff: “If your plans for general [nuclear]
war are carried out as planned, how many people will be killed in the Soviet Union and
China?” The answer was in the form of a graph.
from: The Prologue, p. 2, in: The Doomsday Machine: Confessions of a Nuclear War
Planner, 2017, by Daniel Ellsberg.
Four scenes:
· The war room – Russian ambassador – Hi Dimitri – The Doomsday Machine
from: Dr. Strangelove, or: How I learned to stop worrying and love the bomb (1964)
288 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 33
Mutually Assured Destruction (MAD)
Cuba 1962... Ukraine 2022, 2023
Note: The two choice nodes joined by the dashed line are referred to as an information set of 2 and are to be
treated as a single choice node for 2, denoting that 2 does not know what 1 chose (as in a simultaneous game).
289 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 35
Mutually Assured Destruction (MAD)
The atomic confrontation
· The simultaneous move game starting at player’s 1 second node can be
written as follows:
1\2 D R
D ∗
−K , −K ∗ −K , −K ∗
R −K ∗ , −K −1∗ , −1∗
· It has two pure strategy Nash equilibria:
· (D, D ) and (R, R ) with payoffs (−K , −K ) and (−1, −1).
· Now we have two “reduced games” (one for each Nash equilibrium):
B 2,-2 B 2,-2
E 2 E 2
1 1 A -K,-K
A -1,-1
I 0, 0 I 0, 0
· We see there are two subgame perfect equilibria:
· ((I , R ), (A, R )) with payoffs (0, 0)
· ((E , D ), (B, D )) with payoffs (2, −2).
1 1 A -K,-K
A -1,-1
I 0, 0 I 0, 0
· We see there are two subgame perfect equilibria:
· ((I , R ), (A, R )) with payoffs (0, 0)
· ((E , D ), (B, D )) with payoffs (2, −2).
· Are doomsday actions played?
290 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 36
Mutually Assured Destruction (MAD)
The atomic confrontation
· The simultaneous move game starting at player’s 1 second node can be
written as follows:
1\2 D R
D ∗
−K , −K ∗ −K , −K ∗
R −K ∗ , −K −1∗ , −1∗
· It has two pure strategy Nash equilibria:
· (D, D ) and (R, R ) with payoffs (−K , −K ) and (−1, −1).
· Now we have two “reduced games” (one for each Nash equilibrium):
B 2,-2 B 2,-2
E 2 E 2
1 1 A -K,-K
A -1,-1
I 0, 0 I 0, 0
· We see there are two subgame perfect equilibria:
· ((I , R ), (A, R )) with payoffs (0, 0)
· ((E , D ), (B, D )) with payoffs (2, −2).
· None of the equilibrium paths involve the doomsday action!
“In the real world of real political leaders—whether here or in the Soviet
Union—a decision that would bring even one hydrogen bomb on one city
of one’s own country would be recognized as a catastrophic blunder; ten
bombs on ten cities would be a disaster beyond human history; and a
hundred bombs on a hundred cities unthinkable.”
291 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 38
Mutually Assured Destruction (MAD)
How large should K be?
· After leaving the White House, McGeorge Bundy [ex-security advisor to
presidents J.F. Kennedy and L.B. Johnson] wrote in Foreign Affairs:
“In the real world of real political leaders—whether here or in the Soviet
Union—a decision that would bring even one hydrogen bomb on one city
of one’s own country would be recognized as a catastrophic blunder; ten
bombs on ten cities would be a disaster beyond human history; and a
hundred bombs on a hundred cities unthinkable.”
· In the last year of the Cold War, in a famous talk, Herbert York [nuclear
physicist] posed the question:
“How many nuclear weapons are needed to deter an adversary rational
enough to be deterred?”.
“In the real world of real political leaders—whether here or in the Soviet
Union—a decision that would bring even one hydrogen bomb on one city
of one’s own country would be recognized as a catastrophic blunder; ten
bombs on ten cities would be a disaster beyond human history; and a
hundred bombs on a hundred cities unthinkable.”
· In the last year of the Cold War, in a famous talk, Herbert York [nuclear
physicist] posed the question:
“How many nuclear weapons are needed to deter an adversary rational
enough to be deterred?”.
· In 1986, the US had 23,317 nuclear warheads and Russia had 40,159, for
a total of 63,836 weapons.
from: Chapter 9, p. 144 in: The Doomsday Machine: Confessions of a Nuclear War
Planner, 2017, by Daniel Ellsberg.
292 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 38
Mutually Assured Destruction (MAD)
How large should K be?
“Somewhere in the range of 1, 10, or 100... closer to 1 than to 100.”
294 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 42
Liar’s Poker with Perfect Information – open card poker
Player 1 is dealt the card face up so that everybody can see it.
o (20,-20)
2
A (10,-10)
b
1
2
1/
=
K (-10, 10)
A)
P(
o (-40, 40)
2
P(
A
K)
b (10,-10)
=
1/
2
1
K (-10, 10)
o (20,-20)
2
A (10,-10)
b
1
2
1/
)=
K (-10, 10)
A
P(
o (-40, 40)
2
P(
A
K)
b (10,-10)
=
1/
2
1
K (-10, 10)
295 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 44
Liar’s Poker with Imperfect Information – blind poker
Player 1 is dealt the card face down so that no player can see it.
o (20,-20)
2
A (10,-10)
b
1
2
1/
=
K (-10, 10)
A)
P(
o (-40, 40)
2
P(
A
K
(10,-10)
)=
b
1/
2
1
K (-10, 10)
Player 2 will:
· believe (b) if an Ace is dealt, object (o) if a King is dealt.
Player 1 will:
· bet (A) if he gets an Ace, pass (K ) if he gets a King.
The backward induction equilibrium (BIE) is:
· ((A, K ), (b, o )) with (expected) payoffs: (0, 0).
296 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 46
Solution to Blind Poker
Player 1 is dealt the card face down so that no player can see it. We cannot
solve it by backward induction since there are hidden actions.
o (20,-20)
2
A (10,-10)
b
1
2
1/
=
K (-10, 10)
A)
P(
o (-40, 40)
2
P(
A
K)
b (10,-10)
=
1/
2
1
K (-10, 10)
1\2 o b
A −10, 10 10, −10
K −10, 10 −10, 10
297 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 48
Solution to Blind Poker
1\2 o b
A −10, 10 10, −10
K −10, 10 −10, 10
Note: Here the Nash equilibria are also subgame perfect equilibria. But because the
game is zero-sum they are also part of the players’ security strategies. Hence −10 is
also the value of the game Blind Poker to player 1.
Solution to Poker
o (20,-20)
2
A (10,-10)
b
1
2
1/
=
K (-10, 10)
A)
P(
o (-40, 40)
2
P(
A
K)
b (10,-10)
=
1/
2
1
K (-10, 10)
Since the game is zero sum, it suffices to compute the payoff for one agent.
For example:
1 1
u1 ((A, A), o ) = 20 + (−40) = −10
2 2
1 1
u1 ((A, K ), o ) = 20 + (−10) = 5
2 2
1 1
u1 ((K , A), o ) = (−10) + (−40) = −25.
2 2
Solution to Poker
The game in normal form is:
1\2 o b
(A,A) −10, 10 10, −10
(A,K) 5, −5 0, 0
(K,A) −25, 25 0, 0
(K,K) −10, 10 −10, 10
Eliminate the strictly dominated strategies:
· (K , A) is strictly dominated by (A, A); and (K , K ) by (A, K ).
299 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 51
Solution to Poker
The game in normal form is:
1\2 o b
(A,A) −10, 10 10, −10
(A,K) 5, −5 0, 0
(K,A) −25, 25 0, 0
(K,K) −10, 10 −10, 10
Eliminate the strictly dominated strategies:
· (K , A) is strictly dominated by (A, A); and (K , K ) by (A, K ).
We are left with the 2 × 2 subgame: 1\2 o b
(A,A) −10, 10 10, −10
(A,K) 5, −5 0, 0
Solution to Poker
The game in normal form is:
1\2 o b
(A,A) −10, 10 10, −10
(A,K) 5, −5 0, 0
(K,A) −25, 25 0, 0
(K,K) −10, 10 −10, 10
Eliminate the strictly dominated strategies:
· (K , A) is strictly dominated by (A, A); and (K , K ) by (A, K ).
We are left with the 2 × 2 subgame: 1\2 o b
(A,A) −10, 10 10, −10
(A,K) 5, −5 0, 0
300 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 51
Solution to Poker
The game in normal form is:
1\2 o b
(A,A) −10, 10 10, −10
(A,K) 5, −5 0, 0
(K,A) −25, 25 0, 0
(K,K) −10, 10 −10, 10
Eliminate the strictly dominated strategies:
· (K , A) is strictly dominated by (A, A); and (K , K ) by (A, K ).
We are left with the 2 × 2 subgame: 1\2 o b
(A,A) −10, 10 10, −10
(A,K) 5, −5 0, 0
Important Lesson
301 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 52
Commitment
302 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 54
The entry investment game in extensive form
e -2,-2
2
e
n 3, 0
1
e 0, 3
n
2
n 0, 0
In0
e -2,-2
2
e
In1 n 3, 0
1 1
e -1, 3
n
2
n -1, 0
In3
e -2,-2
2
e
n 3, 0
1
e -3, 3
n
2
n -3, 0
1\2 e n
e −2, −2 3, 0
n 0, 3 0, 0
303 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 56
In1: Involvement and the entry game
1\2 e n
e −2, −2 3, 0
n −1, 3 −1, 0
1\2 e n
e 2, −2 3, 0
n −3, 3 −3, 0
304 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 7, Slide 58
Introduction to Game Theory
Topic 8. Repeated Games
-- not exam relevant --
Fabrizio Germano
Winter 2024
Figure: From social anthropology book on the origin and history of inequality
Recommended book: Bowles and Gintis (2012) A Cooperative Species: Human
Reciprocity and its Evolution, Princeton University Press.
But see also: Jean-Jacques Rousseau, David Hume,..., Friedrich Nietzsche, ..., Michel
Foucault etc., etc.
305 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 2
Key Terms in Repeated Games
Some general terms:
· Reciprocity – Cooperation
· Punishment
· Penal code – optimal penal code
· Norm
· Reputation
Suppose player 1 and 2 interact over two periods and play in both
periods the stage game:
1\2 X Y Z
G = A 4, 3 0, 0 1, 4
B 0, 0 2, 1 0, 0
Suppose the payoffs of the overall two-period game are the sums of
the stage game payoffs.
· For example, if players play (A, X ) in period 1 and (B, Y ) in
period 2, then payoffs are:
First Example
1\2 X Y Z
G = A 4, 3 0, 0 1, 4
B 0, 0 2, 1 0, 0
10
8
Payoff of player 2
0
0 2 4 6 8 10
Payoff of player 1
307 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 5
First Example
1\2 X Y Z
G = A 4, 3 0, 0 1, 4
B 0, 0 2, 1 0, 0
10
Payoff of player 2
6
0
0 2 4 6 8 10
Payoff of player 1
The above graph shows all the possible payoffs that can be reached with pure
actions in the two-period game.
· The red points can be supported as Nash equilibria of the stage game in
both periods. Nash equilibria of G are (A, Z ), (B, Y ).
First Example
1\2 X Y Z
G = A 4, 3 0, 0 1, 4
B 0, 0 2, 1 0, 0
10
8
Payoff of player 2
0
0 2 4 6 8 10
Payoff of player 1
The above graph shows all the possible payoffs that can be reached with pure
actions in the two-period game.
· The red points can be supported as Nash equilibria of the stage game in
both periods. Nash equilibria of G are (A, Z ), (B, Y ).
· The blue point does not involve playing a Nash equilibrium in both
periods but is nonetheless a subgame perfect equilibrium outcome.
308 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 5
First Example
Fix a finite stage game G that is repeated finitely many times.
Then any sequence of Nash equilibrium profiles of the stage game G can
be supported as the outcome of a Nash equilibrium and in fact of a
subgame perfect equilibrium of the repeated game.
· The red points of the example can be supported in this way, as
Nash equilibria of the stage game in both periods, thus forming a
subgame perfect equilibrium.
But are there Nash equilibria or even subgame perfect equilibria of the
repeated game that do not consist of playing Nash equilibrium profiles of
the stage game in each period?
· YES! This is true even with finite repetitions.
· The blue point (5, 7) in the previous figure is such an example.
· How can we support it as a subgame perfect equilibrium?
First Example
1\2 X Y Z
G = A 4, 3 0, 0 1, 4
B 0, 0 2, 1 0, 0
Consider the following repeated game strategy profile for both players:
· Period 1: Play (A, X )
· Period 2:
Player 1: As long as player 2 did not deviate from X in period 1, play A;
otherwise, play B.
Player 2: As long as player 2 did not deviate from X in period 1, play Z ;
otherwise, play Y .
In equilibrium this leads to the following actions chosen: (A, X ) in period
1 and (A, Z ) in period 2 with payoffs of
Both are Nash equilibria of the last period stage game and so no
player has an incentive to deviate from the period 2 action.
First Example
1\2 X Y Z
G = A 4, 3 0, 0 1, 4
B 0, 0 2, 1 0, 0
Given the above strategy, the period 2 continuation strategies are given
by the following matrix (as a function of what was played in period 1):
1\2 X Y Z
A (A, Z ) (B, Y ) (B, Y )
B (A, Z ) (B, Y ) (B, Y )
from which we can compute the period 2 continuation payoffs (again as
a function of what was played in period 1):
1\2 X Y Z
A 5, 7 2, 1 3, 5
B 1, 4 4, 2 2, 1
1\2 C D
G = C 3, 3 0, 4
D 4, 0 1, 1
311 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 11
Second Example: Finitely Repeated Prisoner’s Dilemma
Consider a repeated prisoner’s dilemma game with stage game:
1\2 C D
G = C 3, 3 0, 4
D 4, 0 1, 1
Proposition
If the stage game of a finitely repeated game has a unique Nash
equilibrium, then the finitely repeated game has a unique subgame
perfect equilibrium where each player always plans to use the
unique Nash equilibrium action.
312 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 12
Finitely Repeated Games
Proposition
If the stage game of a finitely repeated game has a unique Nash
equilibrium, then the finitely repeated game has a unique subgame
perfect equilibrium where each player always plans to use the
unique Nash equilibrium action.
Preliminaries
GT ,δ = (G , T , δ)
where:
· G is a the stage game defined by:
· N the set of players
· Ai the set of actions of player i
· ui the utility or utility function of player i.
· T * N is the number of consecutive repetitions of G .
· δ * (0, 1] is a common discount factor.
We assume N and each Ai are finite and players know G , T , δ.
When δ < 1 players maximize discounted payoffs: Ui = ∑T t 21 u (at ).
t =1 δ i
1
When δ = 1 players maximize average payoffs: Ui = T ∑T t
t = 1 ui ( a ) .
313 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 13
Preliminaries
As in extensive form games, pure strategies in repeated games
are contingent plans that specify, for each player, an action
ai,t * Ai , for every period of repetition, and for every possible
history of actions up to t 2 1, for every t = 1, 2, . . . , T .
Definition
A pure strategy of player i in the game GT ,δ , si = (si,t )T
t =1 , is
defined, for every t = 1, 2, . . . , T , by a map:
si,t : H t 21 ³ Ai ,
where
Preliminaries
δ + δ2 + δ3 + . . . = δ(1 + δ + δ2 + . . .) = δv
1
ó v = 1 + δv ñó v = .
12δ
Hence, for any a * R:
a
a + a · δ + a · δ2 + . . . = a · v = .
12δ
We use this repeatedly when evaluating deviations and continuation
payoffs in infinitely repeated games.
314 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 15
Infinitely Repeated Prisoner’s Dilemma
1\2 C D
G = C 3, 3 0, 4
D 4, 0 1, 1
315 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 17
Infinitely Repeated Prisoner’s Dilemma
So consider the infinitely repeated prisoner’s dilemma G∞,δ with stage
game G as above.
In general, a player’s strategy is a full description of an action to take at
every information of the player.
In an infinitely repeated game there is an information set for every period
t and for every different history ht 21 = (a1 , a2 , . . . , at 21 ), for t ³ ∞.
Fortunately, many interesting equilibria can be described by a few types
of simple strategies.
· One example is a strategy that prescribes a Nash equilibrium action
of the stage game for each contingency.
· This also constitutes a subgame perfect equilibrium of the game.
· In the repeated prisoner’s dilemma it yields the profile (D, D ) being
played in each period.
· But can the players do better?
1\2 C D
G = C 3, 3 0, 4
D 4, 0 1, 1
3
C : 3 + 3 · δ + 3 · δ2 + . . . =
12δ
δ
D: 4 + 1 · δ + 1 · δ2 + . . . = 4 +
12δ
Note: 4 is the period t deviation payoff of playing D instead of C .
Hence, to sustain players playing the cooperative profile (C , C ), we need
that:
3 δ 1
g 4+ ñó δ g .
12δ 12δ 3
317 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 20
Infinitely Repeated Prisoner’s Dilemma – automata
An automaton is an idealized computing machine. A finite automaton can
keep track of a finite number of states.
Each circle represents a possible state of the machine. The letter inside
represents the action the machine will take in that state. The arrows represent
transitions pointing to the states or actions to be taken in the next period in
response to actions taken by the opponent in the current period. Each machine
has an arrow that comes from nowhere to indicate the machine’s initial state.
Machines that start with C are said to be nice, and nasty otherwise.
318 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 22
Infinitely Repeated Prisoner’s Dilemma – automata
All 1-state and 2-state automata for the infinitely repeated prisoner’s dilemma.
319 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 24
Infinitely Repeated Prisoner’s Dilemma – automata
To see this, consider the following version of the Prisoner’s Dilemma:
1\2 C D
G = C 2, 2 21, 3
D 3, 21 0, 0
And compute payoff histories of two automata playing agains each other.
Period 1 2 3 4 5 6 7 8 9 ···
Payoff 21 0 3 21 0 3 21 0 3 ···
Tit-for-Tat C D D C D D C D D ···
Tat-for-Tit D D C D D C D D C ···
Payoff 3 0 21 3 0 21 3 0 21 ···
320 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 26
Infinitely Repeated Prisoner’s Dilemma – automata
Table: Tit-for-Tat vs. Tat-for-Tit
Period 1 2 3 4 5 6 7 8 9 ···
Payoff 21 0 3 21 0 3 21 0 3 ···
Tit-for-Tat C D D C D D C D D ···
Tat-for-Tit D D C D D C D D C ···
Payoff 3 0 21 3 0 21 3 0 21 ···
Period 1 2 3 4 5 6 7 8 9 ···
Payoff 0 2 2 2 2 2 2 2 2 ···
Tat-for-Tit D C C C C C C C C ···
Tat-for-Tit D C C C C C C C C ···
Payoff 0 2 2 2 2 2 2 2 2 ···
321 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 27
Infinitely Repeated Prisoner’s Dilemma – automata
Period 1 2 3 4 5 6 7 8 9 ···
Payoff 0 2 2 2 2 2 2 2 2 ···
Tat-for-Tit D C C C C C C C C ···
Tat-for-Tit D C C C C C C C C ···
Payoff 0 2 2 2 2 2 2 2 2 ···
Folk Theorems
323 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 30
Folk Theorems
Note: Game theorists refer to these results as folk theorems because they were
thought to be part of the profession’s wisdom before various versions were
formally proved.
Folk Theorem A
What stage game payoffs can be sustained as equilibria of infinitely
repeated games when players are sufficiently patient (δ ³ 1)?
324 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 31
Folk Theorem A
What stage game payoffs can be sustained as equilibria of infinitely
repeated games when players are sufficiently patient (δ ³ 1)?
Definition
A payoff profile u = (u1 , u2 , . . . , un ) * Rn is feasible for G if there
exist nonnegative weights:
(νa )a*A g 0 with ∑ νa = 1
a *A
Folk Theorem A
325 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 32
Folk Theorem A – infinitely repeated prisoner’s dilemma
Consider again the infinitely repeated prisoner’s dilemma game G∞,δ with
stage game:
1\2 C D
G = C 3, 3 0, 4
D 4, 0 1, 1
4
Payoff of player 2
0
0 1 2 3 4 5
Payoff of player 1
1\2 X Y Z
G = A 4, 3 0, 0 1, 4
B 0, 0 2, 1 0, 0
4
Payoff of player 2
0
0 1 2 3 4 5
Payoff of player 1
1\2 X Y Z
G = A 4, 3 0, 0 1, 4
B 0, 0 2, 1 0, 0
Payoff of player 2 4
0
0 1 2 3 4 5
Payoff of player 1
327 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 36
Folk Theorem A – infinitely repeated trust game
Consider the following version of the trust game:
3
Payoff of player 2
0
0.0 0.5 1.0 1.5 2.0
Payoff of player 1
328 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 36
Folk Theorem B
Recall: v i = min max ui (αi , α2i )(g max min ui (αi , α2i ) = v i ).
α 2i αi αi α 2i
Note: the minmax value (v i ) is the worst payoff the opponents can
force i to accept (that is, allowing i to react to the opponents’ actions.)
Definition
A payoff profile u = (u1 , u2 , . . . , un ) * Rn is enforceable (or
individually rational) for G if for all i * N:
ui g v i .
It is strictly enforceable if ui > v i for all i.
Folk Theorem B
329 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 38
Folk Theorem B – infinitely repeated prisoner’s dilemma
Consider again the infinitely repeated prisoner’s dilemma game G∞,δ with stage
game:
1\2 C D
G = C 3, 3 0, 4
D 4, 0 1, 1
Payoff of player 2
3
0
0 1 2 3 4 5
Payoff of player 1
4
Payoff of player 2
0
0 1 2 3 4 5
Payoff of player 1
330 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 40
Folk Theorem B – infinitely repeated introductory example
Consider the infinitely repeated game G∞,δ with the stage game of the
introductory example:
1\2 X Y Z
G = A 4, 3 0, 0 1, 4
B 0, 0 2, 1 0, 0
Payoff of player 2 3
0
0 1 2 3 4 5
Payoff of player 1
W2 (σ2 ) = min{u2 (A, (σ21 , 1 2 σ21 )), u2 (B, (σ21 , 1 2 σ21 ))}
= min{2σ21 + 6(1 2 σ21 ), 4σ21 + 0(1 2 σ21 )}
3
4σ21 if σ21 f
1 1 4
= min{6 2 4σ2 , 4σ2 } =
6 2 4σ21 else
· What is the best guaranteed payoff player i can obtain?
v i = max Wi (σi ) = max min ui (σi , sj ) = max min ui (σi , σj ).
σi *Σi σi *Σi sj *Sj σi *Σi σj *Σj
331 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 41
Recall Topic 1: Zero-Sum Games – maxminimization
u1 u2
6 6
5 5
4 u1(Ã1,X) 4 u2(A,Ã2)
3
u1(Ã1,Y) 3
u2(B,Ã2)
W1(Ã1) W2(Ã2)
2 2
1 1
Ã1A Ã2X
0.2 0.4 0.6 0.8 1.0 0.2 0.4 0.6 0.8 1.0
Figure: Game G : The functions W1 (left) and W2 (right) (these are the
minimum of the red and blue lines) and player 1 and 2’s security strategies
(achieved at the maximum of W1 at σ11 = 21 at which player 1’s payoff is
v 1 = 3 for player 1 and achieved at the maximum of W2 at σ12 = 34 at which
player 2’s payoff is v 2 = 3 for player 2.
where
Bi (σj ) = max ui (si , σj ).
si * Si
Note:
332 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 43
Recall Topic 1: Zero-Sum Games – minmaximization
1\2 X Y
σ11 = prob. 1 plays A, σ12 = 1 2 σ11 = prob. 1 plays B
A 4, 2 0, 6 σ21 = prob. 2 plays X , σ22 = 1 2 σ21 = prob. 2 plays Y
B 2, 4 6, 0
· The worst payoffs for players 1 and 2 are given by:
B1 (σ2 ) = max{u1 (A, (σ21 , 1 2 σ21 )), u2 (B, (σ21 , 1 2 σ21 ))}
= max{4σ21 + 0(1 2 σ21 ), 2σ21 + 6(1 2 σ21 )}
3
6 2 4σ21 if σ21 f
= max{4σ21 , 6 2 4σ21 } = 4
4σ21 else
u1 u2
6 6
5 5
4 u1(A,Ã2) 4 u2(Ã1,X)
3 u1(B,Ã2) 3 u2(Ã1,Y)
B1(Ã2) B2(Ã1)
2 2
1 1
Ã2X Ã1A
0.2 0.4 0.6 0.8 1.0 0.2 0.4 0.6 0.8 1.0
Figure: Game G : The functions B1 (left) and B2 (right) (these are the
maximum of the red and blue lines) and player 2 and 1’s strategies of forcing
the lowest acceptable payoffs (achieved at the minimum of B1 at σ12 = 43 at
which player 1’s payoff is v 1 = 3 for player 1 and achieved at the minimum of
B2 at σ11 = 21 at which player 2’s payoff is v 2 = 3 for player 2.
333 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 45
Theorem B – infinitely repeated entry game
Chain store paradox
1\2 X Y
A 2, 2 0, 0
B 1, 5 1, 5
· The best payoffs for players 1 and 2 are given by B1 (σ2 ) and B2 (σ1 ).
· What is the lowest payoff player i can be forced to accept?
u1 u2
2.0 5
4
1.5
u1(A,Ã2) u2(Ã1,X)
3
u1(B,Ã2) u2(Ã1,Y)
1.0
B1(Ã2) 2 B2(Ã1)
0.5
1
Ã2X Ã1A
0.2 0.4 0.6 0.8 1.0 0.2 0.4 0.6 0.8 1.0
Figure: Entry game: The functions B1 (left) and B2 (right) (these are the maximum
of the red and blue lines) and player 2 and 1’s strategies of forcing the lowest
acceptable payoffs (achieved at the minimum of B1 at 0 f σ12 f 21 (maximum
probability on Acquiesce is 1/2) at which player 1’s payoff is v 1 = 1 for player 1 and
achieved at the minimum of B2 at σ11 = 1 (probability 1 on In) at which player 2’s
payoff is v 2 = 2 for player 2.
334 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 47
Theorem B – infinitely repeated entry game
Chain store paradox
1\2 X Y
A 2, 2 0, 0
B 1, 5 1, 5
1\2 X Y
A 2, 2 0, 0
B 1, 5 1, 5
0
0.0 0.5 1.0 1.5 2.0
Payoff of player 1 (Entrant)
1. Trust game with existing legal institution that enforces payment (at cost to 2).
336 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 50
Reputation and infinitely repeated trust game(s)
Institutions, brand names, long-run and short-run players
2. Trust game with a law merchant (player 3) who provides a service at fee 0.1 by
collecting a bond from player 2 and using it to pay player 1 if player 2 does not pay.
The SPE does not involve using the law merchant and yields the outcome (1, 0, 0).
· At the SPE, the law merchant keeps the bond (does not fulfil agreement).
· No exchange takes place in the one shot game.
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 51
337 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 52
Reputation and infinitely repeated trust game(s)
Institutions, brand names, long-run and short-run players
3. Trust game with a law merchant (player 3) who provides a service at fee 0.1 by
collecting a bond from player 2 and using it to pay player 1 if player 2 does not pay.
The infinitely repeated version delivers (2, 1.9, 0.1) as a SPE outcome, provided the
law merchant’s discount rate satisfies δ3 > 0.96. (Why?)
· This works even if players 1 and 2 are short-run players (P1t , P2t ).
· Law merchant is long-run player (P3 ) who has incentive to build a reputation.
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 52
The infinitely repeated variant delivers (2, 1.9, 0.1) as a SPE outcome, provided the
law merchant’s discount rate satisfies δ3 > 0.96. (Why?) Compute 10.1 2δ3 > 2.5.
t t
· This works even if players 1 and 2 are short-run players (P1 , P2 ).
· Law merchant is long-run player (P3 ) who has incentive to build a reputation.
338 / 339
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 53
Reputation and infinitely repeated trust game(s)
Institutions, brand names, long-run and short-run players
4. Trust game with brand names. Both players 1 and 2 are short-run players (P1t , P2t ).
At end of each period t, player P2t has option of selling a brand name to player P2t +1 .
The price of the brand name in t is 3 (any price > 2 works) and maintains its value as
long as the player P2t pays. Otherwise, the brand name is worth 0.
Again, there is a SPE of the infinitely repeated variant that yields the outcome (2, 2).
· The possibility of selling the brand name allows the exchange to take place.
· The brand name allows to establish a reputation that payment will be made.
UPF – Introduction to Game Theory, Winter 2024 Topic 8, Slide 54
339 / 339