Professional Documents
Culture Documents
Michael Johanson - Evaluating State-Space Abstractions in Extensive-Form Games
Michael Johanson - Evaluating State-Space Abstractions in Extensive-Form Games
Extensive-Form Games
Michael Johanson and Neil Burch and Richard Valenzano and Michael Bowling
University of Alberta
Edmonton, Alberta
{johanson,nburch,valenzan,mbowling}@ualberta.ca
Unsuited
7 1 1 1 2 3 7 3 3 5 5 5 6 7
height of each bar indicates the probability of the remaining 8 1 1 2 2 3 3 8 3 5 5 5 6 7
9 2 2 2 2 3 3 3 8 5 5 5 7 7
public cards resulting in that hand strength, and the vertical T 2 2 2 2 3 3 3 5 8 5 7 7 7
black line and label shows E[HS]. J
Q
2
4
2
4
4
4
4
4
4
4
4
4
5
5
5
5
5
5
8
5
7
8
7
7
7
7
Note that the top and bottom histograms have different K 4 4 4 6 6 6 6 6 7 7 7 8 7
A 6 6 6 6 6 6 6 7 7 7 7 7 8
distribution shapes: 4♠4♥ and 6♠6♥ have most of their
weight near their E[HS] values, while T ♠J♠ and Q♠K♠ Table 1: Eight hand clusters used for the OCHS features.
have almost no weight near E[HS] as the unrevealed cards
will make this hand either strong or weak. This difference
1.0
is an indication that the top and bottom rows are strategi- 4s4h 6s6h
0.702
0.635
0.496
0.614
0.482
0.558
0.476
0.180
0.799
0.691
0.576
0.669
0.498
0.639
0.498
0.183
0.2
umn, whereas merging along each row may be better.
0
This suggests the use of a distribution-aware similar- TsJs QsKs
ity metric such as earth mover’s distance [20] to compare 0.8
0.634
two hand strength distributions. Earth mover’s distance 0.575
0.6
measures the minimum work required to change one dis- 0.4
tribution into another by moving probability mass. In
0.659
0.712
0.674
0.549
0.548
0.465
0.387
0.318
0.646
0.676
0.638
0.724
0.682
0.549
0.497
0.389
0.2
one-dimensional discrete distributions such as these hand
strength distributions, it can be efficiently computed with 0
1 2 3 4 5 6 7 8 1 2 3 4 5 6 7 8
Opponent hand clusters
a single pass over the histogram bars. Unlike alternative
distance metrics such as L2 or Kolmogorov-Smirnov, earth OCHS L2 Distance E[HS] Distance
4s4h 6s6h TsJs QsKs 4s4h 6s6h TsJs QsKs
mover’s distance measures not only the difference in prob- 4s4h 0.066 0.095 0.104 0.063 0.005 0.064
6s6h 0.066 0.117 0.105 0.063 0.015 0.001
ability mass, but also how far that mass was moved. In TsJs 0.095 0.117 0.098 0.005 0.015 0.059
Figure 2(bottom), the earth mover’s distance and difference QsKs 0.104 0.105 0.098 0.064 0.001 0.059
Table 2: Computing the number of information sets in three nearly equally sized Texas hold’em abstractions.
Table 3: Average performance in games between abstract strategies generated by Public Chance Sampled CFR. Results are
in milli-big-blinds/game (mbb/g) over a 10 million hand duplicate match with a 95% confidence interval of 1.1 mbb/g.
CFR CFR-BR
PR IR PR IR PR-PHS-PHS
PHS-PHS 288.942 358.188 89.632 94.841 IR-PHS-PHS
Exploitability (mbb/g)
PHS-KO 289.152 318.197 90.371 85.275 PR-KE-KO
KE-PHS 281.63 339.872 90.720 80.557 IR-KE-KO
KE-KO 282.856 282.395 84.039 64.820
102
KO-PHS 335.902 355.881 105.389 88.546
KO-KO 330.319 291.574 105.523 73.091