Welcome to Scribd!

Aima 161

Uploaded by

0% found this document useful (0 votes)

6 views1 page

Monte Carlo tree search (MCTS) is an alternative to alpha-beta search for games like Go with high branching factors and difficult evaluation functions. MCTS involves running many simulations or "playouts" that randomly select moves until the end of the game, then using the results to estimate state values. A playout policy biases moves toward better ones. MCTS balances exploring less-simulated states and exploiting states that have performed well before to estimate values more accurately.

Original Description:

Original Title

aima-161

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

6 views1 page

Aima 161

Uploaded by

Summer Triangle

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 1

Search inside document

Section 5.

4 Monte Carlo Tree Search 161

can play perfectly by looking up the right move in this table. The table is constructed by
retrograde minimax search: start by considering all ways to place the KBNK pieces on the Retrograde
board. Some of the positions are wins for white; mark them as such. Then reverse the rules of
chess to do reverse moves rather than moves. Any move by White that, no matter what move
Black responds with, ends up in a position marked as a win, must also be a win. Continue
this search until all possible positions are resolved as win, loss, or draw, and you have an
infallible lookup table for all endgames with those pieces. This has been done not only for
KBNK endings, but for all endings with seven or fewer pieces. The tables contain 400 trillion
positions. An eight-piece table would require 40 quadrillion positions.

5.4 Monte Carlo Tree Search

The game of Go illustrates two major weaknesses of heuristic alpha–beta tree search: First,
Go has a branching factor that starts at 361, which means alpha–beta search would be limited
to only 4 or 5 ply. Second, it is difficult to define a good evaluation function for Go because
material value is not a strong indicator and most positions are in flux until the endgame. In
response to these two challenges, modern Go programs have abandoned alpha–beta search
Monte Carlo tree
and instead use a strategy called Monte Carlo tree search (MCTS).3 search (MCTS)
The basic MCTS strategy does not use a heuristic evaluation function. Instead, the value
of a state is estimated as the average utility over a number of simulations of complete games Simulation
starting from the state. A simulation (also called a playout or rollout) chooses moves first for Playout
one player, then for the other, repeating until a terminal position is reached. At that point the Rollout
rules of the game (not fallible heuristics) determine who has won or lost, and by what score.
For games in which the only outcomes are a win or a loss, “average utility” is the same as
“win percentage.”
How do we choose what moves to make during the playout? If we just choose randomly,
then after multiple simulations we get an answer to the question “what is the best move if
both players play randomly?” For some simple games, that happens to be the same answer
as “what is the best move if both players play well?,” but for most games it is not. To get
useful information from the playout we need a playout policy that biases the moves towards Playout policy
good ones. For Go and other games, playout policies have been successfully learned from
self-play by using neural networks. Sometimes game-specific heuristics are used, such as
“consider capture moves” in chess or “take the corner square” in Othello.
Given a playout policy, we next need to decide two things: from what positions do we
start the playouts, and how many playouts do we allocate to each position? The simplest
answer, called pure Monte Carlo search, is to do N simulations starting from the current Pure Monte Carlo
search
state of the game, and track which of the possible moves from the current position has the
highest win percentage.
For some stochastic games this converges to optimal play as N increases, but for most
games it is not sufficient—we need a selection policy that selectively focuses the computa- Selection policy
tional resources on the important parts of the game tree. It balances two factors: exploration Exploration
of states that have had few playouts, and exploitation of states that have done well in past Exploitation
playouts, to get a more accurate estimate of their value. (See Section 17.3 for more on the
3 “Monte Carlo” algorithms are randomized algorithms named after the Casino de Monte-Carlo in Monaco.

IE54500 - Exam 3: 1. Static Game of Complete Information
Document5 pages
IE54500 - Exam 3: 1. Static Game of Complete Information
M
100% (1)
Free NL Hold'em Success Guide PDF
Document38 pages
Free NL Hold'em Success Guide PDF
G M
No ratings yet
Tutorial6 LoadingPartsInSpice
Document3 pages
Tutorial6 LoadingPartsInSpice
Summer Triangle
No ratings yet
Rio
Document6 pages
Rio
sunny993
No ratings yet
AI Othello: Mick G.D. Remmerswaal April 23, 2020
Document35 pages
AI Othello: Mick G.D. Remmerswaal April 23, 2020
Mick Remmerswaal
No ratings yet
Ai-Module 2
Document47 pages
Ai-Module 2
edemo6870
No ratings yet
Search Methods in AI
Document17 pages
Search Methods in AI
Iheme Tobechukwu
No ratings yet
Monte Carlo Tree Search
Document8 pages
Monte Carlo Tree Search
Struct DesignPro
No ratings yet
Local Search Algorithms: Hill Climbing Simulated Annealing Search Local Beam Search Genetic Algorithm
Document45 pages
Local Search Algorithms: Hill Climbing Simulated Annealing Search Local Beam Search Genetic Algorithm
Prashaman Pokharel
No ratings yet
AI Application Freebook Final
Document105 pages
AI Application Freebook Final
மது பாபு சத்யம்
No ratings yet
Elliott Paper
Document28 pages
Elliott Paper
Ronnachai Sutthisung
No ratings yet
Game Playing: MIN-MAX Search
Document6 pages
Game Playing: MIN-MAX Search
anon_565693848
No ratings yet
Game Theory
Document27 pages
Game Theory
Pavitra Pati
No ratings yet
Computer Chess Programming Theory
Document21 pages
Computer Chess Programming Theory
erdnase1902
No ratings yet
2021 Rejwana ACG Leela Chess Zero Paper
Document10 pages
2021 Rejwana ACG Leela Chess Zero Paper
mapudungo
No ratings yet
Games - Note 3
Document12 pages
Games - Note 3
Eman Jaffri
No ratings yet
Game-Tree Properties and MCTS Performance: Hilmar Finnsson and Yngvi BJ Ornsson
Document8 pages
Game-Tree Properties and MCTS Performance: Hilmar Finnsson and Yngvi BJ Ornsson
Sergio Neira
No ratings yet
Introduction To AI Techniques: Game Search, Minimax, and Alpha Beta Pruning June 8, 2009
Document19 pages
Introduction To AI Techniques: Game Search, Minimax, and Alpha Beta Pruning June 8, 2009
avatazjoe
No ratings yet
EDAP01
Document4 pages
EDAP01
Axel Rosenqvist
No ratings yet
Monte Carlo Tree Search Method For AI Games: Volume 2, Issue 2, March - April 2013
Document6 pages
Monte Carlo Tree Search Method For AI Games: Volume 2, Issue 2, March - April 2013
International Journal of Application or Innovation in Engineering & Management
No ratings yet
Tic Tac Toe
Document28 pages
Tic Tac Toe
Yugal Kumar
0% (1)
Artificial Assignment
Document7 pages
Artificial Assignment
Ali Raza
No ratings yet
A - Mini - Project - Report - Tic - Tac - Toe 12
Document18 pages
A - Mini - Project - Report - Tic - Tac - Toe 12
aniket sonwane aniket
No ratings yet
Monte Carlo Tree Search in Monopoly
Document5 pages
Monte Carlo Tree Search in Monopoly
Kapil Kanagal
No ratings yet
n5 Adversarial
Document12 pages
n5 Adversarial
Gabriel Martí
No ratings yet
Introduction To Tic Tac Toe
Document8 pages
Introduction To Tic Tac Toe
Sravani Martha
No ratings yet
Adversial Search
Document21 pages
Adversial Search
k.s.v.gameplay
No ratings yet
18CS753 Ai Module 4
Document43 pages
18CS753 Ai Module 4
Arham Syed
No ratings yet
Mini Project Tic Toc Toe Game Priyanshi Singh
Document42 pages
Mini Project Tic Toc Toe Game Priyanshi Singh
priyanshi Singh
No ratings yet
Genetic Evolution of TIC-TAC-ToE
Document8 pages
Genetic Evolution of TIC-TAC-ToE
Eshan Goyal
No ratings yet
3 GamePlaying - Minimax
Document75 pages
3 GamePlaying - Minimax
87Saaraansh MishraIT
No ratings yet
The Minimax Algorithm
Document4 pages
The Minimax Algorithm
dinhtrung158
No ratings yet
CS6659 AI UNIT 2 Notes
Document51 pages
CS6659 AI UNIT 2 Notes
profBalamurugan
100% (4)
Local Adversarial Search
Document44 pages
Local Adversarial Search
Tiffany Bryan
No ratings yet
Notes - Game Theory
Document12 pages
Notes - Game Theory
Zayn Rajpoot
No ratings yet
AI All Units
Document93 pages
AI All Units
03rajput.ki
No ratings yet
Aiml Unit-2
Document61 pages
Aiml Unit-2
katakamavinash329
No ratings yet
CS 188 Introduction To Artificial Intelligence Summer 2019 Note 2
Document12 pages
CS 188 Introduction To Artificial Intelligence Summer 2019 Note 2
hellosunshine95
No ratings yet
Institute of Southern Punjab Multan: Syed Zohair Quain Haider Lecturer ISP Multan
Document41 pages
Institute of Southern Punjab Multan: Syed Zohair Quain Haider Lecturer ISP Multan
Ch Maaan
No ratings yet
AlphaZero Research Paper Summary
Document3 pages
AlphaZero Research Paper Summary
Kasim Panjri
No ratings yet
Beginner's Guide To Counting Pips
Document8 pages
Beginner's Guide To Counting Pips
LucianDan
No ratings yet
Week 3 C5 Adversarial Search and Games (Belano & Ong Chua)
Document57 pages
Week 3 C5 Adversarial Search and Games (Belano & Ong Chua)
vronkalinkish
No ratings yet
Tic-Tac-Toe AI Player Using Minimax and Alpha-Beta Pruning
Document15 pages
Tic-Tac-Toe AI Player Using Minimax and Alpha-Beta Pruning
KaartiikBen
No ratings yet
Final
Document10 pages
Final
Ale Dragut
No ratings yet
Checkers Is A Game Played On An Eight by Eight Squared Board With Twelve Pieces On Each Side
Document23 pages
Checkers Is A Game Played On An Eight by Eight Squared Board With Twelve Pieces On Each Side
Saad Charif
No ratings yet
Monte Carlo Tree Search
Document9 pages
Monte Carlo Tree Search
joseph676
No ratings yet
Unit 3
Document25 pages
Unit 3
Saurav Semalti
No ratings yet
Chin CD Cover
Document32 pages
Chin CD Cover
Jared Redmond
No ratings yet
Artificial Intelligence - 6c
Document34 pages
Artificial Intelligence - 6c
Rohith Roh
No ratings yet
Tic Tac Toe Game: A Project On Using "C"
Document20 pages
Tic Tac Toe Game: A Project On Using "C"
Parikshit Datir
No ratings yet
Backtracking and Games: Eric Roberts CS 106B January 30, 2012
Document17 pages
Backtracking and Games: Eric Roberts CS 106B January 30, 2012
starachandani
No ratings yet
Report Abalone
Document6 pages
Report Abalone
nino
No ratings yet
csd311 Project
Document11 pages
csd311 Project
298aman
No ratings yet
Ijntr01070003 PDF
Document5 pages
Ijntr01070003 PDF
Yoni Keren
No ratings yet
CSC-411-AI-lec6-Adversarial Search
Document38 pages
CSC-411-AI-lec6-Adversarial Search
Abdullah
No ratings yet
AI Notes Part-2
Document14 pages
AI Notes Part-2
Divyanshu Patwal
No ratings yet
Adversarial Search
Document13 pages
Adversarial Search
Usman Mani
No ratings yet
AI Game Playing and Search
Document41 pages
AI Game Playing and Search
Hithesh Gowda
No ratings yet
Adversial Search
Document20 pages
Adversial Search
Ankit Kumar
No ratings yet
Hero in Chess
Document21 pages
Hero in Chess
Eddie Resurreccion Jr.
No ratings yet
Approximating Game-Theoretic Optimal Strategies For Full-Scale Poker (D. Billings)
Document17 pages
Approximating Game-Theoretic Optimal Strategies For Full-Scale Poker (D. Billings)
Anonymous eVmCdZJJDP
100% (1)
Anti-Chess and A.I.
Document22 pages
Anti-Chess and A.I.
Manik Jindal
No ratings yet
Craps Wagering Strategies Using Actual Las Vegas Roll Data
From Everand
Craps Wagering Strategies Using Actual Las Vegas Roll Data
Eric Cybulski
No ratings yet
Ch07 Testing
Document34 pages
Ch07 Testing
Summer Triangle
No ratings yet
Embedded Systems Design in Intelligent Industrial
Document3 pages
Embedded Systems Design in Intelligent Industrial
Summer Triangle
No ratings yet
Aima 67
Document1 page
Aima 67
Summer Triangle
No ratings yet
FINALVERSION
Document24 pages
FINALVERSION
Summer Triangle
No ratings yet
Ch05 Functional Decomposition
Document25 pages
Ch05 Functional Decomposition
Summer Triangle
No ratings yet
Aima 130
Document1 page
Aima 130
Summer Triangle
No ratings yet
Ch04 Concept Generation and Evaluation
Document26 pages
Ch04 Concept Generation and Evaluation
Summer Triangle
No ratings yet
Ch06 Behavior Models
Document31 pages
Ch06 Behavior Models
Summer Triangle
No ratings yet
Ch03 The Requirements Specification
Document32 pages
Ch03 The Requirements Specification
Summer Triangle
No ratings yet
Ch02 Project Selection
Document27 pages
Ch02 Project Selection
Summer Triangle
No ratings yet
Mobile Robot Navigation and Obstacle Avoidance
Document13 pages
Mobile Robot Navigation and Obstacle Avoidance
Summer Triangle
No ratings yet
Kebijakan Indonesia Terhadap Impor Sampah Kertas Dari Amerika Serikat Tahun 2017-2019 Oleh: Penulis: Eka Juwita Pembimbing: Dr. Pazli, S.IP., M.si
Document15 pages
Kebijakan Indonesia Terhadap Impor Sampah Kertas Dari Amerika Serikat Tahun 2017-2019 Oleh: Penulis: Eka Juwita Pembimbing: Dr. Pazli, S.IP., M.si
Summer Triangle
No ratings yet
PLC System PDF
Document22 pages
PLC System PDF
Summer Triangle
No ratings yet
Pioneer Robot
Document69 pages
Pioneer Robot
Summer Triangle
No ratings yet
Introduction PDF
Document8 pages
Introduction PDF
Summer Triangle
No ratings yet
Reinforcement Learning For Robot
Document22 pages
Reinforcement Learning For Robot
Summer Triangle
No ratings yet
On The Use of The Humanoid Bioloid System For Robo
Document19 pages
On The Use of The Humanoid Bioloid System For Robo
Summer Triangle
No ratings yet
Love in Blue City PDF
Document199 pages
Love in Blue City PDF
Summer Triangle
No ratings yet
Love in Marrakech PDF
Document183 pages
Love in Marrakech PDF
Summer Triangle
100% (1)
ECE 2110 Experiment 1
Document9 pages
ECE 2110 Experiment 1
Summer Triangle
No ratings yet
Logic-Based Subsumption Architecture & PERI
Document12 pages
Logic-Based Subsumption Architecture & PERI
Summer Triangle
No ratings yet
Subsumption Vienna
Document32 pages
Subsumption Vienna
Summer Triangle
No ratings yet
MIT14 12F12 Midterm1
Document5 pages
MIT14 12F12 Midterm1
milad mozafari
No ratings yet
Coordination and Anti-Coordination Games
Document4 pages
Coordination and Anti-Coordination Games
deathslayer112
No ratings yet
GitHub - Game Theory Cheat Sheet
Document22 pages
GitHub - Game Theory Cheat Sheet
Alvaro Celis Fernández
No ratings yet
CH 15
Document118 pages
CH 15
rofiq09
No ratings yet
Regret Matrix - Payoff Tables
Document2 pages
Regret Matrix - Payoff Tables
Inah Salcedo
No ratings yet
Decision Theory: Mcgraw-Hill/Irwin
Document21 pages
Decision Theory: Mcgraw-Hill/Irwin
Khiet
100% (2)
Lec 3 - Game Theory and Economics
Document19 pages
Lec 3 - Game Theory and Economics
bayan
No ratings yet
Game Theory An Introduction With Step-By-Step Examples
Document471 pages
Game Theory An Introduction With Step-By-Step Examples
Vladimir Kobelev
No ratings yet
Static Games Complete Information
Document29 pages
Static Games Complete Information
Kushagra Arora
No ratings yet
COMP3211 Lecture Note On Game Theory and Auctions: Fangzhen Lin
Document34 pages
COMP3211 Lecture Note On Game Theory and Auctions: Fangzhen Lin
timkung
No ratings yet
Lesson 0: Martingales: Le Thi Xuan Mai
Document50 pages
Lesson 0: Martingales: Le Thi Xuan Mai
Hau Ho
No ratings yet
Managerial Economics Chapter 10
Document50 pages
Managerial Economics Chapter 10
Usman Khan
100% (1)
Lecture Notes 11 Game Theory II: Xu Le National University of Singapore
Document30 pages
Lecture Notes 11 Game Theory II: Xu Le National University of Singapore
Ong Yong Xin, Seanne
No ratings yet
Week 10 Game Theory
Document14 pages
Week 10 Game Theory
duygualsan1
No ratings yet
ADVERS SRCHES - Min Max - Students
Document34 pages
ADVERS SRCHES - Min Max - Students
GUNEET SURA
No ratings yet
6 Imperfect-Information Games
Document38 pages
6 Imperfect-Information Games
I_Prashant97
No ratings yet
Game Theory
Document2 pages
Game Theory
Abhay Karmyogi
No ratings yet
Oyun Teorisi
Document8 pages
Oyun Teorisi
Hamit Çakır
No ratings yet
Presentation On Auction Theory by Simon Herrmann
Document16 pages
Presentation On Auction Theory by Simon Herrmann
simon.herrmann4368
No ratings yet
Overview Manajemen Strategik 19 Des 2018
Document56 pages
Overview Manajemen Strategik 19 Des 2018
atika
No ratings yet
Microeconomic Theory (501b) Problem Set 5. Bayesian Games Suggested Solutions by Tibor Heumann
Document15 pages
Microeconomic Theory (501b) Problem Set 5. Bayesian Games Suggested Solutions by Tibor Heumann
manu
No ratings yet
Maths M T.3 Game Theory
Document2 pages
Maths M T.3 Game Theory
Kelvin Fook
No ratings yet
Instructions: For Each of The Given Weighted System in Problems 1 To 6, Do The Following
Document12 pages
Instructions: For Each of The Given Weighted System in Problems 1 To 6, Do The Following
Aaron Joseph Ferrer Dimatulac
No ratings yet
Game Theory: Artificial Intelligence COSC-3112 Ms. Humaira Anwer
Document30 pages
Game Theory: Artificial Intelligence COSC-3112 Ms. Humaira Anwer
MUHAMMAD ALI
No ratings yet
W2. Decision Tree PDF
Document32 pages
W2. Decision Tree PDF
HENRIKUS HARRY UTOMO
No ratings yet
Resumen Teoría de Juegos Watson
Document2 pages
Resumen Teoría de Juegos Watson
jjpp109309
100% (1)
Oligopoly and Games
Document60 pages
Oligopoly and Games
Hardeep Singh
No ratings yet
Strictly Dominant Strategy
Document4 pages
Strictly Dominant Strategy
Rayan Rahman
No ratings yet