Welcome to Scribd!

Skip carousel

AlphaGo Tutorial Slides

Uploaded by

Daniel Andrés Crespo

0% found this document useful (0 votes)

125 views16 pages

AlphaGo

Copyright

Available Formats

PDF, TXT or read online from Scribd

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Report this Document

AlphaGo

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

0% found this document useful (0 votes)

125 views16 pages

AlphaGo Tutorial Slides

Uploaded by

Daniel Andrés Crespo

AlphaGo

Copyright:

Available Formats

Download as PDF, TXT or read online from Scribd

Flag for inappropriate content

Download as pdf or txt

Jump to Page

You are on page 1of 16

Search inside document

Why is Go hard for computers to play?

Game tree complexity = bd

Brute force search intractable:

1. Search space is huge

2. Impossible for computers
to evaluate who is winning
Convolutional neural network
Value network
Evaluation

v (s)

Position
Policy network
Move probabilities

p (a|s)

Position
Neural network training pipeline

Human expert Supervised Learning Reinforcement Learning Self-play data Value network
positions policy network policy network
Supervised learning of policy networks
Policy network: 12 layer convolutional neural network

Training data: 30M positions from human expert games (KGS 5+ dan)

Training algorithm: maximise likelihood by stochastic gradient descent

Training time: 4 weeks on 50 GPUs using Google Cloud

Results: 57% accuracy on held out test data (state-of-the art was 44%)
Reinforcement learning of policy networks
Policy network: 12 layer convolutional neural network

Training data: games of self-play between policy network

Training algorithm: maximise wins z by policy gradient reinforcement learning

Training time: 1 week on 50 GPUs using Google Cloud

Results: 80% vs supervised learning. Raw network ~3 amateur dan.

Reinforcement learning of value networks
Value network: 12 layer convolutional neural network

Training data: 30 million games of self-play

Training algorithm: minimise MSE by stochastic gradient descent

Training time: 1 week on 50 GPUs using Google Cloud

Results: First strong position evaluation function - previously thought impossible

Exhaustive search
Reducing depth with value network
Reducing breadth with policy network
Professional Amateur Beginner
dan (p) dan (d) kyu (k)
Evaluating AlphaGo against computers

1d
9p
7p
5p
3p
1p

1k
3k
5k
7k

Gnu
Go
Fuego
Pachi
Zen
Crazy Stone
AlphaGo (Nature v13)
AlphaGo (Seoul v18)

0
4500

4000

3500

3000

2500

2000

1500

1000

500
Computer Programs Calibration Human Players

DeepMind challenge match Lee Sedol (9p)

AlphaGo (Mar 2016) Top player of
4-1 past decade

Beats Beats

Nature match Fan Hui (2p)

AlphaGo (Oct 2015) 3-times reigning
5-0 Euro Champion

Beats Beats

KGS Amateur
Crazy Stone and Zen
humans
Whats Next?
Demis Hassabis

Caterpillar Product Speci Cations
Document4 pages
Caterpillar Product Speci Cations
Jorge Enrique Pulido Ayala
No ratings yet
Huber-Suhner. 5G Functional Split
Document1 page
Huber-Suhner. 5G Functional Split
Pavel Schukin
No ratings yet
РЛЭ Дьюк 60
Document170 pages
РЛЭ Дьюк 60
valentine_avia
100% (4)
WBVF - Planos Electricos
Document27 pages
WBVF - Planos Electricos
Franklin Palacios
100% (4)
围棋初级指导－通往五级的捷径
Document219 pages
围棋初级指导－通往五级的捷径
api-3744762
100% (1)
Special Issue Volume 10
Document100 pages
Special Issue Volume 10
K. M. Junayed Ahmed
No ratings yet
Principles of Data Management and Mining: CS 504 Spring 2020
Document28 pages
Principles of Data Management and Mining: CS 504 Spring 2020
bijaysubedi
No ratings yet
Graphanalyticswitharangodbfeb2021 210215121042
Document56 pages
Graphanalyticswitharangodbfeb2021 210215121042
Adireddy Satyatrinadh
No ratings yet
Sugar Store
Document1 page
Sugar Store
Mbalekelwa Mpembe
No ratings yet
MKT MKT MKT MKT MKT-1 MKT-2 MKT-3 MKT-4 22 23 24 25 MKT MKT-5
Document4 pages
MKT MKT MKT MKT MKT-1 MKT-2 MKT-3 MKT-4 22 23 24 25 MKT MKT-5
locustwong
No ratings yet
Data Science Periodic Table
Document1 page
Data Science Periodic Table
Jeremiah Seagraves
No ratings yet
Camera Comparison: Scarlet Dragon® Epic Dragon Weapon Dragon 6K Weapon Dragon 8K Weapon® Dragon 6K
Document1 page
Camera Comparison: Scarlet Dragon® Epic Dragon Weapon Dragon 6K Weapon Dragon 8K Weapon® Dragon 6K
Selva Prasantha Kumar
No ratings yet
Scaler Master Class Notes
Document10 pages
Scaler Master Class Notes
vicky5294nitp
No ratings yet
1 3DataFunSummit 实体对齐算法在电商领域当中的实践和应用 Final
Document30 pages
1 3DataFunSummit 实体对齐算法在电商领域当中的实践和应用 Final
JIA zheng
No ratings yet
SMI - AOC - Display 2020 01 31 10 - 13 - 15
Document3 pages
SMI - AOC - Display 2020 01 31 10 - 13 - 15
devino grasio
No ratings yet
P1903009-0ga-Ar-3801-Hygiene Master Ground Floor Plan - Overall PDF
Document1 page
P1903009-0ga-Ar-3801-Hygiene Master Ground Floor Plan - Overall PDF
wrightwoman
No ratings yet
Report
Document1 page
Report
cuvinte vindecatoare
No ratings yet
Domain Overview For: HTTPS://WWW - Fastrack.in
Document11 pages
Domain Overview For: HTTPS://WWW - Fastrack.in
sri_reddy_10
No ratings yet
RPX 1217 y
Document51 pages
RPX 1217 y
TALENT SCOUT
No ratings yet
Mug21 DL ML v3
Document84 pages
Mug21 DL ML v3
Fernando Cisneros
No ratings yet
Zambia GPON - Comparisons
Document3 pages
Zambia GPON - Comparisons
Andrew Nkhuwa
No ratings yet
EOS DSLR Camera Timeline
Document1 page
EOS DSLR Camera Timeline
freewneel
100% (1)
Alphago Zero Dethroned
Document37 pages
Alphago Zero Dethroned
Leo Chagaum
No ratings yet
Codeforces Com
Document8 pages
Codeforces Com
Zulqarnayn
No ratings yet
Custom Chart - SNR SND LinkMargin
Document2 pages
Custom Chart - SNR SND LinkMargin
santiarno srimulatsih
No ratings yet
Front Elevation: Bureau of Design
Document1 page
Front Elevation: Bureau of Design
Lowie Torres Tonio
100% (1)
List of AMD Graphics Processing Units - Wikipedia
Document18 pages
List of AMD Graphics Processing Units - Wikipedia
MD Showeb Arif Siddiquie
100% (1)
Robo 3
Document6 pages
Robo 3
Grishma Balgi
No ratings yet
A Brief Introduction of Existing Big Data Tools
Document37 pages
A Brief Introduction of Existing Big Data Tools
bhattsb
No ratings yet
Report
Document1 page
Report
cuvinte vindecatoare
No ratings yet
GameMindsDT - Final Report
Document24 pages
GameMindsDT - Final Report
alohasoundsystem
No ratings yet
Julia Intro
Document18 pages
Julia Intro
Alexa Alexiu
No ratings yet
Ro100922 2021 01
Document7 pages
Ro100922 2021 01
Simona Maria Lazar
No ratings yet
7 11 Complete Plan Set
Document30 pages
7 11 Complete Plan Set
Gerardo Galeano
No ratings yet
ND Ice Hockey Yearly Plan
Document5 pages
ND Ice Hockey Yearly Plan
api-486152353
No ratings yet
781 Crawfis 1
Document29 pages
781 Crawfis 1
Shantanu Varma
No ratings yet
Epics
Document3 pages
Epics
Majid Khoshnama
No ratings yet
Fillable Anima Character Sheet
Document23 pages
Fillable Anima Character Sheet
IanSchobben
No ratings yet
Third Floor Power Layout: Bureau of Design
Document1 page
Third Floor Power Layout: Bureau of Design
Juan Istil
No ratings yet
Periodic Table of Devops Tools v3
Document1 page
Periodic Table of Devops Tools v3
Dinesh Verma
No ratings yet
Muskie Proppant LLC: Qualifying Fluid & Proppant Performance
Document11 pages
Muskie Proppant LLC: Qualifying Fluid & Proppant Performance
smithyry2014
No ratings yet
1 Mile. (All-Weather) : # Speed Last Race # Prime Power # Class Rating # Best Speed at Dist
Document33 pages
1 Mile. (All-Weather) : # Speed Last Race # Prime Power # Class Rating # Best Speed at Dist
Cristian Hernandez
No ratings yet
Android - Failed To Resolve - Com - github.PhilJay - MPAndroidChart - v2.1.4 - Stack Overflow PDF
Document1 page
Android - Failed To Resolve - Com - github.PhilJay - MPAndroidChart - v2.1.4 - Stack Overflow PDF
Yusto Malik Omondi
No ratings yet
AlphaGo IJCAI
Document41 pages
AlphaGo IJCAI
scribrrrr
100% (1)
(Tutorial) Graphics With Ggplot2 - DataCamp
Document10 pages
(Tutorial) Graphics With Ggplot2 - DataCamp
Gabriel Hi
No ratings yet
CS6710 Mipsx2
Document27 pages
CS6710 Mipsx2
AntonKots
No ratings yet
Role Master - Character Generation Sheet
Document375 pages
Role Master - Character Generation Sheet
Ryan Witt
100% (2)
Riggin Plan - Puente Grua
Document1 page
Riggin Plan - Puente Grua
Juan Carlos Alberca Alfaro
No ratings yet
Ffgdp-1 Lighting: Al Barakeh Mall
Document1 page
Ffgdp-1 Lighting: Al Barakeh Mall
mahdi nori
No ratings yet
M365 License Features
Document4 pages
M365 License Features
Praveen Peethambaran
No ratings yet
01 Introreview PDF
Document130 pages
01 Introreview PDF
Meenakshi
No ratings yet
Ground Floor Lighting Layout: Bureau of Design
Document1 page
Ground Floor Lighting Layout: Bureau of Design
Dominador Ladot Heraña Jr.
No ratings yet
Lec 3 - 4 - Linear Regression PDF
Document1 page
Lec 3 - 4 - Linear Regression PDF
Ahmed Samy
No ratings yet
Daily Indexes 10oct2022
Document15 pages
Daily Indexes 10oct2022
scribbug
No ratings yet
First Term English Exam: 2/say If The Statement Below Are True or False
Document3 pages
First Term English Exam: 2/say If The Statement Below Are True or False
Uni Que
No ratings yet
NEW Objet Materials:: The Power Behind Your 3D Printer
Document2 pages
NEW Objet Materials:: The Power Behind Your 3D Printer
Vivek C
No ratings yet
Job Information: Job No Sheet No Rev
Document15 pages
Job Information: Job No Sheet No Rev
Rommel Azores
No ratings yet
Bloom IPads Apps
Document1 page
Bloom IPads Apps
tray
No ratings yet
dc7261 Scott Ruppert Tim Woodard Deep Learning With Quadro in Workstation
Document11 pages
dc7261 Scott Ruppert Tim Woodard Deep Learning With Quadro in Workstation
yuriikorolov15
No ratings yet
Pangya Debug - Wiki
Document83 pages
Pangya Debug - Wiki
biohmeanik
No ratings yet
Main Sheets: Spells, Powers, Techniques, Summons and Invocations
Document23 pages
Main Sheets: Spells, Powers, Techniques, Summons and Invocations
Nero
No ratings yet
Second Floor Power Layout: Bureau of Design
Document1 page
Second Floor Power Layout: Bureau of Design
Dominador Ladot Heraña Jr.
No ratings yet
Modeling and Animation Using Blender: Blender 2.80: The Rise of Eevee
From Everand
Modeling and Animation Using Blender: Blender 2.80: The Rise of Eevee
Ezra Thess Mendoza Guevarra
No ratings yet
Quick Reference Guide: Organize. Collaborate. Discover
Document4 pages
Quick Reference Guide: Organize. Collaborate. Discover
Daniel Andrés Crespo
No ratings yet
Conference #1
Document1 page
Conference #1
Daniel Andrés Crespo
No ratings yet
Low Power and High-Speed FPGA Implementation For 4D Memristor Chaotic System For Image Encryption
Document20 pages
Low Power and High-Speed FPGA Implementation For 4D Memristor Chaotic System For Image Encryption
Daniel Andrés Crespo
No ratings yet
Cryptographic Accelerator in Reconfigurable Hardware
Document8 pages
Cryptographic Accelerator in Reconfigurable Hardware
Daniel Andrés Crespo
No ratings yet
Miller PDF
Document25 pages
Miller PDF
Daniel Andrés Crespo
No ratings yet
Edward Winter - Jaffe and His 'Primer'
Document4 pages
Edward Winter - Jaffe and His 'Primer'
Daniel Andrés Crespo
No ratings yet
Switching Power Supply 150W S-150-24: Output 24Vdc 6.25A
Document2 pages
Switching Power Supply 150W S-150-24: Output 24Vdc 6.25A
Daniel Andrés Crespo
No ratings yet
Design of PLC Based Speed Control of DC Motor Using PI Controller
Document4 pages
Design of PLC Based Speed Control of DC Motor Using PI Controller
Tricia Mae Evangelista
No ratings yet
X10040 (Ae8603)
Document2 pages
X10040 (Ae8603)
sathesh waran
0% (1)
Tamang: SAFE Analysis & Design Report
Document52 pages
Tamang: SAFE Analysis & Design Report
Sudip Shrestha
No ratings yet
HMI Embedded PCU 20 V06.05.49: SINUMERIK 810D / 840D Upgrade Instructions
Document4 pages
HMI Embedded PCU 20 V06.05.49: SINUMERIK 810D / 840D Upgrade Instructions
Nica Bogdan
No ratings yet
Lecture 3 - Mud Program
Document16 pages
Lecture 3 - Mud Program
huutaipham
No ratings yet
Unified Council Unified Council: National Level Science Talent Search Examination
Document5 pages
Unified Council Unified Council: National Level Science Talent Search Examination
Payal Jain
No ratings yet
LMR 400 PDF
Document4 pages
LMR 400 PDF
viktor220378
No ratings yet
Nayara Carin Report 10-Apr-2024
Document3 pages
Nayara Carin Report 10-Apr-2024
amitk397115
No ratings yet
Unveiling The Ultimate Electrical Switch: Revolutionize Your Home With This Game-Changing Device!
Document4 pages
Unveiling The Ultimate Electrical Switch: Revolutionize Your Home With This Game-Changing Device!
saanvisingh861
No ratings yet
New Materials / Technologies / Equipment Accredited by The Indian Roads Congress (Irc)
Document4 pages
New Materials / Technologies / Equipment Accredited by The Indian Roads Congress (Irc)
Sabyasachi Naik (Zico)
No ratings yet
Service Manual
Document45 pages
Service Manual
StoneAge1
No ratings yet
Caterpillar Cat 302.2D Mini Hydraulic Excavator (Prefix LJG) Service Repair Manual (LJG00001 and Up)
Document20 pages
Caterpillar Cat 302.2D Mini Hydraulic Excavator (Prefix LJG) Service Repair Manual (LJG00001 and Up)
kfmuseddk
No ratings yet
Design Guide 13 - Wide-Flange Column Stiffening at Moment Connections (See Errata Listed at End of File.)
Document105 pages
Design Guide 13 - Wide-Flange Column Stiffening at Moment Connections (See Errata Listed at End of File.)
Hui Liu
100% (3)
Ew Kern
Document26 pages
Ew Kern
john smith
No ratings yet
Eco Nical
Document13 pages
Eco Nical
luizfellipe95
No ratings yet
Esthetic Post
Document33 pages
Esthetic Post
Ahmed Gendia
No ratings yet
Oil Free Reciprocating Compressors - IR
Document2 pages
Oil Free Reciprocating Compressors - IR
vmohan01
No ratings yet
En Safe Load Guidelines
Document46 pages
En Safe Load Guidelines
CEIT Logistica
No ratings yet
Applied Well Test Analysis
Document31 pages
Applied Well Test Analysis
-yanyan-
50% (2)
Linux Mind Map
Document4 pages
Linux Mind Map
Muhsin Mahamood
No ratings yet
VWR Standard 1000 Orbital Shaker Instruction Manual
Document73 pages
VWR Standard 1000 Orbital Shaker Instruction Manual
Wai Yan
No ratings yet
RF Basics
Document60 pages
RF Basics
Hasan Zahid
No ratings yet
AIATS Schedule For XI Studying JEE-Main-2020
Document1 page
AIATS Schedule For XI Studying JEE-Main-2020
Srishti Sharma
No ratings yet
Service Manual: CB 470S CB 570S CB 800S
Document157 pages
Service Manual: CB 470S CB 570S CB 800S
Mack Diesel
No ratings yet
Cmmi Documentation For Maturity Level 2
Document8 pages
Cmmi Documentation For Maturity Level 2
Hamid Hamid
No ratings yet
Incidentrequest Closed Monthly May
Document345 pages
Incidentrequest Closed Monthly May
أحمد أبوعرفه
No ratings yet
Conversion Tables, Constants, and Material Properties
Document2 pages
Conversion Tables, Constants, and Material Properties
LuisA.HarCór
0% (1)