2016

Mastering the Game of Go With Deep Neural Networks and Tree Search

D. Silver, A. Huang, C. J. Maddison, A. Guez, L. Sifre, G. V. D. Driessche, J. Schrittwieser, I. Antonoglou, V. Panneershelvam, M. Lanctot, S. Dieleman, D. Grewe, J. Nham, N. Kalchbrenner, Ilya Sutskever, T. Lillicrap, M. Leach, Koray Kavukcuoglu, T. Graepel, Demis Hassabis

citations

Cite Score

92

AI summary

This paper introduces AlphaGo, a Go program combining deep neural networks and Monte Carlo tree search (MCTS). AlphaGo uses policy networks, value networks and a novel search algorithm, achieving a 99.8% winning rate against other Go programs and defeating the human European Go champion.

Main Contributions

  • Introduces a novel approach to computer Go using value networks and policy networks.
  • Presents a new search algorithm combining Monte Carlo simulation with value and policy networks.
  • Achieves a 99.8% winning rate against other Go programs.
  • Defeats the human European Go champion by 5 games to 0.
  • Demonstrates the first time a computer program has defeated a human professional player in the full-sized game of Go.

Abstract

The game of Go has long been viewed as the most challenging of classic games for artificial intelligence owing to its enormous search space and the difficulty of evaluating board positions and moves. Here we introduce a new approach to computer Go that uses 'value networks' to evaluate board positions and ‘policy networks’ to select moves. These deep neural networks are trained by a novel combination of supervised learning from human expert games, and reinforcement learning from games of self-play. Without any lookahead search, the neural networks play Go at the level of state-of-the-art Monte Carlo tree search programs that simulate thousands of random games of self-play. We also introduce a new search algorithm that combines Monte Carlo simulation with value and policy networks. Using this search algorithm, our program AlphaGo achieved a 99.8% winning rate against other Go programs, and defeated the human European Go champion by 5 games to 0. This is the first time that a computer program has defeated a human professional player in the full-sized game of Go, a feat previously thought to be at least a decade away.

Citation Graph

Loading graph...

References [37]

Sort:
Filter:

Alex Krizhevsky, Ilya Sutskever, Geoffrey E. Hinton - 2012

71 papers in library cite

I. Goodfellow, Yoshua Bengio, Y. A. Courville, A. Aaron - 2016

5 papers in library cite

V. Mnih - 2015

9 papers in library cite

R. Williams - 1992

11 papers in library cite

Richard S. Sutton, A. Barto - 1998

5 papers in library cite

C. Maddison, A. Huang, Ilya Sutskever, D. Silver - 2015

2 papers in library cite

L. Kocsis, C. Szepesvari - 2006

3 papers in library cite

S. Lawrence, C. Giles, A. Tsoi, A. Back - 1997

3 papers in library cite

R. Coulom - 2006

2 papers in library cite

Richard S. Sutton, D. Mcallester, Shivalika Singh, Y. Mansour - 2000

2 papers in library cite

H. Berliner - 1978

1 paper in library cites

C. Browne, E. Powley, D. Whitehouse, S. Lucas, P. Cowling, P. Rohlfshagen, S. Tavener, D. Perez, S. Samothrakis, S. Colton - 2012

1 paper in library cites

J. Schaeffer, R. Lake, P. Lu, D. Szafron - 1992

1 paper in library cites

D. Mechner - 1998

1 paper in library cites

D. Stern, R. Herbrich, T. Graepel - 2006

1 paper in library cites

Sylvain Gelly, D. Silver - 2007

1 paper in library cites

J. Mandziuk - 2007

1 paper in library cites

M. Muller - 2002

1 paper in library cites

R. Coulom - 2007

1 paper in library cites

M. Campbell, A. Hoane, F. Hsu - 2002

1 paper in library cites

M. Enzenberger - 2003

1 paper in library cites

M. Buro - 1999

1 paper in library cites

M. Muller, M. Enzenberger, B. Arneson, R. Segal - 2010

1 paper in library cites

H. V. D. Herik, J. Uiterwijk, J. V. Rijswijck - 2002

1 paper in library cites

Ilya Sutskever, V. Nair - 2008

1 paper in library cites

B. Bouzy, B. Helmstetter - 2003

1 paper in library cites

Gerald Tesauro, G. Galperin - 1996

1 paper in library cites

P. Baudis, J. Gailly - 2012

1 paper in library cites

L. Allis - 1994

1 paper in library cites

N. Schraudolph, Peter Dayan, T. Sejnowski - 1994

1 paper in library cites

D. Silver, Richard S. Sutton, M. Muller - 2012

1 paper in library cites

J. Schaeffer - 2000

1 paper in library cites

Sylvain Gelly, L. Kocsis, M. Lanctot, C. Stepane, O. Teytaud, M. Winands - 2012

1 paper in library cites

Missing year

A. Levinovitz

1 paper in library cites

C. Clark, A. Storkey - 2015

1 paper in library cites

R. Coulom - 2008

1 paper in library cites

B. Sheppard - 2002

1 paper in library cites

Cited by

5

papers in your library

Cites

6

papers in your library

Read

on November 22, 2025

Your review

Tags

Paper Aliases

No aliases