HOME

TheInfoList



OR:

AlphaGo is a
computer program A computer program is a sequence or set of instructions in a programming language for a computer to Execution (computing), execute. Computer programs are one component of software, which also includes software documentation, documentation and oth ...
that plays the
board game Board games are tabletop games that typically use . These pieces are moved or placed on a pre-marked board (playing surface) and often include elements of table, card, role-playing, and miniatures games as well. Many board games feature a co ...
Go. It was developed by
DeepMind DeepMind Technologies is a British artificial intelligence subsidiary of Alphabet Inc. and research laboratory founded in 2010. DeepMind was acquired by Google in 2014 and became a wholly owned subsidiary of Alphabet Inc, after Google's restru ...
Technologies a subsidiary of
Google Google LLC () is an American Multinational corporation, multinational technology company focusing on Search Engine, search engine technology, online advertising, cloud computing, software, computer software, quantum computing, e-commerce, ar ...
(now
Alphabet Inc. Alphabet Inc. is an American multinational technology conglomerate holding company headquartered in Mountain View, California. It was created through a restructuring of Google on October 2, 2015, and became the parent company of Google and se ...
). Subsequent versions of AlphaGo became increasingly powerful, including a version that competed under the name
Master Master or masters may refer to: Ranks or titles * Ascended master, a term used in the Theosophical religious tradition to refer to spiritually enlightened beings who in past incarnations were ordinary humans *Grandmaster (chess), National Master ...
. After retiring from competitive play, AlphaGo Master was succeeded by an even more powerful version known as AlphaGo Zero, which was completely self-taught without learning from human games. AlphaGo Zero was then generalized into a program known as AlphaZero, which played additional games, including
chess Chess is a board game for two players, called White and Black, each controlling an army of chess pieces in their color, with the objective to checkmate the opponent's king. It is sometimes called international chess or Western chess to dist ...
and
shogi , also known as Japanese chess, is a strategy board game for two players. It is one of the most popular board games in Japan and is in the same family of games as Western chess, '' chaturanga, Xiangqi'', Indian chess, and ''janggi''. ''Shōgi ...
. AlphaZero has in turn been succeeded by a program known as MuZero which learns without being taught the rules. AlphaGo and its successors use a
Monte Carlo tree search In computer science, Monte Carlo tree search (MCTS) is a heuristic search algorithm for some kinds of decision processes, most notably those employed in software that plays board games. In that context MCTS is used to solve the game tree. ...
algorithm to find its moves based on knowledge previously acquired by
machine learning Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence. Machine ...
, specifically by an
artificial neural network Artificial neural networks (ANNs), usually simply called neural networks (NNs) or neural nets, are computing systems inspired by the biological neural networks that constitute animal brains. An ANN is based on a collection of connected unit ...
(a
deep learning Deep learning (also known as deep structured learning) is part of a broader family of machine learning methods based on artificial neural networks with representation learning. Learning can be supervised, semi-supervised or unsupervised. ...
method) by extensive training, both from human and computer play. A neural network is trained to identify the best moves and the winning percentages of these moves. This neural network improves the strength of the tree search, resulting in stronger move selection in the next iteration. In October 2015, in a match against Fan Hui, the original AlphaGo became the first
computer Go Computer Go is the field of artificial intelligence (AI) dedicated to creating a computer program that plays the traditional board game Go. The field is sharply divided into two eras. Before 2015, the programs of the era were weak. The best ...
program to beat a human professional Go player without handicap on a full-sized 19×19 board. In March 2016, it beat
Lee Sedol Lee Sedol ( ko, 이세돌; born 2 March 1983), or Lee Se-dol, is a former South Korean professional Go player of 9 dan rank. As of February 2016, he ranked second in international titles (18), behind only Lee Chang-ho (21). He is the f ...
in a five-game match, the first time a computer Go program has beaten a
9-dan There are various systems of Go ranks and ratings that measure the skill in the traditional board game Go. Traditionally, Go rankings have been measured using a system of dan and kyu ranks. Especially in amateur play, these ranks facilitate th ...
professional without handicap. Although it lost to Lee Sedol in the fourth game, Lee resigned in the final game, giving a final score of 4 games to 1 in favour of AlphaGo. In recognition of the victory, AlphaGo was awarded an honorary 9-dan by the
Korea Baduk Association The Korea Baduk Association, also known as Hanguk Kiwon (), was founded in November 1945 by Cho Namchul. Baduk is a game which was present in Korea by the 5th century. It originated in China, but the West is more familiar with the Japanese name ...
. The lead up and the challenge match with Lee Sedol were documented in a documentary film also titled '' AlphaGo'', directed by Greg Kohs. The win by AlphaGo was chosen by ''
Science Science is a systematic endeavor that builds and organizes knowledge in the form of testable explanations and predictions about the universe. Science may be as old as the human species, and some of the earliest archeological evidence ...
'' as one of the
Breakthrough of the Year The Breakthrough of the Year is an annual award for the most significant development in scientific research made by the AAAS journal '' Science,'' an academic journal covering all branches of science. Originating in 1989 as the ''Molecule of the Y ...
runners-up on 22 December 2016. At the 2017
Future of Go Summit The Future of Go Summit () was held in May 2017 by the Chinese Go Association, Sport Bureau of Zhejiang Province and Google in Wuzhen, Zhejiang, the permanent host of the World Internet Conference. It featured five Go games involving AlphaGo and t ...
, the
Master Master or masters may refer to: Ranks or titles * Ascended master, a term used in the Theosophical religious tradition to refer to spiritually enlightened beings who in past incarnations were ordinary humans *Grandmaster (chess), National Master ...
version of AlphaGo beat Ke Jie, the number one ranked player in the world at the time, in a three-game match, after which AlphaGo was awarded professional 9-dan by the
Chinese Weiqi Association Chinese Weiqi Association (), or Chinese Go Association, founded in Hefei, Anhui in 1962, is the major go organization in China. As a branch of the Zhongguo Qiyuan, it oversees professional players as well as strong amateurs, functioning in the s ...
. After the match between AlphaGo and Ke Jie, DeepMind retired AlphaGo, while continuing AI research in other areas. The self-taught AlphaGo Zero achieved a 100–0 victory against the early competitive version of AlphaGo, and its successor AlphaZero is currently perceived as the world's top player in Go.


History

Go is considered much more difficult for computers to win than other games such as
chess Chess is a board game for two players, called White and Black, each controlling an army of chess pieces in their color, with the objective to checkmate the opponent's king. It is sometimes called international chess or Western chess to dist ...
, because its strategic and aesthetic nature makes it hard to directly construct an evaluation function, and its much larger branching factor makes it prohibitively difficult to use traditional AI methods such as
alpha–beta pruning Alpha–beta pruning is a search algorithm that seeks to decrease the number of nodes that are evaluated by the minimax algorithm in its search tree. It is an adversarial search algorithm used commonly for machine playing of two-player games ...
,
tree traversal In computer science, tree traversal (also known as tree search and walking the tree) is a form of graph traversal and refers to the process of visiting (e.g. retrieving, updating, or deleting) each node in a tree data structure, exactly once. ...
and
heuristic A heuristic (; ), or heuristic technique, is any approach to problem solving or self-discovery that employs a practical method that is not guaranteed to be optimal, perfect, or rational, but is nevertheless sufficient for reaching an immediate ...
search. Almost two decades after IBM's computer Deep Blue beat world chess champion
Garry Kasparov Garry Kimovich Kasparov (born 13 April 1963) is a Russian chess grandmaster, former World Chess Champion, writer, political activist and commentator. His peak rating of 2851, achieved in 1999, was the highest recorded until being surpassed by ...
in the 1997 match, the strongest Go programs using
artificial intelligence Artificial intelligence (AI) is intelligence—perceiving, synthesizing, and inferring information—demonstrated by machines, as opposed to intelligence displayed by animals and humans. Example tasks in which this is done include speech ...
techniques only reached about amateur 5-dan level, and still could not beat a professional Go player without a handicap. In 2012, the software program
Zen Zen ( zh, t=禪, p=Chán; ja, text= 禅, translit=zen; ko, text=선, translit=Seon; vi, text=Thiền) is a school of Mahayana Buddhism that originated in China during the Tang dynasty, known as the Chan School (''Chánzong'' 禪宗), and ...
, running on a four PC cluster, beat Masaki Takemiya ( 9p) twice at five- and four-stone handicaps. In 2013, Crazy Stone beat Yoshio Ishida (9p) at a four-stone handicap. According to DeepMind's David Silver, the AlphaGo research project was formed around 2014 to test how well a neural network using
deep learning Deep learning (also known as deep structured learning) is part of a broader family of machine learning methods based on artificial neural networks with representation learning. Learning can be supervised, semi-supervised or unsupervised. ...
can compete at Go. AlphaGo represents a significant improvement over previous Go programs. In 500 games against other available Go programs, including Crazy Stone and Zen, AlphaGo running on a single computer won all but one. In a similar matchup, AlphaGo running on multiple computers won all 500 games played against other Go programs, and 77% of games played against AlphaGo running on a single computer. The distributed version in October 2015 was using 1,202
CPU A central processing unit (CPU), also called a central processor, main processor or just processor, is the electronic circuitry that executes instructions comprising a computer program. The CPU performs basic arithmetic, logic, controlling, a ...
s and 176 GPUs.


Match against Fan Hui

In October 2015, the distributed version of AlphaGo defeated the European Go champion Fan Hui, a 2-dan (out of 9 dan possible) professional, five to zero. This was the first time a computer Go program had beaten a professional human player on a full-sized board without handicap. The announcement of the news was delayed until 27 January 2016 to coincide with the publication of a paper in the journal ''
Nature Nature, in the broadest sense, is the physical world or universe. "Nature" can refer to the phenomena of the physical world, and also to life in general. The study of nature is a large, if not the only, part of science. Although humans are ...
'' describing the algorithms used.


Match against Lee Sedol

AlphaGo played South Korean professional Go player
Lee Sedol Lee Sedol ( ko, 이세돌; born 2 March 1983), or Lee Se-dol, is a former South Korean professional Go player of 9 dan rank. As of February 2016, he ranked second in international titles (18), behind only Lee Chang-ho (21). He is the f ...
, ranked 9-dan, one of the best players at Go, with five games taking place at the
Four Seasons Hotel Four Seasons Hotels Limited, trading as Four Seasons Hotels and Resorts, is an international luxury hotel and resort company headquartered in Toronto, Ontario, Canada. Four Seasons currently operates more than 100 hotels and resorts worldwide.D ...
in
Seoul Seoul (; ; ), officially known as the Seoul Special City, is the Capital city, capital and largest metropolis of South Korea.Before 1972, Seoul was the ''de jure'' capital of the North Korea, Democratic People's Republic of Korea (North Korea ...
, South Korea on 9, 10, 12, 13, and 15 March 2016, which were video-streamed live. Out of five games, AlphaGo won four games and Lee won the fourth game which made him recorded as the only human player who beat AlphaGo in all of its 74 official games. AlphaGo ran on Google's cloud computing with its servers located in the United States. The match used Chinese rules with a 7.5-point komi, and each side had two hours of thinking time plus three 60-second
byoyomi A time control is a mechanism in the tournament play of almost all two-player board games so that each round of the match can finish in a timely way and the tournament can proceed. Time controls are typically enforced by means of a game clock, ...
periods. The version of AlphaGo playing against Lee used a similar amount of computing power as was used in the Fan Hui match. ''
The Economist ''The Economist'' is a British weekly newspaper printed in demitab format and published digitally. It focuses on current affairs, international business, politics, technology, and culture. Based in London, the newspaper is owned by The Eco ...
'' reported that it used 1,920 CPUs and 280 GPUs. At the time of play, Lee Sedol had the second-highest number of Go international championship victories in the world after South Korean player Lee Changho who kept the world championship title for 16 years. Since there is no single official method of ranking in international Go, the rankings may vary among the sources. While he was ranked top sometimes, some sources ranked Lee Sedol as the fourth-best player in the world at the time. AlphaGo was not specifically trained to face Lee nor was designed to compete with any specific human players. The first three games were won by AlphaGo following resignations by Lee. However, Lee beat AlphaGo in the fourth game, winning by resignation at move 180. AlphaGo then continued to achieve a fourth win, winning the fifth game by resignation. The prize was US$1 million. Since AlphaGo won four out of five and thus the series, the prize will be donated to charities, including
UNICEF UNICEF (), originally called the United Nations International Children's Emergency Fund in full, now officially United Nations Children's Fund, is an agency of the United Nations responsible for providing humanitarian and developmental aid to c ...
. Lee Sedol received $150,000 for participating in all five games and an additional $20,000 for his win in Game 4. In June 2016, at a presentation held at a university in the Netherlands, Aja Huang, one of the Deep Mind team, revealed that they had patched the logical weakness that occurred during the 4th game of the match between AlphaGo and Lee, and that after move 78 (which was dubbed the " divine move" by many professionals), it would play as intended and maintain Black's advantage. Before move 78, AlphaGo was leading throughout the game, but Lee's move caused the program's computing powers to be diverted and confused. Huang explained that AlphaGo's policy network of finding the most accurate move order and continuation did not precisely guide AlphaGo to make the correct continuation after move 78, since its value network did not determine Lee's 78th move as being the most likely, and therefore when the move was made AlphaGo could not make the right adjustment to the logical continuation.


Sixty online games

On 29 December 2016, a new account on the
Tygem Tygem ( ko, 타이젬바둑) is an internet go server owned by South Korean company TongYang Online. Popular in Asia, their website states that over 500 professional Go players use their service. Tygem was founded as ICBL by Cho Hun-hyun in 200 ...
server named "Magister" (shown as 'Magist' at the server's Chinese version) from South Korea began to play games with professional players. It changed its account name to "Master" on 30 December, then moved to the FoxGo server on 1 January 2017. On 4 January, DeepMind confirmed that the "Magister" and the "Master" were both played by an updated version of AlphaGo, called
AlphaGo Master Master is a version of DeepMind's Go software AlphaGo, named after the account name (originally Magister/Magist) used online, which won 60 straight online games against human professional Go players from 29 December 2016 to 4 January 2017. This ver ...
. As of 5 January 2017, AlphaGo Master's online record was 60 wins and 0 losses, including three victories over Go's top-ranked player, Ke Jie, who had been quietly briefed in advance that Master was a version of AlphaGo. After losing to Master, Gu Li offered a bounty of 100,000 yuan (US$14,400) to the first human player who could defeat Master. Master played at the pace of 10 games per day. Many quickly suspected it to be an AI player due to little or no resting between games. Its adversaries included many world champions such as Ke Jie, Park Jeong-hwan,
Yuta Iyama is a Japanese professional Go player. In April 2016, he became the first player in Japanese history to hold all seven major titles simultaneously. In January 2018, Iyama became the first professional Go player to be awarded Japan's People' ...
, Tuo Jiaxi,
Mi Yuting Mi Yuting (; born 8 January 1996) is a Chinese professional go player. As of Oct 2018, he is ranking 1st in Go ratings with an Elo rating of 3645. Promotion record Titles and runners-up Head-to-head record vs selected players ''Players ...
, Shi Yue,
Chen Yaoye Chen Yaoye (Traditional: 陳耀燁; Simplified: 陈耀烨; Pinyin: Chén Yàoyè; born on December 16, 1989) is a Chinese professional Go player. Biography Chen Yaoye was born in Beijing, China. He is a young Go player who, at the age of ...
, Li Qincheng, Gu Li, Chang Hao, Tang Weixing,
Fan Tingyu Fan Tingyu (born 6 August 1996) is a Chinese professional Go player. He won the 17th Xinren Wang and 18th Xinren Wang. Fan defeated Park Junghwan (b. 1993) -1in the final of the 7th (2012/13) Ing Cup The Ing Cup () is an internation ...
,
Zhou Ruiyang Zhou Ruiyang (; born March 8, 1991) is a Chinese professional Go player. Biography Zhou began playing Go at the age of 7. He won the biggest amateur tournament in China, the Wanbao Cup, both the same year before he became a professiona ...
,
Jiang Weijie Jiang Weijie (born 17 October 1991) is a Chinese professional Go player. Jiang became a professional in 2005. He won his first title, the RICOH Xinxiu Cup, in 2008. Jiang ended Gu Li's six-year reign over the Mingren The Mingren () is a Go ...
,
Chou Chun-hsun Chou Chun-hsun ( Taiwanese POJ: Chiu Chùn-hun; born February 23, 1980) is a Go player."Youngsters dominate Ing Cup". Xinhua News Agency. April 22, 2004 (via Lexis-Nexis). Retrieved February 13, 2010. Biography Chou was born in Taipei, Taiwa ...
, Kim Ji-seok, Kang Dong-yun,
Park Yeong-hun Park Yeong-hun ( ko, 박영훈, born April 1, 1985), also known as Park Young-hoon and Pak Yeong-hoon, is a South Korean professional Go player. Biography Park Yeong-hun was born in Seoul. He is a professional Go player in the Hanguk Kiwon. He ...
, and
Won Seong-jin Won Seong-jin (born 15 July 1985) is a professional Go player. Biography Won became a professional in 1998. In 2011, Won entered his first World Championship Final, the 16th Samsung Cup, by defeating Pak Yeong-hun in the quarter-finals, and ...
; national champions or world championship runners-up such as Lian Xiao,
Tan Xiao Tan Xiao (; born 10 March 1993) is a Chinese professional Go player. He won the 11th RICOH Cup. In 2017, he defeated Park Yeong-hun to win the 11th Chunlan Cup The Chunlan Cup, officially the Chunlan Cup World Professional Weiqi Championship ...
, Meng Tailing, Dang Yifei, Huang Yunsong,
Yang Dingxin Yang Dingxin (; born 19 October 1998) is a Chinese professional Go player. Biography Yang Dingxin was born in 1998 in Zhengzhou, Henan. As a young child, he lived in Bingcha in Rudong County, Nantong, Jiangsu and in Zhengzhou, and moved to B ...
, Gu Zihao, Shin Jinseo, Cho Han-seung, and An Sungjoon. All 60 games except one were fast-paced games with three 20 or 30 seconds byo-yomi. Master offered to extend the byo-yomi to one minute when playing with
Nie Weiping Nie Weiping (; born 17 August 1952) is a professional Go player. Biography Born in Shenzhou, Nie began learning Go at the age of nine and won the inaugural World Amateur Go Championship in 1979. Nie was given 9 dan rank in 1982. He became ...
in consideration of his age. After winning its 59th game Master revealed itself in the chatroom to be controlled by Dr.
Aja Huang Aja Huang (; born 1978) is a Taiwanese computer scientist and expert on artificial intelligence. He works for DeepMind and was a member of the AlphaGo project. Born in 1978, Huang received a bachelor's degree from National Chiao Tung University i ...
of the DeepMind team, then changed its nationality to the United Kingdom. After these games were completed, the co-founder of
DeepMind DeepMind Technologies is a British artificial intelligence subsidiary of Alphabet Inc. and research laboratory founded in 2010. DeepMind was acquired by Google in 2014 and became a wholly owned subsidiary of Alphabet Inc, after Google's restru ...
, Demis Hassabis, said in a tweet, "we're looking forward to playing some official, full-length games later
017 Seventeen or 17 may refer to: *17 (number), the natural number following 16 and preceding 18 * one of the years 17 BC, AD 17, 1917, 2017 Literature Magazines * ''Seventeen'' (American magazine), an American magazine * ''Seventeen'' (Japanese ...
in collaboration with Go organizations and experts". Go experts were impressed by the program's performance and its nonhuman play style; Ke Jie stated that "After humanity spent thousands of years improving our tactics, computers tell us that humans are completely wrong... I would go as far as to say not a single human has touched the edge of the truth of Go."


Future of Go Summit

In the Future of Go Summit held in
Wuzhen Wuzhen (, Wu: Whu-tsen lit. "Wu Town") is a historic scenic town, part of Tongxiang, located in the north of Zhejiang Province, China. It lies within the triangle formed by Hangzhou, Suzhou and Shanghai. Covering an area of , Wuzhen has a tota ...
in May 2017,
AlphaGo Master Master is a version of DeepMind's Go software AlphaGo, named after the account name (originally Magister/Magist) used online, which won 60 straight online games against human professional Go players from 29 December 2016 to 4 January 2017. This ver ...
played three games with Ke Jie, the world No.1 ranked player, as well as two games with several top Chinese professionals, one pair Go game and one against a collaborating team of five human players. Google DeepMind offered 1.5 million dollar winner prizes for the three-game match between Ke Jie and Master while the losing side took 300,000 dollars. Master won all three games against Ke Jie, after which AlphaGo was awarded professional 9-dan by the Chinese Weiqi Association. After winning its three-game match against Ke Jie, the top-rated world Go player, AlphaGo retired. DeepMind also disbanded the team that worked on the game to focus on AI research in other areas. After the Summit, Deepmind published 50 full length AlphaGo vs AlphaGo matches, as a gift to the Go community.


AlphaGo Zero and AlphaZero

AlphaGo's team published an article in the journal ''
Nature Nature, in the broadest sense, is the physical world or universe. "Nature" can refer to the phenomena of the physical world, and also to life in general. The study of nature is a large, if not the only, part of science. Although humans are ...
'' on 19 October 2017, introducing AlphaGo Zero, a version without human data and stronger than any previous human-champion-defeating version. By playing games against itself, AlphaGo Zero surpassed the strength of AlphaGo Lee in three days by winning 100 games to 0, reached the level of
AlphaGo Master Master is a version of DeepMind's Go software AlphaGo, named after the account name (originally Magister/Magist) used online, which won 60 straight online games against human professional Go players from 29 December 2016 to 4 January 2017. This ver ...
in 21 days, and exceeded all the old versions in 40 days. In a paper released on
arXiv arXiv (pronounced "archive"—the X represents the Greek letter chi ⟨χ⟩) is an open-access repository of electronic preprints and postprints (known as e-prints) approved for posting after moderation, but not peer review. It consists of ...
on 5 December 2017, DeepMind claimed that it generalized AlphaGo Zero's approach into a single AlphaZero algorithm, which achieved within 24 hours a superhuman level of play in the games of
chess Chess is a board game for two players, called White and Black, each controlling an army of chess pieces in their color, with the objective to checkmate the opponent's king. It is sometimes called international chess or Western chess to dist ...
,
shogi , also known as Japanese chess, is a strategy board game for two players. It is one of the most popular board games in Japan and is in the same family of games as Western chess, '' chaturanga, Xiangqi'', Indian chess, and ''janggi''. ''Shōgi ...
, and Go by defeating world-champion programs,
Stockfish Stockfish is unsalted fish, especially cod, dried by cold air and wind on wooden racks (which are called "hjell" in Norway) on the foreshore. The drying of food is the world's oldest known preservation method, and dried fish has a storage lif ...
, Elmo, and 3-day version of AlphaGo Zero in each case.


Teaching tool

On 11 December 2017, DeepMind released AlphaGo teaching tool on its website to analyze winning rates of different Go openings as calculated by
AlphaGo Master Master is a version of DeepMind's Go software AlphaGo, named after the account name (originally Magister/Magist) used online, which won 60 straight online games against human professional Go players from 29 December 2016 to 4 January 2017. This ver ...
. The teaching tool collects 6,000 Go openings from 230,000 human games each analyzed with 10,000,000 simulations by AlphaGo Master. Many of the openings include human move suggestions.


Versions

An early version of AlphaGo was tested on hardware with various numbers of
CPU A central processing unit (CPU), also called a central processor, main processor or just processor, is the electronic circuitry that executes instructions comprising a computer program. The CPU performs basic arithmetic, logic, controlling, a ...
s and GPUs, running in asynchronous or distributed mode. Two seconds of thinking time was given to each move. The resulting
Elo rating The Elo rating system is a method for calculating the relative skill levels of players in zero-sum games such as chess. It is named after its creator Arpad Elo, a Hungarian-American physics professor. The Elo system was invented as an improved c ...
s are listed below. In the matches with more time per move higher ratings are achieved. In May 2016, Google unveiled its own proprietary hardware "
tensor processing unit Tensor Processing Unit (TPU) is an AI accelerator application-specific integrated circuit (ASIC) developed by Google for neural network machine learning, using Google's own TensorFlow software. Google began using TPUs internally in 2015, and ...
s", which it stated had already been deployed in multiple internal projects at Google, including the AlphaGo match against Lee Sedol. In the
Future of Go Summit The Future of Go Summit () was held in May 2017 by the Chinese Go Association, Sport Bureau of Zhejiang Province and Google in Wuzhen, Zhejiang, the permanent host of the World Internet Conference. It featured five Go games involving AlphaGo and t ...
in May 2017, DeepMind disclosed that the version of AlphaGo used in this Summit was
AlphaGo Master Master is a version of DeepMind's Go software AlphaGo, named after the account name (originally Magister/Magist) used online, which won 60 straight online games against human professional Go players from 29 December 2016 to 4 January 2017. This ver ...
, and revealed that it had measured the strength of different versions of the software. AlphaGo Lee, the version used against Lee, could give AlphaGo Fan, the version used in AlphaGo vs. Fan Hui, three stones, and AlphaGo Master was even three stones stronger.


Algorithm

As of 2016, AlphaGo's algorithm uses a combination of
machine learning Machine learning (ML) is a field of inquiry devoted to understanding and building methods that 'learn', that is, methods that leverage data to improve performance on some set of tasks. It is seen as a part of artificial intelligence. Machine ...
and
tree search In computer science, tree traversal (also known as tree search and walking the tree) is a form of graph traversal and refers to the process of visiting (e.g. retrieving, updating, or deleting) each node in a tree data structure, exactly once. ...
techniques, combined with extensive training, both from human and computer play. It uses
Monte Carlo tree search In computer science, Monte Carlo tree search (MCTS) is a heuristic search algorithm for some kinds of decision processes, most notably those employed in software that plays board games. In that context MCTS is used to solve the game tree. ...
, guided by a "value network" and a "policy network," both implemented using
deep neural network Deep learning (also known as deep structured learning) is part of a broader family of machine learning methods based on artificial neural networks with representation learning. Learning can be supervised, semi-supervised or unsupervised. D ...
technology. A limited amount of game-specific feature detection pre-processing (for example, to highlight whether a move matches a nakade pattern) is applied to the input before it is sent to the neural networks. The networks are convolutional neural networks with 12 layers, trained by reinforcement learning. The system's neural networks were initially bootstrapped from human gameplay expertise. AlphaGo was initially trained to mimic human play by attempting to match the moves of expert players from recorded historical games, using a database of around 30 million moves. Once it had reached a certain degree of proficiency, it was trained further by being set to play large numbers of games against other instances of itself, using reinforcement learning to improve its play. To avoid "disrespectfully" wasting its opponent's time, the program is specifically programmed to resign if its assessment of win probability falls beneath a certain threshold; for the match against Lee, the resignation threshold was set to 20%.


Style of play

Toby Manning, the match referee for AlphaGo vs. Fan Hui, has described the program's style as "conservative". AlphaGo's playing style strongly favours greater probability of winning by fewer points over lesser probability of winning by more points. Its strategy of maximising its probability of winning is distinct from what human players tend to do which is to maximise territorial gains, and explains some of its odd-looking moves. It makes a lot of opening moves that have never or seldom been made by humans. It likes to use shoulder hits, especially if the opponent is over concentrated.


Responses to 2016 victory


AI community

AlphaGo's March 2016 victory was a major milestone in artificial intelligence research. Go had previously been regarded as a hard problem in machine learning that was expected to be out of reach for the technology of the time. Most experts thought a Go program as powerful as AlphaGo was at least five years away; some experts thought that it would take at least another decade before computers would beat Go champions. Most observers at the beginning of the 2016 matches expected Lee to beat AlphaGo. With games such as checkers (that has been " solved" by the Chinook draughts player team), chess, and now Go won by computers, victories at popular board games can no longer serve as major milestones for artificial intelligence in the way that they used to. Deep Blue's
Murray Campbell Murray Campbell is a Canadian computer scientist known for being part of the team that created Deep Blue; the first computer to defeat a world chess champion. Biography Campbell was involved in surveillance projects related to petroleum produc ...
called AlphaGo's victory "the end of an era... board games are more or less done and it's time to move on." When compared with Deep Blue or
Watson Watson may refer to: Companies * Actavis, a pharmaceutical company formerly known as Watson Pharmaceuticals * A.S. Watson Group, retail division of Hutchison Whampoa * Thomas J. Watson Research Center, IBM research center * Watson Systems, make ...
, AlphaGo's underlying algorithms are potentially more general-purpose and may be evidence that the scientific community is making progress towards
artificial general intelligence Artificial general intelligence (AGI) is the ability of an intelligent agent to understand or learn any intellectual task that a human being can. It is a primary goal of some artificial intelligence research and a common topic in science fictio ...
. Some commentators believe AlphaGo's victory makes for a good opportunity for society to start preparing for the possible future impact of machines with general purpose intelligence. As noted by entrepreneur Guy Suter, AlphaGo only knows how to play Go and doesn't possess general-purpose intelligence; " tcouldn't just wake up one morning and decide it wants to learn how to use firearms." AI researcher Stuart Russell said that AI systems such as AlphaGo have progressed quicker and become more powerful than expected, and we must therefore develop methods to ensure they "remain under human control". Some scholars, such as Stephen Hawking, warned (in May 2015 before the matches) that some future self-improving AI could gain actual general intelligence, leading to an unexpected AI takeover; other scholars disagree: AI expert Jean-Gabriel Ganascia believes that "Things like '
common sense ''Common Sense'' is a 47-page pamphlet written by Thomas Paine in 1775–1776 advocating independence from Great Britain to people in the Thirteen Colonies. Writing in clear and persuasive prose, Paine collected various moral and political arg ...
'... may never be reproducible", and says "I don't see why we would speak about fears. On the contrary, this raises hopes in many domains such as health and space exploration." Computer scientist Richard Sutton said "I don't think people should be scared... but I do think people should be paying attention." In China, AlphaGo was a " Sputnik moment" which helped convince the Chinese government to prioritize and dramatically increase funding for artificial intelligence. In 2017, the DeepMind AlphaGo team received the inaugural
IJCAI The International Joint Conference on Artificial Intelligence (IJCAI) is the leading conference in the field of Artificial Intelligence. The conference series has been organized by the nonprofit IJCAI Organization since 1969, making it the oldest pr ...
Marvin Minsky Marvin Lee Minsky (August 9, 1927 – January 24, 2016) was an American cognitive and computer scientist concerned largely with research of artificial intelligence (AI), co-founder of the Massachusetts Institute of Technology's AI laboratory ...
medal for Outstanding Achievements in AI. "AlphaGo is a wonderful achievement, and a perfect example of what the Minsky Medal was initiated to recognise", said Professor
Michael Wooldridge Michael Richard Lewis Wooldridge (born 7 November 1956) is an Australian doctor, company director, and former politician. He served as deputy leader of the Liberal Party from 1993 to 1994, under John Hewson. In the Howard Government he held m ...
, Chair of the IJCAI Awards Committee. "What particularly impressed IJCAI was that AlphaGo achieves what it does through a brilliant combination of classic AI techniques as well as the state-of-the-art machine learning techniques that DeepMind is so closely associated with. It's a breathtaking demonstration of contemporary AI, and we are delighted to be able to recognise it with this award."


Go community

Go is a popular game in China, Japan and Korea, and the 2016 matches were watched by perhaps a hundred million people worldwide. Many top Go players characterized AlphaGo's unorthodox plays as seemingly-questionable moves that initially befuddled onlookers, but made sense in hindsight: "All but the very best Go players craft their style by imitating top players. AlphaGo seems to have totally original moves it creates itself." AlphaGo appeared to have unexpectedly become much stronger, even when compared with its October 2015 match where a computer had beaten a Go professional for the first time ever without the advantage of a handicap. The day after Lee's first defeat, Jeong Ahram, the lead Go correspondent for one of South Korea's biggest daily newspapers, said "Last night was very gloomy... Many people drank alcohol." The
Korea Baduk Association The Korea Baduk Association, also known as Hanguk Kiwon (), was founded in November 1945 by Cho Namchul. Baduk is a game which was present in Korea by the 5th century. It originated in China, but the West is more familiar with the Japanese name ...
, the organization that oversees Go professionals in South Korea, awarded AlphaGo an honorary 9-dan title for exhibiting creative skills and pushing forward the game's progress. China's Ke Jie, an 18-year-old generally recognized as the world's best Go player at the time, initially claimed that he would be able to beat AlphaGo, but declined to play against it for fear that it would "copy my style". As the matches progressed, Ke Jie went back and forth, stating that "it is highly likely that I (could) lose" after analysing the first three matches, but regaining confidence after AlphaGo displayed flaws in the fourth match. Toby Manning, the referee of AlphaGo's match against Fan Hui, and Hajin Lee, secretary general of the
International Go Federation The International Go Federation (IGF) is an international organization that connects the various national Go federations around the world. Role The role of the IGF is to promote the sport of Go throughout the world, promote amicable relations ...
, both reason that in the future, Go players will get help from computers to learn what they have done wrong in games and improve their skills. After game two, Lee said he felt "speechless": "From the very beginning of the match, I could never manage an upper hand for one single move. It was AlphaGo's total victory." Lee apologized for his losses, stating after game three that "I misjudged the capabilities of AlphaGo and felt powerless." He emphasized that the defeat was "Lee Se-dol's defeat" and "not a defeat of mankind". Lee said his eventual loss to a machine was "inevitable" but stated that "robots will never understand the beauty of the game the same way that we humans do." Lee called his game four victory a "priceless win that I (would) not exchange for anything."


Similar systems

Facebook Facebook is an online social media and social networking service owned by American company Meta Platforms. Founded in 2004 by Mark Zuckerberg with fellow Harvard College students and roommates Eduardo Saverin, Andrew McCollum, Dust ...
has also been working on its own Go-playing system '' darkforest'', also based on combining machine learning and
Monte Carlo tree search In computer science, Monte Carlo tree search (MCTS) is a heuristic search algorithm for some kinds of decision processes, most notably those employed in software that plays board games. In that context MCTS is used to solve the game tree. ...
. Although a strong player against other computer Go programs, as of early 2016, it had not yet defeated a professional human player. Darkforest has lost to CrazyStone and Zen and is estimated to be of similar strength to CrazyStone and Zen. DeepZenGo, a system developed with support from video-sharing website
Dwango The Dial-up Wide-Area Network Game Operation, better known by the acronym DWANGO, was an early online gaming service based in the United States. Launched in 1994, it was originally known for its compatibility with '' Doom'', for which it functio ...
and the
University of Tokyo , abbreviated as or UTokyo, is a public research university located in Bunkyō, Tokyo, Japan. Established in 1877, the university was the first Imperial University and is currently a Top Type university of the Top Global University Project b ...
, lost 2–1 in November 2016 to Go master
Cho Chikun Cho Chikun ''25th Honinbo'' ''Honorary Meijin'' ( ko, 조치훈; born June 20, 1956) is a professional Go player and a nephew of Cho Namchul. Born in Busan, South Korea, he is affiliated to Nihon Ki-in. His total title tally of 75 titles is t ...
, who holds the record for the largest number of Go title wins in Japan. A 2018 paper in
Nature Nature, in the broadest sense, is the physical world or universe. "Nature" can refer to the phenomena of the physical world, and also to life in general. The study of nature is a large, if not the only, part of science. Although humans are ...
cited AlphaGo's approach as the basis for a new means of computing potential pharmaceutical drug molecules.


Example game

AlphaGo Master Master is a version of DeepMind's Go software AlphaGo, named after the account name (originally Magister/Magist) used online, which won 60 straight online games against human professional Go players from 29 December 2016 to 4 January 2017. This ver ...
(white) v.
Tang Weixing Tang Weixing (; born 15 January 1993) is a Chinese professional go player. He has won three international titles, with two championships in the Samsung Cup (2013, 2019) and one in the Ing Cup (2016). Early life Tang Weixing was born in Guiyan ...
(31 December 2016), AlphaGo won by resignation. White 36 was widely praised.


Impacts on Go

The documentary film '' AlphaGo'' raised hopes that
Lee Sedol Lee Sedol ( ko, 이세돌; born 2 March 1983), or Lee Se-dol, is a former South Korean professional Go player of 9 dan rank. As of February 2016, he ranked second in international titles (18), behind only Lee Chang-ho (21). He is the f ...
and Fan Hui would have benefitted from their experience of playing AlphaGo, but as of May 2018 their ratings were little changed;
Lee Sedol Lee Sedol ( ko, 이세돌; born 2 March 1983), or Lee Se-dol, is a former South Korean professional Go player of 9 dan rank. As of February 2016, he ranked second in international titles (18), behind only Lee Chang-ho (21). He is the f ...
was ranked 11th in the world, and Fan Hui 545th. On 19 November 2019, Lee announced his retirement from professional play, arguing that he could never be the top overall player of Go due to the increasing dominance of AI. Lee referred to them as being "an entity that cannot be defeated".


See also

* Albert Lindsey Zobrist, wrote first Go program in 1968 * Chinook (draughts player),
draughts Checkers (American English), also known as draughts (; British English), is a group of strategy board games for two players which involve diagonal moves of uniform game pieces and mandatory captures by jumping over opponent pieces. Checker ...
playing program *
Glossary of artificial intelligence This glossary of artificial intelligence is a list of definitions of terms and concepts relevant to the study of artificial intelligence, its sub-disciplines, and related fields. Related glossaries include Glossary of computer science, Glossary o ...
* Go and mathematics *
Leela (software) Leela is a computer Go software developed by Belgian programmer Gian-Carlo Pascutto, the author of chess engine Sjeng. It won the third place for 19x19 board Go and the second place for 9x9 board Go at the Computer Olympiad in 2008, and won the e ...
* Leela Zero, open-source learning Go program *
Matchbox Educable Noughts and Crosses Engine The Matchbox Educable Noughts and Crosses Engine (sometimes called the Machine Educable Noughts and Crosses Engine or MENACE) was a mechanical computer made from 304 matchboxes designed and built by artificial intelligence researcher Donald Mi ...
* Samuel's learning computer checkers (draughts) * TD-Gammon,
backgammon Backgammon is a two-player board game played with counters and dice on tables boards. It is the most widespread Western member of the large family of tables games, whose ancestors date back nearly 5,000 years to the regions of Mesopotamia and Pe ...
neural network * Pluribus (poker bot) * AlphaZero * AlphaFold


References


External links

* * *
AlphaGo wiki
at
Sensei's Library Sensei's Library (commonly referred to as SL among Go-players) is an Internet website and wiki A wiki ( ) is an online hypertext publication collaboratively edited and managed by its own audience, using a web browser. A typical wiki contai ...
, including links to AlphaGo games
AlphaGo page
with archive and games
Estimated 2017 rating of Alpha Go
* {{Go (game) 2015 software Applications of artificial intelligence Go engines Google Applied machine learning