Evaluation Function Based Monte-Carlo LOA

Mark H. M. Winands; Yngvi Bjornsson

doi:10.1007/978-3-642-12993-3_4

Evaluation Function Based Monte-Carlo LOA

Mark H. M. Winands^*, Yngvi Bjornsson

^*Corresponding author for this work

Networks and Strategic Optimization

Research output: Chapter in Book/Report/Conference proceeding › Chapter › Academic

Abstract

Recently, monte-carlo tree search (mcts) has advanced the field of computer go substantially. Also in the game of lines of action (loa), which has been dominated so far by aß, mcts is making an inroad. In this paper we investigate how to use a positional evaluation function in a monte-carlo simulation-based loa program (mc-loa). Four different simulation strategies are designed, called evaluation cut-off, corrective, greedy, and mixed. They use an evaluation function in several ways. Experimental results reveal that the mixed strategy is the best among them. This strategy draws the moves randomly based on their transition probabilities in the first part of a simulation, but selects them based on their evaluation score in the second part of a simulation. Using this simulation strategy the mc-loa program plays at the same level as the aß program mia, the best loa-playing entity in the world.

Original language	English
Title of host publication	Advances in Computer Games
Subtitle of host publication	12th International Conference, ACG 2009, Pamplona Spain, May 11-13, 2009. Revised Papers
Editors	H. Jaap van den Herik, Pieter Spronck
Place of Publication	Berlin, Heidelberg
Publisher	Springer
Pages	33-44
Number of pages	12
ISBN (Print)	978-3-642-12993-3
DOIs	https://doi.org/10.1007/978-3-642-12993-3_4
Publication status	Published - 2010

Publication series

Series	Lecture Notes in Computer Science
Volume	6048

Access to Document

10.1007/978-3-642-12993-3_4

http://dx.doi.org/10.1007/978-3-642-12993-3_4

Cite this

@inbook{63f60e8a9cf4487c8220bf296a3d7c05,

title = "Evaluation Function Based Monte-Carlo LOA",

abstract = "Recently, monte-carlo tree search (mcts) has advanced the field of computer go substantially. Also in the game of lines of action (loa), which has been dominated so far by a{\ss}, mcts is making an inroad. In this paper we investigate how to use a positional evaluation function in a monte-carlo simulation-based loa program (mc-loa). Four different simulation strategies are designed, called evaluation cut-off, corrective, greedy, and mixed. They use an evaluation function in several ways. Experimental results reveal that the mixed strategy is the best among them. This strategy draws the moves randomly based on their transition probabilities in the first part of a simulation, but selects them based on their evaluation score in the second part of a simulation. Using this simulation strategy the mc-loa program plays at the same level as the a{\ss} program mia, the best loa-playing entity in the world.",

author = "Winands, {Mark H. M.} and Yngvi Bjornsson",

year = "2010",

doi = "10.1007/978-3-642-12993-3_4",

language = "English",

isbn = "978-3-642-12993-3",

series = "Lecture Notes in Computer Science",

publisher = "Springer",

pages = "33--44",

editor = "{van den Herik}, {H. Jaap} and Pieter Spronck",

booktitle = "Advances in Computer Games",

address = "United States",

}

Evaluation Function Based Monte-Carlo LOA. / Winands, Mark H. M.; Bjornsson, Yngvi.
Advances in Computer Games: 12th International Conference, ACG 2009, Pamplona Spain, May 11-13, 2009. Revised Papers. ed. / H. Jaap van den Herik; Pieter Spronck. Berlin, Heidelberg: Springer, 2010. p. 33-44 (Lecture Notes in Computer Science, Vol. 6048).

Research output: Chapter in Book/Report/Conference proceeding › Chapter › Academic

TY - CHAP

T1 - Evaluation Function Based Monte-Carlo LOA

AU - Winands, Mark H. M.

AU - Bjornsson, Yngvi

PY - 2010

Y1 - 2010

N2 - Recently, monte-carlo tree search (mcts) has advanced the field of computer go substantially. Also in the game of lines of action (loa), which has been dominated so far by aß, mcts is making an inroad. In this paper we investigate how to use a positional evaluation function in a monte-carlo simulation-based loa program (mc-loa). Four different simulation strategies are designed, called evaluation cut-off, corrective, greedy, and mixed. They use an evaluation function in several ways. Experimental results reveal that the mixed strategy is the best among them. This strategy draws the moves randomly based on their transition probabilities in the first part of a simulation, but selects them based on their evaluation score in the second part of a simulation. Using this simulation strategy the mc-loa program plays at the same level as the aß program mia, the best loa-playing entity in the world.

AB - Recently, monte-carlo tree search (mcts) has advanced the field of computer go substantially. Also in the game of lines of action (loa), which has been dominated so far by aß, mcts is making an inroad. In this paper we investigate how to use a positional evaluation function in a monte-carlo simulation-based loa program (mc-loa). Four different simulation strategies are designed, called evaluation cut-off, corrective, greedy, and mixed. They use an evaluation function in several ways. Experimental results reveal that the mixed strategy is the best among them. This strategy draws the moves randomly based on their transition probabilities in the first part of a simulation, but selects them based on their evaluation score in the second part of a simulation. Using this simulation strategy the mc-loa program plays at the same level as the aß program mia, the best loa-playing entity in the world.

U2 - 10.1007/978-3-642-12993-3_4

DO - 10.1007/978-3-642-12993-3_4

M3 - Chapter

SN - 978-3-642-12993-3

T3 - Lecture Notes in Computer Science

SP - 33

EP - 44

BT - Advances in Computer Games

A2 - van den Herik, H. Jaap

A2 - Spronck, Pieter

PB - Springer

CY - Berlin, Heidelberg

ER -