On-Line Parameter Tuning for Monte-Carlo Tree Search in General Game Playing

Chiara F. Sironi*, Mark H. M. Winands

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingChapterAcademic

36 Downloads (Pure)

Abstract

Many enhancements have been proposed for Monte-Carlo Tree Search (MCTS). Some of them have been applied successfully in the context of General Game Playing (GGP). MCTS and its enhancements are usually controlled by multiple parameters that require extensive and time-consuming computation to be tuned in advance. Moreover, in GGP optimal parameter values may vary depending on the considered game. This paper proposes a method to automatically tune search-control parameters on-line for GGP. This method considers the tuning problem as a Combinatorial Multi-Armed Bandit (CMAB). Four strategies designed to deal with CMABs are evaluated for this particular problem. Experiments show that on-line tuning in GGP almost reaches the same performance as off-line tuning. It can be considered as a valid alternative for domains where off-line parameter tuning is costly or infeasible.
Original languageEnglish
Title of host publicationComputer Games
Subtitle of host publication6th Workshop, CGW 2017, Held in Conjunction with the 26th International Conference on Artificial Intelligence, IJCAI 2017, Melbourne, VIC, Australia, August, 20, 2017, Revised Selected Papers
EditorsTristan Cazenave, Mark H.M. Winands, Abdallah Saffidine
Place of PublicationCham
PublisherSpringer
Pages75-95
Number of pages21
ISBN (Print)978-3-319-75931-9
DOIs
Publication statusPublished - 2018

Publication series

SeriesCommunications in Computer and Information Science
Volume818
ISSN1865-0929

Cite this