Cross-Entropy for Monte-Carlo Tree Search

Guillaume M. J. B. Chaslot; Mark H. M. Winands; Istvan Szita; H. Jaap van den Herik

doi:10.3233/ICG-2008-31303

Cross-Entropy for Monte-Carlo Tree Search

Guillaume M. J. B. Chaslot^*, Mark H. M. Winands, Istvan Szita, H. Jaap van den Herik

^*Corresponding author for this work

Networks and Strategic Optimization

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

Recently, Monte-Carlo Tree Search (MCTS) has become a Popular approach for intelligent play in games. Amongst others, it is successfully used in most state-of-the-art Go programs. To improve the playing strength of these Go programs any further, many parameters dealing with MCTS Should be fine-tuned.In this paper, we propose to apply the Cross-Entropy Method (CEM) for this task. The method is comparable to Estimation-of-Distribution Algorithms (EDAs), a new area of evolutionary computation. We tested CEM by tuning various types of parameters in our Go program MANGO. The experiments were performed in matches against the open-source program GNU Go. They revealed that a program with the CEM tuned parameters played better than without. Moreover, MANGO plus CEM outperformed the regular MANGO for various time settings and board sizes. From the results we may conclude that parameter tuning by CEM genuinely improved the playing strength of MANGO, for various time settings. This result may be generalized to other game engines using MCTS.

Original language	English
Pages (from-to)	145-156
Journal	ICGA Journal
Volume	31
Issue number	3
DOIs	https://doi.org/10.3233/ICG-2008-31303
Publication status	Published - Sept 2008

Access to Document

10.3233/ICG-2008-31303

Cite this

@article{5c461561dde247fd81abb002ea59f9a3,

title = "Cross-Entropy for Monte-Carlo Tree Search",

abstract = "Recently, Monte-Carlo Tree Search (MCTS) has become a Popular approach for intelligent play in games. Amongst others, it is successfully used in most state-of-the-art Go programs. To improve the playing strength of these Go programs any further, many parameters dealing with MCTS Should be fine-tuned.In this paper, we propose to apply the Cross-Entropy Method (CEM) for this task. The method is comparable to Estimation-of-Distribution Algorithms (EDAs), a new area of evolutionary computation. We tested CEM by tuning various types of parameters in our Go program MANGO. The experiments were performed in matches against the open-source program GNU Go. They revealed that a program with the CEM tuned parameters played better than without. Moreover, MANGO plus CEM outperformed the regular MANGO for various time settings and board sizes. From the results we may conclude that parameter tuning by CEM genuinely improved the playing strength of MANGO, for various time settings. This result may be generalized to other game engines using MCTS.",

author = "Chaslot, {Guillaume M. J. B.} and Winands, {Mark H. M.} and Istvan Szita and {van den Herik}, {H. Jaap}",

year = "2008",

month = sep,

doi = "10.3233/ICG-2008-31303",

language = "English",

volume = "31",

pages = "145--156",

journal = "ICGA Journal",

issn = "1389-6911",

publisher = "IOS Press",

number = "3",

}

TY - JOUR

T1 - Cross-Entropy for Monte-Carlo Tree Search

AU - Chaslot, Guillaume M. J. B.

AU - Winands, Mark H. M.

AU - Szita, Istvan

AU - van den Herik, H. Jaap

PY - 2008/9

Y1 - 2008/9

N2 - Recently, Monte-Carlo Tree Search (MCTS) has become a Popular approach for intelligent play in games. Amongst others, it is successfully used in most state-of-the-art Go programs. To improve the playing strength of these Go programs any further, many parameters dealing with MCTS Should be fine-tuned.In this paper, we propose to apply the Cross-Entropy Method (CEM) for this task. The method is comparable to Estimation-of-Distribution Algorithms (EDAs), a new area of evolutionary computation. We tested CEM by tuning various types of parameters in our Go program MANGO. The experiments were performed in matches against the open-source program GNU Go. They revealed that a program with the CEM tuned parameters played better than without. Moreover, MANGO plus CEM outperformed the regular MANGO for various time settings and board sizes. From the results we may conclude that parameter tuning by CEM genuinely improved the playing strength of MANGO, for various time settings. This result may be generalized to other game engines using MCTS.

AB - Recently, Monte-Carlo Tree Search (MCTS) has become a Popular approach for intelligent play in games. Amongst others, it is successfully used in most state-of-the-art Go programs. To improve the playing strength of these Go programs any further, many parameters dealing with MCTS Should be fine-tuned.In this paper, we propose to apply the Cross-Entropy Method (CEM) for this task. The method is comparable to Estimation-of-Distribution Algorithms (EDAs), a new area of evolutionary computation. We tested CEM by tuning various types of parameters in our Go program MANGO. The experiments were performed in matches against the open-source program GNU Go. They revealed that a program with the CEM tuned parameters played better than without. Moreover, MANGO plus CEM outperformed the regular MANGO for various time settings and board sizes. From the results we may conclude that parameter tuning by CEM genuinely improved the playing strength of MANGO, for various time settings. This result may be generalized to other game engines using MCTS.

U2 - 10.3233/ICG-2008-31303

DO - 10.3233/ICG-2008-31303

M3 - Article

SN - 1389-6911

VL - 31

SP - 145

EP - 156

JO - ICGA Journal

JF - ICGA Journal

IS - 3

ER -