General Game Playing with Stochastic CSP

Eric Piette; Frédéric Koriche; Sylvain Lagrue; Sébastien Tabary

General Game Playing with Stochastic CSP

Eric Piette, Frédéric Koriche, Sylvain Lagrue, Sébastien Tabary

Research output: Contribution to journal › Article › Academic › peer-review

57 Downloads (Pure)

Abstract

The challenge of General Game Playing (GGP) is to devise artificial game playing algorithms that take as input the rules of any strategic game, described in the Game Description Language (GDL), and that effectively play without human intervention. The aim of this paper is to address the GGP challenge by casting GDL games (potentially with chance events) into the Stochastic Constraint Satisfaction Problem (SCSP). The stochastic constraint network of a game is decomposed into a sequence of μSCSPs (a.k.a. one-stage SCSP), each of them being associated to a game round. Winning strategies are searched by coupling the MAC algorithm, used to solve each μSCSP in turn, together with the UCB (Upper Confidence Bound) policy for approximating the values of those strategies obtained by the last μSCSP in the sequence. Extensive experiments conducted on various GDL games with different deliberation times per round, demonstrate that the MAC-UCB algorithm significantly outperforms the state-of-the-art UCT (upper confidence bounds for trees) algorithm.

Original language	English
Journal	Constraints
Publication status	Published - 1 Jan 2016
Externally published	Yes

Access to Document

constraints15

Cite this

@article{aae7857448fc4000870a2de506a14f18,

title = "General Game Playing with Stochastic CSP",

abstract = "The challenge of General Game Playing (GGP) is to devise artificial game playing algorithms that take as input the rules of any strategic game, described in the Game Description Language (GDL), and that effectively play without human intervention. The aim of this paper is to address the GGP challenge by casting GDL games (potentially with chance events) into the Stochastic Constraint Satisfaction Problem (SCSP). The stochastic constraint network of a game is decomposed into a sequence of μSCSPs (a.k.a. one-stage SCSP), each of them being associated to a game round. Winning strategies are searched by coupling the MAC algorithm, used to solve each μSCSP in turn, together with the UCB (Upper Confidence Bound) policy for approximating the values of those strategies obtained by the last μSCSP in the sequence. Extensive experiments conducted on various GDL games with different deliberation times per round, demonstrate that the MAC-UCB algorithm significantly outperforms the state-of-the-art UCT (upper confidence bounds for trees) algorithm.",

author = "Eric Piette and Fr{\'e}d{\'e}ric Koriche and Sylvain Lagrue and S{\'e}bastien Tabary",

year = "2016",

month = jan,

day = "1",

language = "English",

journal = "Constraints",

issn = "1383-7133",

publisher = "Springer",

}

TY - JOUR

T1 - General Game Playing with Stochastic CSP

AU - Piette, Eric

AU - Koriche, Frédéric

AU - Lagrue, Sylvain

AU - Tabary, Sébastien

PY - 2016/1/1

Y1 - 2016/1/1

N2 - The challenge of General Game Playing (GGP) is to devise artificial game playing algorithms that take as input the rules of any strategic game, described in the Game Description Language (GDL), and that effectively play without human intervention. The aim of this paper is to address the GGP challenge by casting GDL games (potentially with chance events) into the Stochastic Constraint Satisfaction Problem (SCSP). The stochastic constraint network of a game is decomposed into a sequence of μSCSPs (a.k.a. one-stage SCSP), each of them being associated to a game round. Winning strategies are searched by coupling the MAC algorithm, used to solve each μSCSP in turn, together with the UCB (Upper Confidence Bound) policy for approximating the values of those strategies obtained by the last μSCSP in the sequence. Extensive experiments conducted on various GDL games with different deliberation times per round, demonstrate that the MAC-UCB algorithm significantly outperforms the state-of-the-art UCT (upper confidence bounds for trees) algorithm.

AB - The challenge of General Game Playing (GGP) is to devise artificial game playing algorithms that take as input the rules of any strategic game, described in the Game Description Language (GDL), and that effectively play without human intervention. The aim of this paper is to address the GGP challenge by casting GDL games (potentially with chance events) into the Stochastic Constraint Satisfaction Problem (SCSP). The stochastic constraint network of a game is decomposed into a sequence of μSCSPs (a.k.a. one-stage SCSP), each of them being associated to a game round. Winning strategies are searched by coupling the MAC algorithm, used to solve each μSCSP in turn, together with the UCB (Upper Confidence Bound) policy for approximating the values of those strategies obtained by the last μSCSP in the sequence. Extensive experiments conducted on various GDL games with different deliberation times per round, demonstrate that the MAC-UCB algorithm significantly outperforms the state-of-the-art UCT (upper confidence bounds for trees) algorithm.

M3 - Article

SN - 1383-7133

JO - Constraints

JF - Constraints

ER -