Improving strategies in stochastic games

J. Flesch; F. Thuijsman; O.J. J Vrieze

doi:10.1109/CDC.1998.757857

Improving strategies in stochastic games

J. Flesch^*, F. Thuijsman, O.J. J Vrieze

^*Corresponding author for this work

Networks and Strategic Optimization

Research output: Chapter in Book/Report/Conference proceeding › Chapter › Academic

Abstract

In a zero-sum limiting average stochastic game, we evaluate a\nstrategy π for the maximizing player, player 1, by the reward φ\n_s(π) that π guarantees to him when starting in state s.\nA strategy π is called non-improving if\nφ_s(π)&ges;φ_s(π[h]) for any state s\nand for any finite history h, where π[h] is the strategy π\nconditional on the history h; otherwise the strategy is called\nimproving. We investigate the use of improving and non-improving\nstrategies, and explore the relation between (non-)improvingness and\n(&epsi;-) optimality. Improving strategies appear to play a very\nimportant role for obtaining &epsi; optimality, while 0-optimal\nstrategies are always non-improving. Several examples are given to\nclarify all these issues

Original language	English
Title of host publication	Proceedings of the 37th IEEE Conference on Decision and Control (Cat. No.98CH36171)
Pages	2674-2679
Number of pages	6
DOIs	https://doi.org/10.1109/CDC.1998.757857
Publication status	Published - 1998

Publication series

Series	Proceedings of the IEEE Conference on Decision and Control
ISSN	0191-2216

Access to Document

10.1109/CDC.1998.757857

http://www.mendeley.com/research/improving-strategies-stochastic-games

Cite this

@inbook{d4af6675302e4a999c269d5f118ebbbf,

title = "Improving strategies in stochastic games",

abstract = "In a zero-sum limiting average stochastic game, we evaluate a\nstrategy π for the maximizing player, player 1, by the reward φ\ns(π) that π guarantees to him when starting in state s.\nA strategy π is called non-improving if\nφs(π)&ges;φs(π[h]) for any state s\nand for any finite history h, where π[h] is the strategy π\nconditional on the history h; otherwise the strategy is called\nimproving. We investigate the use of improving and non-improving\nstrategies, and explore the relation between (non-)improvingness and\n(&epsi;-) optimality. Improving strategies appear to play a very\nimportant role for obtaining &epsi; optimality, while 0-optimal\nstrategies are always non-improving. Several examples are given to\nclarify all these issues",

author = "J. Flesch and F. Thuijsman and Vrieze, {O.J. J}",

year = "1998",

doi = "10.1109/CDC.1998.757857",

language = "English",

isbn = "0-7803-4394-8",

series = "Proceedings of the IEEE Conference on Decision and Control",

publisher = "IEEE",

pages = "2674--2679",

booktitle = "Proceedings of the 37th IEEE Conference on Decision and Control (Cat. No.98CH36171)",

}

TY - CHAP

T1 - Improving strategies in stochastic games

AU - Flesch, J.

AU - Thuijsman, F.

AU - Vrieze, O.J. J

PY - 1998

Y1 - 1998

N2 - In a zero-sum limiting average stochastic game, we evaluate a\nstrategy π for the maximizing player, player 1, by the reward φ\ns(π) that π guarantees to him when starting in state s.\nA strategy π is called non-improving if\nφs(π)&ges;φs(π[h]) for any state s\nand for any finite history h, where π[h] is the strategy π\nconditional on the history h; otherwise the strategy is called\nimproving. We investigate the use of improving and non-improving\nstrategies, and explore the relation between (non-)improvingness and\n(&epsi;-) optimality. Improving strategies appear to play a very\nimportant role for obtaining &epsi; optimality, while 0-optimal\nstrategies are always non-improving. Several examples are given to\nclarify all these issues

AB - In a zero-sum limiting average stochastic game, we evaluate a\nstrategy π for the maximizing player, player 1, by the reward φ\ns(π) that π guarantees to him when starting in state s.\nA strategy π is called non-improving if\nφs(π)&ges;φs(π[h]) for any state s\nand for any finite history h, where π[h] is the strategy π\nconditional on the history h; otherwise the strategy is called\nimproving. We investigate the use of improving and non-improving\nstrategies, and explore the relation between (non-)improvingness and\n(&epsi;-) optimality. Improving strategies appear to play a very\nimportant role for obtaining &epsi; optimality, while 0-optimal\nstrategies are always non-improving. Several examples are given to\nclarify all these issues

U2 - 10.1109/CDC.1998.757857

DO - 10.1109/CDC.1998.757857

M3 - Chapter

SN - 0-7803-4394-8

T3 - Proceedings of the IEEE Conference on Decision and Control

SP - 2674

EP - 2679

BT - Proceedings of the 37th IEEE Conference on Decision and Control (Cat. No.98CH36171)

ER -