Average-discounted equilibria in stochastic games

J Flesch; F Thuijsman; OJ Vrieze

doi:10.1016/S0377-2217(97)00384-6

Average-discounted equilibria in stochastic games

J Flesch^*, F Thuijsman, OJ Vrieze

^*Corresponding author for this work

Networks and Strategic Optimization

Research output: Contribution to journal › Article › Academic › peer-review

66 Downloads (Pure)

Abstract

In stochastic games with finite state and action spaces, we examine existence of equilibria where player 1 uses the limiting average reward and player 2 a discounted reward for the evaluations of the respective payoff sequences. By the nature of these rewards, the far future determines player 1's reward, while player 2 is rather interested in the near future. This gives rise to a natural cooperation between the players along the course of the play. First we show the existence of stationary epsilon-equilibria, for all epsilon > 0, in these games. However, besides these stationary epsilon-equilibria, there also exist epsilon-equilibria, in terms of only slightly more complex ultimately stationary strategies, which are rather in the spirit of these games because, after a large stage when the discounted game is not interesting any longer, the players cooperate to guarantee the highest feasible reward to player I. Moreover, we analyze an interesting example demonstrating that 0-equilibria do not necessarily exist in these games, not even in terms of history dependent strategies. Finally, we examine special classes of stochastic games with specific conditions on the transition and payoff structures. Several examples are given to clarify all these issues.

Original language	English
Pages (from-to)	187-195
Journal	European Journal of Operational Research
Volume	112
Issue number	1
DOIs	https://doi.org/10.1016/S0377-2217(97)00384-6
Publication status	Published - 1 Jan 1999

Keywords

game theory
stochastic games
equilibria
discounted reward
limiting average reward

Access to Document

10.1016/S0377-2217(97)00384-6

Full TextFinal published version, 149 KBLicence: Taverne

Cite this

@article{f35e13acf47446e7a0f3d7c9a71aee4b,

title = "Average-discounted equilibria in stochastic games",

abstract = "In stochastic games with finite state and action spaces, we examine existence of equilibria where player 1 uses the limiting average reward and player 2 a discounted reward for the evaluations of the respective payoff sequences. By the nature of these rewards, the far future determines player 1's reward, while player 2 is rather interested in the near future. This gives rise to a natural cooperation between the players along the course of the play. First we show the existence of stationary epsilon-equilibria, for all epsilon > 0, in these games. However, besides these stationary epsilon-equilibria, there also exist epsilon-equilibria, in terms of only slightly more complex ultimately stationary strategies, which are rather in the spirit of these games because, after a large stage when the discounted game is not interesting any longer, the players cooperate to guarantee the highest feasible reward to player I. Moreover, we analyze an interesting example demonstrating that 0-equilibria do not necessarily exist in these games, not even in terms of history dependent strategies. Finally, we examine special classes of stochastic games with specific conditions on the transition and payoff structures. Several examples are given to clarify all these issues. ",

keywords = "game theory, stochastic games, equilibria, discounted reward, limiting average reward",

author = "J Flesch and F Thuijsman and OJ Vrieze",

year = "1999",

month = jan,

day = "1",

doi = "10.1016/S0377-2217(97)00384-6",

language = "English",

volume = "112",

pages = "187--195",

journal = "European Journal of Operational Research",

issn = "0377-2217",

publisher = "Elsevier",

number = "1",

}

TY - JOUR

T1 - Average-discounted equilibria in stochastic games

AU - Flesch, J

AU - Thuijsman, F

AU - Vrieze, OJ

PY - 1999/1/1

Y1 - 1999/1/1

N2 - In stochastic games with finite state and action spaces, we examine existence of equilibria where player 1 uses the limiting average reward and player 2 a discounted reward for the evaluations of the respective payoff sequences. By the nature of these rewards, the far future determines player 1's reward, while player 2 is rather interested in the near future. This gives rise to a natural cooperation between the players along the course of the play. First we show the existence of stationary epsilon-equilibria, for all epsilon > 0, in these games. However, besides these stationary epsilon-equilibria, there also exist epsilon-equilibria, in terms of only slightly more complex ultimately stationary strategies, which are rather in the spirit of these games because, after a large stage when the discounted game is not interesting any longer, the players cooperate to guarantee the highest feasible reward to player I. Moreover, we analyze an interesting example demonstrating that 0-equilibria do not necessarily exist in these games, not even in terms of history dependent strategies. Finally, we examine special classes of stochastic games with specific conditions on the transition and payoff structures. Several examples are given to clarify all these issues.

AB - In stochastic games with finite state and action spaces, we examine existence of equilibria where player 1 uses the limiting average reward and player 2 a discounted reward for the evaluations of the respective payoff sequences. By the nature of these rewards, the far future determines player 1's reward, while player 2 is rather interested in the near future. This gives rise to a natural cooperation between the players along the course of the play. First we show the existence of stationary epsilon-equilibria, for all epsilon > 0, in these games. However, besides these stationary epsilon-equilibria, there also exist epsilon-equilibria, in terms of only slightly more complex ultimately stationary strategies, which are rather in the spirit of these games because, after a large stage when the discounted game is not interesting any longer, the players cooperate to guarantee the highest feasible reward to player I. Moreover, we analyze an interesting example demonstrating that 0-equilibria do not necessarily exist in these games, not even in terms of history dependent strategies. Finally, we examine special classes of stochastic games with specific conditions on the transition and payoff structures. Several examples are given to clarify all these issues.

KW - game theory

KW - stochastic games

KW - equilibria

KW - discounted reward

KW - limiting average reward

U2 - 10.1016/S0377-2217(97)00384-6

DO - 10.1016/S0377-2217(97)00384-6

M3 - Article

SN - 0377-2217

VL - 112

SP - 187

EP - 195

JO - European Journal of Operational Research

JF - European Journal of Operational Research

IS - 1

ER -