Reachability and Safety Objectives in Markov Decision Processes on Long but Finite Horizons

Galit Ashkenazi-Golan; Janos Flesch; Arkadi Predtetchinski; Eilon Solan

doi:10.1007/s10957-020-01681-2

Reachability and Safety Objectives in Markov Decision Processes on Long but Finite Horizons

Galit Ashkenazi-Golan, Janos Flesch^*, Arkadi Predtetchinski, Eilon Solan

^*Corresponding author for this work

Research output: Contribution to journal › Article › Academic › peer-review

Abstract

We consider discrete-time Markov decision processes in which the decision maker is interested in long but finite horizons. First we consider reachability objective: the decision maker's goal is to reach a specific target state with the highest possible probability. A strategy is said to overtake another strategy, if it gives a strictly higher probability of reaching the target state on all sufficiently large but finite horizons. We prove that there exists a pure stationary strategy that is not overtaken by any pure strategy nor by any stationary strategy, under some condition on the transition structure and respectively under genericity. A strategy that is not overtaken by any other strategy, called an overtaking optimal strategy, does not always exist. We provide sufficient conditions for its existence. Next we consider safety objective: the decision maker's goal is to avoid a specific state with the highest possible probability. We argue that the results proven for reachability objective extend to this model.

Original language	English
Pages (from-to)	945-965
Number of pages	21
Journal	Journal of Optimization Theory and Applications
Volume	185
Issue number	3
Early online date	18 May 2020
DOIs	https://doi.org/10.1007/s10957-020-01681-2
Publication status	Published - Jun 2020

Keywords

Markov decision process
Reachability objective
Safety objective
Overtaking optimality
Perron-Frobenius eigenvalue
OPTIMALITY
OVERTAKING

Access to Document

10.1007/s10957-020-01681-2Licence: CC BY

Cite this

@article{f3a9e7afc30d4d318577cb748e5eb750,

title = "Reachability and Safety Objectives in Markov Decision Processes on Long but Finite Horizons",

abstract = "We consider discrete-time Markov decision processes in which the decision maker is interested in long but finite horizons. First we consider reachability objective: the decision maker's goal is to reach a specific target state with the highest possible probability. A strategy is said to overtake another strategy, if it gives a strictly higher probability of reaching the target state on all sufficiently large but finite horizons. We prove that there exists a pure stationary strategy that is not overtaken by any pure strategy nor by any stationary strategy, under some condition on the transition structure and respectively under genericity. A strategy that is not overtaken by any other strategy, called an overtaking optimal strategy, does not always exist. We provide sufficient conditions for its existence. Next we consider safety objective: the decision maker's goal is to avoid a specific state with the highest possible probability. We argue that the results proven for reachability objective extend to this model.",

keywords = "Markov decision process, Reachability objective, Safety objective, Overtaking optimality, Perron-Frobenius eigenvalue, OPTIMALITY, OVERTAKING",

author = "Galit Ashkenazi-Golan and Janos Flesch and Arkadi Predtetchinski and Eilon Solan",

note = "data source: no data used",

year = "2020",

month = jun,

doi = "10.1007/s10957-020-01681-2",

language = "English",

volume = "185",

pages = "945--965",

journal = "Journal of Optimization Theory and Applications",

issn = "0022-3239",

publisher = "Springer Verlag",

number = "3",

}

TY - JOUR

T1 - Reachability and Safety Objectives in Markov Decision Processes on Long but Finite Horizons

AU - Ashkenazi-Golan, Galit

AU - Flesch, Janos

AU - Predtetchinski, Arkadi

AU - Solan, Eilon

N1 - data source: no data used

PY - 2020/6

Y1 - 2020/6

N2 - We consider discrete-time Markov decision processes in which the decision maker is interested in long but finite horizons. First we consider reachability objective: the decision maker's goal is to reach a specific target state with the highest possible probability. A strategy is said to overtake another strategy, if it gives a strictly higher probability of reaching the target state on all sufficiently large but finite horizons. We prove that there exists a pure stationary strategy that is not overtaken by any pure strategy nor by any stationary strategy, under some condition on the transition structure and respectively under genericity. A strategy that is not overtaken by any other strategy, called an overtaking optimal strategy, does not always exist. We provide sufficient conditions for its existence. Next we consider safety objective: the decision maker's goal is to avoid a specific state with the highest possible probability. We argue that the results proven for reachability objective extend to this model.

AB - We consider discrete-time Markov decision processes in which the decision maker is interested in long but finite horizons. First we consider reachability objective: the decision maker's goal is to reach a specific target state with the highest possible probability. A strategy is said to overtake another strategy, if it gives a strictly higher probability of reaching the target state on all sufficiently large but finite horizons. We prove that there exists a pure stationary strategy that is not overtaken by any pure strategy nor by any stationary strategy, under some condition on the transition structure and respectively under genericity. A strategy that is not overtaken by any other strategy, called an overtaking optimal strategy, does not always exist. We provide sufficient conditions for its existence. Next we consider safety objective: the decision maker's goal is to avoid a specific state with the highest possible probability. We argue that the results proven for reachability objective extend to this model.

KW - Markov decision process

KW - Reachability objective

KW - Safety objective

KW - Overtaking optimality

KW - Perron-Frobenius eigenvalue

KW - OPTIMALITY

KW - OVERTAKING

U2 - 10.1007/s10957-020-01681-2

DO - 10.1007/s10957-020-01681-2

M3 - Article

SN - 0022-3239

VL - 185

SP - 945

EP - 965

JO - Journal of Optimization Theory and Applications

JF - Journal of Optimization Theory and Applications

IS - 3

ER -