Achieving target state-action frequencies in multichain average-reward Markov decision processes

D Krass*, OJ Vrieze

*Corresponding author for this work

    Research output: Contribution to journalArticleAcademicpeer-review

    Original languageEnglish
    Pages (from-to)545-566
    Number of pages22
    JournalMathematics of Operations Research
    Volume27
    Issue number3
    DOIs
    Publication statusPublished - Aug 2002

    Keywords

    • Markov decision processes
    • average reward criterion
    • state-action frequencies
    • constrained Markov decision processes
    • Markov decision processes with nonstandard reward criteria
    • CHAINS
    • CONSTRAINTS
    • POLICIES

    Cite this