On theory and algorithms for Markov decision problems with the total reward criterion

J.A.E.E. van Nunen, J. Wessels

    Research output: Contribution to journalArticleAcademicpeer-review

    3 Citations (Scopus)

    Abstract

    The first part of this survey paper is devoted to derive under rather weak conditions, which don't guarantee contraction, a number of important existency and convergency results in Markov decision theory. In the second part of the paper conditions that guarantee that the contraction mapping approach can be used are analysed. These conditions are rather weak and allow for unbounded rewards. The generation of successive approximation methods for solving Markov decision processes by using action depending stopping times is described at the end of the paper.
    Original languageEnglish
    Pages (from-to)57-67
    Number of pages11
    JournalOR Spektrum
    Volume1
    Issue number1
    DOIs
    Publication statusPublished - 1979

    Fingerprint Dive into the research topics of 'On theory and algorithms for Markov decision problems with the total reward criterion'. Together they form a unique fingerprint.

    Cite this