On theory and algorithms for Markov decision problems with the total reward criterion

J.A.E.E. van Nunen, J. Wessels

Onderzoeksoutput: Bijdrage aan tijdschriftTijdschriftartikelAcademicpeer review

3 Citaten (Scopus)

Samenvatting

The first part of this survey paper is devoted to derive under rather weak conditions, which don't guarantee contraction, a number of important existency and convergency results in Markov decision theory. In the second part of the paper conditions that guarantee that the contraction mapping approach can be used are analysed. These conditions are rather weak and allow for unbounded rewards. The generation of successive approximation methods for solving Markov decision processes by using action depending stopping times is described at the end of the paper.
Originele taal-2Engels
Pagina's (van-tot)57-67
Aantal pagina's11
TijdschriftOR Spektrum
Volume1
Nummer van het tijdschrift1
DOI's
StatusGepubliceerd - 1979

Vingerafdruk

Duik in de onderzoeksthema's van 'On theory and algorithms for Markov decision problems with the total reward criterion'. Samen vormen ze een unieke vingerafdruk.

Citeer dit