In this paper we consider some problems and results in the area of Markov decision processes with an incompletely known transition law. We concentrate here on the average return under the Bayes criterion. We discuss easy-to-handle strategies which are optimal under some conditions. For detailed proofs we refer to a monograph published by the present author.
|Place of Publication||Eindhoven|
|Publisher||Technische Hogeschool Eindhoven|
|Number of pages||13|
|Publication status||Published - 1979|