The method of successive approximations for the discounted Markov game

J. Wal, van der

Onderzoeksoutput: Boek/rapportRapportAcademic

55 Downloads (Pure)

Samenvatting

This paper presents a number of successive approximation algorithms for the repeated two-person zero-sum game called Markov game using the criterion of total expected discounted rewards. As Wessels [12] did for Markov decision processes stopping times are introduced in order to simplify the proofs. It is shown that each algorithm provides upper and lower bounds for the value of the game and nearly optimal stationary strategies for both players.
Originele taal-2Engels
Plaats van productieEindhoven
UitgeverijTechnische Hogeschool Eindhoven
Aantal pagina's14
StatusGepubliceerd - 1975

Publicatie series

NaamMemorandum COSOR
Volume7502
ISSN van geprinte versie0926-4493

Vingerafdruk

Duik in de onderzoeksthema's van 'The method of successive approximations for the discounted Markov game'. Samen vormen ze een unieke vingerafdruk.

Citeer dit