The method of successive approximations for the discounted Markov game

J. Wal, van der

Research output: Book/ReportReportAcademic

54 Downloads (Pure)

Abstract

This paper presents a number of successive approximation algorithms for the repeated two-person zero-sum game called Markov game using the criterion of total expected discounted rewards. As Wessels [12] did for Markov decision processes stopping times are introduced in order to simplify the proofs. It is shown that each algorithm provides upper and lower bounds for the value of the game and nearly optimal stationary strategies for both players.
Original languageEnglish
Place of PublicationEindhoven
PublisherTechnische Hogeschool Eindhoven
Number of pages14
Publication statusPublished - 1975

Publication series

NameMemorandum COSOR
Volume7502
ISSN (Print)0926-4493

Fingerprint

Dive into the research topics of 'The method of successive approximations for the discounted Markov game'. Together they form a unique fingerprint.

Cite this