Discounted Markov games : successive approximation and stopping times

J. Wal, van der

Research output: Contribution to journalArticleAcademicpeer-review

16 Citations (Scopus)
1 Downloads (Pure)

Abstract

This paper presents a number of successive approximation algorithms for the repeated two-person zero-sum game called Markov game using the criterion of total expected discounted rewards. AsWessels [1977] did for Markov decision processes stopping times are introduced in order to simplify the proofs. It is shown that each algorithm provides upper and lower bounds for the value of the game and nearly optimal stationary strategies for both players.
Original languageEnglish
Pages (from-to)11-22
Number of pages12
JournalInternational Journal of Game Theory
Volume6
Issue number1
DOIs
Publication statusPublished - 1977

Fingerprint

Dive into the research topics of 'Discounted Markov games : successive approximation and stopping times'. Together they form a unique fingerprint.

Cite this