Doorgaan naar hoofdnavigatie Doorgaan naar zoeken Ga verder naar hoofdinhoud

On the relation between optimality and saddle-conservation in Markov games

  • L.P.J. Groenewegen
  • , J. Wessels

Onderzoeksoutput: Boek/rapportRapportAcademic

65 Downloads (Pure)

Samenvatting

In this paper it will be investigated how the concept of value-conserving strategies can be generalized from Markov decision processes to Markov games. It will be proved that optimal Markov strategies are necessarily saddle conserving, which is the most straightforward generalization. Another generalization (called saddling) is shown to constitute a sufficient condition for optimality under relatively strong assumptions for the convergence of total expected rewards. Counterexamples show that saddle conserving is not sufficient for optimality (even under these strong convergence assumptions) and saddling is proved to be not necessary.
Originele taal-2Engels
Plaats van productieEindhoven
UitgeverijTechnische Hogeschool Eindhoven
Aantal pagina's13
StatusGepubliceerd - 1976

Publicatie series

NaamMemorandum COSOR
Volume7614
ISSN van geprinte versie0926-4493

Vingerafdruk

Duik in de onderzoeksthema's van 'On the relation between optimality and saddle-conservation in Markov games'. Samen vormen ze een unieke vingerafdruk.

Citeer dit