Geometric convergence in average reward Markov decision processes

W.H.M. Zijm

Research output: Book/ReportReportAcademic

96 Downloads (Pure)

Abstract

Recently, Federgruen and Schweitzer [3] proved that in undiscounted Markov decision problems the value iteration method for finding maximal gain policies converges geometrically fast, whenever convergence occurs. This result was obtained without any restriction on either the periodicity or chain structure of the problem. In this paper we establish the same result once again; the proof however, seems essentially simpler and, moreover, yields an upperbound for the convergence rate.
Original languageEnglish
Place of PublicationEindhoven
PublisherTechnische Hogeschool Eindhoven
Number of pages9
Publication statusPublished - 1980

Publication series

NameMemorandum COSOR
Volume8008
ISSN (Print)0926-4493

Fingerprint

Dive into the research topics of 'Geometric convergence in average reward Markov decision processes'. Together they form a unique fingerprint.

Cite this