A policy improvement-value approximation algorithm for the ergodic average reward Markov decision process

J. Wal, van der

Research output: Book/ReportReportAcademic

51 Downloads (Pure)

Fingerprint

Dive into the research topics of 'A policy improvement-value approximation algorithm for the ergodic average reward Markov decision process'. Together they form a unique fingerprint.

Mathematics

Engineering