A stopping time-based policy iteration algorithm for Markov decision processes with discountfactor tending to 1

J. Wal, van der

    Research output: Book/ReportReportAcademic

    27 Downloads (Pure)

    Fingerprint

    Dive into the research topics of 'A stopping time-based policy iteration algorithm for Markov decision processes with discountfactor tending to 1'. Together they form a unique fingerprint.

    Engineering & Materials Science