A policy improvement-value approximation algorithm for the ergodic average reward Markov decision process

J. Wal, van der

Onderzoeksoutput: Boek/rapportRapportAcademic

51 Downloads (Pure)

Vingerafdruk

Duik in de onderzoeksthema's van 'A policy improvement-value approximation algorithm for the ergodic average reward Markov decision process'. Samen vormen ze een unieke vingerafdruk.

Mathematics

Engineering