Foresighted policy gradient reinforcement learning: solving large-scale dilemmas with rational altruistic punishment

P.J. t Hoen, S.M. Bohté, J.A. Poutré, La

Research output: Book/ReportReportPopular

Original languageEnglish
Place of PublicationAmsterdam
PublisherCentrum voor Wiskunde en Informatica
Publication statusPublished - 2008

Publication series

NameCWI report. SEN-R : software engineering
Volume0804
ISSN (Print)1386-369X

Cite this