Neurogenetic programming framework for explainable reinforcement learning

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

2 Citations (Scopus)

Abstract

Automatic programming, the task of generating computer programs compliant with a specification without a human developer, is usually tackled either via genetic programming methods based on mutation and recombination of programs, or via neural language models. We propose a novel method that combines both approaches using a concept of a virtual neuro-genetic programmer, or scrum team. We demonstrate its ability to provide performant and explainable solutions for various OpenAI Gym tasks, as well as inject expert knowledge into the otherwise data-driven search for solutions.

Original languageEnglish
Title of host publicationGECCO 2021 Companion - Proceedings of the 2021 Genetic and Evolutionary Computation Conference Companion
PublisherAssociation for Computing Machinery, Inc
Pages329-330
Number of pages2
ISBN (Electronic)9781450383516
DOIs
Publication statusPublished - 7 Jul 2021
Event2021 Genetic and Evolutionary Computation Conference, GECCO 2021 - Virtual/Online, Lille, France
Duration: 10 Jul 202114 Jul 2021
https://gecco-2021.sigevo.org/HomePage

Conference

Conference2021 Genetic and Evolutionary Computation Conference, GECCO 2021
Abbreviated titleGECCO 2021
Country/TerritoryFrance
CityLille
Period10/07/2114/07/21
Internet address

Bibliographical note

Funding Information:
This work was funded by the European Union’s Horizon 2020 research and innovation programme under grant agreement 812882. This work is part of "Personal Health Interfaces Leveraging HUman-MAchine Natural interactionS" (PhilHumans) project

Publisher Copyright:
© 2021 Owner/Author.

Keywords

  • genetic programming
  • program synthesis
  • reinforcement learning

Fingerprint

Dive into the research topics of 'Neurogenetic programming framework for explainable reinforcement learning'. Together they form a unique fingerprint.

Cite this