Adversarial reinforcement learning in a cyber security simulation

Richard Elderman, Leon J.J. Pater, Albert S. Thie, Madalina M. Drugan, Marco A. Wiering

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

57 Citations (Scopus)
408 Downloads (Pure)

Abstract

This paper focuses on cyber-security simulations in networks modeled as a Markov game with incomplete information and stochastic elements. The resulting game is an adversarial sequential decision making problem played with two agents, the attacker and defender. The two agents pit one reinforcement learning technique, like neural networks, Monte Carlo learning and Q-learning, against each other and examine their effectiveness against learning opponents. The results showed that Monte Carlo learning with the Softmax exploration strategy is most effective in performing the defender role and also for learning attacking strategies.

Original languageEnglish
Title of host publicationICAART 2017 - Proceedings of the 9th International Conference on Agents and Artificial Intelligence
EditorsJoaquim Filipe, Jaap van den Herik, Ana Paula Rocha, Joaquim Filipe
Place of Publications.l.
PublisherSciTePress Digital Library
Pages559-566
Number of pages8
ISBN (Electronic)9789897582202
Publication statusPublished - 1 Jan 2017
Event9th International Conference on Agents and Artificial Intelligence (ICAART 2017) - Porto, Portugal
Duration: 24 Feb 201726 Feb 2017
Conference number: 9

Conference

Conference9th International Conference on Agents and Artificial Intelligence (ICAART 2017)
Abbreviated titleICAART 2017
Country/TerritoryPortugal
CityPorto
Period24/02/1726/02/17

Keywords

  • Adversarial setting
  • Cyber security in networks
  • Markov games
  • Reinforcement learning

Fingerprint

Dive into the research topics of 'Adversarial reinforcement learning in a cyber security simulation'. Together they form a unique fingerprint.

Cite this