Abstract
This paper focuses on cyber-security simulations in networks modeled as a Markov game with incomplete information and stochastic elements. The resulting game is an adversarial sequential decision making problem played with two agents, the attacker and defender. The two agents pit one reinforcement learning technique, like neural networks, Monte Carlo learning and Q-learning, against each other and examine their effectiveness against learning opponents. The results showed that Monte Carlo learning with the Softmax exploration strategy is most effective in performing the defender role and also for learning attacking strategies.
Original language | English |
---|---|
Title of host publication | ICAART 2017 - Proceedings of the 9th International Conference on Agents and Artificial Intelligence |
Editors | Joaquim Filipe, Jaap van den Herik, Ana Paula Rocha, Joaquim Filipe |
Place of Publication | s.l. |
Publisher | SciTePress Digital Library |
Pages | 559-566 |
Number of pages | 8 |
ISBN (Electronic) | 9789897582202 |
Publication status | Published - 1 Jan 2017 |
Event | 9th International Conference on Agents and Artificial Intelligence (ICAART 2017) - Porto, Portugal Duration: 24 Feb 2017 → 26 Feb 2017 Conference number: 9 |
Conference
Conference | 9th International Conference on Agents and Artificial Intelligence (ICAART 2017) |
---|---|
Abbreviated title | ICAART 2017 |
Country/Territory | Portugal |
City | Porto |
Period | 24/02/17 → 26/02/17 |
Keywords
- Adversarial setting
- Cyber security in networks
- Markov games
- Reinforcement learning