Abstract
Multi-agent reinforcement learning, especially learning in unknown complex environments, requires new algorithms. In this work, our focus is on adopting the concept of the quantum entanglement phenomena to the action selection procedure of multi-agent Q-learning, aiming to enhance the learning speed, collision avoidance, and also providing full coverage of the environment. The exploration procedure is exclusively induced by a memory-based probabilistic sequential action selection method acting as a knowledge hub, shared among the agents, which is the central pillar of this work. This causes enhancing the parallelism of the learning process, plus, building an effective yet simple communicating-bridge between the learning agents; that is, they can signal and guide one another through sharing their gained experience and knowledge in order not to repeat the same mistake that the other agents have already run into. The simulation results demonstrated the effectiveness of our proposed algorithm in terms of reducing the learning time, significant reduction of collision occurrence, and providing full coverage of big complex clutter environments.
Original language | English |
---|---|
Title of host publication | 2020 10th International Conference on Computer and Knowledge Engineering, ICCKE 2020 |
Publisher | Institute of Electrical and Electronics Engineers |
Pages | 617-622 |
Number of pages | 6 |
ISBN (Electronic) | 978-1-7281-8566-8 |
ISBN (Print) | 978-1-7281-8567-5 |
DOIs | |
Publication status | Published - 31 Dec 2020 |
Externally published | Yes |
Event | 10th International Conference on Computer and Knowledge Engineering, ICCKE 2020 - Mashhad, Iran, Islamic Republic of Duration: 29 Oct 2020 → 30 Oct 2020 Conference number: 10 |
Conference
Conference | 10th International Conference on Computer and Knowledge Engineering, ICCKE 2020 |
---|---|
Abbreviated title | ICCKE 2020 |
Country/Territory | Iran, Islamic Republic of |
City | Mashhad |
Period | 29/10/20 → 30/10/20 |
Keywords
- Quantum entanglement
- Quantum computing
- Synthetic aperture sonar
- Prediction algorithms
- Learning systems
- Games
- Convergence
- Q-learning
- entanglement phenomena
- Reinforcement learning
- probabilistic and sequential action selection