Abstract
Numerical optimization has been investigated for decades to solve complex problems in wireless communication systems. This has resulted in many effective methods, e.g., the weighted minimum mean square error (WMMSE) algorithm. However, these methods often incur a high computational cost, making their application to time-constrained problems difficult. Recently data-driven methods have attracted a lot of attention due to their near-optimal performance with affordable computational cost. Deep reinforcement learning (DRL) is one of the most promising optimization methods for future wireless communication systems. In this paper, we investigate the DRL method, using a deep Q-network (DQN), to allocate the downlink transmission power in cell-free (CF) mmWave massive multiple-input multiple-output (MIMO) systems. We consider the sum spectral efficiency (SE) optimization for systems with mobile user equipment (UEs). The DQN is trained by the rewards of trial-and-error interactions with the environment over time. It takes as input the long-term fading information and it outputs the downlink transmission power values. The numerical results, obtained for a particular 3GPP scenario, show that DQN outperforms WMMSE in terms of sum-SE and has a much lower computational complexity.
Original language | English |
---|---|
Title of host publication | Proceedings of the 18th International Conference on Wireless Networks and Mobile Systems, WINSYS 2021 |
Editors | Joel Rodrigues, Jaime Lloret Mauri |
Publisher | SciTePress Digital Library |
Pages | 33-45 |
Number of pages | 13 |
ISBN (Electronic) | 978-989-758-529-6 |
DOIs | |
Publication status | Published - 2021 |
Event | 18th International Conference on Wireless Networks and Mobile Systems, WINSYS 2021 - Virtual, Online Duration: 7 Jul 2021 → 9 Jul 2021 |
Conference
Conference | 18th International Conference on Wireless Networks and Mobile Systems, WINSYS 2021 |
---|---|
City | Virtual, Online |
Period | 7/07/21 → 9/07/21 |
Keywords
- Cell-free Massive MIMO
- Deep Q-Network
- Deep Reinforcement Learning
- Power Allocation