Abstract
In this paper, the problem of maximizing the sum rate of mobile users in a multi-base station (BS) cooperative millimeter-wave (mmWave) multicast communication system is studied. In the considered model, due to the real-time mobility of users, the users being served by a given BS and beamforming of BSs and users are dynamic. Multiple BSs must cooperate to serve dynamic requests of multiple mobile users. This problem is posed as an optimization framework whose goal is to maximize the sum rate of all mobile users by jointly optimizing the number of users served by all BSs and beamforming matrices of both BSs and users. To solve this non-convex optimization problem, we first introduce a value decomposition based reinforcement learning (VD- RL) algorithm to determine the users to be served by each BS. Then, we use the block diagonalization method to obtain the fully digital transmit beamforming matrices of all BSs as well as the receive beamforming matrices of the users. Finally, a fast optimization algorithm is used to optimize the hybrid beamforming matrices of both BSs and users. Simulation results show that, the proposed algorithm can achieve up to 51 % gain in terms of the sum rate of all mobile users compared to baseline multi-agent algorithms.