Profit Maximization for Multi-Time-Scale Hierarchical DRL-Based Joint Optimization in MEC-Enabled Air-Ground Integrated Networks

Jianbo Du; Jiao Xu; Aijing Sun; Jiawen Kang; Ye Hu; F. Richard Yu; Victor. C. M. Leung

doi:10.1109/TCOMM.2024.3454702

Back

Profit Maximization for Multi-Time-Scale Hierarchical DRL-Based Joint Optimization in MEC-Enabled Air-Ground Integrated Networks

Journal article

Peer reviewed

Profit Maximization for Multi-Time-Scale Hierarchical DRL-Based Joint Optimization in MEC-Enabled Air-Ground Integrated Networks

Jianbo Du, Jiao Xu, Aijing Sun, Jiawen Kang, Ye Hu, F. Richard Yu and Victor. C. M. Leung

IEEE transactions on communications, Vol.73(3), pp.1591-1606

2025-03-01

DOI: https://doi.org/10.1109/TCOMM.2024.3454702

Appears in College Of Engineering - Latest Publications

Abstract

Engineering

Engineering, Electrical & Electronic

Science & Technology

Technology

Telecommunications

In this paper, we address the problem of the operator's economic profit maximization in a multi-access edge computing (MEC)-enabled time division multiple access (TDMA)-based air-ground integrated networking (AGIN) network. We consider to optimize task placement and replacement, unmanned aerial vehicle (UAV) placement, UAV flight time, access control, and task offloading ratios in user devices (UDs) and the UAV. The optimization is constrained by storage capacity, task processing quality of service (QoS) requirements, and TDMA requirements, etc. Our optimization is conducted in two time scales. Task placement and replacement are performed in a coarse-grained time scale (frame), while other optimizations are conducted in a fine-grained time scale (time slot). Due to the high dynamics of the environment, finding a solution is challenging. To address this problem, we present a hierarchical deep reinforcement learning (DRL) algorithm. The high-level component is a deep Q network (DQN) agent responsible for obtaining task placement and replacement solutions within a frame. The low-level component is an improved deep deterministic policy gradient (IDDPG) agent, which is used to address task processing-related issues within a time slot. Our simulations illustrate that the proposed algorithm has good performance in economic profit maximization compared with other algorithms.

Metrics

1 Record Views

Details

Title: Profit Maximization for Multi-Time-Scale Hierarchical DRL-Based Joint Optimization in MEC-Enabled Air-Ground Integrated Networks
Creators: Jianbo Du - Xi’an University of Posts and Telecommunications
Jiao Xu - Xi’an University of Posts and Telecommunications
Aijing Sun - Xi’an University of Posts and Telecommunications
Jiawen Kang - Guangdong University of Technology
Ye Hu - University of Miami
F. Richard Yu - Carleton University
Victor. C. M. Leung - Shenzhen University
Publication Details: IEEE transactions on communications, Vol.73(3), pp.1591-1606
Publisher: IEEE
Number of pages: 16
Grant note: 21JC032 / Serving Local Special Scientific Research Project of Education Department of Shaanxi Province 62271391; 62471388 / Natural Science Foundation of China; National Natural Science Foundation of China (NSFC)
Academic Unit: CoE - Industrial Engineering; College of Engineering
Language: English
Resource Type: Journal article
Record Identifier: 991032795886402976