Skip to main navigation Skip to search Skip to main content

Deep Deterministic Policy Gradient-Based Edge Caching: An Inherent Performance Tradeoff

Research output: Chapter in Book/Report/Conference proceedingConference contributionAcademicpeer-review

Abstract

In this paper, edge caching is investigated subject to time-varying content popularity where no systematic distri-bution of content popularity can be known in advance. For achieving efficient caching updates sequentially, two optimization problems of maximizing the long-term accumulated-cache-hit-ratio (ACHR) and minimizing the long-term average-content-provision-cost (ACPC) are formulated. In order to solve these two problems, a deep deterministic policy gradient (DDPG)-based caching algorithm is proposed, which is capable of pro-cessing large-scale and continuous action space and adjusting the caching strategies based on the historical observations of users' requests. To evaluate the performance of the proposed DDPG-based caching algorithm, a real-world data set from MovieLens is adopted. Simulation results demonstrate that sig-nificant performance gains in terms of both ACHR and ACPC are achieved by the proposed algorithm over existing caching strategies. Furthermore, an inherent performance tradeoff exists between the ACHR and the ACPC, and the balance between these performance metrics requires careful system parameter selection.

Original languageEnglish
Title of host publication2021 IEEE Global Communications Conference, GLOBECOM 2021 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers
Number of pages7
ISBN (Electronic)978-1-7281-8104-2
DOIs
Publication statusPublished - 2 Feb 2022
Externally publishedYes
Event2021 IEEE Global Communications Conference, GLOBECOM 2021 - Madrid, Spain
Duration: 7 Dec 202111 Dec 2021

Conference

Conference2021 IEEE Global Communications Conference, GLOBECOM 2021
Country/TerritorySpain
CityMadrid
Period7/12/2111/12/21

Bibliographical note

Funding Information:
The authors would like to acknowledge the support from the Natural Science Foundation of China (NSFC) under grant 61971461, and the Hubei Provincial Key R&D Program under grant 2020BAA002.

Funding

The authors would like to acknowledge the support from the Natural Science Foundation of China (NSFC) under grant 61971461, and the Hubei Provincial Key R&D Program under grant 2020BAA002.

Keywords

  • accumulated-cache-hit-ratio
  • average-content-provision-cost
  • deep deterministic policy gradient
  • dynamic content popularity
  • Edge caching

Fingerprint

Dive into the research topics of 'Deep Deterministic Policy Gradient-Based Edge Caching: An Inherent Performance Tradeoff'. Together they form a unique fingerprint.

Cite this