IDT: Intelligent Data Placement for Multi-tiered Main Memory with Reinforcement Learning

  • Chang, Juneseo
  • Doh, Wanju
  • Moon, Yaebin
  • Lee, Eojin
  • Ahn, Jung Ho
Citations

WEB OF SCIENCE

5
Citations

SCOPUS

8

초록

To address the limitation of a DRAM-based single-tier in satisfying the comprehensive demands of main memory, multi-tiered memory systems are gaining widespread adoption. To support these systems, operating-system-level solutions that analyze the application's memory access patterns and ensure data placement in the appropriate memory tier have been vastly explored. In this paper, we identify reinforcement learning (RL) as an effective solution for tiered memory management, and its policy can be formulated in a solvable form using RL. We also demonstrate that an effective region-granularity memory access monitoring method is necessary to provide an accurate environment state to the RL model. Thus, we propose IDT, an intelligent data placement for multi-tiered main memory. IDT incorporates an RL-based demotion policy autotuning and a mechanism that efficiently demotes cold pages to lower-tier memory. IDT also promotes hot pages to upper-tier memory to minimize access on slow memory, featuring a lightweight machine learning algorithm. IDT employs region-granularity memory access monitoring with statistical-testing-based adjacent region merge and split to improve precision and mitigate ambiguity observed in priorworks. Experiments on an actual four-tiered memory system show that IDT achieves an average 2.08x speedup over the default Linux kernel and 11.2% performance improvement compared to the state-of-the-art solution.

키워드

Memory TieringEmerging Memory TechnologiesMemory ManagementReinforcement Learning
제목
IDT: Intelligent Data Placement for Multi-tiered Main Memory with Reinforcement Learning
저자
Chang, JuneseoDoh, WanjuMoon, YaebinLee, EojinAhn, Jung Ho
DOI
10.1145/3625549.3658659
발행일
2024
유형
Proceedings Paper
저널명
PROCEEDINGS OF THE 33RD INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE PARALLEL AND DISTRIBUTED COMPUTING, HPDC 2024