Market Mechanism-Based User-in-the-Loop Scalable Power Oversubscription for HPC Systems

Abstract

Significant power consumption is one of the major challenges for current and future high-performance computing (HPC) systems. All the while, HPC systems generally remain power underutilized, making them a great candidate for applying power oversubscription to reclaim unused capacity. However, an oversubscribed HPC system may occasionally get overloaded. In this paper, we propose MPR (Market-based Power Reduction), a scalable market-based approach where users actively participate in reducing the HPC system’s power consumption to mitigate overloads. In MPR, HPC users bid to supply, in exchange for incentives, the resource reduction required for handling the overloads. Using several real-world trace-based simulations, we extensively evaluate MPR and show that, by participating in MPR, users always receive more rewards than the cost of performance loss. At the same time, the HPC manager enjoys orders of magnitude more resource gain than her incentive payoff to the users. We also demonstrate the real-world effectiveness of MPR on a prototype system.

Publication
The 29th IEEE International Symposium on High-Performance Computer Architecture (IEEE HPCA 2023)
Md Rajib Hossen
Md Rajib Hossen
PhD Candidate in Computer Science

My research interests include Microservices, HPC, Distributed Systems, Machine Learning, and Converged Computing

Kishwar Ahmed
Assistant Professor in Computer Science Department
Mohammad A. Islam
Assistant Professor in Computer Science Department