Date post: | 21-Jan-2019 |
Category: |
Documents |
Upload: | truongkien |
View: | 218 times |
Download: | 0 times |
DVFS-Aware Application Classification to improve
GPGPUs Energy Efficiency
PHD PROGRAM ELECTRICAL AND COMPUTER ENGINEERING
At Instituto Superior Técnico, Universidade de Lisboa
JOÃO GUERREIRO ([email protected])
Supervisors: Pedro Tomás and Nuno Roma
PhD Program Electrical and Computer Engineering
1. Motivation
Given the increasing usage of Graphics Processing Units (GPUs) in High
Performance Computing (HPC) systems, intensifies the importance of finding
reliable mechanisms that ensure the maximum efficiency of the computing
system (both in performance and energy-consumption).
The architecture of current GPU devices allows for the different components to
be clocked at distinct and independent frequencies. Dynamic Voltage and
Frequency Scaling (DVFS) techniques are one of the most promising power
management strategies, due to the inherent potential for significant power and
energy savings in many of the computer system components.
The resulting effects of changing the frequency of GPU subsystems largely
depends on the considered application and it is closely tied to each application's
characteristics (Figure 1). While it can be expected that a decrease of the core
frequency (𝐹𝐶𝑜𝑟𝑒) and voltage (𝑉𝐶𝑜𝑟𝑒) will cause the kernel's execution time to
increase (𝑇 ∝ ൗ1 𝐹𝐶𝑜𝑟𝑒 ) and the resulting power consumption to decrease
(𝑃𝐷𝑦𝑛𝑎𝑚𝑖𝑐 ∝ 𝐹𝐶𝑜𝑟𝑒𝑉𝐶𝑜𝑟𝑒2 and 𝑃𝑆𝑡𝑎𝑡𝑖𝑐∝ 𝑉𝐶𝑜𝑟𝑒𝑒
𝛾𝑉𝐶𝑜𝑟𝑒 ), the actual values for the
application's performance and power consumption over different frequencies
are highly dependent on the way each application exploits each of the GPU
subsystems.
2. Proposed Classification Method
3. Performance and Power Classification
By providing a systematic mechanism to characterize the impact of DVFS on
the energy-consumption of any GPU application, the proposed DVFS-aware
classification methodologies create many interesting opportunities to improve
the energy-efficiency of HPC systems.
All obtained energy-savings using the selected frequency levels are at most
3% distant from the optimal energy-savings, with average energy-savings of
14% versus the reference and 7% versus NVIDIA’s Auto-boost setup. In the
case where a 10% performance drop-off is allowed, some applications can
achieve up to 30% energy-savings.
Given the paradigm of GPU architectures, the execution of applications
generally consists on the execution of instructions on different GPU
components in partially or fully overlapped manner. However, the power
consumption of the several different components cannot be hidden or
masqueraded, and must be always combined together in order to obtain the
total power consumption of the GPU.
4. Energy-Savings
Encontro Ciência 2017