Demystifying Power and Performance Variations in GPU Systems through Microarchitectural Analysis
Computer Science and Information Systems, Tome 22 (2025) no. 2
Voir la notice de l'article provenant de la source Computer Science and Information Systems website
Graphics Processing Units (GPUs) serve efficient parallel execution for general-purpose computations at high-performance computing and embedded systems. While performance concerns guide the main optimization efforts, power issues become significant for energy-efficient and sustainable GPU executions. Profilers and simulators report statistics about the target execution; however, they either present only performance metrics in a coarse kernel function level or lack visualization support that can enable microarchitectural performance analysis or performance-power consumption comparison. Evaluating runtime performance and power consumption dynamically across GPU components enables a comprehensive tradeoff analysis for GPU architects and software developers. In this work, we present a novel memory performance and power monitoring tool for GPU programs, GPPRMon, which performs a systematic metric collection and provides useful visualization views to guide power and performance analysis for target executions. Our simulation-based framework dynamically gathers SM and memory-related microarchitectural metrics by monitoring individual instructions, and reports achieved performance and power consumption information at runtime. Our visualization interface presents spatial and temporal views of the execution. While the first demonstrates the performance and power metrics across GPU memory components, including global memory, cache, and SMs, the latter shows the corresponding information at the instruction granularity in a timeline. Based on our framework, we demonstrate performance and power analysis for memory-bound graph applications and resource-critical embedded programs from GPU benchmark suites. Our case studies reveal potential usages of our tool in memory-bound kernel identification, performance bottleneck analysis of a memory-intensive workload, performancepower evaluation of an embedded application, and the impact of input size on the memory structures of an embedded system.
Keywords:
GPU Computing, Performance monitoring, Power consumption
Burak Topcu; Deniz Karabacak; Işıl Öz. Demystifying Power and Performance Variations in GPU Systems through Microarchitectural Analysis. Computer Science and Information Systems, Tome 22 (2025) no. 2. http://geodesic.mathdoc.fr/item/CSIS_2025_22_2_a5/
@article{CSIS_2025_22_2_a5,
author = {Burak Topcu and Deniz Karabacak and I\c{s}{\i}l \"Oz},
title = {Demystifying {Power} and {Performance} {Variations} in {GPU} {Systems} through {Microarchitectural} {Analysis}},
journal = {Computer Science and Information Systems},
year = {2025},
volume = {22},
number = {2},
url = {http://geodesic.mathdoc.fr/item/CSIS_2025_22_2_a5/}
}
TY - JOUR AU - Burak Topcu AU - Deniz Karabacak AU - Işıl Öz TI - Demystifying Power and Performance Variations in GPU Systems through Microarchitectural Analysis JO - Computer Science and Information Systems PY - 2025 VL - 22 IS - 2 UR - http://geodesic.mathdoc.fr/item/CSIS_2025_22_2_a5/ ID - CSIS_2025_22_2_a5 ER -
%0 Journal Article %A Burak Topcu %A Deniz Karabacak %A Işıl Öz %T Demystifying Power and Performance Variations in GPU Systems through Microarchitectural Analysis %J Computer Science and Information Systems %D 2025 %V 22 %N 2 %U http://geodesic.mathdoc.fr/item/CSIS_2025_22_2_a5/ %F CSIS_2025_22_2_a5