Bottlenecks in organizing the workflows of large HPC centers
Numerical methods and programming, Tome 24 (2023) no. 1, pp. 1-9
Voir la notice de l'article provenant de la source Math-Net.Ru
Effective output from data centers are determined by many complementary factors. Often, attention is paid to only a few, at first glance, the most significant of them. For example, this is the efficiency of the scheduler, the efficiency of resource utilization by user tasks. At the same time, a more general view of the problem is often missed: the level at which the interconnection of work processes in the HPC center is determined, the organization of effective work as a whole. missions at this stage can negate any subtle optimizations at a low level. This paper provides a scheme for describing workflows in the supercomputer center and analyzes the experience of large HPC facilities in identifying the bottlenecks in this chain. A software implementation option that gives the possibility of optimizing the organization of work at all stages is also proposed in the form of a support system for the functioning of the HPC site.
Keywords:
supercomputing, provision of computing resources, use of computing resources, workflows at supercomputer center, shared research facilities, provision of computing services.
@article{VMP_2023_24_1_a2,
author = {Dmitry A. Nikitenko},
title = {Bottlenecks in organizing the workflows of large {HPC} centers},
journal = {Numerical methods and programming},
pages = {1--9},
publisher = {mathdoc},
volume = {24},
number = {1},
year = {2023},
language = {en},
url = {http://geodesic.mathdoc.fr/item/VMP_2023_24_1_a2/}
}
Dmitry A. Nikitenko. Bottlenecks in organizing the workflows of large HPC centers. Numerical methods and programming, Tome 24 (2023) no. 1, pp. 1-9. http://geodesic.mathdoc.fr/item/VMP_2023_24_1_a2/