Bottlenecks in organizing the workflows of large HPC centers
Numerical methods and programming, Tome 24 (2023) no. 1, pp. 1-9
Voir la notice de l'article provenant de la source Math-Net.Ru
Effective output from data centers are determined by many complementary factors. Often, attention is paid to only a few, at first glance, the most significant of them. For example, this is the efficiency of the scheduler, the efficiency of resource utilization by user tasks. At the same time, a more general view of the problem is often missed: the level at which the interconnection of work processes in the HPC center is determined, the organization of effective work as a whole. missions at this stage can negate any subtle optimizations at a low level. This paper provides a scheme for describing workflows in the supercomputer center and analyzes the experience of large HPC facilities in identifying the bottlenecks in this chain. A software implementation option that gives the possibility of optimizing the organization of work at all stages is also proposed in the form of a support system for the functioning of the HPC site.
Keywords:
supercomputing, provision of computing resources, use of computing resources, workflows at supercomputer center, shared research facilities, provision of computing services.
Dmitry A. Nikitenko. Bottlenecks in organizing the workflows of large HPC centers. Numerical methods and programming, Tome 24 (2023) no. 1, pp. 1-9. http://geodesic.mathdoc.fr/item/VMP_2023_24_1_a2/
@article{VMP_2023_24_1_a2,
author = {Dmitry A. Nikitenko},
title = {Bottlenecks in organizing the workflows of large {HPC} centers},
journal = {Numerical methods and programming},
pages = {1--9},
year = {2023},
volume = {24},
number = {1},
language = {en},
url = {http://geodesic.mathdoc.fr/item/VMP_2023_24_1_a2/}
}