Preprocessing of system monitoring data for workload analysis of hpc systems
Numerical methods and programming, Tome 22 (2021) no. 3, pp. 230-238.

Voir la notice de l'article provenant de la source Math-Net.Ru

HPC systems are complex in architecture and contain millions of components. To ensure reliable operation and efficient output, functioning of most subsystems should be supervised. This is done on the basis of collected data from various logging and monitoring systems. This means that different data sources are used, and accordingly, data analysis can face multiple issues processing this data. Some of the data subsets can be incorrect due to the malfunctioning of used sensors, monitoring system data aggregation errors, etc. This is why it is crucial to preprocess such monitoring data before analyzing it, taking into the consideration the analysis goals. The aim of this paper is, being based on the MSU HPC Center monitoring data, to propose an approach to data preprocessing of HPC monitoring systems, giving some real life examples of issues that may be faced, and recommendations for further analysis of similar datasets.
Keywords: supercomputing, supercomputers, system monitoring data analysis, system monitoring data cleaning, system monitoring data reduction.
@article{VMP_2021_22_3_a3,
     author = {M. I. Martyshov and D. A. Nikitenko},
     title = {Preprocessing of system monitoring data for workload analysis of hpc systems},
     journal = {Numerical methods and programming},
     pages = {230--238},
     publisher = {mathdoc},
     volume = {22},
     number = {3},
     year = {2021},
     language = {ru},
     url = {http://geodesic.mathdoc.fr/item/VMP_2021_22_3_a3/}
}
TY  - JOUR
AU  - M. I. Martyshov
AU  - D. A. Nikitenko
TI  - Preprocessing of system monitoring data for workload analysis of hpc systems
JO  - Numerical methods and programming
PY  - 2021
SP  - 230
EP  - 238
VL  - 22
IS  - 3
PB  - mathdoc
UR  - http://geodesic.mathdoc.fr/item/VMP_2021_22_3_a3/
LA  - ru
ID  - VMP_2021_22_3_a3
ER  - 
%0 Journal Article
%A M. I. Martyshov
%A D. A. Nikitenko
%T Preprocessing of system monitoring data for workload analysis of hpc systems
%J Numerical methods and programming
%D 2021
%P 230-238
%V 22
%N 3
%I mathdoc
%U http://geodesic.mathdoc.fr/item/VMP_2021_22_3_a3/
%G ru
%F VMP_2021_22_3_a3
M. I. Martyshov; D. A. Nikitenko. Preprocessing of system monitoring data for workload analysis of hpc systems. Numerical methods and programming, Tome 22 (2021) no. 3, pp. 230-238. http://geodesic.mathdoc.fr/item/VMP_2021_22_3_a3/