Octoshell: large supercomputer complex administration system
Vestnik Ûžno-Uralʹskogo gosudarstvennogo universiteta. Seriâ Vyčislitelʹnaâ matematika i informatika, Tome 5 (2016) no. 3, pp. 76-95 Cet article a éte moissonné depuis la source Math-Net.Ru

Voir la notice de l'article

Managing and administering of modern supercomputer centers and HPC systems as a part is a complicated and complex task. The usage of numerous traditional stand-alone tools for administering and management of supercomputers becomes a bottleneck for efficient resource utilization in conditions of growing systems scale. The developed "Octoshell" system for support of running supercomputer centers is aimed at solving this problem. It implements essential tools for administering in a single interface and allows significant automatization of typical management tasks ensuring higher efficiency of large supercomputer complex output as a whole.
Keywords: supercomputer, monitoring, managing HPC center, administering supercomputers, user support.
@article{VYURV_2016_5_3_a5,
     author = {D. A. Nikitenko and V. V. Voevodin and S. A. Zhumatiy},
     title = {Octoshell: large supercomputer complex administration system},
     journal = {Vestnik \^U\v{z}no-Uralʹskogo gosudarstvennogo universiteta. Seri\^a Vy\v{c}islitelʹna\^a matematika i informatika},
     pages = {76--95},
     year = {2016},
     volume = {5},
     number = {3},
     language = {ru},
     url = {http://geodesic.mathdoc.fr/item/VYURV_2016_5_3_a5/}
}
TY  - JOUR
AU  - D. A. Nikitenko
AU  - V. V. Voevodin
AU  - S. A. Zhumatiy
TI  - Octoshell: large supercomputer complex administration system
JO  - Vestnik Ûžno-Uralʹskogo gosudarstvennogo universiteta. Seriâ Vyčislitelʹnaâ matematika i informatika
PY  - 2016
SP  - 76
EP  - 95
VL  - 5
IS  - 3
UR  - http://geodesic.mathdoc.fr/item/VYURV_2016_5_3_a5/
LA  - ru
ID  - VYURV_2016_5_3_a5
ER  - 
%0 Journal Article
%A D. A. Nikitenko
%A V. V. Voevodin
%A S. A. Zhumatiy
%T Octoshell: large supercomputer complex administration system
%J Vestnik Ûžno-Uralʹskogo gosudarstvennogo universiteta. Seriâ Vyčislitelʹnaâ matematika i informatika
%D 2016
%P 76-95
%V 5
%N 3
%U http://geodesic.mathdoc.fr/item/VYURV_2016_5_3_a5/
%G ru
%F VYURV_2016_5_3_a5
D. A. Nikitenko; V. V. Voevodin; S. A. Zhumatiy. Octoshell: large supercomputer complex administration system. Vestnik Ûžno-Uralʹskogo gosudarstvennogo universiteta. Seriâ Vyčislitelʹnaâ matematika i informatika, Tome 5 (2016) no. 3, pp. 76-95. http://geodesic.mathdoc.fr/item/VYURV_2016_5_3_a5/

[1] Top50 Superkompyutery, (data obrascheniya: 02.08.2015) http://top50.supercomputers.ru

[2] Top500 Supercomputer Sites, (data obrascheniya: 02.08.2015) http://top500.org

[3] V.V. Voevodin, A.S. Antonov, P.A. Bryzgalov, D.A. Nikitenko, S.I. Sobolev, K.S. Stefanov, Vad.V. Voevodin, S.A. Zhumatij, “Practice of «Lomonosov» Supercomputer”, Open Systems, 2012, no. 7, 36–39

[4] S.A. Zhumatij, D.A. Nikitenko, “Flexible Approach to the Management of Supercomputers”, Scientific Service over Internet: All Shades of Parallelism: Proceedings of the International Supercomputing Conference (Novorossiisk, 23–28 September 2013), Publishing of the MSU, Moscow, 2013, 296–300

[5] S.A. Zhumatij, O.V. Datsyuk, Administering of Supercomputers and Cluster Systems, Publishing of the MSU, Moscow, 2014, 400 pp.

[6] Torgue Batch System, (data obrascheniya: 02.08.2015) http://www.adaptivecomputing.com/products/open-source/torque/

[7] SLURM Workload Manager, (data obrascheniya: 02.08.2015) http://slurm.schedmd.com/

[8] OpenPBS, (data obrascheniya: 02.08.2015) http://www.mcs.anl.gov/research/projects/openpbs/

[9] Ganglia Monitoring System, (data obrascheniya: 02.08.2015) http://ganglia.sourceforge.net/

[10] Zabbix Monitoring, (data obrascheniya: 02.08.2015) http://www.zabbix.com/ru/

[11] Nagios Monitoring, (data obrascheniya: 02.08.2015) https://www.nagios.org/

[12] Open-source Ticket Request System, (data obrascheniya: 02.08.2015) http://www.otrs.org/

[13] S.N. Leonenkov, “Extending Functionality of SLURM Supercomputer Resource Manager”, Scientific Service over Internet: Diversity of Supercomputing Worlds: Proceedings of the International Supercomputing Conference (Novorossiisk, 22–27 September 2014), Publishing of the MSU, Moscow, 2014, 472–476

[14] D.A. Nikitenko, “Complex Analysis of Supercomputer Systems’ Performance Based on System Monitoring Data”, Computational Methods and Programming: New Computational Technologies, 15 (2014), 85–97

[15] A.S. Antonov, S.A. Zhumatij, D.A. Nikitenko, K.S. Stefanov, A.M. Teplov, P.A. Shvets, “Dynamic Characteristics Analysis of Jobs Sequence on Supercomputer System”, Computational Methods and Programming: New Computational Technologies, 14:2 (2013), 104–108

[16] K.S. Stefanov, “Supercomputer Performance Monitoring System”, Bulletin of Perm National Research Polytechnical University. Aerospace Technics, 2014, no. 39, 17–34

[17] Voevodin V.V., “Situational Screen of Supercomputer”, Open Systems, 2014, no. 3, 36–39

[18] A.S. Antonov, V.V. Voevodin, A.A. Daugel-Dauge, S.A. Zhumatij, D.A. Nikitenko, S.I. Sobolev, K.S. Stefanov, P.A. Shvets, “Securing of Active Control and Efficient Autonomous Operating of MSU Supercomputer Center”, Bulletin of South Ural State University. Series: Computational Mathematics and Informatics, 4:2 (2015), 33–43 | DOI