Parallelization of NAS parallel benchmarks for Intel Xeon Phi coprocessor in Fortran-DVMH language
Vestnik Ûžno-Uralʹskogo gosudarstvennogo universiteta. Seriâ Vyčislitelʹnaâ matematika i informatika, Tome 4 (2015) no. 4, pp. 48-63 Cet article a éte moissonné depuis la source Math-Net.Ru

Voir la notice de l'article

The article analyzes the effectiveness of the implementation of NAS benchmarks from NPB 3.3.1 package (EP, MG, BT, SP, LU) on cluster nodes with different architectures using multi-core processors, NVidia graphics accelerators and Intel coprocessors. Characteristics of tests de-veloped in high-level Fortran-DVMH language (hereafter referred to as FDVMH), and their im-plementation in other languages are compared. We research the effect of different optimization methods for FDVMH NAS benchmarks necessary for their effective work on Intel Xeon Phi co-processor. The results of the simultaneous using of all cores of CPU, GPU and Intel Xeon Phi co-processor are presented.
Mots-clés : DVMH, coprocessor, Fortran.
Keywords: high-level programming language, accelerator, GPU, NAS Parallel Benchmarks
@article{VYURV_2015_4_4_a2,
     author = {V. F. Aleksahin and V. A. Bakhtin and O. F. Zhukova and A. S. Kolganov and V. A. Krukov and I. P. Ostrovskaya and N. V. Podderugina and M. N. Pritula and O. A. Savitskaya},
     title = {Parallelization of {NAS} parallel benchmarks for {Intel} {Xeon} {Phi} coprocessor in {Fortran-DVMH} language},
     journal = {Vestnik \^U\v{z}no-Uralʹskogo gosudarstvennogo universiteta. Seri\^a Vy\v{c}islitelʹna\^a matematika i informatika},
     pages = {48--63},
     year = {2015},
     volume = {4},
     number = {4},
     language = {ru},
     url = {http://geodesic.mathdoc.fr/item/VYURV_2015_4_4_a2/}
}
TY  - JOUR
AU  - V. F. Aleksahin
AU  - V. A. Bakhtin
AU  - O. F. Zhukova
AU  - A. S. Kolganov
AU  - V. A. Krukov
AU  - I. P. Ostrovskaya
AU  - N. V. Podderugina
AU  - M. N. Pritula
AU  - O. A. Savitskaya
TI  - Parallelization of NAS parallel benchmarks for Intel Xeon Phi coprocessor in Fortran-DVMH language
JO  - Vestnik Ûžno-Uralʹskogo gosudarstvennogo universiteta. Seriâ Vyčislitelʹnaâ matematika i informatika
PY  - 2015
SP  - 48
EP  - 63
VL  - 4
IS  - 4
UR  - http://geodesic.mathdoc.fr/item/VYURV_2015_4_4_a2/
LA  - ru
ID  - VYURV_2015_4_4_a2
ER  - 
%0 Journal Article
%A V. F. Aleksahin
%A V. A. Bakhtin
%A O. F. Zhukova
%A A. S. Kolganov
%A V. A. Krukov
%A I. P. Ostrovskaya
%A N. V. Podderugina
%A M. N. Pritula
%A O. A. Savitskaya
%T Parallelization of NAS parallel benchmarks for Intel Xeon Phi coprocessor in Fortran-DVMH language
%J Vestnik Ûžno-Uralʹskogo gosudarstvennogo universiteta. Seriâ Vyčislitelʹnaâ matematika i informatika
%D 2015
%P 48-63
%V 4
%N 4
%U http://geodesic.mathdoc.fr/item/VYURV_2015_4_4_a2/
%G ru
%F VYURV_2015_4_4_a2
V. F. Aleksahin; V. A. Bakhtin; O. F. Zhukova; A. S. Kolganov; V. A. Krukov; I. P. Ostrovskaya; N. V. Podderugina; M. N. Pritula; O. A. Savitskaya. Parallelization of NAS parallel benchmarks for Intel Xeon Phi coprocessor in Fortran-DVMH language. Vestnik Ûžno-Uralʹskogo gosudarstvennogo universiteta. Seriâ Vyčislitelʹnaâ matematika i informatika, Tome 4 (2015) no. 4, pp. 48-63. http://geodesic.mathdoc.fr/item/VYURV_2015_4_4_a2/

[1] Top500 List - November 2014, (data obrascheniya: 01.04.2015) http://top500.org/list/2014/11/

[2] High Performance Fortran, (data obrascheniya: 01.04.2015) http://hpff.rice.edu

[3] Konovalov N.A., Kryukov V.A, Mikhajlov S.N., Pogrebtsov A.A., “Fortan DVM: a Language for Portable Parallel Program Development”, Programming and Computer Software, 1995, no. 1, 49–54 | MR | Zbl

[4] Konovalov N.A., Krukov V.A, Sazanov Y.L., “C-DVM – a Language for the Development of Portable Parallel Programs”, Programming and Computer Software, 1999, no. 1, 54–65 | Zbl

[5] OpenACC, (data obrascheniya: 01.04.2015) http://www.openacc-standard.org/

[6] OpenMP 4.0 Specifications, (data obrascheniya: 01.04.2015) http://openmp.org/wp/openmp-specifications/

[7] Intel Ivy Bridge-EP architecture, (accessed: 30.11.2014) http://www.intel.ru/content/www/ru/ru/secure/intelligent-systems/privileged/ivy-bridge-ep/xeon-e5-1600-2600-v2-bsdl.html

[8] Intel MIC architecture, (accessed: 01.04.2015) https://software.intel.com/mic-developer

[9] Nidia Kepler architecture, (accessed: 01.04.2015) http://www.nvidia.com/content/PDF/kepler/NVIDIA-kepler-GK110-Architecture-Whitepaper.pdf

[10] NAS Parallel Benchmarks, (data obrascheniya: 01.04.2015) http://www.nas.nasa.gov/publications/npb.html

[11] Bakhtin V.A, Klinov M.S., Krukov V.A., Podderugina N.V., Pritula M.N., Sazanov Yu.L., “Extension of the DVM-model of parallel programming for clusters with heterogeneous nodes”, Bulletin of South Ural State University. Series: Mathematical Modeling, Programming Computer Software, 2012, no. 18(277), 82–92

[12] Intel Xeon Phi programming environment, (data obrascheniya: 01.04.2015) https://software.intel.com/en-us/articles/intel-xeon-phi-programming-environment

[13] Aleksahin V.F, Bakhtin V.A, Zhukova O.F., Kolganov A.S., Krukov V.A., Podderugina N.V., Pritula M.N., Savitskaya O.A., Shubert A.V., “Parallelization on GPUs of NPB 3.3 NAS tests on Fortran DVMH language”, Bulletin of Ufa State Aviation Technical University, 19:1 (2015), 240–250

[14] A. Ramachandran, J. Vienne, R. Wijngaart, L. Koesterke, I. Sharapov, “Performance Evaluation of NAS Parallel Benchmarks on Intel Xeon Phi”, Proceedings of the 42nd International Conference on Parallel Processing, 2013, 736–743 | DOI