New features of parallel implementation of $N$-body problems on GPU
Vestnik Ûžno-Uralʹskogo gosudarstvennogo universiteta. Seriâ, Matematičeskoe modelirovanie i programmirovanie, Tome 11 (2018) no. 1, pp. 124-136 Cet article a éte moissonné depuis la source Math-Net.Ru

Voir la notice de l'article

This paper focuses on the parallel implementation of a direct $N$-body method (particle-particle algorithm) and the application of multiple GPUs for galactic dynamics simulations. Application of a hybrid OpenMP-CUDA technology is considered for models with a number of particles $N \sim 10^5 \div 10^7$. By means of $N$-body simulations of gravitationally unstable stellar galactic we have investigated the algorithms parallelization efficiency for various Nvidia Tesla graphics processors (K20, K40, K80). Particular attention was paid to the parallel performance of simulations and accuracy of the numerical solution by comparing single and double floating-point precisions (SP and DP). We showed that the double-precision simulations are slower by a factor of $1,7$ than the single-precision runs performed on Nvidia Tesla K-Series processors. We also claim that application of the single-precision operations leads to incorrect result in the evolution of the non-axisymmetric gravitating $N$-body systems. In particular, it leads to significant quantitative and even qualitative distortions in the galactic disk evolution. For instance, after $10^4$ integration time steps for the single-precision numbers the total energy, momentum, and angular momentum of a system with $N = 2^{20}$ conserve with accuracy of $10^{-3}$, $10^{-2}$ and $10^{-3}$ respectively, in comparison to the double-precision simulations these values are $10^{-5}$, $10^{-15}$ and $10^{-13}$, respectively. Our estimations evidence in favour of usage of the second-order accuracy schemes with double-precision numbers since it is more efficient than in the fourth-order schemes with single-precision numbers.
Keywords: Multi-GPU; OpenMP-CUDA; GPU-Direct; Nvidia Tesla; N-body; single and double precision numerical simulation; collisionless system; gravitational instability.
@article{VYURU_2018_11_1_a10,
     author = {S. S. Khrapov and S. A. Khoperskov and A. V. Khoperskov},
     title = {New features of parallel implementation of $N$-body problems on {GPU}},
     journal = {Vestnik \^U\v{z}no-Uralʹskogo gosudarstvennogo universiteta. Seri\^a, Matemati\v{c}eskoe modelirovanie i programmirovanie},
     pages = {124--136},
     year = {2018},
     volume = {11},
     number = {1},
     language = {en},
     url = {http://geodesic.mathdoc.fr/item/VYURU_2018_11_1_a10/}
}
TY  - JOUR
AU  - S. S. Khrapov
AU  - S. A. Khoperskov
AU  - A. V. Khoperskov
TI  - New features of parallel implementation of $N$-body problems on GPU
JO  - Vestnik Ûžno-Uralʹskogo gosudarstvennogo universiteta. Seriâ, Matematičeskoe modelirovanie i programmirovanie
PY  - 2018
SP  - 124
EP  - 136
VL  - 11
IS  - 1
UR  - http://geodesic.mathdoc.fr/item/VYURU_2018_11_1_a10/
LA  - en
ID  - VYURU_2018_11_1_a10
ER  - 
%0 Journal Article
%A S. S. Khrapov
%A S. A. Khoperskov
%A A. V. Khoperskov
%T New features of parallel implementation of $N$-body problems on GPU
%J Vestnik Ûžno-Uralʹskogo gosudarstvennogo universiteta. Seriâ, Matematičeskoe modelirovanie i programmirovanie
%D 2018
%P 124-136
%V 11
%N 1
%U http://geodesic.mathdoc.fr/item/VYURU_2018_11_1_a10/
%G en
%F VYURU_2018_11_1_a10
S. S. Khrapov; S. A. Khoperskov; A. V. Khoperskov. New features of parallel implementation of $N$-body problems on GPU. Vestnik Ûžno-Uralʹskogo gosudarstvennogo universiteta. Seriâ, Matematičeskoe modelirovanie i programmirovanie, Tome 11 (2018) no. 1, pp. 124-136. http://geodesic.mathdoc.fr/item/VYURU_2018_11_1_a10/

[1] Fridman A. M., Khoperskov A. V., Physics of Galactic Disks, Cambridge International Science Publishing, Cambridge, 2013

[2] Kennedy G. F., Meiron Y., Shukirgaliyev B., Panamarev T., Berczik P. et al., “The DRAGON Simulations: Globular Cluster Evolution with a Million Stars”, Monthly Notices of the Royal Astronomical Society, 458:2 (2016), 1450–1465 | DOI

[3] Khrapov S., Khoperskov A., “Smoothed-Particle Hydrodynamics Models: Implementation Features on GPUs”, Communications in Computer and Information Science, 793 (2017), 266–277 | DOI

[4] Smirnov A. A., Sotnikova N. Ya., Koshkin A. A., “Simulations of Slow Bars in Anisotropic Disk Systems”, Astronomy Letters, 43:2 (2017), 61–74 | DOI

[5] Comparat J., Prada F., Yepes G., Klypin A., “Accurate Mass and Velocity Functions of Dark Matter Haloes”, Monthly Notices of the Royal Astronomical Society, 469:4 (2017), 4157–4174 | DOI

[6] Knebe A., Stoppacher D., Prada F., Behrens C., Benson A. et al., “Multidark-Galaxies: Data Release and First Results”, Monthly Notices of the Royal Astronomical Society, 474:4 (2018), 5206–5231 | DOI

[7] Hwang J.-S., Park C., “Effects of Hot Halo Gas on Star Formation and Mass Transfer During Distant Galaxy-Galaxy Encounters”, The Astrophysical Journal, 805 (2015), 131–149 | DOI

[8] Portaluri E., Debattista V., Fabricius M., Cole D. R., Corsini E. et al., “The Kinematics of $\sigma$-drop Bulges from Spectral Synthesis Modelling of a Hydrodynamical Simulation”, Monthly Notices of the Royal Astronomical Society, 467:1 (2017), 1008–1015 | DOI

[9] Khoperskov A. V., Just A., Korchagin V. I., Jalali M. A., “High Resolution Simulations of Unstable Modes in a Collisionless Disc”, Astronomy and Astrophysics, 473 (2007), 31–40 | DOI

[10] Gelato S., Chernoff D. F., Wasserman I., “An Adaptive Hierarchical Particle-Mesh Code with Isolated Boundary Conditions”, The Astrophysical Journal, 480 (1997), 115–131 | DOI

[11] Barnes J., Hut P., “A Hierarchical $O(N\log N)$ Force-Calculation Algorithm”, Nature, 324 (1986), 446–449 | DOI

[12] Greengard L., “The Numerical Solution of the N-Body Problem”, Computers in physics, 4 (1990), 142–152 | DOI

[13] Huang S.-Y., Spurzem R., Berczik P., “Performance Analysis of Parallel Gravitational N-Body Codes on Large Gpu Clusters”, Research in Astronomy and Astrophysics, 16:1 (2016), 11 pp. | DOI

[14] Steinberg O. B., “Circular Shift of Loop Body – Programme Transformation, Promoting Parallelism”, Bulletin of the South Ural State University. Series: Mathematical Modelling, Programming and Computer Software, 10:3 (2017), 120–132 | DOI

[15] Khoperskov A., Bizyaev D., Tiurina N., Butenko M., “Numerical Modelling of the Vertical Structure and Dark Halo Parameters in Disc Galaxies”, Astronomische Nachrichten, 331 (2010), 731–745 | DOI

[16] Khoperskov A. V., Khoperskov S. A., Zasov A. V., Bizyaev D. V., Khrapov S. S., “Interaction between Collisionless Galactic Discs and Nonaxissymmetric Dark Matter Haloes”, Monthly Notices of the Royal Astronomical Society, 431 (2013), 1230–1239 | DOI

[17] Khoperskov S. A., Vasiliev E. O., Khoperskov A. V., Lubimov V. N., “Numerical Code for Multi-Component Galaxies: from N-Body to Chemistry and Magnetic Fields”, Journal of Physics: Conference Series, 510 (2014), 1–13 | DOI

[18] Rodionov S. A., Athanassoula E., Sotnikova N. Ya., “An Iterative Method for Constructing Equilibrium Phase Models of Stellar Systems”, Monthly Notices of the Royal Astronomical Society, 392:2 (2009), 904–916 | DOI | MR

[19] Bellemana R. G., Jeroen B., Simon F., Portegies Z., “High Performance Direct Gravitational N-Body Simulations on Graphics Processing Units: An Implementation in CUDA”, New Astronomy, 13 (2008), 103–112 | DOI

[20] Griv E., Wang H.-H., “Density Wave Formation in Differentially Rotating Disk Galaxies: Hydrodynamic Simulation of the Linear Regime”, New Astronomy, 30 (2014), 8–27 | DOI

[21] Romeo A., Falstad N., “A Simple and Accurate Approximation for the Q Stability Parameter in Multicomponent and Realistically Thick Discs”, Monthly Notices of the Royal Astronomical Society, 433:2 (2013), 1389–1397 | DOI