The peculiarities of the parallel implementation of Particle-In-Cell method
Vestnik Udmurtskogo universiteta. Matematika, mehanika, kompʹûternye nauki, Tome 28 (2018) no. 3, pp. 419-426
Cet article a éte moissonné depuis la source Math-Net.Ru

Voir la notice de l'article

Particle-In-Cell (PIC) method is widely used for plasma simulation and the GPUs appear to be the most efficient way to run this method. In this work we propose a technique that enables one to speed up one of the most time-consuming operations in the GPU implementation of the PIC method. The operation is particle reordering, or redistribution of particles between cells, which is performed after pushing. The reordering operation provides data locality which is the key performance issue of the PIC method. We propose to divide the reordering into two stages. First, gather the particles that are going to leave a particular cell into arrays, the number of arrays being equal to the number of neighbor cells (26 for 3D case). Second, each neighbor cell copies the particles from the necessary array to its own particle array. The second operation is done in 26 threads independently with no synchronization or waiting and involves no critical sections, semaphores, mutexes, atomic operations etc. It results in the more than 10 times reduction of the reordering time compared to the straightforward reordering algorithm.
Keywords: optimization, GPU, PIC.
Mots-clés : simulation
@article{VUU_2018_28_3_a10,
     author = {A. A. Romanenko and A. V. Snytnikov},
     title = {The peculiarities of the parallel implementation of {Particle-In-Cell} method},
     journal = {Vestnik Udmurtskogo universiteta. Matematika, mehanika, kompʹ\^uternye nauki},
     pages = {419--426},
     year = {2018},
     volume = {28},
     number = {3},
     language = {en},
     url = {http://geodesic.mathdoc.fr/item/VUU_2018_28_3_a10/}
}
TY  - JOUR
AU  - A. A. Romanenko
AU  - A. V. Snytnikov
TI  - The peculiarities of the parallel implementation of Particle-In-Cell method
JO  - Vestnik Udmurtskogo universiteta. Matematika, mehanika, kompʹûternye nauki
PY  - 2018
SP  - 419
EP  - 426
VL  - 28
IS  - 3
UR  - http://geodesic.mathdoc.fr/item/VUU_2018_28_3_a10/
LA  - en
ID  - VUU_2018_28_3_a10
ER  - 
%0 Journal Article
%A A. A. Romanenko
%A A. V. Snytnikov
%T The peculiarities of the parallel implementation of Particle-In-Cell method
%J Vestnik Udmurtskogo universiteta. Matematika, mehanika, kompʹûternye nauki
%D 2018
%P 419-426
%V 28
%N 3
%U http://geodesic.mathdoc.fr/item/VUU_2018_28_3_a10/
%G en
%F VUU_2018_28_3_a10
A. A. Romanenko; A. V. Snytnikov. The peculiarities of the parallel implementation of Particle-In-Cell method. Vestnik Udmurtskogo universiteta. Matematika, mehanika, kompʹûternye nauki, Tome 28 (2018) no. 3, pp. 419-426. http://geodesic.mathdoc.fr/item/VUU_2018_28_3_a10/

[1] Astrelin V. T., Burdakov A. V., Postupaev V. V., “Generation of ion-acoustic waves and suppression of heat transport during plasma heating by an electron beam”, Plasma Physics Reports, 24:5 (1998), 414–425 | MR

[2] Burau H., Widera R., Honig W., Juckeland G., Debus A., Kluge T., Schramm U., Cowan T. E., Sauerbrey R., Bussmann M., “PIConGPU: a fully relativistic particle-in-cell code for a GPU cluster”, IEEE Transactions on Plasma Science, 38:10 (2010), 2831–2839 | DOI

[3] Rossi F., Londrillo P., Sgattoni A., Sinigardi S., Turchetti G., “Towards robust algorithms for current deposition and dynamic load-balancing in a GPU particle in cell code”, AIP Conference Proceedings, 1507:1 (2012), 184–192 | DOI

[4] Kong X., Huang M., Ren Ch., Decyk V., “Particle-in-cell simulations with charge-conserving current deposition on graphic processing units”, Journal of Computational Physics, 230:4 (2011), 1676–1685 | DOI | Zbl

[5] Rieke M., Trost T., Grauer R., “Coupled Vlasov and two-fluid codes on GPUs”, Journal of Computational Physics, 283 (2015), 436–452 | DOI | MR | Zbl

[6] Lotov K. V., Timofeev I. V., Mesyats E. A., Snytnikov A. V., Vshivkov V. A., “Note on quantitatively correct simulations of the kinetic beam-plasma instability”, Physics of Plasmas, 22:2 (2015), 024502 | DOI

[7] Tskhakaya D., Schneider R., “Optimization of PIC codes by improved memory management”, Journal of Computational Physics, 225:1 (2007), 829–839 | DOI | Zbl

[8] Romanenko A. A., Snytnikov A. V., Timofeev I. V., “3D hybrid code for simulation of high-frequency electromagnetic radiation generation in turbulent plasma”, Vestn. Novosib. Gos. Univ. Ser. Inf. Tekhnol., 14:3 (2016), 81–90 (in Russian)

[9] Snytnikov A., Romanenko A., “Parallel template implementation of Particle-in-Cell method for hybrid supercomputers”, Bulletin of the Novosibirsk Computing Center. Series: Computer Science, 2014, vol. 36, pp. 79–88.

[10] Snytnikov A., Romanenko A., “The advantage of the GPU-based supercomputer simulation of plasma phenomena”, Bulletin of the Novosibirsk Computing Center. Series: Numerical Analysis, 16 (2013), 81–92 https://nccbulletin.ru/article/706

[11] Snytnikov A. V., Boronina M. A., Mesyats E. A., Romanenko A. A., “A technology for the design of hybrid supercomputer simulation codes for relativistic particle electrodynamics”, Proceedings of the 1st Russian Conference on Supercomputing, 2015, 17–25 http://ceur-ws.org/Vol-1482/017.pdf

[12] Grigoryev Yu.N., Vshivkov V. A., Fedoruk M. P., Numerical “Particle-in-Cell” Methods, VSP, Utrecht–Boston, 2002 | MR