Back to Search
Start Over
Hybrid Parallelization and Performance Optimization of the FLEUR Code: New Possibilities for All-Electron Density Functional Theory
- Source :
- Cham : Springer International Publishing, Lecture Notes in Computer Science 11014, 735-748 (2018)., Euro-Par 2018: Parallel Processing / Aldinucci, Marco (Editor), Euro-Par 2018: Parallel Processing / Aldinucci, Marco (Editor) 24th International Conference on Parallel and Distributed Computing, Turin, Italy, 2018-08-27-2018-08-31, Euro-Par 2018: Parallel Processing ISBN: 9783319969824, Euro-Par
- Publication Year :
- 2018
- Publisher :
- Springer International Publishing, 2018.
-
Abstract
- A hybrid MPI+OpenMP parallelization strategy has been implemented into the density functional theory code FLEUR. Based on the full-potential linearized augmented plane-wave (FLAPW) method, FLEUR is a well-established all-electron code specialized on the simulation of materials properties of crystalline bulk solids and surfaces with significant electronic and magnetic complexity. Developed in over 30 years the Fortran implementation included two layers of MPI-based distributed memory parallelization that serves as a reference for our work. The revised code version shows superior performance, improved scalability and thereby opens the path to exploit current and future high performance computing architectures efficiently. Multiple threads per MPI process can be utilized by interfacing with optimized linear algebra subroutines from the BLAS and LAPACK libraries as well as in code sections with explicit OpenMP statements. We demonstrate that the additional multithreading helps to avoid the communication induced scalability limit of the pure-MPI version and simultaneously boosts the single node-performance on current multi-core systems. This enables FLEUR calculations for unit cells with over 1000 atoms to simulate extended defects, surfaces and disordered solids.
- Subjects :
- Computer science
Fortran
Subroutine
Process (computing)
010103 numerical & computational mathematics
Parallel computing
Supercomputer
01 natural sciences
Multithreading
0103 physical sciences
Scalability
Distributed memory
0101 mathematics
010306 general physics
computer
computer.programming_language
Subjects
Details
- Language :
- English
- ISBN :
- 978-3-319-96982-4
- ISBNs :
- 9783319969824
- Database :
- OpenAIRE
- Journal :
- Cham : Springer International Publishing, Lecture Notes in Computer Science 11014, 735-748 (2018)., Euro-Par 2018: Parallel Processing / Aldinucci, Marco (Editor), Euro-Par 2018: Parallel Processing / Aldinucci, Marco (Editor) 24th International Conference on Parallel and Distributed Computing, Turin, Italy, 2018-08-27-2018-08-31, Euro-Par 2018: Parallel Processing ISBN: 9783319969824, Euro-Par
- Accession number :
- edsair.doi.dedup.....cf68664836dabeba5be97aaca683279f