Back to Search Start Over

Hybrid Parallelization and Performance Optimization of the FLEUR Code: New Possibilities for All-Electron Density Functional Theory

Authors :
Stefan Blügel
Daniel Wortmann
Gregor Michalicek
Uliana Alekseeva
Source :
Cham : Springer International Publishing, Lecture Notes in Computer Science 11014, 735-748 (2018)., Euro-Par 2018: Parallel Processing / Aldinucci, Marco (Editor), Euro-Par 2018: Parallel Processing / Aldinucci, Marco (Editor) 24th International Conference on Parallel and Distributed Computing, Turin, Italy, 2018-08-27-2018-08-31, Euro-Par 2018: Parallel Processing ISBN: 9783319969824, Euro-Par
Publication Year :
2018
Publisher :
Springer International Publishing, 2018.

Abstract

A hybrid MPI+OpenMP parallelization strategy has been implemented into the density functional theory code FLEUR. Based on the full-potential linearized augmented plane-wave (FLAPW) method, FLEUR is a well-established all-electron code specialized on the simulation of materials properties of crystalline bulk solids and surfaces with significant electronic and magnetic complexity. Developed in over 30 years the Fortran implementation included two layers of MPI-based distributed memory parallelization that serves as a reference for our work. The revised code version shows superior performance, improved scalability and thereby opens the path to exploit current and future high performance computing architectures efficiently. Multiple threads per MPI process can be utilized by interfacing with optimized linear algebra subroutines from the BLAS and LAPACK libraries as well as in code sections with explicit OpenMP statements. We demonstrate that the additional multithreading helps to avoid the communication induced scalability limit of the pure-MPI version and simultaneously boosts the single node-performance on current multi-core systems. This enables FLEUR calculations for unit cells with over 1000 atoms to simulate extended defects, surfaces and disordered solids.

Details

Language :
English
ISBN :
978-3-319-96982-4
ISBNs :
9783319969824
Database :
OpenAIRE
Journal :
Cham : Springer International Publishing, Lecture Notes in Computer Science 11014, 735-748 (2018)., Euro-Par 2018: Parallel Processing / Aldinucci, Marco (Editor), Euro-Par 2018: Parallel Processing / Aldinucci, Marco (Editor) 24th International Conference on Parallel and Distributed Computing, Turin, Italy, 2018-08-27-2018-08-31, Euro-Par 2018: Parallel Processing ISBN: 9783319969824, Euro-Par
Accession number :
edsair.doi.dedup.....cf68664836dabeba5be97aaca683279f