1. Methods to Load Balance a GCR Pressure Solver Using a Stencil Framework on Multi- and Many-Core Architectures.
- Author
-
Ciznicki, Milosz, Kulczewski, Michal, Kopta, Piotr, and Kurowski, Krzysztof
- Subjects
- *
LOAD balancing (Computer networks) , *DECOMPOSITION method , *GRAPHICS processing units , *COMPUTER architecture , *CONJUGATE gradient methods , *HIGH performance computing , *COMPUTER algorithms - Abstract
The recent advent of novel multi- and many-core architectures forces application programmers to deal with hardware-specific implementation details and to be familiar with software optimisation techniques to benefit from new high-performance computing machines. Extra care must be taken for communication-intensive algorithms, which may be a bottleneck for forthcoming era of exascale computing. This paper aims to present a high-level stencil framework implemented for the EULerian or LAGrangian model (EULAG) that efficiently utilises multi- and many-cores architectures. Only an efficient usage of both many-core processors (CPUs) and graphics processing units (GPUs) with the flexible data decomposition method can lead to the maximum performance that scales the communication-intensive Generalized Conjugate Residual (GCR) elliptic solver with preconditioner. [ABSTRACT FROM AUTHOR]
- Published
- 2015
- Full Text
- View/download PDF