1. Performance analysis of fully explicit and fully implicit solvers within a spectral element shallow-water atmosphere model
- Author
-
Patrick H. Worley, Mark A. Taylor, Carol S. Woodward, Katherine J. Evans, Rick Archibald, Matthew R. Norman, and David J. Gardner
- Subjects
020203 distributed computing ,Computer science ,Message Passing Interface ,02 engineering and technology ,Atmospheric model ,Solver ,Theoretical Computer Science ,Computational science ,law.invention ,Range (mathematics) ,Hardware and Architecture ,law ,0202 electrical engineering, electronic engineering, information engineering ,Code (cryptography) ,Climate model ,Hydrostatic equilibrium ,Scaling ,Software - Abstract
Explicit Runge–Kutta methods and implicit multistep methods utilizing a Newton–Krylov nonlinear solver are evaluated for a range of configurations of the shallow-water dynamical core of the spectral element community atmosphere model to evaluate their computational performance. These configurations are designed to explore the attributes of each method under different but relevant model usage scenarios including varied spectral order within an element, static regional refinement, and scaling to the largest problem sizes. This analysis is performed within the shallow-water dynamical core option of a full climate model code base to enable a wealth of simulations for study, with the aim of informing solver development within the more complete hydrostatic dynamical core used for climate research. The limitations and benefits to using explicit versus implicit methods, with different parameters and settings, are discussed in light of the trade-offs with Message Passing Interface (MPI) communication and memory and their inherent efficiency bottlenecks. Given the performance behavior across the configurations analyzed here, the recommendation for future work using the implicit solvers is conditional based on scale separation and the stiffness of the problem. For the regionally refined configurations, the implicit method has about the same efficiency as the explicit method, without considering efficiency gains from a preconditioner. The potential for improvement using a preconditioner is greatest for higher spectral order configurations, where more work is shifted to the linear solver. Initial simulations with OpenACC directives to utilize a Graphics Processing Unit (GPU) when performing function evaluations show improvements locally, and that overall gains are possible with adjustments to data exchanges.
- Published
- 2017
- Full Text
- View/download PDF