1. An analysis of the feasibility and benefits of GPU/multicore acceleration of the Weather Research and Forecasting model
- Author
-
Tetsuya Takemi and Wim Vanderbauwhede
- Subjects
010504 meteorology & atmospheric sciences ,Computer Networks and Communications ,Computer science ,Fortran ,Weather forecasting ,Climate change ,Environmental pollution ,02 engineering and technology ,Parallel computing ,computer.software_genre ,01 natural sciences ,Theoretical Computer Science ,0203 mechanical engineering ,Fluid dynamics ,Precipitation ,0105 earth and related environmental sciences ,computer.programming_language ,Severe weather ,Advection ,Numerical weather prediction ,Computer Science Applications ,020303 mechanical engineering & transports ,Computational Theory and Mathematics ,Weather Research and Forecasting Model ,Hardware acceleration ,Central processing unit ,computer ,Software - Abstract
There is a growing need for ever more accurate climate and weather simulations to be delivered in shorter timescales, in particular, to guard against severe weather events such as hurricanes and heavy rainfall. Due to climate change, the severity and frequency of such events - and thus the economic impact - are set to rise dramatically. Hardware acceleration using graphics processing units GPUs or Field-Programmable Gate Arrays FPGAs could potentially result in much reduced run times or higher accuracy simulations. In this paper, we present the results of a study of the Weather Research and Forecasting WRF model undertaken in order to assess if GPU and multicore acceleration of this type of numerical weather prediction NWP code is both feasible and worthwhile. The focus of this paper is on acceleration of code running on a single compute node through offloading of parts of the code to an accelerator such as a GPU. The governing equations set of the WRF model is based on the compressible, non-hydrostatic atmospheric motion with multi-physics processes. We put this work into context by discussing its more general applicability to multi-physics fluid dynamics codes: in many fluid dynamics codes, the numerical schemes of the advection terms are based on finite differences between neighboring cells, similar to the WRF code. For fluid systems including multi-physics processes, there are many calls to these advection routines. This class of numerical codes will benefit from hardware acceleration. We studied the performance of the original code of the WRF model and proposed a simple model for comparing multicore CPU and GPU performance. Based on the results of extensive profiling of representative WRF runs, we focused on the acceleration of the scalar advection module. We discuss the implementation of this module as a data-parallel kernel in both OpenCL and OpenMP. We show that our data-parallel kernel version of the scalar advection module runs up to seven times faster on the GPU compared with the original code on the CPU. However, as the data transfer cost between GPU and CPU is very high as shown by our analysis, there is only a small speed-up two times for the fully integrated code. We show that it would be possible to offset the data transfer cost through GPU acceleration of a larger portion of the dynamics code. In order to carry out this research, we also developed an extensible software system for integrating OpenCL code into large Fortran code bases such as WRF. This is one of the main contributions of our work. We discuss the system to show how it allows the replacement of the sections of the original codebase with their OpenCL counterparts with minimal changes - literally only a few lines - to the original code. Our final assessment is that, even with the current system architectures, accelerating WRF - and hence also other, similar types of multi-physics fluid dynamics codes - with a factor of up to five times is definitely an achievable goal. Accelerating multi-physics fluid dynamics codes including NWP codes is vital for its application to weather forecasting, environmental pollution warning, and emergency response to the dispersion of hazardous materials. Implementing hardware acceleration capability for fluid dynamics and NWP codes is a prerequisite for up-to-date and future computer architectures. Copyright © 2015 John Wiley & Sons, Ltd.
- Published
- 2015
- Full Text
- View/download PDF