Back to Search Start Over

Evaluating performance portability of five shared-memory programming models using a high-order unstructured CFD solver.

Authors :
Dai, Zhe
Deng, Liang
Che, YongGang
Li, Ming
Zhang, Jian
Wang, Yueqing
Source :
Journal of Parallel & Distributed Computing. May2024, Vol. 187, pN.PAG-N.PAG. 1p.
Publication Year :
2024

Abstract

This paper presents implementing and optimizing a high-order unstructured computational fluid dynamics (CFD) solver using five shared-memory programming models: CUDA, OpenACC, OpenMP, Kokkos, and OP2. The study aims to evaluate the performance of these models on different hardware architectures, including NVIDIA GPUs, x86-based Intel/AMD, and Arm-based systems. The goal is to determine whether these models can provide developers with performance-portable solvers running efficiently on various architectures. The paper forms a more holistic view of a high-order solver across multiple platforms by visualizing performance portability (PP) and measuring productivity. It gives guidelines for translating existing codebases and their data structures to these models. • We port and optimize a high-order unstructured CFD application by using five shared-memory programming models. • We evaluate the performance portability of five programming models on diverse hardware. • We analyze the workload from the perspective of code volume and learning cost. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
07437315
Volume :
187
Database :
Academic Search Index
Journal :
Journal of Parallel & Distributed Computing
Publication Type :
Academic Journal
Accession number :
175393770
Full Text :
https://doi.org/10.1016/j.jpdc.2023.104831