Back to Search Start Over

Heterogeneous parallel algorithm design and performance optimization for WENO on the Sunway Taihulight supercomputer

Authors :
Jianqiang Huang
Xiaoying Wang
Wenguang Chen
Wentao Han
Source :
Tsinghua Science and Technology. 25:56-67
Publication Year :
2020
Publisher :
Tsinghua University Press, 2020.

Abstract

A Weighted Essentially Non-Oscillatory scheme (WENO) is a solution to hyperbolic conservation laws, suitable for solving high-density fluid interface instability with strong intermittency. These problems have a large and complex flow structure. To fully utilize the computing power of High Performance Computing (HPC) systems, it is necessary to develop specific methodologies to optimize the performance of applications based on the particular system's architecture. The Sunway TaihuLight supercomputer is currently ranked as the fastest supercomputer in the world. This article presents a heterogeneous parallel algorithm design and performance optimization of a high-order WENO on Sunway TaihuLight. We analyzed characteristics of kernel functions, and proposed an appropriate heterogeneous parallel model. We also figured out the best division strategy for computing tasks, and implemented the parallel algorithm on Sunway TaihuLight. By using access optimization, data dependency elimination, and vectorization optimization, our parallel algorithm can achieve up to 172X speedup on one single node, and additional 58X speedup on 64 nodes, with nearly linear scalability.

Details

ISSN :
10070214
Volume :
25
Database :
OpenAIRE
Journal :
Tsinghua Science and Technology
Accession number :
edsair.doi...........886e6d91557a33f75c8c894489a38631
Full Text :
https://doi.org/10.26599/tst.2018.9010112