Back to Search Start Over

Understanding the performance of stencil computations on Intel's Xeon Phi

Authors :
Michael A. Laurenzano
William A. Ward
Roy L. Campbell
Ananta Tiwari
Laura Carrington
Joshua Peraza
Source :
CLUSTER
Publication Year :
2013
Publisher :
IEEE, 2013.

Abstract

Accelerators are becoming prevalent in high performance computing as a way of achieving increased computational capacity within a smaller power budget. Effectively utilizing the raw compute capacity made available by these systems, however, remains a challenge because it can require a substantial investment of programmer time to port and optimize code to effectively use novel accelerator hardware. In this paper we present a methodology for isolating and modeling the performance of common performance-critical patterns of code (so-called idioms) and other relevant behavioral characteristics from large scale HPC applications which are likely to perform favorably on Intel Xeon Phi. The benefits of the methodology are twofold: (1) it directs programmer efforts toward the regions of code most likely to benefit from porting to the Xeon Phi and (2) provides speedup estimates for porting those regions of code. We then apply the methodology to the stencil idiom, showing performance improvements of up to a factor of 4.7× on stencil-based benchmark codes.

Details

Database :
OpenAIRE
Journal :
2013 IEEE International Conference on Cluster Computing (CLUSTER)
Accession number :
edsair.doi...........909b99d5cc247c39fbf8ae6968f669ac
Full Text :
https://doi.org/10.1109/cluster.2013.6702651