Back to Search Start Over

GPU Algorithms for Structured Sparse Matrix Multiplication with Diagonal Storage Schemes †.

Authors :
Haque, Sardar Anisul
Parvez, Mohammad Tanvir
Hossain, Shahadat
Source :
Algorithms. Jan2024, Vol. 17 Issue 1, p31. 18p.
Publication Year :
2024

Abstract

Matrix–matrix multiplication is of singular importance in linear algebra operations with a multitude of applications in scientific and engineering computing. Data structures for storing matrix elements are designed to minimize overhead information as well as to optimize the operation count. In this study, we utilize the notion of the compact diagonal storage method (CDM), which builds upon the previously developed diagonal storage—an orientation-independent uniform scheme to store the nonzero elements of a range of matrices. This study exploits both these storage schemes and presents efficient GPU-accelerated parallel implementations of matrix multiplication when the input matrices are banded and/or structured sparse. We exploit the data layouts in the diagonal storage schemes to expose a substantial amount of fine-grained parallelism and effectively utilize the GPU shared memory to improve the locality of data access for numerical calculations. Results from an extensive set of numerical experiments with the aforementioned types of matrices demonstrate orders-of-magnitude speedups compared with the sequential performance. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
19994893
Volume :
17
Issue :
1
Database :
Academic Search Index
Journal :
Algorithms
Publication Type :
Academic Journal
Accession number :
175057832
Full Text :
https://doi.org/10.3390/a17010031