Back to Search Start Over

High Performance RDMA Based All-to-All Broadcast for InfiniBand Clusters.

Authors :
Bader, David A.
Parashar, Manish
Sridhar, Varadarajan
Prasanna, Viktor K.
Sur, S.
Bondhugula, U. K. R.
Mamidala, A.
Jin, H.-W.
Panda, D. K.
Source :
High Performance Computing - HiPC 2005; 2005, p148-157, 10p
Publication Year :
2005

Abstract

The All-to-all broadcast collective operation is essential for many parallel scientific applications. This collective operation is called MPI_Allgather in the context of MPI. Contemporary MPI software stacks implement this collective on top of MPI point-to-point calls leading to several performance overheads. In this paper, we propose a design of All-to-All broadcast using the Remote Direct Memory Access (RDMA) feature offered by InfiniBand, an emerging high performance interconnect. Our RDMA based design eliminates the overheads associated with existing designs. Our results indicate that latency of the All-to-all Broadcast operation can be reduced by 30% for 32 processes and a message size of 32 KB. In addition, our design can improve the latency by a factor of 4.75 under no buffer reuse conditions for the same process count and message size. Further, our design can improve performance of a parallel matrix multiplication algorithm by 37% on eight processes, while multiplying a 256x256 matrix. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISBNs :
9783540309369
Database :
Complementary Index
Journal :
High Performance Computing - HiPC 2005
Publication Type :
Book
Accession number :
32701236
Full Text :
https://doi.org/10.1007/11602569_19