Back to Search Start Over

Two-tree algorithms for full bandwidth broadcast, reduction and scan

Authors :
Jesper Larsson Träff
Jochen Speck
Peter Sanders
Source :
Parallel Computing. 35:581-594
Publication Year :
2009
Publisher :
Elsevier BV, 2009.

Abstract

We present a new, simple algorithmic idea for the collective communication operations broadcast, reduction, and scan (prefix sums). The algorithms concurrently communicate over two binary trees which both span the entire network. By careful layout and communication scheduling, each tree communicates as efficiently as a single tree with exclusive use of the network. Our algorithms thus achieve up to twice the bandwidth of most previous algorithms. In particular, our approach beats all previous algorithms for reduction and scan. Experiments on clusters with Myrinet and InfiniBand interconnect show significant reductions in running time for all three operations sometimes even close to the best possible factor of two.

Details

ISSN :
01678191
Volume :
35
Database :
OpenAIRE
Journal :
Parallel Computing
Accession number :
edsair.doi...........81027fd2e801e0b6676c0025fd2b3fa8