Back to Search
Start Over
Mapping stream programs onto heterogeneous multiprocessor systems
- Source :
- Recercat. Dipósit de la Recerca de Catalunya, instname, UPCommons. Portal del coneixement obert de la UPC, Universitat Politècnica de Catalunya (UPC), CASES
- Publisher :
- ACM Press, NY
-
Abstract
- This paper presents a partitioning and allocation algorithm for an iterative stream compiler, targeting heterogeneous multiprocessors with constrained distributed memory and any communications topology. We introduce a novel definition of connectedness that enables the algorithm to model the capabilities of the compiler. The algorithm uses convexity and connectedness constraints to produce partitions that are easier to compile and require short pipelines. Software pipelining is an effective transformation, but it increases memory footprint and latency, and has a startup overhead. Our algorithm takes account of these downstream costs. We show results for the StreamIt 2.1.1 benchmarks for an SMP, 2 × 2 mesh, SMP plus accelerator, and IBM QS20 blade, which has two Cell processors. Our results show that the average performance is within 5% of the unrestricted optimum found using a brute force search, while seldom requiring software pipelining. The heuristic is robust, and fast enough to be inside the feedback loop of an iterative compiler.
- Subjects :
- Multi-core processor
Social connectedness
Computer science
Compiladors (Programes d'ordinador)
Brute-force search
Multiprocessing
Parallel computing
Multiprocessadors
computer.software_genre
Informàtica::Llenguatges de programació [Àrees temàtiques de la UPC]
Software pipelining
Memory footprint
Multiprocessors
Distributed memory
Compiler
computer
Compilers (Computer programs)
Subjects
Details
- Database :
- OpenAIRE
- Journal :
- Recercat. Dipósit de la Recerca de Catalunya, instname, UPCommons. Portal del coneixement obert de la UPC, Universitat Politècnica de Catalunya (UPC), CASES
- Accession number :
- edsair.doi.dedup.....fca64ee0b8c14ccaa70d1536946a5021