Back to Search Start Over

MultiMaKe

Authors :
Nikil Dutt
Sudeep Pasricha
Luis Angel D. Bathen
Yongjin Ahn
Source :
ACM Transactions on Embedded Computing Systems. 12:1-25
Publication Year :
2013
Publisher :
Association for Computing Machinery (ACM), 2013.

Abstract

The increasing demand for low-power and high-performance multimedia embedded systems has motivated the need for effective solutions to satisfy application bandwidth and latency requirements under a tight power budget. As technology scales, it is imperative that applications are optimized to take full advantage of the underlying resources and meet both power and performance requirements. We propose MultiMaKe, an application mapping design flow capable of discovering and enabling parallelism opportunities via code transformations, efficiently distributing the computational load across resources, and minimizing unnecessary data transfers. Our approach decomposes the application's tasks into smaller units of computations called kernels, which are distributed and pipelined across the different processing resources. We exploit the ideas of inter-kernel data reuse to minimize unnecessary data transfers between kernels, early execution edges to drive performance, and kernel pipelining to increase system throughput. Our experimental results on JPEG and JPEG2000 show up to 97% off-chip memory access reduction, and up to 80% execution time reduction over standard mapping and task-level pipelining approaches.

Details

ISSN :
15583465 and 15399087
Volume :
12
Database :
OpenAIRE
Journal :
ACM Transactions on Embedded Computing Systems
Accession number :
edsair.doi...........33674eee17a2f6edaa6cb094a9308d73
Full Text :
https://doi.org/10.1145/2435227.2435255