Back to Search
Start Over
MultiMaKe
- Source :
- ACM Transactions on Embedded Computing Systems. 12:1-25
- Publication Year :
- 2013
- Publisher :
- Association for Computing Machinery (ACM), 2013.
-
Abstract
- The increasing demand for low-power and high-performance multimedia embedded systems has motivated the need for effective solutions to satisfy application bandwidth and latency requirements under a tight power budget. As technology scales, it is imperative that applications are optimized to take full advantage of the underlying resources and meet both power and performance requirements. We propose MultiMaKe, an application mapping design flow capable of discovering and enabling parallelism opportunities via code transformations, efficiently distributing the computational load across resources, and minimizing unnecessary data transfers. Our approach decomposes the application's tasks into smaller units of computations called kernels, which are distributed and pipelined across the different processing resources. We exploit the ideas of inter-kernel data reuse to minimize unnecessary data transfers between kernels, early execution edges to drive performance, and kernel pipelining to increase system throughput. Our experimental results on JPEG and JPEG2000 show up to 97% off-chip memory access reduction, and up to 80% execution time reduction over standard mapping and task-level pipelining approaches.
Details
- ISSN :
- 15583465 and 15399087
- Volume :
- 12
- Database :
- OpenAIRE
- Journal :
- ACM Transactions on Embedded Computing Systems
- Accession number :
- edsair.doi...........33674eee17a2f6edaa6cb094a9308d73
- Full Text :
- https://doi.org/10.1145/2435227.2435255