Back to Search
Start Over
Exploiting Parallelism of Imperfect Nested Loops on Coarse-Grained Reconfigurable Architectures
- Source :
- IEEE Transactions on Parallel and Distributed Systems. 27:3199-3213
- Publication Year :
- 2016
- Publisher :
- Institute of Electrical and Electronics Engineers (IEEE), 2016.
-
Abstract
- Coarse-grained reconfigurable architecture (CGRA) is a promising parallel computing platform that provides high performance, high power efficiency and flexibility. However, for imperfect nested loops, the existing loop mapping methods often result in low execution performance and poor hardware utilization. To tackle this problem, this paper makes three contributions: 1) a highly effective and general approach to map imperfect loops on CGRA; 2) a global optimization strategy to search the optimal initiation intervals (IIs); 3) a powerful kernel compression method to reduce the oversized kernel. Experiment results show that our approach can reduce the total computing latency by 20.5, 58.5 and 73.2 percent compared to the state-of-the-art approaches on $2 \times 2$ , $4 \times 4$ and $8 \times 8$ CGRA respectively. Moreover, the compilation time and configuration context size is acceptable in practice.
- Subjects :
- Computer science
02 engineering and technology
Parallel computing
020202 computer hardware & architecture
Software pipelining
Computational Theory and Mathematics
Hardware and Architecture
Signal Processing
0202 electrical engineering, electronic engineering, information engineering
020201 artificial intelligence & image processing
Imperfect
Nested loop join
Field-programmable gate array
Global optimization
Subjects
Details
- ISSN :
- 10459219
- Volume :
- 27
- Database :
- OpenAIRE
- Journal :
- IEEE Transactions on Parallel and Distributed Systems
- Accession number :
- edsair.doi...........a23fd0c2f91a36663ae88c9753efc041
- Full Text :
- https://doi.org/10.1109/tpds.2016.2531678