Back to Search Start Over

High-performance Convolutional Neural Network Accelerator Based on Systolic Arrays and Quantization

Authors :
Shengli Lu
Wei Pang
Li Yufeng
Hao Liu
Luo Jihe
Source :
2019 IEEE 4th International Conference on Signal and Image Processing (ICSIP).
Publication Year :
2019
Publisher :
IEEE, 2019.

Abstract

In recent years, convolutional neural networks (CNN) has achieved great success in computer vision tasks, such as object detection, face recognition and image classification. With the diversification of CNN models, accelerators that only support a single network can no longer meet the needs of applications. Due to the computational intensiveness of convolution operations, the implementation of CNN on the FPGA platform faces many challenges. In this paper, a convolution unit based on systolic arrays is proposed in the design of CNN accelerator, and the fixed-point quantization method is adopted to save a large amount of storage resources and reduce the required transmission bandwidth, thus improving throughput and power efficiency. The performance density and power efficiency of our design can reach 0.165 GOPS/DSP and 36.3 GOPS/W under 100MHz clock frequency.

Details

Database :
OpenAIRE
Journal :
2019 IEEE 4th International Conference on Signal and Image Processing (ICSIP)
Accession number :
edsair.doi...........73e3a7d06dc6192d60c94f8cab280266
Full Text :
https://doi.org/10.1109/siprocess.2019.8868327