Back to Search Start Over

Conv-inheritance: A hardware-efficient method to compress convolutional neural networks for edge applications.

Authors :
Yang, Yang
Wang, Chao
Gong, Lei
Wu, Min
Zhou, Xuehai
Source :
Neurocomputing. May2022, Vol. 487, p172-180. 9p.
Publication Year :
2022

Abstract

Convolutional Neural Networks (CNNs) have won tremendous success in various applications, such as image recognition and natural language processing. Due to the computing-intensive and memory-intensive features of CNN models, it is challenging to deploy them on devices with limited resources and tight power budgets. To address this limitation, pruning, quantization, weights sharing, and other methods have been proposed as efficient solutions to compress CNN models. However, these works usually do not optimize the hardware design when compressing the CNNs. In this paper, we introduce Conv-Inheritance, a hardware-efficient compression method for CNNs by reducing the number of convolution operations to simultaneously reduce the inference time and hardware computing resources on chip of CNN edge devices. We also develop an extended version, called adaptive Conv-Inheritance, to further improve the inference performance by considering the similarity between pixels. We applied our Conv-Inheritance method to compressing various CNN models and performed extensive experiments on different image datasets. Experimental results demonstrate that Conv-Inheritance is both hardware-efficient and time-efficient on edge devices. Besides, Conv-Inheritance is a plug-and-play compression method for CNNs, and we can run Conv-Inheritance on top of other compression methods without any conflicts. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09252312
Volume :
487
Database :
Academic Search Index
Journal :
Neurocomputing
Publication Type :
Academic Journal
Accession number :
156026499
Full Text :
https://doi.org/10.1016/j.neucom.2021.02.106