Back to Search Start Over

Power-law initialization algorithm for convolutional neural networks.

Authors :
Jiang, Kaiwen
Liu, Jian
Xing, Tongtong
Li, Shujing
Wu, Shunyao
Shao, Fengjing
Sun, Rencheng
Source :
Neural Computing & Applications. Oct2023, Vol. 35 Issue 30, p22431-22447. 17p.
Publication Year :
2023

Abstract

Well-honed CNN architectures trained with massive labeled images datasets are the state-of-the-art solution in many fields. In this paper, the weights of five commonly used pre-trained models are carefully analyzed for extracting their numerical characteristics and spatial distribution law. The general characteristics are: (1) the weights of a single convolutional layer conform to the distribution of symmetric power law. (2) the power exponent at the center of its convolutional kernel is relatively large, and the power exponent decreases radially from the center. (3) the value range of power exponents between layers is continuous from - 0.5 to - 3.5 . Based on these founding, a weight initialization method is proposed in order to speed up the convergence and improve the performance of CNN models. The proposed weight initialization method is compared with several commonly used methods. Extensive experiments show that it can improve the convergence speed of the CNN models, and the model accuracy is improved by 1–3%. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
09410643
Volume :
35
Issue :
30
Database :
Academic Search Index
Journal :
Neural Computing & Applications
Publication Type :
Academic Journal
Accession number :
171995075
Full Text :
https://doi.org/10.1007/s00521-023-08881-7