Back to Search Start Over

Advances in the Neural Network Quantization: A Comprehensive Review.

Authors :
Wei, Lu
Ma, Zhong
Yang, Chaojie
Yao, Qin
Source :
Applied Sciences (2076-3417); Sep2024, Vol. 14 Issue 17, p7445, 14p
Publication Year :
2024

Abstract

Artificial intelligence technologies based on deep convolutional neural networks and large language models have made significant breakthroughs in many tasks, such as image recognition, target detection, semantic segmentation, and natural language processing, but also face a conflict between the high computational capacity of the algorithms and limited deployment resources. Quantization, which converts floating-point neural networks into low-bit-width integer networks, is an important and essential technique for efficient deployment and cost reduction in edge computing. This paper analyzes various existing quantization methods, showcases the deployment accuracy of advanced techniques, and discusses the future challenges and trends in this domain. [ABSTRACT FROM AUTHOR]

Details

Language :
English
ISSN :
20763417
Volume :
14
Issue :
17
Database :
Complementary Index
Journal :
Applied Sciences (2076-3417)
Publication Type :
Academic Journal
Accession number :
179649968
Full Text :
https://doi.org/10.3390/app14177445