Back to Search Start Over

3PXNet

Authors :
Tianmu Li
Puneet Gupta
W. Romaszkan
Source :
ACM Transactions on Embedded Computing Systems. 19:1-23
Publication Year :
2020
Publisher :
Association for Computing Machinery (ACM), 2020.

Abstract

As the adoption of Neural Networks continues to proliferate different classes of applications and systems, edge devices have been left behind. Their strict energy and storage limitations make them unable to cope with the sizes of common network models. While many compression methods such as precision reduction and sparsity have been proposed to alleviate this, they don’t go quite far enough. To push size reduction to its absolute limits, we combine binarization with sparsity in Pruned-Permuted-Packed XNOR Networks (3PXNet), which can be efficiently implemented on even the smallest of embedded microcontrollers. 3PXNets can reduce model sizes by up to 38X and reduce runtime by up to 3X compared with already compact conventional binarized implementations with less than 3% accuracy reduction. We have created the first software implementation of sparse-binarized Neural Networks, released as open source library targeting edge devices. Our library is complete with training methodology and model generating scripts, making it easy and fast to deploy.

Details

ISSN :
15583465 and 15399087
Volume :
19
Database :
OpenAIRE
Journal :
ACM Transactions on Embedded Computing Systems
Accession number :
edsair.doi...........a1421cdbd495f0ddb93a58077a975759
Full Text :
https://doi.org/10.1145/3371157