Back to Search Start Over

BARM: A Batch-Aware Resource Manager for Boosting Multiple Neural Networks Inference on GPUs With Memory Oversubscription

Authors :
Zhao-Wei Qiu
Kun-Sheng Liu
Ya-Shu Chen
Source :
IEEE Transactions on Parallel and Distributed Systems. 33:4612-4624
Publication Year :
2022
Publisher :
Institute of Electrical and Electronics Engineers (IEEE), 2022.

Details

ISSN :
21619883 and 10459219
Volume :
33
Database :
OpenAIRE
Journal :
IEEE Transactions on Parallel and Distributed Systems
Accession number :
edsair.doi...........68728d4721d273065e4d2300aa613fde
Full Text :
https://doi.org/10.1109/tpds.2022.3199806