Back to Search Start Over

Easy, fast, and energy-efficient object detection on heterogeneous on-chip architectures

Authors :
Ehsan Totoni
María Jesús Garzarán
Mert Dikmen
Source :
ACM Transactions on Architecture and Code Optimization. 10:1-25
Publication Year :
2013
Publisher :
Association for Computing Machinery (ACM), 2013.

Abstract

We optimize a visual object detection application (that uses Vision Video Library kernels) and show that OpenCL is a unified programming paradigm that can provide high performance when running on the Ivy Bridge heterogeneous on-chip architecture. We evaluate different mapping techniques and show that running each kernel where it fits the best and using software pipelining can provide 1.91 times higher performance and 42% better energy efficiency. We also show how to trade accuracy for energy at runtime. Overall, our application can perform accurate object detection at 40 frames per second (fps) in an energy-efficient manner.

Details

ISSN :
15443973 and 15443566
Volume :
10
Database :
OpenAIRE
Journal :
ACM Transactions on Architecture and Code Optimization
Accession number :
edsair.doi...........15877c16761bd7307830176df57db9f2
Full Text :
https://doi.org/10.1145/2541228.2555302