1. Closed-Loop Region of Interest Enabling High Spatial and Temporal Resolutions in Object Detection and Tracking via Wireless Camera
- Author
-
Jack Chen, Hen-Wei Huang, Philipp Rupp, Anjali Sinha, Claas Ehmke, and Giovanni Traverso
- Subjects
General Computer Science ,Computer science ,business.industry ,Machine vision ,ComputingMethodologies_IMAGEPROCESSINGANDCOMPUTERVISION ,General Engineering ,object detection ,Field of view ,Image processing ,Object detection ,TK1-9971 ,real-time systems ,Region of interest ,Temporal resolution ,Video tracking ,General Materials Science ,Computer vision ,region of interest ,Electrical engineering. Electronics. Nuclear engineering ,Artificial intelligence ,business ,spatial resolution ,Image resolution ,object tracking - Abstract
The trade-off between spatial and temporal resolution remains a fundamental challenge in machine vision. A captured image often contains a significant amount of redundant information, and only a small region of interest (ROI) is necessary for object detection and tracking. In this paper, we first systematically characterize the effects of ROI on camera capturing, data transmission, and image processing. We then present the closed-loop ROI algorithm capable of high spatial and temporal resolution as well as wide scanning field of view (FOV) in single and multi-object detection and tracking via real-time wireless video streaming. With the feedback from real-time object tracking, the wireless camera is able to capture and transmit only the ROI which in turn enhances both the spatial and temporal resolution in object tracking. In addition, the proposed approach can still maintain a large FOV by processing regions outside of the ROI at lower spatial and temporal resolutions. When applied to a high spatial resolution wireless stream (5 MegaPixels), the closed-loop ROI algorithm improves the temporal resolution by up to $10\times $ (from 2.4FPS to 22.5FPS). Specifically, camera processing is improved by up to $4.7\times $ , data transmission is improved by up to $160\times $ , and PC processing is improved by up to $2.5\times $ . In a person tracking experiment, the closed-loop ROI algorithm enables a wide-angle camera to outperform both a normal wide-angle camera–which suffers from poor temporal resolution and motion blur–and a pan & tilt camera–which cannot automatically refresh tracking after the tracking is lost.
- Published
- 2021
- Full Text
- View/download PDF