Sorry, I don't understand your search. ×
Back to Search Start Over

Deep learning-guided video compression for machine vision tasks

Authors :
Aro Kim
Seung-taek Woo
Minho Park
Dong-hwi Kim
Hanshin Lim
Soon-heung Jung
Sangwoon Kwak
Sang-hyo Park
Source :
EURASIP Journal on Image and Video Processing, Vol 2024, Iss 1, Pp 1-20 (2024)
Publication Year :
2024
Publisher :
SpringerOpen, 2024.

Abstract

Abstract In the video compression industry, video compression tailored to machine vision tasks has recently emerged as a critical area of focus. Given the unique characteristics of machine vision, the current practice of directly employing conventional codecs reveals inefficiency, which requires compressing unnecessary regions. In this paper, we propose a framework that more aptly encodes video regions distinguished by machine vision to enhance coding efficiency. For that, the proposed framework consists of deep learning-based adaptive switch networks that guide the efficient coding tool for video encoding. Through the experiments, it is demonstrated that the proposed framework has superiority over the latest standardization project, video coding for machine benchmark, which achieves a Bjontegaard delta (BD)-rate gain of 5.91% on average and reaches up to a 19.51% BD-rate gain.

Details

Language :
English
ISSN :
16875281
Volume :
2024
Issue :
1
Database :
Directory of Open Access Journals
Journal :
EURASIP Journal on Image and Video Processing
Publication Type :
Academic Journal
Accession number :
edsdoj.5127105087e847289b927b5d68f733d9
Document Type :
article
Full Text :
https://doi.org/10.1186/s13640-024-00649-w