Back to Search Start Over

NR-CNN: Nested-Residual Guided CNN In-loop Filtering for Video Coding

Authors :
Kai Lin
Chuanmin Jia
Xinfeng Zhang
Shanshe Wang
Siwei Ma
Wen Gao
Source :
ACM Transactions on Multimedia Computing, Communications, and Applications. 18:1-22
Publication Year :
2022
Publisher :
Association for Computing Machinery (ACM), 2022.

Abstract

Recently, deep learning for video coding, such as deep predictive coding, deep transform coding, and deep in-loop filtering, has been an emerging research area. The coding gain of hybrid coding framework could be extensively promoted by the data-driven models. However, previous deep coding tools especially deep in-loop filtering mainly consider the performance improvement while pay less attention to the reliability, usability, and adaptivity of the networks. In this article, a nested-residual guided convolutional neural network (NR-CNN) structure with cascaded global shortcut and configurable residual blocks is proposed for in-loop filtering. By taking advantage of the correlation between different color components, we further extend the NR-CNN by utilizing luminance as textural and structural guidance for chrominance filtering, which significantly improves the filtering performance. To fully exploit the proposed network into codec integration, we subsequently introduce an efficient and adaptive framework consisting of an adaptive granularity optimization and a parallel inference pipeline for deep learning based filtering. The former contributes to the coding performance improvement through an adaptive decision-making based on rate-distortion analysis at various granularities. The latter reduces the running time of network inference. The extensive experimental results show the superiority of the proposed method, achieving 8.2%, 14.9%, and 13.2% BD-rate savings on average under random access (RA) configuration. Meanwhile, the proposed method also obtains better subjective quality.

Details

ISSN :
15516865 and 15516857
Volume :
18
Database :
OpenAIRE
Journal :
ACM Transactions on Multimedia Computing, Communications, and Applications
Accession number :
edsair.doi...........1a5fafa5c7db8333bb14dc24be3c5a71
Full Text :
https://doi.org/10.1145/3502723