Back to Search Start Over

Speeding up VP9 Intra Encoder with Hierarchical Deep Learning Based Partition Prediction.

Authors :
Paul S
Norkin A
Bovik AC
Source :
IEEE transactions on image processing : a publication of the IEEE Signal Processing Society [IEEE Trans Image Process] 2020 Jul 28; Vol. PP. Date of Electronic Publication: 2020 Jul 28.
Publication Year :
2020
Publisher :
Ahead of Print

Abstract

In VP9 video codec, the sizes of blocks are decided during encoding by recursively partitioning 64×64 superblocks using rate-distortion optimization (RDO). This process is computationally intensive because of the combinatorial search space of possible partitions of a superblock. Here, we propose a deep learning based alternative framework to predict the intra-mode superblock partitions in the form of a four-level partition tree, using a hierarchical fully convolutional network (H-FCN). We created a large database of VP9 superblocks and the corresponding partitions to train an H-FCN model, which was subsequently integrated with the VP9 encoder to reduce the intra-mode encoding time. The experimental results establish that our approach speeds up intra-mode encoding by 69.7% on average, at the expense of a 1.71% increase in the Bjøntegaard-Delta bitrate (BD-rate). While VP9 provides several built-in speed levels which are designed to provide faster encoding at the expense of decreased rate-distortion performance, we find that our model is able to outperform the fastest recommended speed level of the reference VP9 encoder for the good quality intra encoding configuration, in terms of both speedup and BD-rate.

Details

Language :
English
ISSN :
1941-0042
Volume :
PP
Database :
MEDLINE
Journal :
IEEE transactions on image processing : a publication of the IEEE Signal Processing Society
Publication Type :
Academic Journal
Accession number :
32746243
Full Text :
https://doi.org/10.1109/TIP.2020.3011270