Computer Science & Technology

A Multi-Feature Incremental Learning Neural Network for the Quality Enhancement of Video Reconstructed Pictures in H. 265/HEVC

Expand
  • 1. School of Information Science and Engineering,Hangzhou Normal University,Hangzhou 311121,Zhejiang,China; 2. Guangzhou NINED LLC,Guangzhou 511400,Guangdong,China
丁丹丹(1983-),女,讲师,主要从事视频图像处理、视频编码研究.

Received date: 2018-08-25

  Online published: 2018-11-01

Supported by

Supported by the National Key R&D Program of China under Grant (2017YFB1002803) and the National-Level Collage Student’s Innovative Entrepreneurial Training Plan Program (201810346015)

Abstract

The new generation video coding standard H. 265/HEVC employs in-loop filter,which includes de-bloc- king (DBF) and sample adaptive offset filter (SAO),to remove the blocking artifacts and reduce the distortions of reconstructed video frames. Both of DBF and SAO originated from signal processing theory,and the corresponding algorithms and parameters are designed and set manually. Although the computational complexity is relatively low, such filters may not deal with different kinds of contents well enough as the natural videos are much more complex. This paper formulates the loop-filter problem in video coding as an end-to-end regression problem,which can be solved by deep neural network. The relationship between reconstructed frames and original frames are mapped au- tomatically and as a result,the differences between them are minimized. The proposed Multi-Feature based Incre- mental Learning Network (MFILNet) includes 35 layers. The integrated network adopts global residual learning strategy and cascades several Feature Incremental Learning Blocks (FIBs) to extract features of different levels. Consequently,useful features are finally extracted,selected and enhanced to improve the perceptual ability of the network. Within each FIB,variable convolutional kernels are adopted. Inspirited by DenseNet,features from dif- ferent layers are fused,thus to facilitate information flow among layers. Experimental results show that with the scheme of combining density and sparsity,learning capability and generalization capability of the proposed network are boosted tremendously. Both objective and subjective quality of the video compressed frames is improved signifi- cantly. Consequently,the proposed network model is used to substitute the DBF and SAO in H. 265/HEVC. Up to 11. 2% and averaged 6. 32% BD-rate reduction is obtained. The model is also used after the DBF and SAO, 5. 24% BD-rate saving can be obtained in average.

Cite this article

DING Dandan CHEN Jingsen FEI Jialuo TONG Junchao PAN Zhigeng YAO Zhengwei . A Multi-Feature Incremental Learning Neural Network for the Quality Enhancement of Video Reconstructed Pictures in H. 265/HEVC[J]. Journal of South China University of Technology(Natural Science), 2018 , 46(12) : 42 -50 . DOI: 10.3969/j.issn.1000-565X.2018.12.006

Outlines

/