Journal of South China University of Technology (Natural Science Edition)

• Computer Science & Technology • Previous Articles     Next Articles

A Multi-Feature Incremental Learning Neural Network for the Quality Enhancement of Video Reconstructed Pictures in H. 265/HEVC

DING Dandan1 CHEN Jingsen1 FEI Jialuo1 TONG Junchao1 PAN Zhigeng1,2 YAO Zhengwei1   

  1. 1. School of Information Science and Engineering,Hangzhou Normal University,Hangzhou 311121,Zhejiang,China; 2. Guangzhou NINED LLC,Guangzhou 511400,Guangdong,China
  • Received:2018-08-25 Online:2018-12-25 Published:2018-11-01
  • Contact: 丁丹丹(1983-),女,讲师,主要从事视频图像处理、视频编码研究. E-mail:DandanDing@hznu.edu.cn
  • About author:丁丹丹(1983-),女,讲师,主要从事视频图像处理、视频编码研究.
  • Supported by:
    Supported by the National Key R&D Program of China under Grant (2017YFB1002803) and the National-Level Collage Student’s Innovative Entrepreneurial Training Plan Program (201810346015)

Abstract: The new generation video coding standard H. 265/HEVC employs in-loop filter,which includes de-bloc- king (DBF) and sample adaptive offset filter (SAO),to remove the blocking artifacts and reduce the distortions of reconstructed video frames. Both of DBF and SAO originated from signal processing theory,and the corresponding algorithms and parameters are designed and set manually. Although the computational complexity is relatively low, such filters may not deal with different kinds of contents well enough as the natural videos are much more complex. This paper formulates the loop-filter problem in video coding as an end-to-end regression problem,which can be solved by deep neural network. The relationship between reconstructed frames and original frames are mapped au- tomatically and as a result,the differences between them are minimized. The proposed Multi-Feature based Incre- mental Learning Network (MFILNet) includes 35 layers. The integrated network adopts global residual learning strategy and cascades several Feature Incremental Learning Blocks (FIBs) to extract features of different levels. Consequently,useful features are finally extracted,selected and enhanced to improve the perceptual ability of the network. Within each FIB,variable convolutional kernels are adopted. Inspirited by DenseNet,features from dif- ferent layers are fused,thus to facilitate information flow among layers. Experimental results show that with the scheme of combining density and sparsity,learning capability and generalization capability of the proposed network are boosted tremendously. Both objective and subjective quality of the video compressed frames is improved signifi- cantly. Consequently,the proposed network model is used to substitute the DBF and SAO in H. 265/HEVC. Up to 11. 2% and averaged 6. 32% BD-rate reduction is obtained. The model is also used after the DBF and SAO, 5. 24% BD-rate saving can be obtained in average.

Key words:  H. 265/HEVC, in-loop filter, convolutional neural network, incremental learning