Journal of South China University of Technology(Natural Science Edition) ›› 2022, Vol. 50 ›› Issue (6): 80-90.doi: 10.12141/j.issn.1000-565X.210507

Special Issue: 2022年电子、通信与自动控制

• Electronics, Communication & Automation Technology • Previous Articles     Next Articles

Feature-domain Multi-Hypothesis Prediction Reconstruction Neural Network for Compressed Video Sensing

YANG Chunling  LING Xi  LÜ Zeyu#br#   

  1. School of Electronic and Information Engineering,South China University of Technology,Guangzhou 510640,Guangdong,China
  • Received:2021-08-12 Revised:2021-12-02 Online:2022-06-25 Published:2021-12-31
  • Contact: 杨春玲 (1970-),女,教授,主要从事图像/视频压缩编码、图像质量评价、图像/视频压缩感知重构研究。 E-mail:eeclyang@ scut. edu. cn
  • About author:杨春玲 (1970-),女,教授,主要从事图像/视频压缩编码、图像质量评价、图像/视频压缩感知重构研究。
  • Supported by:
    Supported by the Natural Science Foundation of Guangdong Province (2017A030311028,2019A1515011949)

Abstract: In the prediction-residual reconstruction framework, multi-hypothesis prediction based on temporal correlation is the key step of compressed video sensing reconstruction. This paper studies the accuracy prediction method by utilizing rich features based on deep learning, and a novel feature-domain multi-hypothesis reconstruction network for compressed video sensing (FMH_CVSNet) is proposed. In FMH_CVSNet, the feature domain multi-hypothesis prediction module (FMH_Module) is firstly proposed, which improves the prediction ability by reasonably constructing the motion estimation module and the hypothesis weight calculation module based on the characteristics of video signal. Secondly, the two-stage multi-reference motion compensation mode is proposed, which makes the constructed hypothesis sets much better for sequences with different motion and the further improves the prediction accuracy. Simulation results show that FMH_CVSNet achieves better reconstruction performance under various experimental conditions, improves by 4.76dB compared with the traditional multi-hypothesis algorithm 2sMHR and improves by 3.87dB compared with CNN based compressed video sensing reconstruction algorithm VCSNet-2.

Key words: compressed video sensing, deep learning, multi-hypothesis prediction, adaptive hypothesis weight, multiple reference frame, video motion feature

CLC Number: