Two-Stage Multi-Hypothesis Network for Compressed Video Sensing&nbsp;Reconstruction Algorithms Based on Deep Learning

YANG Chunling LING Xi

doi:10.12141/j.issn.1000-565X.200623

Journal of South China University of Technology(Natural Science) >

2021 , Vol. 49 >Issue 6: 88 - 99

DOI: https://doi.org/10.12141/j.issn.1000-565X.200623

Electronics, Communication & Automation Technology

Two-Stage Multi-Hypothesis Network for Compressed Video Sensing Reconstruction Algorithms Based on Deep Learning

Expand

School of Electronics and Information, South China University of Technology, Guangzhou 510640, Guangdong, China

杨春玲（1970-），女，教授，主要从事图像/视频压缩编码、图像质量评价研究。

Received date: 2020-10-19

Revised date: 2021-02-05

Online published: 2021-02-22

Supported by

Supported by the Key Program of Natural Science Foundation of Guangdong Province（2017A030311028）and the Natural Science Foundation of Guangdong Province(2019A1515011949)

Fold

Abstract

Traditional Compressed Video Sensing (CVS) reconstruction algorithm is highly time-consuming. Newly developed CVS neural networks can successfully deal with the speed problem, but it fails to make full use of the spatiotemporal correlation of video and leads to a poor performance. To solve this problem, a novel two-stage multi-hypothesis neural network (2sMHNet) was proposed. Firstly, the Temporal Deformable Alignment Network（TDAN）was used to realize pixel based multi-hypothesis prediction. While avoiding block effects, it improves the matching accuracy of the hypothesis set and obtains accurate multi-hypothesis weights by adaptively parameters learning. Then, the residual reconstruction module was constructed to reconstruct the prediction residual with measurements to further improve the reconstruction quality. Finally, in order to make full use of the inter-frame correlation, a two-stage serial reconstruction mode was proposed. In the first stage, as the reconstructed key frames have rich details, they are selected as the reference frame to improve the non-key frames quality. In the second stage, the more relevant adjacent frames are used for motion compensation, which is more conducive to fast and complex sequences. Experimental results demonstrate that the proposed 2sMHNet outperforms the existing good CVS reconstruction algorithms.

Key words： compressed video sensing reconstruction algorithm; deep learning; temporal deformable alignment network; reconstruction performance

Cite this article

YANG Chunling LING Xi . Two-Stage Multi-Hypothesis Network for Compressed Video Sensing Reconstruction Algorithms Based on Deep Learning[J]. Journal of South China University of Technology(Natural Science), 2021 , 49(6) : 88 -99 . DOI: 10.12141/j.issn.1000-565X.200623

Options

Outlines

模态框（Modal）标题

Abstract

Cite this article