Journal of South China University of Technology(Natural Science Edition) ›› 2023, Vol. 51 ›› Issue (9): 82-89.doi: 10.12141/j.issn.1000-565X.220825

• Computer Science & Technology • Previous Articles     Next Articles

AdfNet: An Adaptive Deep Forgery Detection Network Based on Diverse Features

LI Jiachun LI Bowen LIN Weiwei   

  1. School of Computer Science and Engineering,South China University of Technology,Guangzhou 510006,Guangdong,China
  • Received:2022-12-24 Online:2023-09-25 Published:2023-04-20
  • Contact: 李家春(1968-),女,博士,副教授,主要从事计算机网络与信息安全、隐私保护、人工智能安全、智慧教学等研究。 E-mail:jclee@scut.edu.cn
  • About author:李家春(1968-),女,博士,副教授,主要从事计算机网络与信息安全、隐私保护、人工智能安全、智慧教学等研究。
  • Supported by:
    Guangdong Provincial Key R&D Program(2021B0101420002);the Ministry of Education’s Cooperative Education Project(201902186007)

Abstract:

The harm caused by video tampering has been endangering people’s lives, which makes deep forgery detection technology gradually obtain widespread attention and development. However, current detection methods could not effectively capture noisy residuals due to the use of inflexible constraints. In addition, they ignore the correlation between texture and semantic features and the impact of temporal features on detection performance improvement. To solve these problems, this paper proposed an adaptive network (AdfNet) with diverse features for deep forgery detection. It helps the classifier to judge authenticity by extracting semantic features, texture features and temporal features. The paper explored the adaptive texture noise extraction (ATNEM) mechanism, and flexibly captured the noise residuals in non-fixed frequency bands through unpooled feature mapping and frequency-based channel attention mechanism. The deep semantic analysis guidance strategy (DSAGS) was designed to highlight the tampering traces through spatial attention mechanism, and guide the feature extractor to focus on the deep features of the focus region. The paper studied multi-scale temporal feature processing (MTFPM), and used temporal attention mechanism to assign weights to different video frames and capture the difference of time series in tampered videos. The experimental results show that the ACC score of the proposed network in the HQ mode of FaceForensics++(FF++) dataset is 97.41%, which is significantly better than that of the existing mainstream algorithms. Moreover, while maintaining the AUC value of 99.80% on the FF++ dataset, the AUC value can reach 76.41% on Celeb-DF, reflecting strong generalization.

Key words: deep learning, deep forgery detection, multi-scale temporal feature, attention mechanism, adaptive network

CLC Number: