华南理工大学学报(自然科学版) ›› 2014, Vol. 42 ›› Issue (8): 70-76.doi: 10.3969/j.issn.1000-565X.2014.08.012

• 电子、通信与自动控制 • 上一篇    下一篇

一种双重判断机制的音频篡改盲检测算法

吕志胜 胡永健 李晗 刘琲贝   

  1. 华南理工大学 电子与信息学院,广东 广州 510640
  • 收稿日期:2014-01-25 修回日期:2014-04-12 出版日期:2014-08-25 发布日期:2014-07-01
  • 通信作者: 吕志胜(1983-),男,博士生,主要从事数字音频篡改检测研究. E-mail:zhishenglu@163.com.cn
  • 作者简介:吕志胜(1983-),男,博士生,主要从事数字音频篡改检测研究.
  • 基金资助:

    国家“973”计划项目( 2011CB707003) ; 广东省自然科学基金团队项目( 9351064101000003) ; 华南理工大学中央高校基本科研业务费专项资金资助项目( 2014ZZ0036) .

A Blind Audio Forgery Detection Algorithm Based on Dual Judgment Mechanisms

Lü Zhi-sheng Hu Yong-jian Li Han Liu Bei-bei   

  1. School of Electronic and Information Engineering,South China University of Technology,Guangzhou 510640,Guangdong,China
  • Received:2014-01-25 Revised:2014-04-12 Online:2014-08-25 Published:2014-07-01
  • Contact: 吕志胜(1983-),男,博士生,主要从事数字音频篡改检测研究. E-mail:zhishenglu@163.com.cn
  • About author:吕志胜(1983-),男,博士生,主要从事数字音频篡改检测研究.
  • Supported by:

    国家“973”计划项目( 2011CB707003) ; 广东省自然科学基金团队项目( 9351064101000003) ; 华南理工大学中央高校基本科研业务费专项资金资助项目( 2014ZZ0036) .

摘要: 插入和删除是两种常见的音频篡改操作. 针对现有基于电网频率( ENF) 信号的音频篡改检测算法对插入和删除操作定位精度不高的问题,提出一种双重判断机制的篡改盲检测算法. 机制一利用最大相关偏移量曲线来确定篡改位置; 机制二根据最大相关偏移量曲线所对应的斜率曲线来确定篡改位置. 将机制一和机制二联合使用可获得更好的篡改定位精度. 为了简化算法实现,文中还提出一种不用引入额外ENF 参考信号计算最大相关偏移量的方法. 与现有文献中两种代表性算法相比,文中算法具有更高的篡改定位准确度,且对重采样、压缩及加噪这3 种常见的音频处理操作有一定的鲁棒性.

关键词: 音频篡改检测, 电网频率, 最大相关偏移, 斜率, 双重判断机制

Abstract:

Insertion and deletion are two commonly-used audio forgery operations,and the current Electric NetworkFrequency( ENF) -based audio forgery detection algorithms cannot accurately determine the location of insertion anddeletion operations.In order to solve this problem,a blind forgery detection algorithm based on dual judgmentmechanisms is proposed in this paper.In the investigation,the first mechanism utilizes the information from aMOCC ( max offset for cross correlation) curve to locate the forgery while the second mechanism estimates forgeryregions according to the slope curve which corresponds to the MOCC curve.The combined use of the two mechanismscan obtain a higher forgery localization precision.For simplicity of implementation,a method is put forward to calculatethe MOCC without an extra reference ENF signal.As compared with two current representative methods,the proposeddetection method can locate the deletion and insertion operations more accurately.Besides,it has reasonablerobustness when applied to such common audio operations as re-sampling,compression and noise addition.

Key words: audio forgery detection, electric network frequency, max offset for cross correlation, slope, dual judgment mechanisms