基于唇部灰度能量图的唇读方法

doi:10.3969/j.issn.1000-565X.2011.07.015

华南理工大学学报（自然科学版） ›› 2011, Vol. 39 ›› Issue (7): 88-94.doi: 10.3969/j.issn.1000-565X.2011.07.015

• 电子、通信与自动控制 • 上一篇下一篇

基于唇部灰度能量图的唇读方法

梁亚玲杜明辉

华南理工大学电子与信息学院，广东广州 510640

收稿日期:2010-12-09 修回日期:2011-04-10 出版日期:2011-07-25 发布日期:2011-06-03
通信作者: 梁亚玲(1977-) ，女，博士生，讲师，主要从事图像处理、图像编码研究． E-mail:ylliang@scut.edu.cn
作者简介:梁亚玲(1977-) ，女，博士生，讲师，主要从事图像处理、图像编码研究．
基金资助:
NSFC-广东省自然科学联合基金资助项目( U0735004)

Lipreading Based on Lip Gray Energy Image

Liang Ya-ling Du Ming-hui

School of Electronic and Information Engineering,South China University of Technology,Guangzhou 510640,Guangdong,China

Received:2010-12-09 Revised:2011-04-10 Online:2011-07-25 Published:2011-06-03
Contact: 梁亚玲(1977-) ，女，博士生，讲师，主要从事图像处理、图像编码研究． E-mail:ylliang@scut.edu.cn
About author:梁亚玲(1977-) ，女，博士生，讲师，主要从事图像处理、图像编码研究．
Supported by:
NSFC-广东省自然科学联合基金资助项目( U0735004)

摘要/Abstract

摘要： 针对单视觉通道唇读系统的唇部特征提取问题，提出了基于唇部灰度能量图的特征提取方法．该方法将表示字或词的图像序列投影到二维灰度能量图上，不但统一了输入数据的维数，而且较好地保留了序列图像的运动信息．针对模板匹配方法对模板的依赖问题，文中将单训练样本唇部灰度能量模板图推广至多训练样本．文中还就唇部定位提出中心定位法．实验结果表明: 在单帧图像特征维数相同的情况下，文中唇读方法的识别率比传统的对单帧图像分别提取特征的方法有较大的提高，运算时间明显缩小; 双训练样本比单训练样本的识别率平均提高了11. 29%; 唇部精确定位后的识别率比定位前提高2%以上，系统最高识别率达90. 63%．

关键词: 唇读, 唇部灰度能量图, 步态能量图, 特征提取

Abstract:

In this paper,by taking the visual-only lipreading system as the research objective,a method to extract the visual lip feature based on the lip gray energy image ( LGEI) is proposed. In this method,the image sequences of a word are projected to the 2D lip gray energy image to unify the dimension of input data and maintain most motion information of image sequences. In order to eliminate the dependence of the template matching method on the
template,the LGEI of the single-training sample is extended to the multi-training sample. Moreover,a lip location method based on the lip center is also proposed. Experimental results show that,as compared with the conventional methods that extract features for each image of the sequence,the proposed method greatly improves the recognition rate and significantly decreases the computation time in the same dimension of features for a single image,that the recognition rate of double-training samples averagely improves by 11.29%,as compared with that of single-training samples,and that,after an accurate lip location,the recognition rate improves by more than 2%,with its maximum being up to 90.63%.

Key words: lipreading, lip gray energy image, gait energy image, feature extraction

梁亚玲杜明辉. 基于唇部灰度能量图的唇读方法[J]. 华南理工大学学报（自然科学版）, 2011, 39(7): 88-94.

Liang Ya-ling Du Ming-hui. Lipreading Based on Lip Gray Energy Image[J]. Journal of South China University of Technology (Natural Science Edition), 2011, 39(7): 88-94.

[1]	刘乙奇, 黄志鹏, 于广平, 等. 全生命周期污泥膨胀的智能检测和诊断分析[J]. 华南理工大学学报(自然科学版), 2022, 50(6): 91-99,110.
[2]	孔祥玉, 陈雅琳, 罗家宇, 等. 基于偏最小二乘的多特性复杂过程监测方法研究[J]. 华南理工大学学报(自然科学版), 2022, 50(6): 100-110.
[3]	孙晓贺, 施成华, 刘凌晖, 等. 基于改进的种子填充算法的混凝土裂缝图像识别系统[J]. 华南理工大学学报(自然科学版), 2022, 50(5): 127-136,146.
[4]	张艳, 吴洛天, 王年, 等. 基于多模块关系网络的2D足迹分类[J]. 华南理工大学学报（自然科学版）, 2021, 49(6): 66-76.
[5]	莫海军陈杰王顺栋. 结合点云纹理信息的快速点特征直方图描述子算法[J]. 华南理工大学学报（自然科学版）, 2021, 49(6): 56-65,76.
[6]	杨俊美雷杨陈习坤. 基于Flatten-CNN的语音带宽扩展研究[J]. 华南理工大学学报（自然科学版）, 2021, 49(11): 87-94.
[7]	杨圣豪, 吴玥悦, 毛佳昕, 等. 基于半监督学习的涉及未成年人案件文书识别方法[J]. 华南理工大学学报(自然科学版), 2021, 49(1): 29-38,46.
[8]	崔冬, 王明, 李刚, 等. 基于多级深度特征与随机游走的显著性检测[J]. 华南理工大学学报（自然科学版）, 2020, 48(8): 49-55.
[9]	郭明军, 李伟光, 杨期江, 等. PCA 的幅值滤波特性及在转子特征提取中的应用[J]. 华南理工大学学报（自然科学版）, 2020, 48(5): 125-133.
[10]	郭明军, 李伟光, 杨期江, 等. 基于稀疏算法的大型转子多工况轴心轨迹提纯[J]. 华南理工大学学报（自然科学版）, 2020, 48(4): 45-53.
[11]	程洪超, 吴菁, 刘乙奇, 等. 面向污水处理过程的预测元-RVM 故障诊断建模[J]. 华南理工大学学报（自然科学版）, 2020, 48(3): 10-17.
[12]	孙朝云, 裴莉莉, 李伟, 等. 基于改进 Faster R-CNN 的路面灌封裂缝检测方法[J]. 华南理工大学学报（自然科学版）, 2020, 48(2): 84-93.
[13]	梅园叶登攀刘昌瑞. 加密域图像检索技术综述 [J]. 华南理工大学学报（自然科学版）, 2018, 46(5): 78-86.
[14]	邝泳聪李家裕梁经伦欧阳高飞. 基于旋转立体视觉的元件针脚精密定位方法 [J]. 华南理工大学学报（自然科学版）, 2018, 46(2): 44-52,58.
[15]	牛海清吴炬卓许佳郑文坚. 基于Radon 和Fourier-Mellin 变换的电缆终端红外图像识别[J]. 华南理工大学学报（自然科学版）, 2016, 44(8): 47-52,59.

基于唇部灰度能量图的唇读方法

Lipreading Based on Lip Gray Energy Image

PDF

可视化

被引次数

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价