收稿日期: 2010-04-16
修回日期: 2010-06-17
网络出版日期: 2011-01-02
基金资助
国家自然科学基金资助项目(60972132);广东省自然科学基金资助项目(10451064101004651,9351064101000003)
Two-Stage Decision-Based Detection of Non-Lexical Audio Events in Spontaneous Vocalization
Received date: 2010-04-16
Revised date: 2010-06-17
Online published: 2011-01-02
Supported by
国家自然科学基金资助项目(60972132);广东省自然科学基金资助项目(10451064101004651,9351064101000003)
贺前华 李艳雄 李韬 张虹 杨继臣 . 基于两步判决的口语中非文字音频事件检测方法[J]. 华南理工大学学报(自然科学版), 2011 , 39(2) : 20 -25,31 . DOI: 10.3969/j.issn.1000-565X.2011.02.004
In order to effectively utilize non-lexical audio events to analyze the semantics of conversational speech,the characteristic differences among the audio events frequently occurring in spontaneous vocalization are analyzed,and a two-stage decision-based method to detect non-lexical audio events in spontaneous vocalization is proposed.In this method,the characteristics of audio events are used to construct signal segments of audio events: the thre-shold decision is used to detect longer applause(the first-stage decision),and statistical models are employed to detect other audio events(the second-stage decision).Experimental results show that the average precision,the recall rate and the F1-measure of the proposed method for three non-lexical audio events(i.e.filled pause,laughter and applause) are respectively 87.3%,93.8% and 90.4%;and that,as compared with the existing method,the proposed method is of an average F1-measure increase by 7.5% and it helps to determine the boundaries of non-lexical audio events with higher accuracy.
/
| 〈 |
|
〉 |