Journal of South China University of Technology (Natural Science Edition) ›› 2015, Vol. 43 ›› Issue (1): 21-27,33.doi: 10.3969/j.issn.1000-565X.2015.01.004

• Electronics, Communication & Automation Technology • Previous Articles     Next Articles

A Clustering Method for Multiple Speaker Roles

Li Wei He Qian-hua Li Yan-xiong   

  1. School of Electronic and Information Engineering , South China University of Technology , Guangzhou 510640 , Guangdong , China 
  • Received:2014-09-15 Revised:2014-11-21 Online:2015-01-25 Published:2014-12-01
  • Contact: 李威(1979-),女,博士生,主要从事语音信号处理研究 . E-mail:livay_21@163.com
  • About author:李威(1979-),女,博士生,主要从事语音信号处理研究 .
  • Supported by:
    Supported by the National Natural Science Foundation of China ( 61101160 )

Abstract:

In order to find the number of speaker roles and the corresponding speakers ’ speech in meeting speeches , a clustering method for multiple speaker roles is proposed. Firstly , features for speaker role clustering are defined. Secondly , geodesic distance is used to measure the similarities among features. Then , inner-class distance is used to control inter-class mergence to form the clustering method. Finally , four different types of meeting speech corpora are used to validate the effectiveness of the proposed method. The results indicate that ,for the meeting speeches obtained by both manual and automatic segmentation , the clustering performance using geodesic distance is superior to that using traditional distance when the same clustering algorithm is used in all cases , and that the proposed method performs better than the traditional hierarchical clustering method when the same measuring distance is used.

Key words: speaker role, characteristic distance measure, role clustering, geodesic distance, unsupervised clustering

CLC Number: