收稿日期: 2010-10-29
修回日期: 2011-03-08
网络出版日期: 2011-06-03
基金资助
国家自然科学基金资助项目( 61003065)
Extraction of Domain-Specific Phenomenal Terms Based on Separator and Contextual Terms
Received date: 2010-10-29
Revised date: 2011-03-08
Online published: 2011-06-03
Supported by
国家自然科学基金资助项目( 61003065)
关键词: 术语抽取; 分隔符; 复合词; NC-value 算法
刘里 刘小明 . 基于分隔符和上下文术语的领域现象术语抽取[J]. 华南理工大学学报(自然科学版), 2011 , 39(7) : 146 -149,155 . DOI: 10.3969/j.issn.1000-565X.2011.07.024
As domain-specific phenomenal terms are usually compounds that are difficult to extract according to local context features via the traditional machine learning methods,a novel extraction method is proposed. In this method,first,the context-based method is employed to extract the separator set. Then,with the combination of the separator set and context terms,the improved NC-value algorithm is used to extract candidate phenomenal results. Finally,nominal terms are filtered out from the candidate phenomenal terms to obtain the final terms. Experimental results indicate that the proposed extraction method of domain-specific phenomenal terms performs better than the word frequency-based and the separator-based ones.
Key words: term extraction; separator; compound; NC-value algorithm
/
| 〈 |
|
〉 |