Journal of South China University of Technology (Natural Science Edition) ›› 2011, Vol. 39 ›› Issue (7): 146-149,155.doi: 10.3969/j.issn.1000-565X.2011.07.024

• Computer Science & Technology • Previous Articles     Next Articles

Extraction of Domain-Specific Phenomenal Terms Based on Separator and Contextual Terms

Liu Li  Liu Xiao-ming   

  1. School of Computer Science and Technology,Beijing Institute of Technology,Beijing 100081,China
  • Received:2010-10-29 Revised:2011-03-08 Online:2011-07-25 Published:2011-06-03
  • Contact: 刘里(1983-) ,男,博士生,主要从事自然语言处理研究. E-mail:niceliuli@sina.com
  • About author:刘里(1983-) ,男,博士生,主要从事自然语言处理研究.
  • Supported by:

    国家自然科学基金资助项目( 61003065)

Abstract:

As domain-specific phenomenal terms are usually compounds that are difficult to extract according to local context features via the traditional machine learning methods,a novel extraction method is proposed. In this method,first,the context-based method is employed to extract the separator set. Then,with the combination of the separator set and context terms,the improved NC-value algorithm is used to extract candidate phenomenal results. Finally,nominal terms are filtered out from the candidate phenomenal terms to obtain the final terms. Experimental results indicate that the proposed extraction method of domain-specific phenomenal terms performs better than the word frequency-based and the separator-based ones.

Key words: term extraction, separator, compound, NC-value algorithm