Journal of South China University of Technology (Natural Science Edition) ›› 2011, Vol. 39 ›› Issue (5): 102-107.doi: 10.3969/j.issn.1000-565X.2011.05.018

• Computer Science & Technology • Previous Articles     Next Articles

A Keyword-Based Method for Diversification of Web Search Results

Lin Gu-li  Peng Hong  Ma Qian-li  Wei Jia  Qin Jiang-wei   

  1. School of Computer Science and Engineering,South China University of Technology,Guangzhou 510006,Guangdong,China
  • Received:2010-06-21 Revised:2010-07-10 Online:2011-05-25 Published:2011-04-01
  • Contact: 林古立(1984-) ,男,博士生,主要从事信息检索、数据挖掘、机器学习研究. E-mail:lin.guli@mail.scut.edu.cn
  • About author:林古立(1984-) ,男,博士生,主要从事信息检索、数据挖掘、机器学习研究.
  • Supported by:

    广东省自然科学基金资助项目( 07006474, 9451064101003233) ; 广东省科技攻关项目( 2007B010200044) ; 华南理工大学中央高校基本科研业务费专项资金资助项目( 2009ZM0125, 2009ZM0189)

Abstract:

The diversification of Web search results has been known as an important factor of improving Web search efficiency and user satisfaction. In this paper,the diversification problem is formalized into a maximization problem of facet coverage,and a novel diversification method named KDM is proposed. In KDM,first,keywords representing document facets are extracted from the retrieved documents related to the query. Then,the document facet novelty is calculated according to the co-occurrence and description ability of the keywords. Finally,the documents are re-ranked by considering both the novelty and the relevance to provide diversified search results for users. Experimental results indicate that KDM outperforms other existing approaches in terms of diversification ability.

Key words: information retrieval, keyword, retrieval result, diversification, re-ranking