华南理工大学学报(自然科学版) ›› 2008, Vol. 36 ›› Issue (9): 48-51.

• 计算机科学与技术 • 上一篇    下一篇

一种改进的元搜索排序合成算法

李红梅丁振国周水生周利华1   

  1. 1. 西安电子科技大学 计算机学院, 陕西 西安 710071; 2. 西安电子科技大学 理学院, 陕西 西安 710071
  • 收稿日期:2008-03-18 修回日期:2008-05-16 出版日期:2008-09-25 发布日期:2008-09-25
  • 通信作者: 李红梅(1968-),女,博士生,讲师,主要从事信息处理和人工智能研究. E-mail:lhmbd@163.com
  • 作者简介:李红梅(1968-),女,博士生,讲师,主要从事信息处理和人工智能研究.
  • 基金资助:

    国家自然科学基金资助项目(60603098)

An Improved Rank Merging Algorithm for Meta Search

Li Hong-mei Ding Zhen-guo1  Zhou Shui-sheng2  Zhou Li-hua1   

  1. 1. School of Computer Science and Technology, Zhou Shui-sheng2 Zhou Li-hua1 Xidian University, Xi'an 710071, Shaanxi, China; 2. School of Science, Xidian University, Xi'an 710071, Shaanxi, China
  • Received:2008-03-18 Revised:2008-05-16 Online:2008-09-25 Published:2008-09-25
  • Contact: 李红梅(1968-),女,博士生,讲师,主要从事信息处理和人工智能研究. E-mail:lhmbd@163.com
  • About author:李红梅(1968-),女,博士生,讲师,主要从事信息处理和人工智能研究.
  • Supported by:

    国家自然科学基金资助项目(60603098)

摘要: 为了提高元搜索引擎的查询精度,提出了一种改进的元搜索结果合成算法.首先,通过分析搜索结果列表中包含的文本信息,综合考虑搜索结果与查询的匹配完全程度和相关程度,给出了文本分析的规范化方法;并结合搜索结果的排序信息计算文档的相关分值,据此实现对局部相似度的调整.然后,利用成员搜索引擎的性能评价,提出了改进的影子文档方法来估算非相关文档的相关分值.最后,采用基于群决策的合成方法对搜索结果进行一致性排序.实际Web环境中的测试表明,所提出的算法比现有合成算法具有更好的控索结果相关性.

关键词: 信息检索, 元搜索, 搜索结果合成, 文本分析, 群决策

Abstract:

In order to improve the precision of meta search engine, an improved merging algorithm of meta search results is proposed. In this algorithm, first, the text-based information obtained from search results is analyzed and both the query-match grade and the result relevancy are considered to give an approach on the text normalization for meta search. Next, the relevant scores of documents are normalized by incorporating text analysis with the ranks given by the search engines for the purpose of adjusting the local similarities. Then, based on the performance evaluation of underlying search engines, an improved shadow document method is presented to evaluate the scores of non-relevant documents. Finally, a merging method based on the group decision making is adopted to sort the search results. It is found from the tested results in an actual Web environment that the search results obtained by the proposed algorithm are of higher relativity than those by the existing merging algorithms.

Key words: information retrieval, meta search, search-resuh merging, text analysis, group decision making