Journal of South China University of Technology (Natural Science Edition) ›› 2006, Vol. 34 ›› Issue (6): 74-78,94.

• Computer Science & Technology • Previous Articles     Next Articles

Determination of Related Web Queries Using Support Vector Regression

Wang Ji-min1  Peng Bo2  Meng Tao2   

  1. 1.Dept.of Information Management,Peking Univ.,Beijing 100871,China;2.School of Electronics Engineering and Computer Science,Peking Univ.,Beijing 100871,China
  • Received:2005-07-15 Online:2006-06-25 Published:2006-06-25
  • Contact: 王继民(1966-),男,博士,副教授,主要从事搜索引擎与Web挖掘方面的研究 E-mail:wjm@pku.edu.cn
  • About author:王继民(1966-),男,博士,副教授,主要从事搜索引擎与Web挖掘方面的研究
  • Supported by:

    国家自然科学基金资助项目(60573166);国家自然科学基金重点资助项目(60435020)

Abstract:

When a user submits a Web query to a search engine,it is helpful for the user to modify the query and find the needed information if the system returns a list of related Web queries.This paper presents a new determ ina-tion method of related Web queries using support vector regression.In this method,five quantified indexes of a candidate query are extracted from the log files,including the submitted number of the candidate query ,the total numbers of submitting the candidate query and hitting the returned resuh,the number of common terms and the number of hitting common URL(Uniform Resource Locator)between the candidate query and the given query.The obtained candidate queries are then ranked based on support vector regression models learned from parts of human.1abeled training data.The related Web queries are finally determ ined according to the relevance.Experimental re-suits show that the proposed method is of high prediction precision.

Key words: search engine, user log, related Web query , support vector regression