Journal of South China University of Technology(Natural Science Edition) ›› 2012, Vol. 40 ›› Issue (8): 82-87.

• Computer Science & Technology • Previous Articles     Next Articles

Topic Tracking Based on Spatiotemporal Contextual Model

Zhou Yi-peng1  Du Jun-ping2   

  1. 1.School of Computer and Information Engineering,Beijing Technology and Business University,Beijing 100048,China; 2. Beijing Key Laboratory of Intelligent Telecommunications Software and Multimedia,Beijing University of Posts and Telecommunications,Beijing 100876,China
  • Received:2012-06-01 Revised:2012-07-23 Online:2012-08-25 Published:2012-07-01
  • Contact: 周亦鹏(1976-) ,男,博士,讲师,主要从事人工智能、Web 挖掘等研究. E-mail:yipengzhou@163.com
  • About author:周亦鹏(1976-) ,男,博士,讲师,主要从事人工智能、Web 挖掘等研究.
  • Supported by:

    国家自然科学基金资助项目( 91024001, 61070142) ; 北京市自然科学基金资助项目( 4111002)

Abstract:

As the existing topic model can not accurately reflect the periodic variation and spatial distribution of topics in spatiotemporal context,a spatiotemporal contextual topic model for topic tracking is proposed according to the fact that the Internet information often contains the publishing time and site. In the investigation,first,by associating the distribution of subtopics with spatiotemporal context,a model is established to describe the cycle and strength of topics. Then,the parameters of the proposed model are estimated through EM algorithm,and are employed to obtain the snapshot and cycle of topics. Finally,the time-based topic similarity is calculated to estimate the subsequent topic information,thus realizing the topic tracking. The tracking experiments of food safety events indicate that,as compared with the traditional topic tracking method only depending on the text features,the proposed method can obviously improve the tracking efficiency of the topic as well as the tracking accuracy of subtopics. It is thus concluded that the proposed method helps to achieve more accurate topic retrieval.

Key words: topic model, context, generation model, probability distributions, text processing