Journal of South China University of Technology (Natural Science Edition) ›› 2020, Vol. 48 ›› Issue (12): 125-134.doi: 10.12141/j.issn.1000-565X.200366

• Artificial Intelligence Special • Previous Articles     Next Articles

Approach for Spammer Detection in Weibo Based on Multi-View Fusion

YANG Xiaohui LIANG Xiao   

  1. School of Cyber Security and Computer,Hebei University,Baoding 071000,Hebei,China
  • Received:2020-06-28 Revised:2020-08-09 Online:2020-12-25 Published:2020-12-01
  • Contact: 杨 晓 晖 ( 1975-) ,男,博 士,教 授,主要从事分布计算、信息安全与可信计算研究。 E-mail:yxh@hbu.edu.cn
  • About author:杨 晓 晖 ( 1975-) ,男,博 士,教 授,主要从事分布计算、信息安全与可信计算研究。
  • Supported by:

    Supported by the National Key Research and Development Program of China ( 2017YFB0802300)

Abstract:

In order to detect spammers more effectively in Weibo,an approach based on multi-view fusion was proposed. First,a user representation strategy for integrating multi-view information was designed to characterize users from 3 views,namely,user behavior,social relationship and text content. In view of the deficiencies that the existing approaches do not fully consider the user's fans and user's environment in social networks,new features such as fan ratio,fan average bidirectional connection rate,community-based bidirectional connection rate,communitybased cluster coefficient,etc. were introduced. Then,a multi-view fusion decision model based on a linear weighting function was constructed. A linear weighting fusion was carried out based on the classification results from each view. The optimal fusion coefficient was obtained by minimizing the approximate error,and then the final classification result was obtained. The test result on the real data from Weibo show that this approach can not only effectively detect spammers,with significant improvement in precision and F1-sorce,but also exhibits greater stability especially when processing unbalanced data. It also analyzes the impact of different views on the final detection effect, and the results show that the user's social relationship view has the most significant effect.

Key words: Weibo, spammer detection, linear weighting function, multi-view fusion

CLC Number: