Journal of South China University of Technology (Natural Science Edition) ›› 2008, Vol. 36 ›› Issue (9): 31-36.

• Computer Science & Technology • Previous Articles     Next Articles

An Extended Knowledge Discovery Framework for Outlier Data Set

Jin Yi-fu  Zhu Qing-sheng   

  1. College of Computer, Chongqing University, Chongqing 400044, China
  • Received:2008-01-08 Revised:2008-06-06 Online:2008-09-25 Published:2008-09-25
  • Contact: 金义富(1969-),男,博士生,湛江师范学院副教授,主要从事智能信息处理与数据挖掘研究. E-mail:yfjin@tom.com
  • About author:金义富(1969-),男,博士生,湛江师范学院副教授,主要从事智能信息处理与数据挖掘研究.
  • Supported by:

    重庆市自然科学基金资助项目(2005BB2224);教育部高校博士点基金资助项目(20050611027)

Abstract:

The existing researches on outlier data mainly focus on the outlier detection. In order to completely analyze the origin, classification, meaning, behavior characteristics and outlying trend of outlier data, some concepts such as the nearest outlying neighbor, the atomic outlier class and the outlying mutation class are defined and the approaches to outlier clustering and outlying trend analyses are proposed based on the existing outlier mining techniques as well as a series of concepts and their searching algorithms including the outlying reduction and the key attribute subspace. Furthermore, an integrated framework of characteristic description and extended knowledge discovery of outlier data set is constructed, whose validity in practical applications is finally verified by the outlier analysis of mobile communication operation data.

Key words: data mining, outlier analysis, key attribute subspace, knowledge discovery framework