Journal of South China University of Technology (Natural Science Edition) ›› 2014, Vol. 42 ›› Issue (8): 122-128,135.doi: 10.3969/j.issn.1000-565X.2014.08.019

• Traffic & Transportation Engineering • Previous Articles     Next Articles

Data Clustering of Road Transportation Information System Based on Attribute Dimension Partition and MapReduce

Zheng Xiao-feng Xu Jian-min Lu Kai   

  1. School of Civil Engineering and Transportation,South China University of Technology,Guangzhou 510640,Guangdong,China
  • Received:2014-04-15 Revised:2014-07-15 Online:2014-08-25 Published:2014-07-01
  • Contact: 郑晓峰(1977-),男,在职博士生,广东省道路运输管理局工程师,主要从事智能交通和道路交通运输管理研究. E-mail:bobcraft@163.com
  • About author:郑晓峰(1977-),男,在职博士生,广东省道路运输管理局工程师,主要从事智能交通和道路交通运输管理研究.
  • Supported by:

    国家自然科学基金资助项目( 61174184) ; 广东省工业科技攻关计划项目( 2008B010200010) ; 广州市科技支撑项目( 2011J4300045)

Abstract:

Aiming at the shortcomings of DBSCAN ( Density-Based Spatial Clustering of Applications with Noise) ,this paper presents the concept of the attribute dimension partition by integrating the domain knowledge with thepartition idea.Then,the principles of the cluster merging and the pruning computation are demonstrated.Finally,an optimization method of DBSCAN is put forward based on the cloud computing programming model MapReduce,and the optimization method is verified through the data clustering of a real road transport information system.It isfound that the dataset partition helps to perform the concurrent computation,and the proposed optimization methodis superior to common statistical methods.

Key words: road transportation, DBSCAN, attribute dimension, partition, MapReduce, clustering

CLC Number: