Traffic & Transportation Engineering

Data Clustering of Road Transportation Information System Based on Attribute Dimension Partition and MapReduce

Expand
  • School of Civil Engineering and Transportation,South China University of Technology,Guangzhou 510640,Guangdong,China
郑晓峰(1977-),男,在职博士生,广东省道路运输管理局工程师,主要从事智能交通和道路交通运输管理研究.

Received date: 2014-04-15

  Revised date: 2014-07-15

  Online published: 2014-07-01

Supported by

国家自然科学基金资助项目( 61174184) ; 广东省工业科技攻关计划项目( 2008B010200010) ; 广州市科技支撑项目( 2011J4300045)

Abstract

Aiming at the shortcomings of DBSCAN ( Density-Based Spatial Clustering of Applications with Noise) ,this paper presents the concept of the attribute dimension partition by integrating the domain knowledge with thepartition idea.Then,the principles of the cluster merging and the pruning computation are demonstrated.Finally,an optimization method of DBSCAN is put forward based on the cloud computing programming model MapReduce,and the optimization method is verified through the data clustering of a real road transport information system.It isfound that the dataset partition helps to perform the concurrent computation,and the proposed optimization methodis superior to common statistical methods.

Cite this article

Zheng Xiao-feng Xu Jian-min Lu Kai . Data Clustering of Road Transportation Information System Based on Attribute Dimension Partition and MapReduce[J]. Journal of South China University of Technology(Natural Science), 2014 , 42(8) : 122 -128,135 . DOI: 10.3969/j.issn.1000-565X.2014.08.019

Outlines

/