Journal of South China University of Technology(Natural Science) >
FP- Growth Algorithm Based on Boolean Matrix and MapReduce
Received date: 2013-05-06
Revised date: 2013-10-22
Online published: 2013-12-01
Supported by
国家自然科学基金面上项目(61272447)
Association rules mining is an important part of data mining.In order to improve the efficiency of associ-ation rules mining,an FP- Growth algorithm based on Boolean matrix and MapReduce,which is marked as BPFP,is proposed,with its time and space complexity being also analyzed.In BPFP algorithm,Hadoop framework andBoolean matrix are used to reduce the number of scans of transaction data,and twice Map- Reduce is adopted tomine frequent item sets.Experimental results on multiple data sets show that the improved FP- Growth algorithm issuperior to the original one due to its high execution efficiency and speedup.
Key words: data mining; association rules; Boolean matrix; MapReduce; FP- Growth algorithm
Chen Xing- shu Zhang Shuai Tong Hao Cui Xiao- jing . FP- Growth Algorithm Based on Boolean Matrix and MapReduce[J]. Journal of South China University of Technology(Natural Science), 2014 , 42(1) : 135 -141 . DOI: 10.3969/j.issn.1000-565X.2014.01.023
/
| 〈 |
|
〉 |