收稿日期: 2013-05-06
修回日期: 2013-10-22
网络出版日期: 2013-12-01
基金资助
国家自然科学基金面上项目(61272447)
FP- Growth Algorithm Based on Boolean Matrix and MapReduce
Received date: 2013-05-06
Revised date: 2013-10-22
Online published: 2013-12-01
Supported by
国家自然科学基金面上项目(61272447)
关键词: 数据挖掘; 关联规则; 布尔矩阵; MapReduce; FP- Growth 算法
陈兴蜀 张帅 童浩 崔晓靖 . 基于布尔矩阵和 MapReduce 的 FP-Growth 算法[J]. 华南理工大学学报(自然科学版), 2014 , 42(1) : 135 -141 . DOI: 10.3969/j.issn.1000-565X.2014.01.023
Association rules mining is an important part of data mining.In order to improve the efficiency of associ-ation rules mining,an FP- Growth algorithm based on Boolean matrix and MapReduce,which is marked as BPFP,is proposed,with its time and space complexity being also analyzed.In BPFP algorithm,Hadoop framework andBoolean matrix are used to reduce the number of scans of transaction data,and twice Map- Reduce is adopted tomine frequent item sets.Experimental results on multiple data sets show that the improved FP- Growth algorithm issuperior to the original one due to its high execution efficiency and speedup.
Key words: data mining; association rules; Boolean matrix; MapReduce; FP- Growth algorithm
/
| 〈 |
|
〉 |