华南理工大学学报(自然科学版) ›› 2014, Vol. 42 ›› Issue (1): 123-127,134.doi: 10.3969/j.issn.1000-565X.2014.01.021

• 计算机科学与技术 • 上一篇    下一篇

图形处理器上CSB+- 树索引的并行构建算法

刘勇 奚建清 黄东平 贾连印 苗德成   

  1. 华南理工大学 计算机科学与工程学院,广东 广州 510006
  • 收稿日期:2013-04-15 修回日期:2013-06-12 出版日期:2014-01-25 发布日期:2013-12-01
  • 通信作者: 刘勇(1973-),男,在职博士生,广西民族大学高级工程师,主要从事高性能并行计算研究. E-mail:l.yong07@mail.scut.edu.cn
  • 作者简介:刘勇(1973-),男,在职博士生,广西民族大学高级工程师,主要从事高性能并行计算研究.
  • 基金资助:

    国家自然科学基金资助项目(61103038);云南省教育厅重点项目(2012Z008)

Parallel Algorithm to Construct CSB+- Tree Indexing on Graphic Processing Unit

Liu Yong Xi Jian- qing Huang Dong- ping Jia Lian- yin Miao De- cheng   

  1. School of Computer Science and Engineering,South China University of Technology,Guangzhou 510006,Guangdong,China
  • Received:2013-04-15 Revised:2013-06-12 Online:2014-01-25 Published:2013-12-01
  • Contact: 刘勇(1973-),男,在职博士生,广西民族大学高级工程师,主要从事高性能并行计算研究. E-mail:l.yong07@mail.scut.edu.cn
  • About author:刘勇(1973-),男,在职博士生,广西民族大学高级工程师,主要从事高性能并行计算研究.
  • Supported by:

    国家自然科学基金资助项目(61103038);云南省教育厅重点项目(2012Z008)

摘要: 为提高缓存敏感 CSB+- 树索引的操作效率,在图形处理器( GPU) 上研究 CSB+-树的并行构建和查询性能.通过分析索引树内部节点的每一键与对应叶子节点的映射关系,提出了一种一次性并行构建 CSB+- 树所有内部节点键值的无锁并行算法,以最大并行度来快速构建索引树.该算法通过设计 GPU 平台上支持 CSB+- 树的索引数据任意伸缩的动态数组来解决 GPU 上不能动态分配显存空间的问题,通过在索引内部节点的边界增加填充位来减少线程块的线程分支数,从而提高 CSB+- 树的查询效率.实验结果表明,文中所提算法的运行时间比基于单个节点和基于树层的并行算法分别提高了 31. 0 和 1.4 倍.

关键词: 并行算法, 图形处理器, CSB+- 树索引, 动态数组, 查询效率

Abstract:

In order to improve the operation efficiency of cache sensitive B+- tree (CSB+- tree) indexing,this pa-per deals with the parallel construction and query performance of CSB+- tree on graphic processing unit (GPU).Inthe investigation,first,the mapping relationship between each key in internal nodes and the corresponding leafnode of the index tree is analyzed, a lock- free parallel algorithm that once for all builds the CSB+- tree internal nodekeys is proposed,and the index tree is constructed at the maximum parallel speed.Moreover, dynamic arrays su-pporting the arbitrary expansion of CSB+- tree index data on GPU are designed to implement the dynamic allocationof memory space on GPU,and padding bits are added to the boundary of the internal nodes to reduce the number ofbranches,thus improving the query efficiency of CSB+- tree.Experimental results indicate that the proposed algo-rithm is 31. 0 and 1.4 times faster respectively than the parallel algorithms based on single node and tree layer.

Key words: parallel algorithms, graphic processing unit, CSB+- tree indexing, dynamic array, query efficiency

中图分类号: