基于多阶融合与循环聚合的立体匹配网络

doi:10.12141/j.issn.1000-565X.200430

华南理工大学学报（自然科学版） ›› 2021, Vol. 49 ›› Issue (6): 77-87,99.doi: 10.12141/j.issn.1000-565X.200430

所属专题： 2021年计算机科学与技术

基于多阶融合与循环聚合的立体匹配网络

张瑞峰¹任国明¹李锵¹段子阳^1,²

1．天津大学微电子学院,天津 300072；2.中国电子科技集团公司第五十三研究所,天津 300300

收稿日期:2020-07-24 修回日期:2020-11-16 出版日期:2021-06-25 发布日期:2021-06-01
通信作者: 张瑞峰（1974-），男，博士，副教授，主要从事机器视觉研究。 E-mail:zhangruifeng@tju.edu.cn
作者简介:张瑞峰（1974-），男，博士，副教授，主要从事机器视觉研究。
基金资助:
国家自然科学基金资助项目（61471263）；天津市自然科学基金资助项目（16JCZDJC31100）

Stereo Matching Network Based on Multi-Stage Fusion and Recurrent Aggregation

ZHANG Ruifeng REN Guoming LI Qiang DUAN Ziyang

1. School of Microelectronics, Tianjin University, Tianjin 300072, China；2. The 53th Research Institute of China
Electronics Technology Group Corporation, Tianjin 300300, China

Received:2020-07-24 Revised:2020-11-16 Online:2021-06-25 Published:2021-06-01
Contact: 张瑞峰（1974-），男，博士，副教授，主要从事机器视觉研究。 E-mail:zhangruifeng@tju.edu.cn
About author:张瑞峰（1974-），男，博士，副教授，主要从事机器视觉研究。
Supported by:
Supported by the National Natural Science Foundation of China(61471263) and the Tianjin Municipal Natural Science Foundation(16JCZDJC31100)

摘要/Abstract

摘要： 针对基于深度学习的立体匹配网络中病态区域匹配效果欠佳、模型参数量过大的问题，提出了一种基于多阶特征融合与循环代价聚合的端对端立体匹配网络—MFRANet。首先，为兼顾图像低层细节信息与高层语义信息，提出了多阶特征融合模块，采用分阶段、逐步式的特征融合策略对多层次、多尺度特征进行有效融合；其次，在代价聚合阶段提出循环聚合机制，以循环方式对匹配代价卷进行聚合优化，在改善聚合效果的同时不引入过多的参数量；最后，利用基于Soft Argmin算法的视差计算模块计算图像视差。并通过KITTI 2012/2015和SceneFlow两个公开数据集对网络进行训练和测试，与其他端对端立体匹配网络进行了对比研究。结果表明，在SceneFlow和KITTI 2015两个公开数据集上，相较于其他端对端立体匹配网络，MFRANet具有更为精准的匹配结果；对于SceneFlow数据集，终点误差降低至0.92Pixels；对于KITTI 2015数据集，误匹配率降低至2.21%。

关键词: 端对端立体匹配网络, 多阶特征融合, 循环代价聚合, 终点误差, 误匹配率

Abstract: Aiming at the poor matching effect of ill-conditioned regions and excessive model parameters in the stereo matching network based on deep learning, an end-to-end stereo matching network based on multi-level feature fusion and recurrent cost aggregation（MFRANet）was proposed. Firstly, in order to take into account both the low-level detail information and high-level semantic information of the image, a multi-stage feature fusion module, which uses a phased and step-by-step feature fusion strategy to effectively fuse multi-level and multi-scale features, was proposed. Secondly, a recurrent mechanism was proposed in the cost aggregation stage to optimize the aggregation of the matching cost volume in a recurrent manner, and it can improve the aggregation effect while avoid introducing too many parameters. Finally, the disparity calculation module based on the Soft Argmin algorithm was used to calculate the image disparity. And through the two public datasets of KITTI 2012/2015 and SceneFlow, the network was trained and tested, and a comparative study with other end-to-end stereo matching networks was caaried out. Experimental results show that, for the two public datasets of SceneFlow and KITTI 2015, MFRANet has more accurate matching results than other end-to-end stereo matching networks; for the SceneFlow dataset, the end-point error is reduced to 0.92 pixels; for the KITTI 2015 dataset, the mismatching rate is reduced to 2.21%.

Key words: end-to-end stereo matching network, multi-stage feature fusion, recurrent cost aggregation, end-point error, mismatching rate

中图分类号:

TP391

张瑞峰, 任国明, 李锵, 等. 基于多阶融合与循环聚合的立体匹配网络[J]. 华南理工大学学报（自然科学版）, 2021, 49(6): 77-87,99.

ZHANG Ruifeng, REN Guoming, LI Qiang, et al. Stereo Matching Network Based on Multi-Stage Fusion and Recurrent Aggregation[J]. Journal of South China University of Technology (Natural Science Edition), 2021, 49(6): 77-87,99.

[1]	李海燕, 尹浩林, 李鹏, 等. 基于密集特征推理及混合损失函数的修复算法[J]. 华南理工大学学报(自然科学版), 2023, 51(9): 99-109.
[2]	刘怡俊, 王嘉达, 钟仕杰, 等. 基于统一标签矩阵的快速多视图聚类[J]. 华南理工大学学报(自然科学版), 2023, 51(9): 110-119.
[3]	王世勇, 乾国康, 李迪, 等. 面向边缘特征的实时模板匹配方法[J]. 华南理工大学学报(自然科学版), 2023, 51(9): 1-10.
[4]	李家春, 李博文, 林伟伟. AdfNet：一种基于多样化特征的自适应深度伪造检测网络[J]. 华南理工大学学报(自然科学版), 2023, 51(9): 82-89.
[5]	马晓亮, 安玲玲, 邓从健, 等. 基于行业词表的自动语音转写后优化技术[J]. 华南理工大学学报(自然科学版), 2023, 51(8): 118-125.
[6]	林志坚, 黄萍, 郑明魁, 等. 基于FPGA的HEVC熵编码语法元素硬件加速设计[J]. 华南理工大学学报(自然科学版), 2023, 51(8): 110-117.
[7]	韩乐, 江怡华. 鲁棒截断L1-L2全变分稀疏恢复模型[J]. 华南理工大学学报(自然科学版), 2023, 51(5): 45-53,140.
[8]	朱铮宇, 罗超, 贺前华, 等. 基于唇重构与三维耦合CNN的多视角音唇一致性判别[J]. 华南理工大学学报(自然科学版), 2023, 51(5): 70-77.
[9]	陆璐, 赖锦雄. 基于胶囊网络和注意力机制的智能合约漏洞检测方法[J]. 华南理工大学学报(自然科学版), 2023, 51(5): 36-44.
[10]	林志坚, 丁永强, 杨秀芝, 等. HEVC帧内率失真优化预测模式的并行流水线硬件设计[J]. 华南理工大学学报(自然科学版), 2023, 51(5): 95-103.
[11]	叶峰, 陈彪, 赖乙宗. 基于特征空间嵌入的对比知识蒸馏算法[J]. 华南理工大学学报(自然科学版), 2023, 51(5): 13-23.
[12]	马碧云, 吴港, 刘娇蛟, 等. 基于稀疏脉冲采样的低复杂度血流速度估计算法[J]. 华南理工大学学报(自然科学版), 2023, 51(5): 63-69.
[13]	刘宇鹏, 张雷. 融合遗忘和知识点重要度的认知诊断模型[J]. 华南理工大学学报(自然科学版), 2023, 51(5): 54-62.
[14]	张艳, 许昌康, 曹丽青, 等. 基于互信息解耦表示的跨域压力足迹图像检索[J]. 华南理工大学学报(自然科学版), 2023, 51(5): 78-85.
[15]	田晟, 宋霖, 赵凯龙. 基于偏移注意力机制和多特征融合的点云分类[J]. 华南理工大学学报(自然科学版), 0, (): 0-.

基于多阶融合与循环聚合的立体匹配网络

Stereo Matching Network Based on Multi-Stage Fusion and Recurrent Aggregation

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 15

编辑推荐

Metrics

本文评价