Journal of South China University of Technology (Natural Science Edition) ›› 2020, Vol. 48 ›› Issue (1): 60-69,83.doi: 10.12141/j.issn.1000-565X.190388

• Computer Science & Technology • Previous Articles     Next Articles

Improved Stereo Matching Algorithm Based on PSMNet

LIU Jianguo FENG Yunjian JI Guo YAN Fuwu ZHU Shizhuo   

  1. Hubei Key Laboratory of Advanced Technology for Automotive Components∥Hubei Collaborative Innovation Center for Automotive Components Technology∥Hubei Research Center for New Energy & Intelligent Connected Vehicle,Wuhan University of Technology,Wuhan 430070,Hubei,China
  • Received:2019-06-27 Revised:2019-08-06 Online:2020-01-25 Published:2019-12-01
  • Contact: 刘建国 (1971-),男,博士,副教授,主要从事机器视觉、智能驾驶研究。 E-mail:ljg424@163.com
  • About author:刘建国 (1971-),男,博士,副教授,主要从事机器视觉、智能驾驶研究。
  • Supported by:
    Supported by the National Natural Science Foundation of China (51975434)

Abstract: Based on PSMNet stereo matching network,an improved stereo matching algorithm with shallow struc-ture and wide receptive field -SWNet was proposed,in order to solve the stereo matching problem in binocular vi-sion,reduce the number of parameters of the stereo matching network,reduce the computational complexity of the algorithm,and improve the practicability of the algorithm. The shallow structure means fewer layers,fewer pa-rameters and faster processing speed,while wide receptive field means that the network is more receptive and can acquire and retain more spatial information. SWNet consists of three parts: feature extraction,3D convolution and disparity regression. In the aspect of feature extraction,Atrous Spatial Pyramid Pool (ASPP) was introduced,which was used to extract multi-scale feature information. Feature fusion module was designed to fuse multi-scale feature information and build matching cost volume. The 3D convolutional neural network use the stack encoding
and decoding structure to further regularize the matching cost volume and obtain the corresponding relationship be-tween the feature points under different disparity conditions. Finally,the disparity map was obtained by regres-sion. SWNet performed well on both SceneFlow and KITTI 2015 public datasets,with a 48. 9% reduction in the number of parameters and a 2. 24% mismatching rate compared to the reference algorithm PSMNet.

Key words: stereo matching, PSMNet stereo matching network, convolutional neural network, deep learning, At-rous Spatial Pyramid Pooling, spatial feature information, feature fusion module

CLC Number: