收稿日期: 2021-08-25
网络出版日期: 2022-07-21
基金资助
广东省重点领域研发计划项目(2018B010109001);广东省海洋经济发展专项(GDNRC[2020]018)
Object Tracking Algorithm Based on Multi-Stream Attention Siamese Network
Received date: 2021-08-25
Online published: 2022-07-21
Supported by
the Key-Area R&D Project of Guangdong Province(2018B010109001);the Guangdong Provincial Special Project for the Development of Ocean Economy(GDNRC[2020]018)
目标跟踪在计算机视觉任务中有重要的意义。近年来随着深度学习的发展,基于孪生网络的目标跟踪算法因其优异的性能而被广泛应用。然而,现有基于孪生网络的跟踪算法在目标发生较大形变、低分辨率、复杂背景等情况下的跟踪性能通常会显著下降。为此,文中提出了一种基于多分支注意力孪生网络的目标跟踪算法。该算法首先构建了超分辨率模块和数据增强模块,分别对目标模板进行超分辨率和数据增强,提升目标模板的特征表征能力;然后利用3个主干网络分别提取原始目标模板、超分辨率目标模板和数据增强目标模板的特征,并进行特征融合,同时在主干网络中应用了通道注意力模块和空间注意力模块,以提升特征提取能力;最后,将融合后的特征图与待搜索区域的特征图输入区域生成网络模块,得到目标跟踪信息。实验结果表明,该算法在OTB100数据集上的精确率为0.919、成功率为0.707,在VOT2018数据集上的准确率为0.642、鲁棒性为0.149,在实际场景中的运行速度每秒至少20次,说明该算法具有优异的跟踪性能,并且在各种复杂场景下都具有良好的鲁棒性。
余陆斌, 田联房, 杜启亮 . 基于多分支注意力孪生网络的目标跟踪算法[J]. 华南理工大学学报(自然科学版), 2022 , 50(12) : 30 -40 . DOI: 10.12141/j.issn.1000-565X.210541
Object tracking is of great significance in computer vision tasks. Recently, with the development of deep learning, the tracking algorithms based on Siamese networks have been extensively applied because of their excellent capabilities. However, the performance of the existing Siamese network modules degrades significantly when dealing with special situations such as large deformation of the target, low resolution, and complex background. To address these aforementioned issues, this paper proposed a tracking algorithm based on a multi-stream attention Siamese network. This algorithm first constructs super-resolution modules and data enhancement mo-dules, which performs super-resolution and data augmentation on the target templates, respectively, so as to improve the feature characterization ability of the target template. Then, the three backbone networks were used to extract the features of the original target template, the super-resolution target template, and the data augmentation target template, respectively, and their features were fused; simultaneously, the channel attention module and spatial attention module are applied in the backbone network to improve the feature extraction capability. Finally, the fused feature map and the feature map to be searched were input into the region proposal network module to obtain the target tracking information. The experimental results show that the algorithm achieved the precision of 0.919, the success of 0.707 on the OTB100 dataset and the accuracy of 0.642, the robustness of 0.149 on the VOT2018 dataset, with operation speed higher than 20 times per second in real scenarios, demonstrating the excellent tracking performance of the algorithm and excellent robustness in handling various complex scenarios.
| 1 | 尹宏鹏,陈波,柴毅,等 .基于视觉的目标检测与跟踪综述[J].自动化学报,2016,42(10):1466-1489. |
| 1 | YIN Hongpeng, CHEN Bo, CHAI Yi,et al .Vision-based object detection and tracking:a review [J].Acta Automatica Sinica,2016,42(10):1466-1489. |
| 2 | 郑运平,李睿君 .二叉树模型在目标跟踪中的应用[J].华南理工大学学报(自然科学版),2020,48(1):42-50. |
| 2 | ZHENG Yunping, LI Ruijun .Application of binary tree model in object tracking [J].Journal of South China University of Technology (Natural Science Edition),2020,48(1):42-50. |
| 3 | 孟琭,杨旭 .目标跟踪算法综述 [J].自动化学报,2019,45(7):1244-1260. |
| 3 | MENG Lu, YANG Xu .A survey of object tracking algorithms [J].Acta Automatica Sinica,2019,45(7):1244-1260. |
| 4 | BOLME D S, BEVERIDGE J R, DRAPER B A,et al .Visual object tracking using adaptive correlation filters [C]∥ Proceedings of 2010 IEEE Conference on Computer Vision and Pattern Recognition.San Francisco:IEEE,2010:2544-2550. |
| 5 | HENRIQUES J F, CASEIRO R, MARTINS P,et al .High-speed tracking with kernelized correlation filters [J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2015,37(3):583-596. |
| 6 | DANELLJAN M, BHAT G, KHAN F S,et al .ECO:efficient convolution operators for tracking [C]∥ Procee-dings of 2017 IEEE Conference on Computer Vision and Pattern Recognition.Honolulu:IEEE,2017:6931-6939. |
| 7 | DANELLJAN M, HAGER G, KHAN F S,et al .Discriminative scale space tracking [J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2017,39(8):1561-1575. |
| 8 | 李玺,查宇飞,张天柱,等 .深度学习的目标跟踪算法综述 [J].中国图象图形学报,2019,24(12):2057-2080. |
| 8 | LI Xi, ZHA Yufei, ZHANG Tianzhu,et al .Survey of visual object tracking algorithms based on deep learning [J].Journal of Image and Graphics,2019,24(12):2057-2080. |
| 9 | BERTINETTO L, VALMADRE J, HENRIQUES J F,et al .Fully-convolutional Siamese networks for object tracking [C]∥ Proceedings of the 14th European Conference on Computer Vision.Amsterdam:Springer,2016:850-865. |
| 10 | LI B, YAN J J, WU W,et al .High performance visual tracking with Siamese region proposal network [C]∥ Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Salt Lake City:IEEE,2018:8971-8980. |
| 11 | LI B, WU W, WANG Q,et al .SiamRPN++:evolution of Siamese visual tracking with very deep networks [C]∥ Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Long Beach:IEEE,2019:4277-4286. |
| 12 | SHEN J, TANG X, DONG X,et al .Visual object tracking by hierarchical attention Siamese network [J].IEEE Transactions on Cybernetics,2020,50(7):3068-3080. |
| 13 | WU Y,LIM J, YANG M H .Object tracking benchmark [J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2015,37(9):1834-1848. |
| 14 | KRISTAN M, LEONARDIS A, MATAS J,et al .The sixth visual object tracking VOT2018 challenge results [C]∥ Proceedings of 2018 European Conference on Computer Vision Workshops.Munich:Springer,2018:3-53 |
| 15 | LI P, CHEN B, OUYANG W,et al .GradNet:gradient-guided network for visual object tracking [C]∥ Proceedings of 2019 IEEE/CVF International Conference on Computer Vision.Seoul:IEEE,2019:6161-6170. |
| 16 | VOIGTLAENDER P, LUITEN J, TORR P H,et al .Siam R-CNN:visual tracking by re-detection [C]∥ Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Seattle:IEEE,2020:6577-6587. |
| 17 | DANELLJAN M, BHAT G, KHAN F S,et al .ATOM: accurate tracking by overlap maximization [C]∥ Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Long Beach:IEEE,2019:4655-4664. |
| 18 | XU T Y, FENG Z H, WU X J,et al .Learning adaptive discriminative correlation filters via temporal consistency preserving spatial feature selection for robust visual object tracking [J].IEEE Transactions on Image Processing,2019,28(11):5596-5609. |
| 19 | CHEN Z, ZHONG B, LI G,et al .Siamese box adaptive network for visual tracking [C]∥ Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Seattle:IEEE,2020:6667-6676. |
| 20 | YU Y, XIONG Y, HUANG W,et al .Deformable Siamese attention networks for visual object tracking [C]∥ Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition.Seattle:IEEE,2020:6727-6736. |
/
| 〈 |
|
〉 |