基于多分支注意力孪生网络的目标跟踪算法

doi:10.12141/j.issn.1000-565X.210541

华南理工大学学报(自然科学版) ›› 2022, Vol. 50 ›› Issue (12): 30-40.doi: 10.12141/j.issn.1000-565X.210541

所属专题： 2022年计算机科学与技术

基于多分支注意力孪生网络的目标跟踪算法

余陆斌^1,² 田联房^1,^3,⁴ 杜启亮^1,^4,⁵

^1.华南理工大学自动化科学与工程学院，广东广州 510640
^2.工业和信息化部电子第五研究所，广东广州 511370
^3.南方海洋科学与工程广东实验室，广东珠海 519000
^4.华南理工大学自主系统与网络控制教育部重点实验室，广东广州 510640
^5.华南理工大学中新国际联合研究院，广东广州 510555

收稿日期:2021-08-25 出版日期:2022-12-25 发布日期:2022-07-21
通信作者: 杜启亮（1980-），男，博士，副研究员，主要从事模式识别与机器视觉研究。 E-mail:qldu@scut.edu.cn
作者简介:余陆斌（1994-），男，博士，主要从事模式识别与机器视觉研究.E-mail:yulubin94@qq.com.
基金资助:
广东省重点领域研发计划项目(2018B010109001);广东省海洋经济发展专项(GDNRC［2020］018)

Object Tracking Algorithm Based on Multi-Stream Attention Siamese Network

YU Lubin^1,² TIAN Lianfang¹^,^3,⁴ DU Qiliang^1,^4,⁵

^1.School of Automation Science and Engineering，South China University of Technology，Guangzhou 510640，Guangdong，China
^2.The Fifth Electronics Research Institute of the Ministry of Industry and Information Technology，Guangzhou 511370，Guangdong，China
^3.Southern Marine Science and Engineering Guangdong Laboratory （Zhuhai），Zhuhai 519000，Guangdong，China
^4.Key Laboratory of Autonomous Systems and Network Control of the Ministry of Education，Guangzhou 510640，Guangdong，China
^5.China-Singapore International Joint Research Institute，South China University of Technology，Guangzhou 510555，Guangdong，China

Received:2021-08-25 Online:2022-12-25 Published:2022-07-21
Contact: 杜启亮（1980-），男，博士，副研究员，主要从事模式识别与机器视觉研究。 E-mail:qldu@scut.edu.cn
About author:余陆斌（1994-），男，博士，主要从事模式识别与机器视觉研究.E-mail:yulubin94@qq.com.
Supported by:
the Key-Area R&D Project of Guangdong Province(2018B010109001);the Guangdong Provincial Special Project for the Development of Ocean Economy(GDNRC［2020］018)

摘要/Abstract

摘要：

目标跟踪在计算机视觉任务中有重要的意义。近年来随着深度学习的发展，基于孪生网络的目标跟踪算法因其优异的性能而被广泛应用。然而，现有基于孪生网络的跟踪算法在目标发生较大形变、低分辨率、复杂背景等情况下的跟踪性能通常会显著下降。为此，文中提出了一种基于多分支注意力孪生网络的目标跟踪算法。该算法首先构建了超分辨率模块和数据增强模块，分别对目标模板进行超分辨率和数据增强，提升目标模板的特征表征能力；然后利用3个主干网络分别提取原始目标模板、超分辨率目标模板和数据增强目标模板的特征，并进行特征融合，同时在主干网络中应用了通道注意力模块和空间注意力模块，以提升特征提取能力；最后，将融合后的特征图与待搜索区域的特征图输入区域生成网络模块，得到目标跟踪信息。实验结果表明，该算法在OTB100数据集上的精确率为0.919、成功率为0.707，在VOT2018数据集上的准确率为0.642、鲁棒性为0.149，在实际场景中的运行速度每秒至少20次，说明该算法具有优异的跟踪性能，并且在各种复杂场景下都具有良好的鲁棒性。

关键词: 目标跟踪, 孪生网络, 超分辨率, 数据增强, 注意力模块

Abstract:

Object tracking is of great significance in computer vision tasks. Recently, with the development of deep learning, the tracking algorithms based on Siamese networks have been extensively applied because of their excellent capabilities. However, the performance of the existing Siamese network modules degrades significantly when dealing with special situations such as large deformation of the target, low resolution, and complex background. To address these aforementioned issues, this paper proposed a tracking algorithm based on a multi-stream attention Siamese network. This algorithm first constructs super-resolution modules and data enhancement mo-dules, which performs super-resolution and data augmentation on the target templates, respectively, so as to improve the feature characterization ability of the target template. Then, the three backbone networks were used to extract the features of the original target template, the super-resolution target template, and the data augmentation target template, respectively, and their features were fused; simultaneously, the channel attention module and spatial attention module are applied in the backbone network to improve the feature extraction capability. Finally, the fused feature map and the feature map to be searched were input into the region proposal network module to obtain the target tracking information. The experimental results show that the algorithm achieved the precision of 0.919, the success of 0.707 on the OTB100 dataset and the accuracy of 0.642, the robustness of 0.149 on the VOT2018 dataset, with operation speed higher than 20 times per second in real scenarios, demonstrating the excellent tracking performance of the algorithm and excellent robustness in handling various complex scenarios.

Key words: object tracking, Siamese network, super-resolution, data augmentation, attention module

中图分类号:

TP391.4

余陆斌, 田联房, 杜启亮. 基于多分支注意力孪生网络的目标跟踪算法[J]. 华南理工大学学报(自然科学版), 2022, 50(12): 30-40.

YU Lubin, TIAN Lianfang, DU Qiliang. Object Tracking Algorithm Based on Multi-Stream Attention Siamese Network[J]. Journal of South China University of Technology(Natural Science Edition), 2022, 50(12): 30-40.

图/表 16

图1

图2

图3

图4

图5

图6

表1

图7

表2

图8

表3

图9

图10

表4

图11

表5

参考文献 20

1	尹宏鹏，陈波，柴毅，等．基于视觉的目标检测与跟踪综述［J］．自动化学报，2016，42（10）：1466-1489．
	YIN Hongpeng， CHEN Bo， CHAI Yi，et al ．Vision-based object detection and tracking：a review ［J］．Acta Automatica Sinica，2016，42（10）：1466-1489．
2	郑运平，李睿君．二叉树模型在目标跟踪中的应用［J］．华南理工大学学报（自然科学版），2020，48（1）：42-50．
	ZHENG Yunping， LI Ruijun ．Application of binary tree model in object tracking ［J］．Journal of South China University of Technology （Natural Science Edition），2020，48（1）：42-50．
3	孟琭，杨旭．目标跟踪算法综述［J］．自动化学报，2019，45（7）：1244-1260．
	MENG Lu， YANG Xu ．A survey of object tracking algorithms ［J］．Acta Automatica Sinica，2019，45（7）：1244-1260．
4	BOLME D S， BEVERIDGE J R， DRAPER B A，et al ．Visual object tracking using adaptive correlation filters ［C］∥ Proceedings of 2010 IEEE Conference on Computer Vision and Pattern Recognition．San Francisco：IEEE，2010：2544-2550．
5	HENRIQUES J F， CASEIRO R， MARTINS P，et al ．High-speed tracking with kernelized correlation filters ［J］．IEEE Transactions on Pattern Analysis and Machine Intelligence，2015，37（3）：583-596．
6	DANELLJAN M， BHAT G， KHAN F S，et al ．ECO：efficient convolution operators for tracking ［C］∥ Procee-dings of 2017 IEEE Conference on Computer Vision and Pattern Recognition．Honolulu：IEEE，2017：6931-6939．
7	DANELLJAN M， HAGER G， KHAN F S，et al ．Discriminative scale space tracking ［J］．IEEE Transactions on Pattern Analysis and Machine Intelligence，2017，39（8）：1561-1575．
8	李玺，查宇飞，张天柱，等．深度学习的目标跟踪算法综述［J］．中国图象图形学报，2019，24（12）：2057-2080．
	LI Xi， ZHA Yufei， ZHANG Tianzhu，et al ．Survey of visual object tracking algorithms based on deep learning ［J］．Journal of Image and Graphics，2019，24（12）：2057-2080．
9	BERTINETTO L， VALMADRE J， HENRIQUES J F，et al ．Fully-convolutional Siamese networks for object tracking ［C］∥ Proceedings of the 14th European Conference on Computer Vision．Amsterdam：Springer，2016：850-865．
10	LI B， YAN J J， WU W，et al ．High performance visual tracking with Siamese region proposal network ［C］∥ Proceedings of 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition．Salt Lake City：IEEE，2018：8971-8980．
11	LI B， WU W， WANG Q，et al ．SiamRPN++：evolution of Siamese visual tracking with very deep networks ［C］∥ Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition．Long Beach：IEEE，2019：4277-4286．
12	SHEN J， TANG X， DONG X，et al ．Visual object tracking by hierarchical attention Siamese network ［J］．IEEE Transactions on Cybernetics，2020，50（7）：3068-3080．
13	WU Y，LIM J， YANG M H ．Object tracking benchmark ［J］．IEEE Transactions on Pattern Analysis and Machine Intelligence，2015，37（9）：1834-1848．
14	KRISTAN M， LEONARDIS A， MATAS J，et al ．The sixth visual object tracking VOT2018 challenge results ［C］∥ Proceedings of 2018 European Conference on Computer Vision Workshops．Munich：Springer，2018：3-53
15	LI P， CHEN B， OUYANG W，et al ．GradNet：gradient-guided network for visual object tracking ［C］∥ Proceedings of 2019 IEEE/CVF International Conference on Computer Vision．Seoul：IEEE，2019：6161-6170．
16	VOIGTLAENDER P， LUITEN J， TORR P H，et al ．Siam R-CNN：visual tracking by re-detection ［C］∥ Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition．Seattle：IEEE，2020：6577-6587．
17	DANELLJAN M， BHAT G， KHAN F S，et al ．ATOM： accurate tracking by overlap maximization ［C］∥ Proceedings of 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition．Long Beach：IEEE，2019：4655-4664．
18	XU T Y， FENG Z H， WU X J，et al ．Learning adaptive discriminative correlation filters via temporal consistency preserving spatial feature selection for robust visual object tracking ［J］．IEEE Transactions on Image Processing，2019，28（11）：5596-5609．
19	CHEN Z， ZHONG B， LI G，et al ．Siamese box adaptive network for visual tracking ［C］∥ Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition．Seattle：IEEE，2020：6667-6676．
20	YU Y， XIONG Y， HUANG W，et al ．Deformable Siamese attention networks for visual object tracking ［C］∥ Proceedings of 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition．Seattle：IEEE，2020：6727-6736．

算法	精确率	成功率
Raw₁	0.838	0.627
SR	0.898	0.671
AUG	0.878	0.644
MSASiam	0.919	0.707

算法	精确率	成功率
Raw₂	0.850	0.642
Spatial	0.874	0.656
Channel	0.907	0.680
MSASiam	0.919	0.707

算法	精确率	成功率
ATOM	0.875	0.659
Siam R-CNN	0.894	0.704
ECO	0.911	0.703
SiamRPN++	0.914	0.691
GradNet	0.905	0.662
MSASiam	0.919	0.707

算法	准确率	鲁棒性	EAO
SiamFC	0.503	0.585	0.188
ECO	0.484	0.276	0.280
SiamRPN	0.586	0.276	0.383
LADCF	0.503	0.159	0.389
ATOM	0.590	0.204	0.401
SiamRPN++	0.600	0.234	0.414
SiamBAN	0.597	0.178	0.440
SiamAttn	0.630	0.160	0.470
MSASiam	0.642	0.149	0.481

算法	EAO	FPS
SiamFC	0.188	40
SiamRPN	0.383	76
SiamRPN++	0.414	35
MSASiam	0.481	21

基于多分支注意力孪生网络的目标跟踪算法

Object Tracking Algorithm Based on Multi-Stream Attention Siamese Network

RichHTML

PDF

可视化

摘要/Abstract

引用本文

使用本文

图/表 16

参考文献 20

相关文章 15

编辑推荐

Metrics

本文评价

[1]	杜启亮, 向照夷, 田联房. 面向嵌入式设备的扶梯客流量实时统计方法[J]. 华南理工大学学报(自然科学版), 2022, 50(6): 60-70.
[2]	谭光李昌镐詹昭焕. 基于端云协同的目标检测跟踪自适应调度算法[J]. 华南理工大学学报（自然科学版）, 2021, 49(7): 86-93.
[3]	郑运平李睿君. 二叉树模型在目标跟踪中的应用[J]. 华南理工大学学报（自然科学版）, 2020, 48(1): 42-50.
[4]	王丹丹谭开拓高素玲袁赣南. 基于模糊逻辑四元数的平方根 UKF 算法[J]. 华南理工大学学报（自然科学版）, 2019, 47(4): 53-60.
[5]	马丽红王小娥田菁张宇. 基于隐藏主题概率模型的图像结构感知SISR重建方法[J]. 华南理工大学学报（自然科学版）, 2019, 47(4): 1-9.
[6]	马丽红黄茵黎剑晖. 基于灵活 LBP 纹理字典构造及多特征描述的改进 SCSR 算法[J]. 华南理工大学学报（自然科学版）, 2015, 43(3): 57-65.
[7]	宋佳声胡国清焦亮. 改进的几何活动轮廓演化及其在目标跟踪中的应用[J]. 华南理工大学学报（自然科学版）, 2015, 43(1): 72-78.
[8]	胡小青胥布工刘永桂文莎. 分布式的有向传感器网络目标跟踪算法[J]. 华南理工大学学报（自然科学版）, 2013, 41(9): 8-14.
[9]	侯跃恩李伟光容爱琼叶国强. 融合背景信息的分块稀疏表示跟踪算法[J]. 华南理工大学学报（自然科学版）, 2013, 41(8): 21-27.
[10]	廖秀秀韩国强沃焱陈湘骥. 基于近邻嵌入逐级放大的图像超分辨率重建[J]. 华南理工大学学报（自然科学版）, 2013, 41(5): 55-60.
[11]	李琦邵春福岳昊. 核窗口尺寸和目标模型自适应的均值漂移跟踪[J]. 华南理工大学学报（自然科学版）, 2013, 41(2): 74-81.
[12]	王小乐黄宏斌邓苏刘明星. 传感器选择问题的GSS算法有效性分析与改进[J]. 华南理工大学学报（自然科学版）, 2013, 41(11): 43-49.
[13]	侯跃恩李伟光四库曾顺星容爱琼. 基于排名的结构稀疏表示目标跟踪算法[J]. 华南理工大学学报（自然科学版）, 2013, 41(11): 23-29,35.
[14]	袁德平史浩山郑娟毅. 用于多目标数据关联的群智能混合算法[J]. 华南理工大学学报(自然科学版), 2012, 40(9): 97-103.
[15]	庄家俊刘琼. 面向辅助驾驶的夜间行人检测方法[J]. 华南理工大学学报(自然科学版), 2012, 40(8): 56-62.