收稿日期: 2023-08-08
网络出版日期: 2023-12-26
基金资助
黑龙江省重点研发项目(JD22A014);国家车辆事故深度调查体系资助项目(NAIS-ZL-ZHGL-2020018);国家自然科学基金资助项目(62173107)
Intelligent Vehicle Object Detection Algorithm Based on Lightweight CenterNet
Received date: 2023-08-08
Online published: 2023-12-26
Supported by
the Key Research and Development Plan Projects of Heilongjiang Province(JD22A014);the National Automobile Accident In-Depth Investigation System Funding Project(NAIS-ZL-ZHGL-2020018);the National Natural Science Foundation of China(62173107)
针对当前目标检测算法参数较多,计算量较大,导致响应速度较慢,难以推广应用于智能车辆系统的问题,提出一种改进的CenterNet目标检测算法。即应用轻量化MobileNetV3网络替换原ResNet-50网络,降低计算量;应用深度可分离的PANet替换特征增强网络,获得多尺度特征信息融合后的特征,并引入SimAM注意力机制在特征融合前强化目标特征关注度,再用SiLU激活函数替换原目标检测网络中的ReLU激活函数,增强网络学习能力。最后提出CPAN-ASFF模块对深度可分离的PANet输出多尺度特征图进行融合,提高目标检测精度。应用优化后的KITTI数据集对改进后的CenterNet目标检测算法进行训练及检测验证,结果表明:其平均准确率为80.7%,比原始CenterNet目标检测算法提高了12个百分点,其检测速度为65 f/s,其参数量为8.91 M,较原算法减少72.73%,改进后的算法在遮挡目标、重叠目标以及与背景相似目标的检测效果上表现更优。且在SODA10M数据集中,文中提出的算法的检测精度与速度也都优于当前主流算法。该研究对算法的优化及实验为智能车辆在实际工程中的应用奠定了技术支撑。
岳永恒 , 宁睿厚 . 基于轻量化CenterNet的智能车辆目标检测算法[J]. 华南理工大学学报(自然科学版), 2024 , 52(8) : 45 -55 . DOI: 10.12141/j.issn.1000-565X.230513
Aiming at the problem that the current object detection algorithm has many parameters and larger computation amount, resulting in slower response and difficulties of application in intelligent vehicle systems, this paper proposed an improved CenterNet object detection algorithm. That is, the lightweight MobileNetV3 network was applied to replace the original ResNet-50 network to reduce the amount of computation; the depth-separable PANet was applied to replace the feature enhancement network to obtain the features after the fusion of the multi-scale feature information, and the SimAM attention mechanism was introduced to strengthen the attention of the target features before the fusion of the features, then the SiLU activation function was used to replace the original object detection network in the ReLU activation function to enhance the network learning. Finally, the CPAN-ASFF module was proposed to fuse the depth-separable PANet output multi-scale feature maps to improve the object detection accuracy. The optimized KITTI dataset was applied to train and detect the improved CenterNet object detection algorithm for validation. And the results show that its mean average precision is 80.7%, which is 12 percentage points higher than that of the original CenterNet object detection algorithm; its detection speed is 65 f/s, and its number of parameters is 8.91 M, which is 72.73% less than the original algorithm. The improved algorithm performs better in the detection of occluded objects, overlapped objects and objects similar to the background. In the SODA10M dataset, the detection accuracy and speed of the algorithm are better than the current mainstream algorithms. The optimization of the algorithm and the experiments in the paper laid the technical support for the application of intelligent vehicles in practical engineering.
| 1 | GIRSHICK R, DONAHUE J, DARRELL T,et al .Rich feature hierarchies for accurate object detection and semantic segmentation[C]∥ Proceedings of 2014 IEEE Conference on Computer Vision and Pattern Recognition(CVPR).Columbus:IEEE,2014:580-587. |
| 2 | GIRSHICK R, DONAHUE J, DARRELL T,et al .Region-based convolutional networks for accurate object detection and segmentation[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2015,38(1):142-158. |
| 3 | GIRSHICK R .Fast R-CNN[C]∥ Proceedings of 2015 IEEE International Conference on Computer Vision (ICCV).Santiago:IEEE,2015:1440-1448. |
| 4 | HE K M, ZHANG X Y, REN S Q,et al .Spatial pyramid pooling in deep convolutional networks for visual re-cognition[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2015,37(9):1904-1916. |
| 5 | REN S Q, HE K M, GIRSHICK R,et al .Faster R-CNN:towards real-time object detection with region proposal networks[J].Advances in Neural Information Processing Systems,2017,39(6):1137-1149. |
| 6 | REDMON J, DIVVALA S, GIRSHICK R,et al .You only look once:unified,real-time object detection[C]∥ Proceedings of 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Las Vegas:IEEE,2016:779-788. |
| 7 | LIU W, ANGUELOV D, ERHAN D,et al .Ssd:single shot multibox detector[C]∥ Proceedings of 2016 European Conference on Computer Vision (ECCV).Amsterdam:Springer,2016:21-37. |
| 8 | SONG Y K, HONG S H, HU C T,et al .MEB-YOLO:an efficient vehicle detection method in complex traffic road scenes[J].Computers Materials & Continua,2023,75(3):5761-5784. |
| 9 | CAO G M, XIE X M, YANG W Z,et al .Feature-fused SSD:fast detection for small objects[C]∥ Proceedings of Ninth International Conference on Graphic and Image Processing (ICGIP 2017).Qingdao:SPIE,2018:381-388. |
| 10 | LIN T Y, GOYAL P, GIRSHICK R,et al .Focal loss for dense object detection[C]∥ Proceedings of 2017 IEEE International Conference on Computer Vision (ICCV).Venice:IEEE,2017:2980-2988. |
| 11 | ZHAO G C, PANG J, ZHANG H,et al .Anchor-free network for multi-class object detection in remote sensing images[C]∥ Proceedings of 2020 39th Chinese Control Conference (CCC).Shenyang:IEEE,2020:7510-7515. |
| 12 | LAW H, DENG J .CornerNet:detecting objects as paired keypoints[C]∥ Proceedings of 2018 European Conference on Computer Vision (ECCV).Munich:Springer,2018:734-750. |
| 13 | CARION N, MASSA F, SYNNAEVE G,et al .End-to-end object detection with transformers[C]∥ Proceedings of 2020 European Conference on Computer Vision (ECCV).Glasgow:Springer,2020:213-229. |
| 14 | ZHOU X Y, WANG D Q, KR?HENBüHL P .Objects as points[J].arXiv preprint arXiv:,2019. |
| 15 | HOWARD A, SANDLER M, CHU G,et al .Searching for mobilenetv3[C]∥ Proceedings of 2019 IEEE International Conference on Computer Vision (ICCV).Seoul:IEEE,2019:1314-1324. |
| 16 | LIU S, QI L, QIN H F,et al .Path aggregation network for instance segmentation[C]∥ Proceedings of 2018 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Salt Lake City:IEEE,2018:8759-8768. |
| 17 | YANG L X, ZHANG R Y, LI L D,et al .Simam:a simple,parameter-free attention module for convolutional neural networks[C]∥ Proceedings of International Conference on Machine Learning (ICML).Williamstown:PMLR,2021:11863-11874. |
| 18 | RAMACHANDRAN P, ZOPH B, LE Q V .Searching for activation functions[J].arXiv preprint arXiv:,2017. |
| 19 | GLOROT X, BORDES A, BENGIO Y .Deep sparse rectifier neural networks[C]∥ Proceedings of Fourteenth International Conference on Artificial Intelligence and Statistics.Fort Lauderdale:JMLR,2011:315-323. |
| 20 | GEIGER A, LENZ P, URTASUN R .Are we ready for autonomous driving?the KITTI vision benchmark suite[C]∥ Proceedings of 2012 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Providence:IEEE,2012:3354-3361. |
| 21 | NEUBECK A, VAN GOOL L .Efficient non-maximum suppression[C]∥ Proceedings of 18th International Conference on Pattern Recognition (ICPR).Hong Kong:IEEE,2006:850-855. |
| 22 | LIN T Y, DOLLáR P, GIRSHICK R,et al .Feature pyramid networks for object detection[C]∥ Proceedings of 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Honolulu:IEEE,2017:2117-2125. |
| 23 | ZHANG X X, LI N, ZHANG R X .An improved lightweight network MobileNetv3 based YOLOv3 for pedestrian detection[C]∥ Proceedings of 2021 IEEE International Conference on Consumer Electronics and Computer Engineering (ICCECE).Guangzhou:IEEE,2021:114-118. |
| 24 | WANG S Y, ZHAO J,TA N,et al .A real-time deep learning forest fire monitoring algorithm based on an improved Pruned+ KD model[J].Journal of Real-Time Image Processing,2021,18(6):2319-2329. |
| 25 | HOWARD G, ZHU M L, CHEN B,et al .Mobilenets:efficient convolutional neural networks for mobile vision applications[J].arXiv preprint arXiv:,2017. |
| 26 | SANDLER M, HOWARD A, ZHU M L,et al .Mobilenetv2:inverted residuals and linear bottlenecks[C]∥ Proceedings of 2018 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Salt Lake City:IEEE,2018:4510-4520. |
| 27 | HU J, SHEN L, SUN G .Squeeze-and-excitation networks[C]∥ Proceedings of 2018 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).Salt Lake City:IEEE,2018:7132-7141. |
| 28 | ELFWING S, UCHIBE E, DOYA K .Sigmoid-weighted linear units for neural network function approximation in reinforcement learning[J].Neural Networks,2018,107:3-11. |
| 29 | MA N N, ZHANG X Y, LIU M,et al .Activate or not:learning customized activation[C]∥ Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).Nashville:IEEE,2021:8032-8042. |
| 30 | LIU S T, HUANG D, WANG Y H .Learning spatial fusion for single-shot object detection[J].arXiv preprint arXiv:,2019. |
| 31 | GUO M H, XU T X, LIU J J,et al .Attention mechanisms in computer vision:a survey[J].Computational Visual Media,2022,8(3):331-368. |
| 32 | GEIGER A, LENZ P, STILLER C,et al .Vision meets robotics:the KITTI dataset[J].The International Journal of Robotics Research,2013,32(11):1231-1237. |
| 33 | OKSUZ K, CAM B C, KALKAN S,et al .Imbalance problems in object detection:a review[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2020,43(10):3388-3415. |
/
| 〈 |
|
〉 |