Dropped Object Detection Method Based on Feature Similarity Learning

doi:10.12141/j.issn.1000-565X.220604

Abstract

Abstract:

To overcome the limitation that the existing dropped object detection methods cannot identify the “unknown category”, this study proposed a dropped object detection architecture based on feature similarity learning. Firstly, the reference image and the query image to be detected were obtained during the dropping process. The appearance features were extracted through a weight-shared siamese convolutional network. Then, Euclidean distance was used to measure dissimilarities between features of reference image and query image. Finally, dropped objects were detected by selecting the pixel from the distance map whose distance value was larger than the fixed threshold. In order to improve its robustness to noise such as illumination change, this paper proposed a novel attention mask unit. And the semantic discriminativeness of the mask was improved through constructing the long-span contextual information and strong supervised learning method. This finally guides the feature response to focus on the appearance changes caused by the dropped objects while ignore the disturbance caused by noise, and solves the problem of feature entanglement between noise and the dropped objects. In order to verify the effectiveness of the method, this study collected data in a real highway scene and built a standard dataset. The results show that the attention mask unit effectively improves the semantic discriminative of features and greatly improves the accuracy of dropped object detection, which achieves F₁ an improvement of 6.4 percentage points. Meanwhile, the algorithm reaches in 30 FPS, which can be performed in real-time. The long-span context information constructed by the feature sequence state transition method is more conducive to attention mask focusing on the projectile feature information, and has stronger anti-noise interference ability. The attention mask contour obtained by strongly supervised learning is more accurate and the model accuracy is higher.

Key words: dropped object detection, deep learning, feature similarity learning, attention mechanism, context information

CLC Number:

U495

GUO Enqiang, FU Xinsha. Dropped Object Detection Method Based on Feature Similarity Learning[J]. Journal of South China University of Technology(Natural Science Edition), 2023, 51(6): 30-41.

Figures/Tables 2

References 23

1	蒋来．浅谈高速公路抛洒物危害与对策［J］．道路交通管理，2021（4）：36-37.
	JIANG Lai ．A brief discussion on the hazards and countermeasures of abandoned objects on highways ［J］．Road Traffic Management，2021（4）：36-37.
2	李清瑶，邹皓，赵群，等．基于帧间差分自适应法的车辆抛洒物检测［J］．长春理工大学学报（自然科学版），2018，41（4）：108-113.
	LI Qingyao， ZOU Hao， ZHAO Qun，et al ．Vehicle throwing detection based on inter-frame difference adaptive method ［J］．Journal of Changchun University of Science and Technology （Natural Science Edition），2018，41（4）：108-113.
3	DIN M， BASHIR A， BASIT A，et al ．Abandoned object detection using frame differencing and background subtraction ［J］．International Journal of Applied Mathematics and Computer Science and Applications，2020，11（7）：676-681.
4	ZENG Y， LAN J， RAN B，et al ．A novel abandoned object detection system based on three-dimensional image information ［J］．Sensors，2015，15（3）：6885-6904.
5	夏莹杰，欧阳聪宇．面向高速公路抛洒物检测的动态背景建模方法［J］．浙江大学学报（工学版），2020，54（7）：1249-1255.
	XIA Yingjie， OUYANG Congyu ．Dynamic image background modeling method for detecting abandoned objects in highway ［J］．Journal of Zhejiang University （Engineering Science），2020，54（7）：1249-1255.
6	FU H， XIANG M， MA H，et al ．Abandoned object detection in highway scene ［C］∥ Proceedings of the 2011 6th International Conference on Pervasive Computing and Applications．Port Elizabeth：IEEE，2011：117-121.
7	汪贵平，马力旺，郭璐，等．高速公路抛洒物事件图像检测算法［J］．长安大学学报（自然科学版），2017，37（5）：81-88.
	WANG Guiping， MA Liwang， GUO Lu，et al ．Image detection algorithm for incident of discarded things in highway［J］．Journal of Chang’an University （Natural Science Edition），2017，37（5）：81-88.
8	金瑶，张锐，尹东．城市道路视频中小像素目标检测［J］．光电工程，2019，46（9）：76-83.
	JIN Yao， ZHANG Rui， YIN Dong ．Object detection for small pixel in urban roads videos ［J］．Optoelectronic Engineering，2019，46（9）：76-83.
9	章悦，张亮，谢非，等．基于实例分割模型优化的道路抛洒物检测算法［J］．计算机应用，2021，41（11）：3228-3233.
	ZHANG Yue， ZHANG Liang， XIE Fei，et al ．Road abandoned object detection algorithm based on optimized instance segmentation model［J］．Journal of Computer Applications，2021，41（11）：3228-3233.
10	BELL S， ZITNICK C L， BALA K，et al ．Inside-outside net： detecting objects in context with skip pooling and recurrent neural networks ［C］∥ Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition．Las Vegas：IEEE，2016：2874-2883.
11	WOO S， PARK J， LEE J-Y，et al ．CBAM：convolutional block attention module ［C］∥ Proceedings of the European Conference on Computer Vision （ECCV）．Munich：Springer，2018：3-19.
12	PARK J，WOO S， LEE J-Y，et al ．A simple and light-weight attention module for convolutional neural networks［J］．International Journal of Computer Vision，2020，128（4）：783-798.
13	CHUNG J， GULCEHRE C， CHO K，et al ．Empirical evaluation of gated recurrent neural networks on sequence modeling ［J］．arXiv preprint arXiv：，2014.
14	BOUTROS F， DAMER N， KIRCHBUCHNER F，et al ．Elasticface：elastic margin loss for deep face recognition ［C］∥ Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops （CVPRW）．New Orleans：IEEE，2022：1577-1586.
15	JIN X， HE T， ZHENG K，et al ．Cloth-changing person re-identification from a single image with gait prediction and regularization ［C］∥ Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition．New Orleans：IEEE，2022：14278-14287.
16	SHEN Q， QIAO L， GUO J，et al ．Unsupervised learning of accurate Siamese tracking ［C］∥ Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition．New Orleans：IEEE，2022：8101-8110.
17	PASZKE A， GROSS S， MASSA F，et al ．PyTorch：an imperative style，high-performance deep learning library［C］∥ Proceeding of NeurIPS 2019．Vancouver：［s.n.］ 2019.
18	HE K， ZHANG X， REN S，et al ．Deep residual learning for image recognition ［C］∥ Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition．Las Vegas：IEEE，2016：770-778.
19	LONG J， SHELHAMER E， DARRELL T ．Fully convolutional networks for semantic segmentation ［C］∥ Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition．Boston：IEEE，2015：3431-3440.
20	RONNEBERGER O， FISCHER P， BROX T ．U-Net：convolutional networks for biomedical image segmentation ［C］∥ Proceedings of the International Conference on Medical Image Computing and Computer-Assisted Intervention．Munich：Springer，2015：234-241.
21	CHEN L-C， ZHU Y， PAPANDREOU G，et al ．Encoder-decoder with atrous separable convolution for semantic image segmentation ［C］∥ Proceedings of the European Conference on Computer Vision （ECCV）．Munich：Springer，2018：801-818.
22	LIN G， LIU F， MILAN A，et al ．RefineNet：multi-path refinement networks for dense prediction ［J］．IEEE Transactions on Pattern Analysis and Machine Intelligence，2019，42（5）：1228-1242.
23	YU F， KOLTUN V， FUNKHOUSER T ．Dilated residual networks ［C］∥ Proceedings of the IEEE conference on Computer Vision and Pattern Recognition．Honolulu：IEEE，2017：472-480.

[1]	. Feature-domain Proximal High-Dimensional Gradient Descent Network for Image Compressed Sensing [J]. Journal of South China University of Technology(Natural Science Edition), 2024, 52(3): 119-130.
[2]	ZHENG Juanyi, DONG Jiahao, ZHANG Qingjue, et al. Reconfigurable Intelligence Surface Channel Estimation Algorithm Based on RDN [J]. Journal of South China University of Technology(Natural Science Edition), 2024, 52(3): 102-111.
[3]	. Research on Forest Fire Recognition Based on Improved EfficientNet-E Model Based on ECA Attention Mechanism [J]. Journal of South China University of Technology(Natural Science Edition), 2024, 52(2): 42-49.
[4]	TIAN Sheng, SONG Lin, ZHAO Kailong. Point Cloud Classification Based on Offset Attention Mechanism and Multi-Feature Fusion [J]. Journal of South China University of Technology(Natural Science Edition), 2024, 52(1): 100-109.
[5]	LI Haiyan, YIN Haolin, LI Peng, et al.. Image Inpainting Algorithm Based on Dense Feature Reasoning and Mix Loss Function [J]. Journal of South China University of Technology(Natural Science Edition), 2023, 51(9): 99-109.
[6]	LI Fang, GUO Weisen, ZHANG Ping, et al.. Prediction Technique for Remaining Useful Life of Bearing Based on Spatial-Temporal Dual Cell State [J]. Journal of South China University of Technology(Natural Science Edition), 2023, 51(9): 69-81.
[7]	SU Jindian, YU Shanshan, HONG Xiaobin. A Self-Supervised Pre-Training Method for Chinese Spelling Correction [J]. Journal of South China University of Technology(Natural Science Edition), 2023, 51(9): 90-98.
[8]	LI Jiachun, LI Bowen, LIN Weiwei. AdfNet: An Adaptive Deep Forgery Detection Network Based on Diverse Features [J]. Journal of South China University of Technology(Natural Science Edition), 2023, 51(9): 82-89.
[9]	ZHAO Jiandong, JIAO Lanxin, ZHAO Zhimin, et al. A Car-Following Model Driven by Combination of Theory and Data Considering Effects of Lane Change of Side Cars [J]. Journal of South China University of Technology(Natural Science Edition), 2023, 51(6): 10-19.
[10]	LU Lu, LAI Jinxiong. Smart Contract Vulnerability Detection Method Based on Capsule Network and Attention Mechanism [J]. Journal of South China University of Technology(Natural Science Edition), 2023, 51(5): 36-44.
[11]	YE Feng, CHEN Biao, LAI Yizong. Contrastive Knowledge Distillation Method Based on Feature Space Embedding [J]. Journal of South China University of Technology(Natural Science Edition), 2023, 51(5): 13-23.
[12]	LIU Yupeng, ZHANG Lei. Cognitive Diagnosis Model Integrating Forgetting and Importance of Knowledge Points [J]. Journal of South China University of Technology(Natural Science Edition), 2023, 51(5): 54-62.
[13]	ZHAO Rongchao, WU Baili, CHEN Zhuyun, WEN Kairu, ZHANG Shaohui, LI Weihua. Graph Neural Network for Fault Diagnosis with Multi-Scale Time-Spatial Information Fusion Mechanism [J]. Journal of South China University of Technology(Natural Science Edition), 2023, 51(12): 42-52.
[14]	HOU Liwei, WANG Hengsheng, ZOU Haoran. Deep Learning-Based Prediction of Contact Force in the Process of Shoveling Up Glass Subtrate [J]. Journal of South China University of Technology(Natural Science Edition), 2022, 50(8): 71-81.
[15]	MO Jianwen, ZHU Yanqiao, YUAN Hua, et al. Incremental learning based on neuron regularization and resource releasing [J]. Journal of South China University of Technology(Natural Science Edition), 2022, 50(6): 71-79,90.