结合关联特征和卷积神经网络的视频目标检测

刘玉杰 曹先知 李宗民 李华

doi:10.3969/j.issn.1000-565X.2018.12.004

华南理工大学学报(自然科学版) >

2018 , Vol. 46 >Issue 12: 26 - 33

DOI: https://doi.org/10.3969/j.issn.1000-565X.2018.12.004

计算机科学与技术

结合关联特征和卷积神经网络的视频目标检测

刘玉杰曹先知李宗民李华

展开

1．中国石油大学(华东)计算机与通信工程学院，山东青岛 266580; 2．中国科学院计算技术研究所，北京 100190; 3．中国科学院大学，北京 100190

刘玉杰(1971-)，男，博士，副教授，主要从事计算机图形图像处理、多媒体数据库、多媒体数据压缩研究．

收稿日期: 2018-07-15

网络出版日期: 2018-11-01

基金资助

国家自然科学基金资助项目(61379106);山东省自然科学基金资助项目(ZＲ2015FM011，ZＲ2013FM036)

收起

Video Object Detection Based on Correlation Feature and Convolutional Neural Network

LIU Yujie CAO Xianzhi LI Zongmin LI Hua

Expand

1． College of Computer ＆ Communication Engineering，China University of Petroleum，Qingdao 266580，Shandong，China; 2． Institute of Computing Technology，Chinese Academy of Sciences，Beijing 100190，China; 3． University of Chinese Academy of Sciences，Beijing 100190，China

刘玉杰(1971-)，男，博士，副教授，主要从事计算机图形图像处理、多媒体数据库、多媒体数据压缩研究．

Received date: 2018-07-15

Online published: 2018-11-01

Supported by

Supportal by the National Natural Science Foundation of China(61379106) and the Natural Science Foundation of Shandong Province，China(ZR2015FM011，ZR2013FM036)

Fold

摘要

针对视频目标检测领域中使用图像检测算法存在的速度与精度相互制约的问题，为充分利用目标在帧之间的运动信息，提出一种结合关联特征和卷积神经网络的视频检测方法．首先，当前视频帧使用图像检测算法提取特征，其次，利用两帧的关联特征预测当前帧的特征图，最后，使用关联特征中的运动信息来修正最终结果．本文的方法最终在 ImageNet 数据集上进行了实验，结果比当前方法获得了较好的精度提升，同时保持了较快的速度．

关键词： 视频目标检测; 卷积神经网络; 关联特征

本文引用格式

刘玉杰曹先知李宗民李华 . 结合关联特征和卷积神经网络的视频目标检测[J]. 华南理工大学学报(自然科学版), 2018 , 46(12) : 26 -33 . DOI: 10.3969/j.issn.1000-565X.2018.12.004

Abstract

The problem of mutual restriction between speed and precision caused by using image detection algorithm in the field of video object detection，a video detection method based on correlation features and convolutional neu- ral network is proposed in order to make full use of the target’s motion between frames． Our methods are demon- strated as follows: firstly，an image detection algorithm is used to extract features from the current video frame; sec- ondly，the correlation features between the frames is employed to predict the feature maps of the current frame and finally，the target motion information from the associated features is used to predict the final result． The method proposed in this paper finally experimented on the ImageNet dataset，which is proved better than the current method since the precision is enhanced and a faster speed is maintained．

Key words： video object detection; convolutional neural network; correlation feature

Options

文章导航

模态框（Modal）标题

摘要

本文引用格式

Abstract