运动模糊
人工智能
计算机视觉
计算机科学
目标检测
特征(语言学)
光学(聚焦)
对象(语法)
像素
视频跟踪
帧(网络)
模式识别(心理学)
图像(数学)
光学
哲学
物理
电信
语言学
作者
Yujie Wu,Hong Zhang,Yawei Li,Y. F. Yang,Ding Yuan
出处
期刊:IEEE Access
[Institute of Electrical and Electronics Engineers]
日期:2020-01-01
卷期号:8: 208554-208565
被引量:30
标识
DOI:10.1109/access.2020.3038913
摘要
In recent years, the excellent image-based object detection algorithms are transferred to the video object detection directly. These frame-by-frame processing methods are suboptimal owing to the degenerate object appearance such as motion blur, defocus and rare poses. The existing works for video object detection mostly focus on the feature aggregation at pixel level and instance level, but the blur impact in the aggregation process has not been exploited well so far. In this article, we propose an end-to-end blur-aid feature aggregation network (BFAN) for video object detection. The proposed BFAN focuses on the aggregation process influenced by the blur including motion blur and defocus with high accuracy and little increased computation. In BFAN, we evaluate the object blur degree of each frame as the weight for aggregation. Noteworthy, the background is usually flat which has a negative impact on the object blur degree evaluation. Therefore, we introduce a light saliency detection network to alleviate the background interference. The experiments conducted on the ImageNet VID dataset show that BFAN achieves the state-of-the-art detection performance, exactly 79.1% mAP, with 3 points improvement compared to the video object detection baseline.
科研通智能强力驱动
Strongly Powered by AbleSci AI