计算机科学
目标检测
人工智能
对象(语法)
基线(sea)
计算机视觉
探测器
点(几何)
感知器
模式识别(心理学)
人工神经网络
几何学
数学
电信
海洋学
地质学
作者
Siyu Chen,Yiling Liu,Jinhe Su,Ruixin Zheng,Zhihui Chen,Lefan Wang
标识
DOI:10.1145/3573428.3573457
摘要
Vessel detection has received wide attention in object detection, and the recently proposed DETR has successfully achieved true end-to-end object detection and has shown good performance. However, DETR is not sensitive to detect small objects, resulting in its unsatisfactory performance in vessel detection. In this paper, we use Deformable DETR as the baseline model and modify it on top of that. Firstly, we add reference point information to object queries to make the features learned by object queries richer to improve the performance of the detector. Secondly, we use multi-layer perceptron instead of multi-head self-attention to reduce the computational effort of the decoder. In addition, we collected 85 videos annotated with 4563 images and used these images to make a vessel dataset. The experimental data on our vessel dataset shows that VDDT performs better compared to the baseline.
科研通智能强力驱动
Strongly Powered by AbleSci AI