计算机科学
人工智能
目标检测
计算机视觉
特征(语言学)
感兴趣区域
对象(语法)
像素
水下
相似性(几何)
编码(内存)
编码(集合论)
特征提取
模式识别(心理学)
图像(数学)
海洋学
地质学
哲学
集合(抽象数据类型)
程序设计语言
语言学
作者
Xutao Liang,Pinhao Song
标识
DOI:10.1109/icip46576.2022.9897515
摘要
Self-attention is one of the most successful designs in deep learning, which calculates the similarity of different tokens and reconstructs the feature based on the attention matrix. Originally designed for NLP, self-attention is also popular in computer vision, and can be categorized into pixel-level attention and patch-level attention. In object detection, RoI features can be seen as patches from base feature maps. This paper aims to apply the attention module to RoI features to improve performance. Instead of employing an original self-attention module, we choose the external attention module, a modified self-attention with reduced parameters. With the proposed Double Head structure and the Positional Encoding module, our method can achieve promising performance in object detection. The comprehensive experiments show that it achieves promising performance, especially in the under-water object detection dataset. The code will be avaiable in: https://github.com/zsyasd/Excavating-RoI-Attention-for-Underwater-Object-Detection
科研通智能强力驱动
Strongly Powered by AbleSci AI