交叉口(航空)
透视图(图形)
维数(图论)
对象(语法)
计算机科学
空间分析
频道(广播)
模式识别(心理学)
人工智能
分割
数学
计算机视觉
地图学
地理
统计
纯数学
计算机网络
作者
Gaoge Han,Shaoli Huang,Fang Zhao,Jinglei Tang
标识
DOI:10.1016/j.patcog.2024.110509
摘要
Attention mechanisms have been shown to play a crucial role in enhancing visual perception tasks. However, in most existing approaches, channel and spatial attention maps are estimated separately without considering the varying importance of each other. This results in a coarse attention weight for objects of interest from a holistic 3-D perspective. To address this issue, we propose a novel Parameter-free Spatial Intersection Attention Module (SIAM), which estimates 3D attention maps with spatial intersection using a parameter-free way. Specifically, SIAM first generates two independent mean queries from two spatial axes and views input as keys. Then, by computing a dot product between these mean queries and keys, SIAM generates two cross-dimension (channel and spatial) attention maps from two spatial directions and combines them into 3-D attention maps. By doing so, the produced attention maps reason important areas with spatial intersection, which can capture location-aware information to facilitate difficult objects' location in the images. We evaluate our method in image classification, object detection, and object segmentation tasks. Extensive experimental results consistently demonstrate our approach is superior to its counterparts.
科研通智能强力驱动
Strongly Powered by AbleSci AI