计算机科学
分割
变压器
人工智能
利用
编码器
杠杆(统计)
图像分割
计算机视觉
模式识别(心理学)
工程类
电压
计算机安全
操作系统
电气工程
作者
Shen Yao,Lei Wang,Yue Jin
标识
DOI:10.1109/cvprw56347.2022.00177
摘要
The semantic segmentation of agricultural aerial images is very important for the recognition and analysis of farmland anomaly patterns, such as drydown, endrow, nutrient deficiency, etc. Methods for general semantic segmentation such as Fully Convolutional Networks can extract rich semantic features, but are difficult to exploit the long-range information. Recently, vision Transformer architectures have made outstanding performances in image segmentation tasks, but transformer-based models have not been fully explored in the field of agriculture.Therefore, we propose a novel architecture called Agricultural Aerial Transformer (AAFormer) to solve the semantic segmentation of aerial farmland images. We adopt Mix Transformer (MiT) in the encoder stage to enhance the ability of field anomaly pattern recognition and leverage the Squeeze-and-Excitation (SE) module in the decoder stage to improve the effectiveness of key channels. The boundary maps of farmland are introduced into the decoder. Evaluated on the Agriculture-Vision validation set, the mIoU of our proposed model reaches 45.44%.
科研通智能强力驱动
Strongly Powered by AbleSci AI