分割
增采样
计算机科学
人工智能
变压器
模式识别(心理学)
小波
计算机视觉
图像(数学)
物理
量子力学
电压
作者
Xinxin Zhang,Fei Li,Haibin Jin,Weisong Mu
标识
DOI:10.1016/j.asoc.2023.110392
摘要
Grape leaf diseases segmentation is an essential basis for achieving precise diagnosis and identification of diseases. However, the complex background renders it difficult for small disease areas to be precisely segmented. The existing Transformer mainly focuses on utilizing key and value downsampling to improve model performance while neglecting that downsampling is irreversible with the loss of contextual information. To this end, this paper proposed a novel Locally Reversible Transformer (LRT) segmentation model for grape leaf diseases in natural scene images, whose representation is learned in a reversible downsampling manner. Specifically, a Local Learning Bottleneck (LLB) is developed to enhance local perception and extract richer semantic information of grape leaf diseases via inverted residual convolution. Furthermore, motivated by the wavelet theory, the Reversible Attention (RA) is designed to replace the original downsampling operation by introducing wavelet transform into the multi-headed attention and solving the problem of difficult detection and segmentation of small disease targets with complex backgrounds. Extensive experiments demonstrate that the segmentation performance of LRT outperforms state-of-the-art models with comparable GFLOPs and parameters. Moreover, LRT can retain more multi-grain information and can increase the receptive field to focus on small disease regions with complex backgrounds.
科研通智能强力驱动
Strongly Powered by AbleSci AI