判别式
过度拟合
计算机科学
人工智能
推论
光学(聚焦)
特征(语言学)
深度学习
机器学习
集合(抽象数据类型)
种植
模式识别(心理学)
作者
Jianpin Chen,Heng Li,Junlin Liang,Xiaofan Su,Zhenzhen Zhai,Xinyu Chai
标识
DOI:10.1016/j.neucom.2022.06.041
摘要
• Attention regions cropping and erasing data augmentation approaches are proposed for fine-grained visual classification. • A coarse-to-fine refinement strategy is proposed to refine the classification result with the defined confidence value. • Analyses of three challenging fine-grained datasets along with currently outstanding methods. • The comprehensive experimental results on three challenging FGVC datasets show the effectiveness of our approach. Fine-grained visual classification is challenging due to similarities within classes and discriminative features located in subtle regions. Conventional methods focus on extracting features from the most discriminative parts, which may underperform when these parts are occluded or invisible. And the limited training data also leads to serious overfitting problem. In this paper, we propose an Attention-based Cropping and Erasing Network (ACEN) with a coarse-to-fine refinement strategy to address these problems. By convolving the feature maps from CNN, we obtain a set of attention maps which focus on discriminative object parts. Guided by the attention maps, we propose attention region cropping and erasing operations to augment training data. Moreover, the attention region cropping enhances local discriminative feature learning, and the attention region erasing promotes multi-attention learning. During inference phase, the coarse-to-fine refinement strategy is proposed to refine the model prediction. Extensive experiments demonstrate that our approach achieves state-of-the-art performance on challenging benchmarks, including CUB-200-2011, FGVC-Aircraft and Stanford Cars.
科研通智能强力驱动
Strongly Powered by AbleSci AI