计算机科学
深度学习
过度拟合
人工智能
学习迁移
残差神经网络
卷积神经网络
稳健性(进化)
上下文图像分类
模式识别(心理学)
机器学习
图像(数学)
人工神经网络
生物化学
基因
化学
作者
Shaojie Zhao,Siqi Yi,Cong Cao,Juan Cheng,Yuqing Ye,Menglin Kong
标识
DOI:10.1109/iccnea60107.2023.00012
摘要
In this paper, we propose a framework for endoscopic image classification based on deep transfer learning (TL), specifically designed to address the unique challenges of endoscopic medical image classification. Our approach focuses on the performance of a convolutional neural network (CNN) model based on the ResNet architecture. To further improve the model's prediction accuracy and robustness, we introduce two ResNet variants: ResNe-SE and ResNet-CBAM, which incorporate the Squeeze-Excitation Module and Convolutional Block Attention Module, respectively. These modules allow the model to selectively learn significant features while suppressing noisy and unimportant features by capturing the dependencies between channels and spatial locations, ultimately achieving optimal performance compared to numerous baseline models. Furthermore, since limited training data can lead to overfitting of deep learning models, we apply deep TL to the field of medical image classification by fine-tuning parameters during model training based on models pre-trained on the publicly available ImageNet dataset. This approach addresses the problem of a limited number of endoscopic images. The results of our ablation experiments demonstrate the effectiveness of using deep TL techniques for this task, the improvements are 22.8%, 25.9%, and 13.3% for VGG-16, ResNet-34, and ResNet-50, respectively.
科研通智能强力驱动
Strongly Powered by AbleSci AI